From 7421d9b16206ee8182654d04fb47c87b7e2de04e Mon Sep 17 00:00:00 2001 From: Sarah Oberbichler <66369271+soberbichler@users.noreply.github.com> Date: Fri, 8 Nov 2024 01:39:18 +0100 Subject: [PATCH] Update module_2.html --- modules/module_2.html | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/modules/module_2.html b/modules/module_2.html index 10d2362..447b51a 100644 --- a/modules/module_2.html +++ b/modules/module_2.html @@ -75,13 +75,13 @@

Notebooks we will use in class:

Workload (after class):

Finish the exercises of the Python Crash Course (if not finished in class)

-

Create your first Jupyter notebook, clone our course repository, and import Data_exercise_2.

+

Create your first Jupyter notebook, clone our course repository, and import kölnische_Zeitung_erdbeben_artikel.xlsx.

Complete the following tasks:

  1. Clean, tokenizem and lemmatize the corpus
  2. Find the most frequent verbs (use the NLTK package for this task)
  3. -
  4. Use TF-IDF vectorization and identify the most frequent verbs again
  5. +
  6. Visualize the most frequent verbs with a visualization of your choice

Save your notebook in your GitHub repository.