*Class schedule is subject to revision throughout the semester.
W | Date | Due (before class @ 12:30pm) | Topics Tools |
|
#To-do/Homework Project |
||||
1 | 1/7 | [slides] Course introduction, setup | ||
1/9 | #1 | [slides] Data in linguistics | ||
2 | 1/14 | Homework 1 | [slides] Processing linguistic data | |
1/16 | #2 | Data processing fundamentals, statistics | [slides] Python's numpy library | |
3 | 1/21 | #3 | [slides] Data frames with pandas | |
1/23 | #4 | [slides] Text processing, stats intro | ||
4 | 1/28 | Stats crash course | ||
1/30 | #5 | Data visualization | ||
5 | 2/4 | Homework 2 | HW2 review | |
2/6 | Corpus linguistics, annotation | [slides] Corpus concepts, building & processing | ||
6 | 2/11 | #6 | [slides] Annotation, data standards & exchange formats | |
2/13 | #7 | Open access & data publishing | [slides] Guest speakers Lauren Collister and Dominic Bordelon | |
7 | 2/18 | #8 | Data mining and machine learning | [slides] Data-mining web & social media |
2/20 | #9 | [notebook] (Linear) regression modeling | ||
8 | 2/25 | [notebook] Logistic regression and classification | ||
2/27 | #10 | [notebook] Classifiers continued, categorical data | ||
9 | 3/3 | #11 | [notebook] Dimensionality reduction, cross-validation | |
3/5 | Homework 3 | Homework 3 review | ||
No class: Spring break | ||||
10 | 3/17 | Class cancelled | ||
3/19 | ||||
11 | 3/24 | Big data | [slides] Bash and command line. Guest speaker Barry Moore II Command line, BASH, Unix tools |
|
3/26 | [handout] Command line, grep | |||
12 | 3/31 | #12 | Supercomputing at CRC, SSH, command line | |
4/2 | #13 | [slides] Computational efficiency, machine learning big data, word embeddings | ||
13 | 4/7 | Homework 4 troubleshooting | ||
4/9 | Homework 4 (moved to 4/14) | Homework 4 review | ||
14 | 4/14 | Speech & multimedia | Speech data, ASR theory, multimodal data
Praat, Elan |
|
4/16 | #14 | Project presentations | ||
15 | 4/24 | No class: finals week |