- Introduction, overview of the course
- Overview of software tools (to be discussed / mentioned in the course)
- Getting to Known R and RStudio with code examples
- Introduction to corpus linguistics
- Corpus design, representativeness, sampling frame
- Sources of corpus data, corpus compilation
- Corpus indexing, queries, analysis
- Worked example: the Trump Twitter Archive on CQPweb
- Querying corpora for constructions
- Regular Expressions
- Automatic NLP pipelines (trankit)
- Practicals of corpus creation
- Quantitative techniques of corpus linguistics
- Basis: frequency comparison
- Collocations, Keywords, ...
- Pointers to statistics course in summer term