Skip to content

TheRootOf3/ucl-nlp-notebook-series

Repository files navigation

NLP Jupyter Notebooks repo

About this series of notebooks

This mini-course has been prepared with the aim of showing rather a practical side of natural language processing (NLP) than detailed theoretical aspects. Although some fundamental theory is necessary, notebooks contain many fragments of code ready to be copied & pasted into your project. The course will give you a good foundation and prepare you to tackle more advanced topics.


Table of contents

  • Notebook 1 - Data loading and Regular expressions
  • Notebook 2 - Text preprocessing, POS tags, and simple word model
  • Notebook 3 - Machine Learning Basics & Classifiers
  • Notebook 4 - Naive-Bayes and Logistics Regression in NLP
  • Notebook 5 - Introduction to word embeddings

and

  • Notebook 6 - Putting it all together!

The first 5 notebooks introduce important topics and their implementation using Python and some well-known modules. Notebook 6 presents how to solve a complete NLP classification problem using different techniques and visualizations.

Resources

Each notebook comes with a resources.md file containing sources of used graphics, external code, or datasets. The theoretical side has been developed using the sklearn documentation and a fantastic book - Speech and Language Processing by Dan Jurafsky and James H. Martin available here!

Typos & bugs

If you spot any, feel free to email me: [email protected]!

About authors

Developed as a result of the UCL Engineering Summer Studentship 2021 by Andrzej Szablewski - the first year student, supervised by Lisa Andreevna Chalaguine - the academic researcher.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published