Lexical Substitution

This project is a lexical substitution task using:

installation

pip install nltk

python
>>> import nltk
>>> nltk.download()

In the Corpora tab, install wordnet and stopwords.

Install gensim, a vector space modeling package for Python.
Install Huggingface Transformers, BERT implementation by Huggingface (an NLP company), or more specifically their slightly more compact model DistilBERT

pip install gensim
pip install transformers

python lexsub_main.py lexsub_trial.xml > part5.predict

lexical substitution outputs are stored in .predict files

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
__pycache__		__pycache__
Readme.md		Readme.md
gold.trial		gold.trial
lexsub_main.py		lexsub_main.py
lexsub_trial.xml		lexsub_trial.xml
lexsub_xml.py		lexsub_xml.py
part2.predict		part2.predict
part3.predict		part3.predict
part4.predict		part4.predict
part5.predict		part5.predict