A Python implementation of a tf-idf based search engine for a subset of the English Wikipedia.
You have to run both commands below. All files must be placed in the same folder.
- 'python generator.py parsed_xml'
- 'python query.py '
The English Wikipedia sub-set has been reduced to make the upload easier.
WikiSearchEngine was developed as a group project for the Information Retrieval course, Master in Language Technology, Uppsala University.
Authors: Giuseppe Della Corte, Ugo Loobuyck
Course: Information Retrieval
Master of Language Technology, Uppsala University