These scripts convert the Wikipedia Dictionary into a TSV file for easy use in your favorite dictionary-reqiuring projects.
It works by downloading a big XML dump (about 500MB) from Wikipedia's dump server, and then Python 2 scripts convert it to gzipped TSV files (one for the English Wiktionary, and one for All Languages), eg:
tsv/enwikt-defs-20180220-all.tsv.gz
tsv/enwikt-defs-20180220-en.tsv.gz
- Make sure
curl
andPython 2
are installed. - Run
./download.sh
(results will be placed in the "xml/" directory; you can run it again to resume the download if it aborts) - Run
./create.sh
(results will be placed in the "tsv/" directory)
This is a fork of a fork of Conrad Irwin's Wiktionary TSV generator. It's no longer running on Wikipedia's TOOL server, so this is the only way to use it at the moment.