Gives some statistics about english language vocabulary. the base vocabulary list is gain using hackerb9/gwordlist and hermitdave/FrequencyWords lists.
- 50,000 english cleaned vocabulary. cleaned using extract-lemmatized-nonstop-words.
- Based on revising 349,066,176,882 words.
- Sorted by relative frequency.
- Relative Frequency percent per word.
- Cumulative Relative Frequency percent perword sorted by relative frequency.
Using Yarn
yarn add vocabulary-list-statistics
Using NPM
npm i --save vocabulary-list-statistics
You can also use the Excel version.
const vocabulary = require('vocabulary-list-statistics');
console.log(vocabulary[12].cumulative);
// logs Cumulative Relative Frequency of 12th word in the list