Vocab index ordering? #75

Tahlor · 2018-04-19T05:02:12Z

Why do we calculate word counts if we ultimately include all words and sort the vocab alphabetically? Mostly I'm wondering if you ordered it with the most common words first, and later reverted to alphabetical ordering for some reason. Would it be bad to have the most common words first? Or do we expect the model would learn better with a random ordering?

In utils.py:

        vocabulary_inv = [x[0] for x in word_counts.most_common()]
        vocabulary_inv = list(sorted(vocabulary_inv))

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vocab index ordering? #75

Vocab index ordering? #75

Tahlor commented Apr 19, 2018 •

edited

Loading

Vocab index ordering? #75

Vocab index ordering? #75

Comments

Tahlor commented Apr 19, 2018 • edited Loading

Tahlor commented Apr 19, 2018 •

edited

Loading