-
Notifications
You must be signed in to change notification settings - Fork 8
Text Analysis
Mihir Singh edited this page Mar 28, 2020
·
10 revisions
- IDE: Visual Studio Code 1.43.2.0 and PowerShell 7.0.0-preview.2
- Language: Python 3.7
- Method: Webscraping
- Tools: Selenium and PhantomJS Web Driver
- IDE: Visual Studio Code 1.43.2.0 and PowerShell 7.0.0-preview.2
- Language: Python 3.7
- Tools: NLTK, Sklearn's TFIDF Feature Extraction and Pickling
A model based on the frequency distribution of words in any new text calculates the most occurring terms and cross-references it with the tags available in our dataset. If the recurring terms or topics are relevant as tags, those are suggested as well.
Lastly, any relevant tags or topics present in the title of the article are added to the mix as well.
A collection of these tags are suggested to the user through the Browser Extension.