Sentiment Analysis on Text Data using Machine Learning

The motive of this project is to find out the customer satisfaction of some residential hotels of Dhaka. This project was done in VS Code with Python programming language. The dataset was created through web scraping using parsing method in RStudio using R programming language. Almost 5,000 reviews were collected of 8 hotels in Dhaka City. In this project almost 600 reviews were used of only 1 hotel.

The methods followed in this project are:

Data Processing
Vectorization
Spliting Dataset
Evaluate the model
Generate Sentiment Score
Visualize Sentiment Analysis

Tools Used

Python programming language
Visual Studio Code IDE

Text Cleaning

Libraries Used

re
openpyxl
pandas

Tokenization

Libraries Used

nltk
pandas
nltk.tokenize (word_tokenize)

Stop Words Removal

Libraries Used

nltk
pandas
nltk.corpus (stopwords)

Stemming/Lemmatization

Libraries Used

nltk
pandas
nltk.stem (PorterStemmer, WordNetLemmatizer)

TF-IDF Vectorization

Libraries Used

json
pandas
sklearn.feature_extraction.text (TfidfVectorizer)

Stemming/Lemmatization

Libraries Used

json
joblib
numpy
sklearn.model_selection (train_test_split)
sklearn.linear_model (LogisticRegression)
sklearn.metrics (accuracy_score, classification_report, confusion_matrix, precision_score, recall_score, f1_score)

For detailed documentation please visit and download files from the link

(https://github.com/Tanzim-prog/sentiment_analysis_ml_stringdata/tree/master/Documentation)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
Assets/Images		Assets/Images
Codes		Codes
Data Sets		Data Sets
Documentation		Documentation
JSON		JSON
Models		Models
Predictions		Predictions
sentiment_analysis.venv		sentiment_analysis.venv
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis on Text Data using Machine Learning

Text Cleaning

Tokenization

Stop Words Removal

Stemming/Lemmatization

TF-IDF Vectorization

Stemming/Lemmatization

For detailed documentation please visit and download files from the link

About

Releases

Packages

Languages

Tanzim-prog/sentiment_analysis_ml_stringdata

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis on Text Data using Machine Learning

Text Cleaning

Tokenization

Stop Words Removal

Stemming/Lemmatization

TF-IDF Vectorization

Stemming/Lemmatization

For detailed documentation please visit and download files from the link

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages