The motive of this project is to find out the customer satisfaction of some residential hotels of Dhaka. This project was done in VS Code with Python programming language. The dataset was created through web scraping using parsing method in RStudio using R programming language. Almost 5,000 reviews were collected of 8 hotels in Dhaka City. In this project almost 600 reviews were used of only 1 hotel.
The methods followed in this project are:
- Data Processing
- Vectorization
- Spliting Dataset
- Evaluate the model
- Generate Sentiment Score
- Visualize Sentiment Analysis
Tools Used
- Python programming language
- Visual Studio Code IDE
Libraries Used
- re
- openpyxl
- pandas
Libraries Used
- nltk
- pandas
- nltk.tokenize (word_tokenize)
Libraries Used
- nltk
- pandas
- nltk.corpus (stopwords)
Libraries Used
- nltk
- pandas
- nltk.stem (PorterStemmer, WordNetLemmatizer)
Libraries Used
- json
- pandas
- sklearn.feature_extraction.text (TfidfVectorizer)
Libraries Used
- json
- joblib
- numpy
- sklearn.model_selection (train_test_split)
- sklearn.linear_model (LogisticRegression)
- sklearn.metrics (accuracy_score, classification_report, confusion_matrix, precision_score, recall_score, f1_score)
(https://github.com/Tanzim-prog/sentiment_analysis_ml_stringdata/tree/master/Documentation)