Build a model to analyze customer sentiment from hotel reviews (positive, neutral, or negative).
URL: TripAdvisor Hotel Reviews on Kaggle The folder contains:
offering.txt
offerings.csv
review.txt
reviews.csv
(Note: In this repository, ignore review.txt
and reviews.csv
due to Git LFS limitations. xD)
In my main.py
, only offerings.csv
and reviews.csv
are used.
df_offerings: [4333]row x [8]col
df_reviews: [810967]row x [11]col
Labels: [
positive
,neutral
,negative
]
Configuration:
Accuracy: 0.87