This data set contains information of about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue. This dataset contans the following attributes:
id - A unique identifier for each movie.
imdb_id - Unique identifier for each movie in IMDB.
popularity - Numeric value representing the movie's popularity.
budget - How much was budgeted to make the movie.
revenue - Proceeds from the movie.
original_title - Title before adaptions.
cast - Names of the lead actors and supporting actors.
homepage - Link to the movie's official homepage.
director - The director(s) of the movie.
tagline - Movie Tagline.
keywords - Words related to or associated with the movie.
overview - Brief summary of the movie.
runtime - Duration of the movie in minutes.
genres - The genre of the movie.
production_companies - The movie production house.
release_date - Date of movie release.
vote_count - Number of ratings the movie received.
vote_average - Average user ratings.
release_year - Year the movie was released.
budget_adj - Budget of associated movie in terms of 2010 dollars, accounting for inflation over time.
revenue_adj - Revenue of associated movie in terms of 2010 dollars, accounting for inflation over time.
- What is the average profit generated by movies produced in each year?
- What movies generated the least profits?
- What is the most popular movie?
- What is the least popular movie?
- What is the relationships between popularity, revenue and user ratings?
- What is the relationship between revenue and profit?