Skip to content

This repository hosts a Python project focused on web scraping data from JustWatch.com, a leading streaming service aggregator.

Notifications You must be signed in to change notification settings

altamashajaz/Web-Scraping-and-EDA-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EDA Project: Web Scraping and Analysis.

Overview

This repository serves as a comprehensive exploration of the streaming entertainment landscape, driven by data analysis techniques. Through the utilization of Python's Beautiful Soup library, data is systematically extracted from JustWatch.com, a reputable streaming service aggregator. This extracted data, encompassing IMDb ratings, runtimes, genres, year of release and streaming service details, undergoes examination and visualization using Pandas, Matplotlib, and Seaborn within a Jupyter Notebook (.ipynb). The project aims to provide valuable insights into viewer preferences, discerning trends such as mean IMDb ratings, average runtimes for movies and shows, top genres, leading streaming services, and the highest-rated movies on Netflix. By delving into this analysis, viewers and content providers alike can gain a deeper understanding of the streaming landscape and make informed decisions accordingly.

Tools Used

  • Python
  • Beautiful Soup
  • Pandas
  • Matplotlib
  • Seaborn
  • Jupyter Notebook

Contents

  • WebScraping.ipynb: Jupyter Notebook containing the code for web scraping, data analysis, and visualization.
  • README.md: This file, providing an overview of the project.

How to Use

  1. Ensure Python is Installed: Make sure you have Python installed on your system.
  2. Install Required Libraries: Install the necessary libraries: beautifulsoup4, requests, pandas, matplotlib, and seaborn.
  3. Clone the Repository: Clone this repository to your local machine.
  4. Open Jupyter Notebook: Open the Web Scraping.ipynb notebook using Jupyter Notebook.
  5. Execute Code Cells: Follow the instructions within the notebook to execute the code cells and perform analysis.
  6. Explore Findings: Explore the findings and visualizations generated from the scraped data.

Data Source

The data is scraped from JustWatch.com, a popular streaming service aggregator.

Note: Ensure you adhere to JustWatch's terms of service and data usage policies while scraping their website.

Feel free to customize and extend the analysis based on your requirements.

For any questions or feedback, please contact Altamash Ajaz at [email protected].

About

This repository hosts a Python project focused on web scraping data from JustWatch.com, a leading streaming service aggregator.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published