This project is a simple web scraper that retrieves the names of the top movies from IMDb's Top 250 movies page. It utilizes Python's requests
library to fetch the webpage content and BeautifulSoup
to parse the JSON-LD structured data.
- Fetches the top movies from the IMDb website.
- Extracts movie names from the JSON-LD data.
- Saves the movie names to a text file (
names.txt
).
- Python 3
requests
libraryBeautifulSoup
frombs4
library- JSON parsing
- Clone this repository:
git clone https://github.com/Avadhoot1905/Web-scrapper.git