Skip to content

Latest commit

 

History

History
31 lines (16 loc) · 833 Bytes

README.md

File metadata and controls

31 lines (16 loc) · 833 Bytes

Brickset Scraper Using Scrapy

Intro

A Web Scraper that uses Python3 and Scrapy to scrape Brickset - A LEGO database.

Requirements

  • Python 3.x
  • Scrapy

Usage

scrapy runspider brickset-scraper > brickset-scraper.json

Example output JSON files are here.

By default, past 5 years of bricksets are scraped.

Change the START_YEAR and END_YEAR in scraper.py to scrape desired years.

Caveat

Example output has been beautified with JSON Lint for readability.

Working of this web scraper depends on the source at Brickset.

Working as on 18 Aug 2018.

Foundation for this scraper can be found here.