Skip to content

subsr97/xkcd-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

XKCD Comics Scraper Using Scrapy

Intro

A Web Scraper that scrapes data off XKCD Comics using Scrapy.

Scraped Contents

  • Title
  • Comic's URL
  • Comic image's URL
  • Transcript of the comic (if present)

Requirements

  • Python 3.x
  • Scrapy

Usage

scrapy crawl xkcd-spider > xkcd.json

Example output JSON and HTML source files are here.

Caveat

Example output has been beautified with JSON Lint for readability.

Working of this web scraper depends on the source at XKCD Comics.

About

A web scraper that scrapes data off XKCD Comics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published