Skip to content

A web scraper for New York Public Library's even calendar. Demo for my PyOhio 2023 talk.

License

Notifications You must be signed in to change notification settings

iamjameswalters/library-eventextractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

library-eventextractor

A web scraper for New York Public Library's event calendar. This is the demo code for my PyOhio 2023 talk: Web Scraping Crash Course! With Python and Scrapy.

APIs are a great way to consume publicly accessible data. But what do you do when there's no API? Enter web scraping: a way you can harvest data out of the same HTML documents you look at in your web browser.

This project demonstrates:

  • How to scaffold out a Scrapy project
  • How to isolate elements in a page to extract data from them
  • How to follow next page links in paginated results
  • How to modify settings.py to alter your scraper's behavior

About

A web scraper for New York Public Library's even calendar. Demo for my PyOhio 2023 talk.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages