Skip to content

Latest commit

 

History

History
46 lines (34 loc) · 1.82 KB

week-09.md

File metadata and controls

46 lines (34 loc) · 1.82 KB

Week 9

Objectives

  • Using the Google Books REST API
  • New York Times article scrape
  • Scraping and Parsing XML
  • Fetching and Parsing Data from the Web with OpenRefine, APIs

Exercises

Save the following Jupyter notebook files to sharedfolder on your desktop.

  1. Using the Google Books REST API

  2. New York Times article scrape

  3. Scraping and Parsing XML

  4. Fetching and Parsing Data from the Web with OpenRefine, APIs Example 2: URL Queries and Parsing JSON

  • Using OpenRefine: Start the application on the lab computer on the Mac side. It only works through a browser window. If the browser window doesn’t open automagically after doubleclicking on the application, copy and paste http://127.0.0.1:3333/ into your browser window.

Getting started

Open Terminal in macOS and launch our Docker container:

docker rm -f pcda_ubuntu
docker pull pcda17/ubuntu-container
docker run --name pcda_ubuntu -ti -p 8889:8889 --volume ~/Desktop/sharedfolder/:/sharedfolder/ pcda17/ubuntu-container

In Windows 10, open PowerShell and enter the following to launch the Docker container:

docker rm -f pcda_ubuntu
docker pull pcda17/ubuntu-container
docker run --name pcda_ubuntu -ti -p 8889:8889 --volume C:\Users\***username_here***\Desktop\sharedfolder:/sharedfolder/ pcda17/ubuntu-container

Open any browser and type (your Juypter Notebook will launch):

localhost:8889