Digital Data Collection - Digging Deeper

Rolf Fredheim and Yulia Shenderovich
University of Cambridge

24/02/2015

Logging on

Before you sit down:

  • Do you have your MCS password?
  • Do you have your Raven password?
    • If you answered 'no' to either then go to the University Computing Services (just outside the door) NOW!
  • Are you registered? If not, see me!

Download these slides

Follow link from course description on the SSRMC pages or go directly to http://fredheir.github.io/WebScraping/

Download the R file to your computer

Install the following packages:

ggplot2 lubridate plyr jsonlite stringr

Summing up

given URLs of target pages, we can now:

  • download raw JSON data
  • extract fields of interest
  • put this in a function
  • apply the function to a list of targets

No class next week!!

So that's it for APIs and JSON But for those who are keen, more advanced stuff involving APIs and JSON sources (maps? YouTube?) can be found in last year's slides:

Too many loops, variables, and functions?

If this has all been a bit much, below is a link to some extra material on all things variables, functions and loops