This is the code and configuration files to accompany my tutorial on getting started with R and Hadoop presented at TDWI Boston 2012's pre-conference workshop, September 15, 2012.
This repository has three main directories:
- bin -- scripts to populate and clear HDFS
- config -- instructions and configuration files to set up the Cloudera demo VM for the tutorial
- R -- all the R code we will work through
- data -- sample data
- presentation -- slide decks, etc.
Jeffrey Breen [email protected]