Big Data Analytics with RevoScaleR
Thursday, September 6, 2012 at 10:00 AM - Friday, September 7, 2012 at 2:00 PM (PDT)
Revo Scale R is an R package designed, built and optimized to solve one of R’s biggest soft spots – dealing with big data. Dealing with big data involves addressing several different issues, such as interfacing with diverse data sources, storing and manipulating big data efficiently and implementing statistical algorithms that can handle large data.
This course is designed for R users who have mastered the basics of R and are interested in learning how to take advantage of the capabilities of ScaleR for high performance analytics on datasets that exceed the normal physical memory limits of R. The class uses a combination of lecture and labs to instruct students on how to effectively use and script ScaleR functions for big data analyses. In addition, students will learn how to visualize the results of ScaleR analyses through use of graphics
Users with some knowledge of R and multivariate modeling who would like to apply such techniques to very large datasets using ScaleR.
- Familiarity with the basics of the R language and some prior hands-on experience
- Understanding of multivariate modeling methods such as linear and logistic regression.
- Windows Laptop/Desktop with Revolution R Enterprise installed.
- Revolution Analytics Training Center Requirements
Introduction: A taste of the power of Big Data Analytics with Revolution Scale R.
- Discussion on the challenges in big data analytics
- Demonstration of importing and exploring big data
- Simple statistical techniques with big data
- Story Telling with visualizations of big data
- Getting help with Revolution Scale R
Data Munging Lab
- Importing different types of data such as delimited text, fixed format
- Dealing with other data sources – SAS/SPSS files, data frames, ODBC
- Transforming and sub setting big data
- Managing Meta Data and Recoding Variables
- Exporting big data
Data Exploration Lab
- Summarizing big data
- Visualizing big data
- Estimating a model (Linear, Logistic, GLM, k-Means)
- Calculating residuals, plot a histogram of residuals
- Predicting on a new dataset
- Repeating with ‘on the fly’ transformations
- Advanced Data Manipulations
- Working Locally or on a Cluster
- Integrating Revo ScaleR into other R packages
- Building your own models using Revo ScaleR
Session recordings will be made available to the participants for a period of 4 weeks.
Disclaimer: We have the right to cancel the event for any reason at any time. Revolution Analytics will refund all monies paid for ticket sales in full in the event of a cancellation. We are not responsible for any travel related expenses incurred by attendees for this event. This includes but not limited to transportation, hotel accommodations or any other travel related expenses secured by the attendee, due to a cancellation on our part.
30 days from event date Full refund less 10% of the paid ticket price
21 days from event date 50% of paid ticket price
Within 15 days of event date Non refundable