Using R with Hadoop (TRA77)

Revolution Analytics, Inc.

Friday, June 6, 2014 from 9:00 AM to 5:00 PM (PDT)

Mountain View, CA

Using R with Hadoop (TRA77)

Ticket Information

Ticket Type Sales End Price Fee Quantity
General Admission Jun 6, 2014 $1,500.00 $9.95
Academic Jun 6, 2014 $1,000.00 $9.95
Student Jun 6, 2014 $600.00 $9.95

Who's Going

Loading your connections...

Share Using R with Hadoop (TRA77)

Event Details

Using R with Hadoop

 

Course Overview

This course introduces Hadoop and its implementation of MapReduce and focuses on RHadoop’s packages to interact with Hadoop’s Distributed File System (HDFS), write and submit MapReduce jobs, and interact with the HBase NoSQL database.

 

Audience

Analysts, modelers, and statisticians already familiar with R

 

Prerequisites

  • 1-2 years’ experience with R and map-reduce
  • Understand the ideas behind parallel programming
  • Know the basics of apply, tapply, lapply, regression, clustering
  • Know how to view and use the Hadoop logs for debugging
  • No instalation will be required, you will need to have an rstudio server-supported browser: chrome or firefox on windows

 

Course Outline

Our courses teach by doing, where short lectures and hands-on exercises are interspersed. By the end of the course, you will learn the following:

  • Overview of Hadoop, MapReduce, and R
  • Options for using R with Hadoop
  • rhdfs
    • Installation
    • Function overview
    • Example: populate HDFS
    • Exercise: Checking the result 
  • rmr2
    • Installation
    • Function overview
    • Components of basic Hadoop MapReduce jobs
    • Exercise: wordcount: the "hello world" of Hadoop
    • Exercise: Airline speed records
    • Exercise: User-based collaborative filtering
    • Advanced features
      • Writing composable map reduce jobs
      • Developing and debugging with the local backend
      • Specifying backend processing options
      • Saving results locally for further analysis
  • rhbase
    • Installation
    • Function overview
    • Exercise: tweets - storing Twitter status messages in HBase
    • Exercise: Twitter users - storing user information about tweet authors  

 


 

Disclaimer:

We have the right to cancel the event for any reason at any time. Revolution Analytics will refund all monies paid for ticket sales in full in the event of a cancellation.  We are not responsible for any travel related expenses incurred by attendees for this event. This includes but not limited to transportation, hotel accommodations or any other travel related expenses secured by the attendee, due to a cancellation on our part.

 

Cancellation Policy:

  • 30 or more days from the event date: Full refund less 10%
  • 16-29 days from the event date: 50% refund
  • 15 or less days from the event date: No refund

 

Note:

  • All related transaction fees PayPal and Eventbrite are not refundable
  • Discount offers cannot be combined
  • A student ID Number is not a proof of full time university enrollment to get the student’s discount.  Proof of enrollment in 9 units or more on a current academic registration document will be required to receive the student's discount.

 

 

 

Have questions about Using R with Hadoop (TRA77)? Contact Revolution Analytics, Inc.

When & Where



Revolution Analytics HQ
2570 West El Camino Real, Suite 222
Mountain View, CA 94040

Friday, June 6, 2014 from 9:00 AM to 5:00 PM (PDT)


  Add to my calendar

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.