This event has ended

Cloudera Hadoop Training: Basic

Monday, July 27, 2009 at 9:00 AM - Tuesday, July 28, 2009 at 5:00 PM (PDT)

Los Angeles, CA

Cloudera Hadoop Training: Basic

Share Cloudera Hadoop Training: Basic

Event Details

Note: We will be hosting this training session in partnership with Fox Interactive Media. This will be on the Fox lot, which has strict security protocols. As such, we need to close registrations the day before to allow time to get your name on the entry list. Registration closes on July 24th at 1PM PDT.

Cloudera's Basic Hadoop Training provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop, and may have never used the software before. It is also appropriate for new users who are seeking a deeper understanding of the core principles, programming API and basic MapReduce algorithms.

Cloudera will give a series of lectures interleaved with practical, hands-on examples and exercises. Attendees must bring their own laptop with VMware Player (or Fusion for Mac) so they may follow along in a pre-configured virtual machine Cloudera provides.

Attendees seeking more in-depth knowledge may also attend our intermediate and advanced training on the following two days (a discount code is provided in your confirmation email if you miss the early bird registration). 

Those wishing to document their skills and receive the Cloudera Certified Hadoop Professional (CCHP) credential may take the certification exam immediately following advanced training. It covers material from all three sessions. 

During this all-day session, we will cover the following agenda with ample time for questions:


Lecture:

Thinking at Scale: Introduction to Hadoop and Big Data

You know your data is big – you found Hadoop. What implications must you consider when working at this scale? This lecture addresses common challenges and general best practices for scaling with your data.

Lecture:

MapReduce and HDFS

These tools provide the core functionality to allow you to store, process, and analyze big data. This lecture "lifts the curtain" and explains how the technology works. You'll understand how these components fit together and build on one another to provide a scalable and powerful system.

Exercise:

Getting Started with Hadoop

If you'd like a more hands-on experience, this is a good time to download our VM and kick the tires a bit. In this activity, using the provided instructions, you'll get a feel for the tools and run some sample jobs.

Lecture:

The Hadoop Ecosystem

An introduction to other projects surrounding Hadoop, which complete the greater ecosystem of available large-data processing tools.

Lecture:

The Hadoop MapReduce API

Learn how to get started writing programs against Hadoop's API.

Lecture:

Introduction to MapReduce Algorithms

Writing programs for MapReduce requires analyzing problems in a new way. This lecture shows how some common functions can be expressed as part of a MapReduce pipeline.

Exercise:

Writing MapReduce Programs

Now that you're familiar with the tools, and have some ideas about how to write a MapReduce program, this exercise will challenge you to perform a common task when working with big data - building an inverted index. More importantly, it teaches you the basic skills you need to write your own, more interesting data processing jobs.

Lecture:

Hadoop Deployment

Once you understand the basics for working with Hadoop and writing MapReduce applications, you'll need to know how to get Hadoop up and running for your own processing (or at least, get your ops team pointed in the right direction). Before ending the day, we'll make sure you understand how to deploy Hadoop on servers in your own datacenter or on Amazon's EC2.


We will take a one hour break around noon for lunch There are many places to eat while exploring the Fox lot.

Security and Parking:

Security:
At the Galaxy Parking Lot, please let the guard know you are there for the "Fox Audience Network Hadoop Workshop."

Parking:
Galaxy Parking Lot
You can park at the Galaxy Parking garage for the workshop. The Galaxy parking garage entrance is located off of Galaxy Way, which is the street just behind the Fox Plaza. As a reference, the Fox Plaza address is 2121 Avenue of the Stars, Los Angeles, CA 90064.

Have questions about Cloudera Hadoop Training: Basic? Contact the organizer

When & Where



Fox Studios Lot
10201 West Pico Blvd.
Building 104 (right next to the parking structure)
Los Angeles, CA 90035

Monday, July 27, 2009 at 9:00 AM - Tuesday, July 28, 2009 at 5:00 PM (PDT)


  Add to my calendar

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.