This event has ended

Cloudera Hadoop Training: Intermediate

Cloudera

Tuesday, July 28, 2009 from 9:00 AM to 5:00 PM (PDT)

Los Angeles, CA

Cloudera Hadoop Training: Intermediate

Share Cloudera Hadoop Training: Intermediate

Event Details

Note: We will be hosting this training session in partnership with Fox Interactive Media. This will be on the Fox lot, which has strict security protocols. As such, we need to close registrations the day before to allow time to get your name on the entry list. Registration closes on July 27th at 1PM PDT.

Cloudera's Intermediate Hadoop Training builds on our basic training, and is appropriate for those who are already familiar with Hadoop basics and the MapReduce programming model.

Those wishing to document their skills and receive the Cloudera Certified Hadoop Professional (CCHP) credential may take the certification exam after advanced training on the following day.

Intermediate training focuses on importing data into Hadoop and building data processing pipelines. We'll cover more advanced topics such as Hive and Pig and show participants how to use each effectively.

Cloudera will give a series of lectures, interleaved with practical, hands on examples and exercises. Attendees must bring their own laptop with VMware Player (or Fusion for Mac) so they may follow along in a pre-configured virtual machine Cloudera provides.

During this all day session, we will cover the following agenda with ample time for questions:

Lecture:

Augmenting existing systems with Hadoop

To introduce our intermediate trainign session, we'll take a step back and look at data systems more generally. Hadoop rarely replaces existing infrastructure, but rather enables you to do more with your data by providing a scalable batch processing system. This lecture helps you understand how it all fits together.

 Lecture:

Best Practices for Data Processing Pipelines

In order for Hadoop to crunch large volumes of data, first you'll need to get that data into Hadoop. This lecture will help you understand how to import different types of data from various sources into Hadoop for further analysis.

Exercise:

Importing existing databases with Sqoop

Sqoop is a command line tool developed by Cloudera and contributed to the Hadoop project. It provides an easy way to import data from RDBMSs and enable you to work with that data directly using MapReduce, Hive, or Pig.

 Lecture:

Introduction to Pig

Pig is a high-level language for large-scale data analysis programs. Pig exposes many common MapReduce constructs in an simplified processing language, and is often used for ad-hoc analysis.

 Exercise:

 Working with Pig

In this exercise, we'll revisit some common tasks and see how you can accomplish them using Pig.

 Lecture:

 Introduction to Hive - A Data Warehouse for Hadoop

Hive is a powerful data warehousing application built on top of Hadoop which allows you to use SQL to access your data. This lecture will give an overview of Hive and the query language.

 Exercise:

Working with Hive

This exercise will show you exactly how to work with Hive. We'll walk through importing data, creating tables, and making queries.

We will take a one hour break around noon for lunch. There are many places to eat while exploring the Fox lot.

Security and Parking:

Security:
At the Galaxy Parking Lot, please let the guard know you are there for the "Fox Audience Network Hadoop Workshop."

Parking:
Galaxy Parking Lot
You can park at the Galaxy Parking garage for the workshop. The Galaxy parking garage entrance is located off of Galaxy Way, which is the street just behind the Fox Plaza. As a reference, the Fox Plaza address is 2121 Avenue of the Stars, Los Angeles, CA 90064.

Have questions about Cloudera Hadoop Training: Intermediate? Contact Cloudera

When & Where



Fox Studios Lot
10201 West Pico Blvd.
Building 104 (right next to the parking structure)
Los Angeles, CA 90035

Tuesday, July 28, 2009 from 9:00 AM to 5:00 PM (PDT)


  Add to my calendar

Organizer

Cloudera

Cloudera brings Hadoop to enterprise users. We provide a certified distribution based on the most recent stable release from Apache, online and live training, as well as commercial support.


 

  Contact the Organizer

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.