Looks like this event has already ended.

Check out upcoming events by this organizer, or organize your very own event.

View upcoming events Create an event

Hadoop World: Developer Training + Certification

Cloudera

Wednesday, October 13, 2010 at 9:00 AM - Thursday, October 14, 2010 at 5:00 PM (EDT)

New York, United States

Hadoop World: Developer Training + Certification

Ticket Information

Ticket Type Sales End Price Fee Quantity
Regular Registration Ended $1,695.00 $0.00
Groups (5+) Ended $1,395.00 $0.00
SHARE THIS EVENT

Event Details

This is an intermediate-level session for developers who want to leverage Hadoop and MapReduce.  Developers will learn the MapReduce framework and how to write programs against its API.  In addition to learning how to write individual MapReduce jobs, we will discuss design techniques for larger workflows. Finally, we'll discuss advanced skills for debugging MapReduce programs and optimizing their performance.  This training will be augmented by hands-on exercises. 

If you have no prior experience with Hadoop, you will want to take our one-day Introduction to Hadoop course. If you'd like to go deeper after this session, you might also consider one-day courses on HBase.

Agenda:

  • Leveraging Hadoop and MapReduce
    • MapReduce and HDFS
    • Hands-on Exercise: Getting familiar with Hadoop
    • Programming with Hadoop
    • Hands-on Exercise: running a MapReduce job
  • Augmenting Existing Systems with Hadoop
    • Hadoop rarely replaces existing infrastructure, but rather enables you to do more with your data by providing a scalable batch processing system. This section helps you understand how it all fits together.
  • Best Practices for Data Processing Pipelines
    • In order for Hadoop to crunch large volumes of data, first you'll need to get that data into Hadoop. This section will help you understand how to import different types of data from various sources into Hadoop for further analysis.
  • Debugging MapReduce Programs
    • Debugging in the distributed environment is challenging. This lecture will expose you to best practices for program design to mitigate debugging challenges, as well as local testing tools and techniques for debugging at scale.
  • Advanced Hadoop API
    • This lecture probes into the API, covering custom data types and file formats, direct HDFS access,  intermediate data partitioning, and other tools such as the DistributedCache.
  • Optimizing MapReduce Programs
    • This section will discuss advanced topics for optimizing MapReduce jobs.
  • Advanced Algorithms
    • This lecture introduces some graph algorithms that can be adapted for your needs, as well as more involved examples like PageRank. We'll also look at strategies for implementing joins efficiently, and compare different techniques that are appropriate to different data models.
  • Cloudera Certified Hadoop Developer exam

When & Where


One New York Plaza
31st Floor
New York, 10004

Wednesday, October 13, 2010 at 9:00 AM - Thursday, October 14, 2010 at 5:00 PM (EDT)


  Add to my calendar

Organizer

Cloudera

Cloudera brings Hadoop to enterprise users. We provide a certified distribution based on the most recent stable release from Apache, online and live training, as well as commercial support.


 

  Contact the Organizer

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.