Looks like this event has already ended.
Check out upcoming events by this organizer, or organize your very own event.
Hadoop Summit 2010: Intro to Hadoop
Monday, June 28, 2010 from 9:00 AM to 4:00 PM (PDT)
Santa Clara, CA
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop, and may have never used the software before, specifically:
- Developers with no Hadoop experience
- Architects and Technical Leaders seeking to understand where Hadoop is appropriate, and how it fits with existing systems
- Users interested in conducting data analysis using higher level languages. This course provides the necessary background for our one-day course on Hive.
Attendees seeking more in-depth knowledge may also attend our intermediate two-day developer training (with certification option) on Wednesday and Thursday.
We'll cover the following topics with ample time for questions and discussion of various use cases.
Thinking at Scale: Introduction to Hadoop
You know your data is big – you found Hadoop. What implications must you consider when working at this scale? This lecture addresses common challenges and general best practices for scaling with your data.
MapReduce and HDFS
These tools provide the core functionality to allow you to store, process, and analyze big data. This lecture "lifts the curtain" and explains how the technology works. You'll understand how these components fit together and build on one another to provide a scalable and powerful system.
The Hadoop Ecosystem
An introduction to other projects surrounding Hadoop, which complete the greater ecosystem of available large-data processing tools.
Introduction to MapReduce Algorithms
Writing programs for MapReduce requires analyzing problems in a new way. This lecture shows how some common functions can be expressed as part of a MapReduce pipeline.
Once you understand the basics for working with Hadoop and writing MapReduce applications, you'll need to know how to get Hadoop up and running for your own processing (or at least, get your ops team pointed in the right direction). Before ending the day, we'll make sure you understand how to deploy Hadoop on servers in your own datacenter or on Amazon's EC2.
When & Where
Cloudera brings Hadoop to enterprise users. We provide a certified distribution based on the most recent stable release from Apache, online and live training, as well as commercial support.