Skip Main Navigation
Page Content
This event has ended

Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15

Cloudera

Monday, February 13, 2012 at 9:00 AM - Wednesday, February 15, 2012 at 5:00 PM (PST)

Redwood City, CA

Cloudera Administrator Training for Apache Hadoop - SF...

Ticket Information

Ticket Type Sales End Price Fee Quantity
Early Bird: Administrator Training + Certification ($100 savings) Ended $2,195.00 $0.00
Administrator Training + Certification Ended $2,295.00 $0.00

Share Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15

Event Details

This three-day hands-on training course from Cloudera is for system administrators and others responsible for managing Apache Hadoop clusters in production or development environments.

You will learn

  • How the Hadoop Distributed File System and MapReduce work
  • What hardware configurations are optimal for Hadoop clusters
  • What network considerations to take into account when building out your cluster
  • How to configure Hadoop's options for best cluster performance
  • How to configure the FairScheduler to provide service-level agreements for multiple users of a cluster
  • How to maintain and monitor your cluster
  • How to load data into the cluster from dynamically-generated files using Flume, and from relational database management systems using Sqoop
  • What system administration issues exist with other Hadoop projects such as Hive, Pig, and HBase

Hands-On Labs

Throughout the course, hands-on labs help students build their knowledge and apply the concepts being discussed.

Certification Exam

Following the training, attendees will have an opportunity to become a Cloudera Certified Administrator for Apache Hadoop (CCAH).

Course Pre-Requisites

This course is designed for people with at least a basic level of Linux system administration experience. Prior knowledge of Hadoop is not required.


Course Contents

The course covers the following topics:

  • An Introduction To Hadoop And HDFS 
    • Why Hadoop?
    • HDFS
    • MapReduce
    • Hive, Pig, HBase and other ecosystem projects
    • Hands-On Exercise: Installing a pseudo-distributed cluster
  • Planning Your Hadoop Cluster 
    • General Planning Considerations
    • Choosing The Right Hardware
    • Node Topologies
    • Choosing The Right Software
  • Deploying Your Cluster 
    • Installing Hadoop
    • Using SCM Express for easy installation
    • Typical Configuration Parameters
    • Configuring Rack Awareness
    • Using Configuration Management Tools
    • Hands-On Exercise: Installing a Hadoop Cluster
  • Managing and Scheduling Jobs
    • Starting and stopping MapReduce jobs
    • Hands-On Exercise: Managing jobs
    • The FIFO Scheduler
    • The Fair Scheduler
    • Hands-On Exercise: Using the FairScheduler
  • Cluster Maintenance 
    • Checking HDFS with fsck
    • Hands-On Exercise: Breaking the Cluster
    • Copying data with distcp
    • Rebalancing cluster nodes
    • Adding and removing cluster nodes
    • Hands-On Exercise: Verifying the Cluster's Self-Healing Features
    • Backup And Restore
    • Upgrading and Migrating
    • Hands-On Exercise: Backing Up and Restoring the NameNode Metadata
  • Cluster Monitoring, Troubleshooting and Optimizing 
    • Hadoop Log Files
    • Using the NameNode and JobTracker Web UIs
    • Interpreting Job Logs
    • Monitoring with Ganglia
    • Other monitoring tools
    • General Optimization Tips
    • Benchmarking Your Cluster
  • Populating HDFS From External Sources
    • Using Sqoop
    • Using Flume 
    • Best Practices for Data Ingestion
  • Installing And Managing Other Hadoop Projects
    • Hive
    • Pig
    • HBase
    • Hands-On Exercise: Configuring the Hive Shared Metastore
  • Cloudera Certified Administrator Exam
Have questions about Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15? Contact Cloudera

When & Where


Seaport Conference Center
459 Seaport Ct
Redwood City, CA 94063

Monday, February 13, 2012 at 9:00 AM - Wednesday, February 15, 2012 at 5:00 PM (PST)


  Add to my calendar

Organizer

Cloudera

Cloudera brings Hadoop to enterprise users. We provide a certified distribution based on the most recent stable release from Apache, online and live training, as well as commercial support.


 

  Contact the Organizer

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.