Skip Main Navigation
Page Content
This event has ended

Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15

Cloudera

Monday, February 13, 2012 at 9:00 AM - Wednesday, February 15, 2012 at 5:00 PM (PST)

Cloudera Administrator Training for Apache Hadoop - SF...

Ticket Information

Ticket Type Sales End Price Fee Quantity
Early Bird: Administrator Training + Certification ($100 savings) Ended $2,195.00 $0.00
Administrator Training + Certification Ended $2,295.00 $0.00

Share Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15

Event Details

This three-day hands-on training course from Cloudera is for system administrators and others responsible for managing Apache Hadoop clusters in production or development environments.

You will learn

  • How the Hadoop Distributed File System and MapReduce work
  • What hardware configurations are optimal for Hadoop clusters
  • What network considerations to take into account when building out your cluster
  • How to configure Hadoop's options for best cluster performance
  • How to configure the FairScheduler to provide service-level agreements for multiple users of a cluster
  • How to maintain and monitor your cluster
  • How to load data into the cluster from dynamically-generated files using Flume, and from relational database management systems using Sqoop
  • What system administration issues exist with other Hadoop projects such as Hive, Pig, and HBase

Hands-On Labs

Throughout the course, hands-on labs help students build their knowledge and apply the concepts being discussed.

Certification Exam

Following the training, attendees will have an opportunity to become a Cloudera Certified Administrator for Apache Hadoop (CCAH).

Course Pre-Requisites

This course is designed for people with at least a basic level of Linux system administration experience. Prior knowledge of Hadoop is not required.


Course Contents

The course covers the following topics:

  • An Introduction To Hadoop And HDFS 
    • Why Hadoop?
    • HDFS
    • MapReduce
    • Hive, Pig, HBase and other ecosystem projects
    • Hands-On Exercise: Installing a pseudo-distributed cluster
  • Planning Your Hadoop Cluster 
    • General Planning Considerations
    • Choosing The Right Hardware
    • Node Topologies
    • Choosing The Right Software
  • Deploying Your Cluster 
    • Installing Hadoop
    • Using SCM Express for easy installation
    • Typical Configuration Parameters
    • Configuring Rack Awareness
    • Using Configuration Management Tools
    • Hands-On Exercise: Installing a Hadoop Cluster
  • Managing and Scheduling Jobs
    • Starting and stopping MapReduce jobs
    • Hands-On Exercise: Managing jobs
    • The FIFO Scheduler
    • The Fair Scheduler
    • Hands-On Exercise: Using the FairScheduler
  • Cluster Maintenance 
    • Checking HDFS with fsck
    • Hands-On Exercise: Breaking the Cluster
    • Copying data with distcp
    • Rebalancing cluster nodes
    • Adding and removing cluster nodes
    • Hands-On Exercise: Verifying the Cluster's Self-Healing Features
    • Backup And Restore
    • Upgrading and Migrating
    • Hands-On Exercise: Backing Up and Restoring the NameNode Metadata
  • Cluster Monitoring, Troubleshooting and Optimizing 
    • Hadoop Log Files
    • Using the NameNode and JobTracker Web UIs
    • Interpreting Job Logs
    • Monitoring with Ganglia
    • Other monitoring tools
    • General Optimization Tips
    • Benchmarking Your Cluster
  • Populating HDFS From External Sources
    • Using Sqoop
    • Using Flume 
    • Best Practices for Data Ingestion
  • Installing And Managing Other Hadoop Projects
    • Hive
    • Pig
    • HBase
    • Hands-On Exercise: Configuring the Hive Shared Metastore
  • Cloudera Certified Administrator Exam
Have questions about Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15? Contact Cloudera

When & Where


Seaport Conference Center
459 Seaport Ct
Redwood City, CA 94063

Monday, February 13, 2012 at 9:00 AM - Wednesday, February 15, 2012 at 5:00 PM (PST)


  Add to my calendar

Organizer

Cloudera

Cloudera brings Hadoop to enterprise users. We provide a certified distribution based on the most recent stable release from Apache, online and live training, as well as commercial support.


 

  Contact the Organizer
Cloudera Administrator Training for Apache Hadoop - SF bay area - Feb 13-15
Redwood City, CA Events Class

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.