San Jose Big Data Analytics and Hadoop Training
$0 – $495
San Jose Big Data Analytics and Hadoop Training

San Jose Big Data Analytics and Hadoop Training

Event Information

Share this event

Date and Time

Location

Location

Virtual Instructor Led Online | Video Conference

Omni212

San Jose, CA

View Map

Friends Who Are Going
Event description

Description


Next class starting

December 5, 2016


Video Conference link

Big Data Analytics & Hadoop Training
Mon, Dec 5, 2016 8:00 AM - 10:00 AM Pacific Standard Time

Please join my meeting from your computer, tablet or smartphone.
https://global.gotomeeting.com/join/721907101

You can also dial in using your phone.
United States +1 (669) 224-3412

Access Code: 721-907-101

First GoToMeeting? Try a test session: http://help.citrix.com/getready


Training Session Details

There will be 8 online sessions, each session being of 2-3 hours. Every session will have presentation about theory, concepts and technology, followed by Hands-on Lab practice exercises.


Training Dates

December 5,7,9,12,14,16,19,21

Timings: 8 AM - 10 AM US Pacific time

Each session will be recorded and the recordings will be shared after each session with students who have paid for the training.


Pre-requisite

Desired but not required - Exposure to, Working proficiency of BI, sql, scripting, how to handle and manage data and databases, using Excel


Software access

A Microsoft cloud Azure account will be provided to every student where they will install hortonworks hadoop on the cloud virtual machines. Students will carry out the hands-on lab exercises with instructor guidance.


Course Outline


Session 1: Big Data Basics

• An introduction to Big Data?

• Why is Big Data? Why now?

• The Three Dimensions of Big Data (Three Vs)

• Evolution of Big Data

• Big Data versus Traditional RDBMS Databases

• Big Data versus Traditional BI and Analytics

• Big Data versus Traditional Storage

• Key Challenges in Big Data adoption

• Benefits of adoption of Big Data

• Introduction to Big Data Technology Stack

• Apache Hadoop Framework

• Introduction to Microsoft HDInsight – Microsoft’s Big Data Service

Hands-On Lab:

• Creating Azure Storage Account

• Creating HDInsight Cluster

• Using services on HDInsight Cluster


Session 2: The Big Data Technology Stack

• Basics of Hadoop Distributed File System (HDFS)

• Basics of Hadoop Distributed Processing (Map Reduce Jobs)

Hands-On Lab:

• Loading files to Azure storage account

• Moving files across HDInsight Cluster

• Remote Access to Azure Storage Account and HDInsight Cluster

Session 3: Deep dive into Hadoop Storage System (HDFS) (1 Hour)

• HDFS

• Reading files with HDFS

• Writing files with HDFS

• Error Handling

Hands-On Lab:

• Accessing Hadoop configuration files using HDInsight Cluster

Session 4: Processing Big Data –MapReduce and YARN

• How MapReduce works

• Handling Common Errors

• Bottlenecks with MapReduce

• How YARN (MapReduceV2) works

• Difference between MR1 and MR2

• Error Handling

Hands-On Lab:

• Running a simple MapReduce application (word count)

• Running a custom MapReduce application (census data)

• Running MapReduce via PowerShell

• Running a MapReduce application using PowerShell

• Monitoring application status

Session 5: Big Data Development Framework

• Introduction to HIVE

• Introduction to PIG

• HBase

Hands-On Lab:

• Loading the data into HIVE

• Submitting Pig jobs using HDInsight

• Submitting Pig jobs via PowerShell

Session 6: Big Data Integration and Management

• Big Data Integration using Polybase

• Big Data Management using Ambari

Hands-On Lab:

• Fetching HDInsight data into SQL

• Using Ambari for managing HDInsight cluster

Session 7: Big Data – BI and Reporting using Power BI

• Introduction to Power BI

• Usual workflow of Power BI

• Power BI Ecosystem

• Getting Data into Power BI

• Reports vs Dashboards

• Additional elements of Power BI Reports

Hands-On Lab:

• Fetching HDInsight Data into Power BI desktop

• Data Modelling using Power BI desktop

• Creating reports using Power BI desktop

Session 8: PowerBI.com services – Deep dive

• Power BI Dashboards

• Natural Language Query

• Power BI Workspaces – Personal and Group Workspaces

• Sharing using OneDrive for Business

Hands-On Lab:

• Publishing reports to Powerbi.com

• Sharing reports using OneDrive for Business

End-to-End Use Case Implementation- Lab Exercise

• Use case -Healthcare Analytics using Hadoop framework through Microsoft HDInsight and Power BI


Class Size: Maximum 22


Price: $495


PLEASE NOTE:
1. Pre-registration is FREE. You will be able to attend the first few sessions for FREE subject to availability.
2. However once the sessions with Hands-on lab exercises begin, you will need to purchase a training ticket.
3. Each paying student get access to a login account on the cloud, Microsoft Azure, where they install Hadoop on a cloud virtual machine and perform hands-on lab exercises with instructor guidance.There will be 2 experienced big data, hadoop instructors supporting the students throughout the class.


ADVANTAGE OF PURCHASING TRAINING TICKET:
1. Class recordings will be made available.
2. Post class support
3. Course material available.
4. Cloud account on Microsoft Azure with Hands-on lab exercises under the guidance of two experienced big data, hadoop instructors.
5. Career advancement and Job placement assistance


REFUND POLICY:
1. 100% refund will be provided only if we DO NOT hold the class and/or we reschedule the class and the new dates and timings don't work for you.
2. If the class is held as per schedule, you don't show up or you register, purchase a training ticket and then change your mind, we will not issue a refund.

Share with friends

Date and Time

Location

Virtual Instructor Led Online | Video Conference

Omni212

San Jose, CA

View Map

Save This Event

Event Saved