Data Vault 2.0 Boot Camp & Private Certification
Event Information
Description
Data Vault 2.0 Boot Camp and Private Certification
Your First Chance in 2016 to take the course directly from Dan Linstedt
NEW FORMAT: 4 Day Class - Includes lecture and 1/2 day workshop + 1/2 day exam
Get Certified Now, Don't Wait!
NEW MATERIALS!!
We will be discussing DV2 on Hive / Hadoop, the benefits, pros and cons, some suggestions on how to build it and leverage it properly. We will be talking about Satellites on HDFS, Hubs & Links on Hive. We'll discuss data modeling implications, and using SERDe definitions at query time. This is the first time ever that this information will be presented in the DV2 class!
Learning Objectives:
- How and when to apply KPA’s and KPI’s for measurement and optimization in business intelligence programs
- What the impacts are of CMMI Level 5 optimization on data warehousing methodologies
- The best practices for automation and generation of ETL / ELT that is highly scalable
- How Data should be laid out in MPP formats
- What Co-Location is, and data re-distribution in MPP environments
- How to deal with joins to unstructured and semi-structured data sets
- What the difference is between Schema on Read and Schema on Write
- How to seamlessly integrate BIG DATA solutions to existing relational database systems
- How to model your data warehouse using Data Vault 2.0 Modeling techniques
These topics, and much much more are covered in the class. The end objective is to enhance your skill set, so that you are a qualified practitioner / expert in Enterprise Business Intelligence projects. Whether it’s data acquisition, data provisioning, change management, or project management you now have the skills to deal with whatever is thrown your way.
Class Agenda: (http://datavaultcertification.com/data-vault-certification/class-agenda/)
- Introduction and Business Justification
- What’s new in Data Vault 2.0
- BI Issues Faced Today
- Uncovering Additional Market Drivers
- Value of Data Vault 2.0
- Managed Self-Service BI (Introduced)
- Data Vault 2.0 – A Valuable Skill Set
- Keys To Success
- Why Data Vault 2.0 is a differentiator
- Who’s using it?
- Endorsements
- Data Vault 2.0 Methodology & Disciplined Agile Delivery
- What are the agility issues?
- What are we building in BI?
- Methodology and Rapid Delivery
- Agile Requirements Gathering
- Technical Numbering
- Team Management
- Roles and Releases
- Methodology: DV2 and CMMI
- KPI’s and Estimations
- Service Level Agreements
- Data Vault 2.0 Systems Architecture
- Data Vault in Business (A Business Case)
- Managed Self Service BI (A little deeper)
- Data Vault 2.0 Architecture Overview
- Big Data / NoSQL - Into to DV on Hive
- Defining Each Architectural Component
- Lowering TCO for the Business
- Unterstanding the Drill & Hadoop Architecture
- Data Vault 2.0 Modeling
- Concept Modeling
- Ontology Modeling
- Business Process Modeling
- Changes to modeling for DV on Hive
- Hashing and Sequences
- Common Terminology
- Core Data Vault Entity Types
- All about Business Keys
- Link to Link & De-Normalization
- Dependent Child & Weak Hub
- Link – Unit of Work
- Link – Driving Key Concepts
- Link – Non-Historized
- Link – Hierarchies and Same As
- Satellite – Applications
- Satellite – Record Source Tracking
- JSON & XML Satellites
- Schema on Read
- Point-in-Time and Bridge Table
- Building a DV Model from a Report
- Building a DV Model from a Dimension
- Data Vault 2.0 Implementation
- Applying Set Logic
- Loading Architecture
- NoSQL & Hive Raw DV Processing Architecture
- Delta / No Delta on Hive
- Data Vault 1 vs Data Vault 2 Load Performance
- Loading Staging Areas
- Loading DV2 Templates
- Big Data and Co-Location (MPP)
- Dealing with Corrupted Data
- Loading Star Schemas
- Example Data Model and Load Code
- Querying the Data Vault
The morning of the 4th day, we offer a hands on working session, followed by a Q&A review and the certification exam. For the first time ever, we will be teaching Raw Data Vault and Staging areas on Hive. We discuss the pros and cons of utilizing Hadoop for Data Warehousing. We introduce the architecture needed to make it work, and we share some of the best kept secrets in the Big Data space!
THIS MAY BE YOUR ONLY CHANCE DURING 2016 TO GET CDVP2 Certification Directly from Dan. Don't wait, register today!