Introduction to THOR - the Extract, Transform and Load (ETL) Process
This class is for developers who want to extend their knowledge of ECL to Extract, Transform, and Load (ETL) any data with the HPCC Systems environment. Anyone planning to write and work with ECL code should attend this course.
Course Length: 2 days
Class Prerequisites: Students must have attended the Introduction to ECL Training class. Students are welcome to bring their own laptops to take away the code and examples from the class.
- Principles of ETL in ECL
- The TABLE function (Memory Tables)
- TRANSFORM functions (PROJECT, etc.)
- Data Hygiene (Cleaning and Standardization)
- Lookup Tables
- OUTPUT to disk files
- Simple JOINs
Class begins Wednesday at 1pm and concludes at 5pm.
Class begins Thursday at 9am and concludes at 5pm.
Class begins Friday at 9am and concludes at 12pm.
When & Where
HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.