Remote Training: Advanced ECL (Part 2)

Remote Training: Advanced ECL (Part 2)

This course explores the concept of Super Files in ECL and the techniques for working with XML and JSON data, and much more!

By HPCC Systems

Date and time

May 15 · 6am - May 16 · 1:30pm PDT

Location

Online

Refund Policy

Contact the organizer to request a refund.

About this event

  • 1 day 7 hours

Remote Training: Advanced ECL (Part 2) – Super files, Working with XML, and Free-form Text Parsing

The class will be held remotely via Microsoft Teams. Your instructor will contact you before the class with login information.

RELX and LexisNexis employees should register using your work email address and the approved promo code provided by their manager or contact training@hpccsystems.com.

Note: This course is the second in a two-part advanced series. People who attend the Advanced ECL (Part 1) – Working with Relational Data class should continue with this class.

This course explores the concept of Super Files in ECL and the techniques for working with XML and JSON data, getting it into your HPCC Systems platform, and defining it to work with other data elements. This flows naturally into the detailed ECL support of Natural Language Parsing – creating pattern-matching definitions and using the PARSE function to extract data from either XML or free-form text.

Course Length: 2 days

Class Prerequisites: Introduction to ECL (Part 1) - Concepts and Queries and Introduction to ECL (Part 2) - Introduction to ECL (Part 2) – Data Profiling and Transformation training Students are welcome to bring their own laptops to take away the code and examples from the class.

Topics include:

  • SuperFiles and SuperKeys
  • Simple XML Spray and Dataset Definition
  • Working with XML Data (simple, complex, and nested)
  • Complex XML Spraying and De-spraying
  • PARSE with XML Data
  • Spraying and defining free-form text data
  • PARSE with free-form text
  • New! An Introduction to Machine Learning

Class begins at 9am and concludes at 4:30pm. Breaks will be provided throughout the day for lunch and lab assignments. Your instructor will provide more information before the class begins.

Organized by

HPCC Systems is an open source, massive parallel-processing computing platform for big data processing and analytics.

$1,995