Sales Ended

2017 HPCC Systems Summit Community Day

Event Information

Share this event

Date and Time



Ritz Carlton Buckhead

3434 Peachtree Rd

Atlanta, GA 30326

View Map

Event description


HPCC Systems

LexisNexis Risk Solutions will be hosting the 4th annual HPCC Systems Summit Community Day on Wednesday October 4, 2017. The purpose of the Summit is to gather engineers, data scientists and technology professionals to share knowledge and future roadmap plans for the HPCC Systems platform. This event is dedicated to showcase our community and have industry and academia present their HPCC Systems use cases, research projects and share their experience on how they leverage the HPCC Systems platform.

As part of Community Day, we are excited to hold our 2nd annual HPCC Systems Technical Poster Presentations Competition on Tuesday October 3, showcasing the work of our academia community. The posters will remain on display throughout the October 4 Community Day event. Last year was a big success and we are thrilled to host the competition again this year! We will also welcome our HPCC Systems Sponsored Robotics teams this year who will have their autonomous and semi-autonomous robots on display throughout the 2-day event.

New this year, we are also offering a pre-event workshop, "Mastering Your Big Data with ECL", on October 3 to our external Community Day participants wanting to understand the HPCC Systems platform and learn ECL to build powerful data queries. See the full curriculum below.

The Community Day event on October 4 will be live streamed via the HPCC Systems YouTube channel. The agenda will tentatively run from 8:30am – 5:00pm ET. We will have a fantastic line-up of speakers featuring industry experts and thought leaders and are currently finalizing the agenda. View a sneak peek of what is planned!

Seating is limited, so your timely response is appreciated no later than September 20.

Ticket Options:

  • 2 DAY PASS - October 3 & 4: $150 (50% discount off Community Day fee) to include October 3 training workshop and October 4 Community Day event.

  • 1 DAY PASS - October 3 Training Workshop: $100 to include course, materials, food & beverages.

  • 1 DAY PASS - October 4 Community Day Event: $100 to include access to all talks, poster presentations, networking, food & beverages.

STUDENT DISCOUNT - We are offering current students a 50% discount. Please email from a valid university email address for more information.

October 3, 2017 - Training Workshop: Mastering Your Big Data with ECL

This class is for Community Day participants who want to understand the HPCC Systems platform and learn ECL to build powerful data queries. Anyone who needs a basic familiarity and learn best practices with ECL should attend.

The one day class will take the student through the entire ETL cycle from Spray (Extract) to Transform (THOR) and finally to Load (ROXIE). Code examples and hands-on lessons will be included.

Course Length: 1 day, October 3, 9 AM - 4 PM, Lunch provided.

Course Price: $100 (Going to Community Day as well on October 4? Select the 2-DAY pass and enjoy a 50% discount off the $100 Community Day registration fee. Attend both days for $150!

Class Prerequisites - REQUIRED: Laptop with pre-installation of ECL IDE and connection to training cluster or HPCC VM.

Instructor: Bob Foreman, Senior Software Engineer, LexisNexis Risk Solutions

Topics include:

Part 1: Data Extraction and Transformation

  • Quick overview of THOR cluster, and the parallel distributed data processing concept.
  • Setting up an HPCC Cluster (VM Installation, AWS)
  • ECL Watch overview
  • Spray data to cluster (KJV text file)
  • Getting started with ECL IDE.
  • ECL Language Essentials, Syntax Rules
  • Define RECORD and DATASET
  • Introduction to Data Transformation with ECL (TRANSFORM)
  • Initial cleaning using ROLLUP (discuss DEDUP here as well)
  • Introduce Standard Library Reference Tools
  • Introduction to custom ECL Functions (passing parameters)
  • Transform of free form text to a structured format:
    • Using TABLE
    • Using PROJECT
  • Using ITERATE to seed new field data

Part 2: Prepare the Data Search Engine

  • Defining an INDEX, why?
  • Building an INDEX
  • Getting single results from INDEX
  • Getting batch results from INDEX
  • Design and build word search (Inverted) INDEX
  • Preparing data for Indexing
  • Filtering and Normalization
  • Searching Test
    • Two Word Test (JOIN)
    • Multi-word search (GraphBody FUNCTION)
    • Search Implementation
      • Search FUNCTION
      • PROJECT GRAPH result to RECORD format

Part 3: Write and Publish ROXIE query

  • Call Search created in Part 2
  • Implicit function to convert Book Number to Book Abbreviation
  • Create post processing to process full text record.
  • Publish in ECL Watch
  • Test in WS-ECL

A big thanks to our sponsors!



Datum Software

Share with friends

Date and Time


Ritz Carlton Buckhead

3434 Peachtree Rd

Atlanta, GA 30326

View Map

Save This Event

Event Saved