IBM is offering a free hands-on lab for clients and practitioners on Apache Spark. This is a full day of education on Spark with hands on exercises instructed in person by Spark experts. The lab will provide a detail overview of Apache Spark. The exercises will be performed on Jupyter notebooks with publicly available datasets. Participants will use IBM’s fully managed free Cloud platform available for educational purposes.
Who should go:
Anyone interested in learning more about Apache Spark
A working knowledge of Coding (Preferred Python and/or Scala), understand distributed computing, Spark and SQL.
What to expect:
A full day of lecture and hands on exercises attacking real-world data challenges using Apache Spark. In 8 hours you will learn the essentials of Apache Spark and why it's important to your organization. This workshop will focus on data wrangling and machine learning.
Please sign up for a free Bluemix (www.bluemix.net) and DSX (http://datascience.ibm.com) account ahead of time. ****** You must bring your own laptop *****
Additionally, please download the files from this GitHub repository to your computer before the class:
Lunch and breakfast will be served
Full Day Agenda:
8:30am - 9am Breakfast, Socialize
9:00am – 10:00am Kickoff, Apache Spark Overview
10:00am – 11:00am Lab 1, Hello Spark - Hand on exercise
11:00 am – 11:15am Apache Spark SQL Overview
11:15am – 12:00pm Lab 2, Spark SQL - Hands on exercises
12:00 pm – 1pm Lunch
1:00pm – 1:30pm Overview of Data Science & Machine Learning w/ Apache Spark
1:30pm – 2:30pm Lab 3, Machine Learning w/ Spark – Hands on exercises
2:30pm – 3:30pm SparkR, Spark Streaming demonstrations
3:30pm – 4:00pm Wrap up – Feedback from attendees