Skip Main Navigation
Page Content
This event has ended

Introduction to Data Mining and Predictive Analytics

RapidMiner

Tuesday, August 12, 2014 at 8:30 AM - Wednesday, August 13, 2014 at 5:00 PM (EDT)

Introduction to Data Mining and Predictive Analytics

Ticket Information

Ticket Type Sales End Price Fee Quantity
Early Bird – SAVE $300!
Early Bird pricing! Save $300 by registering in advance prior to 10 days before the class starts.
Ended $1,525.00 $0.00
DUO - 2-course package
Buy both of these consecutive courses at the same time and save $800! Note: You must purchase the DUO ticket for each class on the same day for this rate to be honored.
Ended $1,425.00 $0.00
Regular
Regular price ticket for training course.
Ended $1,825.00 $0.00
Group of 2 - SAVE $200 each
SAVE $200 each! Receive a discount if buying 2 regular tickets at the same time. Pricing is per ticket.
Ended $1,625.00 $0.00
Group of 3 - SAVE $325 each
SAVE $325 each! Receive a discount if buying 3 regular tickets at the same time. Pricing is per ticket.
Ended $1,500.00 $0.00
Group of 4 - SAVE $500 each
SAVE $500 each! Receive a discount if buying 4 or more regular tickets at the same time. Pricing is per ticket.
Ended $1,325.00 $0.00

Share Introduction to Data Mining and Predictive Analytics

Event Details

Introduction to Data Mining and Predictive Analytics with RapidMiner

Tuesday, August 12th and Wednesday, August 13th from 8:30 am to 5 pm

A light breakfast, lunch, and afternoon snacks will be provided.

Be sure to check out our Intermediate Data Mining and Predictive Analytics course being held in the same location on August 14 and 15. Register for both at the same time to get a special discount.

Class size for this event is limited to 12 students. If the class is sold out and you wish to be added to a waiting list, please contact the event organizer.

This course is a two-day introduction to the foundations of data mining, predicitve analytics, and RapidMiner software. To support a business context for these topics, we will develop a specific business scenario as a through line during the course.  The class follows a learn-do model, allowing students time to focus on the new material as it is explained, then apply that understanding in a lab exercize on their own.

After successfully completing this training, participants will have an understanding of how RapidMiner Studio and RapidMiner Server work and are used. They will be able to create predictive models in the standard data environments found within most analyst positions.

Practical exercises during class prepare the participants to transfer the knowledge gained and apply it to their own data mining problems, solving them more quickly and easily.  Since the class labs are hands-on, performed on the students' own laptops, the students will be taking their actual classwork home with them to jumpstart their application to the real world.

After this course, participants will be able to:

  • perform basic data preparations
  • build initial predictive models
  • evaluate model quality
  • score new data sets


Details

  • Target audience: newcomers, analysts, developers, administrators
  • Previous knowledge: basic knowledge of computer programs and mathematics
  • Methods: lectures, discussions, individual and group work, exercises on realistic data.


Participants may introduce their own work and project specific questions in order to find particular solutions together with the trainer and other participants. The training course addresses beginners and intermediate learners.

 

Topics

  • Overview
    • Business Scenario
    • Analytics
    • Data Mining in the Enterprise
    • CRISP-DM
  • Basic Usage
    • User Interface
    • Creating and handling RapidMiner repositories
    • Starting a new RapidMiner project
    • Operators and processes
    • Loading data
    • Storing data, processes, and results
  • EDA: Exploratory Data Analysis
    • Data Types
    • Data Hierarchy
    • Quick Summary Statistics
    • Visualizing Data
    • Charting
  • Data Preparation
    • Normalization and standardization
    • Basic transformations of value types
    • Handling missing values
    • Sampling
    • Filtering examples and attributes
    • Handling attribute roles
  • Building Better Processes
    • Organizing
    • Renaming
    • Relative Path
    • Flow Control
    • Subprocesses
    • Building Blocks
    • Breakpoints
  • Predictive Models
    • Correlations
    • k-Nearest Neighbor
    • Naïve Bayes
    • Linear Regression
    • Rules
    • Decision Trees
    • Importance of attributes
  • Model Evaluation
    • Applying models
    • Overfitting
    • Splitting data
    • Evaluation methods
    • Performance criteria
  • Sharing and Collaboration
    • Exporting images
    • RapidMiner Server


Prerequisites

You must bring a laptop to class (Windows, Mac or Linux OS).  For Windows, Java Runtime Environment (JRE) version 7 is required.  For Mac and Linux, Java Development Kit (JDK) version 7 is needed.  Students will be provided with links to install RapidMiner Studio 6 prior to the class.


Instructor

David Weisman, PhD

David Weisman is a data scientist consultant with over 35 years of experience in the software field. In addition to consulting, he is a researcher at the University of Massachusetts Boston, working at the intersection of molecular biology and data mining.  David is searching for cancer biomarkers in enormous volumes of DNA sequence data, identifying biosensors of environmental pollutants in bacterial and plant transcriptomic data, and teaching bioinformatics courses.  Prior to obtaining his recent Ph.D. in molecular biology, David ran a long-term successful software consulting firm, specializing in distributed system development, compiler design, operating system development, quantitative finance, network security, and health care informatics.

 

Training Facility Logistics

The training will be held in Cambridge, MA at the RapidMiner Headquarters training center. The facility is near the Alewife T (metro) stop and other public transportation. Map and directions.  

There are a variety of lodging options locally and in other parts of Cambridge and Boston that are accessible to the facility either on foot, bike, or by public transportation.

The closest reliable public parking is at the Alewife T station.  From Alewife to RapidMiner HQ, there is a free public shuttle, but it is only a short walk away (perhaps ten minutes).

___

Classes require a minimum of 3 students by August 4 to be held.  If there are insufficient registrants, the class may be cancelled and all students will be refunded the full registration fee.  Students should organize their travel arrangements accordingly and with this proviso.

___

Can't make it? Sign up for our newletter to stay in the loop on future events and classes by clicking on the Subscribe button at the top of any page on www.rapidminer.com.

___

Our Refund Policy: Plans change? We get it. But if you can't make it to the class, please email us at training@rapidminer.com no later than August 4.  No refunds will be given after this date.

Have questions about Introduction to Data Mining and Predictive Analytics? Contact RapidMiner

When & Where


RapidMiner HQ
10 Fawcett Street
5th Floor, Suite 502
Cambridge, MA 02138

Tuesday, August 12, 2014 at 8:30 AM - Wednesday, August 13, 2014 at 5:00 PM (EDT)


  Add to my calendar

Organizer

RapidMiner

RapidMiner is the industry's easiest-to-use Modern Analytics platform that significantly accelerates productivity – from data prep to predictive action – with prebuilt models and one click deployments. Leveraging its open source heritage, RapidMiner was built by data scientists for data scientists, business analysts and developers. Unlike traditional analytics providers, RapidMiner enables anyone to make the most of all data in all environments, by providing a powerful code free advantage and the wisdom of over 250,000 users around the world.

RapidMiner offers training courses for business analytics, data mining, predictive analytics, predictive reporting, text and web mining, and related topics. Below are upcoming courses in the USA. We also offer courses in the UK and Germany.

  Contact the Organizer
Introduction to Data Mining and Predictive Analytics
Cambridge, MA Events Class Business

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.