This event has ended

Advanced Text and Web Mining Techniques with RapidMiner and RapidAnalytics

RapidMiner

Thursday, September 26, 2013 at 8:30 AM - Friday, September 27, 2013 at 5:00 PM (EDT)

Boston, MA

Advanced Text and Web Mining Techniques with RapidMiner and ...

Ticket Information

Ticket Type Remaining Sales End Price Fee Quantity
Early Bird – SAVE $300
Early Bird pricing. Save $300 for registering in advance before Sept. 16 at 5 pm (EDT).
7 Tickets Ended $1,525.00 $0.00
DUO - 2 course Special
Buy both the consecutive Intro and Advanced courses at the same time and save $800! You must buy the DUO ticket on each site at the same time.
7 Tickets Ended $1,425.00 $0.00
Regular
Regular price ticket for training course.
7 Tickets Ended $1,825.00 $0.00
Group of 2 - SAVE $200 each
Receive a discount if buying 2 regular tickets at the same time. Pricing is per ticket.
7 Tickets Ended $1,625.00 $0.00
Group of 3 - SAVE $325 each
Receive a discount if buying 3 regular tickets at the same time. Pricing is per ticket.
7 Tickets Ended $1,500.00 $0.00
Group of 4 - SAVE $500 each
Receive a discount if buying 4 or more regular tickets at the same time. Pricing is per ticket.
7 Tickets Ended $1,325.00 $0.00

Who's Going

Loading your connections...

Share Advanced Text and Web Mining Techniques with RapidMiner and RapidAnalytics

Event Details

Advanced Text and Web Mining Techniques with RapidMiner and RapidAnalytics

 

Thursday, Sept. 26th and Friday, Sept. 27st from 8:30 am to 5 pm

A light breakfast, lunch and afternoon snacks will be provided.


Be sure to check out our Introduction to Data Mining and Predictive Analytics with RapidMiner course being held in the same location Sept. 24 and 25.  Register for both at the same time to get a special discount.


Class size for this event is limited to 12 students.  If the class is sold out and you wish to be added to a waiting list, contact the event organizer.

 

This training course is an introduction into knowledge discovery from unstructured data like text documents. It focuses on the necessary preprocessing steps and the most successful methods for automatic text classification (including Naive Bayes and Support Vector Machines, SVM) and text clustering. Many practical exercises for different settings (for example e-mail spam detection, automatic e-mail routing, adaptive personal news filtering, sentiment analysis of text documents like news, web pages, blogs, e-mail, or PDF documents) will enable the participants to transfer the gained knowledge to own text mining problems.

After the training course the participants will have the ability to:

  • identify the processes for processing unstructured data
  • transform textual data into a structured format
  • apply different statistical text processing methods
  • perform text classification or text clustering
  • work on recent tasks like sentiment analysis or opinion mining


Details

  • Target audience: users, analysts, developers, administrators
  • Previous knowledge: foundations of data mining and RapidMiner
  • Methods: lectures, discussions, individual and group work, exercises on realistic data.

Participants may introduce their own work and project specific questions in order to find particular solutions together with the trainer and other participants. The training course addresses intermediate learners and we recommend visiting the course Introduction to Data Mining and Predictive Analytics with RapidMiner.

 

Topics

  • Loading of texts
    • Loading from flat files
    • Loading from data sets
    • Loading from data bases
    • Loading from process definitions
  • Concepts
    • Documents
    • Tokens
  • Visualization
    • Visualizing documents and tokens
    • High-dimensional visualizations for transformed documents
  • Handling unstructured data
    • Preprocessing of textual data
    • Tokenizing
    • Stemming
    • Filtering of tokens
    • Term frequencies
    • Document frequencies
    • TFIDF
  • Advanced modeling
    • Methods for high-dimensional data
    • Support Vector Machines
    • Text classification
    • Text clustering
  • Web Mining
    • Crawling the web
    • Extracting information from web sites
    • Transforming web sites to documents
    • Information extraction using XPath or regular expressions

 

Prerequisites

You must bring a laptop to class (Windows, Mac or Linux OS is fine).  Should have Java Runtime Environment (JRE) version 1.6 (officially Java 6.0) or later installed.  Students will be provided with links to install the Community Version of RapidMiner prior to the class.

 

Instructor

David Weisman

David Weisman is a data scientist consultant with over 35 years of experience in the software field.  In addition to consulting, he is a researcher at the University of Massachusetts Boston, working at the intersection of molecular biology and data mining.  David is searching for cancer biomarkers in enormous volumes of DNA sequence data, identifying biosensors of environmental pollutants in bacterial and plant transcriptomic data, and teaching bioinformatics courses.  Prior to obtaining his recent Ph.D. in molecular biology, David ran a long-term successful software consulting firm, specializing in distributed system development, compiler design, operating system development, quantitative finance, network security, and health care informatics.


Training Facility Logistics

The training will be held in downtown Boston at the New Horizons/MicroTek training facility.  The facility is near Boston's South Station and public transportation.  Check their website for directions.

___

Classes require a minimum of 3 students by Sept. 17 to be held.  If there are insufficient registrants, the class may be cancelled and all students will be refunded the full registration fee.  Students should organize their travel arrangements accordingly and with this proviso. Rapid-I will promptly notify registered users as soon as the 3 students quota will be met. 

___

Can't make it? Sign up for our newletter at http://rapid-i.com/content/blogsection/5/66/lang,en/ (upper left corner) to stay in the loop on future events and classes.

 

Our Refund Policy: Plans change? We get it. But if you can't make it to the class, please email us at training@rapid-i.com no later than Sept. 18.  No refunds will be given after this timeframe.

Have questions about Advanced Text and Web Mining Techniques with RapidMiner and RapidAnalytics? Contact RapidMiner

When & Where



New Horizons/MicroTek – Boston, MA
75 Federal Street
#105
Boston, MA 02110

Thursday, September 26, 2013 at 8:30 AM - Friday, September 27, 2013 at 5:00 PM (EDT)


  Add to my calendar

Organizer

RapidMiner

Pioneering advanced analytics vendor RapidMiner (formerly known as Rapid-I) is redefining how business analysts use Big Data to predict the future. With an open source heritage, RapidMiner is one of today’s most widely known and used predictive analytics platforms, providing powerful solutions for a wide variety of industries.

RapidMiner focuses on the fields of predictive analytics, data mining, and text mining. The discovery and leverage of unused business intelligence from existing data enables better informed decisions and allows for process optimization.

RapidMiner serves customers globally from offices in Germany and the United States. Furthermore, more than 30 partners can support your data analysis projects using RapidMiner software products.

 

  Contact the Organizer

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.