Following Data Day Texas, we've had many requests for more information on KNIME. We've asked the folks at KNIME.org to come back and offer a full day deep dive on data mining with KNIME. This is a full day extended version of the two hour workshop at Data Day Texas. For those unfamiliar with KNIME, Michael Berthold of the KNIME project will be speaking to the Austin Data Geeks meetup Tuesday evening at Rackspace. RSVP at Meetup.
KNIME, the Konstanz Information Miner (wikipedia), is an open source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface allows assembly of nodes for data preprocessing (ETL: Extraction, Transformation, Loading), for modeling and data analysis and visualization. Since 2006, KNIME has been used in pharmaceutical research, but increasingly it is used in other areas like customer data analysis, business intelligence and financial data analysis.
This course will cover how to use KNIME for data loading, integration, preprocessing, analysis and visualization.
Bring your own laptop in order to follow the hands-on parts of the course. All Training materials will be included on a USB stick.
Introduction to KNIME
Data Reading and Preprocessing w/ Hands-On
Data Manipulation and Aggregation w/ Hands-On
Visualization w/ Hands-On
Training Predictive Models w/ Hands-On
Model Selection and Validation w/ Hands-On
Control Constructs: Looping and Switching w/ Hands-On
Reporting in KNIME w/ Hands-On
Walk Through of Example Workflows: Next Best Offer, Text&Network Mining, a.o.
Michael Berthold, since August 2003, holds the Nycomed-Chair for Bioinformatics and Information Mining at Konstanz University, Germany where his research focuses on using machine learning methods for the interactive analysis of large information repositories in the Life Sciences. Most of the research results are made available to the public via the open source data mining platform KNIME (wikipedia entry). M. Berthold is Past President of the North American Fuzzy Information Processing Society, Associate Editor of several journals and the President of the IEEE System, Man, and Cybernetics Society. He has been involved in the organization of various conferences, most notably the IDA-series of symposia on Intelligent Data Analysis and the conference series on Computational Life Science. Together with David Hand he co-edited the textbook Intelligent Data Analysis: An Introduction which has recently appeared in a completely revised, second edition. He is also co-author of Guide to Intelligent Data Analysis (Springer Verlag) which appeared in summer 2010.