Introduction to Big Data Analysis with the R Language
Thursday, August 21, 2014 from 8:30 AM to 12:00 PM (PDT)
San Jose, CA
San Francisco, California
London, United Kingdom
Making the most of your investment in Big Data requires a good understanding of the new analytical tools and languages that are available. There’s been an explosion of these tools, built to assist analysts and developers with issues such as concurrency, functional programming, and domain-specific problems. The R Project is perhaps the best known and provides a wide variety of statistical, data mining and visualization capabilities. Among the reasons it is popular at this early stage in the evolution of Big Data, is because it can be adopted by analysts at various levels of technical expertise, including those with very little programming experience.
R is a cross-platform environment with comprehensive documentation. As such, it can be a little overwhelming on where and how to start using it. This talk will introduce you to the environment and help you along the path to solve Big Data analytical challenges. A light background in statistics would be advantageous for any attendees of this session, although rank novices will at least gain some insight into the issues, challenges and requirements for the analysis of Big Data.
The workshop will cover:
R Overview & Environment
- Graphics and Visualization
- Data Mining & Machine Learning
- Arguments & Examples
- Values & Variables
- Basic Types: Numeric, Integer, Complex, Logical, Character, Factor
- Complex Types: Sequence, Vector, List, Matrix, Data Frame, Function
RStudio, Graphics, Statistics & Other Languages
- RStudio Specifics
- Types of Graphics
- Mean/ Median, Frequency Distribution, Quantiles/Percentiles, Range/IQR, Deviations, Covariance/Correlation Coefficient
- Python and Others
Big Data & Linked Data
- Graphs & Queries
- Constraining the Predicate, Multiple Constraints, SPARQL Protocol
- Concerns & Solutions
This workshop has received outstanding reviews when presented previously. It will explain in detail how to get started with R and key features you can exploit to enhance your data analysis skills.
PS: This workshop takes place at the same time and place as the Annual NoSQL Now! Conference (www.NoSQLNow.com) and Semantic Technology and Business Conference (www.SemTechBiz.com). Registration for the R workshop will also entitle you to attend the NoSQL Now! and SemtechBiz exhibits at no additional charge.
Speaker: Brian Sletten, President, Bosatsu Consulting, Inc.
Brian Sletten is a liberal arts-educated software engineer with a focus on forward-leaning technologies. His experience has spanned many industries including retail, banking, online games, defense, finance, hospitality and health care. He has a B.S. in Computer Science from the College of William and Mary and lives in Auburn, CA. He focuses on using, training and mentoring on web architecture, resource-oriented computing, social networking, the Semantic Web, data science, scalable systems, security consulting and other technologies of the late 20th and early 21st Centuries.
When & Where
DATAVERSITY™ provides resources for Information Technology (IT) Professionals, Developers and Executives to learn about the uses and management of data. Our worldwide community of practitioners, advisers, and customers participates in and benefits from DATAVERSITY’s educational conferences, discussions, articles, blogs, webinars, news feeds, and more.