€900

Data science with SQL Server and R (2-day workshop)

Event Information

Share this event

Date and Time

Location

Location

Kohera

33a Veldkant

2550 Kontich

Belgium

View Map

Refund Policy

Refund Policy

Refunds up to 7 days before event

Event description

Description

Abstract

Workshop brings introduction to R language and R programming for data scientist, for purposes of data wrangling, data preparation, statistical analysis and predictive analytics. Discovering basics of statistics and multivariate statistics and covering most common statistical approaches will provide a good ground for advanced algorithms and predictive analytics. Workshop will also cover the guidelines how to work on larger datasets and how to work in conjunction with relational database (SQL Server) for distributed and parallel high performance computation.

Objectives

Get to know R Language and learn how to use R for data analysis and predictive analytics for typical daily data science encounters and challenges. Explore the variety of algorithms and methods for statistical analysis, predictive analytics and walk away with knowledge on how to use them, where and when to use them. Explore how to tackle datasets from different source, SQL Server, text files, web and social media. You will also see how to tackle performance issues when dealing with large datasets and how to get very good performance and scalability on your on-premises server.


Workshop will be covering

Introducing the basics of R Language and R programming. We will be covering basics of the R language, environment, data types to programming in R, and data wrangling, munging, scrapping, and data understanding.


Understanding basic and multivariate statistics for data scientists. Module will focus on exploring basic data understanding, bi and multivariate statistics, covering from correlations, regression, analysis of variance, Factor analysis, canonical and discriminant analysis.


Predictive algorithms explained and working with datasets for predictions in R Language. Covering commonly used predictive algorithms, which every data scientist can use with their daily work. Module will cover decision trees, gradient boosting with regression, Naïve bayes, clustering and time series.


Exploring RevoScaleR R package for parallel and distributive computation for large datasets within SQL Server 2017. RevoScaleR addresses several limitations for working on larger dataset, especially in corporate environment. These limitations have been removed, thus making predictive analysis against large datasets and databases extremely performance efficient. We will bring insights and knowledge on how to use this package with large datasets.


Put your predictive models into production. Story telling and all the knowledge gathered will be summed up and explained how to put your models in productions based on practical examples that can be applied in your company (recommendation system, clustering system, classification, text analysis, fraud detection)


Target audience

Audience who would like to refresh or deepen the knowledge on data science, R Language and working with data for data reports or data visualizations. We welcome everyone working with data (data wrangles, data analysts), working with reports and marketing campaigns (business and marketing people) and working on data analysis (data analysts, data scientist). If your daily work revolves around data and you are curious to learn more on R Language, statistics or predictive analytic, join us.


Required/Suggested Materials and Software

You are very welcome to bring your own laptop and work along. Feel free to download developer edition of SQL Server 2017 with R (in-database) installed (link: https://go.microsoft.com/fwlink/?linkid=853016 ) and in addition have R Studio (link: www.rstudio.com/download) or R Tools for Visual studio preinstalled.

Share with friends

Date and Time

Location

Kohera

33a Veldkant

2550 Kontich

Belgium

View Map

Refund Policy

Refunds up to 7 days before event

Save This Event

Event Saved