Data Exploration Bootcamp with Python

Dunder Data

This bootcamp is designed to introduce you to data exploration using the Python programming language. Through this intense, weeklong program you will master the skills necessary to manipulate, visualize and explore datasets to extract valuable insights. 
<H2>Is this course for you? </H2>
If you are excited about the world of machine learning and data science but have yet to fully dive in, this course will catapult your skills so that you can smoothly transition into it. Data exploration is the foundation for all good data analysis and where the vast majority of time is spent for data scientists. No prior programming experience is needed as a thorough introduction to Python will be given in a pre-course assignment.
<H2>When</H2>
May 15th - 19th: 9 a.m. - 5 p.m.
<H2>Discounts</H2>
If you are a student or unemployed use discount code student25 to get a 25% discount. If you are a member of <A HREF="https://www.meetup.com/Houston-Data-Science/" TARGET="_blank" DATA-MSYS-CLICKTRACK="0" REL="nofollow noopener noreferrer">Houston Data Science</A>, use hds10 to get 10% off. <A HREF="http://www.eventbrite.com/affiliate-register?eid=32411689235&affid=157238147" TARGET="_blank" REL="noopener noreferrer">Become an affiliate</A> and earn $100 for every attendee that you refer. 
<H2>Structure of Course</H2>
Learning is accomplished by working through difficult assignments and receiving and reviewing modeled solutions. Using a 'flipped classroom', students will be expected to prepare and read each day's material before coming to class. In class, students will rotate from instructor guided lessons to student-focused exercises and projects. The instructor will personally review all code and give feedback for all course assignments. Approximately 300 short answer questions with detailed solutions will be available. No more than 10 students will be enrolled in the class ensuring personalized learning and participation.
<H2>Syllabus</H2>
Before the Course: 
Students will need to set aside 10 - 20 hours to set up the programming environment and to complete a thorough overview of the fundamentals of Python. An additional class will be held the week before the bootcamp to ensure all students are completing this assignment.
Day 1: Introduction to Pandas
Perhaps the most popular and widely used open-source data wrangling tool of the times, the Pandas library and its main data structures, the Series and DataFrame will be introduced.
Day 2: Split-Apply-Combine
The split-apply-combine paradigm is crucial for finding insights about particular groupings within your data. Many valuable insights from city of Houston public data will be discovered.
Day 3: Cleaning and Preparing Data for Machine Learning 
All real-world data is messy and not immediately available for consumption by machine learning models. Many different methods on cleaning, tidying and preparing data for input into machine learning will be utilized before deploying some basic machine learning models.
Day 4: Time Series
Stemming from its original purpose, Pandas superior time series functionality will be explored by grabbing stock price data and building a prediction model for the major stock indices.
Day 5: Visualization and Assessment
All good data explorations will have visualizations that accurately and clearly describe the insights discovered. The fundamental plotting library Matplotlib and it's enhancer Seaborn will be introduced. A web application will be deployed using Flask with beautiful and modern interactive visualizations from the popular Bokeh library.
<H2>Post-course Assesment</H2>
A mock data science interview assignment will test student progress on all course material. Additionally, a series of short-answer Pandas questions will be assigned intermittently after course completion to ensure retention of knowledge.
<H2>Instructor </H2>
<A HREF="https://www.linkedin.com/in/tedpetrou" TARGET="_self" DATA-MSYS-CLICKTRACK="0" REL="nofollow noopener noreferrer">Ted Petrou</A> is the author of the upcoming <A HREF="https://www.amazon.com/Pandas-Cookbook-Ted-Petrou-ebook/dp/B06W2LXLQK" TARGET="_blank" DATA-MSYS-CLICKTRACK="0" REL="nofollow noopener noreferrer">Pandas Cookbook</A>. He is a data scientist at Schlumberger where he spends the vast majority of his time exploring data. Some of his projects include using targeted sentiment analysis to discover the root cause of part failure from engineer text, developing customized client/server dashboarding applications and real-time web services to avoid mispricing of sales items. Ted received his Masters degree in statistics from Rice University and used his analytical skills to play poker professionally and teach math before becoming a data scientist. He is also head of <A HREF="http://www.meetup.com/Houston-Data-Science/" TARGET="_blank" DATA-MSYS-CLICKTRACK="0" REL="nofollow noopener noreferrer">Houston Data Science</A>.

Cowork Lab

Our {communityGuidelinesLink} describe the sort of content we prohibit on Eventbrite. If you suspect an event may be in violation, you can report it to us so we can investigate.

To report other categories of prohibited or illegal content, submit {link}

Data Exploration Bootcamp with Python

More events from Dunder Data

Discover more events from Dunder Data, from Science & Tech to other experiences you might love.

Data Exploration Bootcamp with Python

More events from Dunder Data

Discover more events from Dunder Data, from Science & Tech to other experiences you might love.

You might also like...

Browse more events with different dates, prices, and formats to find your next great experience.