Got files with thousands of rows that are a pain to handle in Excel? It's time to automate your data analysis with Python and its Pandas library! This workshop will guide you through loading, cleaning, and analyzing large datasets in table format.
Prerequisites
- Have a good knowledge of Python programming or have completed the workshops “Introduction to programming with Python (PYT101)” and “Enhancing your Python programming skills (PYT102)”
Agenda
- Pandas and DataFrames (data tables) in Python
- Descriptive statistics and data grouping
- Selecting data rows and columns
- Cleaning undefined data
- Combining datasets
Registration
- Academic: $10 (anyone who studies, teaches, or works at a university, CEGEP, CCTT, or university-affiliated research institute)
- NPO: $10 (anyone who works for a non-profit organization)
- Other: $250 (any other profile)
Instructor
Darcy Quesnel, analyst in advanced research computing at Calcul Québec.
Language
English
Technical prerequisites
We will use the Zoom platform. Because this event is a practical workshop, it is very useful having a secondary screen where you would get the instructor window on one screen and your own window on your main screen.
We will use the Jupyter Lab interface. Make sure you have a modern Web browser like Google Chrome, Firefox, Edge or Safari.
Notes
- A certificate of participation will be send to each participant who attends at least 60% of the workshop.
- The workshop is not recorded.
- The workshop could be canceled if the number of registrations is too low.
Contact
For any question, please write us to training@calculquebec.ca
To stay up to date on our upcoming events, subscribe to our Eventbrite page!