Thursday, November 27

Algorythm+/ Data: The Lifeblood of AI Models

By Nomad Engineer: Kat Usop

This lecture will explore the critical role of data, from its collection and cleaning to its transformative effect on model training

Date and time

Location

Online

Good to know

Highlights

2 hours
Online

Refund Policy

Refunds up to 7 days before event

About this event

Science & Tech • Science

This lecture will explore the critical role of data in powering and defining modern Artificial Intelligence (AI) models. Far from being a mere input, data is the lifeblood that shapes an AI model's abilities, biases, and eventual impact.

We'll explore the entire data lifecycle, from collection and cleaning to its transformative effect on model training and performance. Exciting times ahead!

Key Topics Covered

The Data Foundation: Understanding why AI models, particularly those based on machine learning (e.g., neural networks), fundamentally rely on massive datasets to learn patterns, make predictions, and generate outputs. We'll differentiate between various data types (structured, unstructured, semi-structured) and their relevance to different AI applications.

The Data Lifecycle: A step-by-step examination of how data is managed:

Acquisition & Collection: Strategies for gathering relevant and high-quality data.

Cleaning & Preprocessing: The essential—and often time-consuming—processes of handling missing values, standardizing formats, and correcting errors to ensure data quality.

Labeling & Annotation: The necessity of accurate labeling for supervised learning models and the ethical implications of the workforce involved in this process.

Quality vs. Quantity: Analyzing the trade-offs between the sheer volume of data and its integrity, relevance, and diversity. We'll discuss the concept of "Garbage In, Garbage Out" (GIGO) and its profound impact on model reliability.

Bias and Ethics in Data: The most crucial section will focus on how inherent biases present in training data (historical, societal, or selection bias) are absorbed and amplified by AI models, leading to unfair, discriminatory, or inaccurate outcomes. We'll discuss methods like fairness-aware data augmentation and data auditing to mitigate these risks.

Data's Future Role: A brief look at emerging concepts like Synthetic Data and Data-Centric AI, where the focus shifts from endlessly optimizing algorithms to perfecting the data itself.

Who Should Attend

Students, developers, data scientists, researchers, and anyone interested in understanding the practical and ethical foundations required to build responsible and effective AI systems. No prior deep AI knowledge is required, though a basic understanding of computing or statistics will be helpful.

Learning Outcomes

Upon completing this lecture, participants will be able to:

Articulate the fundamental relationship between data and AI model performance.

Identify the key stages and challenges within the data lifecycle for AI development.

Critically evaluate a dataset for potential quality issues and inherent biases.

Understand the ethical responsibilities associated with using data to train AI.

Organized by

Nomad Engineer: Kat Usop

Followers

Events

Hosting

5% off appliedEarly bird discount

From $31.66

Nov 27 · 3:00 AM PST