Skip to main content

course

Exploratory Data Analysis in Python

Intermediate

4.7+

Updated 12/2024

Learn how to explore, visualize, and extract insights from data using exploratory data analysis (EDA) in Python.

Start course for free

Included for FreePremium or Teams

PythonExploratory Data Analysis4 hours14 videos49 exercises4,150 XP58,153Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

So you’ve got some interesting data - where do you begin your analysis? This course will cover the process of exploring and analyzing data, from understanding what’s included in a dataset to incorporating exploration findings into a data science workflow.

Using data on unemployment figures and plane ticket prices, you’ll leverage Python to summarize and validate data, calculate, identify and replace missing values, and clean both numerical and categorical values. Throughout the course, you’ll create beautiful Seaborn visualizations to understand variables and their relationships.

For example, you’ll examine how alcohol use and student performance are related. Finally, the course will show how exploratory findings feed into data science workflows by creating new features, balancing categorical features, and generating hypotheses from findings.

By the end of this course, you’ll have the confidence to perform your own exploratory data analysis (EDA) in Python.You’ll be able to explain your findings visually to others and suggest the next steps for gathering insights from your data!

Prerequisites

Introduction to Statistics in Python Introduction to Data Visualization with Seaborn

1

Getting to Know a Dataset

Initial exploration

Functions for initial exploration

Counting categorical values

Global unemployment in 2021

Data validation

Detecting data types

Validating continents

Validating range

Data summarization

Summaries with .groupby() and .agg()

Named aggregations

Visualizing categorical summaries

2

Data Cleaning and Imputation

Addressing missing data

Dealing with missing data

Strategies for remaining missing data

Imputing missing plane prices

Converting and analyzing categorical data

Finding the number of unique values

Flight duration categories

Adding duration categories

Working with numeric data

Flight duration

Adding descriptive statistics

Handling outliers

What to do with outliers

Identifying outliers

Removing outliers

3

Relationships in Data

Patterns over time

Importing DateTime data

Updating data type to DateTime

Visualizing relationships over time

Correlation

Interpreting a heatmap

Visualizing variable relationships

Visualizing multiple variable relationships

Factor relationships and distributions

Categorical data in scatter plots

Exploring with KDE plots

4

Turning Exploratory Analysis into Action

Considerations for categorical data

Checking for class imbalance

Cross-tabulation

Generating new features

Extracting features for correlation

Calculating salary percentiles

Categorizing salaries

Generating hypotheses

Comparing salaries

Choosing a hypothesis

Congratulations

Exploratory Data Analysis in Python

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Don’t just take our word for it

*4.7

from 46 reviews

80%

17%

2%

0%

0%

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Dimitris L.

7 days

nice course

Jordan S.

11 days

Good overview of EDA and clear content

Paul C.

about 1 month

This course introduces some more advanced concepts and techniques that are taught thoroughly and clearly.

Nataliya K.

about 1 month

Great class!

PASCAL P.

about 2 months

Great learning experience.

"nice course"

Dimitris L.

"Good overview of EDA and clear content"

Jordan S.

"This course introduces some more advanced concepts and techniques that are taught thoroughly and clearly."

Paul C.

FAQs

Join over 15 million learners and start Exploratory Data Analysis in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.