Skip to main content

course

ETL and ELT in Python

Intermediate

4.6+

Updated 12/2024

Learn to build effective, performant, and reliable data pipelines using Extract, Transform, and Load principles.

Start course for free

Included for FreePremium or Teams

PythonData Engineering4 hours14 videos53 exercises4,450 XP15,759Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Empowering Analytics with Data Pipelines

Data pipelines are at the foundation of every strong data platform. Building these pipelines is an essential skill for data engineers, who provide incredible value to a business ready to step into a data-driven future. This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.

Building and Maintaining ETL Solutions

Throughout this course, you’ll dive into the complete process of building a data pipeline. You’ll grow skills leveraging Python libraries such as pandas and json to extract data from structured and unstructured sources before it’s transformed and persisted for downstream use. Along the way, you’ll develop confidence tools and techniques such as architecture diagrams, unit-tests, and monitoring that will help to set your data pipelines out from the rest. As you progress, you’ll put your new-found skills to the test with hands-on exercises.

Supercharge Data Workflows

After completing this course, you’ll be ready to design, develop and use data pipelines to supercharge your data workflow in your job, new career, or personal project.

Prerequisites

Data Warehousing Concepts Streamlined Data Ingestion with pandas

1

Introduction to Data Pipelines

Introduction to ETL and ELT Pipelines

Running an ETL Pipeline

ELT in Action

ETL and ELT Pipelines

Building ETL and ELT Pipelines

Building an ETL Pipeline

The "T" in ELT

Extracting, Transforming, and Loading Student Scores Data

2

Building ETL Pipelines

Extracting data from structure sources

Extracting data from parquet files

Pulling data from SQL databases

Building functions to extract data

Transforming data with pandas

Filtering pandas DataFrames

Transforming sales data with pandas

Validating data transformations

Persisting data with pandas

Loading sales data to a CSV file

Customizing a CSV file

Persisting data to files

Monitoring a data pipeline

Logging within a data pipeline

Handling exceptions when loading data

Monitoring and alerting within a data pipeline

3

Advanced ETL Techniques

Extracting non-tabular data

Ingesting JSON data with pandas

Reading JSON data into memory

Transforming non-tabular data

Iterating over dictionaries

Parsing data from dictionaries

Transforming JSON data

Transforming and cleaning DataFrames

Advanced data transformation with pandas

Filling missing values with pandas

Grouping data with pandas

Applying advanced transformations to DataFrames

Loading data to a SQL database with pandas

Loading data to a Postgres database

Validating data loaded to a Postgres Database

4

Deploying and Maintaining a Data Pipeline

Manually testing a data pipeline

Testing data pipelines

Validating a data pipeline at "checkpoints"

Testing a data pipeline end-to-end

Unit-testing a data pipeline

Validating a data pipeline with assert

Writing unit tests with pytest

Creating fixtures with pytest

Unit testing a data pipeline with fixtures

Running a data pipeline in production

Orchestration and ETL tools

Data pipeline architecture patterns

Running a data pipeline end-to-end

Congratulations!

ETL and ELT in Python

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Don’t just take our word for it

*4.6

from 29 reviews

72%

17%

10%

0%

0%

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Theo N.

12 days

.

Sudipta H.

about 1 month

This was very detailed course, covered a lot of other concepts other than ETL and ELT in Python, like unit testing, error handling.

Domingos D.

about 1 month

Some more emphasis on ELT would be perfect. Overall, a great course. I highly recommend.

Andrea B.

3 months

The ETL and ELT in Python course is clear, easy to follow, and practical. It offers hands-on experience with real-world data workflows, making complex concepts approachable. A great choice for anyone looking to understand ETL/ELT using Python effectively!

Luis V.

4 months

The course content is very appropriate. Thank you.

"."

Theo N.

"This was very detailed course, covered a lot of other concepts other than ETL and ELT in Python, like unit testing, error handling."

Sudipta H.

"Some more emphasis on ELT would be perfect. Overall, a great course. I highly recommend."

Domingos D.

FAQs

Join over 15 million learners and start ETL and ELT in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.