Skip to main content
HomeSQL

Cleaning Data in SQL Server Databases

Develop the skills you need to clean raw data and transform it into accurate insights.

Start Course for Free
4 hours13 videos48 exercises9,669 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

Did you know that data scientists and data analysts spend a large amount of time cleaning data before they can analyze it? This is because real-world data is messy. To help you navigate messy data this course teaches you how to clean data stored in an SQL Server database. You’ll learn how to solve common problems such as how to clean messy strings, deal with empty values, compare the similarity between strings, and much more. You’ll get hands-on with all these tasks using a wide range of interesting and messy datasets, including monthly airline flights by airport, TV series and paper shop sales. Are you ready to get your hands messy?
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Starting with Cleaning Data

    Free

    To begin the course, you will learn why cleaning data is important. You will solve simple problems such as leading and trailing spaces in strings, unifying formats for flight registrations, combining strings and more.

    Play Chapter Now
    Introduction to Cleaning Data
    50 xp
    Unifying flight formats I
    100 xp
    Unifying flight formats II
    100 xp
    Cleaning messy strings
    50 xp
    Trimming strings I
    100 xp
    Trimming strings II
    100 xp
    Unifying strings
    100 xp
    Comparing the similarity between strings
    50 xp
    SOUNDEX() and DIFFERENCE()
    50 xp
    Comparing names with SOUNDEX()
    100 xp
    Comparing names with DIFFERENCE()
    100 xp
  2. 2

    Dealing with missing data, duplicate data, and different date formats

    In this chapter, you will deepen your data cleaning knowledge. You will learn how to deal with missing data, avoid duplicate data in your datasets, and work with different formats of dates.

    Play Chapter Now
  3. 3

    Dealing with out of range values, different data types, and pattern matching

    In this chapter, you will deal with out of range values and inaccurate data. You will also practice converting data with different types. Finally, you will work on matching patterns to your data to find outliers.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Flight statistics dataset and series ratings dataset

collaborators

Collaborator's avatar
Amy Peterson
Miriam Antona HeadshotMiriam Antona

Software Engineer

Miriam is a freelance Software Engineer with 15+ years of experience. She is focused on analyzing, designing, and developing software applications. She also collaborates with the UOC University supervising Bachelor theses. Miriam loves programming and experimenting with different technologies. She is passionate about databases and enjoys playing with data. She holds a Master of Science Degree in Computer Engineering.
See More

What do other learners have to say?

Join over 15 million learners and start Cleaning Data in SQL Server Databases today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.