Skip to main content
HomeR

course

Cleaning Data in R

Intermediate
4.4+
28 reviews
Updated 12/2024
Learn to clean data as quickly and accurately as possible to help you move from raw data to awesome insights.
Start course for free

Included for FreePremium or Teams

RData Preparation4 hours13 videos44 exercises3,700 XP52,609Statement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Overcome Common Data Problems Like Removing Duplicates in R

It's commonly said that data scientists spend 80% of their time cleaning and manipulating data and only 20% of their time analyzing it. The time spent cleaning is vital since analyzing dirty data can lead you to draw inaccurate conclusions.

In this course, you’ll learn a variety of techniques to help you clean dirty data using R. You’ll start by converting data types, applying range constraints, and dealing with full and partial duplicates to avoid double-counting.

Delve into Advanced Data Challenges

Once you’ve practiced working on common data issues, you’ll move on to more advanced challenges such as ensuring consistency in measurements and dealing with missing data. After every new concept, you’ll have the chance to complete a hands-on exercise to cement your knowledge and build your experience.

Learn to Use Record Linkage During Data Cleaning

Record Linkage is used to merge datasets together when the values have issues such as typos or different spellings. You’ll explore this useful technique in the final chapter and practice the application by using it to join two restaurant review datasets together into a single dataset.

Prerequisites

Joining Data with dplyr
1

Common Data Problems

Start Chapter
2

Categorical and Text Data

Start Chapter
3

Advanced Data Problems

Start Chapter
4

Record Linkage

Start Chapter
Cleaning Data in R
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Enroll now

Don’t just take our word for it

*4.4
from 28 reviews
68%
21%
4%
4%
4%
  • Napon M.
    4 months

    Great, i learned a lot to apply with my job

  • Tzu-Hao K.
    5 months

    Practical and well-structured

  • Ondřej M.
    12 months

    Needed info, which helped me a lot how to process these crucial first steps in Data analysis.

  • John G.
    over 1 year

    This course was great. It was informative with an excellent instructor who clearly explained the information.

  • Tara P.
    over 1 year

    There were some really nice ideas on here and it was very helpful. I think using the assertive package is not necessary though and would like to see this updated to more base functions and ideas.

"Great, i learned a lot to apply with my job"

Napon M.

"Practical and well-structured"

Tzu-Hao K.

"Needed info, which helped me a lot how to process these crucial first steps in Data analysis."

Ondřej M.

FAQs

Join over 15 million learners and start Cleaning Data in R today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.