Direkt zum Inhalt
StartseiteR

Handling Missing Data with Imputations in R

Diagnose, visualize and treat missing data with a range of imputation techniques with tips to improve your results.

Kurs Kostenlos Starten
4 Stunden13 Videos49 Übungen5.163 LernendeTrophyLeistungsnachweis

Kostenloses Konto erstellen

GoogleLinkedInFacebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.
Group

Trainierst du 2 oder mehr?

Versuchen DataCamp for Business

Beliebt bei Lernenden in Tausenden Unternehmen


Kursbeschreibung

Missing data is everywhere. The process of filling in missing values is known as imputation, and knowing how to correctly fill in missing data is an essential skill if you want to produce accurate predictions and distinguish yourself from the crowd. In this course, you’ll learn how to use visualizations and statistical tests to recognize missing data patterns and how to impute data using a collection of statistical and machine learning models. You’ll also gain decision-making skills, helping you decide which imputation method fits best in a particular situation. Finally, you’ll learn to incorporate uncertainty from imputation into your inference and predictions, making them more robust and reliable.
Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.
DataCamp Für UnternehmenFür eine maßgeschneiderte Lösung buchen Sie eine Demo.
  1. 1

    The Problem of Missing Data

    Kostenlos

    In this chapter, you’ll find out why missing data can be a risk when analyzing a dataset. You’ll be introduced to the three missing data mechanisms and learn how to recognize them using statistical tests and visualization tools.

    Kapitel Jetzt Abspielen
    Missing data: what can go wrong
    50 xp
    Linear regression with incomplete data
    100 xp
    Analyzing regression output
    50 xp
    Comparing models
    100 xp
    Missing data mechanisms
    50 xp
    Recognizing missing data mechanisms
    100 xp
    t-test for MAR: data preparation
    100 xp
    t-test for MAR: interpretation
    100 xp
    Visualizing missing data patterns
    50 xp
    Aggregation plot
    100 xp
    Spine plot
    100 xp
    Mosaic plot
    100 xp
  2. 2

    Donor-Based Imputation

    Get to know the taxonomy of imputation methods and learn three donor-based techniques: mean, hot-deck, and k-Nearest-Neighbors imputation. You’ll look under the hood to see how these methods work, before learning how to apply them to a real-world tropical weather dataset. Along the way, you’ll also learn useful tricks that you can use to make them work even better for your problems.

    Kapitel Jetzt Abspielen
  3. 3

    Model-Based Imputation

    It’s time to learn how to use statistical and machine learning models, such as linear regression, logistic regression, and random forests, to impute missing data. In this chapter, you’ll look into how the models make their predictions and use this knowledge to draw the imputed values from conditional distributions. This is important as it ensures your imputations are more varied and plausible, making them more similar to the true data.

    Kapitel Jetzt Abspielen
  4. 4

    Uncertainty from Imputation

    Imputed values are not set in stone. They are just estimates and estimates come with some uncertainty. In this final chapter, you’ll discover how bootstrapping and chained equation using the mice package can be used to incorporate imputation uncertainty into your models and analyses to make them more reliable and robust.

    Kapitel Jetzt Abspielen
Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

Datensätze

Biopics datasetTropical Atmosphere Ocean dataset

Mitwirkende

Collaborator's avatar
Amy Peterson
Collaborator's avatar
Adel Nehme
Michał Oleszak HeadshotMichał Oleszak

Machine Learning Engineer

Mehr Anzeigen

Was sagen andere Lernende?

Melden Sie sich an 15 Millionen Lernende und starten Sie Handling Missing Data with Imputations in R Heute!

Kostenloses Konto erstellen

GoogleLinkedInFacebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.