Dealing With Missing Data in R

Make it easy to visualize, explore, and impute missing data with naniar, a tidyverse friendly approach to missing data.

4 Stunden14 Videos52 Übungen15.198 LernendeLeistungsnachweis

Kostenloses Konto erstellen

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.

Trainierst du 2 oder mehr?

Versuchen DataCamp for Business

Beliebt bei Lernenden in Tausenden Unternehmen

Kursbeschreibung

Missing data is part of any real world data analysis. It can crop up in unexpected places, making analyses challenging to understand. In this course, you will learn how to use tidyverse tools and the naniar R package to visualize missing values. You'll tidy missing values so they can be used in analysis and explore missing values to find bias in the data. Lastly, you'll reveal other underlying patterns of missingness. You will also learn how to "fill in the blanks" of missing values with imputation models, and how to visualize, assess, and make decisions based on these imputed datasets.

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Tidyverse Toolbox für Fortgeschrittene

Gehe zu Track

1
Why care about missing data?
Kostenlos
Chapter 1 introduces you to missing data, explaining what missing values are, their behavior in R, how to detect them, and how to count them. We then introduce missing data summaries and how to summarise missingness across cases, variables, and how to explore across groups within the data. Finally, we discuss missing data visualizations, how to produce overview visualizations for the entire dataset and over variables, cases, and other summaries, and how to explore these across groups.
Kapitel Jetzt Abspielen
Introduction to missing data
50 xp
Using and finding missing values
100 xp
How many missing values are there?
100 xp
Working with missing values
50 xp
Why care about missing values?
50 xp
Summarizing missingness
100 xp
Tabulating Missingness
100 xp
Other summaries of missingness
100 xp
How do we visualize missing values?
50 xp
Your first missing data visualizations
100 xp
Visualizing missing cases and variables
100 xp
Visualizing missingness patterns
100 xp
2
Wrangling and tidying up missing values
In chapter two, you will learn how to uncover hidden missing values like "missing" or "N/A" and replace them with `NA`. You will learn how to efficiently handle implicit missing values - those values implied to be missing, but not explicitly listed. We also cover how to explore missing data dependence, discussing Missing Completely at Random (MCAR), Missing At Random (MAR), Missing Not At Random (MNAR), and what they mean for your data analysis.
Kapitel Jetzt Abspielen
Searching for and replacing missing values
50 xp
Using miss_scan_count
100 xp
Using replace_with_na
100 xp
Using replace_with_na scoped variants
100 xp
Filling down missing values
50 xp
Fix implicit missings using complete()
100 xp
Fix explicit missings using fill()
100 xp
Using complete() and fill() together
100 xp
Missing Data dependence
50 xp
Differences between MCAR and MAR
50 xp
Exploring missingness dependence
100 xp
Further exploring missingness dependence
50 xp
3
Testing missing relationships
In this chapter, you will learn about workflows for working with missing data. We introduce special data structures, the shadow matrix, and nabular data, and demonstrate how to use them in workflows for exploring missing data so that you can link summaries of missingness back to values in the data. You will learn how to use ggplot to explore and visualize how values changes as other variables go missing. Finally, you learn how to visualize missingness across two variables, and how and why to visualize missings in a scatterplot.
Kapitel Jetzt Abspielen
Tools to explore missing data dependence
50 xp
Creating shadow matrix data
100 xp
Performing grouped summaries of missingness
100 xp
Further exploring more combinations of missingness
100 xp
Visualizing missingness across one variable
50 xp
Nabular data and filling by missingness
100 xp
Nabular data and summarising by missingness
100 xp
Explore variation by missingness: box plots
100 xp
Visualizing missingness across two variables
50 xp
Exploring missing data with scatter plots
100 xp
Using facets to explore missingness
100 xp
Faceting to explore missingness (multiple plots)
100 xp
4
Connecting the dots (Imputation)
In this chapter, you will learn about filling in the missing values in your data, which is called imputation. You will learn how to impute and track missing values, and what the good and bad features of imputations are so that you can explore, visualise, and evaluate the imputed data against the original values. You will learn how to use, evaluate, and compare different imputation models, and explore how different imputation models affect the inferences you can draw from the models.
Kapitel Jetzt Abspielen
Filling in the blanks
50 xp
Impute data below range with nabular data
100 xp
Visualize imputed values in a scatter plot
100 xp
Create histogram of imputed data
100 xp
What makes a good imputation
50 xp
Evaluating bad imputations
100 xp
Evaluating imputations: The scale
100 xp
Evaluating imputations: Across many variables
100 xp
Performing imputations
50 xp
Using simputation to impute data
100 xp
Evaluating and comparing imputations
100 xp
Evaluating imputations (many models & variables)
100 xp
Evaluating imputations and models
50 xp
Combining and comparing many imputation models
100 xp
Evaluating the different parameters in the model
100 xp
Final Lesson
50 xp

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Tidyverse Toolbox für Fortgeschrittene

Gehe zu Track

Mitwirkende

David Campos

Shon Inouye

Chester Ismay

Voraussetzungen

Introduction to R Introduction to the Tidyverse

Was sagen andere Lernende?

Melden Sie sich an 15 Millionen Lernende und starten Sie Dealing With Missing Data in R Heute!

Kostenloses Konto erstellen

Google LinkedIn Facebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.

Kursbeschreibung

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Trainierst du 2 oder mehr?

In den folgenden Tracks

Tidyverse Toolbox für Fortgeschrittene

Why care about missing data?

Wrangling and tidying up missing values

Testing missing relationships

Connecting the dots (Imputation)

Trainierst du 2 oder mehr?

In den folgenden Tracks

Tidyverse Toolbox für Fortgeschrittene

Was sagen andere Lernende?

Melden Sie sich an .css-ou6dz6{color:#03ef62;}15 Millionen Lernende und starten Sie Dealing With Missing Data in R Heute!

Kostenloses Konto erstellen

Trainierst du 2 oder mehr?

Melden Sie sich an 15 Millionen Lernende und starten Sie Dealing With Missing Data in R Heute!