Data Manipulation with dplyr
4.4+
45 reviewsBeginner
Delve further into the Tidyverse by learning to transform and manipulate data with dplyr.
Start Course for Free4 Hours13 Videos46 Exercises109,214 Learners
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Course Description
Say you've found a great dataset and would like to learn more about it. How can you start to answer the questions you have about the data? You can use dplyr to answer those questions—it can also help with basic transformations of your data. You'll also learn to aggregate your data and add, remove, or change the variables. Along the way, you'll explore a dataset containing information about counties in the United States. You'll finish the course by applying these tools to the babynames dataset to explore trends of baby names in the United States.
- 1
Transforming Data with dplyr
FreeLearn verbs you can use to transform your data, including select, filter, arrange, and mutate. You'll use these functions to modify the counties dataset to view particular observations and answer questions about the data.
Exploring data with dplyr50 xpUnderstanding your data50 xpSelecting columns100 xpThe filter and arrange verbs50 xpArranging observations100 xpFiltering for conditions100 xpFiltering and arranging100 xpThe mutate() verb50 xpCalculating the number of government employees100 xpCalculating the percentage of women in a county100 xpSelect, mutate, filter, and arrange100 xp - 2
Aggregating Data
Now that you know how to transform your data, you'll want to know more about how to aggregate your data to make it more interpretable. You'll learn a number of functions you can use to take many observations in your data and summarize them, including count, group_by, summarize, ungroup, and top_n.
The count verb50 xpCounting by region100 xpCounting citizens by state100 xpMutating and counting100 xpThe group_by, summarize, and ungroup verbs50 xpSummarizing100 xpSummarizing by state100 xpSummarizing by state and region100 xpThe slice_min and slice_max verbs50 xpSelecting a county from each region100 xpFinding the lowest-income state in each region100 xpUsing summarize, slice_max, and count together100 xp - 3
Selecting and Transforming Data
Learn advanced methods to select and transform columns. Also learn about select helpers, which are functions that specify criteria for columns you want to choose, as well as the rename and transmute verbs.
Selecting50 xpSelecting columns100 xpSelect helpers100 xpThe rename verb50 xpRenaming a column after count100 xpRenaming a column as part of a select100 xpThe transmute verb50 xpChoosing among verbs50 xpUsing transmute100 xpMatching verbs to their definitions100 xpChoosing among the four verbs100 xp - 4
Case Study: The babynames Dataset
Work with a new dataset that represents the names of babies born in the United States each year. Learn how to use grouped mutates and window functions to ask and answer more complex questions about your data. And use a combination of dplyr and ggplot2 to make interesting graphs to further explore your data.
The babynames data50 xpFiltering and arranging for one year100 xpFinding the most popular names each year100 xpVisualizing names with ggplot2100 xpGrouped mutates50 xpFinding the year each name is most common100 xpAdding the total and maximum for each name100 xpVisualizing the normalized change in popularity100 xpWindow functions50 xpUsing ratios to describe the frequency of a name100 xpBiggest jumps in a name100 xpCongratulations!50 xp
In the following tracks
Data Analyst with RData Manipulation with RData Scientist with RData Scientist Professional with RR ProgrammerCollaborators

Prerequisites
Introduction to the TidyverseJames Chapman
See MoreCurriculum Manager, DataCamp
James is a Curriculum Manager at DataCamp. He has a Master's degree in Physics and Astronomy from Durham University, where he specialized in quasar detection and tutored Math and English. He joined DataCamp as a learner in 2018, and the data skills learned on DataCamp were quickly integrated into his scientific projects. In his spare time, he enjoys restoring retro toys and electronics.
Follow James on LinkedIn
Follow James on LinkedIn
Don’t just take our word for it
*4.4from 45 reviews
64%
24%
4%
0%
7%
Sort by
- Victor C.14 days
good exercise that not too easy or hard for beginner.
- Asseliya T.28 days
very helpful
- Jiao L.3 months
easy to understand and good practice!
- Tomas K.3 months
I am starting out with R and this course was perfect for people like me. It starts with really basic functions and then gets more challenging at the end. I really enjoyed it!
- Elariz T.3 months
i totally recommend they are teaching on the easiest way possible
Loading ...
"good exercise that not too easy or hard for beginner."
Victor C.
"very helpful"
Asseliya T.
"easy to understand and good practice!"
Jiao L.
Join over 11 million learners and start Data Manipulation with dplyr today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.