Skip to main content

course

Data Manipulation with data.table in R

Beginner

Updated 12/2024

Master core concepts about data manipulation such as filtering, selecting and calculating groupwise statistics using data.table.

Start course for free

Included for FreePremium or Teams

RData Manipulation4 hours15 videos59 exercises5,050 XP24,325Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

The data.table package provides a high-performance version of base R's data.frame with syntax and feature enhancements for ease of use, convenience and programming speed. This course shows you how to create, subset, and manipulate data.tables. You'll also learn about the database-inspired features of data.tables, including built-in groupwise operations. The course concludes with fast methods of importing and exporting tabular text data such as CSV files. Upon completion of the course, you will be able to use data.table in R for a more efficient manipulation and analysis process. Throughout the course you'll explore the San Francisco Bay Area bike share trip dataset from 2014.

Prerequisites

1

Introduction to data.table

Welcome to the course!

data.table pop quiz

Creating a data.table

Introducing bikes data

Filtering rows in a data.table

Filtering rows using positive integers

Filtering rows using negative integers

Filtering rows using logical vectors

Helpers for filtering

I %like% data.tables

Filtering with %in%

Filtering with %between% and %chin%

2

Selecting and Computing on Columns

Selecting columns from a data.table

Selecting a single column

Selecting columns by name

Deselecting specific columns

Computing on columns the data.table way

Computing in j (I)

Computing in j (II)

Advanced computations in j

Computing in j (III)

Combining i and j

3

Groupwise Operations

Computations by groups

Computing stats by groups (I)

Computing stats by groups (II)

Computing multiple stats

Chaining data.table expressions

Ordering rows

What are the top 5 destinations?

What is the most popular destination from each start station?

Combining i, j, and by (I)

Computations in j using .SD

Using .SD (I)

Using .SD (II)

4

Reference Semantics

Adding and updating columns by reference

Adding a new column

Updating an existing column (I)

Updating an existing column (II)

Grouped aggregations

Adding columns by group

Updating columns by group

Advanced aggregations

Adding multiple columns (I)

Adding multiple columns (II)

Combining i, j, and by (II)

5

Importing and Exporting Data

Fast data reading with fread()

Fast reading from disk

Importing a CSV file

Importing selected columns

Importing selected rows

Advanced file reading

Reading large integers

Specifying column classes

Dealing with empty and incomplete lines

Dealing with missing values

Fast data writing with fwrite()

Writing files to disk

Writing date and time columns

Fast writing to disk

Data Manipulation with data.table in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Join over 15 million learners and start Data Manipulation with data.table in R today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.