Skip to main content
HomeR

course

Scalable Data Processing in R

Advanced
Updated 12/2024
Learn how to write scalable code for working with big data in R using the bigmemory and iotools packages.
Start course for free

Included for FreePremium or Teams

RSoftware Development4 hours15 videos49 exercises3,950 XP5,852Statement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Datasets are often larger than available RAM, which causes problems for R programmers since by default all the variables are stored in memory. You’ll learn tools for processing, exploring, and analyzing data directly from disk. You’ll also implement the split-apply-combine approach and learn how to write scalable code using the bigmemory and iotools packages. In this course, you'll make use of the Federal Housing Finance Agency's data, a publicly available data set chronicling all mortgages that were held or securitized by both Federal National Mortgage Association (Fannie Mae) and Federal Home Loan Mortgage Corporation (Freddie Mac) from 2009-2015.

Prerequisites

Writing Efficient R Code
1

Working with increasingly large data sets

Start Chapter
2

Processing and Analyzing Data with bigmemory

Start Chapter
3

Working with iotools

Start Chapter
4

Case Study: A Preliminary Analysis of the Housing Data

Start Chapter
Scalable Data Processing in R
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Enroll now

Join over 15 million learners and start Scalable Data Processing in R today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.