Skip to main content

course

Machine Learning with Tree-Based Models in R

Intermediate

Updated 12/2024

Learn how to use tree-based models and ensembles to make classification and regression predictions with tidymodels.

Start course for free

Included for FreePremium or Teams

RMachine Learning4 hours16 videos58 exercises4,850 XP8,277Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Tree-based machine learning models can reveal complex non-linear relationships in data and often dominate machine learning competitions. In this course, you'll use the tidymodels package to explore and build different tree-based models—from simple decision trees to complex random forests. You’ll also learn to use boosted trees, a powerful machine learning technique that uses ensemble learning to build high-performing predictive models. Along the way, you'll work with health and credit risk data to predict the incidence of diabetes and customer churn.

Prerequisites

Modeling with tidymodels in R

1

Classification Trees

Welcome to the course!

Why tree-based methods?

Specify that tree

Train that model

How to grow your tree

Train/test split

Avoiding class imbalances

From zero to hero

Predict and evaluate

Make predictions

Crack the matrix

Are you predicting correctly?

2

Regression Trees and Cross-Validation

Continuous outcomes

Train a regression tree

Predict new values

Inspect model output

Performance metrics for regression trees

In-sample performance

Out-of-sample performance

Bigger mistakes, bigger penalty

Cross-validation

Create the folds

Fit the folds

Evaluate the folds

Bias-variance tradeoff

Call things by their names

Adjust model complexity

In-sample and out-of-sample performance

3

Hyperparameters and Ensemble Models

Tuning hyperparameters

Generate a tuning grid

Tune along the grid

Pick the winner

More model measures

Calculate specificity

Draw the ROC curve

Area under the ROC curve

Bagged trees

Create bagged trees

In-sample ROC and AUC

Check for overfitting

Random forest

Bagged trees vs. random forest

Variable importance

4

Boosted Trees

Introduction to boosting

Bagging vs. boosting

Specify a boosted ensemble

Gradient boosting

Train a boosted ensemble

Evaluate the ensemble

Compare to a single classifier

Optimize the boosted ensemble

Tuning preparation

The actual tuning

Finalize the model

Model comparison

Compare AUC

Plot ROC curves

Machine Learning with Tree-Based Models in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Join over 15 million learners and start Machine Learning with Tree-Based Models in R today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.