Skip to main content

course

Machine Learning with Tree-Based Models in Python

Intermediate

4.5+

Updated 12/2024

In this course, you'll learn how to use tree-based models and ensembles for regression and classification using scikit-learn.

Start course for free

Included for FreePremium or Teams

PythonMachine Learning5 hours15 videos57 exercises4,650 XP96,489Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Decision trees are supervised learning models used for problems involving classification and regression. Tree models present a high flexibility that comes at a price: on one hand, trees are able to capture complex non-linear relationships; on the other hand, they are prone to memorizing the noise present in a dataset. By aggregating the predictions of trees that are trained differently, ensemble methods take advantage of the flexibility of trees while reducing their tendency to memorize noise. Ensemble methods are used across a variety of fields and have a proven track record of winning many machine learning competitions. In this course, you'll learn how to use Python to train decision trees and tree-based models with the user-friendly scikit-learn machine learning library. You'll understand the advantages and shortcomings of trees and demonstrate how ensembling can alleviate these shortcomings, all while practicing on real-world datasets. Finally, you'll also understand how to tune the most influential hyperparameters in order to get the most out of your models.

Prerequisites

Supervised Learning with scikit-learn

1

Classification and Regression Trees

Decision tree for classification

Train your first classification tree

Evaluate the classification tree

Logistic regression vs classification tree

Classification tree Learning

Growing a classification tree

Using entropy as a criterion

Entropy vs Gini index

Decision tree for regression

Train your first regression tree

Evaluate the regression tree

Linear regression vs regression tree

2

The Bias-Variance Tradeoff

Generalization Error

Complexity, bias and variance

Overfitting and underfitting

Diagnose bias and variance problems

Instantiate the model

Evaluate the 10-fold CV error

Evaluate the training error

High bias or high variance?

Ensemble Learning

Define the ensemble

Evaluate individual classifiers

Better performance with a Voting Classifier

3

Bagging and Random Forests

Define the bagging classifier

Evaluate Bagging performance

Out of Bag Evaluation

Prepare the ground

OOB Score vs Test Set Score

Random Forests (RF)

Train an RF regressor

Evaluate the RF regressor

Visualizing features importances

4

Boosting

Define the AdaBoost classifier

Train the AdaBoost classifier

Evaluate the AdaBoost classifier

Gradient Boosting (GB)

Define the GB regressor

Train the GB regressor

Evaluate the GB regressor

Stochastic Gradient Boosting (SGB)

Regression with SGB

Train the SGB regressor

Evaluate the SGB regressor

5

Model Tuning

Tuning a CART's Hyperparameters

Tree hyperparameters

Set the tree's hyperparameter grid

Search for the optimal tree

Evaluate the optimal tree

Tuning a RF's Hyperparameters

Random forests hyperparameters

Set the hyperparameter grid of RF

Search for the optimal forest

Evaluate the optimal forest

Congratulations!

Machine Learning with Tree-Based Models in Python

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Don’t just take our word for it

*4.5

from 48 reviews

71%

15%

13%

2%

0%

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Sait O.

2 months

Comprehensive and in depth

Mariana R.

4 months

Easy to understand

Laerty S.

7 months

This course was very important to me because it helped me to understand deeply some concepts that I was thinking that I already knew.

Rafael C.

7 months

Enables you to set up the foundations and then proceeed at a progressively quicker pace.

Sue D.

9 months

Stunning course with awesome instructor!

"Comprehensive and in depth"

Sait O.

"Easy to understand"

Mariana R.

"This course was very important to me because it helped me to understand deeply some concepts that I was thinking that I already knew."

Laerty S.

FAQs

Join over 15 million learners and start Machine Learning with Tree-Based Models in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.