Skip to main content

course

Model Validation in Python

Intermediate

Updated 01/2025

Learn the basics of model validation, validation techniques, and begin creating validated and high performing models.

Start course for free

Included for FreePremium or Teams

PythonMachine Learning4 hours15 videos47 exercises3,700 XP24,977Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Machine learning models are easier to implement now more than ever before. Without proper validation, the results of running new data through a model might not be as accurate as expected. Model validation allows analysts to confidently answer the question, how good is your model? We will answer this question for classification models using the complete set of tic-tac-toe endgame scenarios, and for regression models using fivethirtyeight’s ultimate Halloween candy power ranking dataset. In this course, we will cover the basics of model validation, discuss various validation techniques, and begin to develop tools for creating validated and high performing models.

Prerequisites

Supervised Learning with scikit-learn

1

Basic Modeling in scikit-learn

Introduction to model validation

Modeling steps

Seen vs. unseen data

Regression models

Set parameters and fit a model

Feature importances

Classification models

Classification predictions

Reusing model parameters

Random forest classifier

2

Validation Basics

Creating train, test, and validation datasets

Create one holdout set

Create two holdout sets

Why use holdout sets

Accuracy metrics: regression models

Mean absolute error

Mean squared error

Performance on data subsets

Classification metrics

Confusion matrices

Confusion matrices, again

Precision vs. recall

The bias-variance tradeoff

Error due to under/over-fitting

Am I underfitting?

3

Cross Validation

The problems with holdout sets

Two samples

Potential problems

Cross-validation

scikit-learn's KFold()

Using KFold indices

sklearn's cross_val_score()

scikit-learn's methods

Implement cross_val_score()

Leave-one-out-cross-validation (LOOCV)

When to use LOOCV

Leave-one-out-cross-validation

4

Selecting the best model with Hyperparameter tuning.

Introduction to hyperparameter tuning

Creating Hyperparameters

Running a model using ranges

RandomizedSearchCV

Preparing for RandomizedSearch

Implementing RandomizedSearchCV

Selecting your final model

Best classification accuracy

Selecting the best precision model

Course completed!

Model Validation in Python

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Join over 15 million learners and start Model Validation in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.