Accéder au contenu principal

cours

Supervised Learning in R: Regression

Intermédiaire

Updated 12/2024

In this course you will learn how to predict future events using linear regression, generalized additive models, random forests, and xgboost.

Commencer le cours gratuitement

Inclus gratuitementPremium or Teams

RMachine learning4 heures19 vidéos65 exercices5,300 XP42,249Déclaration de réalisation

Créez votre compte gratuit

Google LinkedIn Facebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.

Formation de 2 personnes ou plus ?

Essayer DataCamp for Business

Apprécié par les apprenants de milliers d’entreprises

Description du cours

From a machine learning perspective, regression is the task of predicting numerical outcomes from various inputs. In this course, you'll learn about different regression models, how to train these models in R, how to evaluate the models you train and use them to make predictions.

Conditions préalables

Introduction to Regression in R

1

What is Regression?

Commencer le chapitre

Welcome and Introduction

Identify the regression tasks

Linear regression - the fundamental method

Code a simple one-variable regression

Examining a model

Predicting once you fit a model

Predicting from the unemployment model

Multivariate linear regression (Part 1)

Multivariate linear regression (Part 2)

Wrapping up linear regression

2

Training and Evaluating Regression Models

Commencer le chapitre

Evaluating a model graphically

Graphically evaluate the unemployment model

The gain curve to evaluate the unemployment model

Root Mean Squared Error (RMSE)

Calculate RMSE

Calculate R-squared

Correlation and R-squared

Properly Training a Model

Generating a random test/train split

Train a model using test/train split

Evaluate a model using test/train split

Create a cross validation plan

Evaluate a modeling procedure using n-fold cross-validation

3

Issues to Consider

Commencer le chapitre

Categorical inputs

Examining the structure of categorical inputs

Modeling with categorical inputs

Interactions

Modeling an interaction

Modeling an interaction (2)

Transforming the response before modeling

Relative error

Modeling log-transformed monetary output

Comparing RMSE and root-mean-squared Relative Error

Transforming inputs before modeling

Input transforms: the "hockey stick"

Input transforms: the "hockey stick" (2)

4

Dealing with Non-Linear Responses

Commencer le chapitre

Logistic regression to predict probabilities

Fit a model of sparrow survival probability

Predict sparrow survival

Poisson and quasipoisson regression to predict counts

Poisson or quasipoisson

Fit a model to predict bike rental counts

Predict bike rentals on new data

Visualize the bike rental predictions

GAM to learn non-linear transforms

Writing formulas for GAM models

Writing formulas for GAM models (2)

Model soybean growth with GAM

Predict with the soybean model on test data

5

Tree-Based Methods

Commencer le chapitre

The intuition behind tree-based methods

Predicting with a decision tree

Random forests

Build a random forest model for bike rentals

Predict bike rentals with the random forest model

Visualize random forest bike model predictions

One-Hot-Encoding Categorical Variables

vtreat on a small example

Novel levels

vtreat the bike rental data

Gradient boosting machines

Find the right number of trees for a gradient boosting machine

Fit an xgboost bike rental model and predict

Evaluate the xgboost bike rental model

Visualize the xgboost bike rental model

Supervised Learning in R: Regression

Cours
terminé

Earn Déclaration de réalisation

Ajoutez ces informations d’identification à votre profil LinkedIn, à votre CV ou à votre CV
Partagez-le sur les réseaux sociaux et dans votre évaluation de performance

Inclus avecPremium or Teams

S'inscrire maintenant

Pour les entreprises

Formation de 2 personnes ou plus ?

Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.

Dans les titres suivants

Certification disponibleScientifique de données associé en R

Principes fondamentaux de l'apprentissage automatique en R

Scientifique en apprentissage automatique en R

formateurs

Nina Zumel

Co-founder, Principal Consultant at Win-Vector, LLC

John Mount

Co-founder, Principal Consultant at Win-Vector, LLC

collaborateurs

Sumedh Panchadhar

Richie Cotton

cours ressources

Bikesensemble de données

Blood Pressureensemble de données

Cricketensemble de données

House Pricesensemble de données

Incomeensemble de données

Mpgensemble de données

Soybeanensemble de données

Unemploymentensemble de données

Sparrowensemble de données

Inscrivez-vous 15 millions d’apprenants et commencer Supervised Learning in R: Regression Aujourd’hui!

Créez votre compte gratuit

Google LinkedIn Facebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.