Saltar al contenido principal

curso

Supervised Learning in R: Regression

Intermedio

Updated 12/2024

In this course you will learn how to predict future events using linear regression, generalized additive models, random forests, and xgboost.

Comienza el curso gratis

Incluido de forma gratuitaPremium or Teams

RMachine Learning4 horas19 vídeos65 ejercicios5,300 XP42,247Declaración de cumplimiento

Crea Tu Cuenta Gratuita

Google LinkedIn Facebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.

¿Entrenar a 2 o más personas?

Probar DataCamp for Business

Preferido por estudiantes en miles de empresas

Descripción del curso

From a machine learning perspective, regression is the task of predicting numerical outcomes from various inputs. In this course, you'll learn about different regression models, how to train these models in R, how to evaluate the models you train and use them to make predictions.

Prerrequisitos

Introduction to Regression in R

1

What is Regression?

Iniciar capítulo

Welcome and Introduction

Identify the regression tasks

Linear regression - the fundamental method

Code a simple one-variable regression

Examining a model

Predicting once you fit a model

Predicting from the unemployment model

Multivariate linear regression (Part 1)

Multivariate linear regression (Part 2)

Wrapping up linear regression

2

Training and Evaluating Regression Models

Iniciar capítulo

Evaluating a model graphically

Graphically evaluate the unemployment model

The gain curve to evaluate the unemployment model

Root Mean Squared Error (RMSE)

Calculate RMSE

Calculate R-squared

Correlation and R-squared

Properly Training a Model

Generating a random test/train split

Train a model using test/train split

Evaluate a model using test/train split

Create a cross validation plan

Evaluate a modeling procedure using n-fold cross-validation

3

Issues to Consider

Iniciar capítulo

Categorical inputs

Examining the structure of categorical inputs

Modeling with categorical inputs

Interactions

Modeling an interaction

Modeling an interaction (2)

Transforming the response before modeling

Relative error

Modeling log-transformed monetary output

Comparing RMSE and root-mean-squared Relative Error

Transforming inputs before modeling

Input transforms: the "hockey stick"

Input transforms: the "hockey stick" (2)

4

Dealing with Non-Linear Responses

Iniciar capítulo

Logistic regression to predict probabilities

Fit a model of sparrow survival probability

Predict sparrow survival

Poisson and quasipoisson regression to predict counts

Poisson or quasipoisson

Fit a model to predict bike rental counts

Predict bike rentals on new data

Visualize the bike rental predictions

GAM to learn non-linear transforms

Writing formulas for GAM models

Writing formulas for GAM models (2)

Model soybean growth with GAM

Predict with the soybean model on test data

5

Tree-Based Methods

Iniciar capítulo

The intuition behind tree-based methods

Predicting with a decision tree

Random forests

Build a random forest model for bike rentals

Predict bike rentals with the random forest model

Visualize random forest bike model predictions

One-Hot-Encoding Categorical Variables

vtreat on a small example

Novel levels

vtreat the bike rental data

Gradient boosting machines

Find the right number of trees for a gradient boosting machine

Fit an xgboost bike rental model and predict

Evaluate the xgboost bike rental model

Visualize the xgboost bike rental model

Supervised Learning in R: Regression

Curso
Completo

Obtener Declaración de Logro

Añade esta credencial a tu perfil, currículum vitae o CV de LinkedIn
Compártelo en las redes sociales y en tu evaluación de desempeño

Incluido conPremium or Teams

Inscríbete ahora

Únete a más 15 millones de estudiantes y empezar Supervised Learning in R: Regression ¡Hoy!

Crea Tu Cuenta Gratuita

Google LinkedIn Facebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.