Pular para o conteúdo principal

curso

Supervised Learning in R: Regression

Intermediário

Updated 12/2024

In this course you will learn how to predict future events using linear regression, generalized additive models, random forests, and xgboost.

Iniciar curso gratuitamente

Incluído gratuitamentePremium or Teams

RMachine learning4 horas19 vídeos65 exercícios5,300 XP42,247Declaração de Realização

Crie sua conta gratuita

Google LinkedIn Facebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.

Treinar 2 ou mais pessoas?

Tentar DataCamp for Business

Amado por alunos de milhares de empresas

Descrição do curso

From a machine learning perspective, regression is the task of predicting numerical outcomes from various inputs. In this course, you'll learn about different regression models, how to train these models in R, how to evaluate the models you train and use them to make predictions.

Pré-requisitos

Introduction to Regression in R

1

What is Regression?

Iniciar capítulo

Welcome and Introduction

Identify the regression tasks

Linear regression - the fundamental method

Code a simple one-variable regression

Examining a model

Predicting once you fit a model

Predicting from the unemployment model

Multivariate linear regression (Part 1)

Multivariate linear regression (Part 2)

Wrapping up linear regression

2

Training and Evaluating Regression Models

Iniciar capítulo

Evaluating a model graphically

Graphically evaluate the unemployment model

The gain curve to evaluate the unemployment model

Root Mean Squared Error (RMSE)

Calculate RMSE

Calculate R-squared

Correlation and R-squared

Properly Training a Model

Generating a random test/train split

Train a model using test/train split

Evaluate a model using test/train split

Create a cross validation plan

Evaluate a modeling procedure using n-fold cross-validation

3

Issues to Consider

Iniciar capítulo

Categorical inputs

Examining the structure of categorical inputs

Modeling with categorical inputs

Interactions

Modeling an interaction

Modeling an interaction (2)

Transforming the response before modeling

Relative error

Modeling log-transformed monetary output

Comparing RMSE and root-mean-squared Relative Error

Transforming inputs before modeling

Input transforms: the "hockey stick"

Input transforms: the "hockey stick" (2)

4

Dealing with Non-Linear Responses

Iniciar capítulo

Logistic regression to predict probabilities

Fit a model of sparrow survival probability

Predict sparrow survival

Poisson and quasipoisson regression to predict counts

Poisson or quasipoisson

Fit a model to predict bike rental counts

Predict bike rentals on new data

Visualize the bike rental predictions

GAM to learn non-linear transforms

Writing formulas for GAM models

Writing formulas for GAM models (2)

Model soybean growth with GAM

Predict with the soybean model on test data

5

Tree-Based Methods

Iniciar capítulo

The intuition behind tree-based methods

Predicting with a decision tree

Random forests

Build a random forest model for bike rentals

Predict bike rentals with the random forest model

Visualize random forest bike model predictions

One-Hot-Encoding Categorical Variables

vtreat on a small example

Novel levels

vtreat the bike rental data

Gradient boosting machines

Find the right number of trees for a gradient boosting machine

Fit an xgboost bike rental model and predict

Evaluate the xgboost bike rental model

Visualize the xgboost bike rental model

Supervised Learning in R: Regression

Curso
Completo

Declaração de Realização Earn

Adicione esta credencial ao seu perfil, currículo ou currículo do LinkedIn
Compartilhe nas redes sociais e em sua avaliação de desempenho

Incluído comPremium or Teams

Inscreva-se agora

Junte-se a mais 15 milhões de alunos e comece Supervised Learning in R: Regression Hoje!

Crie sua conta gratuita

Google LinkedIn Facebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.