Saltar al contenido principal

Feature Engineering in R

Learn the principles of feature engineering for machine learning models and how to implement them using the R tidymodels framework.

Comienza El Curso Gratis

4 horas14 vídeos58 ejercicios

Crea Tu Cuenta Gratuita

Google LinkedIn Facebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.

¿Entrenar a 2 o más personas?

Probar DataCamp for Business

Preferido por estudiantes en miles de empresas

Descripción del curso

Discover Feature Engineering for Machine Learning

In this course, you’ll learn about feature engineering, which is at the heart of many times of machine learning models. As the performance of any model is a direct consequence of the features it’s fed, feature engineering places domain knowledge at the center of the process. You’ll become acquainted with principles of sound feature engineering, helping to reduce the number of variables where possible, making learning algorithms run faster, improving interpretability, and preventing overfitting.

Implement Feature Engineering Techniques in R

You will learn how to implement feature engineering techniques using the R tidymodels framework, emphasizing the recipe package that will allow you to create, extract, transform, and select the best features for your model.

Engineer Features and Build Better ML Models

When faced with a new dataset, you will be able to identify and select relevant features and disregard non-informative ones to make your model run faster without sacrificing accuracy. You will also become comfortable applying transformations and creating new features to make your models more efficient, interpretable, and accurate!

Empresas

¿Entrenar a 2 o más personas?

Obtén a tu equipo acceso a la plataforma DataCamp completa, incluidas todas las funciones.

En las siguientes pistas

Certificación disponible

Científico de datos en R

Científico de machine learning in R

1
Introducing Feature Engineering
Gratuito
Raw data does not always come in its best shape for analysis. In this opening chapter, you will get a first look at how to transform and create features that enhance your model's performance and interpretability.
Reproducir Capítulo Ahora
What is feature engineering?
50 xp
A tentative model
100 xp
Manually engineering a feature
100 xp
Creating new features using domain knowledge
50 xp
Setting up your data for analysis
100 xp
Building a workflow
100 xp
Increasing the information content of raw data
50 xp
Identifying missing values
100 xp
Imputing missing values and creating dummy variables
100 xp
Fitting and assessing the model
100 xp
Predicting hotel bookings
100 xp
2
Transforming Features
In this chapter, you’ll learn that, beyond manually transforming features, you can leverage tools from the tidyverse to engineer new variables programmatically. You’ll explore how this approach improves your models' reproducibility and is especially useful when handling datasets with many features.
Reproducir Capítulo Ahora
Why transform existing features?
50 xp
Glancing at your data
50 xp
Normalizing and log-transforming
100 xp
Fit and augment
100 xp
Customize your model assessment
100 xp
Common feature transformations
50 xp
Common transformations
50 xp
Plain recipe
100 xp
Box-Cox transformation
100 xp
Yeo-Johnson transformation
100 xp
Advanced transformations
50 xp
Baseline
100 xp
step_poly()
100 xp
step_percentile()
100 xp
Who's staying?
100 xp
3
Extracting Features
You’ll now learn how models often benefit from reducing dimensionality and extracting features from high-dimensional data, including converting text data into numeric values, encoding categorical data, and ranking the predictive power of variables. You’ll explore methods including principal component analysis, kernel principal component analysis, numerical extraction from text, categorical encodings, and variable importance scores.
Reproducir Capítulo Ahora
Reducing dimensionality
50 xp
Prepping the stage
100 xp
Digging into the structure
50 xp
Percent of variance explained
100 xp
Visualizing variance explained
100 xp
Feature hashing
50 xp
Investigating education field
100 xp
Into the matrix
100 xp
Exploring the hashing
50 xp
Visualizing the hashing
100 xp
Encoding categorical data using supervised learning
50 xp
Setting up your workflow
100 xp
Fitting, augmenting, and assessing
100 xp
Binding models together
100 xp
Variable Importance
50 xp
Create a workflow
100 xp
Fit and augment
100 xp
Which is the main predictor?
100 xp
4
Selecting Features
You’ll wrap up the course by learning about feature engineering and machine learning techniques. You’ll begin by focusing on the problems associated with using all available features in a model and the importance of identifying irrelevant and redundant features and learning to remove these features using embedded methods such as lasso and elastic-net. Next, you’ll explore shrinkage methods such as lasso, ridge, and elastic-net, which can be used to regularize feature weights or select features by setting coefficients to zero. Finally, you’ll finish by focusing on creating an end-to-end feature engineering workflow and reviewing and practicing the previously learned concepts and functions in a small project.
Reproducir Capítulo Ahora
Reducing the model's features
50 xp
Sifting through variable importance
100 xp
Assessing model performance using all available predictors
100 xp
Building a reduced model
100 xp
Shrinkage methods
50 xp
Manual regularization with Lasso
100 xp
Tuning the penalty
100 xp
Finalizing the model
100 xp
Putting it all together
50 xp
Prep and split
100 xp
Preprocess
100 xp
Model
100 xp
Assess
100 xp
Congratulations!
50 xp

Empresas

¿Entrenar a 2 o más personas?

Obtén a tu equipo acceso a la plataforma DataCamp completa, incluidas todas las funciones.

En las siguientes pistas

Certificación disponible

Científico de datos en R

Científico de machine learning in R

colaboradores

Maham Khan

Arne Warnke

requisitos previos

Supervised Learning in R: Classification Supervised Learning in R: Regression

Research Professor

¿Qué tienen que decir otros alumnos?

¡Únete a 15 millones de estudiantes y empieza Feature Engineering in R hoy mismo!

Crea Tu Cuenta Gratuita

Google LinkedIn Facebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.