Intermediate Predictive Analytics in Python
Learn how to prepare and organize your data for predictive analytics.
Commencer Le Cours Gratuitement4 heures15 vidéos56 exercices5 443 apprenantsDéclaration de réalisation
Créez votre compte gratuit
ou
En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.Formation de 2 personnes ou plus ?
Essayer DataCamp for BusinessApprécié par les apprenants de milliers d'entreprises
Description du cours
Building good models only succeeds if you have a decent base table to start with. In this course you will learn how to construct a good base table, create variables and prepare your data for modeling. We finish with advanced topics on the matter.
Formation de 2 personnes ou plus ?
Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.- 1
Crucial base table concepts
GratuitIn this chapter you will learn how to construct the foundations of your base table, namely the population and the target.
The basetable timeline50 xpTimeline violations50 xpAvailable data100 xpTimeline violation100 xpThe population50 xpSelect the relevant population50 xpA timeline compliant population100 xpRemoving duplicate objects100 xpThe target50 xpCalculate an event target100 xpCalculate an aggregated target100 xp - 2
Creating variables
You will learn how to add variables to the base table that you can use to predict the target.
Adding fixed variables50 xpSelecting the right value50 xpAdding age100 xpAdding the donor segment100 xpAdding living place100 xpAdding aggregated variables50 xpSelecting the appropriate date50 xpMaximum value last year100 xpRecency of donations100 xpAdding evolutions50 xpRatio of last month's and last year's average100 xpAbsolute difference between two years100 xpUsing evolution variables50 xpPerformance of evolution variables100 xpMeaning of evolution100 xp - 3
Data preparation
Once you derived variables from the raw data, it is time to clean the data and prepare it for modeling. In this Chapter we discuss the steps that need to be taken to make your data modeling-ready.
Creating dummies50 xpCreating a dummy from a two-category variable100 xpCreating dummies from a many-categories variable100 xpMissing values50 xpHow to replace missing values50 xpCreating a missing value dummy100 xpReplace missing values with the median value100 xpReplace missing values with a fixed value100 xpHandling outliers50 xpInfluence of outliers on predictive models50 xpHandle outliers with winsorization100 xpHandle outliers with standard deviation100 xpTransformations50 xpInteractions50 xpSquare root transformation100 xpAdding interactions to the basetable100 xp - 4
Advanced base table concepts
In some cases, the target or variables change heavily with the seasons. You will learn how you can deal with seasonality by adding different snapshots to the base table.
Seasonality50 xpSeasonality or not50 xpDetecting seasonality100 xpThe effect of seasonality100 xpUsing multiple snapshots50 xpTarget values50 xpCalculating snapshot targets100 xpCalculating aggregated variables100 xpStacking basetables100 xpThe timegap50 xpEvents during the timegap50 xpCalculating aggregated variables with timegap100 xpAdding age with timegap100 xpCongratulations50 xp
Formation de 2 personnes ou plus ?
Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.ensembles de données
Donor IDsBasetable with countries and ageBasetable used in Ex 2.13Living place of donorsDonationscollaborateurs
Nele Verbiest
Voir PlusData Scientist at Python Predictions
Qu’est-ce que les autres apprenants ont à dire ?
Inscrivez-vous 15 millions d’apprenants et commencer Intermediate Predictive Analytics in Python Aujourd’hui!
Créez votre compte gratuit
ou
En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.