Pular para o conteúdo principal

Predictive Analytics using Networked Data in R

Learn to predict labels of nodes in networks using network learning and by extracting descriptive features from the network

Comece O Curso Gratuitamente

4 horas14 vídeos56 exercícios4.466 aprendizesDeclaração de Realização

Crie sua conta gratuita

Google LinkedIn Facebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.

Treinar 2 ou mais pessoas?

Tentar DataCamp for Business

Amado por alunos de milhares de empresas

Descrição do Curso

In this course, you will learn to perform state-of-the art predictive analytics using networked data in R. The aim of network analytics is to predict to which class a network node belongs, such as churner or not, fraudster or not, defaulter or not, etc. To accomplish this, we discuss how to leverage information from the network and its underlying structure in a predictive way. More specifically, we introduce the idea of featurization such that network features can be added to non-network features as such boosting the performance of any resulting analytical model. In this course, you will use the igraph package to generate and label a network of customers in a churn setting and learn about the foundations of network learning. Then, you will learn about homophily, dyadicity and heterophilicty, and how these can be used to get key exploratory insights in your network. Next, you will use the functionality of the igraph package to compute various network features to calculate both node-centric as well as neighbor based network features. Furthermore, you will use the Google PageRank algorithm to compute network features and empirically validate their predictive power. Finally, we teach you how to generate a flat dataset from the network and analyze it using logistic regression and random forests.

Para Empresas

Treinar 2 ou mais pessoas?

Obtenha acesso à sua equipe à plataforma DataCamp completa, incluindo todos os recursos.

Nas seguintes faixas

Análise de rede in R

Ir para a trilha

1
Introduction, networks and labelled networks
Gratuito
In this chapter you will be introduced to labelled networks, network learning and the challanges that can arise.
Reproduzir Capítulo Agora
Motivation: social networks and predictive analytics
50 xp
Most likely to churn
50 xp
Create a network from an edgelist
100 xp
Labeled networks and network learning
50 xp
Labeling nodes
100 xp
Coloring nodes
100 xp
Visualizing Churners
100 xp
Relational Neighbor Classifier
100 xp
Challenges of network-based inference
50 xp
Challenges in Network learning
50 xp
Probabilistic Relational Neighbor Classifier
100 xp
Collective Inferencing
100 xp
2
Homophily
In this chapter you will learn about homophily and how to compute the two measures that can be used to characterice it, dyadicity and heterophilicty.
Reproduzir Capítulo Agora
Homophily
50 xp
Homophilic networks
50 xp
Extracting types of edges
100 xp
Counting types of edges
100 xp
Counting nodes and computing connectance
100 xp
Dyadicity
50 xp
Same label edges
50 xp
Dyadicity of churners
100 xp
Dyadicity of non-churners
50 xp
Heterophilicity
50 xp
Cross label edges
50 xp
Compute heterophilicity
100 xp
Summary of homophily
50 xp
Dyadicity, Heterophilicity, & Homophily
50 xp
Is the network homophilic?
50 xp
3
Network Featurization
In this chapter you will use the igraph package to compute various network features and add them to the network.
Reproduzir Capítulo Agora
Basic Network features
50 xp
Simple network features
100 xp
Centrality features
100 xp
Transitivity
100 xp
Link-Based Features
50 xp
Adjacency matrices
100 xp
Link-based features
100 xp
Second order link-based features
100 xp
Neighborhood link-based features
100 xp
PageRank
50 xp
Most influential node
50 xp
Changes in PageRank
100 xp
Convergence of PageRank
100 xp
Personalized PageRank
100 xp
Extract PageRank features
100 xp
4
Putting it all together
In this chapter you will use the network from Chapter 3 to create a flat dataset. Using standard data mining techniques, you will build predictive models and measure their performance with AUC and top decile lift.
Reproduzir Capítulo Agora
Extract a dataset
50 xp
Getting a flat dataset
100 xp
Missing Values
50 xp
Replace missing values
100 xp
Correlated variables
100 xp
Building a predictive model
50 xp
Split into train and test
100 xp
Logistic regression model
100 xp
Random forest model
100 xp
Evaluating model performance
50 xp
Predicting churn
100 xp
Measure AUC
50 xp
Measure top decile lift
50 xp
Summary and final thoughts
50 xp

Para Empresas

Treinar 2 ou mais pessoas?

Obtenha acesso à sua equipe à plataforma DataCamp completa, incluindo todos os recursos.

Nas seguintes faixas

Análise de rede in R

Ir para a trilha

conjuntos de dados

Student Customers dataset Student Edge List dataset Student Network dataset

colaboradores

David Campos

Shon Inouye

Chester Ismay

pré-requisitos

Network Analysis in R Supervised Learning in R: Classification

Maria Oskarsdottir

Post-doctoral Researcher

Professor in Analytics and Data Science at KU Leuven

O que os outros alunos têm a dizer?

Junte-se a mais de 15 milhões de alunos e comece Predictive Analytics using Networked Data in R hoje mesmo!

Crie sua conta gratuita

Google LinkedIn Facebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.