Skip to main content
HomeR

Predictive Analytics using Networked Data in R

Learn to predict labels of nodes in networks using network learning and by extracting descriptive features from the network

Start Course for Free
4 hours14 videos56 exercises4,475 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

In this course, you will learn to perform state-of-the art predictive analytics using networked data in R. The aim of network analytics is to predict to which class a network node belongs, such as churner or not, fraudster or not, defaulter or not, etc. To accomplish this, we discuss how to leverage information from the network and its underlying structure in a predictive way. More specifically, we introduce the idea of featurization such that network features can be added to non-network features as such boosting the performance of any resulting analytical model. In this course, you will use the igraph package to generate and label a network of customers in a churn setting and learn about the foundations of network learning. Then, you will learn about homophily, dyadicity and heterophilicty, and how these can be used to get key exploratory insights in your network. Next, you will use the functionality of the igraph package to compute various network features to calculate both node-centric as well as neighbor based network features. Furthermore, you will use the Google PageRank algorithm to compute network features and empirically validate their predictive power. Finally, we teach you how to generate a flat dataset from the network and analyze it using logistic regression and random forests.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.

In the following Tracks

Network Analysis in R

Go To Track
  1. 1

    Introduction, networks and labelled networks

    Free

    In this chapter you will be introduced to labelled networks, network learning and the challanges that can arise.

    Play Chapter Now
    Motivation: social networks and predictive analytics
    50 xp
    Most likely to churn
    50 xp
    Create a network from an edgelist
    100 xp
    Labeled networks and network learning
    50 xp
    Labeling nodes
    100 xp
    Coloring nodes
    100 xp
    Visualizing Churners
    100 xp
    Relational Neighbor Classifier
    100 xp
    Challenges of network-based inference
    50 xp
    Challenges in Network learning
    50 xp
    Probabilistic Relational Neighbor Classifier
    100 xp
    Collective Inferencing
    100 xp
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

In the following Tracks

Network Analysis in R

Go To Track

datasets

Student Customers datasetStudent Edge List datasetStudent Network dataset

collaborators

Collaborator's avatar
David Campos
Collaborator's avatar
Shon Inouye
Collaborator's avatar
Chester Ismay
Maria Oskarsdottir HeadshotMaria Oskarsdottir

Post-doctoral Researcher

María Óskarsdóttir is a post-doctoral researcher and an active R user. She holds a PhD in Business Economics from KU Leuven (Belgium). Her research puts focus on applying social network analytics techniques for predictive modeling in marketing, credit scoring and insurance.
See More
Bart Baesens HeadshotBart Baesens

Professor in Analytics and Data Science at KU Leuven

Bart Baesens is professor in Analytics and Data Science at the Faculty of Economics and Business of KU Leuven, and a lecturer at the University of Southampton (UK). He has done extensive research on big data & analytics, credit risk analytics and fraud analytics. He regularly tutors, advises and provides consulting support to international firms with respect to their big data, analytics and fraud & credit risk management strategy.
See More

What do other learners have to say?

FAQs

Join over 15 million learners and start Predictive Analytics using Networked Data in R today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.