Skip to main content

course

Fraud Detection in R

Intermediate

Updated 12/2024

Learn to detect fraud with analytics in R.

Start course for free

Included for FreePremium or Teams

RMachine Learning4 hours16 videos49 exercises3,900 XP6,999Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

The Association of Certified Fraud Examiners estimates that fraud costs organizations worldwide $3.7 trillion a year and that a typical company loses five percent of annual revenue due to fraud. Fraud attempts are expected to even increase further in future, making fraud detection highly necessary in most industries. This course will show how learning fraud patterns from historical data can be used to fight fraud. Some techniques from robust statistics and digit analysis are presented to detect unusual observations that are likely associated with fraud. Two main challenges when building a supervised tool for fraud detection are the imbalance or skewness of the data and the various costs for different types of misclassification. We present techniques to solve these issues and focus on artificial and real datasets from a wide variety of fraud applications.

Prerequisites

Unsupervised Learning in R Supervised Learning in R: Classification

1

Introduction & Motivation

Introduction & Motivation

Imbalanced class distribution

Cost of not detecting fraud

Time features

Circular histogram

Suspicious timestamps

Frequency features

Frequency feature for one account

Frequency feature for multiple accounts

Recency features

Recency feature

Comparing frequency & recency

2

Social network analytics

Social network analytics

Analyzing a network

Overlapping edges

Fraud and social network analysis

Looking for homophily in a network

Visualizing node attributes

Social network based inference

Relational vs non-relational models

Relational neighbor classifier

Social network metrics

Degree, closeness & betweenness

Adding network features

3

Imbalanced class distributions

Dealing with imbalanced datasets

How to deal with class imbalance?

Visualizing patterns in the data

Random over-sampling

Random under-sampling

Shrinking the majority group

Combining ROS & RUS

Synthetic Over-sampling

Have you met SMOTE?

From dataset to detection model

Build your own detection model

True cost of fraud detection

4

Digit analysis and robust statistics

Digit analysis using Benford's law

Benford's Law for first digit

Conformity of census data

Benford's Law for fraud detection

Conformity to Benford's Law

Fire insurance claims

Payments data set

Detecting univariate outliers

Computing robust z-scores

Detecting multivariate outliers

Multivariate outlier detection

Fraud Detection in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Join over 15 million learners and start Fraud Detection in R today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.