Skip to main content
HomePython

Anomaly Detection in Python

Detect anomalies in your data analysis and expand your Python statistical toolkit in this four-hour course.

Start Course for Free
4 hours16 videos59 exercises4,303 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

Spot Anomalies in Your Data Analysis


Extreme values or anomalies are present in almost any dataset, and it is critical to detect and deal with them before continuing statistical exploration. When left untouched, anomalies can easily disrupt your analyses and skew the performance of machine learning models.

Learn to Use Estimators Like Isolation Forest and Local Outlier Factor


In this course, you'll leverage Python to implement a variety of anomaly detection methods. You'll spot extreme values visually and use tested statistical techniques like Median Absolute Deviation for univariate datasets. For multivariate data, you'll learn to use estimators such as Isolation Forest, k-Nearest-Neighbors, and Local Outlier Factor. You'll also learn how to ensemble multiple outlier classifiers into a low-risk final estimator. You'll walk away with an essential data science tool in your belt: anomaly detection with Python.

Expand Your Python Statistical Toolkit


Better anomaly detection means better understanding of your data, and particularly, better root cause analysis and communication around system behavior. Adding this skill to your existing Python repertoire will help you with data cleaning, fraud detection, and identifying system disturbances.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Detecting Univariate Outliers

    Free

    This chapter covers techniques to detect outliers in 1-dimensional data using histograms, scatterplots, box plots, z-scores, and modified z-scores.

    Play Chapter Now
    What are anomalies and outliers?
    50 xp
    Print a 5-number summary
    100 xp
    Histograms for outlier detection
    100 xp
    Scatterplots for outlier detection
    100 xp
    Box plots and IQR
    50 xp
    Boxplots for outlier detection
    100 xp
    Calculating outlier limits with IQR
    100 xp
    Using outlier limits for filtering
    100 xp
    Using z-scores for Anomaly Detection
    50 xp
    Finding outliers with z-scores
    100 xp
    Using modified z-scores with PyOD
    100 xp
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

collaborators

Collaborator's avatar
James Chapman
Collaborator's avatar
Maham Khan
Collaborator's avatar
George Boorman

prerequisites

Supervised Learning with scikit-learn
Bex Tuychiyev HeadshotBex Tuychiyev

Kaggle Master, Data Science Content Creator

Bex is a Top 10 AI writer on Medium and a Kaggle Master with over 10k followers. He loves writing detailed guides, tutorials, and notebooks on complex data science and machine learning topics with a bit of a sarcastic style.
See More

What do other learners have to say?

Join over 15 million learners and start Anomaly Detection in Python today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.