Skip to main content

course

Unsupervised Learning in Python

Intermediate

4.4+

Updated 12/2024

Learn how to cluster, transform, visualize, and extract insights from unlabeled datasets using scikit-learn and scipy.

Start course for free

Included for FreePremium or Teams

PythonMachine Learning4 hours13 videos52 exercises4,150 XP148,278Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Say you have a collection of customers with a variety of characteristics such as age, location, and financial history, and you wish to discover patterns and sort them into clusters. Or perhaps you have a set of texts, such as Wikipedia pages, and you wish to segment them into categories based on their content. This is the world of unsupervised learning, called as such because you are not guiding, or supervising, the pattern discovery by some prediction task, but instead uncovering hidden structure from unlabeled data. Unsupervised learning encompasses a variety of techniques in machine learning, from clustering to dimension reduction to matrix factorization. In this course, you'll learn the fundamentals of unsupervised learning and implement the essential algorithms using scikit-learn and SciPy. You will learn how to cluster, transform, visualize, and extract insights from unlabeled datasets, and end the course by building a recommender system to recommend popular musical artists.

Prerequisites

Supervised Learning with scikit-learn

1

Clustering for Dataset Exploration

Unsupervised Learning

How many clusters?

Clustering 2D points

Inspect your clustering

Evaluating a clustering

How many clusters of grain?

Evaluating the grain clustering

Transforming features for better clusterings

Scaling fish data for clustering

Clustering the fish data

Clustering stocks using KMeans

Which stocks move together?

2

Visualization with Hierarchical Clustering and t-SNE

Visualizing hierarchies

How many merges?

Hierarchical clustering of the grain data

Hierarchies of stocks

Cluster labels in hierarchical clustering

Which clusters are closest?

Different linkage, different hierarchical clustering!

Intermediate clusterings

Extracting the cluster labels

t-SNE for 2-dimensional maps

t-SNE visualization of grain dataset

A t-SNE map of the stock market

3

Decorrelating Your Data and Dimension Reduction

Visualizing the PCA transformation

Correlated data in nature

Decorrelating the grain measurements with PCA

Principal components

Intrinsic dimension

The first principal component

Variance of the PCA features

Intrinsic dimension of the fish data

Dimension reduction with PCA

Dimension reduction of the fish measurements

A tf-idf word-frequency array

Clustering Wikipedia part I

Clustering Wikipedia part II

4

Discovering Interpretable Features

Non-negative matrix factorization (NMF)

Non-negative data

NMF applied to Wikipedia articles

NMF features of the Wikipedia articles

NMF reconstructs samples

NMF learns interpretable parts

NMF learns topics of documents

Explore the LED digits dataset

NMF learns the parts of images

PCA doesn't learn parts

Building recommender systems using NMF

Which articles are similar to 'Cristiano Ronaldo'?

Recommend musical artists part I

Recommend musical artists part II

Final thoughts

Unsupervised Learning in Python

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Don’t just take our word for it

*4.4

from 81 reviews

70%

11%

15%

4%

0%

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Vu H.

5 days

Dimension reduction was new and I was lost at first. But with some additional information from Bing Chat, the picture was gratually emerged. The "compact" presentation was quite interesting! It worked for me! Getting familiar with something new (and quite challenging) is a great experience.

Frauke W.

20 days

Nice

Alois H.

3 months

great teacher, interesting range of topics!

Li D.

4 months

Great course

Anna S.

4 months

Very informative

"Dimension reduction was new and I was lost at first. But with some additional information from Bing Chat, the picture was gratually emerged. The "compact" presentation was quite interesting! It worked for me! Getting familiar with something new (and quite challenging) is a great experience."

Vu H.

"Nice"

Frauke W.

"great teacher, interesting range of topics!"

Alois H.

Join over 15 million learners and start Unsupervised Learning in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.