Skip to main content
learning data science

Data Science Tutorials

Develop your data science skills with tutorials in our blog. We cover everything from intricate data visualizations in Tableau to version control features in Git.
Discover categories
Python

Web Scraping & NLP in Python

Learn to scrape novels from the web and plot word frequency distributions; You will gain experience with Python packages requests, BeautifulSoup and nltk.

Hugo Bowne-Anderson

October 13, 2017

Python

Hierarchical indices, groupby and pandas

In this tutorial, you’ll learn about multi-indices for pandas DataFrames and how they arise naturally from groupby operations on real-world data sets.
Hugo Bowne-Anderson's photo

Hugo Bowne-Anderson

March 31, 2023

Python

Groupby, split-apply-combine and pandas

In this tutorial, you'll learn how to use the pandas groupby operation, which draws from the well-known split-apply-combine strategy, on Netflix movie data.
Hugo Bowne-Anderson's photo

Hugo Bowne-Anderson

September 26, 2017

Python

Poker Probability and Statistics with Python

Tackle probability and statistics in Python: learn more about combinations and permutations, dependent and independent events, and expected value.
Daniel Poston's photo

Daniel Poston

September 21, 2017

Python

Graph Optimization with NetworkX in Python

This NetworkX tutorial will show you how to do graph optimization in Python by solving the Chinese Postman Problem in Python.
Andrew Brooks's photo

Andrew Brooks

September 12, 2017

Python

Detecting Fake News with Scikit-Learn

This scikit-learn tutorial will walk you through building a fake news classifier with the help of Bayesian models.
Katharine Jarmul's photo

Katharine Jarmul

August 24, 2017

Python

Python Seaborn Tutorial For Beginners: Start Visualizing Data

This Seaborn tutorial introduces you to the basics of statistical data visualization
Moez Ali's photo

Moez Ali

March 15, 2023

Python

Apache Spark Tutorial: ML with PySpark

Apache Spark tutorial introduces you to big data processing, analysis and ML with PySpark.
Karlijn Willems's photo

Karlijn Willems

July 28, 2017

Python

Scikit-Learn Tutorial: Baseball Analytics Pt 2

A Scikit-Learn tutorial to using logistic regression and random forest models to predict which baseball players will be voted into the Hall of Fame
Daniel Poston's photo

Daniel Poston

June 20, 2017

Python

5 Tips to Write Idiomatic Pandas Code

This tutorial covers 5 ways in which you can easily write pandorable or more idiomatic Pandas code.
DataCamp Team's photo

DataCamp Team

May 29, 2017

Python

Asyncio: An Introduction

A short introduction to asynchronous I/O with the asyncio package.
DataCamp Team's photo

DataCamp Team

May 8, 2017

Python

Scikit-Learn Tutorial: Baseball Analytics Pt 1

A scikit-learn tutorial to predicting MLB wins per season by modeling data to KMeans clustering model and linear regression models.
Daniel Poston's photo

Daniel Poston

May 4, 2017