Skip to main content
HomePython

Statistical Thinking in Python (Part 2)

4.6+
16 reviews
Intermediate

Learn to perform the two key tasks in statistical inference: parameter estimation and hypothesis testing.

Start Course for Free
4 hours15 videos66 exercises90,960 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

After completing Statistical Thinking in Python (Part 1), you have the probabilistic mindset and foundational hacker stats skills to dive into data sets and extract useful information from them. In this course, you will do just that, expanding and honing your hacker stats toolbox to perform the two key tasks in statistical inference, parameter estimation and hypothesis testing. You will work with real data sets as you learn, culminating with analysis of measurements of the beaks of the Darwin's famous finches. You will emerge from this course with new knowledge and lots of practice under your belt, ready to attack your own inference problems out in the world.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Parameter estimation by optimization

    Free

    When doing statistical inference, we speak the language of probability. A probability distribution that describes your data has parameters. So, a major goal of statistical inference is to estimate the values of these parameters, which allows us to concisely and unambiguously describe our data and draw conclusions from it. In this chapter, you will learn how to find the optimal parameters, those that best describe your data.

    Play Chapter Now
    Optimal parameters
    50 xp
    How often do we get no-hitters?
    100 xp
    Do the data follow our story?
    100 xp
    How is this parameter optimal?
    100 xp
    Linear regression by least squares
    50 xp
    EDA of literacy/fertility data
    100 xp
    Linear regression
    100 xp
    How is it optimal?
    100 xp
    The importance of EDA: Anscombe's quartet
    50 xp
    The importance of EDA
    50 xp
    Linear regression on appropriate Anscombe data
    100 xp
    Linear regression on all Anscombe data
    100 xp
  2. 2

    Bootstrap confidence intervals

    To "pull yourself up by your bootstraps" is a classic idiom meaning that you achieve a difficult task by yourself with no help at all. In statistical inference, you want to know what would happen if you could repeat your data acquisition an infinite number of times. This task is impossible, but can we use only the data we actually have to get close to the same result as an infinitude of experiments? The answer is yes! The technique to do it is aptly called bootstrapping. This chapter will introduce you to this extraordinarily powerful tool.

    Play Chapter Now
  3. 3

    Introduction to hypothesis testing

    You now know how to define and estimate parameters given a model. But the question remains: how reasonable is it to observe your data if a model is true? This question is addressed by hypothesis tests. They are the icing on the inference cake. After completing this chapter, you will be able to carefully construct and test hypotheses using hacker statistics.

    Play Chapter Now
  4. 4

    Hypothesis test examples

    As you saw from the last chapter, hypothesis testing can be a bit tricky. You need to define the null hypothesis, figure out how to simulate it, and define clearly what it means to be "more extreme" in order to compute the p-value. Like any skill, practice makes perfect, and this chapter gives you some good practice with hypothesis tests.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Anscombe dataBee sperm countsFemale literacy and fertilityFinch beaks (1975)Finch beaks (2012)Fortis beak depth heredityFrog tongue dataMajor League Baseball no-hittersScandens beak depth hereditySheffield Weather Station

collaborators

Collaborator's avatar
Yashas Roy
Collaborator's avatar
Hugo Bowne-Anderson
Justin Bois HeadshotJustin Bois

Lecturer at the California Institute of Technology

Justin Bois is a Teaching Professor in the Division of Biology and Biological Engineering at the California Institute of Technology. He teaches nine different classes there, nearly all of which heavily feature Python. He is dedicated to empowering students in the biological sciences with quantitative tools, particularly data analysis skills. Beyond biologists, he is thrilled to develop courses for DataCamp, whose students are an excited bunch of burgeoning data scientists!
See More

Don’t just take our word for it

*4.6
from 16 reviews
75%
19%
0%
6%
0%
Sort by
  • Kleopatra R.
    about 2 months

    I think that the Datacamp site can benefit from more courses on statistical analysis. This is the connection between python programming and our data analysis. I can always google my way through scripts, but understanding what it is that I am looking for is necessary.

  • Thomas M.
    10 months

    The two Statistical Thinking courses provide an excellent introduction to statistical analysis with Python. The theoretical background is well explained and the hands-on excercises are designed to learn step-by-step how to apply the statistical tools in Python. In my opinion, these 2 courses are among the very best DataCamp has to offer. The lecturer succeeds in translating his fascination with his professional approach to statistics

  • Tim K.
    about 1 year

    Good course that teaches statistics and methods

  • Laura L.
    about 1 year

    Great

  • I T.
    over 1 year

    The instructor is excellent, and the course is interesting. The practice could have been more challenging so the learner could learn more about how to code.

"I think that the Datacamp site can benefit from more courses on statistical analysis. This is the connection between python programming and our data analysis. I can always google my way through scripts, but understanding what it is that I am looking for is necessary."

Kleopatra R.

"Good course that teaches statistics and methods"

Tim K.

"Great"

Laura L.

Join over 15 million learners and start Statistical Thinking in Python (Part 2) today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.