Statistical Thinking in Python (Part 1)

Build the foundation you need to think statistically and to speak the language of your data.

Comece O Curso Gratuitamente

3 horas18 vídeos61 exercícios180.678 aprendizesDeclaração de Realização

Crie sua conta gratuita

Google LinkedIn Facebook

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.

Treinar 2 ou mais pessoas?

Tentar DataCamp for Business

Amado por alunos de milhares de empresas

Descrição do Curso

After all of the hard work of acquiring data and getting them into a form you can work with, you ultimately want to make clear, succinct conclusions from them. This crucial last step of a data analysis pipeline hinges on the principles of statistical inference. In this course, you will start building the foundation you need to think statistically, speak the language of your data, and understand what your data is telling you. The foundations of statistical thinking took decades to build, but can be grasped much faster today with the help of computers. With the power of Python-based tools, you will rapidly get up-to-speed and begin thinking statistically by the end of this course.

Para Empresas

Treinar 2 ou mais pessoas?

Obtenha acesso à sua equipe à plataforma DataCamp completa, incluindo todos os recursos.

1
Graphical Exploratory Data Analysis
Gratuito
Before diving into sophisticated statistical inference techniques, you should first explore your data by plotting them and computing simple summary statistics. This process, called exploratory data analysis, is a crucial first step in statistical analysis of data.
Reproduzir Capítulo Agora
Introduction to Exploratory Data Analysis
50 xp
What is the goal of statistical inference?
50 xp
Advantages of graphical EDA
50 xp
Plotting a histogram
50 xp
Plotting a histogram of iris data
100 xp
Axis labels!
100 xp
Adjusting the number of bins in a histogram
100 xp
Plot all of your data: Bee swarm plots
50 xp
Bee swarm plot
100 xp
Interpreting a bee swarm plot
50 xp
Plot all of your data: ECDFs
50 xp
Computing the ECDF
100 xp
Plotting the ECDF
100 xp
Comparison of ECDFs
100 xp
Onward toward the whole story!
50 xp
2
Quantitative Exploratory Data Analysis
In this chapter, you will compute useful summary statistics, which serve to concisely describe salient features of a dataset with a few numbers.
Reproduzir Capítulo Agora
Introduction to summary statistics: The sample mean and median
50 xp
Means and medians
50 xp
Computing means
100 xp
Percentiles, outliers, and box plots
50 xp
Computing percentiles
100 xp
Comparing percentiles to ECDF
100 xp
Box-and-whisker plot
100 xp
Variance and standard deviation
50 xp
Computing the variance
100 xp
The standard deviation and the variance
100 xp
Covariance and the Pearson correlation coefficient
50 xp
Scatter plots
100 xp
Variance and covariance by looking
50 xp
Computing the covariance
100 xp
Computing the Pearson correlation coefficient
100 xp
3
Thinking Probabilistically-- Discrete Variables
Statistical inference rests upon probability. Because we can very rarely say anything meaningful with absolute certainty from data, we use probabilistic language to make quantitative statements about data. In this chapter, you will learn how to think probabilistically about discrete quantities: those that can only take certain values, like integers.
Reproduzir Capítulo Agora
Probabilistic logic and statistical inference
50 xp
What is the goal of statistical inference?
50 xp
Why do we use the language of probability?
50 xp
Random number generators and hacker statistics
50 xp
Generating random numbers using the np.random module
100 xp
The np.random module and Bernoulli trials
100 xp
How many defaults might we expect?
100 xp
Will the bank fail?
100 xp
Probability distributions and stories: The Binomial distribution
50 xp
Sampling out of the Binomial distribution
100 xp
Plotting the Binomial PMF
100 xp
Poisson processes and the Poisson distribution
50 xp
Relationship between Binomial and Poisson distributions
100 xp
How many no-hitters in a season?
50 xp
Was 2015 anomalous?
100 xp
4
Thinking Probabilistically-- Continuous Variables
It’s time to move onto continuous variables, such as those that can take on any fractional value. Many of the principles are the same, but there are some subtleties. At the end of this final chapter, you will be speaking the probabilistic language you need to launch into the inference techniques covered in the sequel to this course.
Reproduzir Capítulo Agora
Probability density functions
50 xp
Interpreting PDFs
50 xp
Interpreting CDFs
50 xp
Introduction to the Normal distribution
50 xp
The Normal PDF
100 xp
The Normal CDF
100 xp
The Normal distribution: Properties and warnings
50 xp
Gauss and the 10 Deutschmark banknote
50 xp
Are the Belmont Stakes results Normally distributed?
100 xp
What are the chances of a horse matching or beating Secretariat's record?
100 xp
The Exponential distribution
50 xp
Matching a story and a distribution
50 xp
Waiting for the next Secretariat
50 xp
If you have a story, you can simulate it!
100 xp
Distribution of no-hitters and cycles
100 xp
Final thoughts
50 xp

Para Empresas

Treinar 2 ou mais pessoas?

Obtenha acesso à sua equipe à plataforma DataCamp completa, incluindo todos os recursos.

conjuntos de dados

2008 election results (all states)2008 election results (swing states)Belmont Stakes Speed of light

colaboradores

Yashas Roy

Hugo Bowne-Anderson

pré-requisitos

Python Toolbox

Justin Bois

Lecturer at the California Institute of Technology

Ver Mais

O que os outros alunos têm a dizer?

Junte-se a mais de 15 milhões de alunos e comece Statistical Thinking in Python (Part 1) hoje mesmo!

Crie sua conta gratuita

Google LinkedIn Facebook

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.

Descrição do Curso

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Treinar 2 ou mais pessoas?

Graphical Exploratory Data Analysis

Quantitative Exploratory Data Analysis

Thinking Probabilistically-- Discrete Variables

Thinking Probabilistically-- Continuous Variables

Treinar 2 ou mais pessoas?

O que os outros alunos têm a dizer?

Junte-se a mais de .css-ou6dz6{color:#03ef62;}15 milhões de alunos e comece Statistical Thinking in Python (Part 1) hoje mesmo!

Crie sua conta gratuita

Treinar 2 ou mais pessoas?

Junte-se a mais de 15 milhões de alunos e comece Statistical Thinking in Python (Part 1) hoje mesmo!