Data Manipulation with pandas
Learn how to import and clean data, calculate statistics, and create visualizations with pandas.
Start Course for Free4 hours15 videos56 exercises415,336 learnersStatement of Accomplishment
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Training 2 or more people?
Try DataCamp for BusinessLoved by learners at thousands of companies
Course Description
Discover Data Manipulation with pandas
With this course, you’ll learn why pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. You’ll explore how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis.With pandas, you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python.
Work with pandas Data to Explore Core Data Science Concepts
You’ll start by mastering the pandas basics, including how to inspect DataFrames and perform some fundamental manipulations. You’ll also learn about aggregating DataFrames, before moving on to slicing and indexing.You’ll wrap up the course by learning how to visualize the contents of your DataFrames, working with a dataset that contains weekly US avocado sales.
Learn to Manipulate DataFrames
By completing this pandas course, you’ll understand how to use this Python library for data manipulation. You’ll have an understanding of DataFrames and how to use them, as well as be able to visualize your data in Python.Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.In the following Tracks
Data Manipulation in Python
Go To Track- 1
Transforming DataFrames
FreeLet’s master the pandas basics. Learn how to inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns.
- 2
Aggregating DataFrames
In this chapter, you’ll calculate summary statistics on DataFrame columns, and master grouped summary statistics and pivot tables.
Summary statistics50 xpMean and median100 xpSummarizing dates100 xpEfficient summaries100 xpCumulative statistics100 xpCounting50 xpDropping duplicates100 xpCounting categorical variables100 xpGrouped summary statistics50 xpWhat percent of sales occurred at each store type?100 xpCalculations with .groupby()100 xpMultiple grouped summaries100 xpPivot tables50 xpPivoting on one variable100 xpFill in missing values and sum values with pivot tables100 xp - 3
Slicing and Indexing DataFrames
Indexes are supercharged row and column names. Learn how they can be combined with slicing for powerful DataFrame subsetting.
Explicit indexes50 xpSetting and removing indexes100 xpSubsetting with .loc[]100 xpSetting multi-level indexes100 xpSorting by index values100 xpSlicing and subsetting with .loc and .iloc50 xpSlicing index values100 xpSlicing in both directions100 xpSlicing time series100 xpSubsetting by row/column number100 xpWorking with pivot tables50 xpPivot temperature by city and year100 xpSubsetting pivot tables100 xpCalculating on a pivot table100 xp - 4
Creating and Visualizing DataFrames
Learn to visualize the contents of your DataFrames, handle missing data values, and import data from and export data to CSV files.
Visualizing your data50 xpWhich avocado size is most popular?100 xpChanges in sales over time100 xpAvocado supply and demand100 xpPrice of conventional vs. organic avocados100 xpMissing values50 xpFinding missing values100 xpRemoving missing values100 xpReplacing missing values100 xpCreating DataFrames50 xpList of dictionaries100 xpDictionary of lists100 xpReading and writing CSVs50 xpCSV to DataFrame100 xpDataFrame to CSV100 xpWrap-up50 xp
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.In the following Tracks
Data Manipulation in Python
Go To TrackMaggie Matsui
See MoreCurriculum Manager at DataCamp
Maggie is a Data Scientist at Etsy. She holds a degree in Statistics and Computer Science from Brown University, where she spent lots of time teaching math, programming, and statistics as a tutor and teaching assistant. She's passionate about teaching all things data-related and making programming accessible to everyone.
Richie Cotton
See MoreData Evangelist at DataCamp
Richie is a Data Evangelist at DataCamp. He has been using R since 2004, in the fields of proteomics, debt collection, and chemical health and safety. He has released almost 30 R packages on CRAN and Bioconductor – most famously the assertive suite of packages – as well as creating and contributing to many others. He also has written two books on R programming, Learning R and Testing R Code.
FAQs
Join over 15 million learners and start Data Manipulation with pandas today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.