Skip to main content
HomePython

Pandas Joins for Spreadsheet Users

Learn how to effectively and efficiently join datasets in tabular format using the Python Pandas library.

Start Course for Free
4 hours12 videos44 exercises3,832 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

Joining two or more datasets is necessary for almost any real-world analysis. You’ve done it before with spreadsheets using VLOOKUP and related functions. Can you build on this experience as you transition to the world of Python? Yes! In this course you will learn the ins and outs of bringing datasets together with pandas, Python’s gold standard for manipulating tabular data. You’ll apply pandas functions to combine data from the National Football League (NFL) framed in a familiar spreadsheet environment. Armed with these skills you will be able to harness the power of pandas and integrate larger, more complex datasets into any analysis.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction to joining data

    Free

    In this chapter, we'll build a foundation for using pandas to join data. You'll learn about the types of joins and how pandas can improve your effectiveness and productivity.

    Play Chapter Now
    Joining data: a real-world necessity
    50 xp
    The need for joining data
    50 xp
    Working with split data
    100 xp
    Working with complementary data
    100 xp
    Concatenation
    50 xp
    Concatenating rows
    100 xp
    Concatenating rows with duplicated indexes
    100 xp
    Concatenating columns
    100 xp
    Power and flexibility
    50 xp
    Advantages of pandas joins
    100 xp
    Simple coding for complex merges
    100 xp
  2. 3

    One-to-many joins

    In this chapter, we'll focus on one-to-many relationships. You'll practice identifying the relationship of key columns and joining data frames by column. You'll also learn how to join two or more data frames based on their indices.

    Play Chapter Now
  3. 4

    Advanced joins

    In the final chapter, you'll learn advanced joining techniques to use when faced with challenging data. You'll be presented with a challenge of your own in the form of a case study that tests your skills.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Games PlaysPlayers draftCurrent playersNGSGamesNew weather data

collaborators

Collaborator's avatar
Anneleen Beckers
Collaborator's avatar
Ruanne Van Der Walt
John Miller HeadshotJohn Miller

Data Scientist

John Miller is a senior data scientist who helps companies use machine learning to improve operations. His favorite work involves building predictive models that provide insights into solving difficult problems. John also works as an expert witness and actively participates in the global AI community as a speaker and writer. He holds Master's degrees in business and engineering from MIT and a Bachelor's in engineering from the US Military Academy.
See More

What do other learners have to say?

Join over 15 million learners and start Pandas Joins for Spreadsheet Users today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.