Skip to main content
HomePython

track

Big Data with PySpark

Master how to process big data and leverage it efficiently with Apache Spark using the PySpark API.
Start track for free

Included withPremium or Teams

PythonImporting & Cleaning Data25 hours22,791

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Track Description

Big Data with PySpark

Advance your data skills by mastering Apache Spark. Using the Spark Python API, PySpark, you will leverage parallel computation with large datasets, and get ready for high-performance machine learning. From cleaning data to creating features and implementing machine learning models, you'll execute end-to-end workflows with Spark. The track ends with building a recommendation engine using the popular MovieLens dataset and the Million Songs dataset.

Prerequisites

There are no prerequisites for this track
  • Course

    1

    Introduction to PySpark

    Learn to implement distributed data management and machine learning in Spark using the PySpark package.

  • Course

    Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

  • Course

    Learn how to make predictions from data with Apache Spark, using decision trees, logistic regression, linear regression, ensembles, and pipelines.

  • Project

    bonus

    Building a Demand Forecasting Model

    Use PySpark to build an e-commerce forecasting model!

Big Data with PySpark
6 courses
Track
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Enroll now

FAQs

Join over 15 million learners and start Big Data with PySpark today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.