course
Spark courses
With Spark, data is read into memory, operations are performed, and the results are written back, resulting in faster execution. Learn core principles and common packages on DataCamp.
Training 2 or more people?
Try DataCamp for BusinessRecommended for Spark beginners
Build your Spark skills with interactive courses curated by real-world experts
IntermediateSkill Level
4 hours
1.3K
track
Big Data with PySpark
25 hours
114
Not sure where to start?
Take an AssessmentBrowse Spark courses and tracks
9 resultscourse
Introduction to PySpark
IntermediateSkill Level
4 hours
1.3K
course
Big Data Fundamentals with PySpark
AdvancedSkill Level
4 hours
912
course
Cleaning Data with PySpark
AdvancedSkill Level
4 hours
471
course
Machine Learning with PySpark
AdvancedSkill Level
4 hours
416
course
Introduction to Spark SQL in Python
AdvancedSkill Level
4 hours
105
course
Feature Engineering with PySpark
AdvancedSkill Level
4 hours
292
course
Building Recommendation Engines with PySpark
AdvancedSkill Level
4 hours
147
course
Foundations of PySpark
IntermediateSkill Level
4 hours
48
course
Introduction to Spark with sparklyr in R
IntermediateSkill Level
4 hours
32
Related resources on Spark
blog
The Top 20 Spark Interview Questions
Essential Spark interview questions with example answers for job-seekers, data professionals, and hiring managers.
Tim Lu
blog
Flink vs. Spark: A Comprehensive Comparison
Comparing Flink vs. Spark, two open-source frameworks at the forefront of batch and stream processing.
Maria Eugenia Inzaugarat
8 min
tutorial
Pyspark Tutorial: Getting Started with Pyspark
Discover what Pyspark is and how it can be used while giving examples.
Natassha Selvaraj
10 min
Ready to apply your skills?
project
Cleaning an Orders Dataset with PySpark
1 hour
1K
project
Building a Demand Forecasting Model
1 hour
1.8K