Skip to main content
Learn

Data science courses

Follow short videos led by expert instructors and then practice what you’ve learned with interactive exercises in your browser.

  • Learn at your own pace
  • Get hands-on experience
  • Complete bite-sized chapters
Screenshot of project code-along
8 results

Introduction to PySpark

Learn to implement distributed data management and machine learning in Spark using the PySpark package.

ClockOver 3 hoursTagProgrammingUserLore DirickLearncourses

Big Data Fundamentals with PySpark

Learn the fundamentals of working with big data with PySpark.

ClockOver 3 hoursTagData EngineeringUserUpendra Kumar DevisettyLearncourses

Cleaning Data with PySpark

Learn how to clean data with Apache Spark in Python.

ClockOver 3 hoursTagData PreparationUserMike MetzgerLearncourses

Machine Learning with PySpark

Learn how to make predictions from data with Apache Spark, using decision trees, logistic regression, linear regression, ensembles, and pipelines.

ClockOver 3 hoursTagMachine LearningUserAndrew CollierLearncourses

Introduction to Spark SQL in Python

Learn how to manipulate data and create machine learning feature sets in Spark using SQL in Python.

ClockOver 3 hoursTagData ManipulationUserMark PlutowskiLearncourses

Feature Engineering with PySpark

Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

ClockOver 3 hoursTagData ManipulationUserJohn HogueLearncourses

Introduction to Spark with sparklyr in R

Learn how to run big data analysis using Spark and the sparklyr package in R, and explore Spark MLIb in just 4 hours.

ClockOver 3 hoursTagProgrammingUserRichie CottonLearncourses

Technology

Topic

FAQs