Feature Engineering with PySpark

Fortgeschritten

Updated 12.2024

Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

Kostenloses Konto erstellen

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.

Kursbeschreibung

The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use PySpark to cut this Big Data problem down to size!

Voraussetzungen

Introduction to PySpark Supervised Learning with scikit-learn

Exploratory Data Analysis

Kapitel starten

Where to Begin

Kursbeschreibung

Leistungsnachweis verdienen

Machen Sie mit .css-nklxlk{color:var(--wf-brand--main, #03EF62);}15 Millionen Lernende und starten Sie Feature Engineering with PySpark Heute!

Kostenloses Konto erstellen

Machen Sie mit 15 Millionen Lernende und starten Sie Feature Engineering with PySpark Heute!