Feature Engineering with PySpark

Avançado

Updated 12/2024

Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.

Crie sua conta gratuita

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.

Descrição do curso

The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use PySpark to cut this Big Data problem down to size!

Pré-requisitos

Introduction to PySpark Supervised Learning with scikit-learn

Exploratory Data Analysis

Iniciar capítulo

Where to Begin

Descrição do curso

Declaração de Realização Earn

Junte-se a mais .css-nklxlk{color:var(--wf-brand--main, #03EF62);}15 milhões de alunos e comece Feature Engineering with PySpark Hoje!

Crie sua conta gratuita

Junte-se a mais 15 milhões de alunos e comece Feature Engineering with PySpark Hoje!