Pular para o conteúdo principal
InícioSpark

projeto

Cleaning an Orders Dataset with PySpark

Avançado
Updated 07/2024
Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!
Iniciar Projeto Gratuitamente

Incluído comPremium or Teams

1 Task1,500 XP860

Crie sua conta gratuita

GoogleLinkedInFacebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.
Group

Treinar 2 ou mais pessoas?

Tentar DataCamp for Business

Project Description

Data cleaning is an essential skill for any data professional.

In this project, you will step into a role of a data engineer at an e-commerce company and use PySpark, a powerful tool for data processing, to clean an orders dataset.

This hands-on experience will sharpen your ability to format, extract and amend data for further analysis.

Project Tasks

  1. 1
    Task 1

Technologies

Python Spark

Topics

Data EngineeringData Preparation
Rufat Mustafaev HeadshotRufat Mustafaev

Data Scientist, Booking.com

Ver Mais

What do other learners have to say?