Pular para o conteúdo principal
InícioSparkCleaning an Orders Dataset with PySpark
projeto

Cleaning an Orders Dataset with PySpark

Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!

Iniciar Projeto Gratuitamente
1 Task1,500 XP524

Crie sua conta gratuita

GoogleLinkedInFacebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.
GroupTreinar 2 ou mais pessoas?Experimente o DataCamp For Business

Amado por alunos de milhares de empresas


Project Description

Data cleaning is an essential skill for any data professional.

In this project, you will step into a role of a data engineer at an e-commerce company and use PySpark, a powerful tool for data processing, to clean an orders dataset.

This hands-on experience will sharpen your ability to format, extract and amend data for further analysis.

Project Tasks

  1. 1
    Task 1

Technologies

Python Spark

Topics

Data EngineeringData Preparation
Rufat Mustafaev HeadshotRufat Mustafaev

Data Scientist, Booking.com

Ver Mais

What do other learners have to say?