Accéder au contenu principal
AccueilSpark

projet

Cleaning an Orders Dataset with PySpark

Avancé
Updated 07/2024
Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!
Démarrer Le Projet Gratuitement

Inclus avecPremium or Teams

1 Task1,500 XP853

Créez votre compte gratuit

GoogleLinkedInFacebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.
Group

Formation de 2 personnes ou plus ?

Essayer DataCamp for Business

Project Description

Data cleaning is an essential skill for any data professional.

In this project, you will step into a role of a data engineer at an e-commerce company and use PySpark, a powerful tool for data processing, to clean an orders dataset.

This hands-on experience will sharpen your ability to format, extract and amend data for further analysis.

Project Tasks

  1. 1
    Task 1

Technologies

Python Spark

Topics

Data EngineeringData Preparation

Conditions préalables

Cleaning Data with PySpark
Rufat Mustafaev HeadshotRufat Mustafaev

Data Scientist, Booking.com

Voir Plus

What do other learners have to say?