Saltar al contenido principal
InicioSpark

proyecto

Cleaning an Orders Dataset with PySpark

Avanzado
Updated 07/2024
Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!
Iniciar Proyecto De Forma Gratuita

Incluido conPremium or Teams

1 Task1,500 XP853

Crea Tu Cuenta Gratuita

GoogleLinkedInFacebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.
Group

¿Entrenar a 2 o más personas?

Probar DataCamp for Business

Project Description

Data cleaning is an essential skill for any data professional.

In this project, you will step into a role of a data engineer at an e-commerce company and use PySpark, a powerful tool for data processing, to clean an orders dataset.

This hands-on experience will sharpen your ability to format, extract and amend data for further analysis.

Project Tasks

  1. 1
    Task 1

Technologies

Python Spark

Topics

Data EngineeringData Preparation
Rufat Mustafaev HeadshotRufat Mustafaev

Data Scientist, Booking.com

Ver Más

What do other learners have to say?