Saltar al contenido principal
InicioData EngineeringCleaning an Orders Dataset with PySpark
proyecto

Cleaning an Orders Dataset with PySpark

Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!

Iniciar Proyecto De Forma Gratuita
1 Task1,500 XP514

Crea Tu Cuenta Gratuita

GoogleLinkedInFacebook

o

Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.
Group¿Entrenar a 2 o más personas?Pruebe DataCamp para empresas

Preferido por estudiantes en miles de empresas


Project Description

Data cleaning is an essential skill for any data professional.

In this project, you will step into a role of a data engineer at an e-commerce company and use PySpark, a powerful tool for data processing, to clean an orders dataset.

This hands-on experience will sharpen your ability to format, extract and amend data for further analysis.

Project Tasks

  1. 1
    Task 1

Technologies

Python Spark

Topics

Data EngineeringData Preparation
Rufat Mustafaev HeadshotRufat Mustafaev

Data Scientist, Booking.com

Ver Más

What do other learners have to say?