Accéder au contenu principal
AccueilPython

projet

Give Life: Predict Blood Donations

Débutant
Updated 06/2024
Build a binary classifier to predict if a blood donor is likely to donate again.
Démarrer Le Projet Gratuitement

Inclus avecPremium or Teams

11 Tasks1,500 XP6,761

Créez votre compte gratuit

GoogleLinkedInFacebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.
Group

Formation de 2 personnes ou plus ?

Essayer DataCamp for Business

Project Description

"Blood is the most precious gift that anyone can give to another person — the gift of life." ~ World Health Organization

Forecasting blood supply is a serious and recurrent problem for blood collection managers: in January 2019, "Nationwide, the Red Cross saw 27,000 fewer blood donations over the holidays than they see at other times of the year." Machine learning can be used to learn the patterns in the data to help to predict future blood donations and therefore save more lives.

In this Project, you will work with data collected from the donor database of Blood Transfusion Service Center in Hsin-Chu City in Taiwan. The center passes its blood transfusion service bus to one university in Hsin-Chu City to gather blood donated about every three months. The dataset, obtained from the UCI Machine Learning Repository, consists of a random sample of 748 donors. Your task will be to predict if a blood donor will donate within a given time window. You will look at the full model-building process: from inspecting the dataset to using the tpot library to automate your Machine Learning pipeline.

Project Tasks

  1. 1
    Inspecting transfusion.data file
  2. 2
    Loading the blood donations data
  3. 3
    Inspecting transfusion DataFrame
  4. 4
    Creating target column
  5. 5
    Checking target incidence
  6. 6
    Splitting transfusion into train and test datasets
  7. 7
    Selecting model using TPOT
  8. 8
    Checking the variance
  9. 9
    Log normalization
  10. 10
    Training the logistic regression model
  11. 11
    Conclusion

Technologies

Python Python

Topics

Data ManipulationMachine Learning
Dimitri Denisjonok HeadshotDimitri Denisjonok

Python Backend Developer at Futrli

Voir Plus

What do other learners have to say?