Chief Data Scientist at Omni Analytics Group
project
Clustering Bustabit Gambling Behavior
Included withPremium or Teams
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Training 2 or more people?
Try DataCamp for BusinessProject Description
Have you ever wondered if you could quantify the behavior of gamblers at the casino? Some seem to win the most, some can be reckless and risky with their bets, and others are casual about the whole experience. While collecting this data from the casino might be a challenge, there is an online platform called Bustabit in which gamblers can bet Bitcoin. We've collected data on thousands of Bustabit gambling sessions and tracked the user, the amount bet, the amount won, and various properties of the particular game itself. Using this data, you will perform a cluster analysis from start to finish in an attempt to group gamblers based on their gambling behavior.
To complete this project, students should be comfortable with R programming,
the tidyverse
package in particular, as the data manipulation and summarization
routines will use this.
The dataset used includes 10,000 games of Bustabit. Each game tracks the particular gambler, the BustedAt value of the game, and the multiplier at which the gambler cashed out.
Project Tasks
- 1A preliminary look at the Bustabit data
- 2Deriving relevant features for clustering
- 3Creating per-player statistics
- 4Scaling and normalization of the derived features
- 5Cluster the player data using K means
- 6Compute averages for each cluster
- 7Visualize the clusters with a Parallel Coordinate Plot
- 8Visualize the clusters with Principal Components
- 9Analyzing the groups of gamblers our solution uncovered
Technologies
R
Founder of Omni Analytics Group