This course introduces learners to version control using Git. You will discover the importance of version control when working on data science projects and explore how you can use Git to track files, compare differences, modify and save files, undo changes, and allow collaborative development through the use of branches. You will gain an introduction to the structure of a repository, how to create new repositories and clone existing ones, and show how Git stores data. By working through typical data science tasks, you will gain the skills to handle conflicting files.
Introduction to GitFree
In the first chapter, you’ll learn what version control is and why it is essential for data projects. Then, you’ll discover what Git is and how to use it for a version control workflow.Introduction to version control with Git50 xpUsing the shell50 xpChecking the version of Git100 xpSaving files50 xpWhere does Git store information?50 xpThe Git workflow100 xpAdding a file100 xpAdding multiple files100 xpComparing files50 xpWhat has changed?50 xpWhat's going to be committed?100 xpWhat's in the staging area?50 xp
Next, you’ll examine how Git stores data, learn essential commands to compare files and repositories at different times, and understand the process for restoring earlier versions of files in your data projects.Storing data with Git50 xpInterpreting the commit structure50 xpViewing a repository's history50 xpViewing a specific commit100 xpViewing changes50 xpComparing to the second most recent commit50 xpComparing commits50 xpWho changed what?100 xpUndoing changes before committing50 xpHow to unstage a file100 xpUndoing changes to unstaged files100 xpUndoing all changes100 xpRestoring and reverting50 xpRestoring an old version of a repo100 xpDeleting untracked files50 xpRestoring an old version of a file100 xp
In this chapter, you'll learn tips and tricks for configuring Git to make you more efficient! You'll also discover branches, identify how to create and switch to different branches, compare versions of files between branches, merge branches together, and deal with conflicting files across branches.Configuring Git50 xpModifying your email address in Git100 xpCreating an alias100 xpIgnoring files50 xpBranches50 xpBranching and merging50 xpCreating new branches100 xpChecking the number of branches50 xpComparing branches100 xpWorking with branches50 xpSwitching branches100 xpMerging two branches100 xpHandling conflict50 xpRecognizing conflict syntax50 xpResolving a conflict100 xp
Collaborating with Git
This final chapter is all about collaboration! You'll gain an introduction to remote repositories and learn how to work with them to synchronize content between the cloud and your local computer. You'll also see how to create new repositories and clone existing ones, along with discovering a workflow to minimize the risk of conflicts between local and remote repositories.Creating repos50 xpSetting up a new repo100 xpConverting an existing project100 xpWorking with remotes50 xpCloning a repo100 xpDefining and identifying remotes100 xpGathering from a remote50 xpFetching from a remote100 xpPulling from a remote100 xpPushing to a remote50 xpPushing to a remote repo100 xpHandling push conflicts100 xpCongratulations!50 xp
In the following tracksData EngineerData Scientist Professional with PythonData Scientist Professional with RPython ProgrammerR Programmer
George BoormanSee More
Curriculum Manager, DataCamp
George is a Curriculum Manager at DataCamp. He holds a PGDip in Exercise for Health and BSc (Hons) in Sports Science and has experience in project management across public health, applied research, and not-for-profit sectors. George is passionate about sports, tech for good, and all things data science.