Skip to main content

course

Introduction to Bioconductor in R

Intermediate

Updated 12/2024

Learn to use essential Bioconductor packages for bioinformatics using datasets from viruses, fungi, humans, and plants!

Start course for free

Included for FreePremium or Teams

RProbability & Statistics4 hours14 videos54 exercises4,050 XP15,047Statement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

Much of the biological research, from medicine to biotech, is moving toward sequence analysis. We are now generating targeted and whole genome big data, which needs to be analyzed to answer biological questions. To help you get started, you will be introduced to The Bioconductor project. Bioconductor is and builds the infrastructure to share software tools (packages), workflows and datasets for the analysis and comprehension of genomic data. Bioconductor is a great platform accessible to you, and it is a community developed open software resource. By the end of this course, you will be able to use essential Bioconductor packages and get a grasp of its infrastructure and some built-in datasets. Using BSgenome, Biostrings, IRanges, GenomicRanges, TxDB, ShortRead and Rqc with real datasets from different species is going to be an exceptional experience!

Prerequisites

Introduction to R Introduction to the Tidyverse

1

What is Bioconductor?

Introduction to the Bioconductor Project

Bioconductor version

BiocManager to install packages

The role of S4 in Bioconductor

S4 class definition

Interaction with classes

Introducing biology of genomic datasets

Discovering the yeast genome

Partitioning the yeast genome

Available genomes

2

Biostrings and When to Use Them?

Introduction to Biostrings

Exploring the Zika virus sequence

Biostrings containers

Manipulating Biostrings

Sequence handling

From a set to a single sequence

Subsetting a set

Common sequence manipulation functions

Why are we interested in patterns?

Searching for a pattern

Finding Palindromes

Finding a conserved region within six frames

Looking for a match

3

IRanges and GenomicRanges

IRanges and Genomic Structures

Constructing IRanges

Interacting with IRanges

Gene of interest

From tabular data to Genomic Ranges

GenomicRanges accessors

ABCD1 mutation

Human genome chromosome X

Manipulating collections of GRanges

A sequence window

Is it there?

More about ABCD1

How many transcripts?

From GRangesList object into a GRanges object

4

Introducing ShortRead

Sequence files

Reading in files

Exploring a fastq file

Extract a sample from a fastq file

Sequence quality

Exploring sequence quality

Base quality plot

Try your own nucleotide frequency plot

Match and filter

Filtering reads on the go!

Removing duplicates

More filtering!

Multiple assessment

Plotting cycle average quality

Introduction to Bioconductor

Introduction to Bioconductor in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Join over 15 million learners and start Introduction to Bioconductor in R today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.