Direkt zum Inhalt

Introduction to Bioconductor in R

Learn to use essential Bioconductor packages for bioinformatics using datasets from viruses, fungi, humans, and plants!

Kurs Kostenlos Starten

4 Stunden14 Videos54 Übungen14.899 LernendeLeistungsnachweis

Kostenloses Konto erstellen

Google LinkedIn Facebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.

Trainierst du 2 oder mehr?

Versuchen DataCamp for Business

Beliebt bei Lernenden in Tausenden Unternehmen

Kursbeschreibung

Much of the biological research, from medicine to biotech, is moving toward sequence analysis. We are now generating targeted and whole genome big data, which needs to be analyzed to answer biological questions. To help you get started, you will be introduced to The Bioconductor project. Bioconductor is and builds the infrastructure to share software tools (packages), workflows and datasets for the analysis and comprehension of genomic data. Bioconductor is a great platform accessible to you, and it is a community developed open software resource. By the end of this course, you will be able to use essential Bioconductor packages and get a grasp of its infrastructure and some built-in datasets. Using BSgenome, Biostrings, IRanges, GenomicRanges, TxDB, ShortRead and Rqc with real datasets from different species is going to be an exceptional experience!

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Genomische Daten analysieren in R

1
What is Bioconductor?
Kostenlos
In this chapter, you will get hands-on with Bioconductor. Bioconductor is the specialized repository for bioinformatics software, developed and maintained by the R community. You will learn how to install and use bioconductor packages. You'll be introduced to S4 objects and functions, because most packages within Bioconductor inherit from S4. Additionally, you will use a real genomic dataset of a fungus to explore the BSgenome package.
Kapitel Jetzt Abspielen
Introduction to the Bioconductor Project
50 xp
Bioconductor version
100 xp
BiocManager to install packages
100 xp
The role of S4 in Bioconductor
50 xp
S4 class definition
50 xp
Interaction with classes
100 xp
Introducing biology of genomic datasets
50 xp
Discovering the yeast genome
100 xp
Partitioning the yeast genome
100 xp
Available genomes
50 xp
2
Biostrings and When to Use Them?
Biostrings are memory efficient string containers. Biostring has matching algorithms, and other utilities, for fast manipulation of large biological sequences or sets of sequences. How efficient you can become by using the right containers for your sequences? You will learn about alphabets, and sequence manipulation by using the tiny genome of a virus.
Kapitel Jetzt Abspielen
Introduction to Biostrings
50 xp
Exploring the Zika virus sequence
100 xp
Biostrings containers
50 xp
Manipulating Biostrings
100 xp
Sequence handling
50 xp
From a set to a single sequence
100 xp
Subsetting a set
50 xp
Common sequence manipulation functions
100 xp
Why are we interested in patterns?
50 xp
Searching for a pattern
50 xp
Finding Palindromes
100 xp
Finding a conserved region within six frames
100 xp
Looking for a match
100 xp
3
IRanges and GenomicRanges
The IRanges and GenomicRanges packages are also containers for storing and manipulating genomic intervals and variables defined along a genome. These packages provide infrastructure and support to many other Bioconductor packages because of their enriching features. You will learn how to use these containers and their associated metadata, for manipulation of your sequences. The dataset you will be looking at is a special gene of interest in the human genome.
Kapitel Jetzt Abspielen
IRanges and Genomic Structures
50 xp
IRanges
50 xp
Constructing IRanges
100 xp
Interacting with IRanges
100 xp
Gene of interest
50 xp
From tabular data to Genomic Ranges
100 xp
GenomicRanges accessors
100 xp
ABCD1 mutation
50 xp
Human genome chromosome X
100 xp
Manipulating collections of GRanges
50 xp
A sequence window
50 xp
Is it there?
50 xp
More about ABCD1
100 xp
How many transcripts?
100 xp
From GRangesList object into a GRanges object
100 xp
4
Introducing ShortRead
ShortRead is the package for input, manipulation and assessment of fasta and fastq files. You can subset, trim and filter the sequences of interest, and even do a report of quality. An extra bonus towards the last exercises will give you the tools for parallel quality assessment, wink, wink Rqc. Exciting enough, for this you will use plant genome sequences!
Kapitel Jetzt Abspielen
Sequence files
50 xp
Why fastq?
50 xp
Reading in files
50 xp
Exploring a fastq file
100 xp
Extract a sample from a fastq file
100 xp
Sequence quality
50 xp
Exploring sequence quality
100 xp
Base quality plot
50 xp
Try your own nucleotide frequency plot
100 xp
Match and filter
50 xp
Filtering reads on the go!
100 xp
Removing duplicates
50 xp
More filtering!
100 xp
Multiple assessment
50 xp
Plotting cycle average quality
100 xp
Introduction to Bioconductor
50 xp

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Genomische Daten analysieren in R

Datensätze

Zika Genomic DNA dataset A. Thaliana Short Reads with Quality dataset Human Gene & Transcript ID dataset Yeast Genome dataset

Mitwirkende

David Campos

Shon Inouye

Richie Cotton

Voraussetzungen

Introduction to R Introduction to the Tidyverse

Curriculum Manager, DataCamp

Data Scientist and Bioinformatician

Was sagen andere Lernende?

Melden Sie sich an 15 Millionen Lernende und starten Sie Introduction to Bioconductor in R Heute!

Kostenloses Konto erstellen

Google LinkedIn Facebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.