Introduction to Bioconductor in R
Learn to use essential Bioconductor packages for bioinformatics using datasets from viruses, fungi, humans, and plants!
Commencer Le Cours Gratuitement4 heures14 vidéos54 exercices14 899 apprenantsDéclaration de réalisation
Créez votre compte gratuit
ou
En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.Formation de 2 personnes ou plus ?
Essayer DataCamp for BusinessApprécié par les apprenants de milliers d'entreprises
Description du cours
Much of the biological research, from medicine to biotech, is moving toward sequence analysis. We are now generating targeted and whole genome big data, which needs to be analyzed to answer biological questions. To help you get started, you will be introduced to The Bioconductor project. Bioconductor is and builds the infrastructure to share software tools (packages), workflows and datasets for the analysis and comprehension of genomic data. Bioconductor is a great platform accessible to you, and it is a community developed open software resource. By the end of this course, you will be able to use essential Bioconductor packages and get a grasp of its infrastructure and some built-in datasets. Using BSgenome, Biostrings, IRanges, GenomicRanges, TxDB, ShortRead and Rqc with real datasets from different species is going to be an exceptional experience!
Formation de 2 personnes ou plus ?
Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.Dans les titres suivants
Analyse des données génomiques en R
Aller à la piste- 1
What is Bioconductor?
GratuitIn this chapter, you will get hands-on with Bioconductor. Bioconductor is the specialized repository for bioinformatics software, developed and maintained by the R community. You will learn how to install and use bioconductor packages. You'll be introduced to S4 objects and functions, because most packages within Bioconductor inherit from S4. Additionally, you will use a real genomic dataset of a fungus to explore the BSgenome package.
Introduction to the Bioconductor Project50 xpBioconductor version100 xpBiocManager to install packages100 xpThe role of S4 in Bioconductor50 xpS4 class definition50 xpInteraction with classes100 xpIntroducing biology of genomic datasets50 xpDiscovering the yeast genome100 xpPartitioning the yeast genome100 xpAvailable genomes50 xp - 2
Biostrings and When to Use Them?
Biostrings are memory efficient string containers. Biostring has matching algorithms, and other utilities, for fast manipulation of large biological sequences or sets of sequences. How efficient you can become by using the right containers for your sequences? You will learn about alphabets, and sequence manipulation by using the tiny genome of a virus.
Introduction to Biostrings50 xpExploring the Zika virus sequence100 xpBiostrings containers50 xpManipulating Biostrings100 xpSequence handling50 xpFrom a set to a single sequence100 xpSubsetting a set50 xpCommon sequence manipulation functions100 xpWhy are we interested in patterns?50 xpSearching for a pattern50 xpFinding Palindromes100 xpFinding a conserved region within six frames100 xpLooking for a match100 xp - 3
IRanges and GenomicRanges
The IRanges and GenomicRanges packages are also containers for storing and manipulating genomic intervals and variables defined along a genome. These packages provide infrastructure and support to many other Bioconductor packages because of their enriching features. You will learn how to use these containers and their associated metadata, for manipulation of your sequences. The dataset you will be looking at is a special gene of interest in the human genome.
IRanges and Genomic Structures50 xpIRanges50 xpConstructing IRanges100 xpInteracting with IRanges100 xpGene of interest50 xpFrom tabular data to Genomic Ranges100 xpGenomicRanges accessors100 xpABCD1 mutation50 xpHuman genome chromosome X100 xpManipulating collections of GRanges50 xpA sequence window50 xpIs it there?50 xpMore about ABCD1100 xpHow many transcripts?100 xpFrom GRangesList object into a GRanges object100 xp - 4
Introducing ShortRead
ShortRead is the package for input, manipulation and assessment of fasta and fastq files. You can subset, trim and filter the sequences of interest, and even do a report of quality. An extra bonus towards the last exercises will give you the tools for parallel quality assessment, wink, wink Rqc. Exciting enough, for this you will use plant genome sequences!
Sequence files50 xpWhy fastq?50 xpReading in files50 xpExploring a fastq file100 xpExtract a sample from a fastq file100 xpSequence quality50 xpExploring sequence quality100 xpBase quality plot50 xpTry your own nucleotide frequency plot100 xpMatch and filter50 xpFiltering reads on the go!100 xpRemoving duplicates50 xpMore filtering!100 xpMultiple assessment50 xpPlotting cycle average quality100 xpIntroduction to Bioconductor50 xp
Formation de 2 personnes ou plus ?
Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.Dans les titres suivants
Analyse des données génomiques en R
Aller à la pisteensembles de données
Zika Genomic DNA datasetA. Thaliana Short Reads with Quality datasetHuman Gene & Transcript ID datasetYeast Genome datasetcollaborateurs
James Chapman
Voir PlusCurriculum Manager, DataCamp
Paula Martinez
Voir PlusData Scientist and Bioinformatician
Qu’est-ce que les autres apprenants ont à dire ?
Inscrivez-vous 15 millions d’apprenants et commencer Introduction to Bioconductor in R Aujourd’hui!
Créez votre compte gratuit
ou
En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.