Metabarcoding
genomics
metabarcoding
Notes for dealing with metabarcoding sequence data.
Metabarcoding
Analysis of amplicon metabarcoding next generation sequencing data
🔗 Useful resources
- Microbiome notes by Mikhail Dozmorov
- Metagenomics R packages
- Introduction to the microbiome R package by Leo Lahti, Sudarshan Shetty et al., core microbiome - Tools for microbiome analysis; with multiple example data sets from published studies; extending the phyloseq class. The package is in Bioconductor and aims to provide a comprehensive collection of tools and tutorials, with a particular focus on amplicon sequencing data.
- How to give life to your microbiome data by Ruth Schmidt - A tutorial on how to use Plotly’s R graphing library for microbiome data analysis and visualization. Also see GitHub for more useful info.
- omicplotR - An R package to visualize high-throughput sequencing data.
- MICrobial Community Analysis (micca) - software pipeline for the processing of amplicon sequencing data, from raw sequences to OTU tables, taxonomy classification and phylogenetic tree inference. The pipeline can be applied to a range of highly conserved genes/spacers, such as 16S rRNA gene, Internal Transcribed Spacer (ITS) 18S and 28S rRNA.
- phyloseq tutoial by Daniel Vaulot - This document explains the use of the phyloseq R library to analyze metabarcoding data. -Otago Uni eDNA GitHub page -Amplicon analysis with QIIME2 - VL microbiome project by Rachael Lappan
- Workflow for Microbiome Data Analysis: from raw reads to community analyses by Benjamin J Callahan
- MicrobiomeR - An R package for microbiome analysis that incorporates phyloseq, metacoder, taxa, and microbiome in order to standardize and simplify common microbiome workflows.
- microeco: An R package for data mining in microbial community ecology by Chi Liu, Minjie Yao
- ampvis2 - ampvis2 is an R-package to conveniently visualise and analyse 16S rRNA amplicon data in different ways.
- microbiomeseq - This R package is developed to enhance the available statistical analysis procedures in R by providing more analysis produre and visualisation of results for microbial communities data obtained from 16S rRNA.
- Analysis of Microbiome Community Data in R by Grunwald lab
- MCSMRT Microbiome Classifier for SMRT PacBio data - a tool to cluster PacBio FL16S amplicon microbiome sequences into Operational Taxonomic Units (OTU) and assign species level taxonomic classifications. Outputs include a table of read counts assigned to each OTU centroid sequence (per sample) with corresponding taxonomic lineage, and a a table of read specific metrics (e.g., CCS count, expected error, length, primer matching result, etc.).
- Bioplatforms Australia - Operational taxonomic unit (OTU) query system - BPA-OTU is a web-based portal into Operational Taxonomic Unit (OTU) data, developed to access data from the Australian Microbiome.
- AusMicrobiome metagenome - In development An assembly and annotation pipeline using standardised tools to characterise shotgun metagenomes produced by the Marine Microbes and Australian Microbiome Project.
Additional analysis
- decontam R package - R package that outlines benchmarking, and demonstrates the benefits of
decontam
-inating your data for more accurate profiling of microbial communities. - The PR3 primer database - A database of eukaryotic rRNA primers and primer sets for metabarcoding studies compiled from the literature.
- Managing Batch Effects in Microbiome Data by Yiwen Wang, Kim-Anh Lê Cao
RShiny
- Biome-Shiny - A Shiny R app for microbiome visualization.
- Dynamic Assessment of Microbial Ecology (DAME) - This R Shiny app is an open source platform that uses the R environment to perform microbial ecology data analyses (designed for QIIME1 output).
- animalcules - animalcules is an R package/R Shiny app for utilizing up-to-date data analytics, visualization methods, and machine learning models to provide users an easy-to-use interactive microbiome analysis framework.
- An R package for the analysis and visualization of microbial communities by Janina Reeder - The microbiomeExplorer package and the Shiny application contained with it provides methods and visualizations to explore the results of 16S rRNA amplicon sequencing experiment.
Sample size calculator
Links to some useful resources for calculating power and sample size in microbiome studies. Good for grant and project proposals to show you have thought about sample size etc.
Some general points to consider
- What population should I target?
- How should I collect and store the samples? (CONTROLS!!!)
- What am I targeting and how should i extract?
- What sequencing approach should I use, amplicon or whole shotgun metagenome? Will there be PCR involved or do I want a pre-PCR library preparation method?
- How deeply should I sequence?
References
- Mattiello F, Verbist B, Faust K, Raes J, Shannon WD, Bijnens L, Thas O. A web application for sample size and power calculation in case-control microbiome studies. Bioinformatics. 2016 Jul 1;32(13):2038-40. doi: 10.1093/bioinformatics/btw099
- Kelly BJ, Gross R, Bittinger K, Sherrill-Mix S, Lewis JD, Collman RG, Bushman FD, Li H. Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA. Bioinformatics. 2015 Aug 1;31(15):2461-8. doi: 10.1093/bioinformatics/btv183
- Casals-Pascual C, González A, Vázquez-Baeza Y, Song SJ, Jiang L, Knight R. Microbial Diversity in Clinical Microbiome Studies: Sample Size and Statistical Power Considerations. Gastroenterology. 2020 May;158(6):1524-1528. doi: 10.1053/j.gastro.2019.11.305.