Bioinformatics - set up
REQUIRED - We will be using RStudio to analyse the data set. It is recommend you have the following installed: RStudio version 1.4 or later and R version 4.0 or later. Further details on getting started in RStudio here.
Optional (not needed for today’s workshop) - We will not be doing the sequence pre-processing steps today but if you did want to do this you will need to download conda and QIIME2.
Raw amplicon 16S sequence data from West et al. (2020) Gut 69, 1452-1459. doi: 10.1136/gutjnl-2019-319620. Download raw data from NCBI Sequence Read Archive. Project number PRJNA493625
from https://sra-explorer.info/.
Rename so fastq.gz
files in format SAMPLEID_L001_R1_001.fastq.gz
(forward) or SAMPLEID_L001_R2_001.fastq.gz
(reverse).
You will not be required to download this for today’s tutorial but if you wanted you could use this data and follow the sequence processing page.
Pre-processed QIIME2 sequence data as outlined in sequence processing is available for download on FigShare link. Download files and you can view them using QIIME2 view.
The easiest way to follow along with this tutorial is to download this GitHub repository using either option 1 or 2 below:
Go to https://github.com/siobhon-egan/BIO514-microbiome and click on the green Code button. Select Download ZIP, open/unzip the file. Open the .Rmd
files in RStudio you will be able to follow along for the data analysis.
Use terminal and clone the GitHub repo.
git clone https://github.com/siobhon-egan/BIO514-microbiome.git
You should only need to do this once to install required libraries.
I recommend you run this bit of code line by line. I.e. copy and paste one line of code directly in to the console and press enter.
For some reason running the whole script at once doesn’t seem to be working.
Also the order of installing the packages is important, please sick to this order.
Remember it assumes you have R version 4.0 and RStudio version 1.4.
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
# Phyloseq
::install("phyloseq")
BiocManager# Tidyverse
install.packages("tidyverse")
# Install microbiome
::install("microbiome")
BiocManager# Install microbiome utilities
install.packages("devtools")
::install_github("microsud/microbiomeutilities")
devtools# Install DirichletMultinomial
::install("DirichletMultinomial")
BiocManager# Install MicrobiotaProcess
::install("MicrobiotaProcess")
BiocManager# Install ampvis2
install.packages("remotes")
::install_github("MadsAlbertsen/ampvis2")
remotes# Install vegan
install.packages("vegan")
# Install tidyverse
install.packages("tidyverse")
# Install ggpubr
install.packages("ggpubr")
# R color brewer
install.packages("RColorBrewer")
# Install reshape2
install.packages("reshape2")
# Install plotly
install.packages("plotly")
Note I have not included packages that relating to “knitting” the document here. Everything should be executable within the R markdown files by running the code chunks.