Chapter 4 Community Profiling Part II

4.1 Structural and Functional Approaches to study microbiomes

Alves, L. D. F., Westmann, C. A., Lovate, G. L., de Siqueira, G. M. V., Borelli, T. C., & Guazzaroni, M. E. (2018). Metagenomic approaches for understanding new concepts in microbial science. International journal of genomics, 2018.
Alves, L. D. F., Westmann, C. A., Lovate, G. L., de Siqueira, G. M. V., Borelli, T. C., & Guazzaroni, M. E. (2018). Metagenomic approaches for understanding new concepts in microbial science. International journal of genomics, 2018.

4.1.1 16S rRNA as an evolutionary chronometer

Case, R. J., Boucher, Y., Dahllöf, I., Holmström, C., Doolittle, W. F., & Kjelleberg, S. (2007). Use of 16S rRNA and rpoB genes as molecular markers for microbial ecology studies. Applied and environmental microbiology, 73(1), 278-288.
Case, R. J., Boucher, Y., Dahllöf, I., Holmström, C., Doolittle, W. F., & Kjelleberg, S. (2007). Use of 16S rRNA and rpoB genes as molecular markers for microbial ecology studies. Applied and environmental microbiology, 73(1), 278-288.
  1. Ubiquitous – present in all known life (excluding viruses)
  2. Functionally constant wrt translation and secondary structure
  3. Evolves very slowly – mutations are extremely rare
  4. Large enough to extract information for evolutionary inference
  5. Limited exchange – limited examples of rRNA gene sharing between organisms

4.1.2 16S rRNA vs rpoB (RNA polymerase β subunit gene)

Case, R. J., Boucher, Y., Dahllöf, I., Holmström, C., Doolittle, W. F., & Kjelleberg, S. (2007). Use of 16S rRNA and rpoB genes as molecular markers for microbial ecology studies. Applied and environmental microbiology, 73(1), 278-288.
Case, R. J., Boucher, Y., Dahllöf, I., Holmström, C., Doolittle, W. F., & Kjelleberg, S. (2007). Use of 16S rRNA and rpoB genes as molecular markers for microbial ecology studies. Applied and environmental microbiology, 73(1), 278-288.

4.1.2.1 16S rRNA hypervariable regions

Yang, B., Wang, Y., & Qian, P. Y. (2016). Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis. BMC bioinformatics, 17(1), 1-8.
Yang, B., Wang, Y., & Qian, P. Y. (2016). Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis. BMC bioinformatics, 17(1), 1-8.
Yang, B., Wang, Y., & Qian, P. Y. (2016). Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis. BMC bioinformatics, 17(1), 1-8.
Yang, B., Wang, Y., & Qian, P. Y. (2016). Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis. BMC bioinformatics, 17(1), 1-8.

Illustration of different hypervariable regions of 16S rRNA

4.2 Basic Workflow for 16S Gene Based Sequencing

Jo, J. H., Kennedy, E. A., & Kong, H. H. (2016). Research techniques made simple: bacterial 16S ribosomal RNA gene sequencing in cutaneous research. Journal of Investigative Dermatology, 136(3), e23-e27.
Jo, J. H., Kennedy, E. A., & Kong, H. H. (2016). Research techniques made simple: bacterial 16S ribosomal RNA gene sequencing in cutaneous research. Journal of Investigative Dermatology, 136(3), e23-e27.

4.3 Addressing the ‘fine print’ while generating 16S rRNA gene amplicon libraries

  1. Sample Collection
  • Sample collection significantly influences the microbiome profiler after sequencing
  • Sample storage
  1. DNA isolation
  • Template concentration
  • Template extraction protocol
  1. PCR amplification
  • PCR bias and inhibitors
  • Amplification of contaminants

J. Microbiol Methods (2018), App. Environ. Microbiol. (2014), Microbiome (2015)

4.4 Steps Involved

  1. Experimental Design: How many samples can be included in the sequencing run?
  • By using barcoded primers, numerous samples can be sequenced simultaneously (multiplexing)
V4 Region
V4 Region

4.4.1 Samples

  1. More the number of samples, more cost effective the run (sequencing depth will be compromised)

Comparison of multiplexing capacity by sequencing system

  1. It is critical to have a ‘library prep manifest’ to document the position of each sample with its associated barcode along with additional metadata information

https://www.youtube.com/watch?v=3SEz-i517Oo&t=5s

4.4.2 Include Controls

  1. Between run repeat (process any sample in duplicate per run to measure reproducibility across runs)
  2. Within run repeat (process any sample in duplicate per plate to measure reproducibility)
  3. Water used during PCR (water blank- to determine if any contaminant was introduced during PCR reaction)
  4. Water spiked with known bacterial DNA (mock bacterial communities- enables quantification of sequencing errors, minimizes bias during sampling and library preparation )

4.4.3 DNA extraction protocol

  1. Effect of mechanical lysis methods for extraction
  2. Presence of inhibitors such as organic matter, humic acid, bile salts, polysaccharides
  3. DNA yield post extraction and reproducibility

Effect of bead beating was larger than sampling time over 5 months

The effect of bead beating on the observed microbial community composition:

Albertsen, M., Karst, S. M., Ziegler, A. S., Kirkegaard, R. H., & Nielsen, P. H. (2015). Back to basics–the influence of DNA extraction and primer choice on phylogenetic analysis of activated sludge communities. PloS one, 10(7), e0132783.
Albertsen, M., Karst, S. M., Ziegler, A. S., Kirkegaard, R. H., & Nielsen, P. H. (2015). Back to basics–the influence of DNA extraction and primer choice on phylogenetic analysis of activated sludge communities. PloS one, 10(7), e0132783.

A. Percentage read abundance of the 11 most abundant phyla as a result of bead beating intensity

B. PCA of samples with different bead beating intensities vs. samples taken at different dates

4.4.4 Selection of primers and region of 16S gene influence microbial profile

V2, V4, V6-V7 regions produced consistent results

Barb, J. J., Oler, A. J., Kim, H. S., Chalmers, N., Wallen, G. R., Cashion, A., … & Ames, N. J. (2016). Development of an analysis pipeline characterizing multiple hypervariable regions of 16S rRNA using mock samples. PLoS One, 11(2), e0148047.
Barb, J. J., Oler, A. J., Kim, H. S., Chalmers, N., Wallen, G. R., Cashion, A., … & Ames, N. J. (2016). Development of an analysis pipeline characterizing multiple hypervariable regions of 16S rRNA using mock samples. PLoS One, 11(2), e0148047.
  1. V2, V3 and V6 contain maximum nucleotide heterogeneity
  2. V6 is the shortest hypervariable region with the maximum sequence heterogeneity
  3. V1 is best target for distinguishing pathogenic S aureus
  4. V2 and V3 are excellent targets for speciation among Staph and Strep pathogens as well as Clostridium and Neisseria species
  5. V2 especially useful for speciation of Mycobacterium sp. and detection of E coli O157:H7
  6. V3 useful for speciation of Haemophilus sp
  7. V6 best target for probe based PCR assays to identify CDC select agents (bio-terrorism agents)
Chakravorty, S., Helb, D., Burday, M., Connell, N., & Alland, D. (2007). A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria. Journal of microbiological methods, 69(2), 330-339.
Chakravorty, S., Helb, D., Burday, M., Connell, N., & Alland, D. (2007). A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria. Journal of microbiological methods, 69(2), 330-339.

4.4.5 Purification of Amplicons

After one –step or two-step PCR, products are cleaned up using AMpure beads

  1. Gel Electrophoresis and quantification of cleaned amplicon products
  • Qubit
  1. Sample pooling – equimolar concentrations (how many samples do you want to pool? How many reads per sample?
  2. Gel extraction of pooled product
  3. Final clean up (Qiagen kit) and QC
16S Library Prep
16S Library Prep

Amplicon Sequencing Library Prep - PacBio

4.4.5.1 Overview of generic amplicon workflow

Amplicon Workflow
Amplicon Workflow

4.5 Oxford Nanopore Sequencing

Wang, Y., Zhao, Y., Bollas, A., Wang, Y., & Au, K. F. (2021). Nanopore sequencing technology, bioinformatics and applications. Nature biotechnology, 39(11), 1348-1365.
Wang, Y., Zhao, Y., Bollas, A., Wang, Y., & Au, K. F. (2021). Nanopore sequencing technology, bioinformatics and applications. Nature biotechnology, 39(11), 1348-1365.

4.5.1 How does it work?

  1. Proetin pore
    • nanoscale
    • biosensor
    • motor protein ratchets DNA/RNA through
  2. Ionic current
    • constant voltage
    • in electrolytic solution
    • disrupted by nucleotide sequence
    • changes in current correspond to sequence

Watch the video below and answer questions:

https://www.youtube.com/watch?v=RcP85JHLmnI

  1. How does the DNA bind to the pore?
  2. Does something help guide the DNA to the pore?
  3. What is the signal produced by the DNA?

https://www.youtube.com/watch?v=E9-Rm5AoZGw

Applications for ONT sequencing:

Wang, Y., Zhao, Y., Bollas, A., Wang, Y., & Au, K. F. (2021). Nanopore sequencing technology, bioinformatics and applications. Nature biotechnology, 39(11), 1348-1365.
Wang, Y., Zhao, Y., Bollas, A., Wang, Y., & Au, K. F. (2021). Nanopore sequencing technology, bioinformatics and applications. Nature biotechnology, 39(11), 1348-1365.

4.6 Metagenomics and Metatranscriptomics

Metagenomics: Untargeted sequencing of all microbial genomes present in a sample.

4.6.1 Shotgun Metagenomics

  • Study design and experimental protocol
  • Computational pre-processing
  • Sequence analysis
  • Post-processing
  • Validation
Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.
Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.

4.6.2 Sample collection and DNA extraction

  1. Sample collection and preservation methods can affect quality and accuracy of metagenomic data
    • Collect sufficient biomass
    • Minimize contamination
    • Enrichment methods where applicable
  2. DNA extraction methods can affect the composition of downstream sequence data
  • Method must be effective for diverse microbial taxa
    • Mechanical lysis (bead beating) method is considered superior, however, data will be biased for easy-to-lyse microbes
    • Bead beating will result in short DNA fragments and lead to DNA loss during library prep methods.

4.6.3 Sources of contamination

  1. Kit or lab reagents
  2. Low biomass samples are vulnerable to contamination as there is less ‘real’ signal to compete with low levels of contamination
    • Use ultraclean kits
    • Include blank sequencing controls
  3. Cross- over from previous sequencing runs
  4. PhiX control DNA
  5. Human/ host DNA

4.6.4 Coverage and Sequencing considerations

  1. No published guidelines for ‘correct’ amount of coverage for a given environment
  • Choose a system that maximizes output in order to recover sequences from as many low-abundance members of the microbiome as possible

  • HiSeq 2500 or 4000, NextSeq and NovaSeq produce high volume data (120Gb- 1.5 Tb per run) – suited for metagenomics study

  • Multiplexing prudently will enable desired per-sample sequencing depth

4.6.5 Illumina Sequencers and Yield

https://www.illumina.com/systems/sequencing-platforms.html

Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.
Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.

4.6.6 Generalized workflow of metagenomic next-generation sequencing for diagnostic clinical use

Gu, W., Miller, S., & Chiu, C. Y. (2019). Clinical metagenomic next-generation sequencing for pathogen detection. Annual Review of Pathology: Mechanisms of Disease, 14, 319-338.
Gu, W., Miller, S., & Chiu, C. Y. (2019). Clinical metagenomic next-generation sequencing for pathogen detection. Annual Review of Pathology: Mechanisms of Disease, 14, 319-338.

4.6.7 Generic Analysis Workflow

Oulas, A., Pavloudi, C., Polymenakou, P., Pavlopoulos, G. A., Papanikolaou, N., Kotoulas, G., … & Iliopoulos, L. (2015). Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies. Bioinformatics and biology insights, 9, BBI-S12462.
Oulas, A., Pavloudi, C., Polymenakou, P., Pavlopoulos, G. A., Papanikolaou, N., Kotoulas, G., … & Iliopoulos, L. (2015). Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies. Bioinformatics and biology insights, 9, BBI-S12462.

4.6.8 Strengths and weaknesses of assembly-based and read-based metagenomics analysis

Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.
Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., & Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nature biotechnology, 35(9), 833-844.

4.6.10 Benefits and limitations of whole genome metagenomics

Benefits

  • Integrative meta-omics
  • Strain-level profiling
  • Longitudinal study design
  • Capability of sequencing large regions or entire genome
  • Identification of organisms in addition to bacteria, archaea
  • Increased prediction of genes and functional pathways

Limitations

  • Expensive
  • Compute intensive
  • Incomplete databases
  • Biases in functional profiling
  • Unvalidated data in the public space
  • Live or dead dilemma
Malla, M. A., Dubey, A., Kumar, A., Yadav, S., Hashem, A., & Abd_Allah, E. F. (2019). Exploring the human microbiome: the potential future role of next-generation sequencing in disease diagnosis and treatment. Frontiers in Immunology, 9, 2868.
Malla, M. A., Dubey, A., Kumar, A., Yadav, S., Hashem, A., & Abd_Allah, E. F. (2019). Exploring the human microbiome: the potential future role of next-generation sequencing in disease diagnosis and treatment. Frontiers in Immunology, 9, 2868.

“What are they doing?” - Metatranscriptomics

Bikel, S., Valdez-Lara, A., Cornejo-Granados, F., Rico, K., Canizales-Quinteros, S., Soberón, X., … & Ochoa-Leyva, A. (2015). Combining metagenomics, metatranscriptomics and viromics to explore novel microbial interactions: towards a systems-level understanding of human microbiome. Computational and structural biotechnology journal, 13, 390-401.
Bikel, S., Valdez-Lara, A., Cornejo-Granados, F., Rico, K., Canizales-Quinteros, S., Soberón, X., … & Ochoa-Leyva, A. (2015). Combining metagenomics, metatranscriptomics and viromics to explore novel microbial interactions: towards a systems-level understanding of human microbiome. Computational and structural biotechnology journal, 13, 390-401.
NCGR
NCGR

4.6.11 Benefits of Metatranscriptomics

NCGR
NCGR
knitr::write_bib(c(
  .packages(), 'bookdown', 'knitr', 'rmarkdown'
), 'packages.bib')