Day 5 part 1 microbial community analysis using 16S rRNA

Environmental Biotechnology

๐ŸŒ Introduction: Why Study Microbial Communities?

Microbes are everywhere โ€” in air, soil, water, humans, and even in extreme environments. They drive:

  • Natural cycles (carbon, nitrogen, etc.)
  • Human health and biotechnology ๐Ÿงซ
  • Biogas, water treatment, nutrient recovery, and even bioplastics production โ™ป๏ธ

Understanding whoโ€™s there (identification) and what they do (function) is key. But bacteria are invisible ๐Ÿ‘€ โ€” so we need methods to identify them!


๐Ÿ”ฌ Methods to Identify Microbes

1. Cultivation / Isolation

Grow bacteria on plates. โœ… Pros: allows study of metabolism and traits. โŒ Cons: only a small fraction can be cultured, slow, and selective. Metaphor: studying wild wolves ๐Ÿบ by raising a poodle ๐Ÿถ โ€” not the same!


2. FISH (Fluorescent In Situ Hybridization)

Youโ€™ve probably used this before! โœ… Visualizes bacteria in situ, no need to culture. โŒ Slow, limited by probe design (you only see what you look for). Good for morphology and spatial distribution.


3. DNA Sequencing (16S rRNA gene)

The core of modern microbiome analysis.

  • Targets the 16S ribosomal RNA gene, about 1500 bp long.
  • Contains:
    • Conserved regions ๐Ÿงฑ โ†’ for universal primer binding
    • Hypervariable regions ๐ŸŽจ โ†’ unique bacterial โ€œfingerprintsโ€

โœ… Fast, affordable, and cultivation-independent. โŒ You get names, not functions directly, and biases exist (primer mismatch, copy number variation).


๐Ÿงฌ The 16S Workflow: From Sample to Data Table

1. Sampling

Critical first step. Must represent the environment (soil, wastewater, etc.). Bad sample = bad data.


2. DNA Extraction

Goal: isolate pure DNA.

  1. Break cells open ๐Ÿ”จ
  2. Remove proteins, RNA, and contaminants
  3. Store clean DNA (โˆ’20 ยฐC or โˆ’80 ยฐC)

3. Library Preparation

Prepare DNA for sequencing:

  • Amplify specific 16S regions via PCR
  • Add adapters (for sequencing) and barcodes (for identifying samples)
  • Clean up PCR products (remove junk)
  • Check quality via gel or TapeStation

Samples are pooled โ†’ reduces cost.


๐Ÿงซ Sequencing Technologies

๐Ÿงฉ Illumina

  • Short-read sequencing (e.g., 2ร—250 bp)
  • Uses bridge amplification and sequencing-by-synthesis (detects fluorescent signals per base). Widely used, high accuracy.

๐Ÿ”„ PacBio

  • Long-read sequencing.
  • Reads a circularized DNA template multiple times for accuracy. Also detects light, like Illumina.

โšก Nanopore

  • Long-read sequencing using a tiny pore ๐Ÿ•ณ๏ธ
  • DNA passes through โ†’ changes in electrical current reveal bases.
  • Portable and real-time sequencing (common in many labs today).

๐Ÿง  Bioinformatics Pipeline (after sequencing)

  1. Raw reads โ†’ remove primers, barcodes, adapters.
  2. Merge forward/reverse reads.
  3. Detect unique sequences (ASVs) or cluster (OTUs).
    • OTU: 97 % similarity threshold.
    • ASV: 100 % identical sequences (higher resolution).
  4. Remove chimeras (artifacts).
  5. Demultiplex (assign reads to samples).
  6. Taxonomic assignment โ†’ map sequences to a database. Output: a feature table (ASV ร— sample matrix).

๐Ÿงฎ Databases for 16S Classification

DatabaseTraitsNotes
SILVABroad, high-qualityNot updated since 2018 โš ๏ธ
RDPClassic but outdatedโš ๏ธ old
GreengenesUpdated but variable quality
MiDAS ๐ŸงซEcosystem-specific (wastewater, biogas)In-house, high-quality full-length sequences

๐Ÿ‘‰ Use ecosystem-specific DBs when possible for better functional inference.


๐Ÿ“Š The Output: The ASV Table

A matrix like this:

ASV IDSample 1Sample 2...PhylumGenusSpecies
ASV110055...ProteobacteriaNitrospira...

This connects identity (taxonomy) with abundance (how much is there).


๐ŸŒฟ Diversity Metrics

๐Ÿงฉ Alpha Diversity (within-sample)

Measures how rich and even a single community is.

  • Richness = number of species ๐ŸŽ๐ŸŒ๐Ÿ‡
  • Evenness = balance between them

High diversity โ†’ more stable ecosystems.


๐Ÿ” Beta Diversity (between-samples)

Compares similarity or dissimilarity between samples. Example: microbiomes in healthy vs sick individuals ๐Ÿงโ€โ™‚๏ธ๐Ÿงโ€โ™€๏ธ Similar composition โ†’ similar state.


๐Ÿ’Ž Core Communities

Bacterial groups found consistently across many samples. Helps identify โ€œkey playersโ€ and environmental drivers in ecosystems.


๐Ÿงฉ Key Takeaways

  • Microbes are fundamental in environmental processes ๐ŸŒŽ
  • 16S rRNA sequencing is the gold standard for community profiling
  • Each step (sampling โ†’ extraction โ†’ sequencing โ†’ analysis) matters
  • ASVs offer fine taxonomic resolution
  • Interpretation relies on diversity, taxonomy, and ecology metrics

Quiz

Score: 0/30 (0%)