The goal of this session is to gain hands-on experience in microbiome analysis using R and the ampvis2 (AMBIS2) package. You’ll explore real microbiome data from wastewater treatment plants (WWTPs) in Denmark — focusing on how microbial communities vary, how to visualize them, and how to interpret basic ecological patterns.
In microbiome studies, raw DNA sequences (from 16S rRNA or similar) are processed into:
🧩 Without metadata, the data are meaningless — you’d know what was sequenced but not where or when.
In this course, the LSB table and metadata are already provided, so students focus on data exploration, not preprocessing.
Before analysis, the dataset must be filtered:
Once data are clean:
Visualize which species are abundant in which samples. Rows = species; columns = samples; color intensity = abundance. ➡️ Turns giant unreadable tables into intuitive color-coded maps 🎨.
Used to compare distributions — for example, how the abundance of a certain taxon differs between treatment plants or seasons.
Explore how similar or different microbial communities are across samples. Common methods:
Each point = sample → close points share similar microbial communities. This helps find clusters, gradients, or outliers in microbial composition.
Although not deeply elaborated, this typically refers to within-sample diversity (e.g., Shannon, Simpson indices). It measures how many species and how evenly they are distributed.
Track changes in microbial composition over time. Useful for studying seasonal variation or system stability.
Once taxonomic patterns are clear, further analysis can identify metabolic functions or gene content of the microbial community — though this part is only briefly mentioned here.
This is the main analysis tool.
💡 Designed for both teaching and real-world research.
The folder data_for_hands-on contains:
📁 All files should be kept in one folder for smoother R operation (setwd() to that folder).
Not to produce graded results — but to practice, learn, and gain confidence in microbiome data analysis.
By the end, participants understand: