Differential abundance analysis (DAA)

Thursday, August 28, 2025

Differential abundance analysis (DAA)

Goal: identify taxa with different abundance between groups
Approaches:
- Classical statistical tests
- Microbiome-specific methods
Challenges: compositionality, sparsity, multiple testing

Elementary methods provide more replicable results in microbial differential abundance analysis

Relative abundances with a Wilcoxon test
Log-transformed relative abundances with a t-test
Presence/absence of taxa with logistic regression

Pelto et al. 2025 show that simple methods often give more replicable results.

Classical tests are linear models

Relative abundances with a Wilcoxon test
- Non-parametric, but can be expressed as a rank-based linear model

\[ \mathrm{Rank}(\mathrm{Relative\ abundance}) \sim \mathrm{Group} \]

Log-transformed relative abundances with a t-test
- Standard linear model

\[ \log(\mathrm{Relative\ abundance}) \sim \mathrm{Group} \]

Presence/absence of taxa with logistic regression
- Generalized linear model with logit link

\[ \mathrm{Presence} \sim \mathrm{Group}, \quad \mathrm{Presence} \in \{0,1\} \]

Wilcoxon vs t-test

Wilcoxon: compares medians, robust to outliers, non-parametric
t-test: compares means, assumes normality, sensitive to outliers

Demonstration

library(maaslin3)

maaslin3_out <- maaslin3(
    input_data = tse,
    formula = "~ patient_status",
    normalization = "TSS",
    transform = "LOG",
)

Exercises

From OMA online book, Chapter 17: Differential abundance

All exercises

References