Association studies have linked microbiome alterations with many human diseases. However, they have not always reported consistent results, thereby necessitating cross-study comparisons. Here, a meta-analysis of eight geographically and technically diverse fecal shotgun metagenomic studies of colorectal cancer (CRC, n = 768), which was controlled for several confounders, identified a core set of 29 species significantly enriched in CRC metagenomes (false discovery rate (FDR) < 1 × 10-5). CRC signatures derived from single studies maintained their accuracy in other studies. By training on multiple studies, we improved detection accuracy and disease specificity for CRC. Functional analysis of CRC metagenomes revealed enriched protein and mucin catabolism genes and depleted carbohydrate degradation genes. Moreover, we inferred elevated production of secondary bile acids from CRC metagenomes, suggesting a metabolic link between cancer-associated gut microbes and a fat- and meat-rich diet. Through extensive validations, this meta-analysis firmly establishes globally generalizable, predictive taxonomic and functional microbiome CRC signatures as a basis for future diagnostics.
- MeSH
- Adenoma genetics microbiology MeSH
- Models, Biological MeSH
- Databases, Genetic MeSH
- Species Specificity MeSH
- Feces microbiology MeSH
- Cohort Studies MeSH
- Colorectal Neoplasms genetics microbiology MeSH
- Middle Aged MeSH
- Humans MeSH
- Metagenome * MeSH
- Biomarkers, Tumor metabolism MeSH
- Reproducibility of Results MeSH
- Aged MeSH
- Gastrointestinal Microbiome genetics MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Meta-Analysis MeSH
- Research Support, N.I.H., Extramural MeSH
Several studies have investigated links between the gut microbiome and colorectal cancer (CRC), but questions remain about the replicability of biomarkers across cohorts and populations. We performed a meta-analysis of five publicly available datasets and two new cohorts and validated the findings on two additional cohorts, considering in total 969 fecal metagenomes. Unlike microbiome shifts associated with gastrointestinal syndromes, the gut microbiome in CRC showed reproducibly higher richness than controls (P < 0.01), partially due to expansions of species typically derived from the oral cavity. Meta-analysis of the microbiome functional potential identified gluconeogenesis and the putrefaction and fermentation pathways as being associated with CRC, whereas the stachyose and starch degradation pathways were associated with controls. Predictive microbiome signatures for CRC trained on multiple datasets showed consistently high accuracy in datasets not considered for model training and independent validation cohorts (average area under the curve, 0.84). Pooled analysis of raw metagenomes showed that the choline trimethylamine-lyase gene was overabundant in CRC (P = 0.001), identifying a relationship between microbiome choline metabolism and CRC. The combined analysis of heterogeneous CRC cohorts thus identified reproducible microbiome biomarkers and accurate disease-predictive models that can form the basis for clinical prognostic tests and hypothesis-driven mechanistic studies.
- MeSH
- Choline metabolism MeSH
- Databases, Genetic MeSH
- Species Specificity MeSH
- Cohort Studies MeSH
- Colorectal Neoplasms diagnosis metabolism microbiology MeSH
- Humans MeSH
- Lyases genetics metabolism MeSH
- Metagenomics * MeSH
- Biomarkers, Tumor metabolism MeSH
- Gastrointestinal Microbiome MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
- Publication type
- Published Erratum MeSH