Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Apr 18;4(4):458–475. doi: 10.1038/s42255-022-00558-0

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2022, corrected publication 2023

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

PMC Copyright notice

Extended Data Fig. 1 — a, Abundance of top CRC-enriched and differentially abundant bacteria in tissue per stage (TCGA). Boxplots shows medians with 1^st and 3^rd quantiles. The whiskers from the hinges to the smallest/largest values represent 1.5*inter-quartile range (IQR). n=108 healthy, n=102 Stage I, n= 209 Stage II, n=163 Stage III and n=86 Stage IV biologically independent donor samples, p=0.000705, p=0.00101, p=0.0000154 and p=0.00153 for Bacteroides in Stages I-IV vs. Healthy respectively, p=0.0112 and p=0.0402 for Campylobacter in Stages II and IV vs. Healthy respectively, p=0.0223 for Fusobacterium in Stage II vs Healthy, p=0.0293, p=0.000608 and p=0.00267 for Gemella in Stages I, II and IV vs. Healthy respectively, pairwise t test. b, Fn abundance in the EGA cohort. PathSeq-analysis for Fn abundance in matching normal vs. adenocarcinoma tissue on RNA-seq data, p=0.00165, paired two-tailed t test. c, Fn tissue abundance distribution in CRC patients of the EGA cohort. Fn was detected via RNA-seq and analyzed for the logarithmic score bacterial distribution (n=69). Quantile-based classification (color code) was applied for the target bacterium (quantcut function from the gtools R package). d, Correlation of Fn abundance with consensus molecular subtypes in CRC. The cohort in c was subjected to gene expression analysis and further classified via the CMScaller R package into CMS as described⁶⁷. Colored, segregated bars show the proportion of patients with differing fusobacterial loads per CMS. Chi-squared tests were performed for comparing the fusobacterial across all CMS. No significant differences were observed. n^CMS1=17, n^CMS2=19, n^CMS3=7, n^CMS4=5. e, IPA analysis of Fusobacterium^{high vs. no} differential gene expression analysis of the TCGA dataset. Plot shows z-scores, p-values, and the number of molecules per pathway. Selected significant pathways are shown (-log(p-value)>1.3). f, KEGG-based GSEA of Fusobacterium^{high vs. no} differential gene expression analysis of the TCGA dataset (pathfindR R package). Plot shows fold enrichment, p-values, and the number of genes per pathway. All significant pathways are shown (p<0.05). g, IPA analysis of Fusobacterium^{high vs. no} differential gene expression analysis of the EGA dataset. Plot shows z-scores, p-values, and the number of enriched molecules per pathway. Selected significant pathways are shown (-log(p-value)>1.3). *p<0.05, **p<0.01, ***p<0.001.

Source data