Skip to main content
. 2024 Sep 26;15:8261. doi: 10.1038/s41467-024-52598-7

Fig. 4. Evidence-supported genome mining and disease association.

Fig. 4

a Schematic representation of our proposed BGC prioritization strategy representing an adapted version of the BiGMAP workflow. Metagenomic assembly is performed for each sample, followed by BGC prediction. Next, all samples are aligned against all core biosynthetic genes of predicted BGCs. Coverage information is extracted, and downstream analysis is performed. b Volcano plot of the differential BGC coverage analysis results. In this visualization, only matching biospecimen – initial BGC contig combinations are visualized, constituting only a fraction of all results. The unadjusted two-tailed unpaired Wilcoxon test p-values are shown with two horizontal lines representing the 0.05 threshold, both before and after p-value adjustment. c Predicted host species distribution of the assembled DNA fragments where significantly associated core biosynthetic genes reside. Color reflects the number of significant BGCs. d Comparison of the highest correlating effect sizes, comparing differential BGC coverage results between alternative diets and diseases. The effect size of the vegetarian-omnivore comparison is visualized on the y-axis. On the x-axis, the cohort named above the panel is compared against the healthy cohort. For the fourth panel, the minimum effect size across all cohort comparisons is taken for each BGC and compared against the diet comparison.