Skip to main content
. Author manuscript; available in PMC: 2025 Sep 1.
Published in final edited form as: Nat Metab. 2025 Feb 18;7(3):617–630. doi: 10.1038/s42255-025-01220-1

Extended Data Fig. 1 ∣. MEDI benchmarks.

Extended Data Fig. 1 ∣

(a) Genomic distance (1 - ANI) vs. macronutrient distance (euclidean, in g/100 g). The blue line denotes a smooth spline regression and shaded area denotes the 95% confidence interval of the mean spline regression. (b) Benchmark of cached and batched processing using MEDI (6 CPUs per process, see Methods). 888 samples were divided into two batches of 500 and 388 FASTQ files and processes separately in parallel. Each point denotes a single FASTQ file and colors denote the batch. Vertical line denotes median classification rate. (c) Relationship between (haploid) genome/assembly size and food abundance in the iHMP data set. Shown are only genomes/assemblies with at least 1 million basepairs.