Skip to main content
. Author manuscript; available in PMC: 2024 Jan 12.
Published in final edited form as: Nat Mach Intell. 2023 Mar 13;5(3):284–293. doi: 10.1038/s42256-023-00627-3

Extended Data Figure 5: The training sample size needed to reach a great predictive performance scales linearly with the number of species in synthetic data for MelonnPan, MiMeNet, and mNODE.

Extended Data Figure 5:

Synthetic data in this figure are generated by the microbial consumer-resource model with nutrient sampling probability pn=1.0. For the case with 100 species and varying number of metabolites (100, 200, or 300), three metrics are used for comparing model performances: the mean SCC ρ, the top-50 mean SCC ρ50, and the number of metabolites with SCCs larger than 0.8 divided by the number of metabolites Nρ>0.8/Nm.