Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2025 Dec 3:2025.12.01.691707. [Version 1] doi: 10.64898/2025.12.01.691707

A foundation model for microbial growth dynamics

Zachary A Holmes, Irida Shyti, Alexandra L Hoffman, Katherine E Duncker, Helena R Ma, Zhengqing Zhou, Dongheon Lee, Rohan Maddamsetti, Kyeri Kim, Emrah Şimşek, Grayson S Hamrick, Hyein Son, César A Villalobos, Jia Lu, Yuanchi Ha, Ashwini R Shende, Zhixiang Yao, Sizhe Liu, Daniel M Shapiro, Kseniia Kholina, Harris Davis, Yasa Baig, Feilun Wu, Shangying Wang, Xiran Wang, Pranam Chatterjee, Michael Lynch, Allison J Lopatkin, Lawrence David, Emma Chory, Lingchong You
PMCID: PMC12822751  PMID: 41573938

Abstract

Microbial growth dynamics contain rich information about microbial populations, which support applications from antibiotic testing to microbiome engineering. However, the high dimensionality of growth data and the scarcity of large, task-specific datasets have limited generalizable modeling analysis across systems. Here, we develop a foundation model for microbial growth dynamics. It is a large-scale, self-supervised representation model trained on ∼370,000 experimental and simulated growth curves spanning diverse microbial species, environmental conditions, and community contexts. The model learns lower-dimensional latent embeddings that capture essential dynamical features of raw growth data and enable accurate reconstruction of these data. The concise representations enhance predictive performance in diverse downstream applications. Using these embedding, we achieve few-shot learning for antibiotic classification and concentration prediction, accurate forecasting of simulated and experimental communities, and inference of total abundance from relative-abundance data. By extracting transferable representations from heterogeneous datasets, our model provides a general framework for analyzing and predicting microbial community dynamics from limited measurements.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES