Emerging whole-cell modeling principles and methods

Arthur P Goldberg; Balázs Szigeti; Yin Hoon Chew; John A P Sekar; Yosef D Roth; Jonathan R Karr

doi:10.1016/j.copbio.2017.12.013

. Author manuscript; available in PMC: 2019 Jun 1.

Published in final edited form as: Curr Opin Biotechnol. 2017 Dec 21;51:97–102. doi: 10.1016/j.copbio.2017.12.013

Emerging whole-cell modeling principles and methods

Arthur P Goldberg ^1,^2,^*, Balázs Szigeti ^1,^2,^*, Yin Hoon Chew ^1,^2,^*, John A P Sekar ^1,^2,^*, Yosef D Roth ^1,², Jonathan R Karr ^1,^2,^†

PMCID: PMC5997489 NIHMSID: NIHMS928198 PMID: 29275251

Abstract

Whole-cell computational models aim to predict cellular phenotypes from genotype by representing the entire genome, the structure and concentration of each molecular species, each molecular interaction, and the extracellular environment. Whole-cell models have great potential to transform bioscience, bioengineering, and medicine. However, numerous challenges remain to achieve whole-cell models. Nevertheless, researchers are beginning to leverage recent progress in measurement technology, bioinformatics, data sharing, rule-based modeling, and multi-algorithmic simulation to build the first whole-cell models. We anticipate that ongoing efforts to develop scalable whole-cell modeling tools will enable dramatically more comprehensive and more accurate models, including models of human cells.

Graphical abstract

graphic file with name nihms928198u1.jpg

INTRODUCTION

Whole-cell (WC) computational models aim to predict cellular phenotypes from genotype and the environment by representing the function of each gene, gene product, and metabolite [1]. WC models could unify our understanding of cell biology and enable researchers to perform in silico experiments with complete control, scope, and resolution [2, 3]. WC models could also help bioengineers rationally design microorganisms that can produce useful chemicals and act as biosensors, and help physicians design personalized therapies tailored to each patient's genome.

Despite their potential, there is little consensus on how WC models should represent cells, what phenotypes WC models should predict, or how to achieve WC models. Nevertheless, we and others are beginning to leverage advances in measurement technology, bioinformatics, rule-based modeling, and multi-algorithmic simulation to develop WC models [4–9]. However, substantial work remains to achieve WC models [10, 11].

To build consensus on WC modeling, we propose a set of key physical and chemical mechanisms that WC models should aim to represent, and a set of key phenotypes that WC models should aim to predict. We also summarize the experimental and computational progress that is making WC modeling feasible, and outline several technological advances that would help accelerate WC modeling.

Note, our proposals focus on defining WC models that are needed for research studies and applications such as bioengineering and personalized medicine which depend on understanding the molecular details of the majority of intracellular processes. However, research that depends on fewer intracellular processes could be served by smaller, more focused models.

PHYSICS AND CHEMISTRY THAT WC MODELS SHOULD AIM TO REPRESENT

We propose that WC models aim to represent all of the chemical reactions in a cell and all of the physical processes that influence their rates (Figure 1a). This requires representing (a) the sequence of each chromosome, RNA, and protein; the location of each chromosomal feature, including each gene, operon, promoter, and terminator; and the location of each site on each RNA and protein; (b) the structure of each molecule, including atom-level information about small molecules, the domains and sites of macromolecules, and the subunit composition of complexes; (c) the subcellular organization of cells into organelles and microdomains; (d) the participants and effect of each molecular interaction, including the molecules that are consumed, produced, and transported, the molecular sites that are modified, and the bonds that are broken and formed, (e) the kinetic parameters of each interaction; (f) the concentration of each species in each organelle and microdomain; and (g) the concentration of each species in the extracellular environment. In addition, to enable WC models to be rigorously tested, each WC model should represent a single, well-defined experimental system. To minimize the complexity of WC models, we recommend modeling small, fast-growing, non-adherent, autonomous, self-renewing cells growing on defined, rich, homogeneous media. Together, this would enable WC models to describe how cellular behavior emerges from the combined function of each gene and genetic variant, and capture how cells respond to changes in their internal and external environments.

PHENOTYPES THAT WC MODELS SHOULD AIM TO PREDICT

We also propose that WC models aim to predict the behavioral trajectories of single cells over their life cycles, with each simulation representing a different cell within a heterogeneous clonal population (Figure 1b). This should include behaviors within individual cells such as the stochastic dynamics of each molecular interaction; the temporal dynamics of the concentration of each species; the spatial dynamics of the concentration of each species in each organelle and microdomain; and complex phenotypes such as cell shape, growth rate, motility, and fate, as well as the variation in the behavior of single cells within clonal populations. Together, this would enable WC models to capture how stochastic and single-cell variation can generate phenotypic diversity; how a cell responds to external cues such as nutrients, growth factors and drugs; and how a cell coordinates critical events such as the G1/S transition. This would also enable WC models to generate predictions that could be embedded into higher-order multiscale models. For example, WC models could predict the timing and speed of chemotaxis, which could help multiscale models predict tumor metastasis.

AVAILABLE RESOURCES

Achieving WC models will require extensive data to constrain every parameter. Fortunately, measurement technology is rapidly advancing. Here, we review the latest methods for generating data for WC models, and highlight repositories and other resources that contain useful data for WC modeling.

Measurement methods

Advances in single-cell and genomic measurement are rapidly generating data that could be used for WC modeling [12–14] (Table S1). For example, Meth-Seq can assess epigenetic modifications [15], Hi-C can determine chromosome structures [16], ChIP-seq can determine protein-DNA interactions [17], fluorescence microscopy can determine protein localizations, mass-spectrometry can quantitate metabolite and protein concentrations, FISH [18] and scRNA-seq [19] can quantitate the dynamics and single-cell variation of RNA abundances, and fluorescence microscopy and mass cytometry [20] can quantitate the dynamics and single-cell variation of protein abundances. In particular, WC models can be constrained by combining high-dimensional measurement methods with multiple genetic and environmental perturbations, frequent temporal observations, and cutting-edge distributed parameter estimation methods. However, substantial work remains to develop methods that can measure non-model organisms including small, slow-growing, and unculturable cells.

Data repositories

Researchers are also rapidly aggregating much of the data needed for WC modeling into public repositories (Table S2). For example, UniProt contains a multitude of information about proteins [21]; BioCyc contain extensive information about interactions [22]; ECMDB [23], ArrayExpress [24], and PaxDb [25] contain metabolite, RNA, and protein abundances, respectively; and SABIO-RK contains kinetic parameters [26]. Furthermore, meta-databases such as Nucleic Acid Research's Database Summary contain lists of repositories [27].

Prediction tools

For certain types of data, accurate prediction tools can be superior to direct experimental evidence which may have incomplete coverage or may be limited to a small number of genotypes and environments. Currently, many tools can predict properties such as operons, RNA folds, and protein localizations (Table S3). For example, PSORTb predicts the localization of bacterial proteins [28]. However, many current prediction tools lack sufficient accuracy for WC modeling.

Published models

WC models can also incorporate separately published models of individual pathways. Currently, there are several model repositories which contain numerous cell cycle, circadian rhythm, electrical signaling, signal transduction, and metabolism models (Table S4 S5). However, most pathways such as RNA degradation do not yet have genome-scale dynamical models, many reported models are not publicly available, and it is difficult to merge most published models because they often use different assumptions and representations.

Emerging methods and tools

Recent advances in data aggregation, model design, model representation, and simulation (Table S6) are also rapidly making WC modeling feasible. We expect that ongoing efforts to adapt and combine these advances will accelerate WC modeling [9] (Figure 2). Here, we summarize the most important emerging methods and tools for WC modeling.

Emerging WC modeling methodology. (a) Data should be aggregated from thousands of publications, repositories, and prediction tools and organized into a PGDB. (b) Models should be designed, calibrated, and validated from PGDBs and described using rules. (c) Models should be simulated using parallel, network-free, multi-algorithmic simulators and their results should be stored in a database. (d) Simulation results should be visualized and analyzed. (e) Results should be validated by comparison to experimental measurements. Importantly, all of these steps should be collaborative.

Data aggregation and organization

For optimal accuracy and scope, WC modeling should be tightly coupled with targeted experimentation. Nevertheless, we believe that WC modeling currently can be most cost-effectively advanced by leveraging the extensive array of public data. To make this public data usable for modeling, researchers are developing automated methods for extracting data from publications [29], building central public repositories [30], and creating tools for programmatically accessing repositories [32]. Pathway/genome database (PGDB) tools such as Pathway Tools [32] are well-suited to organizing this data because they support structured representations of metabolites, DNA, RNA, proteins, and their interactions. However, they provide limited support for non-metabolic pathways and quantitative data. To overcome these limitations, we developed the WholeCellKB tool to organize data for WC modeling [33].

Scalable model design

Several new tools can help researchers develop large models. For example, the Cell Collective facilitates collaborative model design [34], MetaFlux facilitates the design of constraint-based models from PGDBs [35], PySB facilitates programmatic model construction [36], SEEK facilitates model design from data tables [37], and Virtual Cell facilitates model design from KEGG and SABIO-RK [38].

Model languages

Researchers have developed several languages for representing biochemical models. SBML can represent several types of models including flux balance analysis models, deterministic dynamical models, and stochastic dynamical models [39]. Rule-based languages such as BioNetGen can efficiently describe the combinatorial complexity of protein-protein interactions [40].

Simulation

Numerous tools can simulate biomodels. For example, COPASI [41] and Virtual Cell [38] support deterministic, stochastic, hybrid deterministic/stochastic, network-free, and spatial simulation; COBRApy supports constraint-based simulation [42]; and E-Cell supports multi-algorithmic simulation [43].

Calibration

New tools such as saCeSS [44] support distributed calibration of large biochemical models. In addition, aerospace and mechanical engineers have developed methods for using reduced surrogate models to efficiently calibrate large models [45].

Verification

Researchers have begun to adapt formal model checking techniques to biomodeling. For example, BioLab [46] and PRISM [47] can verify BioNetGen-encoded and SBML-encoded models, respectively.

Simulation results analysis

Tools such as COPASI [41] and Virtual Cell [38] can visualize simulation results. We have developed the WholeCellSimDB [48] simulation results database to help researchers organize, search, and share WC simulation results. We have also developed the WholeCellViz [49] simulation results dashboard to help researchers visualize WC simulation results in their biological context.

TECHNOLOGICAL CHALLENGES

Beyond these emerging tools, several technological advances are needed to enable WC models. Here, we summarize the most critically needed technologies.

Experimental measurement

While substantial data about cellular populations already exists, additional data would enable better models. In particular, we need metabolome-wide and proteome-wide measurement technologies that can quantitate the dynamics and single-cell variation of each metabolite and protein. Additionally, we need technologies that can measure kinetic parameters at the interactome scale and technologies that can measure cellular phenotypes across multiple genetic and environmental conditions. Furthermore, to enable WC models of a broad range of organisms, we also need technologies that can measure non-model organisms, including small, slow-growing, motile, and unculturable organisms.

Prediction tools

While existing tools can predict many properties of metabolites, DNA, RNA, and proteins, additional tools are needed to accurately predict the molecular effects of insertions, deletions, and structural variants. Such tools would help WC models design microbial genomes and predict the phenotypes of individual patients.

Data aggregation

As described above, extensive data is now available for WC modeling. However, this data is scattered across many repositories and publications; spans a wide range of data types, organisms, and environments; is described using inconsistent identifiers and units; and often is not annotated or normalized. To make this data more usable for modeling, we are developing a framework for aggregating data from repositories; merging data from multiple species, environmental conditions, and experimental procedures; standardizing data to common units; and identifying the most relevant data for a model.

Scalable, data-driven model design

To scale WC modeling, we need tools for collaboratively building large models directly from experimental data, recording how data is used to build models, and identifying gaps and inconsistencies in models. As described above, several tools support each of these functions. To accelerate WC modeling, the field must develop an extensible platform that supports all of these functions at the scale required for WC modeling.

Rule-based model representation

Several languages can represent individual biological processes, but no existing language supports all of the biological processes that WC models must represent [50, 51]. To overcome this limitation, we are developing a rule-based language that can represent each molecular species at multiple levels of granularity (for example, as a single species, as a set of sites, and as a sequence); the combinatorial complexity of each molecular species and interaction; composite, multi-algorithmic models; and the data used to build models.

Scalable multi-algorithmic simulation

Simulating WC models requires a simulator that supports both network-free interpretation of rule-based model descriptions and multi-algorithmic co-simulation of submodels that are described using different simulation algorithms. However, no existing simulator supports both network-free and multi-algorithmic simulation. To scalably simulate WC models, we are using Rete algorithms and parallel discrete event simulation to develop a parallel, network-free, multi-algorithmic simulator [9].

Calibration and verification

Scalable tools are needed to calibrate and verify WC models. Although we and others have begun to explore surrogate strategies for efficiently calibrating and validating WC models [52], further work is needed to formalize these methods.

Simulation analysis

We and others have developed tools for organizing and visualizing simulation results, but they provided limited support for large datasets or custom visualizations such as pathway maps. To visualize WC simulation results, researchers should use distributed database and data processing technologies to search and reduce simulation results, standard visualization grammars to enable flexible and custom visualizations, and high-performance visualization toolkits to handle terabyte-scale simulation results.

Collaboration

Ultimately, achieving WC models will require extensive teamwork. To facilitate collaboration, the field must develop collaborative model design tools, version control systems for models, standards for annotating and verifying submodels, and protocols for merging separately developed submodels.

CONCLUSION

WC models have great potential to advance bioscience, bioengineering, and medicine. However, significant challenges remain to achieve WC models. To advance WC modeling, we have proposed how WC models should represent cells and the phenotypes that WC models should predict, and summarized the best emerging methods and resources. We have also outlined several technological solutions to the most immediate WC modeling challenges. Specifically, we must develop new tools for scalably and collaboratively designing, simulating, calibrating, validating and analyzing models. We must also develop new methods for measuring the dynamics and single-cell variation of the metabolome and proteome and for measuring kinetic parameters at the interactome scale. Despite these challenges, we and others are building the first WC models, developing the first WC modeling tools, and beginning to form a WC modeling community [50, 52]. We anticipate that these efforts will enable comprehensive models of cells.

Supplementary Material

supplement

NIHMS928198-supplement.xlsx^{(80.5KB, xlsx)}

Highlights.

Whole-cell models predict phenotype from genotype by representing each gene function
Whole-cell models could transform bioscience, bioengineering, and medicine
There are many challenges to achieve whole-cell models
New measurement and modeling technologies are rapidly enabling whole-cell modeling
Ongoing efforts to build scalable modeling tools will accelerate whole-cell modeling

Acknowledgments

We thank Saahith Pochiraju for critical feedback. This work was supported by a National Institute of Health MIRA award [grant number 1 R35 GM119771-01]; a National Science Foundation INSPIRE award [grant number 1649014]; and the National Science Foundation/ERASynBio [grant numbers 1548123, 335672].

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES AND RECOMMENDED READING

Papers of particular interest, published within the period of review, have been highlighted as:

• of special interest

•• outstanding interest

1.Karr JR, Takahashi K, Funahashi A. The principles of whole-cell modeling. Curr Opin Microbiol. 2015;27:18–24. doi: 10.1016/j.mib.2015.06.004. •• Describes the principles of WC modeling including mechanistic representation of each gene, molecular species, and molecular interaction over the lifecycle of an organism. [DOI] [PubMed] [Google Scholar]
2.Tomita M. Whole-cell simulation: a grand challenge of the 21st century. Trends Biotechnol. 2001;19:205–210. doi: 10.1016/s0167-7799(01)01636-5. • Describes the need to develop WC models to understand biology and personalize medicine. [DOI] [PubMed] [Google Scholar]
3.Carrera J, Covert MW. Why build whole-cell models? Trends Cell Biol. 2015;25:719–722. doi: 10.1016/j.tcb.2015.09.004. •• Describes several potential uses of WC models including data integration, knowledge evaluation, phenotype prediction, hypothesis generation, and genome design. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Tomita M, Hashimoto K, Takahashi K, Shimizu TS, Matsuzaki Y, Miyoshi F, Saito K, Tanida S, Yugi K, Venter JC, et al. E-CELL: software environment for whole-cell simulation. Bioinformatics. 1999;15:72–84. doi: 10.1093/bioinformatics/15.1.72. • Describes an early effort to build models that represent each gene product. [DOI] [PubMed] [Google Scholar]
5.Atlas J, Nikolaev E, Browning S, Shuler M. Incorporating genome-wide DNA sequence information into a dynamic whole-cell model of Escherichia coli: application to DNA replication. IET Syst Biol. 2008;2:369–382. doi: 10.1049/iet-syb:20070079. • Describes an early effort to build WC models. [DOI] [PubMed] [Google Scholar]
6.Roberts E, Magis A, Ortiz JO, Baumeister W, Luthey-Schulten Z. Noise contributions in an inducible genetic switch: a whole-cell simulation study. PLoS Comput Biol. 2011;7:e1002010. doi: 10.1371/journal.pcbi.1002010. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Karr JR, Sanghvi JC, Macklin DN, Gutschow MV, Jacobs JM, Bolival B, Assad-Garcia N, Glass JI, Covert MW. A whole-cell computational model predicts phenotype from genotype. Cell. 2012;150:389–401. doi: 10.1016/j.cell.2012.05.044. •• Describes a novel approach for combining heterogeneous data and models, and the first model that represents the function of each characterized gene of an organism. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Bordbar A, McCloskey D, Zielinski DC, Sonnenschein N, Jamshidi N, Palsson BO. Personalized whole-cell kinetic models of metabolism for discovery in genomics and pharmacodynamics. Cell Syst. 2015;1:283–292. doi: 10.1016/j.cels.2015.10.003. • Describes the development of genome-scale kinetic models of cellular metabolism. [DOI] [PubMed] [Google Scholar]
9.Goldberg AP, Chew YH, Karr JR. Toward scalable whole-cell modeling of human cells; Proc 2016 Annu ACM Conf SIGSIM Princip Adv Discrete SimulACM; 2016. pp. 259–262. • Proposes a parallel algorithm for simulating multi-algorithmic WC models. [Google Scholar]
10.Szigeti B, Roth YD, Sekar JAP, Goldberg AP, Pochiraju S, Karr JR. A blueprint for human whole-cell modeling. Curr Opin Syst Biol. 2018;7:8–15. doi: 10.1016/j.coisb.2017.10.005. •• Describes the major bottlenecks to WC modeling and proposes a community project to overcome these bottlenecks. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Macklin DN, Ruggero NA, Covert MW. The future of whole-cell modeling. Curr Opin Biotechnol. 2014;28:111–115. doi: 10.1016/j.copbio.2014.01.012. •• Summarizes the challenges to achieve WC models including aggregating the data needed to build WC models; merging pathway submodels; simulating, calibrating, and validating large models; and coordinating large modeling teams. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Macaulay IC, Voet T. Single cell genomics: advances and future perspectives. PLoS Genet. 2014;10:e1004126. doi: 10.1371/journal.pgen.1004126. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Altelaar AM, Munoz J, Heck AJ. Next-generation proteomics: towards an integrative view of proteome dynamics. Nat Rev Genet. 2013;14:35. doi: 10.1038/nrg3356. [DOI] [PubMed] [Google Scholar]
14.Fuhrer T, Zamboni N. High-throughput discovery metabolomics. Curr Opinion Biotechnol. 2015;31:73–78. doi: 10.1016/j.copbio.2014.08.006. [DOI] [PubMed] [Google Scholar]
15.Laird PW. Principles and challenges of genome-wide DNA methylation analysis. Nat Rev Genetics. 2010;11:191. doi: 10.1038/nrg2732. [DOI] [PubMed] [Google Scholar]
16.Dekker J, Marti-Renom MA, Mirny LA. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet. 2013;14:390. doi: 10.1038/nrg3454. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Park PJ. ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet. 2009;10:669. doi: 10.1038/nrg2641. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Yang JL, Ferrante TC, Terry R, Jeanty SS, Li C, Amamoto R, et al. Highly multiplexed subcellular RNA sequencing in situ. Science. 2014;343:1360–1363. doi: 10.1126/science.1250212. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Saliba AE, Westermann AJ, Gorski SA, Vogel J. Single-cell RNA-seq: advances and future challenges. Nucleic Acids Res. 2014;42:8845–8860. doi: 10.1093/nar/gku555. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Bendall SC, Nolan GP, Roederer M, Chattopadhyay PK. A deep profiler’s guide to cytometry. Trends Immunol. 2012;33:323–332. doi: 10.1016/j.it.2012.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Consortium TU. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–D169. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Caspi R, Billington R, Ferrer L, Foerster H, Fulcher CA, Keseler IM, Kothari A, Krummenacker M, Latendresse M, Mueller LA, et al. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2016;44:D471–D480. doi: 10.1093/nar/gkv1164. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Sajed T, Marcu A, Ramirez M, Pon A, Guo AC, Knox C, Wilson M, Grant JR, Djoumbou Y, Wishart DS. ECMDB 2.0: A richer resource for understanding the biochemistry of E. coli. Nucleic Acids Res. 2016;44:D495–D501. doi: 10.1093/nar/gkv1060. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Kolesnikov N, Hastings E, Keays M, Melnichuk O, Tang YA, Williams E, Dylag M, Kurbatova N, Brandizi M, Burdett T, et al. ArrayExpress update–simplifying data submissions. Nucleic Acids Res. 2015;43:D1113–D1116. doi: 10.1093/nar/gku1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Wang M, Herrmann CJ, Simonovic M, Szklarczyk D, Mering C. Version 4.0 of PaxDb: protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics. 2015;15:3163–3168. doi: 10.1002/pmic.201400441. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Wittig U, Kania R, Golebiewski M, Rey M, Shi L, Jong L, Algaa E, Weidemann A, Sauer-Danzwith H, Mir S, et al. SABIO-RK–database for biochemical reaction kinetics. Nucleic Acids Res. 2012;40:D790–D796. doi: 10.1093/nar/gkr1046. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Galperin MY, Fernández-Suárez XM, Rigden DJ. The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes. Nucleic Acids Res. 2017;45:D1–D11. doi: 10.1093/nar/gkw1188. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Sahinalp SC, Ester M, Foster LJ, et al. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics. 2010;26:1608–1615. doi: 10.1093/bioinformatics/btq249. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Cohen PR. DARPA’s Big Mechanism program. Phys Biol. 2015;12:045008. doi: 10.1088/1478-3975/12/4/045008. [DOI] [PubMed] [Google Scholar]
30.Pampel H, Vierkant P, Scholze F, Bertelmann R, Kindling M, Klump J, Goebelbecker HJ, Gundlach J, Schirmbacher P, Dierolf U. Making research data repositories visible: The re3data.org Registry. PloS One. 2013;8:e78080. doi: 10.1371/journal.pone.0078080. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Cokelaer T, Pultz D, Harder LM, Serra-Musach J, Saez-Rodriguez J. BioServices: a common Python package to access biological web services programmatically. Bioinformatics. 2013;29:3241–3242. doi: 10.1093/bioinformatics/btt547. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Karp PD, Latendresse M, Paley SM, Krummenacker M, Ong QD, Billington R, Kothari A, Weaver D, Lee T, Subhraveti P, et al. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform. 2016;17:877–890. doi: 10.1093/bib/bbv079. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Karr JR, Sanghvi JC, Macklin DN, Arora A, Covert MW. WholeCellKB: model organism databases for comprehensive whole-cell models. Nucleic Acids Res. 2013;41:D787–D792. doi: 10.1093/nar/gks1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Helikar T, Kowal B, Rogers J. A cell simulator platform: the Cell Collective. Clin Pharmacol Ther. 2013;93:393–395. doi: 10.1038/clpt.2013.41. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Latendresse M, Krummenacker M, Trupp M, Karp PD. Construction and completion of flux balance models from pathway databases. Bioinformatics. 2012;28:388–396. doi: 10.1093/bioinformatics/btr681. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Lopez CF, Muhlich JL, Bachman JA, Sorger PK. Programming biological models in Python using PySB. Mol Syst Biol. 2013;9:646. doi: 10.1038/msb.2013.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Wolstencroft K, Owen S, Krebs O, Nguyen Q, Stanford NJ, Golebiewski M, Weidemann A, Bittkowski M, An L, Shockley D, et al. SEEK: a systems biology data and model management platform. BMC Syst Biol. 2015;9:33. doi: 10.1186/s12918-015-0174-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Resasco DC, Gao F, Morgan F, Novak IL, Schaff JC, Slepchenko BM. Virtual Cell: computational tools for modeling in cell biology. Wiley Interdiscip Rev Syst Biol Med. 2012;4:129–140. doi: 10.1002/wsbm.165. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, Cornish-Bowden A, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19:524–531. doi: 10.1093/bioinformatics/btg015. [DOI] [PubMed] [Google Scholar]
40.Harris LA, Hogg JS, Tapia JJ, Sekar JA, Gupta S, Korsunsky I, Arora A, Barua D, Sheehan RP, Faeder JR. BioNetGen 2.2: advances in rule-based modeling. Bioinformatics. 2016;32:3366–3368. doi: 10.1093/bioinformatics/btw469. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Mendes P, Hoops S, Sahle S, Gauges R, Dada J, Kummer U. Computational modeling of biochemical networks using COPASI. Methods Mol Biol. 2009;500:17–59. doi: 10.1007/978-1-59745-525-1_2. [DOI] [PubMed] [Google Scholar]
42.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. Cobrapy: Constraints-based reconstruction and analysis for python. BMC Syst Biol. 2013;7:74. doi: 10.1186/1752-0509-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Dhar PK, Takahashi K, Nakayama Y, Tomita M. E-Cell: Computer simulation of the cell. Rev Cell Biol Mol Med. 2012 • Describes a multi-algorithmic simulator. [Google Scholar]
44.Penas DR, González P, Egea JA, Doallo R, Banga JR. Parameter estimation in large-scale systems biology models: a parallel and self-adaptive cooperative strategy. BMC Bioinformatics. 2017;18:52. doi: 10.1186/s12859-016-1452-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Forrester AI, Keane AJ. Recent advances in surrogate-based optimization. Prog Aerospace Sci. 2009;45:50–79. [Google Scholar]
46.Clarke EM, Faeder JR, Langmead CJ, Harris LA, Jha SK, Legay A. Statistical model checking in BioLab: Applications to the automated analysis of T-cell receptor signaling pathway; Int Conf Comput Meth Syst Biol; 2008. pp. 231–250. [Google Scholar]
47.Kwiatkowska M, Norman G, Parker D. PRISM 4.0: Verification of probabilistic real-time systems. Computer Aided Verification. 2011:585–591. [Google Scholar]
48.Karr JR, Phillips NC, Covert MW. WholeCellSimDB: a hybrid relational/HDF database for whole-cell model predictions. Database. 2014:bau095. doi: 10.1093/database/bau095. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Lee R, Karr JR, Covert MW. WholeCellViz: data visualization for whole-cell models. BMC Bioinformatics. 2013;14:253. doi: 10.1186/1471-2105-14-253. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Waltemath D, Karr JR, Bergmann FT, Chelliah V, Hucka M, Krantz M, Liebermeister W, Mendes P, Myers CJ, Pir P, et al. Toward community standards and software for whole-cell modeling. IEEE Trans Biomed Eng. 2016;63:2007–2014. doi: 10.1109/TBME.2016.2560762. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Medley JK, Goldberg AP, Karr JR. Guidelines for reproducibly building and simulating systems biology models. IEEE Trans Biomed Eng. 2016;63:2015–2020. doi: 10.1109/TBME.2016.2591960. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Karr JR, Williams AH, Zucker JD, Raue A, Steiert B, Timmer J, Kreutz C, Wilkinson S, Allgood BA, Bot BM, et al. Summary of the DREAM8 parameter estimation challenge: toward parameter identification for whole-cell models. PLoS Comput Biol. 2015;11:e1004096. doi: 10.1371/journal.pcbi.1004096. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

NIHMS928198-supplement.xlsx^{(80.5KB, xlsx)}

[R1] 1.Karr JR, Takahashi K, Funahashi A. The principles of whole-cell modeling. Curr Opin Microbiol. 2015;27:18–24. doi: 10.1016/j.mib.2015.06.004. •• Describes the principles of WC modeling including mechanistic representation of each gene, molecular species, and molecular interaction over the lifecycle of an organism. [DOI] [PubMed] [Google Scholar]

[R2] 2.Tomita M. Whole-cell simulation: a grand challenge of the 21st century. Trends Biotechnol. 2001;19:205–210. doi: 10.1016/s0167-7799(01)01636-5. • Describes the need to develop WC models to understand biology and personalize medicine. [DOI] [PubMed] [Google Scholar]

[R3] 3.Carrera J, Covert MW. Why build whole-cell models? Trends Cell Biol. 2015;25:719–722. doi: 10.1016/j.tcb.2015.09.004. •• Describes several potential uses of WC models including data integration, knowledge evaluation, phenotype prediction, hypothesis generation, and genome design. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Tomita M, Hashimoto K, Takahashi K, Shimizu TS, Matsuzaki Y, Miyoshi F, Saito K, Tanida S, Yugi K, Venter JC, et al. E-CELL: software environment for whole-cell simulation. Bioinformatics. 1999;15:72–84. doi: 10.1093/bioinformatics/15.1.72. • Describes an early effort to build models that represent each gene product. [DOI] [PubMed] [Google Scholar]

[R5] 5.Atlas J, Nikolaev E, Browning S, Shuler M. Incorporating genome-wide DNA sequence information into a dynamic whole-cell model of Escherichia coli: application to DNA replication. IET Syst Biol. 2008;2:369–382. doi: 10.1049/iet-syb:20070079. • Describes an early effort to build WC models. [DOI] [PubMed] [Google Scholar]

[R6] 6.Roberts E, Magis A, Ortiz JO, Baumeister W, Luthey-Schulten Z. Noise contributions in an inducible genetic switch: a whole-cell simulation study. PLoS Comput Biol. 2011;7:e1002010. doi: 10.1371/journal.pcbi.1002010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Karr JR, Sanghvi JC, Macklin DN, Gutschow MV, Jacobs JM, Bolival B, Assad-Garcia N, Glass JI, Covert MW. A whole-cell computational model predicts phenotype from genotype. Cell. 2012;150:389–401. doi: 10.1016/j.cell.2012.05.044. •• Describes a novel approach for combining heterogeneous data and models, and the first model that represents the function of each characterized gene of an organism. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Bordbar A, McCloskey D, Zielinski DC, Sonnenschein N, Jamshidi N, Palsson BO. Personalized whole-cell kinetic models of metabolism for discovery in genomics and pharmacodynamics. Cell Syst. 2015;1:283–292. doi: 10.1016/j.cels.2015.10.003. • Describes the development of genome-scale kinetic models of cellular metabolism. [DOI] [PubMed] [Google Scholar]

[R9] 9.Goldberg AP, Chew YH, Karr JR. Toward scalable whole-cell modeling of human cells; Proc 2016 Annu ACM Conf SIGSIM Princip Adv Discrete SimulACM; 2016. pp. 259–262. • Proposes a parallel algorithm for simulating multi-algorithmic WC models. [Google Scholar]

[R10] 10.Szigeti B, Roth YD, Sekar JAP, Goldberg AP, Pochiraju S, Karr JR. A blueprint for human whole-cell modeling. Curr Opin Syst Biol. 2018;7:8–15. doi: 10.1016/j.coisb.2017.10.005. •• Describes the major bottlenecks to WC modeling and proposes a community project to overcome these bottlenecks. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Macklin DN, Ruggero NA, Covert MW. The future of whole-cell modeling. Curr Opin Biotechnol. 2014;28:111–115. doi: 10.1016/j.copbio.2014.01.012. •• Summarizes the challenges to achieve WC models including aggregating the data needed to build WC models; merging pathway submodels; simulating, calibrating, and validating large models; and coordinating large modeling teams. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Macaulay IC, Voet T. Single cell genomics: advances and future perspectives. PLoS Genet. 2014;10:e1004126. doi: 10.1371/journal.pgen.1004126. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Altelaar AM, Munoz J, Heck AJ. Next-generation proteomics: towards an integrative view of proteome dynamics. Nat Rev Genet. 2013;14:35. doi: 10.1038/nrg3356. [DOI] [PubMed] [Google Scholar]

[R14] 14.Fuhrer T, Zamboni N. High-throughput discovery metabolomics. Curr Opinion Biotechnol. 2015;31:73–78. doi: 10.1016/j.copbio.2014.08.006. [DOI] [PubMed] [Google Scholar]

[R15] 15.Laird PW. Principles and challenges of genome-wide DNA methylation analysis. Nat Rev Genetics. 2010;11:191. doi: 10.1038/nrg2732. [DOI] [PubMed] [Google Scholar]

[R16] 16.Dekker J, Marti-Renom MA, Mirny LA. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet. 2013;14:390. doi: 10.1038/nrg3454. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Park PJ. ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet. 2009;10:669. doi: 10.1038/nrg2641. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Yang JL, Ferrante TC, Terry R, Jeanty SS, Li C, Amamoto R, et al. Highly multiplexed subcellular RNA sequencing in situ. Science. 2014;343:1360–1363. doi: 10.1126/science.1250212. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Saliba AE, Westermann AJ, Gorski SA, Vogel J. Single-cell RNA-seq: advances and future challenges. Nucleic Acids Res. 2014;42:8845–8860. doi: 10.1093/nar/gku555. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Bendall SC, Nolan GP, Roederer M, Chattopadhyay PK. A deep profiler’s guide to cytometry. Trends Immunol. 2012;33:323–332. doi: 10.1016/j.it.2012.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Consortium TU. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–D169. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Caspi R, Billington R, Ferrer L, Foerster H, Fulcher CA, Keseler IM, Kothari A, Krummenacker M, Latendresse M, Mueller LA, et al. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2016;44:D471–D480. doi: 10.1093/nar/gkv1164. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Sajed T, Marcu A, Ramirez M, Pon A, Guo AC, Knox C, Wilson M, Grant JR, Djoumbou Y, Wishart DS. ECMDB 2.0: A richer resource for understanding the biochemistry of E. coli. Nucleic Acids Res. 2016;44:D495–D501. doi: 10.1093/nar/gkv1060. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Kolesnikov N, Hastings E, Keays M, Melnichuk O, Tang YA, Williams E, Dylag M, Kurbatova N, Brandizi M, Burdett T, et al. ArrayExpress update–simplifying data submissions. Nucleic Acids Res. 2015;43:D1113–D1116. doi: 10.1093/nar/gku1057. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Wang M, Herrmann CJ, Simonovic M, Szklarczyk D, Mering C. Version 4.0 of PaxDb: protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics. 2015;15:3163–3168. doi: 10.1002/pmic.201400441. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Wittig U, Kania R, Golebiewski M, Rey M, Shi L, Jong L, Algaa E, Weidemann A, Sauer-Danzwith H, Mir S, et al. SABIO-RK–database for biochemical reaction kinetics. Nucleic Acids Res. 2012;40:D790–D796. doi: 10.1093/nar/gkr1046. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Galperin MY, Fernández-Suárez XM, Rigden DJ. The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes. Nucleic Acids Res. 2017;45:D1–D11. doi: 10.1093/nar/gkw1188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Sahinalp SC, Ester M, Foster LJ, et al. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics. 2010;26:1608–1615. doi: 10.1093/bioinformatics/btq249. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Cohen PR. DARPA’s Big Mechanism program. Phys Biol. 2015;12:045008. doi: 10.1088/1478-3975/12/4/045008. [DOI] [PubMed] [Google Scholar]

[R30] 30.Pampel H, Vierkant P, Scholze F, Bertelmann R, Kindling M, Klump J, Goebelbecker HJ, Gundlach J, Schirmbacher P, Dierolf U. Making research data repositories visible: The re3data.org Registry. PloS One. 2013;8:e78080. doi: 10.1371/journal.pone.0078080. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Cokelaer T, Pultz D, Harder LM, Serra-Musach J, Saez-Rodriguez J. BioServices: a common Python package to access biological web services programmatically. Bioinformatics. 2013;29:3241–3242. doi: 10.1093/bioinformatics/btt547. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Karp PD, Latendresse M, Paley SM, Krummenacker M, Ong QD, Billington R, Kothari A, Weaver D, Lee T, Subhraveti P, et al. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform. 2016;17:877–890. doi: 10.1093/bib/bbv079. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Karr JR, Sanghvi JC, Macklin DN, Arora A, Covert MW. WholeCellKB: model organism databases for comprehensive whole-cell models. Nucleic Acids Res. 2013;41:D787–D792. doi: 10.1093/nar/gks1108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Helikar T, Kowal B, Rogers J. A cell simulator platform: the Cell Collective. Clin Pharmacol Ther. 2013;93:393–395. doi: 10.1038/clpt.2013.41. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Latendresse M, Krummenacker M, Trupp M, Karp PD. Construction and completion of flux balance models from pathway databases. Bioinformatics. 2012;28:388–396. doi: 10.1093/bioinformatics/btr681. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Lopez CF, Muhlich JL, Bachman JA, Sorger PK. Programming biological models in Python using PySB. Mol Syst Biol. 2013;9:646. doi: 10.1038/msb.2013.1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] 37.Wolstencroft K, Owen S, Krebs O, Nguyen Q, Stanford NJ, Golebiewski M, Weidemann A, Bittkowski M, An L, Shockley D, et al. SEEK: a systems biology data and model management platform. BMC Syst Biol. 2015;9:33. doi: 10.1186/s12918-015-0174-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Resasco DC, Gao F, Morgan F, Novak IL, Schaff JC, Slepchenko BM. Virtual Cell: computational tools for modeling in cell biology. Wiley Interdiscip Rev Syst Biol Med. 2012;4:129–140. doi: 10.1002/wsbm.165. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, Cornish-Bowden A, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19:524–531. doi: 10.1093/bioinformatics/btg015. [DOI] [PubMed] [Google Scholar]

[R40] 40.Harris LA, Hogg JS, Tapia JJ, Sekar JA, Gupta S, Korsunsky I, Arora A, Barua D, Sheehan RP, Faeder JR. BioNetGen 2.2: advances in rule-based modeling. Bioinformatics. 2016;32:3366–3368. doi: 10.1093/bioinformatics/btw469. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Mendes P, Hoops S, Sahle S, Gauges R, Dada J, Kummer U. Computational modeling of biochemical networks using COPASI. Methods Mol Biol. 2009;500:17–59. doi: 10.1007/978-1-59745-525-1_2. [DOI] [PubMed] [Google Scholar]

[R42] 42.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. Cobrapy: Constraints-based reconstruction and analysis for python. BMC Syst Biol. 2013;7:74. doi: 10.1186/1752-0509-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] 43.Dhar PK, Takahashi K, Nakayama Y, Tomita M. E-Cell: Computer simulation of the cell. Rev Cell Biol Mol Med. 2012 • Describes a multi-algorithmic simulator. [Google Scholar]

[R44] 44.Penas DR, González P, Egea JA, Doallo R, Banga JR. Parameter estimation in large-scale systems biology models: a parallel and self-adaptive cooperative strategy. BMC Bioinformatics. 2017;18:52. doi: 10.1186/s12859-016-1452-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] 45.Forrester AI, Keane AJ. Recent advances in surrogate-based optimization. Prog Aerospace Sci. 2009;45:50–79. [Google Scholar]

[R46] 46.Clarke EM, Faeder JR, Langmead CJ, Harris LA, Jha SK, Legay A. Statistical model checking in BioLab: Applications to the automated analysis of T-cell receptor signaling pathway; Int Conf Comput Meth Syst Biol; 2008. pp. 231–250. [Google Scholar]

[R47] 47.Kwiatkowska M, Norman G, Parker D. PRISM 4.0: Verification of probabilistic real-time systems. Computer Aided Verification. 2011:585–591. [Google Scholar]

[R48] 48.Karr JR, Phillips NC, Covert MW. WholeCellSimDB: a hybrid relational/HDF database for whole-cell model predictions. Database. 2014:bau095. doi: 10.1093/database/bau095. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] 49.Lee R, Karr JR, Covert MW. WholeCellViz: data visualization for whole-cell models. BMC Bioinformatics. 2013;14:253. doi: 10.1186/1471-2105-14-253. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] 50.Waltemath D, Karr JR, Bergmann FT, Chelliah V, Hucka M, Krantz M, Liebermeister W, Mendes P, Myers CJ, Pir P, et al. Toward community standards and software for whole-cell modeling. IEEE Trans Biomed Eng. 2016;63:2007–2014. doi: 10.1109/TBME.2016.2560762. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] 51.Medley JK, Goldberg AP, Karr JR. Guidelines for reproducibly building and simulating systems biology models. IEEE Trans Biomed Eng. 2016;63:2015–2020. doi: 10.1109/TBME.2016.2591960. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] 52.Karr JR, Williams AH, Zucker JD, Raue A, Steiert B, Timmer J, Kreutz C, Wilkinson S, Allgood BA, Bot BM, et al. Summary of the DREAM8 parameter estimation challenge: toward parameter identification for whole-cell models. PLoS Comput Biol. 2015;11:e1004096. doi: 10.1371/journal.pcbi.1004096. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Emerging whole-cell modeling principles and methods

Arthur P Goldberg

Balázs Szigeti

Yin Hoon Chew

John A P Sekar

Yosef D Roth

Jonathan R Karr

Abstract

Graphical abstract

INTRODUCTION

PHYSICS AND CHEMISTRY THAT WC MODELS SHOULD AIM TO REPRESENT

Figure 1.

PHENOTYPES THAT WC MODELS SHOULD AIM TO PREDICT

AVAILABLE RESOURCES

Measurement methods

Data repositories

Prediction tools

Published models

Emerging methods and tools

Figure 2.

Data aggregation and organization

Scalable model design

Model languages

Simulation

Calibration

Verification

Simulation results analysis

TECHNOLOGICAL CHALLENGES

Experimental measurement

Prediction tools

Data aggregation

Scalable, data-driven model design

Rule-based model representation

Scalable multi-algorithmic simulation

Calibration and verification

Simulation analysis

Collaboration

CONCLUSION

Supplementary Material

Highlights.

Acknowledgments

Footnotes

REFERENCES AND RECOMMENDED READING

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases