MEMOTE for standardized genome-scale metabolic model testing

Christian Lieven; Moritz E Beber; Brett G Olivier; Frank T Bergmann; Meric Ataman; Parizad Babaei; Jennifer A Bartell; Lars M Blank; Siddharth Chauhan; Kevin Correia; Christian Diener; Andreas Dräger; Birgitta E Ebert; Janaka N Edirisinghe; José P Faria; Adam M Feist; Georgios Fengos; Ronan M T Fleming; Beatriz García-Jiménez; Vassily Hatzimanikatis; Wout van Helvoirt; Christopher S Henry; Henning Hermjakob; Markus J Herrgård; Ali Kaafarani; Hyun Uk Kim; Zachary King; Steffen Klamt; Edda Klipp; Jasper J Koehorst; Matthias König; Meiyappan Lakshmanan; Dong-Yup Lee; Sang Yup Lee; Sunjae Lee; Nathan E Lewis; Filipe Liu; Hongwu Ma; Daniel Machado; Radhakrishnan Mahadevan; Paulo Maia; Adil Mardinoglu; Gregory L Medlock; Jonathan M Monk; Jens Nielsen; Lars Keld Nielsen; Juan Nogales; Intawat Nookaew; Bernhard O Palsson; Jason A Papin; Kiran R Patil; Mark Poolman; Nathan D Price; Osbaldo Resendis-Antonio; Anne Richelle; Isabel Rocha; Benjamín J Sánchez; Peter J Schaap; Rahuman S Malik Sheriff; Saeed Shoaie; Nikolaus Sonnenschein; Bas Teusink; Paulo Vilaça; Jon Olav Vik; Judith A H Wodke; Joana C Xavier; Qianqian Yuan; Maksim Zakhartsev; Cheng Zhang

doi:10.1038/s41587-020-0446-y

letter

. 2020 Mar 2;38(3):272–276. doi: 10.1038/s41587-020-0446-y

MEMOTE for standardized genome-scale metabolic model testing

Christian Lieven ^1,^#, Moritz E Beber ^1,^#, Brett G Olivier ², Frank T Bergmann ³, Meric Ataman ⁴, Parizad Babaei ¹, Jennifer A Bartell ¹, Lars M Blank ⁵, Siddharth Chauhan ⁶, Kevin Correia ⁷, Christian Diener ^8,⁹, Andreas Dräger ^10,^11,¹², Birgitta E Ebert ^5,¹³, Janaka N Edirisinghe ¹⁴, José P Faria ¹⁴, Adam M Feist ^1,⁶, Georgios Fengos ⁴, Ronan M T Fleming ¹⁵, Beatriz García-Jiménez ^16,⁴⁰, Vassily Hatzimanikatis ⁴, Wout van Helvoirt ^17,¹⁸, Christopher S Henry ¹⁴, Henning Hermjakob ¹⁹, Markus J Herrgård ¹, Ali Kaafarani ¹, Hyun Uk Kim ²⁰, Zachary King ⁶, Steffen Klamt ²¹, Edda Klipp ²², Jasper J Koehorst ²³, Matthias König ²², Meiyappan Lakshmanan ²⁴, Dong-Yup Lee ^24,²⁵, Sang Yup Lee ^1,²⁰, Sunjae Lee ^26,²⁷, Nathan E Lewis ^6,²⁸, Filipe Liu ¹⁴, Hongwu Ma ²⁹, Daniel Machado ³⁰, Radhakrishnan Mahadevan ^7,³¹, Paulo Maia ³², Adil Mardinoglu ^26,²⁷, Gregory L Medlock ³³, Jonathan M Monk ⁶, Jens Nielsen ^1,³⁴, Lars Keld Nielsen ^1,¹³, Juan Nogales ¹⁶, Intawat Nookaew ^34,³⁵, Bernhard O Palsson ^1,⁶, Jason A Papin ³³, Kiran R Patil ³⁰, Mark Poolman ³⁶, Nathan D Price ⁹, Osbaldo Resendis-Antonio ⁸, Anne Richelle ²⁸, Isabel Rocha ^37,³⁸, Benjamín J Sánchez ^1,³⁴, Peter J Schaap ²³, Rahuman S Malik Sheriff ¹⁹, Saeed Shoaie ^26,²⁷, Nikolaus Sonnenschein ^1,^✉, Bas Teusink ², Paulo Vilaça ³², Jon Olav Vik ¹⁷, Judith A H Wodke ²², Joana C Xavier ³⁹, Qianqian Yuan ²⁹, Maksim Zakhartsev ¹⁷, Cheng Zhang ²⁶

¹Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark

²Systems Biology Lab, Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, Amsterdam, the Netherlands

³BioQUANT/COS, Heidelberg University, Heidelberg, Germany

⁴Ecole Polytechnique Fédérale de Lausanne, Laboratory of Computational Systems Biotechnology, Lausanne, Switzerland

⁵iAMB-Institute of Applied Microbiology, ABBt-Aachen Biology and Biotechnology, RWTH Aachen University, Aachen, Germany

⁶Department of Bioengineering, University of California, La Jolla, CA USA

⁷Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario Canada

⁸Human Systems Biology Laboratory, Instituto Nacional de Medicina Genomica & Coordinación de la Investigación Científica-Red de Apoyo a la Investigación, UNAM, Mexico City, Mexico

⁹Institute for Systems Biology, Seattle, WA USA

¹⁰Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Institute for Biomedical Informatics (IBMI), Tübingen, Germany

¹¹Department of Computer Science, University of Tübingen, Tübingen, Germany

¹²German Center for Infection Research (DZIF), partner site Tübingen, Tübingen, Germany

¹³Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Queensland Australia

¹⁴Argonne National Laboratory, Lemont, IL USA

¹⁵Analytical Biosciences, Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, the Netherlands

¹⁶Department of Systems Biology, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas (CNB-CSIC), Madrid, Spain

¹⁷Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life Sciences, Oslo, Norway

¹⁸Hanze University of Applied Sciences, Groningen, the Netherlands

¹⁹European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus, Cambridge, UK

²⁰Department of Chemical and Biomolecular Engineering (BK21 Plus Program), BioProcess Engineering Research Center, BioInformatics Research Center, Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea

²¹Analysis and Redesign of Biological Networks, Max Planck Institute for Dynamics of Complex Technical Systems Magdeburg, Magdeburg, Germany

²²Theoretical Biophysics, Humboldt-Universität zu Berlin, Berlin, Germany

²³Department of Agrotechnology and Food Sciences, Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, the Netherlands

²⁴Bioprocessing Technology Institute, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore

²⁵School of Chemical Engineering Sungkyunkwan University, Jangan-gu Suwon, Gyeonggi-do Republic of Korea

²⁶Science for Life Laboratory, KTH - Royal Institute of Technology, Stockholm, Sweden

²⁷Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral & Craniofacial Sciences, King’s College London, London, UK

²⁸Department of Pediatrics and Novo Nordisk Foundation Center for Biosustainability, University of California, San Diego School of Medicine, La Jolla, CA USA

²⁹Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, P.R. China

³⁰Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany

³¹Institute of Biomaterials and Biomedical Engineering, University of Toronto, Toronto, Canada

³²SilicoLife Lda., Braga, Portugal

³³Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia USA

³⁴Chalmers University of Technology, Department of Biology and Biological Engineering, Division of Systems and Synthetic Biology, Göteborg, Sweden

³⁵Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences (UAMS), Little Rock, AR USA

³⁶Oxford Brookes University, Oxford, UK

³⁷Centre of Biological Engineering, University of Minho, Braga, Portugal

³⁸Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa (ITQB-NOVA), Oeiras, Portugal

³⁹Institute for Molecular Evolution, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany

⁴⁰Present Address: Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) – Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Madrid, Spain

^✉

Corresponding author.

^#

Contributed equally.

PMCID: PMC7082222 NIHMSID: NIHMS1572784 PMID: 32123384

To the Editor — Reconstructing metabolic reaction networks enables the development of testable hypotheses of an organism’s metabolism under different conditions¹. State-of-the-art genome-scale metabolic models (GEMs) can include thousands of metabolites and reactions that are assigned to subcellular locations. Gene–protein–reaction (GPR) rules and annotations using database information can add meta-information to GEMs. GEMs with metadata can be built using standard reconstruction protocols², and guidelines have been put in place for tracking provenance and enabling interoperability, but a standardized means of quality control for GEMs is lacking³. Here we report a community effort to develop a test suite named MEMOTE (for metabolic model tests) to assess GEM quality.

Incompatible description formats and missing annotations⁴ limit GEM reuse. Moreover, numerical errors⁵ and omission of essential cofactors⁶ in a single biomass objective function can have substantial impact on the predictive performance of a GEM. Failure to make checks for flux cycles and imbalances can render model predictions untrustworthy⁷.

Every year, increasing numbers of manually curated and automatically generated GEMs are published, including those for human and cancer tissue models⁸. We believe that it is essential to optimize GEM reproducibility and reuse. Researchers need models that are software-agnostic, with components that have standardized, database-independent identifiers. Default conditions and mathematically specified modeling formulations must be precisely defined to allow reproduction of the original model predictions. Models must produce feasible phenotypes under various conditions. Finally, data used to build any model must be made available in a reusable format.

A dual approach could be used to improve GEM reuse and reproducibility. First, we advocate adoption of the latest version of the Systems Biology Markup Language (SBML) level 3 flux balance constraints (SBML3FBC) package⁹ as the primary description and exchange format. The SBML3FBC package adds structured, semantic descriptions for domain-specific model components such as flux bounds, multiple linear objective functions, GPR rules, metabolite chemical formulas, charge and annotations. The SBML and constraint-based modeling communities collaboratively develop this package, updating it based on user input. It has been adopted by a wide range of constraint-based modeling software and public model repositories (http://cbmpy.sourceforge.net/ and refs. ^10–15), and should therefore be considered the standard for encoding GEMs.

Second, we present MEMOTE (/’mi:moʊt/ in international phonetic alphabet notation), an open-source Python software that represents a unified approach to ensure the formally correct definition of SBML3FBC and provides quality control and continuous quality assurance of metabolic models with tools and best practices already used in software development^16,17. MEMOTE accepts stoichiometric models encoded in SBML3FBC and previous versions as input. In addition to structural validation analogous to the SBML validator¹⁸, MEMOTE benchmarks metabolic models using consensus tests from four general areas: annotation, basic tests, biomass reaction and stoichiometry.

Annotation tests check that a model is annotated according to community standards with minimum information required in annotation of models (MIRIAM)-compliant cross-references¹⁹, that all primary identifiers belong to the same namespace rather than being fractured across several namespaces, and that components are described using Systems Biology Ontology (SBO) terms²⁰. A lack of explicit, standardized annotations complicates the use, comparison and extension of GEMs, and thus strongly hampers collaboration^3,4.

Basic tests check the formal correctness of a model and verify the presence of components such as metabolites, compartments, reactions and genes. These tests also check for metabolite formula and charge information, and GPR rules. General quality metrics, such as the degree of metabolic coverage representing the ratio of reactions and genes²¹, are also checked.

A model is tested for production of biomass precursors in different conditions, for biomass consistency, for nonzero growth rate and for direct precursors. The biomass reaction is based on the biomass composition of the modeled organism and expresses its ability to produce the necessary precursors for in silico cell growth and maintenance. Thus, an extensive, well-formed biomass reaction is crucial for accurate predictions with a GEM⁶.

Stoichiometric inconsistency, erroneously produced energy metabolites⁷ and permanently blocked reactions are identified by MEMOTE. Errors in stoichiometries may result in the production of ATP or redox cofactors from nothing² and are detrimental to the performance of the model when using flux-based analysis⁴.

MEMOTE enables a quick comparison of any two given models, in which individual test results are quantified and condensed to calculate an overall score (Supplementary Note 1). In addition to these consensus tests, researchers can supply experimental data from growth and gene perturbation studies in a range of input formats (.csv, .tsv, .xls or .xslx) in MEMOTE. To support reproducibility, researchers can configure MEMOTE to recognize specific data types as input to predefined experimental tests for model validation (Supplementary Note 2).

There are two main workflows for MEMOTE (Fig. 1a and Supplementary Figs. 1–3). For peer review, MEMOTE can produce either a ‘snapshot report’ or a ‘diff report’ that display MEMOTE test results of one single or multiple models, respectively. For model reconstruction, MEMOTE helps users to create a version-controlled repository of the model and to activate continuous integration toward building a ‘history report’ that records the results of each tracked edit of the model. Although a model repository can be used offline, we encourage community collaboration via distributed version control development platforms, such as GitHub (https://github.com), GitLab (https://gitlab.com/) or BioModels¹² (http://wwwdev.ebi.ac.uk/biomodels/). MEMOTE is tightly integrated with GitHub. Models generated and versioned in MEMOTE can easily be uploaded to GitLab and BioModels. Collaborative model reconstruction with MEMOTE as benchmark can occur using all three software platforms (Fig. 1b).

We validated MEMOTE using models from seven GEM collections (Fig. 2, Supplementary Table 1 and Supplementary Methods), that comprise manually and (semi)-automatically reconstructed GEMs (10,780 models in total). Most GEM collections have already made models available in SMBL format. A nonlinear dimensional reduction of the normalized test results (Supplementary Methods) using t-distributed stochastic neighbor embedding (t-SNE; Fig. 2a) indicates that models from the same source are generally more similar to each other than to models from other sources. Nevertheless, several model sources reveal internal subgroupings (Fig. 2a). With the exception of Path2Models²², which relies on pathway resources that contain problematic reaction information on stoichiometry and directionality²³, automatically reconstructed GEMs were stoichiometrically consistent (Fig. 2b) and mass-balanced (Supplementary Fig. 4). Of the manually reconstructed GEMs we tested, most models in BiGG¹³ are stoichiometrically consistent, but there is wide variation among published models, with ~70% of models having at least one stoichiometrically unbalanced metabolite. Stoichiometrically inconsistent models cannot be mass-balanced, but missing formula annotations, from which molecular masses are calculated, further contribute to reactions being counted as unbalanced. The problems that we identified in published models underpin the need for application of MEMOTE during peer-review process (but ideally before submission) of GEMs.

During GEM reconstruction, metabolic reactions are defined based on functional gene annotations, and this information is output as GPR rules. We found that ~15% of reactions in models we tested are not annotated with GPR rules (Fig. 2c). For published models, subgroups of models contain up to 85% of reactions without GPR rules. This could be due to a large number of modeling-specific reactions, spontaneous reactions²⁴ and known reactions with undiscovered genes, or if GPR rules were annotated in nonstandard ways.

CarveMe²⁵ and Path2Models²² have a very low fraction of universally blocked reactions, whereas models from AGORA²⁶ and KBase¹⁴ contain ~30% blocked reactions, and BiGG¹³ models and OptFlux¹⁵ models contain ~20% blocked reactions (Fig. 2d). Similarly, orphan and dead-end metabolites (Supplementary Figs. 5 and 6) are also present in all of these published collections. We note that blocked reactions and dead-end metabolites are not indicators of low-quality models but that a large proportion (for example, >50%) of universally blocked reactions can indicate problems in reconstruction that need solving.

AGORA, KBase and BiGG are the only collections with SBML-compliant metabolite and reaction annotations. Gene annotations are only present in KBase models and selected BiGG models (Supplementary Figs. 7–9). Each collection uses its own system of identifiers for each model component, but there is some overlap between all three (Supplementary Figs. 10 and 11), and partial overlaps for models from KBase and BiGG (Supplementary Figs. 12–16), or AGORA and BiGG (Supplementary Figs. 17 and 18), but not KBase and AGORA. BiGG is the only collection with models using MetaNetX²⁷ annotations (Supplementary Fig. 19). MetaNetX consolidates biochemical namespaces by establishing a mapping between them through a set of unique identifiers. Hence, knowing the MetaNetX identifier for a given entity often means also knowing the identifiers for other databases (Supplementary Methods).

MEMOTE tests cover semantic and conceptual requirements, which are fundamental to SBML3FBC and constraint-based modeling, respectively. They are extensible to allow the validation of a model’s performance against experimental data and can be executed as a stand-alone tool or integrated into existing reconstruction pipelines. Capitalizing on robust workflows established in modern software development, MEMOTE promotes openness and collaboration by granting the community tangible metrics to support their research and to discuss assumptions or limitations openly.

Application of a set of defined metabolic model tests is not dependent on implementation in MEMOTE, and for some users it may be more desirable to implement each test separately to streamline the user experience.

We propose that an independent, central library of tests and a tool to run them offers an unbiased approach to quality control because the tests are continuously reviewed by the community. This resource will be maintained under stewardship of Nikolaus Sonnenschein by the openCOBRA consortium (https://github.com/opencobra). To encourage integration as opposed to duplication, MEMOTE provides a Python application programing interface (API) as well as being available as a web service. MEMOTE has already been integrated in several services and tools (Supplementary Note 3). We discuss alternatives and future perspectives of MEMOTE in Supplementary Notes 4 and 5, respectively.

We recommend that MEMOTE users reach out to GEM authors to report any errors and thereby enable community improvement of models as resources. Using inconsistent GEMs for hypothesis generation could lead researchers down blind alleys, so we weighed the influence of ‘consistency’ and ‘stoichiometric consistency’ and SBO terms higher than tests for metabolite, reaction and gene annotations.

We are committed to keeping MEMOTE open to support community principles. Robust benchmarking will only work if it is actively supported by the whole community, and we call on any interested experts to join this endeavor and enable its continual improvement.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Supplementary Materials^{(28.6MB, pdf)}

Supplementary Figs. 1–162, Notes 1–5, Methods, and Tables 1 and 2

Reporting Summary^{(67KB, pdf)}

Acknowledgements

We acknowledge D. Dannaher and A. Lopez for their supporting work on the Angular parts of MEMOTE; resources and support from the DTU Computing Center; J. Cardoso, S. Gudmundsson, K. Jensen and D. Lappa for their feedback on conceptual details; and P. D. Karp and I. Thiele for critically reviewing the manuscript. We thank J. Daniel, T. Kristjánsdóttir, J. Saez-Saez, S. Sulheim, and P. Tubergen for being early adopters of MEMOTE and for providing written testimonials. J.O.V. received the Research Council of Norway grants 244164 (GenoSysFat), 248792 (DigiSal) and 248810 (Digital Life Norway); M.Z. received the Research Council of Norway grant 244164 (GenoSysFat); C.L. received funding from the Innovation Fund Denmark (project “Environmentally Friendly Protein Production (EFPro2)”); C.L., A.K., N. S., M.B., M.A., D.M., P.M, B.J.S., P.V., K.R.P. and M.H. received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement 686070 (DD-DeCaF); B.G.O., F.T.B. and A.D. acknowledge funding from the US National Institutes of Health (NIH, grant number 2R01GM070923-13); A.D. was supported by infrastructural funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), Cluster of Excellence EXC 2124 Controlling Microbes to Fight Infections; N.E.L. received funding from NIGMS R35 GM119850, Novo Nordisk Foundation NNF10CC1016517 and the Keck Foundation; A.R. received a Lilly Innovation Fellowship Award; B.G.-J. and J. Nogales received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement no 686585 for the project LIAR, and the Spanish Ministry of Economy and Competitivity through the RobDcode grant (BIO2014-59528-JIN); L.M.B. has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement 633962 for project P4SB; R.F. received funding from the US Department of Energy, Offices of Advanced Scientific Computing Research and the Biological and Environmental Research as part of the Scientific Discovery Through Advanced Computing program, grant DE-SC0010429; A.M., C.Z., S.L. and J. Nielsen received funding from The Knut and Alice Wallenberg Foundation, Advanced Computing program, grant #DE-SC0010429; S.K.’s work was in part supported by the German Federal Ministry of Education and Research (de.NBI partner project “ModSim” (FKZ: 031L104B)); E.K. and J.A.H.W. were supported by the German Federal Ministry of Education and Research (project “SysToxChip”, FKZ 031A303A); M.K. is supported by the Federal Ministry of Education and Research (BMBF, Germany) within the research network Systems Medicine of the Liver (LiSyM, grant number 031L0054); J.A.P. and G.L.M. acknowledge funding from US National Institutes of Health (T32-LM012416, R01-AT010253, R01-GM108501) and the Wagner Foundation; G.L.M. acknowledges funding from a Grand Challenges Exploration Phase I grant (OPP1211869) from the Bill & Melinda Gates Foundation; H.H. and R.S.M.S. received funding from the Biotechnology and Biological Sciences Research Council MultiMod (BB/N019482/1); H.U.K. and S.Y.L. received funding from the Technology Development Program to Solve Climate Changes on Systems Metabolic Engineering for Biorefineries (grants NRF-2012M1A2A2026556 and NRF-2012M1A2A2026557) from the Ministry of Science and ICT through the National Research Foundation (NRF) of Korea; H.U.K. received funding from the Bio & Medical Technology Development Program of the NRF, the Ministry of Science and ICT (NRF-2018M3A9H3020459); P.B., B.J.S., Z.K., B.O.P., C.L., M.B., N.S., M.H. and A.F. received funding through Novo Nordisk Foundation through the Center for Biosustainability at the Technical University of Denmark (NNF10CC1016517); D.-Y.L. received funding from the Next-Generation BioGreen 21 Program (SSAC, PJ01334605), Rural Development Administration, Republic of Korea; G.F. was supported by the RobustYeast within ERA net project via SystemsX.ch; V.H. received funding from the ETH Domain and Swiss National Science Foundation; M.P. acknowledges Oxford Brookes University; J.C.X. received support via European Research Council (666053) to W.F. Martin; B.E.E. acknowledges funding through the CSIRO-UQ Synthetic Biology Alliance; C.D. is supported by a Washington Research Foundation Distinguished Investigator Award. I.N. received funding from National Institutes of Health (NIH)/National Institute of General Medical Sciences (NIGMS) (grant P20GM125503).

Source data

Source Data Fig. 2^{(1.4MB, xlsx)}

Data availability

The model collection is available at 10.5281/zenodo.2636858. Individual results and aggregated tables, as well as analysis code, are available at 10.5281/zenodo.2638234.

Code availability

MEMOTE source code is available at https://github.com/opencobra/memote under the Apache license, version 2.0. Supporting documentation is available at https://memote.readthedocs.io/en/latest/. The MEMOTE web interface is hosted at https://memote.io. A detailed list of all tests in MEMOTE is available at https://memote.readthedocs.io/en/latest/autoapi/index.html.

Competing interests

The authors declare no competing interests.

Footnotes

These authors contributed equally: Christian Lieven, Moritz E. Beber.

Change history

3/19/2020

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

Supplementary information

Supplementary information is available for this paper at 10.1038/s41587-020-0446-y.

References

1.Palsson, B.Ø. Systems Biology: Constraint-based Reconstruction and Analysis (Cambridge Univ. Press, 2015).
2.Thiele I, Palsson BØ. Nat. Protoc. 2010;5:93–121. doi: 10.1038/nprot.2009.203. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Heavner BD, Price ND. Curr. Opin. Biotechnol. 2015;34:105–109. doi: 10.1016/j.copbio.2014.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Ravikrishnan A, Raman K. Brief. Bioinform. 2015;16:1057–1068. doi: 10.1093/bib/bbv003. [DOI] [PubMed] [Google Scholar]
5.Chan, S.H.J., Cai, J., Wang, L., Simons-Senftle, M.N. & Maranas, C.D. Bioinformatics10.1093/bioinformatics/btx453 (2017). [DOI] [PubMed]
6.Xavier JC, Patil KR, Rocha I. Metab. Eng. 2017;39:200–208. doi: 10.1016/j.ymben.2016.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Fritzemeier CJ, Hartleb D, Szappanos B, Papp B, Lercher MJ. PLoS Comput. Biol. 2017;13:e1005494. doi: 10.1371/journal.pcbi.1005494. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Jerby L, Ruppin E. Clin. Cancer Res. 2012;18:5572–5584. doi: 10.1158/1078-0432.CCR-12-1856. [DOI] [PubMed] [Google Scholar]
9.Olivier BG, Bergmann FT. J. Integr. Bioinform. 2018;15:20170082. doi: 10.1515/jib-2017-0082. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Heirendt L, et al. Nat. Protoc. 2019;14:639–702. doi: 10.1038/s41596-018-0098-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. BMC Syst. Biol. 2013;7:74. doi: 10.1186/1752-0509-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Chelliah V, et al. Nucleic Acids Res. 2015;43:D542–D548. doi: 10.1093/nar/gku1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.King ZA, et al. Nucleic Acids Res. 2016;44:D515–D522. doi: 10.1093/nar/gkv1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Arkin AP, et al. Nat. Biotechnol. 2018;36:566–569. doi: 10.1038/nbt.4163. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Rocha I, et al. BMC Syst. Biol. 2010;4:45. doi: 10.1186/1752-0509-4-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Cooper J, Vik JO, Waltemath D. Prog. Biophys. Mol. Biol. 2015;117:99–106. doi: 10.1016/j.pbiomolbio.2014.10.001. [DOI] [PubMed] [Google Scholar]
17.Beaulieu-Jones BK, Greene CS. Nat. Biotechnol. 2017;35:342–346. doi: 10.1038/nbt.3780. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Bornstein BJ, Keating SM, Jouraku A, Hucka M. Bioinformatics. 2008;24:880–881. doi: 10.1093/bioinformatics/btn051. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Le Novère N, et al. Nat. Biotechnol. 2005;23:1509–1515. doi: 10.1038/nbt1156. [DOI] [PubMed] [Google Scholar]
20.Courtot M, et al. Mol. Syst. Biol. 2011;7:543. doi: 10.1038/msb.2011.77. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Monk J, Nogales J, Palsson BO. Nat. Biotechnol. 2014;32:447–452. doi: 10.1038/nbt.2870. [DOI] [PubMed] [Google Scholar]
22.Büchel F, et al. BMC Syst. Biol. 2013;7:116. doi: 10.1186/1752-0509-7-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Yuan Q, et al. PLoS One. 2017;12:e0169437. doi: 10.1371/journal.pone.0169437. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Keller MA, Piedrafita G, Ralser M. Curr. Opin. Biotechnol. 2015;34:153–161. doi: 10.1016/j.copbio.2014.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Machado D, Andrejev S, Tramontano M, Patil KR. Nucleic Acids Res. 2018;46:7542–7553. doi: 10.1093/nar/gky537. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Magnúsdóttir S, et al. Nat. Biotechnol. 2017;35:81–89. doi: 10.1038/nbt.3703. [DOI] [PubMed] [Google Scholar]
27.Moretti S, et al. Nucleic Acids Res. 2016;44:D523–D526. doi: 10.1093/nar/gkv1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Steffensen JL, Dufault-Thompson K, Zhang Y, Dandekar T. PSAMM: a portable system for the analysis of metabolic models. PLOS Comput. Biol. 2016;12:e1004732. doi: 10.1371/journal.pcbi.1004732. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Sidiropoulos N, et al. SinaPlot: an enhanced chart for simple and truthful representation of single observations over multiple classes. J. Comput. Graph. Stat. 2018;27:673–676. [Google Scholar]
30.Ebrahim A, et al. Do genome-scale models need exact solvers or clearer standards? Mol. Syst. Biol. 2015;11:831. doi: 10.15252/msb.20156157. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials^{(28.6MB, pdf)}

Supplementary Figs. 1–162, Notes 1–5, Methods, and Tables 1 and 2

Reporting Summary^{(67KB, pdf)}

Data Availability Statement

The model collection is available at 10.5281/zenodo.2636858. Individual results and aggregated tables, as well as analysis code, are available at 10.5281/zenodo.2638234.

MEMOTE source code is available at https://github.com/opencobra/memote under the Apache license, version 2.0. Supporting documentation is available at https://memote.readthedocs.io/en/latest/. The MEMOTE web interface is hosted at https://memote.io. A detailed list of all tests in MEMOTE is available at https://memote.readthedocs.io/en/latest/autoapi/index.html.

[CR1] 1.Palsson, B.Ø. Systems Biology: Constraint-based Reconstruction and Analysis (Cambridge Univ. Press, 2015).

[CR2] 2.Thiele I, Palsson BØ. Nat. Protoc. 2010;5:93–121. doi: 10.1038/nprot.2009.203. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Heavner BD, Price ND. Curr. Opin. Biotechnol. 2015;34:105–109. doi: 10.1016/j.copbio.2014.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Ravikrishnan A, Raman K. Brief. Bioinform. 2015;16:1057–1068. doi: 10.1093/bib/bbv003. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Chan, S.H.J., Cai, J., Wang, L., Simons-Senftle, M.N. & Maranas, C.D. Bioinformatics10.1093/bioinformatics/btx453 (2017). [DOI] [PubMed]

[CR6] 6.Xavier JC, Patil KR, Rocha I. Metab. Eng. 2017;39:200–208. doi: 10.1016/j.ymben.2016.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Fritzemeier CJ, Hartleb D, Szappanos B, Papp B, Lercher MJ. PLoS Comput. Biol. 2017;13:e1005494. doi: 10.1371/journal.pcbi.1005494. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Jerby L, Ruppin E. Clin. Cancer Res. 2012;18:5572–5584. doi: 10.1158/1078-0432.CCR-12-1856. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Olivier BG, Bergmann FT. J. Integr. Bioinform. 2018;15:20170082. doi: 10.1515/jib-2017-0082. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Heirendt L, et al. Nat. Protoc. 2019;14:639–702. doi: 10.1038/s41596-018-0098-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. BMC Syst. Biol. 2013;7:74. doi: 10.1186/1752-0509-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Chelliah V, et al. Nucleic Acids Res. 2015;43:D542–D548. doi: 10.1093/nar/gku1181. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.King ZA, et al. Nucleic Acids Res. 2016;44:D515–D522. doi: 10.1093/nar/gkv1049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Arkin AP, et al. Nat. Biotechnol. 2018;36:566–569. doi: 10.1038/nbt.4163. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Rocha I, et al. BMC Syst. Biol. 2010;4:45. doi: 10.1186/1752-0509-4-45. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Cooper J, Vik JO, Waltemath D. Prog. Biophys. Mol. Biol. 2015;117:99–106. doi: 10.1016/j.pbiomolbio.2014.10.001. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Beaulieu-Jones BK, Greene CS. Nat. Biotechnol. 2017;35:342–346. doi: 10.1038/nbt.3780. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Bornstein BJ, Keating SM, Jouraku A, Hucka M. Bioinformatics. 2008;24:880–881. doi: 10.1093/bioinformatics/btn051. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Le Novère N, et al. Nat. Biotechnol. 2005;23:1509–1515. doi: 10.1038/nbt1156. [DOI] [PubMed] [Google Scholar]

[CR20] 20.Courtot M, et al. Mol. Syst. Biol. 2011;7:543. doi: 10.1038/msb.2011.77. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Monk J, Nogales J, Palsson BO. Nat. Biotechnol. 2014;32:447–452. doi: 10.1038/nbt.2870. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Büchel F, et al. BMC Syst. Biol. 2013;7:116. doi: 10.1186/1752-0509-7-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Yuan Q, et al. PLoS One. 2017;12:e0169437. doi: 10.1371/journal.pone.0169437. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Keller MA, Piedrafita G, Ralser M. Curr. Opin. Biotechnol. 2015;34:153–161. doi: 10.1016/j.copbio.2014.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Machado D, Andrejev S, Tramontano M, Patil KR. Nucleic Acids Res. 2018;46:7542–7553. doi: 10.1093/nar/gky537. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Magnúsdóttir S, et al. Nat. Biotechnol. 2017;35:81–89. doi: 10.1038/nbt.3703. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Moretti S, et al. Nucleic Acids Res. 2016;44:D523–D526. doi: 10.1093/nar/gkv1117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Steffensen JL, Dufault-Thompson K, Zhang Y, Dandekar T. PSAMM: a portable system for the analysis of metabolic models. PLOS Comput. Biol. 2016;12:e1004732. doi: 10.1371/journal.pcbi.1004732. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Sidiropoulos N, et al. SinaPlot: an enhanced chart for simple and truthful representation of single observations over multiple classes. J. Comput. Graph. Stat. 2018;27:673–676. [Google Scholar]

[CR30] 30.Ebrahim A, et al. Do genome-scale models need exact solvers or clearer standards? Mol. Syst. Biol. 2015;11:831. doi: 10.15252/msb.20156157. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

MEMOTE for standardized genome-scale metabolic model testing

Christian Lieven

Moritz E Beber

Brett G Olivier

Frank T Bergmann

Meric Ataman

Parizad Babaei

Jennifer A Bartell

Lars M Blank

Siddharth Chauhan

Kevin Correia

Christian Diener

Andreas Dräger

Birgitta E Ebert

Janaka N Edirisinghe

José P Faria

Adam M Feist

Georgios Fengos

Ronan M T Fleming

Beatriz García-Jiménez

Vassily Hatzimanikatis

Wout van Helvoirt

Christopher S Henry

Henning Hermjakob

Markus J Herrgård

Ali Kaafarani

Hyun Uk Kim

Zachary King

Steffen Klamt

Edda Klipp

Jasper J Koehorst

Matthias König

Meiyappan Lakshmanan

Dong-Yup Lee

Sang Yup Lee

Sunjae Lee

Nathan E Lewis

Filipe Liu

Hongwu Ma

Daniel Machado

Radhakrishnan Mahadevan

Paulo Maia

Adil Mardinoglu

Gregory L Medlock

Jonathan M Monk

Jens Nielsen

Lars Keld Nielsen

Juan Nogales

Intawat Nookaew

Bernhard O Palsson

Jason A Papin

Kiran R Patil

Mark Poolman

Nathan D Price

Osbaldo Resendis-Antonio

Anne Richelle

Isabel Rocha

Benjamín J Sánchez

Peter J Schaap

Rahuman S Malik Sheriff

Saeed Shoaie

Nikolaus Sonnenschein

Bas Teusink

Paulo Vilaça

Jon Olav Vik

Judith A H Wodke

Joana C Xavier

Qianqian Yuan

Maksim Zakhartsev

Cheng Zhang

Fig. 1. Graphical summary of MEMOTE.

Fig. 2. Quality of manually reconstructed GEMs from collections without quality control or quality assurance.

Reporting Summary

Supplementary information

Acknowledgements

Source data

Data availability

Code availability

Competing interests