A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology

Markus J Herrgård; Neil Swainston; Paul Dobson; Warwick B Dunn; K Yalçin Arga; Mikko Arvas; Nils Blüthgen; Simon Borger; Roeland Costenoble; Matthias Heinemann; Michael Hucka; Nicolas Le Novère; Peter Li; Wolfram Liebermeister; Monica L Mo; Ana Paula Oliveira; Dina Petranovic; Stephen Pettifer; Evangelos Simeonidis; Kieran Smallbone; Irena Spasić; Dieter Weichart; Roger Brent; David S Broomhead; Hans V Westerhoff; Betül Kırdar; Merja Penttilä; Edda Klipp; Bernhard Ø Palsson; Uwe Sauer; Stephen G Oliver; Pedro Mendes; Jens Nielsen; Douglas B Kell

doi:10.1038/nbt1492

. Author manuscript; available in PMC: 2014 May 12.

Published in final edited form as: Nat Biotechnol. 2008 Oct;26(10):1155–1160. doi: 10.1038/nbt1492

A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology

Markus J Herrgård ^1,^¶, Neil Swainston ^2,^3,^¶, Paul Dobson ^3,⁴, Warwick B Dunn ^3,⁴, K Yalçin Arga ⁵, Mikko Arvas ⁶, Nils Blüthgen ^3,⁷, Simon Borger ⁸, Roeland Costenoble ⁹, Matthias Heinemann ⁹, Michael Hucka ¹⁰, Nicolas Le Novère ¹¹, Peter Li ^2,³, Wolfram Liebermeister ⁸, Monica L Mo ¹, Ana Paula Oliveira ¹², Dina Petranovic ¹², Stephen Pettifer ^2,³, Evangelos Simeonidis ^3,⁷, Kieran Smallbone ^3,¹³, Irena Spasić ^2,³, Dieter Weichart ^3,⁴, Roger Brent ¹⁴, David S Broomhead ^3,¹³, Hans V Westerhoff ^3,^7,¹⁵, Betül Kırdar ⁵, Merja Penttilä ⁶, Edda Klipp ⁸, Bernhard Ø Palsson ¹, Uwe Sauer ⁹, Stephen G Oliver ^3,¹⁶, Pedro Mendes ^2,^3,¹⁷, Jens Nielsen ^12,¹⁸, Douglas B Kell ^3,^4,^*

¹Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0412, USA

²School of Computer Science, The University of Manchester, Oxford Rd, Manchester M13 9PL, UK

³The Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, The University of Manchester, 131 Princess St, Manchester M1 7DN, UK

⁴School of Chemistry, The University of Manchester, Manchester M13 9PL

⁵Department of Chemical Engineering, Boğaziçi University, Bebek 34342, Istanbul, Turkey

⁶VTT Biotechnology Espoo, PO Box 1500, FIN-02044, Finland

⁷School of Chemical Engineering and Analytical Science, The University of Manchester

⁸Max-Planck-Institut für Molekulare Genetik, Ihnestrasse 73, 14195 Berlin, Germany

⁹Institut für Molekulare Systembiologie, ETH Zürich, Wolfgang-Pauli-Str. 16, 8093 Zürich, Switzerland

¹⁰Control and Dynamical Systems, California Institute of Technology, Pasadena, CA 91125, USA

¹¹Computational Neurobiology, EMBL-EBI, Wellcome-Trust Genome Campus, Hinxton, Cambridge CB10 1SD, U.K.

¹²Center for Microbial Biotechnology, Department of Systems Biology, Technical University of Denmark, Building 223, DK-2800 Kgs. Lyngby, Denmark

¹³School of Mathematics, The University of Manchester, Manchester M13 9PL

¹⁴The Molecular Sciences Institute, 2168 Shattuck Avenue, Berkeley, CA 94704, USA

¹⁵Department of Molecular Cell Physiology, Vrije Universiteit, de Boelelaan 1085, 1081 HV Amsterdam, The Netherlands

¹⁶Department of Biochemistry, University of Cambridge, Sanger Building, 80 Tennis Court Road, Cambridge CB2 1GA, UK

¹⁷Virginia Bioinformatics Institute, Virginia Tech, Washington St 0477, Blacksburg, VA 24061, USA

¹⁸Department of Chemical and Biological Engineering, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden

Corresponding author dbk@manchester.ac.ukwww.mcisb.org

^¶

These authors contributed equally to this work

Present addresses: Synthetic Genomics, Inc., 11149 N. Torrey Pines Rd, La Jolla, CA 92037 (MJH)

Department of Chemical and Biological Engineering, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden (DP)

PMCID: PMC4018421 NIHMSID: NIHMS574724 PMID: 18846089

Abstract

Genomic data now allow the large-scale manual or semi-automated reconstruction of metabolic networks. A network reconstruction represents a highly curated organism-specific knowledge base. A few genome-scale network reconstructions have appeared for metabolism in the baker’s yeast Saccharomyces cerevisiae. These alternative network reconstructions differ in scope and content, and further have used different terminologies to describe the same chemical entities, thus making comparisons between them difficult. The formulation of a ‘community consensus’ network that collects and formalizes the ‘community knowledge’ of yeast metabolism is thus highly desirable. We describe how we have produced a consensus metabolic network reconstruction for S. cerevisiae. Special emphasis is laid on referencing molecules to persistent databases or using database-independent forms such as SMILES or InChI strings, since this permits their chemical structure to be represented unambiguously and in a manner that permits automated reasoning. The reconstruction is readily available via a publicly accessible database and in the Systems Biology Markup Language, and we describe the manner in which it can be maintained as a community resource. It should serve as a common denominator for system biology studies of yeast. Similar strategies will be of benefit to communities studying genome-scale metabolic networks of other organisms.

Introduction

One of the goals of integrative systems biology is the accurate representation of biochemical, metabolic and signaling networks by mathematical models. This undertaking can be divided into four stages ¹. The first is a qualitative stage in which are listed all the reactions that are known to occur in the system or organism of interest; in the modern era, and especially for metabolic networks, these reaction lists are often derived in part from genomic annotations ^{2, 3} with curation based on literature (‘bibliomic’) data ⁴. A second stage, again qualitative, adds known effectors, while the third and fourth stages – essentially amounting to molecular enzymology – include the known kinetic rate equations and the values of their parameters. Armed with such information it is then possible to provide a stochastic or ordinary differential equation model of the entire metabolic network of interest. An attractive feature of metabolism, for the purposes of modeling, is that, in contrast to signaling pathways, metabolism is subject to direct thermodynamic and (in particular) stoichiometric constraints ³. Our focus here is on the first two stages of the reconstruction process, especially as it pertains to the mapping of experimental metabolomics data onto metabolic network reconstructions.

Besides being an industrial workhorse for a variety of biotechnological products, baker’s yeast (Saccharomyces cerevisiae) is a highly developed model organism for biochemical, genetic, pharmacological and post-genomic studies (e.g. ⁵). It is especially attractive because of its sequenced genome ⁶, the availability of a whole series of bar-coded deletion ^{7, 8} and other ⁹ strains, extensive experimental omics data (e.g. ^10–14), and the ability to grow it for extended periods in highly controlled conditions ¹⁵. The scientific community that works on S. cerevisiae is very active and has a history of collaborative research projects that have led to significant advances in our understanding of eukaryotic biology (e.g. ^{6, 8, 13, 16, 17}). Furthermore, yeast metabolic physiology has been the subject of intensive study and most of the components of the yeast metabolic network are relatively well characterized. Taken together, these factors make yeast metabolism an attractive topic to test a community approach to model building in systems biology.

A number of groups (e.g. ^18–21) have reconstructed the metabolic network of yeast from genomic and literature data and made the reconstructions freely available. However, due to different approaches utilized in the reconstruction as well as different interpretations of the literature, the existing reconstructions have many differences. Additionally, the naming of metabolites and enzymes in the existing reconstructions was, at best, inconsistent, and there were no systematic annotations of the chemical species in the form of links to external databases that store chemical compound information. This lack of model annotation complicated the use of the models for data analysis and integration. Members of the yeast systems biology community therefore recognized that a single ‘consensus’ reconstruction and annotation of the metabolic network was highly desirable as a starting point for further investigations.

A crucial factor that enabled the building of a consensus network reconstruction is the ability to describe and exchange biochemical network models in a standard format, the Systems Biology Markup Language (SBML; www.sbml.org) ²². The SBML format is utilized by most commonly used software applications for visualization, simulation, and analysis of biochemical networks, and also in pathway databases. SBML also provides the necessary standardized means (MIRIAM ²³) to annotate models with information that is required to identify uniquely network components, including metabolites, proteins, and genes. Representing the consensus metabolic network reconstruction in a MIRIAM-compatible SBML format allows widespread use of the reconstruction as well as assisting in its continued curation, expansion and revision.

We developed this consensus reconstruction using a ‘jamboree’ approach, a large, focused work meeting, where we defined the protocol for the curation process as well as resolving the majority of discrepancies between the existing reconstructions. The jamboree event was followed by an extended process of curation of remaining discrepancies and careful annotation of components of the reconstructions by a smaller group of people. The overall goal of the effort was, by careful curation and comprehensive annotation of the network model and its components, to make the consensus reconstruction useful for the broadest possible set of users. The general reconstruction could then be used directly in bioinformatics applications aimed at integration of, for example, metabolomics and proteomics data or as a starting point for building predictive models using a number of different approaches (e.g. ^{24, 25}), and for other purposes outlined below.

Here we describe how an initial ‘community consensus’ reconstruction of the yeast metabolic network was carried out. We make some further proposals for how this reconstruction of the yeast metabolic network may evolve as more information is acquired. We also discuss the possibility of using a similar approach to build consensus models of metabolic and other networks in other organisms.

Consensus reconstruction process

As a starting point for the development of a consensus reconstruction, we chose two separately developed freely available metabolic network reconstructions, iMM904 (http://gcrg.ucsd.edu/In_Silico_Organisms/Yeast) and iLL672 ²⁰, containing 904 and 672 yeast genes, respectively. Both of these reconstructions were derived from the first genome-scale metabolic network reconstruction for yeast iFF708 ¹⁸ (for the basis of this terminology see ²⁶), but the process of curating the original reconstruction was significantly different for the two derived reconstructions. The iMM904 reconstruction has eight different compartments and it was developed by curating and expanding an earlier reconstruction, iND750 ¹⁹. In contrast, the iLL672 reconstruction ²⁰ was directly derived from iFF708 by extensively curating the reconstruction in order to improve the ability of the flux balance model derived from the reconstruction to predict gene deletion phenotypes ²⁷. It should be noted that yeast metabolic pathways in the KEGG ²⁸ and SGD (see e.g. ^{29, 30}) databases were used to establish the starting point for building the original iFF708 reconstruction and also for curating the iLL672 and iMM904 reconstructions. Hence, the information from early versions of these two reconstructions is included implicitly in the consensus reconstruction.

Due to the lack of common metabolite names and annotations, the comparison of the two starting-point reconstructions required first manually defining the correspondences between metabolites. After these had been assigned, the overall metabolite and reaction content of the two reconstructions could be compared (Table 1). The majority of metabolites (444) were found in both reconstructions, whereas 8 were only in iLL672 and 269 only in iMM904. In terms of reactions, 566 were in both reconstructions, 177 reactions were only found in iLL672, and 836 only in iMM904. The large number of additional reactions in iMM904 is mostly due to the expanded number of compartments represented in this reconstruction.

Table 1.

Comparison of starting point-reconstructions.

	iMM9 04	iLL672	Common	iMM904 only	iLL672 only
Metabolites	713	452	444	269	8
Reactions	1402	743	566^*	836	177
Genes	904	659	646	258	13
Compartments	8	2	2	6	0

Open in a new tab

Reaction comparisons were done by considering every reaction to be reversible and without taking into account water and extracellular or intracellular protons (explicitly accounted for in iMM904).

The actual reconstruction jamboree was held at The University of Manchester, UK, in April 2007. The comparison between the iLL672 and iMM904 reconstructions shown in Table 1, proposed at a meeting of the YSBN (www.ysbn.eu/) in Helsinki, Finland in June 2006, formed the starting point for the reconstruction. The three-day Manchester event concentrated on three separate areas: (1) defining standards for curation as well as for representation of the annotated reconstruction in SBML, (2) annotating the metabolites with reference to external compound databases, and (3) resolving discrepancies between the reaction/metabolite sets in the two reconstructions. The presence of experts in relevant fields such as yeast genetics and physiology, systems modeling, metabolomics, standards (SBML/MIRIAM/metabolomics), and database/ontology development allowed the group to make significant progress in all three areas during the event. The annotation/curation was aided by a version of the B-Net database ³¹, and is provided as SBML (see Supplementary Table 1). After the jamboree, a sub-group of the authors verified the curation/annotation and resolved the remaining discrepancies between models. In the following we discuss in detail some of the major components of the curation and annotation processes.

Metabolite naming conventions

The initial comparison made it very clear that the naming conventions used in the two models were completely different, such that it was hard in some cases even for experts to know which chemical entities were meant. In addition, some of the reactions involved ‘generic’ structures (molecules with R-groups or so-called ‘Markush’ structures) which are not effectively represented in stoichiometric metabolic models, while some of the named entities represented ‘composite’ substances such as mixtures of different lipids or ‘biomass’. Without standardized names, it is extremely hard to enable computer software to reason about the similarities and differences between different models ^32–37; this problem is even more significant in the case of reconstructions of the larger human metabolic network (e.g. ^{4, 38}).

However, SBML allows one to annotate species such as metabolites with external references, and this has permitted us to relate them to molecules in ChEBI ³⁹, KEGG ²⁸, and PubChem ⁴⁰, and to identify them definitively using database-independent representations of small molecules such as SMILES ⁴¹ and InChI ^{36, 42}. We took advantage of this aspect of SBML to identify and annotate (manually) which chemical species were being described. In general the procedure we followed was to search these databases with the contents of the species’ name attribute field in the SBML representation or by the chemical formula of the compound sought. The order of annotation was such that we annotated metabolite species using ChEBI identifiers and InchI strings (if possible). If these did not exist or could not be resolved, then we used KEGG IDs (or in two cases HMDB identifiers ⁴³) followed by PubChem IDs and finally PubMed references. This activity generated, for the first time, a representation that allows computational comparisons to be performed.

Because some individual molecules have multiple states (e.g. because of acid-base reactions), it would be desirable to use the chemical entities believed to be most common at the pH of the relevant compartment. However, in this version of the consensus reconstruction, all species are assumed to be in the form that corresponds to the most common protonation state at pH 7.2. Whenever possible, the metabolites were annotated with a database entry with the correct protonation state but, in a number of cases, the databases only contained the metabolite in a neutral form or otherwise in an incorrect or incorrectly annotated protonation state.

Annotation of large-scale metabolic models in SBML

While large-scale metabolic network reconstructions and models are now commonly represented in SBML, there has not thus far been a standard way to annotate these models. As part of the consensus reconstruction effort, we tried to develop such a standard that is compliant with MIRIAM ²³. While the annotation of metabolites is quite straightforward, standardized annotation of the reaction content (molecules and reactions) of the reconstructed network proved to be more involved.

Where possible, we annotated reactions using literature references encoded as PubMed IDs, using the MIRIAM- and SBML-compliant "isDescribedBy" RDF annotation tag. In addition, reaction annotations include modifiers (enzymes/enzyme complexes) where possible. If a given reaction can be catalyzed by two or more isozymes, we generated an individual reaction for each isozyme (or complex). We represented the formation of protein complexes by separate reactions. Proteins and genes were finally annotated by references to SGD ²⁹ and UniProt ⁴⁴. In addition, we annotated cellular compartments using GO terms ⁴⁵. In all cases where annotations were used, the MIRIAM ²³ web services (http://www.ebi.ac.uk/compneur-srv/miriam-main/mdb?section=ws) were consulted to ensure correct annotation. Examples of fully annotated species and reaction entries are shown in Figure 2 and in Supplementary Figure 1.

An example of the SBML annotation of a metabolite species using the example of ATP, as used in the reconstruction of the consensus network, illustrating its use of the Systems Biology Ontology (http://www.ebi.ac.uk/sbo/) and its MIRIAM-compliance. A. Relevant parts of the SBML code. B. An indication of the kinds of annotations included (for clarity not all are shown).

Contents of the consensus reconstructions

In all, the resulting consensus network consists of 2153 species (1168 metabolites, 832 genes, 888 proteins, and 96 catalytic protein complexes) and 1857 reactions (1761 metabolic reactions and 96 complex formation reactions). Reactions and species can be localized to 15 compartments (see Table 2) including membrane compartments. The network contains 664 distinct chemical entities (i.e. ATP is present in the nucleus, cytoplasm, Golgi, mitochondrion, peroxisome and vacuole and as such represents 6 metabolites but here we used just one chemical species, ignoring, e.g., Mg⁺⁺ liganding). Of these distinct chemical entities, 554 are annotated with ChEBI identifiers, 564 with InChI identifiers, 78 with KEGG identifiers, 10 with PubChem identifiers, two with HMDB identifiers, and five only have PubMed references. In addition, 26 compounds are currently unannotated in this way – the majority of these are fatty acyl CoA’s or Acyl carrier proteins (ACPs) where the corresponding fatty acid is in public databases, but the fatty acyl CoA or ACP is currently not (but will be submitted to them).

Table 2.

Summary of the consensus reconstruction by compartment

Compartment	Reactions	Metabolites
Cytoplasm	835	590
Extracellular	15	158
Golgi	2	13
Mitochondrion	188	235
Nucleus	30	42
Endoplasmic reticulum	32	28
Vacuole	2	22
Peroxisome	77	80
Mitochondrial membrane	142	0
Plasma membrane	311	0
Peroxisomal membrane	44	0
ER membrane	17	0
Vacuolar membrane	35	0
Golgi membrane	5	0
Nuclear membrane	26	0

Open in a new tab

The network includes 1312 unique chemical transformations, of which 911 occur within a single compartment and the remaining 401 are transport reactions. The overall distribution of metabolites and reactions between the various compartments in the consensus network is given in Table 2. EC number and PubMed reference annotations are provided for 738 and 478 unique transformations in the network, respectively. Each reaction includes all of its cofactors (sometimes known as ‘currency metabolites’) such as ATP, NADH and CoA. In addition, and while we recognize that there is a certain arbitrariness about this, we have assigned pathway names for each reaction in the network.

We have removed various reactions from the initial networks, especially where they contained Markush structures or ambiguities. This has led to the under-representation of lipids, where there are many combinatorial issues (e.g. ⁴⁶). We anticipate that lipid pathways will be added in the future, but “lipidomics” experiments will eventually be necessary to define the full complement of different lipid species present in S. cerevisiae. In a similar vein, composite items such as ‘biomass’ are excluded; while these are required for flux balance analysis (FBA), our purpose here is to provide the basic inventory of metabolites and network structure that can be used e.g. to compare the network with experimental metabolomics data. This inventory can then form the basis for setting up flux balance models using different assumptions required for setting up these kinds of model, e.g. assumptions on the biomass composition, reversibility of reactions and lumping of the reactions into fewer compartments.

Figure 1 depicts the degree distribution (see e.g. ⁴⁷) of the complete metabolite network, and a version where the currency metabolites were ignored as described earlier by Wagner and Fell ⁴⁸. The complete network (Fig 1A) has an average clustering coefficient of 0.742, average node degree of 13.166, characteristic path length of 2.186, and betweenness centralization of 0.3897. The network without currency metabolites (Fig 1B) has an average clustering coefficient of 0.421, average node degree of 5.138, characteristic path length of 4.178, and betweenness centralization of 0.2329. In the full network the largest value for the shortest distance between any two metabolites (‘diameter’) is only 4 reaction steps while in the one without “currency” metabolites it is 11 reaction steps (between dTTP and Heme A). These statistics indicate that the “currency” metabolites should not be ignored as is sometimes done; without them the network is considerably less connected and several unconnected subnets appear, thus leaving some areas of metabolism unconnected from the rest. The center metabolite in the complete network is the proton, while in the smaller one it is coenzyme A. Table 3 lists the top 15 most-connected metabolites of each network.

The metabolic reaction network was first summarized in a metabolite network, where metabolites are the nodes and one edge links two metabolites that co-occur in a reaction (in any role as substrates or products) as described in ⁴⁸. For this analysis transport steps were not considered, nor were protein-protein binding reactions. The figures plot the distribution of the degree of connectivity, *P(k)*, expressed as the fraction of metabolites that have k links out of the total number of metabolites plotted against the number of links (k) **(A)** in the complete network and, **(B)** in a network where the following metabolites were not considered: {water, proton, carbon dioxide, dioxygen, phosphate³⁻, diphosphate⁴⁻, ammonium, ATP, ADP, AMP, NAD⁺, NADH, NADP⁺, NADPH} (to be comparable with the analysis in ⁴⁸).

Table 3.

Most connected nodes in the metabolite network. The degree is the number of metabolites that co-occur in metabolic reactions; the betweeness quantifies the number of paths between any two pairs of metabolites in the network that this one mediates (a global property)

Complete metabolite network (as in Fig 1A)			Abbreviated metabolite network (as in Fig 1B)
Metabolite	Degree	Betweeness	Metabolite	Degree	Betweeness
Proton	506	0.391	Coenzyme A	106	0.237
Water	390	0.226	L-glutamate¹⁻	71	0.232
ATP	268	0.099	Acetyl-CoA	66	0.065
Diphosphate⁴⁻	181	0.062	2-oxoglutarate²⁻	48	0.078
Phosphate³⁻	178	0.041	Hydrogen peroxide	45	0.070
ADP	173	0.026	Pyruvate	38	0.070
NADPH	166	0.021	Glycine	31	0.041
NADP⁺	166	0.021	S-adenosyl-L-methionine	28	0.063
NAD⁺	143	0.018	Acetate	26	0.021
Carbon dioxide	139	0.020	S-adenosyl-L-homocysteine	25	0.039
NADH	139	0.017	(6S)-5,6,7,8-tetrahydrofolic acid	25	0.031
AMP	128	0.029	L-glutamine	25	0.025
Coenzyme A	119	0.021	Succinate²⁻	24	0.029
Dioxygen	116	0.014	Acyl-carrier protein	23	0.013
Ammonium	92	0.011	L-cysteine	22	0.023

Open in a new tab

Dissemination and future curation of the reconstruction

An SBML-encoded version of the base model (with and without compartments) is available at http://www.comp-sys-bio.org/yeastnet. Specifically, the SBML representation of the model is made available under the Creative Commons Attribution-Share Alike 3.0 Unported License (see www.creativecommons.org/). This is the preferred source for using the complete model with systems biology software. We have tested the SBML using various XML validators, and shown that it loads successfully into the COPASI biochemical software simulator ⁴⁹. COPASI shows that there are a total of 307 mass conservation relations, which were calculated from the stoichiometry matrix using the method of Vallabhajosyula ⁵⁰, which is now standard in COPASI ⁴⁹. We have also loaded the model successfully into some versions of Cytoscape ⁵¹ and CellDesigner ⁵². The software is fully libSBML-compliant (see e.g. http://sbml.org/software/libsbml/ and ^{53, 54}).

Recognizing that for many applications only subsets of this model are going to be relevant, we also make it available in an online database that facilitates searching the model. We used the database schema B-Net ³¹ which already supported all of the features required for our SBML model, including a structured mechanism for MIRIAM annotations. This B-Net representation of the model can be searched using synonyms and it also allows the user to navigate through the network, for example going from a metabolite to all its reactions, then to the genes that encode the enzymes catalyzing those reactions, etc. The database is also available at http://www.comp-sys-bio.org/yeastnet.

The B-Net database provides another important function as it is also the preferred means by which the community will be able to edit the model. It will thus be the primary source for the model. The advantages of using a database for managing data are well known and need not be reiterated here; suffice to say that, since there is no redundancy in the database, any change in any component immediately becomes global. For the time being, editing the model is limited to a few curators to ensure that the current standards are maintained. However, community annotation has major benefits ^{55, 56}. Thus we also have included at the database a mechanism that collects annotations from anyone who wishes to communicate corrections or additions to the model. These annotations will then be reviewed and incorporated into the model for future releases of new versions.

Discussion

We have brought together a large segment of the community engaged in research involving genome-scale metabolic networks of yeast to create a consensus network that is freely available without restrictions and that can form the basis for future improvements. The SBML representation of the reconstruction is freely available under a Creative Commons License, and representations of the network were designed to facilitate future improvements.

Although annotation was semi-automated, a considerable element of manual annotation was still required, especially the parsing of the starting models. One of the biggest problems was the use of non-standard and often arcane synonyms for referring to the same chemical entity. A number of commentators have recognized the difficulties caused by issues of synonyms (e.g. ^{4, 33, 38}). For these purposes, we believe and strongly recommend that the best solution to the synonym problem is to reference chemical entities in persistent databases and with database-independent representations such as SMILES ⁴¹ and InChI ⁴². Referencing the true chemical entity intended requires detailed consideration of its stereochemistry and the anomeric specificity of reactions in which it is involved, and not all databases have the required level of precision. We would also recommend that one should first build these networks in an assumption-free manner, and only then introduce (and annotate) extra features or assumptions that may be required for specific purposes – for instance adding composite compounds for flux balance analyses. A further benefit of the jamboree approach is the access to experts necessary to annotate details such as the precise gene-protein relationships underlying specific reactions.

The reconstruction presented here is currently the most comprehensive and consistent stoichiometric representation of yeast metabolism, from which predictive (sub)models, for example for genome-scale flux balance analysis, can be extracted and deployed. Presently, the reconstruction lacks information on effectors, reaction kinetics and parametrization. However, the basic framework of B-Net coupled to SBML models that can easily be populated with such data enables these to be added as they become available, and thus kinetic models that can be directly linked to the genome-scale metabolic network can be built. Some parameters are already available at SABIO-RK (http://sabio.villa-bosch.de/SABIORK/).

Network reconstruction approaches have developed rapidly in recent years. When they reach the genome-scale, they can be viewed as systems-level genome annotations ⁵⁷. Genome-annotation is produced by a community-driven process to reach a consensus annotation that represents the state of knowledge about the genome of the target organism. Annotations are then updated based on new information and they serve as a common denominator for genome science studies of the target organism. The yeast metabolic reconstruction presented here represents an analogous process for systems biology studies of a target organism. With the successful achievement of the first consensus reconstruction the systems biology community can look forward to similar two-dimensional annotation jamborees for other organisms.

The metabolite nomenclature proposed here will hopefully become the standard terminology for metabolic models because the compounds themselves are essentially identical in all species. In the present case, we believe that the semantically annotated reconstruction we here provide will have especial utility in a number of areas. First is the basic exploration of metabolic pathways and well-curated connections between gene products. Further, the reconstruction will allow the automated interpretation and visualization of metabolomics data as well as data on metabolic proteins, genes and transcripts. The network can form the basis of phenotype predictions (including product yield) in response to genetic/environmental perturbations using a variety of methods (including FBA and logical approaches ⁵⁸), and it can be used in metabolic flux estimation based on isotopomer data (e.g. ⁵⁹), for filling gaps in metabolic pathways, and for exploring questions related to comparative metabolomics ⁶⁰ and of metabolic pathway evolution. The widespread use of a consensus starting point will make both the comparison and the integration of such studies considerably easier.

Supplementary Material

Supp Fig 1

NIHMS574724-supplement-Supp_Fig_1.doc^{(53.5KB, doc)}

Supp Table

NIHMS574724-supplement-Supp_Table.txt^{(199.8KB, txt)}

Acknowledgments

The Manchester groups thank the UK BBSRC and EPSRC for financial support including for the Manchester Centre for Integrative Systems Biology (www.mcisb.org/). The UCSD participants thank NIH for financial support (NIH R01 GM071808). We thank Sarah Keating and Norman Paton for many useful discussions. The Jamboree was held under the auspices, and with the sponsorship of, the Yeast Systems Biology Network (EC Contract: LSHG-CT-2005-018942).

Footnotes

Author contributions. All authors conceived the idea of the consensus reconstruction, the majority were present during the jamboree itself, and all contributed to the writing of, and approved, the manuscript.

REFERENCES

1.Kell DB. Metabolomics, modelling and machine learning in systems biology: towards an understanding of the languages of cells. The 2005 Theodor Bücher lecture. FEBS J. 2006;273:873–894. doi: 10.1111/j.1742-4658.2006.05136.x. [DOI] [PubMed] [Google Scholar]
2.Arakawa K, Yamada Y, Shinoda K, Nakayama Y, Tomita M. GEM System: automatic prototyping of cell-wide metabolic pathway models from genomes. BMC Bioinformatics. 2006;7:168. doi: 10.1186/1471-2105-7-168. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Palsson BØ. Systems biology: properties of reconstructed networks. Cambridge: Cambridge University Press; 2006. [Google Scholar]
4.Duarte NC, et al. Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci. 2007;104:1777–1782. doi: 10.1073/pnas.0610772104. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Mager WH, Winderickx J. Yeast as a model for medical and medicinal research. Trends Pharmacol Sci. 2005;26:265–273. doi: 10.1016/j.tips.2005.03.004. [DOI] [PubMed] [Google Scholar]
6.Goffeau A, et al. Life With 6000 Genes. Science. 1996;274:546–567. doi: 10.1126/science.274.5287.546. [DOI] [PubMed] [Google Scholar]
7.Winzeler EA, et al. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science. 1999;285:901–906. doi: 10.1126/science.285.5429.901. [DOI] [PubMed] [Google Scholar]
8.Giaever G, et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387–391. doi: 10.1038/nature00935. [DOI] [PubMed] [Google Scholar]
9.Yen K, Gitsham P, Wishart J, Oliver SG, Zhang N. An improved tetO promoter replacement system for regulating the expression of yeast genes. Yeast. 2003;20:1255–1262. doi: 10.1002/yea.1037. [DOI] [PubMed] [Google Scholar]
10.Hughes TR, et al. Functional discovery via a compendium of expression profiles. Cell. 2000;102:109–126. doi: 10.1016/s0092-8674(00)00015-5. [DOI] [PubMed] [Google Scholar]
11.Allen JK, et al. High-throughput characterisation of yeast mutants for functional genomics using metabolic footprinting. Nature Biotechnol. 2003;21:692–696. doi: 10.1038/nbt823. [DOI] [PubMed] [Google Scholar]
12.Zhu H, et al. Global analysis of protein activities using proteome chips. Science. 2001;293:2101–2105. doi: 10.1126/science.1062191. [DOI] [PubMed] [Google Scholar]
13.Castrillo JI, et al. Growth control of the eukaryote cell: a systems biology study in yeast. J. Biol. 2007;6:4. doi: 10.1186/jbiol54. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Delneri D, et al. Identification and characterization of high-flux-control genes of yeast through competition analyses in continuous cultures. Nat Genet. 2008;40:113–117. doi: 10.1038/ng.2007.49. [DOI] [PubMed] [Google Scholar]
15.Wu J, Zhang N, Hayes A, Panoutsopoulou K, Oliver SG. Global analysis of nutrient control of gene expression in Saccharomyces cerevisiae during growth and starvation. Proc Natl Acad Sci U S A. 2004;101:3148–3153. doi: 10.1073/pnas.0308321100. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Oliver SG. A network approach to the systematic analysis of gene function. Trends Genet. 1996;12:241–242. doi: 10.1016/0168-9525(96)30053-x. [DOI] [PubMed] [Google Scholar]
17.Suter B, Auerbach D, Stagljar I. Yeast-based functional genomics and proteomics technologies: the first 15 years and beyond. Biotechniques. 2006;40:625–644. doi: 10.2144/000112151. [DOI] [PubMed] [Google Scholar]
18.Förster J, Famili I, Fu P, Palsson BØ, Nielsen J. Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res. 2003;13:244–253. doi: 10.1101/gr.234503. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Duarte NC, Herrgard MJ, Palsson BØ. Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 2004;14:1298–1309. doi: 10.1101/gr.2250904. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Kuepfer L, Sauer U, Blank LM. Metabolic functions of duplicate genes in Saccharomyces cerevisiae. Genome Res. 2005;15:1421–1430. doi: 10.1101/gr.3992505. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Caspi R, et al. MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res. 2006;34:D511–D516. doi: 10.1093/nar/gkj128. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Hucka M, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19:524–531. doi: 10.1093/bioinformatics/btg015. [DOI] [PubMed] [Google Scholar]
23.Le Novère N, et al. Minimum information requested in the annotation of biochemical models (MIRIAM) Nat Biotechnol. 2005;23:1509–1515. doi: 10.1038/nbt1156. [DOI] [PubMed] [Google Scholar]
24.Çakir T, et al. Integration of metabolome data with metabolic networks reveals reporter reactions. Mol Syst Biol. 2006;2:50. doi: 10.1038/msb4100085. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Kümmel A, Panke S, Heinemann M. Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data. Mol Syst Biol. 2006;2 doi: 10.1038/msb4100074. 2006 0034. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Reed JL, Vo TD, Schilling CH, Palsson BØ. An expanded genome-scale model of Escherichia coli K12 (iJR904 GSM/GPR) Genome Biol. 2003;4:R54. doi: 10.1186/gb-2003-4-9-r54. (open access at http://genomebiology.com/2003/2004/2009/R2054) [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Förster J, Famili I, Palsson BØ, Nielsen J. Large-scale evaluation of in silico deletions in Saccharomyces cerevisiae. Omics. 2003;7:193–202. doi: 10.1089/153623103322246584. [DOI] [PubMed] [Google Scholar]
28.Kanehisa M, et al. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34:D354–D357. doi: 10.1093/nar/gkj102. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Nash R, et al. Expanded protein information at SGD: new pages and proteome browser. Nucleic Acids Res. 2007;35:D468–D471. doi: 10.1093/nar/gkl931. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Caspi R, et al. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 2008;36:D623–D631. doi: 10.1093/nar/gkm900. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Li XJ, et al. In: Metabolic profiling: its role in biomarker discovery and gene function analysis. Harrigan GG, Goodacre R, editors. Boston: Kluwer Academic Publishers; 2003. pp. 293–309. [Google Scholar]
32.Goble C, Wroe C. The Montagues and the Capulets. Comp Func Genomics. 2004;5:623–632. doi: 10.1002/cfg.442. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Ananiadou S, Kell DB, Tsujii J-i. Text Mining and its potential applications in Systems Biology. Trends Biotechnol. 2006;24:571–579. doi: 10.1016/j.tibtech.2006.10.002. [DOI] [PubMed] [Google Scholar]
34.Poolman MG, Bonde BK, Gevorgyan A, Patel HH, Fell DA. Challenges to be faced in the reconstruction of metabolic networks from public databases. IEE Proc Syst Biol. 2006;153:379–384. doi: 10.1049/ip-syb:20060012. [DOI] [PubMed] [Google Scholar]
35.Spasić I, et al. Facilitating the development of controlled vocabularies for metabolomics with text mining. BMC Bioinformatics. 2008;9:S5. doi: 10.1186/1471-2105-9-S5-S5. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Williams AJ. Internet-based tools for communication and collaboration in chemistry. Drug Discov Today. 2008;13:502–506. doi: 10.1016/j.drudis.2008.03.015. [DOI] [PubMed] [Google Scholar]
37.Williams AJ. A perspective of publicly accessible/open-access chemistry databases. Drug Discov Today. 2008;13:495–501. doi: 10.1016/j.drudis.2008.03.017. [DOI] [PubMed] [Google Scholar]
38.Ma H, et al. The Edinburgh human metabolic network reconstruction and its functional analysis. Mol Syst Biol. 2007;3:135. doi: 10.1038/msb4100177. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Brooksbank C, Cameron G, Thornton J. The European Bioinformatics Institute's data resources: towards systems biology. Nucleic Acids Res. 2005;33:D46–D53. doi: 10.1093/nar/gki026. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Wheeler DL, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2007;35:D5–D12. doi: 10.1093/nar/gkl1031. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Weininger D. SMILES, a chemical language and information system.1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 1988;28:31–36. [Google Scholar]
42.Coles SJ, Day NE, Murray-Rust P, Rzepa HS, Zhang Y. Enhancement of the chemical semantic web through the use of InChI identifiers. Organic & Biomolecular Chemistry. 2005;3:1832–1834. doi: 10.1039/b502828k. [DOI] [PubMed] [Google Scholar]
43.Wishart DS, et al. HMDB: the Human Metabolome Database. Nucleic Acids Res. 2007;35:D521–D526. doi: 10.1093/nar/gkl923. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.The UniProt Consortium The universal protein resource (UniProt) Nucleic Acids Res. 2008;36:D190–D195. doi: 10.1093/nar/gkm895. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Ashburner M, et al. Gene Ontology: tool for the unification of biology. Nature Genetics. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Sud M, et al. LMSD: LIPID MAPS structure database. Nucleic Acids Res. 2007;35:D527–D532. doi: 10.1093/nar/gkl838. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Barabási A-L, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat. Rev. Genet. 2004;5:101–113. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]
48.Wagner A, Fell DA. The small world inside large metabolic networks. Proc R Soc Lond B Biol Sci. 2001;268:1803–1810. doi: 10.1098/rspb.2001.1711. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Hoops S, et al. COPASI: a COmplex PAthway SImulator. Bioinformatics. 2006;22:3067–3074. doi: 10.1093/bioinformatics/btl485. [DOI] [PubMed] [Google Scholar]
50.Vallabhajosyula RR, Chickarmane V, Sauro HM. Conservation analysis of large biochemical networks. Bioinformatics. 2006;22:346–353. doi: 10.1093/bioinformatics/bti800. [DOI] [PubMed] [Google Scholar]
51.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Funahashi A, Tanimura N, Morohashi M, Kitano H. CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. Biosilico. 2003;1:159–162. [Google Scholar]
53.Li P, Oinn T, Stoiland S, Kell DB. Automated manipulation of systems biology models using libSBML within Taverna workflows. Bioinformatics. 2008;24:287–289. doi: 10.1093/bioinformatics/btm578. [DOI] [PubMed] [Google Scholar]
54.Bornstein BJ, Keating SM, Jouraku A, Hucka M. LibSBML: an API library for SBML. Bioinformatics. 2008;24:880–881. doi: 10.1093/bioinformatics/btn051. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Surowiecki J. The wisdom of crowds: why the many are smarter than the few. London: Abacus; 2004. [Google Scholar]
56.Tapscott D, Williams A. Wikinomics: how mass collaboration changes everything. New Paradigm; 2007. [Google Scholar]
57.Palsson B. Two-dimensional annotation of genomes. Nat Biotechnol. 2004;22:1218–1219. doi: 10.1038/nbt1004-1218. [DOI] [PubMed] [Google Scholar]
58.Whelan KE, King RD. Using a logical model to predict the growth of yeast. BMC Bioinformatics. 2008;9 doi: 10.1186/1471-2105-9-97. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Blank LM, Kuepfer L, Sauer U. Large-scale 13C-flux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast. Genome Biol. 2005;6:R49. doi: 10.1186/gb-2005-6-6-r49. [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Kell DB. Systems biology, metabolic modelling and metabolomics in drug discovery and development. Drug Disc Today. 2006;11:1085–1092. doi: 10.1016/j.drudis.2006.10.004. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Fig 1

NIHMS574724-supplement-Supp_Fig_1.doc^{(53.5KB, doc)}

Supp Table

NIHMS574724-supplement-Supp_Table.txt^{(199.8KB, txt)}

[R1] 1.Kell DB. Metabolomics, modelling and machine learning in systems biology: towards an understanding of the languages of cells. The 2005 Theodor Bücher lecture. FEBS J. 2006;273:873–894. doi: 10.1111/j.1742-4658.2006.05136.x. [DOI] [PubMed] [Google Scholar]

[R2] 2.Arakawa K, Yamada Y, Shinoda K, Nakayama Y, Tomita M. GEM System: automatic prototyping of cell-wide metabolic pathway models from genomes. BMC Bioinformatics. 2006;7:168. doi: 10.1186/1471-2105-7-168. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Palsson BØ. Systems biology: properties of reconstructed networks. Cambridge: Cambridge University Press; 2006. [Google Scholar]

[R4] 4.Duarte NC, et al. Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci. 2007;104:1777–1782. doi: 10.1073/pnas.0610772104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Mager WH, Winderickx J. Yeast as a model for medical and medicinal research. Trends Pharmacol Sci. 2005;26:265–273. doi: 10.1016/j.tips.2005.03.004. [DOI] [PubMed] [Google Scholar]

[R6] 6.Goffeau A, et al. Life With 6000 Genes. Science. 1996;274:546–567. doi: 10.1126/science.274.5287.546. [DOI] [PubMed] [Google Scholar]

[R7] 7.Winzeler EA, et al. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science. 1999;285:901–906. doi: 10.1126/science.285.5429.901. [DOI] [PubMed] [Google Scholar]

[R8] 8.Giaever G, et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387–391. doi: 10.1038/nature00935. [DOI] [PubMed] [Google Scholar]

[R9] 9.Yen K, Gitsham P, Wishart J, Oliver SG, Zhang N. An improved tetO promoter replacement system for regulating the expression of yeast genes. Yeast. 2003;20:1255–1262. doi: 10.1002/yea.1037. [DOI] [PubMed] [Google Scholar]

[R10] 10.Hughes TR, et al. Functional discovery via a compendium of expression profiles. Cell. 2000;102:109–126. doi: 10.1016/s0092-8674(00)00015-5. [DOI] [PubMed] [Google Scholar]

[R11] 11.Allen JK, et al. High-throughput characterisation of yeast mutants for functional genomics using metabolic footprinting. Nature Biotechnol. 2003;21:692–696. doi: 10.1038/nbt823. [DOI] [PubMed] [Google Scholar]

[R12] 12.Zhu H, et al. Global analysis of protein activities using proteome chips. Science. 2001;293:2101–2105. doi: 10.1126/science.1062191. [DOI] [PubMed] [Google Scholar]

[R13] 13.Castrillo JI, et al. Growth control of the eukaryote cell: a systems biology study in yeast. J. Biol. 2007;6:4. doi: 10.1186/jbiol54. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Delneri D, et al. Identification and characterization of high-flux-control genes of yeast through competition analyses in continuous cultures. Nat Genet. 2008;40:113–117. doi: 10.1038/ng.2007.49. [DOI] [PubMed] [Google Scholar]

[R15] 15.Wu J, Zhang N, Hayes A, Panoutsopoulou K, Oliver SG. Global analysis of nutrient control of gene expression in Saccharomyces cerevisiae during growth and starvation. Proc Natl Acad Sci U S A. 2004;101:3148–3153. doi: 10.1073/pnas.0308321100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Oliver SG. A network approach to the systematic analysis of gene function. Trends Genet. 1996;12:241–242. doi: 10.1016/0168-9525(96)30053-x. [DOI] [PubMed] [Google Scholar]

[R17] 17.Suter B, Auerbach D, Stagljar I. Yeast-based functional genomics and proteomics technologies: the first 15 years and beyond. Biotechniques. 2006;40:625–644. doi: 10.2144/000112151. [DOI] [PubMed] [Google Scholar]

[R18] 18.Förster J, Famili I, Fu P, Palsson BØ, Nielsen J. Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res. 2003;13:244–253. doi: 10.1101/gr.234503. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Duarte NC, Herrgard MJ, Palsson BØ. Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 2004;14:1298–1309. doi: 10.1101/gr.2250904. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Kuepfer L, Sauer U, Blank LM. Metabolic functions of duplicate genes in Saccharomyces cerevisiae. Genome Res. 2005;15:1421–1430. doi: 10.1101/gr.3992505. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Caspi R, et al. MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res. 2006;34:D511–D516. doi: 10.1093/nar/gkj128. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Hucka M, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19:524–531. doi: 10.1093/bioinformatics/btg015. [DOI] [PubMed] [Google Scholar]

[R23] 23.Le Novère N, et al. Minimum information requested in the annotation of biochemical models (MIRIAM) Nat Biotechnol. 2005;23:1509–1515. doi: 10.1038/nbt1156. [DOI] [PubMed] [Google Scholar]

[R24] 24.Çakir T, et al. Integration of metabolome data with metabolic networks reveals reporter reactions. Mol Syst Biol. 2006;2:50. doi: 10.1038/msb4100085. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Kümmel A, Panke S, Heinemann M. Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data. Mol Syst Biol. 2006;2 doi: 10.1038/msb4100074. 2006 0034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Reed JL, Vo TD, Schilling CH, Palsson BØ. An expanded genome-scale model of Escherichia coli K12 (iJR904 GSM/GPR) Genome Biol. 2003;4:R54. doi: 10.1186/gb-2003-4-9-r54. (open access at http://genomebiology.com/2003/2004/2009/R2054) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Förster J, Famili I, Palsson BØ, Nielsen J. Large-scale evaluation of in silico deletions in Saccharomyces cerevisiae. Omics. 2003;7:193–202. doi: 10.1089/153623103322246584. [DOI] [PubMed] [Google Scholar]

[R28] 28.Kanehisa M, et al. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34:D354–D357. doi: 10.1093/nar/gkj102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Nash R, et al. Expanded protein information at SGD: new pages and proteome browser. Nucleic Acids Res. 2007;35:D468–D471. doi: 10.1093/nar/gkl931. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Caspi R, et al. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 2008;36:D623–D631. doi: 10.1093/nar/gkm900. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Li XJ, et al. In: Metabolic profiling: its role in biomarker discovery and gene function analysis. Harrigan GG, Goodacre R, editors. Boston: Kluwer Academic Publishers; 2003. pp. 293–309. [Google Scholar]

[R32] 32.Goble C, Wroe C. The Montagues and the Capulets. Comp Func Genomics. 2004;5:623–632. doi: 10.1002/cfg.442. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Ananiadou S, Kell DB, Tsujii J-i. Text Mining and its potential applications in Systems Biology. Trends Biotechnol. 2006;24:571–579. doi: 10.1016/j.tibtech.2006.10.002. [DOI] [PubMed] [Google Scholar]

[R34] 34.Poolman MG, Bonde BK, Gevorgyan A, Patel HH, Fell DA. Challenges to be faced in the reconstruction of metabolic networks from public databases. IEE Proc Syst Biol. 2006;153:379–384. doi: 10.1049/ip-syb:20060012. [DOI] [PubMed] [Google Scholar]

[R35] 35.Spasić I, et al. Facilitating the development of controlled vocabularies for metabolomics with text mining. BMC Bioinformatics. 2008;9:S5. doi: 10.1186/1471-2105-9-S5-S5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Williams AJ. Internet-based tools for communication and collaboration in chemistry. Drug Discov Today. 2008;13:502–506. doi: 10.1016/j.drudis.2008.03.015. [DOI] [PubMed] [Google Scholar]

[R37] 37.Williams AJ. A perspective of publicly accessible/open-access chemistry databases. Drug Discov Today. 2008;13:495–501. doi: 10.1016/j.drudis.2008.03.017. [DOI] [PubMed] [Google Scholar]

[R38] 38.Ma H, et al. The Edinburgh human metabolic network reconstruction and its functional analysis. Mol Syst Biol. 2007;3:135. doi: 10.1038/msb4100177. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Brooksbank C, Cameron G, Thornton J. The European Bioinformatics Institute's data resources: towards systems biology. Nucleic Acids Res. 2005;33:D46–D53. doi: 10.1093/nar/gki026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Wheeler DL, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2007;35:D5–D12. doi: 10.1093/nar/gkl1031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Weininger D. SMILES, a chemical language and information system.1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 1988;28:31–36. [Google Scholar]

[R42] 42.Coles SJ, Day NE, Murray-Rust P, Rzepa HS, Zhang Y. Enhancement of the chemical semantic web through the use of InChI identifiers. Organic & Biomolecular Chemistry. 2005;3:1832–1834. doi: 10.1039/b502828k. [DOI] [PubMed] [Google Scholar]

[R43] 43.Wishart DS, et al. HMDB: the Human Metabolome Database. Nucleic Acids Res. 2007;35:D521–D526. doi: 10.1093/nar/gkl923. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] 44.The UniProt Consortium The universal protein resource (UniProt) Nucleic Acids Res. 2008;36:D190–D195. doi: 10.1093/nar/gkm895. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] 45.Ashburner M, et al. Gene Ontology: tool for the unification of biology. Nature Genetics. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] 46.Sud M, et al. LMSD: LIPID MAPS structure database. Nucleic Acids Res. 2007;35:D527–D532. doi: 10.1093/nar/gkl838. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] 47.Barabási A-L, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat. Rev. Genet. 2004;5:101–113. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]

[R48] 48.Wagner A, Fell DA. The small world inside large metabolic networks. Proc R Soc Lond B Biol Sci. 2001;268:1803–1810. doi: 10.1098/rspb.2001.1711. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] 49.Hoops S, et al. COPASI: a COmplex PAthway SImulator. Bioinformatics. 2006;22:3067–3074. doi: 10.1093/bioinformatics/btl485. [DOI] [PubMed] [Google Scholar]

[R50] 50.Vallabhajosyula RR, Chickarmane V, Sauro HM. Conservation analysis of large biochemical networks. Bioinformatics. 2006;22:346–353. doi: 10.1093/bioinformatics/bti800. [DOI] [PubMed] [Google Scholar]

[R51] 51.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] 52.Funahashi A, Tanimura N, Morohashi M, Kitano H. CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. Biosilico. 2003;1:159–162. [Google Scholar]

[R53] 53.Li P, Oinn T, Stoiland S, Kell DB. Automated manipulation of systems biology models using libSBML within Taverna workflows. Bioinformatics. 2008;24:287–289. doi: 10.1093/bioinformatics/btm578. [DOI] [PubMed] [Google Scholar]

[R54] 54.Bornstein BJ, Keating SM, Jouraku A, Hucka M. LibSBML: an API library for SBML. Bioinformatics. 2008;24:880–881. doi: 10.1093/bioinformatics/btn051. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] 55.Surowiecki J. The wisdom of crowds: why the many are smarter than the few. London: Abacus; 2004. [Google Scholar]

[R56] 56.Tapscott D, Williams A. Wikinomics: how mass collaboration changes everything. New Paradigm; 2007. [Google Scholar]

[R57] 57.Palsson B. Two-dimensional annotation of genomes. Nat Biotechnol. 2004;22:1218–1219. doi: 10.1038/nbt1004-1218. [DOI] [PubMed] [Google Scholar]

[R58] 58.Whelan KE, King RD. Using a logical model to predict the growth of yeast. BMC Bioinformatics. 2008;9 doi: 10.1186/1471-2105-9-97. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Blank LM, Kuepfer L, Sauer U. Large-scale 13C-flux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast. Genome Biol. 2005;6:R49. doi: 10.1186/gb-2005-6-6-r49. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] 60.Kell DB. Systems biology, metabolic modelling and metabolomics in drug discovery and development. Drug Disc Today. 2006;11:1085–1092. doi: 10.1016/j.drudis.2006.10.004. [DOI] [PubMed] [Google Scholar]

PERMALINK

A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology

Markus J Herrgård

Neil Swainston

Paul Dobson

Warwick B Dunn

K Yalçin Arga

Mikko Arvas

Nils Blüthgen

Simon Borger

Roeland Costenoble

Matthias Heinemann

Michael Hucka

Nicolas Le Novère

Peter Li

Wolfram Liebermeister

Monica L Mo

Ana Paula Oliveira

Dina Petranovic

Stephen Pettifer

Evangelos Simeonidis

Kieran Smallbone

Irena Spasić

Dieter Weichart

Roger Brent

David S Broomhead

Hans V Westerhoff

Betül Kırdar

Merja Penttilä

Edda Klipp

Bernhard Ø Palsson

Uwe Sauer

Stephen G Oliver

Pedro Mendes

Jens Nielsen

Douglas B Kell

Abstract

Introduction

Consensus reconstruction process

Table 1.

Metabolite naming conventions

Annotation of large-scale metabolic models in SBML

Figure 2.

Contents of the consensus reconstructions

Table 2.

Figure 1. Degree distribution of the metabolic network.

Table 3.

Dissemination and future curation of the reconstruction

Discussion

Supplementary Material

Acknowledgments

Footnotes

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases