Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2017 Feb 28;7(4):1239–1249. doi: 10.1534/g3.116.037218

The Pivotal Role of Protein Phosphorylation in the Control of Yeast Central Metabolism

Panayotis Vlastaridis *, Athanasios Papakyriakou *,, Anargyros Chaliotis *, Efstratios Stratikos , Stephen G Oliver ‡,§,1, Grigorios D Amoutzias *,1
PMCID: PMC5386872  PMID: 28250014

Abstract

Protein phosphorylation is the most frequent eukaryotic post-translational modification and can act as either a molecular switch or rheostat for protein functions. The deliberate manipulation of protein phosphorylation has great potential for regulating specific protein functions with surgical precision, rather than the gross effects gained by the over/underexpression or complete deletion of a protein-encoding gene. In order to assess the impact of phosphorylation on central metabolism, and thus its potential for biotechnological and medical exploitation, a compendium of highly confident protein phosphorylation sites (p-sites) for the model organism Saccharomyces cerevisiae has been analyzed together with two more datasets from the fungal pathogen Candida albicans. Our analysis highlights the global properties of the regulation of yeast central metabolism by protein phosphorylation, where almost half of the enzymes involved are subject to this sort of post-translational modification. These phosphorylated enzymes, compared to the nonphosphorylated ones, are more abundant, regulate more reactions, have more protein–protein interactions, and a higher fraction of them are ubiquitinated. The p-sites of metabolic enzymes are also more conserved than the background p-sites, and hundreds of them have the potential for regulating metabolite production. All this integrated information has allowed us to prioritize thousands of p-sites in terms of their potential phenotypic impact. This multi-source compendium should enable the design of future high-throughput (HTP) mutation studies to identify key molecular switches/rheostats for the manipulation of not only the metabolism of yeast, but also that of many other biotechnologically and medically important fungi and eukaryotes.

Keywords: yeast, metabolism, phosphorylation, comparative phosphoproteomics


Since the advent of the functional genomic technologies, there has been an ongoing community effort to characterize and model the complete metabolic network of the yeast, Saccharomyces cerevisiae (Herrgård et al. 2008). The aim is to be able to simulate yeast metabolism in silico and so generate accurate predictions of the phenotypic consequences of genetic manipulations, including multiple gene deletions (Szappanos et al. 2011) and the recruitment of foreign genes to construct novel biosynthetic pathways (Szczebara et al. 2003; Galanie et al. 2015; Nielsen 2015). Genome-scale stoichiometric models of the yeast metabolic network that allow the computation of the steady-state distribution of metabolic fluxes (Flux Balance Analysis) have proved useful in this regard (Dobson et al. 2010; Orth et al. 2010).

Despite these successes, there is an urgent need to improve these models by incorporating metabolic control and biomass composition in an accurate and context-dependent manner (Dikicioglu et al. 2015), as well as the various levels of transcriptional and post-transcriptional regulation (Pir et al. 2012). Several studies have clearly revealed the high importance of post-transcriptional regulation (Gygi et al. 1999; Greenbaum et al. 2003; Castrillo et al. 2007; Schwanhäusser et al. 2011). Physiological perturbations can trigger a rapid reconfiguration of the fluxes through the metabolic network and the immediacy of such responses is thought to be largely due to changes at the level of enzyme activity, rather than changes in the expression of enzyme-encoding genes (Ralser et al. 2009; Bouwman et al. 2011; Oliveira et al. 2012; Kochanowski et al. 2013). These alterations in enzyme activity are often the consequence of the interactions of these protein catalysts with small molecules, including substrates and cofactors. However, the post-translational modification of enzyme molecules, e.g., by phosphorylation, are likely to play an important role in metabolic adaptations since they also have rapid kinetics (Oliveira et al. 2012; Oliveira and Sauer 2012; Schulz et al. 2014; Tripodi et al. 2015; Chen and Nielsen 2016). Intriguingly, the energetic cost of protein synthesis is nine times higher than that of transcription (Schwanhäusser et al. 2011); therefore, post-translational regulation via amino acid modifications seems to be a very rapid and energy efficient level of regulation.

Protein phosphorylation is the most abundant post-translational modification that may alter the structure, function, localization, molecular interactions, or degradation of a protein (Nishi et al. 2014), and may therefore function as a molecular switch or rheostat of enzyme activity (Chen and Nielsen 2016). The importance of this level of regulation is highlighted by the fact that up to 23% of intracellular ATP may be utilized by protein kinases for phosphorylating their numerous targets (Ptacek et al. 2005; Carpy et al. 2014). Furthermore, this type of regulation is expected to be tightly controlled, otherwise the ATP supply would be rapidly depleted (Krebs and Stull 1975). The identification of crucial p-sites in key proteins offers synthetic biologists the prospect of manipulating molecular pathways or organismal phenotypes with greater precision than can be achieved by either the deletion or under/overexpression of complete genes (Oliveira et al. 2012; Oliveira and Sauer 2012).

The advent of HTP phosphoproteomic technologies in the last decade has revolutionized the field, since hundreds or even thousands of p-sites may be identified within a single HTP experiment. Nevertheless, serious concerns have been raised about the quality of these p-site identifications in terms of both technical and biological noise (Lienhard 2008); indeed, it has been suggested that up to 65% of these p-sites may be nonfunctional (Landry et al. 2009, 2014). In addition, the various phosphoproteomic protocols capture distinct fractions of the total phosphoproteome with moderate overlap among them (Bodenmiller et al. 2007). Hence, any analysis of phosphoproteomic data poses a series of challenges (Lee et al. 2015; Vlastaridis et al. 2016). Thus, before identifying p-sites with potentially significant impact on protein function and organismal phenotype, there is an urgent need to: (i) stringently filter these HTP data and (ii) compile datasets from many and diverse protocols to ameliorate any potential biases (Amoutzias et al. 2012).

The goal of this study is to employ a compendium of stringently filtered and diverse phosphoproteomic data from the best-studied model eukaryote, S. cerevisiae and the pathogenic fungus Candida albicans together with evolutionary, functional genomic, and phenotypic data so as to: (i) reveal the impact of protein phosphorylation on central metabolism, and (ii) prioritize the metabolism-related yeast p-sites in terms of biological significance and assess their potential as targets of future mutation studies with a focus on biotechnological and medical applications. Furthermore, by identifying crucial phosphorylation switches that regulate yeast metabolism, it should be possible, with minimal effort, to significantly improve the predictive accuracy of metabolic flux balance analyses.

Methods

For S. cerevisiae, a high quality compendium of p-sites has been employed from another computational analysis of our group concerning the estimation of the total number of phosphoproteins and p-sites in several eukaryotic species (Vlastaridis et al. 2017). This compendium was generated from 20 HTP phosphoproteomic experiments found in 18 publications (Gruhler et al. 2005; Chi et al. 2007; Li et al. 2007; Albuquerque et al. 2008; Bodenmiller et al. 2008, 2010; Beltrao et al. 2009; Huber et al. 2009; Holt et al. 2009; Gnad et al. 2009; Soufi et al. 2009; Aguiar et al. 2010; Saleem et al. 2010; Wu et al. 2011; Oliveira et al. 2012; Mascaraque et al. 2013; Lee et al. 2013; Weinert et al. 2014). Very stringent criteria were applied, such as 99% correct phosphopeptide identification and 99% correct p-site localization (see Supplemental Material, File S1; spreadsheet: yeast p-sites). This compendium was an update of a previous yeast compendium from 12 HTP datasets (Amoutzias et al. 2012). In addition, the PhosphoGRID 2 dataset of manually curated low-throughput (LTP) p-sites (serving as a “gold standard”) (Sadowski et al. 2013) was integrated into the compendium. For comparative phosphoproteomic analyses, two datasets from C. albicans (Beltrao et al. 2009; Willger et al. 2015) were mined and filtered by applying the same stringent criteria as for S. cerevisiae. All filtered p-sites from the two species are organized in two spreadsheets (S. cerevisiae p-sites and C. abicans p-sites) within File S1 and File S3.

For the comparative phosphoproteomics analysis between S. cerevisiae S288C (Goffeau et al. 1996) and C. albicans (SC5314 Assembly 21, haploid protein complement), orthologous relationships were retrieved from the Candida Gene Order Browser (Maguire et al. 2013) using synteny or, if not available, the best Blast hit. To estimate the conservation of yeast p-sites in the orthologs of various ascomycetes, orthologies were retrieved from the fungal orthogroups repository (Byrne and Wolfe 2005; Wapinski et al. 2007). For each orthologous pair of sequences, pairwise global alignment was performed with the SSearch software (Pearson 2000) and orthologous amino acids were retrieved from each alignment.

Once orthologs had been identified, the conservation of a p-site in certain ascomycetae ancestors was assessed by two different methods. In the first method, a pairwise comparison of the homologous amino acids between S. cerevisiae and another ascomycete was performed. If the amino acid phosphorylated in S. cerevisiae was also found conserved as serine, threonine, or tyrosine in the other species, then the p-site was assumed to be present in their common ancestor. In the second method, the ancestral amino acid was inferred by maximum likelihood ancestral sequence reconstruction, using the MEGA7 software (Kumar et al. 2016). Conservation of S. cerevisiae p-sites in the other ascomycetes is stored in the Excel spreadsheets “conservation_pairwise_comp” and “conservation_MEGA_ASR” of File S3. Divergence dates between extant fungi were retrieved from the TimeTree database (Hedges et al. 2015).

For the functional and statistical analyses, many publicly available functional genomics datasets were integrated, such as three protein abundance datasets from two publications (Ghaemmaghami et al. 2003; Newman et al. 2006), two protein half-lives datasets (Belle et al. 2006; Christiano et al. 2014), one compendium/list of highly confident essential genes (Giaever et al. 2002; Steinmetz et al. 2002; Pache et al. 2009), one protein ubiquitination dataset (Peng et al. 2003), one dataset of highly confident genetic interactions (Costanzo et al. 2010), one compendium of highly confident protein–protein interactions (Batada et al. 2006), a list of genes and the metabolic reactions that they are involved in (included in the updated version 7.6 of the yeast metabolic model) (Dobson et al. 2010), and a dataset of biotechnologically important genes that have been annotated as such in the Saccharomyces Genome Database (Cherry et al. 2012). The integrated functional data are stored in the Excel spreadsheets “yeast p-sites” and “functional_information” of File S1 and File S2. Of note, many of the above properties/measurements may be context dependent or change significantly from one physiological condition to another.

A negative phosphoproteome was also defined, which comprised an extension of the negative phosphoproteome from our previous study of 2012 (Amoutzias et al. 2012). More specifically, in the 2012 study, a nonphosphoproteome comprised 2219 ORFs that had no evidence of phosphorylation, even with less stringent filtering criteria. In this updated analysis, any of these 2219 ORFs that were now found to be phosphorylated were removed from the negative phosphoproteome, resulting in an updated negative set of 2167 ORFs.

Data integration was performed with the PERL programming language and statistical analyses with the R programming language (https://www.R-project.org/) (R Core Team 2015). Mapping of the yeast phosphoregulated enzymes to the KEGG metabolic map was performed with the KEGG mapper computational tool (Kanehisa et al. 2012), using the Uniprot identifiers of the yeast phosphorylated proteins.

To control for protein abundance as a potential confounding factor (Levy et al. 2012) in the comparison between the phosphoproteome and the negative phosphoproteome, relevant abundance measurements [based on the most thorough dataset of Ghaemmaghami et al. (2003)] were converted to log10 values and binned in 8–10 groups. Equal numbers of phosphoproteins and nonphosphoproteins were randomly selected from each bin, thus generating a Protein Abundance Controlled phosphoproteome and negative phosphoproteome. The same procedure was followed for the metabolic phosphoproteome and the metabolic negative phosphoproteome.

For the structural analyses, the available X-ray crystal structures of selected enzymes were retrieved from RCSB PDB (Rose et al. 2015). The interactions of p-sites with surrounding residues and ligands or substrates were identified and then all heteroatoms were removed prior to the simulations. Molecular dynamics (MD) were performed for selected enzymes in their native and phosphorylated states with all-atom representation in explicit solvent using AMBER 14 and the ff14SB force field (Case et al. 2005; Hornak et al. 2006). The phosphorylated enzymes were prepared by mutating the corresponding residues to their phosphorylated forms (net charge of −2e), which were treated with the optimized parameters of the phosaa10 force field (Homeyer et al. 2006). Simulations were performed for 100 ns using the GPU-version of the PMEMD program (Salomon-Ferrer et al. 2013) and the trajectory analysis was performed with the CPPTRAJ module of AmberTools 15 (Roe and Cheatham 2013) after mass-weighted RMSD fitting with respect to the initial coordinates of the backbone atoms. Visual inspection of the trajectories and rendering of the figures was performed with VMD (v1.9) (Humphrey et al. 1996).

Data availability

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.

Results and Discussion

The updated yeast p-site compendium

The new S. cerevisiae compendium consists of 14,339 p-sites in 2633 ORFs (see Table 1 and Excel spreadsheet “yeast p-sites” of File S1) and constitutes a significant increase of 47% (for p-sites) over a previous compendium of 12 publicly available HTP phosphoproteomic datasets (Amoutzias et al. 2012). It is designated as 21UHQ, where 21 stands for the number of datasets, U stands for phosphopeptides uniquely matched to only one protein, and HQ stands for high-quality phosphopeptides, based predominantly on 99% correct peptide identification and 99% correct p-site localization. Compared to the original yeast p-site compendium, the new one has been expanded by eight more HTP datasets and also includes the latest version of the PhosphoGRID 2 (PG2) subset (Sadowski et al. 2013), which is based on manually curated LTP p-sites. PhosphoGRID is considered the gold standard of yeast p-sites.

Table 1. The number of unique p-sites and phosphoproteins identified in the various phosphorylation compendiums and subsets.

Total p-Sites Total p-Sites Found in PFAM Domains Highly Confident p-Sites Highly Confident p-Sites Found in PFAM Domains Phosphoproteins Phosphoproteins with Highly Confident p-Sites
12UHQ 9783 2059 2566 (26%) 431 2374 1112 (47%)
20UHQ (only HTP) 13,244 2625 4156 (31%) 698 2587 1421 (55%)
21UHQ (including PG2) 14,339 3036 5519 (38%) 1175 2633 1557 (59%)
21UHQ metabolism (including PG2) 1668 527 499 99 412 197
21UHQ metabolism essential proteins (including PG2) 339 153 79 34 71 36

p-sites identified in three or more experiments are designated as Highly Confident. 12UHQ refers to the Amoutzias et al. (2012) dataset. 20UHQ (only HTP) refers to the p-sites identified by 20 HTP experiments in Vlastaridis et al. (2017). 21UHQ (including PG2) refers to the p-sites identified by 20 HTP experiments and by PhosphoGrid2 in Vlastaridis et al. (2017). 21UHQ metabolism (including PG2) refers to the p-sites identified by 20 HTP experiments and by PhosphoGrid2 in Vlastaridis et al. (2017) that were in metabolic proteins. 21UHQ metabolism essential proteins (including PG2) refers to p-sites identified by 20 HTP experiments and by PhosphoGrid2 in Vlastaridis et al. (2017) that were in essential metabolic proteins. p-site, phosphorylation site; HTP, high-throughput.

Due to concerns about technical and biological noise in phosphoproteomic data (Lienhard 2008; Landry et al. 2009), we constructed a highly confident subset consisting of 5519 p-sites in 1557 ORFs that includes p-sites identified in three or more HTP experiments and/or any of the PG2 LTP data (see Table 1). The criterion for three or more experiments was based on simulations and a series of five different analyses with the original compendium (Amoutzias et al. 2012). The corresponding highly confident subset is now designated as 21UHQ_HC, where HC stands for High Confidence.

A crucial issue is the reliability of p-sites that have been identified only once or twice by HTP experiments, since biological and technical noise are serious concerns. In order to address this, the PG2 dataset was employed to perform a crude extrapolation. Of the 536 highly confident p-sites that are detected both by PG2 and any of the 20 HTP experiments, 270 were found in three or more HTP experiments (designated as high overlap HTP), whereas 266 are found in one or two HTP experiments (designated as low overlap HTP). The ratio for high/low overlap is almost one. Therefore, for every high overlap HTP p-site there exists one low overlap HTP p-site that has a high probability of being valid. Based on this empirical ratio, an extrapolation was performed on the whole phosphoproteomic dataset, which has 4156 high overlap and 9088 low overlap HTP p-sites. Thus, we predict that 46% of those 9088 low overlap sites could eventually be verified as highly confident by new experiments. The above crude extrapolation estimates that two thirds (66%) of the current total (low + high overlap) HTP p-sites will turn out to be functional. This is in moderately good agreement with an independent estimation by Landry et al. (2009) that was based on other datasets and an evolutionary analysis, where they estimated that 65% of their HTP p-sites could be nonfunctional. Nevertheless, in their dataset, they applied less stringent detection criteria than those we employed and probably included a higher fraction of noisy p-sites.

Our own literature mining revealed (at that time) that the phosphoproteomic data available for fungi other than S. cerevisiae were rather limited, although very recently a comparative phosphoproteomic analysis has been performed in 18 fungi (Studer et al. 2016). The only other closely related ascomycete for which sufficient phosphoproteomic data were available to allow meaningful comparative analyses was C. albicans, with two datasets comprising 9438 nonredundant p-sites. By identifying the homologous amino acids between S. cerevisiae and C. albicans (see Methods), comparative phosphoproteomics revealed that only 7% (692) of those 9438 C. albicans p-sites have also been identified as phosphorylated in S. cerevisiae. Interestingly, 12% (81/692) of these conserved phosphorylation events between the two species had a mutation from serine to threonine and vice versa to one of the two species. We did not observe such a mutation for phosphorylated tyrosines, most probably due to their very low number (five). These observations are explained by the lack of tyrosine kinases in yeast and the dual specificity of certain serine/threonine kinases that may further phosphorylate some tyrosines (Stern et al. 1991; Hunter and Plowman 1997; Zhu et al. 2000; Manning et al. 2002). Moreover, the use of the 9438 C. albicans p-sites together with amino acid conservation in S. cerevisiae suggests phosphorylation for another 2122 homologous serines, threonines, and tyrosines in S. cerevisiae that have not been detected as phosphorylated yet in that species, but that are likely to be detected by future studies; these would increase the S. cerevisiae phosphoproteome by 15%, from 14,399 to 16,461 p-sites.

The fact that few p-sites appear as phosphorylated and conserved between the two species is not surprising. It could be attributed to several factors, such as the incompleteness of the p-site compendia of the two species and experimental biases, since the C. albicans compendium was based only on two experiments (Boekhorst et al. 2008, 2011). In a recently published study, our group has estimated that the total S. cerevisiae phosphoproteome may be ca. 40,000 p-sites (Vlastaridis et al. 2017). Other contributing factors could be the evolutionary distance of ∼300 million years between S. cerevisiae and C. albicans (Hedges et al. 2015), the high evolutionary turnover generally observed for p-sites, the fast network rewiring at the phosphorylation-regulatory level, and the relaxed localization constraints for p-site conservation (Iakoucheva et al. 2004; Moses et al. 2007; Landry et al. 2009, 2014; Beltrao et al. 2009; Shou et al. 2011; Freschi et al. 2014). Overall, the extrapolation enabled by this comparative phosphoproteomics analysis did not have a profound effect on the quantitative expansion of the S. cerevisiae dataset. On the other hand, the conserved p-sites that have withstood such strong forces of mutation and evolution and are detected by these yet imperfect technologies are expected to be of very high functional importance; thus, in qualitative terms, the gain may be greater than the simple increase in p-site numbers implies.

A substantial part of the yeast central metabolism is regulated by phosphorylation

The Yeast 7.6 genome-scale metabolic model is manually curated by experts and contains 2302 reactions that have been assigned one or more of the 909 (15%) protein-coding genes to be catalyzing these specific reactions. Based on the stringency criteria to define a p-site (designated as ALL for all p-sites and HC), 412 (45%) or 197 (22%) of the metabolic proteins are phosphorylated and may control 1176 or 656 reactions, respectively. Thus, protein phosphorylation is likely to exert significant control over the yeast central metabolism (see Figure 1). A previous analysis on older and less filtered datasets also identified half of the metabolic proteins as being phosphorylated (Oliveira and Sauer 2012). Similarly, a review focused on yeast carbon metabolism reported more than half of the relevant enzymes to be targets of post-translational modifications (Tripodi et al. 2015), whereas another review has identified 41 phosphoregulated enzymes that have been experimentally verified (Chen and Nielsen 2016). Furthermore, genetic perturbations of the yeast kinome revealed significant changes in concentrations of hundreds of intracellular metabolites (Schulz et al. 2014). Although the current phosphoproteomic data are incomplete in terms of individual p-site detection, an analysis by our group has revealed that most of the phosphoproteins have already been detected (Vlastaridis et al. 2017), thus these conclusions appear robust.

Figure 1.

Figure 1

Protein phosphorylation is likely to exert significant control over S. cerevisiae central metabolism. Nodes represent metabolites and lines represent reactions in the Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic map. Blue color is for reactions that are controlled by at least one enzyme that undergoes phosphorylation. Red color is for reactions that are controlled by at least one enzyme that contains High Confidence (HC) p-site/s. Mapping was performed with the KEGG mapper tool (Kanehisa et al. 2012), using the Uniprot identifiers of the yeast phosphorylated enzymes.

A significant proportion of metabolic proteins are phosphorylated and yet there does not seem to be any major enrichment or depletion for phosphorylation in metabolic enzymes compared to the rest of the proteome (45 and 27% for ALL and HC, respectively). Twelve percent (1668/14339) of ALL and 9% (499/5519) of HC p-sites are found in metabolic proteins (designated as phosphometabolic proteins). On average, phosphometabolic proteins have 4 and 2.5 p-sites (ALL and HC, respectively), whereas the rest of the phosphoproteome has 5.7 and 3.7 p-sites, respectively, a statistically significant difference (Wilcoxon p-value < 0.006). In addition, 31% of ALL metabolic and 20% of HC metabolic p-sites are found within PFAM domains, indicating a potentially significant impact on structure, and probably on function. In contrast, 21% of ALL and 21% of HC p-sites are found within PFAM domains (see Table 1). Nevertheless, the next section shows that important enzymes tend to be regulated by phosphorylation.

The general properties of yeast central metabolism likely to be regulated by phosphorylation

The general properties of the phosphorylated metabolic proteins (designated as phosphometabolic), compared to the negative phosphometabolic proteins, are summarized in Figure 2 and in more detail in File S2, Excel spreadsheet “stats.” All subsequent reported differences are statistically significant (p-value < 0.05) and were performed with the appropriate Wilcoxon or χ-squared test. Phosphometabolic proteins are: (i) significantly more abundant (305–540% higher), (ii) have more kinase–target interactions (1–1.4 vs. 0.3–0.4; 185–327% higher), (iii) have longer total length (602–682 vs. 369–388 amino acids; 55–76% higher), (iv) longer intrinsically disordered regions (159–204 vs. 71–74 amino acids; 114–175% higher), (v) more protein–protein interactions (1–1.5 vs. 0.5; 75–194% higher), and (vi) regulate more reactions (4–5 vs. 3.7–3.8; 3–36% higher). Furthermore, a higher fraction of them are ubiquitinated (37–53% vs. 19–23%; 64–176% higher). It seems that some synergism exists between protein phosphorylation and ubiquitination in the proteins of the yeast metabolic network (Tripodi et al. 2015). All of the above conclusions hold true even when controlling for protein abundance as a confounding factor. GOSlim analysis with Bingo (Maere et al. 2005) revealed an enrichment for the GO term “Vacuole,” when phosphometabolic proteins were compared to the background (all metabolic proteins). In general, phosphometabolic proteins retain many of the general properties of the whole phosphoproteome (see Figure 2), except the higher number of genetic interactions, the shorter protein half-lives [only for the Belle et al. (2006) dataset; conflicting results for the Christiano et al. (2014) dataset], and the higher fraction of essential genes (when controlling for protein abundance). Reassuringly, analyses on this updated whole phosphoproteome compendium confirm the global properties observed in a previous analysis (Amoutzias et al. 2012) with a compendium of 12 HTP datasets, even when controlling for protein abundance. This was expected, since the 2012 dataset comprised 2372 phosphoproteins, whereas the new compendium comprises 2633 phosphoproteins. This is another indication of the view that the majority of the yeast phosphoproteome has been discovered (Vlastaridis et al. 2017).

Figure 2.

Figure 2

The general properties of the phosphoproteome, compared to the negative phosphoproteome. The bars show which properties of the phosphoproteome are higher/lower (% difference), compared to the negative phosphoproteome. Only statistically significant differences are shown. This is estimated for various datasets. HC, High Confidence subset of the phosphoproteome; MetPhos_vs_MetNegPhos, metabolic proteins of the phosphoproteome vs. metabolic proteins of the negative phosphoproteome set. PAC, Protein Abundance Controlled dataset; Phos_vs_NegPhos: phosphoproteome vs. negative phosphoproteome.

In terms of evolution, gene duplications and especially the whole-genome duplication (WGD) that occurred in the hemiascomycete lineage ∼100 million years ago (Pöhlmann and Philippsen 1996; Wolfe and Shields 1997; Kellis et al. 2004; Dietrich et al. 2004; Scannell et al. 2007) are known to have played a significant role in shaping the yeast genome and especially metabolism (Papp et al. 2004; Conant and Wolfe 2007; Conant 2014). Intriguingly, a significant fraction of total kinase–substrate relationships may have been rewired during this period by the evolutionary forces of nonfunctionalization, neofunctionalization, and subfunctionalization, suggesting rapid adaption at this level (Amoutzias et al. 2010; Freschi et al. 2011). On average, 19% (1096/5884) of S. cerevisiae protein-coding genes are present in duplicate as a result of the WGD, whereas 23% (207/909) of the genes encoding metabolic proteins are WGD paralogs. For the metabolic enzymes that are phosphoproteins, this proportion increases to 28% (ALL:115/412) and 32% (HC: 64/197). All the above differences are statistically significant (p-value < 0.05), according to the hypergeometric test. This agrees with a previous observation, based on a smaller dataset, that phosphorylation is a factor that affects the survival of genes after WGD (Amoutzias et al. 2010).

Phosphorylation sites of metabolic proteins tend to be more conserved than average

Our comparative phosphoproteomic analysis reveals that 115 p-sites, conserved and phosphorylated in both S. cerevisiae and C. albicans, could regulate 72 metabolic proteins that in turn are involved in 271 reactions of the yeast 7.6 metabolic model (see File S2; spreadsheet “reactions-proteins”). The fraction of conserved phosphorylations between the two species that are involved in metabolism is higher than expected by chance (17%–115/692 vs. a background of 12%–1668/14339; hypergeometric test p-value < 2e−5), thus, revealing that p-sites of enzymes tend to be more conserved than the background p-sites. Phosphoproteomic data are not so abundant in other ascomycetes and the observed small overlap may also be attributed to missing data and experimental biases (Boekhorst et al. 2008, 2011). We have recently estimated the total yeast phosphoproteome at ∼40,000 p-sites (Vlastaridis et al. 2017). To control for this factor, the level of conservation of metabolic protein p-sites in other ascomycetes was also assessed, but only at the amino acid level. Genomic and evolutionary data, together with pairwise comparisons between two extant species or with ancestral sequence reconstruction, were used to infer the homologous amino acid of a yeast p-site in various common ancestors. If the yeast p-site was conserved as serine, threonine, or tyrosine in the inferred ascomycete ancestor, then the assumption was that the phosphorylation event was also present in that ancestor. Next, a comparison of the conservation at the amino acid level was performed for all the p-sites found in S. cerevisiae central metabolism vs. p-sites in the rest of the proteome (Figure 3). It is clear that the yeast p-sites that are found in metabolic proteins are more conserved than the p-sites in other proteins, and this difference is always statistically significant (Wilcoxon test p-value < 0.05), independent of the method/datasets used. In addition, as the evolutionary distance increases, so does the relative level of conservation of metabolic protein p-sites. Based on the ancestral sequence reconstruction analysis, it is estimated that 1257 budding yeast p-sites identified in 345 ORFs, which in turn are involved in 1003 reactions, could be conserved in the common ancestor between S. cerevisiae and C. albicans/Debaryomyces hansenii.

Figure 3.

Figure 3

Conservation of p-sites in various ascomycete ancestors. Conservation was inferred by two different methods: pairwise comparison of yeast with other extant species (red boxes in the phylogenetic tree) and ancestral sequence reconstruction. Dark blue bars show % conservation of all metabolic p-sites. Light blue bars show % conservation of all nonmetabolic p-sites. Red bars show % conservation of High Confidence (HC) metabolic p-sites. Orange bars show conservation of HC nonmetabolic p-sites.

Identification of p-sites in proteins that have a biotechnologically interesting phenotype related to metabolism and molecule production

The Saccharomyces Genome Database has mined and stored phenotypes caused by various gene perturbations, such gene over/underexpression or even gene deletion. We manually inspected the phenotypes and focused on the ones that, in our opinion, are biotechnologically interesting. These phenotype terms mapped to 850 proteins, of which 408 were phosphoproteins, harboring 2363 p-sites. These phosphoproteins were not all annotated as participating in metabolism. By applying a stringent criterion of HC p-sites situated within conserved domains, we identified 180 of them in 73 phosphoproteins. These findings are summarized in Table 2. Obviously, there exist a significant number of very good candidate p-sites that may regulate biotechnologically important phenotypes, especially those related to increased chemical compound excretion and increased respiratory growth. These candidates should be the initial targets of future studies, e.g., to examine the phenotypic impact of deleting specific p-sites. Due to the inherent technical and biological noise of phosphorylation data, prioritization of p-sites for detailed study is an important task (Beltrao et al. 2012; Xiao et al. 2016). Readers can perform their own customized prioritization on these data using File S1.

Table 2. Number of p-sites that regulate proteins with a biotechnologically interesting phenotype.

Phenotype Terms p-Sites/Proteins (ALL) p-Sites Within Domains/Proteins (ALL) p-Sites/Proteins (HC) p-Sites Within Domains/Proteins (HC)
Chemical compound excretion: increased 1497/248 284/189 564/147 109/43
Fermentative growth: increased 7/3 1/1 2/1 0/0
Fermentative metabolism: increased 85/10 10/6 38/10 3/3
Growth rate in exponential phase: increased 73/8 14/5 38/6 9/2
Nutrient uptake/utilization: increased 124/20 40/8 37/13 13/5
Respiratory growth: increased 416/75 116/41 170/46 43/18
Respiratory metabolism: increased 331/61 70/24 121/38 31/11
Utilization of carbon source: increased 36/8 9/5 16/4 5/2
Vegetative growth: increased 8/5 4/2 0/0 0/0
Viability: increased 67/17 16/9 24/9 2/2
ALL_RELATED_phenotypes 2363/408 496/183 887/247 180/73

p-site, phosphorylation site; ALL, all p-sites; HC, high confidence p-sites.

Structural simulations of selected phosphorylation sites in two essential metabolic proteins predict a significant impact of phosphorylation on function

Yeast p-sites identified in many experiments, within essential enzymes and also found conserved and phosphorylated in C. albicans, could have great potential not only for the manipulation of metabolism (and thus affect the growth rate of S. cerevisiae), but also for medical purposes related to other closely related pathogenic fungi. In order to quickly assess the importance of this p-site subset, computational structural analyses were performed on two selected enzymes as a case study. The first enzyme investigated was phosphoglycerate mutase 1 (Gpm1p), which mediates the conversion of 3-phosphoglycerate to 2-phosphoglycerate during glycolysis and the reverse reaction during gluconeogenesis (Heinisch et al. 1991). This enzyme has a very promising p-site at Ser116 that was found phosphorylated in 11 HTP experiments. Visual inspection of the crystal structure revealed that this p-site is close to the catalytic site (see Figure 4A). The second protein investigated was aspartyl-tRNA synthetase (Dps1p), an aminoacyl-tRNA synthetase responsible for the charging of tRNAAsp with its cognate amino acid (Sellami et al. 1985). Dps1p is a characteristic enzyme of a superfamily that is crucial for the fidelity of translation of the genetic code (Chaliotis et al. 2017). Dps1p harbors a very promising p-site at Ser301 that was found phosphorylated in 10 HTP experiments. Examination of the crystal structure of the Dps1p complex with tRNA revealed that this p-site is in direct contact with the substrate (see Figure 4B).

Figure 4.

Figure 4

Molecular representations of two p-sites examined with molecular dynamic simulations in (A) the yeast phosphoglycerate mutase (Gpm1p) and (B) aspartyl-tRNA (transfer RNA) synthetase (Dps1p). The X-ray crystal structures of the enzymes are illustrated with cartoons colored by secondary structure and the p-site serine residues are shown with spheres (green C, red O, and blue N atoms). Distances between the p-sites and the catalytic active sites are indicated with dashed lines between Ser116 and a sulfate ion in Gpm1p [ Protein Data Bank identifier (PDB ID): 5pgm], and between Ser301 and adenine-1 (A-1) of tRNA in Dps1p (PDB ID: 1asy). The red arrows indicate regions close to the active sites of the enzymes that display altered dynamics upon phosphorylation. (C and D) Plots of the atomic fluctuations of the backbone Cα atoms extracted from 100-ns MD simulations of the native and phosphorylated enzymes.

To examine the potential effect of phosphorylation at the enzyme sites described above, we employed a comparative MD study of each enzyme in its native and phosphorylated state. The simulation systems were based on the crystallographic coordinates of the yeast enzymes in the substrate-free forms (PDB IDs: 5pgm for Gpm1p and 1eov for Dps1p) (Rigden et al. 1999; Sauter et al. 2000). Our simulations indicate that phosphorylation at either Gpm1p-Ser116 or Dps1p-Ser301 can affect substrate binding, either directly or via perturbation of the structural dynamics in regions of the enzymes close to the active site (see Figure 4, C and D). Using their own criteria, readers can use File S1 to prioritize future structural simulations before proceeding to wet lab experiments. With the current datasets, there exist at least 36 p-sites in essential metabolic proteins that have been detected as phosphorylated in both species and need to be investigated with wet lab experiments.

In summary, the integration of HTP data from various genomic, proteomic, functional, and evolutionary sources has highlighted the pivotal role of protein phosphorylation in the control of yeast central metabolism, where almost half of the enzymes involved are phosphorylated. These phosphorylated enzymes, compared to the nonphosphorylated ones, are more abundant, have more protein–protein interactions, regulate more reactions, and a higher fraction of them are ubiquitinated. Furthermore, the p-sites of metabolic proteins are more conserved than the background p-sites. This analysis has also successfully identified and prioritized potential high-confidence p-sites that are likely to have a major impact on enzyme function and should be targets of biotechnological and medical importance. The crucial question in this new era of HTP and integrative science is whether the numerous top-priority targets identified in silico will be investigated by LTP validation studies or by highly automated robotic procedures (King et al. 2004, 2009).

Supplementary Material

Supplemental material is available online at www.g3journal.org/lookup/suppl/doi:10.1534/g3.116.037218/-/DC1.

Acknowledgments

G.D.A. acknowledges financial support from the “ARISTEIA II” Action of the “Operational Programme Education and Lifelong Learning” that is cofunded by the European Social Fund and National Resources (code 4288 to G.D.A.). S.G.O. acknowledges the University of Cambridge for the award of sabbatical leave that allowed him to work with G.D.A. at the University of Thessaly, Greece.

Footnotes

Communicating editor: B. J. Andrews

Literature Cited

  1. Aguiar M., Haas W., Beausoleil S. A., Rush J., Gygi S. P., 2010.  Gas-phase rearrangements do not affect site localization reliability in phosphoproteomics data sets. J. Proteome Res. 9: 3103–3107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Albuquerque C. P., Smolka M. B., Payne S. H., Bafna V., Eng J., et al. , 2008.  A multidimensional chromatography technology for in-depth phosphoproteome analysis. Mol. Cell. Proteomics 7: 1389–1396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Amoutzias G. D., He Y., Gordon J., Mossialos D., Oliver S. G., et al. , 2010.  Posttranslational regulation impacts the fate of duplicated genes. Proc. Natl. Acad. Sci. USA 107: 2967–2971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Amoutzias G. D., He Y., Lilley K. S., Van de Peer Y., Oliver S. G., 2012.  Evaluation and properties of the budding yeast phosphoproteome. Mol. Cell. Proteomics 11: M111.009555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Batada N. N., Reguly T., Breitkreutz A., Boucher L., Breitkreutz B.-J., et al. , 2006.  Stratus not altocumulus: a new view of the yeast protein interaction network. PLoS Biol. 4: e317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Belle A., Tanay A., Bitincka L., Shamir R., O’Shea E. K., 2006.  Quantification of protein half-lives in the budding yeast proteome. Proc. Natl. Acad. Sci. USA 103: 13004–13009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Beltrao P., Trinidad J. C., Fiedler D., Roguev A., Lim W. A., et al. , 2009.  Evolution of phosphoregulation: comparison of phosphorylation patterns across yeast species. PLoS Biol. 7: e1000134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Beltrao P., Albanèse V., Kenner L. R., Swaney D. L., Burlingame A., et al. , 2012.  Systematic functional prioritization of protein posttranslational modifications. Cell 150: 413–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bodenmiller B., Mueller L. N., Mueller M., Domon B., Aebersold R., 2007.  Reproducible isolation of distinct, overlapping segments of the phosphoproteome. Nat. Methods 4: 231–237. [DOI] [PubMed] [Google Scholar]
  10. Bodenmiller B., Campbell D., Gerrits B., Lam H., Jovanovic M., et al. , 2008.  PhosphoPep–a database of protein phosphorylation sites in model organisms. Nat. Biotechnol. 26: 1339–1340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bodenmiller B., Wanka S., Kraft C., Urban J., Campbell D., et al. , 2010.  Phosphoproteomic analysis reveals interconnected system-wide responses to perturbations of kinases and phosphatases in yeast. Sci. Signal. 3: rs4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Boekhorst J., van Breukelen B., Heck A., Snel B., 2008.  Comparative phosphoproteomics reveals evolutionary and functional conservation of phosphorylation across eukaryotes. Genome Biol. 9: R144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Boekhorst J., Boersema P. J., Tops B. B. J., van Breukelen B., Heck A. J. R., et al. , 2011.  Evaluating experimental bias and completeness in comparative phosphoproteomics analysis. PLoS One 6: e23276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bouwman J., Kiewiet J., Lindenbergh A., van Eunen K., Siderius M., et al. , 2011.  Metabolic regulation rather than de novo enzyme synthesis dominates the osmo-adaptation of yeast. Yeast 28: 43–53. [DOI] [PubMed] [Google Scholar]
  15. Byrne K. P., Wolfe K. H., 2005.  The yeast gene order browser: combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Res. 15: 1456–1461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Carpy A., Krug K., Graf S., Koch A., Popic S., et al. , 2014.  Absolute proteome and phosphoproteome dynamics during the cell cycle of Schizosaccharomyces pombe (fission yeast). Mol. Cell. Proteomics 13: 1925–1936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Case D. A., Cheatham T. E., Darden T., Gohlke H., Luo R., et al. , 2005.  The Amber biomolecular simulation programs. J. Comput. Chem. 26: 1668–1688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Castrillo J. I., Zeef L. A., Hoyle D. C., Zhang N., Hayes A., et al. , 2007.  Growth control of the eukaryote cell: a systems biology study in yeast. J. Biol. 6: 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chaliotis A., Vlastaridis P., Mossialos D., Ibba M., Becker H. D., et al. , 2017.  The complex evolutionary history of aminoacyl-tRNA synthetases. Nucleic Acids Res. 45: 1059–1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Chen Y., Nielsen J., 2016.  Flux control through protein phosphorylation in yeast. FEMS Yeast Res. DOI: 10.1093/femsyr/fow096. [DOI] [PubMed] [Google Scholar]
  21. Cherry J. M., Hong E. L., Amundsen C., Balakrishnan R., Binkley G., et al. , 2012.  Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40: D700–D705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Chi A., Huttenhower C., Geer L. Y., Coon J. J., Syka J. E. P., et al. , 2007.  Analysis of phosphorylation sites on proteins from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry. Proc. Natl. Acad. Sci. USA 104: 2193–2198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Christiano R., Nagaraj N., Fröhlich F., Walther T. C., 2014.  Global proteome turnover analyses of the yeasts S. cerevisiae and S. pombe. Cell Rep. 9: 1959–1965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Conant G. C., 2014.  Comparative genomics as a time machine: how relative gene dosage and metabolic requirements shaped the time-dependent resolution of yeast polyploidy. Mol. Biol. Evol. 31: 3184–3193. [DOI] [PubMed] [Google Scholar]
  25. Conant G. C., Wolfe K. H., 2007.  Increased glycolytic flux as an outcome of whole-genome duplication in yeast. Mol. Syst. Biol. 3: 129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Costanzo M., Baryshnikova A., Bellay J., Kim Y., Spear E. D., et al. , 2010.  The genetic landscape of a cell. Science 327: 425–431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Dietrich F. S., Voegeli S., Brachat S., Lerch A., Gates K., et al. , 2004.  The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome. Science 304: 304–307. [DOI] [PubMed] [Google Scholar]
  28. Dikicioglu D., Kırdar B., Oliver S. G., 2015.  Biomass composition: the “elephant in the room” of metabolic modelling. Metabolomics 11: 1690–1701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Dobson P. D., Smallbone K., Jameson D., Simeonidis E., Lanthaler K., et al. , 2010.  Further developments towards a genome-scale metabolic model of yeast. BMC Syst. Biol. 4: 145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Freschi L., Courcelles M., Thibault P., Michnick S. W., Landry C. R., 2011.  Phosphorylation network rewiring by gene duplication. Mol. Syst. Biol. 7: 504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Freschi L., Osseni M., Landry C. R., 2014.  Functional divergence and evolutionary turnover in mammalian phosphoproteomes. PLoS Genet. 10: e1004062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Galanie S., Thodey K., Trenchard I. J., Filsinger Interrante M., Smolke C. D., 2015.  Complete biosynthesis of opioids in yeast. Science 349: 1095–1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Ghaemmaghami S., Huh W.-K., Bower K., Howson R. W., Belle A., et al. , 2003.  Global analysis of protein expression in yeast. Nature 425: 737–741. [DOI] [PubMed] [Google Scholar]
  34. Giaever G., Chu A. M., Ni L., Connelly C., Riles L., et al. , 2002.  Functional profiling of the Saccharomyces cerevisiae genome. Nature 418: 387–391. [DOI] [PubMed] [Google Scholar]
  35. Gnad F., de Godoy L. M. F., Cox J., Neuhauser N., Ren S., et al. , 2009.  High-accuracy identification and bioinformatic analysis of in vivo protein phosphorylation sites in yeast. Proteomics 9: 4642–4652. [DOI] [PubMed] [Google Scholar]
  36. Goffeau A., Barrell B. G., Bussey H., Davis R. W., Dujon B., et al. , 1996.  Life with 6000 genes. Science 274(546): 563–567. [DOI] [PubMed] [Google Scholar]
  37. Greenbaum D., Colangelo C., Williams K., Gerstein M., 2003.  Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biol. 4: 117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Gruhler A., Olsen J. V., Mohammed S., Mortensen P., Faergeman N. J., et al. , 2005.  Quantitative phosphoproteomics applied to the yeast pheromone signaling pathway. Mol. Cell. Proteomics 4: 310–327. [DOI] [PubMed] [Google Scholar]
  39. Gygi S. P., Rochon Y., Franza B. R., Aebersold R., 1999.  Correlation between protein and mRNA abundance in yeast. Mol. Cell. Biol. 19: 1720–1730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hedges S. B., Marin J., Suleski M., Paymer M., Kumar S., 2015.  Tree of life reveals clock-like speciation and diversification. Mol. Biol. Evol. 32: 835–845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Heinisch J., von Borstel R. C., Rodicio R., 1991.  Sequence and localization of the gene encoding yeast phosphoglycerate mutase. Curr. Genet. 20: 167–171. [DOI] [PubMed] [Google Scholar]
  42. Herrgård M. J., Swainston N., Dobson P., Dunn W. B., Arga K. Y., et al. , 2008.  A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology. Nat. Biotechnol. 26: 1155–1160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Holt L. J., Tuch B. B., Villén J., Johnson A. D., Gygi S. P., et al. , 2009.  Global analysis of Cdk1 substrate phosphorylation sites provides insights into evolution. Science 325: 1682–1686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Homeyer N., Horn A. H. C., Lanig H., Sticht H., 2006.  AMBER force-field parameters for phosphorylated amino acids in different protonation states: phosphoserine, phosphothreonine, phosphotyrosine, and phosphohistidine. J. Mol. Model. 12: 281–289. [DOI] [PubMed] [Google Scholar]
  45. Hornak V., Abel R., Okur A., Strockbine B., Roitberg A., et al. , 2006.  Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 65: 712–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Huber A., Bodenmiller B., Uotila A., Stahl M., Wanka S., et al. , 2009.  Characterization of the rapamycin-sensitive phosphoproteome reveals that Sch9 is a central coordinator of protein synthesis. Genes Dev. 23: 1929–1943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Humphrey W., Dalke A., Schulten K., 1996.  VMD: visual molecular dynamics. J. Mol. Graph. 14: 33–38. [DOI] [PubMed] [Google Scholar]
  48. Hunter T., Plowman G. D., 1997.  The protein kinases of budding yeast: six score and more. Trends Biochem. Sci. 22: 18–22. [DOI] [PubMed] [Google Scholar]
  49. Iakoucheva L. M., Radivojac P., Brown C. J., O’Connor T. R., Sikes J. G., et al. , 2004.  The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res. 32: 1037–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kanehisa M., Goto S., Sato Y., Furumichi M., Tanabe M., 2012.  KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40: D109–D114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kellis M., Birren B. W., Lander E. S., 2004.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428: 617–624. [DOI] [PubMed] [Google Scholar]
  52. King R. D., Whelan K. E., Jones F. M., Reiser P. G. K., Bryant C. H., et al. , 2004.  Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427: 247–252. [DOI] [PubMed] [Google Scholar]
  53. King R. D., Rowland J., Oliver S. G., Young M., Aubrey W., et al. , 2009.  The automation of science. Science 324: 85–89. [DOI] [PubMed] [Google Scholar]
  54. Kochanowski K., Sauer U., Chubukov V., 2013.  Somewhat in control–the role of transcription in regulating microbial metabolic fluxes. Curr. Opin. Biotechnol. 24: 987–993. [DOI] [PubMed] [Google Scholar]
  55. Krebs E. G., Stull J. T., 1975.  Protein phosphorylation and metabolic control. Ciba Found. Symp. 31: 355–367. [DOI] [PubMed] [Google Scholar]
  56. Kumar S., Stecher G., Tamura K., 2016.  MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33: 1870–1874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Landry C. R., Levy E. D., Michnick S. W., 2009.  Weak functional constraints on phosphoproteomes. Trends Genet. 25: 193–197. [DOI] [PubMed] [Google Scholar]
  58. Landry C. R., Freschi L., Zarin T., Moses A. M., 2014.  Turnover of protein phosphorylation evolving under stabilizing selection. Front. Genet. 5: 245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Lee D. C. H., Jones A. R., Hubbard S. J., 2015.  Computational phosphoproteomics: from identification to localization. Proteomics 15: 950–963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Lee J., Reiter W., Dohnal I., Gregori C., Beese-Sims S., et al. , 2013.  MAPK Hog1 closes the S. cerevisiae glycerol channel Fps1 by phosphorylating and displacing its positive regulators. Genes Dev. 27: 2590–2601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Levy E. D., Michnick S. W., Landry C. R., 2012.  Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information. Philos. Trans. R. Soc. Lond. B Biol. Sci. 367: 2594–2606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Li X., Gerber S. A., Rudner A. D., Beausoleil S. A., Haas W., et al. , 2007.  Large-scale phosphorylation analysis of alpha-factor-arrested Saccharomyces cerevisiae. J. Proteome Res. 6: 1190–1197. [DOI] [PubMed] [Google Scholar]
  63. Lienhard G. E., 2008.  Non-functional phosphorylations? Trends Biochem. Sci. 33: 351–352. [DOI] [PubMed] [Google Scholar]
  64. Maere S., Heymans K., Kuiper M., 2005.  BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21: 3448–3449. [DOI] [PubMed] [Google Scholar]
  65. Maguire S. L., ÓhÉigeartaigh S. S., Byrne K. P., Schröder M. S., O’Gaora P., et al. , 2013.  Comparative genome analysis and gene finding in Candida species using CGOB. Mol. Biol. Evol. 30: 1281–1291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Manning G., Plowman G. D., Hunter T., Sudarsanam S., 2002.  Evolution of protein kinase signaling from yeast to man. Trends Biochem. Sci. 27: 514–520. [DOI] [PubMed] [Google Scholar]
  67. Mascaraque V., Hernáez M. L., Jiménez-Sánchez M., Hansen R., Gil C., et al. , 2013.  Phosphoproteomic analysis of protein kinase C signaling in Saccharomyces cerevisiae reveals Slt2 mitogen-activated protein kinase (MAPK)-dependent phosphorylation of eisosome core components. Mol. Cell. Proteomics 12: 557–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Moses A. M., Hériché J.-K., Durbin R., 2007.  Clustering of phosphorylation site recognition motifs can be exploited to predict the targets of cyclin-dependent kinase. Genome Biol. 8: R23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Newman J. R. S., Ghaemmaghami S., Ihmels J., Breslow D. K., Noble M., et al. , 2006.  Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise. Nature 441: 840–846. [DOI] [PubMed] [Google Scholar]
  70. Nielsen J., 2015.  Yeast cell factories on the horizon. Science 349: 1050–1051. [DOI] [PubMed] [Google Scholar]
  71. Nishi H., Shaytan A., Panchenko A. R., 2014.  Physicochemical mechanisms of protein regulation by phosphorylation. Front. Genet. 5: 270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Oliveira A. P., Sauer U., 2012.  The importance of post-translational modifications in regulating Saccharomyces cerevisiae metabolism. FEMS Yeast Res. 12: 104–117. [DOI] [PubMed] [Google Scholar]
  73. Oliveira A. P., Ludwig C., Picotti P., Kogadeeva M., Aebersold R., et al. , 2012.  Regulation of yeast central metabolism by enzyme phosphorylation. Mol. Syst. Biol. 8: 623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Orth J. D., Thiele I., Palsson B. Ø., 2010.  What is flux balance analysis? Nat. Biotechnol. 28: 245–248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Pache R. A., Babu M. M., Aloy P., 2009.  Exploiting gene deletion fitness effects in yeast to understand the modular architecture of protein complexes under different growth conditions. BMC Syst. Biol. 3: 74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Papp B., Pál C., Hurst L. D., 2004.  Metabolic network analysis of the causes and evolution of enzyme dispensability in yeast. Nature 429: 661–664. [DOI] [PubMed] [Google Scholar]
  77. Pearson W. R., 2000.  Flexible sequence similarity searching with the FASTA3 program package. Methods Mol. Biol. 132: 185–219. [DOI] [PubMed] [Google Scholar]
  78. Peng J., Schwartz D., Elias J. E., Thoreen C. C., Cheng D., et al. , 2003.  A proteomics approach to understanding protein ubiquitination. Nat. Biotechnol. 21: 921–926. [DOI] [PubMed] [Google Scholar]
  79. Pir P., Gutteridge A., Wu J., Rash B., Kell D. B., et al. , 2012.  The genetic control of growth rate: a systems biology study in yeast. BMC Syst. Biol. 6: 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Pöhlmann R., Philippsen P., 1996.  Sequencing a cosmid clone of Saccharomyces cerevisiae chromosome XIV reveals 12 new open reading frames (ORFs) and an ancient duplication of six ORFs. Yeast 12: 391–402. [DOI] [PubMed] [Google Scholar]
  81. Ptacek J., Devgan G., Michaud G., Zhu H., Zhu X., et al. , 2005.  Global analysis of protein phosphorylation in yeast. Nature 438: 679–684. [DOI] [PubMed] [Google Scholar]
  82. R Core Team , 2015.  R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  83. Ralser M., Wamelink M. M. C., Latkolik S., Jansen E. E. W., Lehrach H., et al. , 2009.  Metabolic reconfiguration precedes transcriptional regulation in the antioxidant response. Nat. Biotechnol. 27: 604–605. [DOI] [PubMed] [Google Scholar]
  84. Rigden D. J., Walter R. A., Phillips S. E. V., Fothergill-Gilmore L. A., 1999.  Sulphate ions observed in the 2.12 Å structure of a new crystal form of S. cerevisiae phosphoglycerate mutase provide insights into understanding the catalytic mechanism. J. Mol. Biol. 286: 1507–1517. [DOI] [PubMed] [Google Scholar]
  85. Roe D. R., Cheatham T. E., 2013.  PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9: 3084–3095. [DOI] [PubMed] [Google Scholar]
  86. Rose P. W., Prlić A., Bi C., Bluhm W. F., Christie C. H., et al. , 2015.  The RCSB protein data bank: views of structural biology for basic and applied research and education. Nucleic Acids Res. 43: D345–D356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Sadowski I., Breitkreutz B.-J., Stark C., Su T.-C., Dahabieh M., et al. , 2013.  The PhosphoGRID Saccharomyces cerevisiae protein phosphorylation site database: version 2.0 update. Database (Oxford) 2013: bat026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Saleem R. A., Rogers R. S., Ratushny A. V., Dilworth D. J., Shannon P. T., et al. , 2010.  Integrated phosphoproteomics analysis of a signaling network governing nutrient response and peroxisome induction. Mol. Cell. Proteomics 9: 2076–2088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Salomon-Ferrer R., Götz A. W., Poole D., Le Grand S., Walker R. C., 2013.  Routine microsecond molecular dynamics simulations with AMBER on GPUs. 2. Explicit solvent particle mesh Ewald. J. Chem. Theory Comput. 9: 3878–3888. [DOI] [PubMed] [Google Scholar]
  90. Sauter C., Lorber B., Cavarelli J., Moras D., Giegé R., 2000.  The free yeast aspartyl-tRNA synthetase differs from the tRNAAsp-complexed enzyme by structural changes in the catalytic site, hinge region, and anticodon-binding domain. J. Mol. Biol. 299: 1313–1324. [DOI] [PubMed] [Google Scholar]
  91. Scannell D. R., Frank A. C., Conant G. C., Byrne K. P., Woolfit M., et al. , 2007.  Independent sorting-out of thousands of duplicated gene pairs in two yeast species descended from a whole-genome duplication. Proc. Natl. Acad. Sci. USA 104: 8397–8402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Schulz J. C., Zampieri M., Wanka S., von Mering C., Sauer U., 2014.  Large-scale functional analysis of the roles of phosphorylation in yeast metabolic pathways. Sci. Signal. 7: rs6. [DOI] [PubMed] [Google Scholar]
  93. Schwanhäusser B., Busse D., Li N., Dittmar G., Schuchhardt J., et al. , 2011.  Global quantification of mammalian gene expression control. Nature 473: 337–342. [DOI] [PubMed] [Google Scholar]
  94. Sellami M., Prévost G., Bonnet J., Dirheimer G., Gangloff J., 1985.  Isolation and characterization of the yeast aspartyl-tRNA synthetase gene. Gene 40: 349–352. [DOI] [PubMed] [Google Scholar]
  95. Shou C., Bhardwaj N., Lam H. Y. K., Yan K.-K., Kim P. M., et al. , 2011.  Measuring the evolutionary rewiring of biological networks. PLOS Comput. Biol. 7: e1001050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Soufi B., Kelstrup C. D., Stoehr G., Fröhlich F., Walther T. C., et al. , 2009.  Global analysis of the yeast osmotic stress response by quantitative proteomics. Mol. Biosyst. 5: 1337–1346. [DOI] [PubMed] [Google Scholar]
  97. Steinmetz L. M., Scharfe C., Deutschbauer A. M., Mokranjac D., Herman Z. S., et al. , 2002.  Systematic screen for human disease genes in yeast. Nat. Genet. 31: 400–404. [DOI] [PubMed] [Google Scholar]
  98. Stern D. F., Zheng P., Beidler D. R., Zerillo C., 1991.  Spk1, a new kinase from Saccharomyces cerevisiae, phosphorylates proteins on serine, threonine, and tyrosine. Mol. Cell. Biol. 11: 987–1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Studer R. A., Rodriguez-Mias R. A., Haas K. M., Hsu J. I., Viéitez C., et al. , 2016.  Evolution of protein phosphorylation across 18 fungal species. Science 354: 229–232. [DOI] [PubMed] [Google Scholar]
  100. Szappanos B., Kovács K., Szamecz B., Honti F., Costanzo M., et al. , 2011.  An integrated approach to characterize genetic interaction networks in yeast metabolism. Nat. Genet. 43: 656–662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Szczebara F. M., Chandelier C., Villeret C., Masurel A., Bourot S., et al. , 2003.  Total biosynthesis of hydrocortisone from a simple carbon source in yeast. Nat. Biotechnol. 21: 143–149. [DOI] [PubMed] [Google Scholar]
  102. Tripodi F., Nicastro R., Reghellin V., Coccetti P., 2015.  Post-translational modifications on yeast carbon metabolism: regulatory mechanisms beyond transcriptional control. Biochim. Biophys. Acta 1850: 620–627. [DOI] [PubMed] [Google Scholar]
  103. Vlastaridis P., Oliver S. G., Van de Peer Y., Amoutzias G. D., 2016.  The challenges of interpreting phosphoproteomics data: a critical view through the bioinformatics lens, pp. 196–204 in Computational Intelligence Methods for Bioinformatics and Biostatistics, edited by Angelini C., Rancoita P. M., Rovetta S. Springer International Publishing, Cham. [Google Scholar]
  104. Vlastaridis P., Kyriakidou P., Chaliotis A., Van de Peer Y., Oliver S. G., et al. , 2017.  Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes. GigaScience. DOI: 10.1093/gigascience/giw015https://doi.org/10.1093/gigascience/giw015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Wapinski I., Pfeffer A., Friedman N., Regev A., 2007.  Natural history and evolutionary principles of gene duplication in fungi. Nature 449: 54–61. [DOI] [PubMed] [Google Scholar]
  106. Weinert B. T., Iesmantavicius V., Moustafa T., Schölz C., Wagner S. A., et al. , 2014.  Acetylation dynamics and stoichiometry in Saccharomyces cerevisiae. Mol. Syst. Biol. 10: 716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Willger S. D., Liu Z., Olarte R. A., Adamo M. E., Stajich J. E., et al. , 2015.  Analysis of the Candida albicans phosphoproteome. Eukaryot. Cell 14: 474–485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Wolfe K. H., Shields D. C., 1997.  Molecular evidence for an ancient duplication of the entire yeast genome. Nature 387: 708–713. [DOI] [PubMed] [Google Scholar]
  109. Wu R., Dephoure N., Haas W., Huttlin E. L., Zhai B., et al. , 2011.  Correct interpretation of comprehensive phosphorylation dynamics requires normalization by protein expression changes. Mol. Cell. Proteomics 10: M111.009654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Xiao Q., Miao B., Bi J., Wang Z., Li Y., 2016.  Prioritizing functional phosphorylation sites based on multiple feature integration. Sci. Rep. 6: 24735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Zhu H., Klemic J. F., Chang S., Bertone P., Casamayor A., et al. , 2000.  Analysis of yeast protein kinases using protein chips. Nat. Genet. 26: 283–289. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.


Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES