Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Apr 27.
Published in final edited form as: Nature. 2009 Jun 17;460(7252):274–277. doi: 10.1038/nature08102

Unlimited multistability in multisite phosphorylation systems

Matthew Thomson 1, Jeremy Gunawardena 2
PMCID: PMC2859978  NIHMSID: NIHMS184849  PMID: 19536158

Abstract

Reversible phosphorylation on serine, threonine and tyrosine is the most widely studied posttranslational modification of proteins1,2. The number of phosphorylated sites on a protein (n) shows a significant increase from prokaryotes, with n≤7 sites, to eukaryotes, with examples having n≥150 sites3. Multisite phosphorylation has many roles4,5 and site conservation indicates that increasing numbers of sites cannot be due merely to promiscuous phosphorylation. A substrate with n sites has an exponential number (2n) of phospho-forms and individual phospho-forms may have distinct biological effects6,7. The distribution of these phospho-forms and how this distribution is regulated have remained unknown. Here we show that, when kinase and phosphatase act in opposition on a multisite substrate, the system can exhibit distinct stable phospho-form distributions at steady state and that the maximum number of such distributions increases with n. Whereas some stable distributions are focused on a single phospho-form, others are more diffuse, giving the phosphoproteome the potential to behave as a fluid regulatory network able to encode information and flexibly respond to varying demands. Such plasticity may underlie complex information processing in eukaryotic cells8 and suggests a functional advantage in having many sites. Our results follow from the unusual geometry of the steady-state phospho-form concentrations, which we show to constitute a rational algebraic curve, irrespective of n. We thereby reduce the complexity of calculating steady states from simulating 3×2n differential equations to solving two algebraic equations, while treating parameters symbolically. We anticipate that these methods can be extended to systems with multiple substrates and multiple enzymes catalysing different modifications, as found in posttranslational modification ‘codes’9 such as the histone code10,11. Whereas simulations struggle with exponentially increasing molecular complexity, mathematical methods of the kind developed here can provide a new language in which to articulate the principles of cellular information processing12.

A major difficulty in studying multisite phosphorylation from a systems perspective has been the lack of information regarding when different sites on the same protein are simultaneously phosphorylated13. Sites may be intricately dependent on each other14 and both the number and the position of phosphates can affect biological outcome6,7. The phospho-form distribution—the relative stoichiometries of each of the 2n phospho-forms—is thus the appropriate measure of phosphorylation state for a multisite substrate. Such distributions are starting to be measured15,16, prompted by interest in posttranslational modification (PTM) codes. A theoretical understanding will improve our ability to interpret such data, uncover biological principles and design appropriate experiments. Although we focus on phosphorylation, we hope to lay a foundation for analysing multiple PTMs.

Figure 1 summarizes a general model of multisite phosphorylation. A substrate S, with n sites, is acted on by a kinase E and a phosphatase F. Substrates may have multiple kinases and phosphatases in vivo but a single enzyme often addresses multiple sites and we focus here on the minimal enzymatic machinery needed for any n. Each enzyme may act distributively (Fig. 1a) as well as processively (Fig. 1b) using a standard biochemical scheme (Fig. 1c) and arbitrary preferences for site order. These assumptions are more general than in previous models17-22: by choosing the details appropriately, any kinase, phosphatase and substrate system can be represented. ATP is assumed to be recharged by some process that is not modelled. It is therefore not treated as a variable but its effect is absorbed into the site-specific parameters (auX,buX,cu,vX defined in Fig. 1c). Phosphorylation and dephosphorylation are assumed to take place on a fast time scale in comparison to synthesis and degradation of the component proteins. The model is, therefore, effectively closed: there is no flux of material through it and the total amounts of substrate, Stot, and enzymes, Etot and Ftot, remain constant at all times. With mass-action kinetics, these assumptions give rise to 3×2n nonlinear differential equations for the state variables (Fig. 1d).

Figure 1. General model of multisite phosphorylation with substrate S having n sites, kinase E and phosphatase F.

Figure 1

The phospho-forms are denoted Su, where u is a bit string indicating the absence/presence (0/1) of phosphate. Kinase reactions for n=3 with one phosphorylation per reaction (a, distributivity29) and multiple phosphorylations per reaction (b, processivity29). Phosphatases act similarly in removing phosphates. c, Each enzyme (X=E or F) uses a standard biochemical mechanism30 but may form multiple products, with associated parameters (auX,buX,cu,vX) for mass-action kinetics. ATP is assumed to be held constant and synthesis and degradation are ignored. d, The 3×2n differential equations, where uXv signifies that X converts Su to Sv. The same symbol is used for a chemical species and for its concentration. Note that X(Su) here indicates the product of X and Su.

With limited information on site-specific parameters, numerical simulations can be undertaken for randomly selected parameter values in an attempt to discern typical behaviours. However, doing so in a state space of dimension 3×2n rapidly becomes intractable as n increases. Here we introduce a new method of analysis, which allows strong conclusions to be drawn about steady states without having to specify parameter values in advance. Experimental evidence indicates that biological systems attain quasi-steady states in vivo23,24, including systems in which multisite phosphorylation has a significant role25. While the steady-state assumption must be confirmed in each experimental context, it has been widely used in modelling multisite phosphorylation17-22.

At steady state, the second equation in Fig. 1d yields an expression for XSu in terms of X and Su. Substituting this into the first equation in Fig. 1d makes that equation linear in Su. The coefficients are algebraic expressions in the site-specific parameters (collectively denoted a) and the auxiliary parameter t=E/F, the steady-state ratio of free kinase to free phosphatase. These expressions may be regarded as elements of a set of coefficients, ℝ(a, t), in which the a values and the t have been adjoined to the ordinary numbers, ℝ, as uninterpreted symbols that can be added, subtracted, multiplied and divided as if they were numbers (see the Supplementary Information for more details). The elements of ℝ(a, t) correspond to rational functions, or ratios of polynomials, in these parameters (Supplementary Information). By treating the parameters symbolically in this way, they can be used in calculations without their numerical values being known in advance.

The linearized equations for the Su can be solved by Gaussian elimination, which works as well over the coefficients ℝ(a, t) as over ℝ (Supplementary Information). The steady-state phospho-forms can thereby be shown to satisfy (Supplementary Information)

Su=S0ru(t) (1)

where S0 is the unphosphorylated phospho-form and ru(t) is a rational function of t with coefficients in ℝ(a). Although these rational functions are complex, they can be explicitly calculated for any given model (Supplementary Information).

If numerical values are to be given to the site-specific parameters, it is important to know that the rational functions ru(t) remain well defined. For instance, the rational function c/(1 – c) becomes undefined when c=1. We show that, for any positive site-specific parameter values and positive t, ru(t) is always well defined and positive (‘positivity’, Supplementary Information), ruling out such problems.

Equation (1) implies that the steady-state phospho-forms can be described through a single auxiliary variable, t, so that they form a one-dimensional geometric object, or ‘curve’. Despite increasing numbers of sites, and the exponentially increasing size of the model, the steady-state phospho-forms always remain a curve, providing the basis for an exponential reduction in complexity. What changes with n is the extent to which the curve undulates, which determines how many steady states can co-exist for given amounts of substrate and enzymes (see below). Not all curves can be described by rational functions; those that can are of considerable geometric interest, as explored in an earlier paper26 and discussed further in the Supplementary Information.

The upshot of equation (1) is that, at steady state, the 3×2n state variables are determined by S0, E and F. Because the substrate and enzyme totals remain constant, we can formulate three equations for these three unknowns. The equation for S0 can be solved directly in terms of Stot, leaving a pair of equations (defined in the Supplementary Information):

Φ1(E,F)=EtotandΦ2(E,F)=Ftot (2)

which determine the steady-state E and F values corresponding to any given substrate and enzyme totals. Equation (2) exactly characterizes the steady states of the model (Supplementary Information). To find steady states, it is no longer necessary to numerically simulate 3×2n differential equations; this can be done by only solving two algebraic equations. The complexity arising from the dynamics has been distilled away. This exponential reduction of complexity is the key to what follows.

Figure 2a shows an example with four sites and five steady states. (We assume sequentiality, as in Fig. 3a, but merely for convenience.) Only stable states are detected experimentally, or found by numerical simulation, and this example is tristable; the corresponding stable phospho-form distributions are markedly distinct. Whereas distributions 1 and 3 are each focused on a single phospho-form, distribution 2 is broader. With multiple stable states a system can encode many outcomes, or several bits of information, enabling complex information encoding and processing8. Such multistability is believed to underlie cellular differentiation and other decisions8,27 but experimental examples have, so far, only demonstrated bistability23-25. Bistability in two-site phosphorylation was previously shown by modelling20.

Figure 2. Multistability for an n=4 distributive, sequential system, as in Fig. 3a.

Figure 2

For parameter values, see Supplementary Table 1. a, Plots of equation (2) for Stot=10 μM and Etot=Ftot=2.8 μM. The intersections correspond to the steady states. Filled squares are stable: 1 (red), 2 (black) and 3 (blue); open squares are unstable. Stability was determined by standard methods (Supplementary Information). The inset shows the corresponding phospho-form distributions, following the notation in Fig. 3a. b, Time courses of S4 reaching its three stable values from initial conditions S0Stot, S4=(1 – α)Stot and X=0 for all other variables, with α chosen randomly from the uniform distribution on [0, 1] (100 samples), determined by simulation (Supplementary Information).

Figure 3. Multistability by kinetic trapping.

Figure 3

a, A distributive, sequential system with n=4; E phosphorylates in order, F dephosphorylates in reverse order. The phospho-forms are denoted Si, i=0,…, 4, where i is the number of phosphates. b, Rate functions for production of S1 from S0 by E and of S0 from S1 by F are approximated as Michaelis–Menten hyperbolas30, with the phosphatase curve to the left and above the kinase curve. If the system is initiated with substrate entirely in S0 and in excess over both enzymes and saturating them, S0 will sequester E and produce S1 at nearly maximal rate (point 1). F, however, will be unoccupied, so that as S1 increases (grey arrow), phosphorylation and dephosphorylation will balance (point 2). Any leak of S1 into S2 can be opposed by F, which is not sequestered. The system will hence reach steady state with substrate remaining predominantly unphosphorylated. If similar conditions are applied to S3 and S4, but reversed with respect to E and F, then S4 can be similarly trapped. The parameter values required for this argument are discussed in the Supplementary Information.

It remained unknown how the number of stable states depends on n. We found that multistability tends to occur when substrate is in excess over enzymes (as in Fig. 2a). In this approximation, the two equations for E and F in equation (2) can be reduced to one polynomial equation for t=E/F (Supplementary Information):

P(t)=αNtN+αN1tN1++α1t+α0=0 (3)

where ai ∈ ℝ(a) and N lies between n+1 and 2n depending on the model. Positivity of the ru(t) is essential here. Positive solutions of P(t)=0 correspond approximately to steady states. We show that the discrepancy between the exact steady states found by equation (2) and the approximate ones found by equation (3) can be made as small as desired by increasing the excess of Stot over Etot and Ftot (Supplementary Information).

The advantage of equation (3) over equation (2) is that we can readily construct solutions of the former. If n is even and we choose any n+1 distinct positive numbers, then we can always find a model for which the corresponding P(t) has these numbers as solutions (Supplementary Information). Moreover, Stot can be chosen arbitrarily, so that the approximation of equation (2) by equation (3) is as accurate as desired. It follows that the corresponding model has n+1 steady states. The example in Fig. 2a meets this bound for n=4. If n is odd the same can be achieved for any set of n distinct positive numbers (Supplementary Information). Note that odd numbers of steady states are constructed in both cases.

This is also what we find in simulations. Furthermore, if the steady states are ordered by their corresponding E/F values then unstable states always occur between stable ones (Supplementary Information), as in Fig. 2a. Hence, if there are 2k+1 steady states, k+1 of them are stable. It follows that a model with n sites can have as many as ⌊(n+2)/2⌋ stable states (⌊x⌋ being the greatest integer not greater than x). We see that the tristability in Fig. 2a is only the tip of the iceberg: the maximum number of stable states increases with increasing numbers of sites.

Experimental detection of stable states requires an understanding of how they arise dynamically. Figure 3b considers the inter-conversion of S0 and S1, for the sequential system in Fig. 3a. An informal argument based on the Michaelis–Menten approximation suggests that, for suitable parameter values, if the system is started entirely in S0 then substrate can remain trapped predominantly in that state, as in distribution 1 in Fig. 2a. A similar argument applies to trapping of S4, as in distribution 3. Simulations show that all three stable distributions can be reached by starting from suitable mixtures of S0 and S4 (Fig. 2b).

In vivo, the most likely way that a phosphorylation system is regulated is by modulating its enzymes. Because the enzymes have been assumed to be in their active states, this corresponds to altering Etot or Ftot. Figure 4a shows, for the tristable system in Fig. 2, that changes in Etot can switch the system between stable states 1 and 3. It is possible that more complex modulations, involving both Etot and Ftot, could access all three stable states or that different parameter values could facilitate additional switching capability. Figure 4b shows another option, in which changes in Etot can switch between three stable states even though there is only a narrow window of Etot values for which three stable states coexist. In other words, the system may not need to exhibit robust tristability to have access to three stable states.

Figure 4. Switching between stable states.

Figure 4

a, The system in Fig. 2a with Etot=Ftot=2.8 μM and Stot=10 μM is taken in a cycle (grey arrows) using simulation (Supplementary Information). The free kinase is repeatedly changed by a small amount and the system allowed to relax back to a stable state. Starting on the lower branch, corresponding to distribution 1 in Fig. 2a, the system switches abruptly to the higher branch (distribution 3), remains on that branch as Etot is lowered, and then switches abruptly back down to the lower branch (hysteresis). b, The system with Etot=Ftot=2.8 μM, Stot=5 μM has only two stable states (not shown) but occupies three when Etot is cycled. Three stable states only coexist in a narrow window around the dotted line. The solid lines mark Etot=2.8 μM.

DNA provides a static, structural mechanism for encoding information at a capacity of 2 bits per base pair. The idea that PTMs provide a dynamic mechanism for information encoding has been broadly influential10,11 but the mechanistic details remain a matter of debate9,28 and no estimate of information capacities has emerged for any such code. Our result provides the first demonstration of a PTM mechanism that can, in principle, encode an arbitrary amount of information, along with an estimate of its information capacity. If natural selection has found such a capability useful, that may help account for the emergence of large numbers of phosphorylation sites.

Supplementary Material

1

Acknowledgements

This work was supported in part by the NIH under grant R01-GM081578. We thank A. Manrai for scientific discussions, R. Ward for editorial help and HMS RITG for support with cluster computing. We acknowledge the encouragement of the late Stephen Thomson (1946–2006) and Charles Gunawardena (1929–2007).

Footnotes

Supplementary Information is linked to the online version of the paper at www.nature.com/nature.

Reprints and permissions information is available at www.nature.com/reprints.

References

  • 1.Walsh CT. Posttranslational Modification of Proteins. Roberts and Company; 2006. [Google Scholar]
  • 2.Cohen P. The role of reversible protein phosphorylation in health and disease. Eur. J. Biochem. 2001;268:5001–5010. doi: 10.1046/j.0014-2956.2001.02473.x. [DOI] [PubMed] [Google Scholar]
  • 3.Gnad F, et al. PHOSIDA (phosphorylation site database): management, structural and evolutionary investigation, and prediction of phosphosites. Genome Biol. 2007;8:R250. doi: 10.1186/gb-2007-8-11-r250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Cohen P. The regulation of protein function by multisite phosphorylation — a 25 year update. Trends Biochem. Sci. 2000;25:596–601. doi: 10.1016/s0968-0004(00)01712-6. [DOI] [PubMed] [Google Scholar]
  • 5.Holmberg CI, Tran SEF, Eriksson JE, Sistonen L. Multisite phosphorylation provides sophisticated regulation of transcription factors. Trends Biochem. Sci. 2002;27:619–627. doi: 10.1016/s0968-0004(02)02207-7. [DOI] [PubMed] [Google Scholar]
  • 6.Wu RC, et al. Selective phosphorylations of the SRC–3/AIB1 coactivator integrate genomic responses to multiple cellular signaling pathways. Mol. Cell. 2004;15:937–949. doi: 10.1016/j.molcel.2004.08.019. [DOI] [PubMed] [Google Scholar]
  • 7.Park K-S, Mohapatra DP, Misonou H, Trimmer JS. Graded regulation of the Kv2.1 potassium channel by variable phosphorylation. Science. 2006;313:976–979. doi: 10.1126/science.1124254. [DOI] [PubMed] [Google Scholar]
  • 8.Nurse P. Life, logic and information. Nature. 2008;454:424–426. doi: 10.1038/454424a. [DOI] [PubMed] [Google Scholar]
  • 9.Sims RJ, Reinberg D. Is there a code embedded in proteins that is based on posttranslational modification? Nature Rev. Mol. Cell Biol. 2008;9:815–820. doi: 10.1038/nrm2502. [DOI] [PubMed] [Google Scholar]
  • 10.Jenuwein T, Allis CD. Translating the histone code. Science. 2001;293:1074–1080. doi: 10.1126/science.1063127. [DOI] [PubMed] [Google Scholar]
  • 11.Turner B. Cellular memory and the histone code. Cell. 2002;111:285–291. doi: 10.1016/s0092-8674(02)01080-2. [DOI] [PubMed] [Google Scholar]
  • 12.Cohen JE. Mathematics is biology's next microscope, only better; biology is mathematics' next physics, only better. PLoS Biol. 2004;2:e439. doi: 10.1371/journal.pbio.0020439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hunter T. The age of crosstalk: phosphorylation, ubiquitination and beyond. Mol. Cell. 2007;28:730–738. doi: 10.1016/j.molcel.2007.11.019. [DOI] [PubMed] [Google Scholar]
  • 14.Ferrarese A, et al. Chemical dissection of the APC repeat 3 multistep phosphorylation by the concerted action of protein kinases CK1 and GSK3. Biochemistry. 2007;46:11902–11910. doi: 10.1021/bi701674z. [DOI] [PubMed] [Google Scholar]
  • 15.Phanstiel D, et al. Mass spectrometry identifies and quantifies 74 unique histone H4 isoforms in differentiating human embryonic stem cells. Proc. Natl Acad. Sci. USA. 2008;105:4093–4098. doi: 10.1073/pnas.0710515105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pesavento JJ, Bullock CR, LeDuc RD, Mizzen CA, Kelleher NL. Combinatorial modification of human histone H4 quantitated by two-dimensional liquid chromatography coupled with top down mass spectrometry. J. Biol. Chem. 2008;283:14927–14937. doi: 10.1074/jbc.M709796200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Goldbeter A, Koshland DE. An amplified sensitivity arising from covalent modification in biological systems. Proc. Natl Acad. Sci. USA. 1981;78:6840–6844. doi: 10.1073/pnas.78.11.6840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lisman JE. A mechanism for memory storage insensitive to molecular turnover: a bistable autophosphorylating kinase. Proc. Natl Acad. Sci. USA. 1985;82:3055–3057. doi: 10.1073/pnas.82.9.3055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Salazar C, Höfer T. Allosteric regulation of the transcription factor NFAT1 by multiple phosphorylation sites: a mathematical analysis. J. Mol. Biol. 2003;327:31–45. doi: 10.1016/s0022-2836(03)00085-8. [DOI] [PubMed] [Google Scholar]
  • 20.Markevich NI, Hoek JB, Kholodenko BN. Signalling switches and bistability arising from multisite phosphorylation in protein kinase cascades. J. Cell Biol. 2004;164:353–359. doi: 10.1083/jcb.200308060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gunawardena J. Multisite protein phosphorylation makes a good threshold but can be a poor switch. Proc. Natl Acad. Sci. USA. 2005;102:14617–14622. doi: 10.1073/pnas.0507322102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kim SY, Ferrell JE. Substrate competition as a source of ultrasensitivity in the inactivation of Wee1. Cell. 2007;128:1133–1145. doi: 10.1016/j.cell.2007.01.039. [DOI] [PubMed] [Google Scholar]
  • 23.Ozbudak EM, Thattai M, Lim HN, Shraiman BI, van Oudenaarden A. Multistability in the lactose utilization network of Escherichia coli. Nature. 2004;427:737–740. doi: 10.1038/nature02298. [DOI] [PubMed] [Google Scholar]
  • 24.Sha W, et al. Hysteresis drives cell-cycle transitions in Xenopus laevis egg extracts. Proc. Natl Acad. Sci. USA. 2003;100:975–980. doi: 10.1073/pnas.0235349100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ferrell JE, Jr, Machleder EM. The biochemical basis of an all-or-none cell fate switch in Xenopus oocytes. Science. 1998;280:895–898. doi: 10.1126/science.280.5365.895. [DOI] [PubMed] [Google Scholar]
  • 26.Manrai A, Gunawardena J. The geometry of multisite phosphorylation. Biophys. J. 2008;95:5533–5543. doi: 10.1529/biophysj.108.140632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Monod J, Jacob F. General conclusions: teleonomic mechanisms in cellular metabolism, growth and differentiation. Cold Spring Harb. Symp. Quant. Biol. 1961;26:389–401. doi: 10.1101/sqb.1961.026.01.048. [DOI] [PubMed] [Google Scholar]
  • 28.Berger SL. The complex language of chromatin regulation during transcription. Nature. 2007;447:407–411. doi: 10.1038/nature05915. [DOI] [PubMed] [Google Scholar]
  • 29.Huang C-YF, Ferrell JE. Ultrasensitivity in the mitogen-activated protein kinase cascade. Proc. Natl Acad. Sci. USA. 1996;93:10078–10083. doi: 10.1073/pnas.93.19.10078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Cornish-Bowden A. Fundamentals of Enzyme Kinetics. 2nd edn Portland Press; 1995. pp. 23–28. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES