Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Mar 13.
Published in final edited form as: Cell Rep. 2014 Sep 15;8(6):1832–1844. doi: 10.1016/j.celrep.2014.07.055

Intrinsically Disordered Segments Affect Protein Half-Life in the Cell and during Evolution

Robin van der Lee 1,2,*, Benjamin Lang 1, Kai Kruse 1, Jörg Gsponer 3, Natalia Sánchez de Groot 1, Martijn A Huynen 2, Andreas Matouschek 4, Monika Fuxreiter 5, M Madan Babu 1,*
PMCID: PMC4358326  NIHMSID: NIHMS669198  PMID: 25220455

Summary

Precise control of protein turnover is essential for cellular homeostasis. The ubiquitin-proteasome system is well established as a major regulator of protein degradation, but an understanding of how inherent structural features influence the lifetimes of proteins is lacking. We report that yeast, mouse, and human proteins with terminal or internal intrinsically disordered segments have significantly shorter half-lives than proteins without these features. The lengths of the disordered segments that affect protein half-life are compatible with the structure of the proteasome. Divergence in terminal and internal disordered segments in yeast proteins originating from gene duplication leads to significantly altered half-life. Many paralogs that are affected by such changes participate in signaling, where altered protein half-life will directly impact cellular processes and function. Thus, natural variation in the length and position of disordered segments may affect protein half-life and could serve as an underappreciated source of genetic variation with important phenotypic consequences.

Introduction

Protein degradation is the endpoint of gene expression, and correct turnover of proteins is essential for cellular function. Indeed, protein half-life impacts virtually all cellular processes including the cell cycle (Pagano et al., 1995), DNA repair (Lakin and Jackson, 1999), apoptosis and cell survival (Rutkowski et al., 2006), alternative splicing (Irimia et al., 2012), circadian rhythm (van Ooijen et al., 2011), cell differentiation (Ramakrishna et al., 2011), development (Hirata et al., 2004), and immunity (Babon et al., 2006). Altered protein half-life can lead to abnormal development and diseases such as cancer and neurodegeneration (Ciechanover, 2012). For instance, artificially extending the half-life of the Hes7 transcription factor by ∼8 min severely disorganizes embryonic development in mice (Hirata et al., 2004). Missense mutations in succinate dehydrogenase that increase turnover rates contribute to neuroendocrine tumors (Yang et al., 2012).

The proteasome mediates controlled and selective degradation of most proteins in eukaryotic cells, and access to the proteasome is key to controlling the half-life of substrates (Goldberg, 2003; Hershko and Ciechanover, 1998). Substrate recruitment to the proteasome is primarily mediated through their polyubiquitination by ubiquitin ligases (Komander and Rape, 2012; Ravid and Hochstrasser, 2008; Varshavsky, 2012). This mechanism regulates the half-life of proteins, which ranges from seconds to days (Belle et al., 2006; Kristensen et al., 2013; Schwanhäusser et al., 2011). The large number of ubiquitin ligases and deubiquitinating enzymes encoded in eukaryotic genomes highlights the importance of this system (Hutchins et al., 2013; Komander et al., 2009). Although the role of ubiquitination in delivering proteins to the proteasome is well established, it remains unclear to what extent intrinsic structural features of substrates influence their half-life once bound to the proteasome and whether such features have been exploited to alter half-life during evolution.

An important feature implicated in affecting protein half-life is the presence of polypeptide regions that do not adopt a defined 3D structure, typically called intrinsically disordered, or unstructured regions (van der Lee et al., 2014). Disordered regions are present in a large number of eukaryotic proteins and play key roles in protein function along with structured domains (Babu et al., 2012). A number of genome-scale studies have investigated the relationship between the overall fraction of disordered residues of a protein and its half-life, but these have yielded contradictory results ranging from no correlation (Yen et al., 2008) to weak correlation (Tompa et al., 2008) to a strong effect (Gsponer et al., 2008). The reason for the inconsistencies is perhaps that these studies investigated correlations without the guidance provided by the biochemical mechanism by which disordered regions might contribute to protein turnover.

In this work, we develop a theory of how disordered segments influence protein half-life, through a systematic analysis of multiple data sets describing sequence, structure, expression, evolutionary relationships, and experimental half-life measurements from both unicellular and multicellular organisms. We present evidence that proteins with a long terminal or internal disordered segment have a significantly shorter in vivo half-life in yeast on a genomic scale. The same relationship is found in mouse and human. Upon gene duplication, divergence in terminal and internal disordered segments leads to altered half-life of paralogous proteins. Many affected paralogs participate in signaling pathways, where altered half-life will influence signaling outcomes. We suggest specific biochemical mechanisms by which disordered segments may influence degradation rates, how these changes might modulate cellular function and phenotype, and how natural variation in the length and position of intrinsically disordered protein regions may contribute to the evolution of protein half-life.

Results

To investigate the relationship between the structural architecture of proteins and their cellular stability, we inferred the disorder status of every residue in the proteomes of yeast, mouse, and human using the DISOPRED2 (Ward et al., 2004), IUPRED (Dosztányi et al., 2005), and PONDR VLS1 (Obradovic et al., 2005) software. In vivo protein half-life data for yeast were obtained from a study that used strains in which proteins expressed from their endogenous promoter contained a tandem affinity purification (TAP) tag at the C terminus (Belle et al., 2006). After inhibition of protein synthesis, protein abundance was measured at three time points by western blotting with TAP antibodies. Protein turnover in mouse and human cells was measured using isotope labeling and mass spectrometry (Kristensen et al., 2013; Schwanhäusser et al., 2011). We combined the information on protein half-life, and other large-scale data sets, with the position and length of disordered segments and analyzed the data using appropriate statistical tests (3,273 proteins in yeast, 4,502 in mouse, and 3,971 in human; Experimental Procedures; Table S1A; Figure S1A).

Long N-Terminal Disordered Segments Contribute to Short Protein Half-Life In Vivo

We first classified yeast proteins into two groups depending on the length of the disordered termini, treating the N and C termini separately: those with short (≤30 residues) and those with long (>30 residues) disordered tails (Figure 1A). The length cutoff was based on recent molecular models of the proteasome (da Fonseca et al., 2012; Lander et al., 2012; Lasker et al., 2012) and on in vitro biochemical studies using purified proteasomes showing that there is a critical minimum length of ∼30 residues that allows a disordered terminus of a ubiquitinated substrate to efficiently initiate degradation (Inobe et al., 2011). Indeed, analysis of the yeast data confirms that protein half-life does not depend linearly on the length of disordered segments (Figure S1B; Supplemental Experimental Procedures).

Figure 1. The Effects of Terminal Disordered Segments on Protein Half-Life.

Figure 1

(A) A total of 3,273 yeast proteins were grouped based on the length of the disordered segment at the N terminus. Long (dark red) and short (light red) terminal disordered segments were defined as stretches of >30 and ≤30 disordered residues.

(B–D) Boxplots of protein half-life distributions. Proteins were classified based on the length of the disordered segment at the N terminus (B) or the C terminus (C) and the presence of N-terminal disordered or structured segments (D, long N-terminal structured regions [dark gray] were defined as >30 structured residues).

(E) Boxplots of the distributions of N-terminal disorder length for different half-life groups, indicated with schematic exponential degradation curves (from short half-life [dark green] to long half-life [light green]).

Central boxplot notches mark the median and the 95% confidence interval. Colored boxes represent the 50% of data points above (×0.75) and below (×0.25) the median (×0.50). Vertical lines (whiskers) connected to the boxes by the horizontal dashed lines represent the largest and the smallest nonoutlier data points. Outliers are not shown to improve visualization. p values reported are from Mann-Whitney U (B–D) and Kruskal-Wallis (E) tests. p values, the number of data points (n), and differences between the half-life medians of the ∼ compared groups (ΔH̃) are shown to the right.

See also Figures S1 and S3 and Table S1.

Proteins with a long disordered N terminus have a significantly shorter half-life compared to proteins with a short disordered N terminus (p = 5 × 10−6, Mann-Whitney U test, a nonparametric test for assessing whether two samples come from the same underlying distribution [H0]; Figure 1B). The approach for measuring half-lives in yeast involved C-terminal tagging with a TAP tag, which is 186 amino acids long and largely structured. Since all proteins had identical C termini due to the TAP tag, we should see little difference in half-life between proteins with long and short C-terminal disorder as characterized from the original genome sequence. Indeed, these groups display similar distributions of protein half-life (p = 0.99, Mann-Whitney U test; Figure 1C).

In order to assess to what extent the disordered state of the N terminus affects half-life, we performed three analyses. First, we investigated proteins with a highly structured N terminus (>30 residues predicted to be structured) and found that they display a longer half-life compared to proteins with a long disordered N terminus (p = 2 × 10−7, Mann-Whitney U test; Figure 1D). Second, we classified the proteome into three groups of roughly equal size, based on their half-life: (1) short-lived proteins (half-life ≤ 30 min), (2) medium half-life proteins (31–70 min), and (3) long-lived proteins (>70 min) (Figure S1A). The distributions of the length of N-terminal disorder differ significantly across the three groups in a manner consistent with the above observations: proteins with a shorter half-life tend to have longer N-terminal disordered segments (p = 3 × 10−6, Kruskal-Wallis test, which extends the Mann-Whitney U test to three or more groups, Figures 1E and S1F). Again, this relationship is not true for the C terminus, because the TAP tag causes all proteins to have the same C terminus (p = 0.2, Kruskal-Wallis test; Figures S1E and S1F). Third, we quantified the effects of disordered segments on half-life by comparing conditional probabilities for finding proteins with and without long N-terminal disorder within specific half-life ranges. The likelihood of finding a protein with a short half-life among those that have long N-terminal disorder was two times higher than the “reverse” probability of finding proteins with long N-terminal disorder among those with short half-life (p[short half-life given long N-terminal disorder] = 0.44; p[long N-terminal disorder given short half-life] = 0.18; Tables 1A and S3A). This indicates that the presence of a long disordered N terminus often results in short half-life but proteins with short half-life need not always have a long N-terminal disordered segment. Thus, the presence of a disordered N terminus is linked to short half-life, but other properties also affect protein turnover (see Discussion).

Table 1. Conditional Probabilities for Intrinsically Disordered Segments and Protein Half-Life.

N-Terminal Disordered Segment Half-Life graphic file with name nihms669198t1.jpg graphic file with name nihms669198t2.jpg graphic file with name nihms669198t3.jpg Total
graphic file with name nihms669198t4.jpg 210 158 111 479
graphic file with name nihms669198t5.jpg 930 1023 841 2794
Total 1140 1181 952 3273
Internal Disordered Segment
graphic file with name nihms669198t6.jpg 564 432 264 1260
graphic file with name nihms669198t7.jpg 576 749 688 2013
Total 1140 1181 952 3273
Event (E) Condition (C) Probability (Event | Condition) = P(E⋂C)/P(C)
A Short half-life Long N-terminal disorder
P(Hs|NL)=210479=0.44
Long N-terminal disorder Short half-life
P(NL|HS)=2101140=0.18
B Short half-life Internal disordered segment
P(HS|Ip)=5641260=0.45
Internal disordered segment Short half-life
P(Ip|HS)=5641140=0.49
C Short half-life Long N-terminal or internal disorder
P(HS|NLIp)=6531513=0.43
Long N-terminal or internal disorder Short half-life
P(NLIp|HS)=6531140=0.57
Short half-life Long N-terminal and internal disorder
P(HS|NLIp)=121226=0.54
Long N-terminal and internal disorder Short half-life
P(NLIp|HS)=1211140=0.11
Short N-terminal and no internal disorder Short half-life
P(NSIa|HS)=4871140=0.43
Long N-terminal and internal disorder Long half-life
P(NLIp|HL)=39952=0.04

See the top panel of this table and Figures 1 and 2 for a description of the definitions. See also Tables S3A (for part A) and S3B (for part B).

Internal Disordered Segments Also Contribute to Short Protein Half-Life

The proteasome not only digests proteins starting from their termini but also can cleave or initiate from disordered regions in the middle of the chain (Fishbain et al., 2011; Liu et al., 2003; Piwko and Jentsch, 2006; Prakash et al., 2004; Takeuchi et al., 2007; Zhao et al., 2010). The catalytic residues for proteolysis are buried deep within the proteasome core particle, accessible only through a long narrow channel, and the same is true for the ATPase motor that drives protein substrates through the degradation channel (da Fonseca et al., 2012; Lander et al., 2012; Lasker et al., 2012). To reach these sites, a disordered segment in the middle of a protein has to be longer than a segment at a protein terminus (Fishbain et al., 2011). Therefore, to investigate whether the presence of internal disorder influences protein half-life, we identified proteasome-susceptible internal disordered segments as continuous stretches of at least 40 disordered amino acids (see Discussion). Proteins that contain such an internal disordered segment have a significantly shorter half-life than proteins that do not (p = 3 × 10−29, Mann-Whitney U test; Figure 2A). This observation is robust to our choice of cutoff used for detecting internal disordered segments, but systematically varying the length cutoff revealed that maximal difference in median half-life is obtained for a value of 40 amino acids (Table S1E). Further, the relationship is independent of N-terminal disorder, as the half-life of proteins with internal disordered segments is significantly lower than of those without, regardless of the length of the disordered terminus (Figure 2B).

Figure 2. The Effects of Internal Disordered Segments on Protein Half-Life.

Figure 2

(A) Boxplots of protein half-life distributions for different groups of yeast proteins that contain (dark red) or lack (light red) an internal disordered segment (defined as a continuous stretch of ≥40 disordered residues), subclassified based on (B and D) the length of N-terminal disorder (as in Figure 1: long, >30 residues or short, ≤ 30 residues) and (C) the number of internal disordered segments (from zero, top, to three or more, bottom). Each protein is present in only one category per panel. See Figure 1 for further information. See also Figures S2 and S3 and Tables S1 and S4.

To quantify the contribution of internal disorder to protein half-life, we computed conditional probabilities for finding proteins with and without internal disordered segments within specific half-life ranges. The probability of observing a protein with a short half-life among those that contain an internal disordered segment is high and comparable to the “reverse” probability of finding a protein containing an internal disordered segment among those with short half-life (p[short half-life given internal disordered segment] = 0.45; p[internal disordered segment given short half-life] = 0.49; Tables 1B and S3B). This suggests that presence or absence of an internal disordered segment is an important determinant of the half-life of a protein.

Terminal and Internal Disordered Segments Have Combined Effects on Half-Life

Interestingly, proteins with multiple internal disordered segments have even shorter half-lives than proteins with a single segment (Figures 2C and S2C). This prompted us to investigate the combinatorial effects of terminal and internal disordered segments. Indeed, proteins that have both a long terminal disordered segment and an internal disordered segment tend to have the shortest half-lives (Figure 2D). Furthermore, the probability of having either a terminal or an internal disordered segment given that a protein has a short half-life is the highest (p[long N-terminal or internal disorder given short half-life] = 0.57; Table 1C). Consistent with this observation, we find that the probability of having both terminal and internal disordered segments among proteins with a long half-life is very low (p[long N-terminal and internal disorder given long half-life] = 0.04; Table 1C). Taken together, these results suggest that disordered segments are modular in their ability to affect protein half-life and that these segments can act in a combinatorial manner to accentuate their effects.

The Effects of Disordered Segments on Half-Life Are Independent of the Overall Disorder Degree

So far, we have investigated the effects of continuous stretches of disordered residues (i.e., disordered segments) on protein turnover. However, the fraction of disordered residues (i.e., overall degree of disorder), which is an estimate of the packing, folding, and structural stability of a protein, also correlates with half-life, although previous studies disagree on the extent of the effect (Gsponer et al., 2008; Tompa et al., 2008; Yen et al., 2008). Proteins with a greater overall disorder degree generally contain longer terminal and internal disordered segments (Figure S3A). To determine whether the effects of disordered segments on protein turnover (Figures 1 and 2) are independent of the overall degree of disorder, we matched proteins that have a similar fraction of disordered residues but have varying combinations of disordered segments (long or short N-terminal disorder and/or presence or absence of internal disordered segments; Supplemental Results; Figure S3B).

Comparison of the half-life distributions of proteins from different classes with similar overall disorder degrees (Figure S3C) reveals similar trends as the analysis that uses all proteins (Figure 2D): proteins with both long N-terminal and internal disordered segments typically have the shortest half-lives, followed by proteins with either long internal or long N-terminal disordered segments. Proteins without disordered segments typically have the longest half-lives. The effect sizes of the differences between the half-life distributions are comparable when using all or only proteins with matched overall disorder degree (Figure S3D, upper triangles). Furthermore, most half-life distributions are significantly different, though p values are less significant due to smaller sample sizes (Figure S3D, lower triangles). These results indicate that long disordered segments at the N terminus or internally are important intrinsic features that contribute to shorter protein half-life in living cells and that these effects are independent of the fraction of disordered residues across the whole protein. It should, however, be noted that this does not rule out an additional effect of the overall disorder degree on half-life, i.e., among proteins that do or do not have a disordered segment, proteins with higher degrees of overall disorder tend to have a lower half-life compared to those with a lower degree of disorder (see Discussion).

Disordered Segments Have Direct Effects on Half-Life Rather than Acting Indirectly by Embedding Destruction Signals

Disordered segments could influence half-life either indirectly, by embedding short peptide motifs that serve as destruction signals such as ubiquitination sites or docking sites for ubiquitinating enzymes (Ravid and Hochstrasser, 2008), or directly, by better initiating degradation by the proteasome (Gödderz et al., 2011; Inobe et al., 2011; Peña et al., 2009; Piwko and Jentsch, 2006; Prakash et al., 2004; Takeuchi et al., 2007; Verhoef et al., 2009; Zhao et al., 2010). To investigate the indirect effects, we collected data on four known destruction signals: experimentally determined ubiquitination sites as well as predicted KEN box motifs, destruction box motifs, and PEST sequences. More than half (56%) of all proteins with long terminal or internal disordered segments do not contain any of these destruction signals in their disordered segments that could account for the short half-life (Supplemental Results). Consistently, half-life distributions of proteins with and without predicted destruction signals within the disordered regions were not significantly different (p = 0.1 for N-terminal disorder; p = 0.2 for internal disorder; Mann-Whitney U test; Supplemental Results). Indeed, the majority of experimentally determined ubiquitination sites involved in degradation are in structured rather than disordered regions (Hagai et al., 2011). Furthermore, sequence analysis revealed that disordered segments of proteins with short half-life lack enriched, uncharacterized sequence motifs that could result in rapid degradation, for example by serving as docking sites for ubiquitin ligases (Supplemental Results). Together, these findings suggest that disordered segments do not affect half-life primarily indirectly by embedding destruction motifs. Rather, the general characteristics of disordered segments seem to directly result in short half-life by forming initiation sites for degradation by the proteasome.

Disordered Segments Have Similar Effects on Protein Turnover in Mouse and Human

Given that the ubiquitin-proteasome system and the architecture of the proteasome itself are conserved from yeast to mammals (da Fonseca et al., 2012; Lasker et al., 2012), we hypothesized that the observed relationships may be evolutionarily conserved. We investigated the effects of terminal and internal disordered segments on protein degradation in mouse NIH 3T3 fibroblasts (Schwanhäusser et al., 2011) and in human THP-1 myelomonocytic leukemia cells (Kristensen et al., 2013) and found similar trends: the presence of N-terminal and internal disordered segments is linked with significantly faster protein turnover in both mouse (shorter half-lives) and human (higher degradation rates) (Figure 3; Table S5). In the mouse and human studies, protein degradation was monitored using isotope labeling and mass spectrometry, so that proteins did not need to be tagged at the either terminus (in contrast to the yeast study). Therefore, we could assess the contribution of the disordered segment at the C terminus and found that proteins with a long C-terminal disordered segment display increased turnover in mouse and human, though in mouse, the effect seems smaller than for N-terminal disorder and is not statistically significant (Figures 3B and 3E). These results collectively suggest that the effects of disordered regions on protein half-life are evolutionary conserved.

Figure 3. The Effects of Disordered Segments on Protein Turnover in Mouse and Human.

Figure 3

Boxplots of the distributions of half-life values in Mus musculus (A–C), and relative degradation rates in Homo sapiens (D–F), for proteins with long and short N-terminal, C-terminal, and internal disordered segments. Note that the scale for protein half-life is in hours for mouse, rather than minutes as in yeast. Values are reversed for the human data: proteins with a short half-life have a high relative degradation rate. See Figure 1 for further information. See also Table S5.

Divergence in Disordered Segments during Evolution Can Impact Protein Half-Life

Our observations suggest that protein turnover rates could be tuned by divergence in terminal or internal disordered segments during evolution (Figure 4A). To test this, we investigated protein pairs in yeast that arose from gene duplication (i.e., paralogs) (Experimental Procedures). Since paralogs are encoded within the same genome, this makes it possible to compare half-lives between evolutionarily related proteins under similar conditions. We specifically asked whether paralogs diverged in the length of N-terminal disorder or in the number of internal disordered segments (but are otherwise largely similar) and, if they did, whether this corresponded to changes in their half-life. Protein half-life data are available for both paralogs of 1,440 pairs (Table S7), and many of these paralog pairs have diverged in the length and number of terminal and internal disordered segments (Figures 4B and 4C, Tables S7 and S8).

Figure 4. Divergence in Disordered Segments during Evolution Can Impact Protein Half-Life.

Figure 4

(A) Schematic depiction of how the half-life of paralogs could be altered by changes in N-terminal and/or internal disordered segments during evolution. The dark and light green degradation curves denote a short and long half-life. This schematic is not intended to cover all possible scenarios for divergence of disordered segments between paralogs.

(B) Distributions of half-life differences (ΔH) in pairs of yeast paralogs, grouped according to the difference in the length of their N-terminal disordered segments. Top: one paralog has a short and the other paralog a long disordered N terminus (SL). Bottom: both paralogs have short (both ≤30 residues; SS) or both have long (both >30 residues; LL) disordered N termini.

(C) Distribution of half-life differences (ΔH) in paralog pairs, grouped according to the difference in the number of internal disordered regions (ΔI). Top: pairs where one of the two paralogs has a higher number of internal disordered segments (ΔI ≥ 1). Bottom: pairs with identical numbers of internal disordered segment (ΔI = 0).

Each paralog pair is arranged so that ΔL = L1 − L2 (B) and ΔI = I1 − I2 (C) are always positive (i.e., L1 ≥ L2 and I1 ≥ I2). This order is used for the ΔH calculation (that is, the half-life of the paralog with the shortest N-terminal disorder, or the smallest number of internal disordered segments, will be subtracted from the half-life of the other one; ΔH = H1 − H2). As a result, ΔH will be negative for pairs where an increase in N-terminal or internal disorder coincides with a shorter half-life (Experimental Procedures). For ΔI = 0 (C, bottom), two ΔH distributions were obtained by ordering the paralogs within a pair according to the total length of all internal disordered segments (increasing and decreasing to simulate gain and loss of internal disorder length during evolution; Table S6A). See Figure 1 for further information. See also Figure S4 and Table S6.

We classified the pairs of paralogs into (1) those that during evolution maintained N-terminal disorder of roughly equal length (i.e., both proteins have a short [≤30 residues] or both have a long [>30 residues] disordered segment at the N terminus; 1,049 pairs) and (2) pairs with disordered N termini of different length (i.e., one protein of the pair has a short and the other has a long disordered segment; 391 pairs). Paralogous protein pairs that diverged in the length of N-terminal disorder show significantly larger differences in half-lives than pairs that maintained roughly equal N-terminal disorder (p = 9 × 10−6, Mann-Whitney U test; Figure 4B), in a manner that agrees with the trends reported above: the protein with the longer N-terminal disordered segment usually has a shorter half-life than its paralog with a shorter disordered segment. More precisely, (1) paralogous proteins with similar length of terminal disorder tend to have similar half-life values (median difference in half-life is close to zero; Figure 4B, bottom boxplot; Table S6A), and (2) the half-life of proteins with longer N-terminal disordered regions tends to be 14 min shorter (median) than that of their paralogous partners (Figure 4B, top boxplot; Table S6A). The converse is also true, as paralogous pairs with large half-life changes show a large divergence in the length of N-terminal disorder (Supplemental Results; Figure S4A). A 14 min difference in half-life between paralogs is substantial in the context of yeast biology, as this is comparable to the time from division to budding (G1 phase) in laboratory strains growing exponentially in rich media at 30°C, which is 15–37 min (Di Talia et al., 2007). Thus, altered half-life due to divergence in the length of terminal disorder could have a significant impact on the duration for which a protein can impart its function in a cell and thus affect cellular behavior.

Paralogous proteins that differ in the number of internal disordered segments also show significantly larger changes in half-life than pairs with the same number of internal disordered regions (p = 1 x 10−5, Mann-Whitney U test; Figure 4C). The half-life of proteins with more internal disordered regions tends to be 7 min shorter (median) than that of their paralogs (Table S6A), which again can be a considerable amount of time considering the doubling time of yeast. In paralogous pairs that have the same number of internal disordered segments but diverged in the total internal disorder length (i.e., the sum of all internal disordered segments), the half-life of the protein with the longer total internal disorder also tends to be shorter (median half-life difference is 5 min; Table S6A).

Analysis of conditional probability values allowed us to quantify the trends (Tables 2 and S8). The majority (73%) of paralogous pairs that diverged in the length of terminal or number of internal disordered segments show a consistent change in half-life: the paralog with the longest terminal disordered segment or largest number of internal disordered segments tends to have the shorter half-life (p[shorter half-life given divergence of N-terminal or internal disorder] = 0.73; Table 2C). This effect is large even if only segments at the N terminus or only internal segments are considered (p [shorter half-life given divergence of N-terminal disorder] = 0.64 and p[shorter half-life given divergence of internal disorder] = 0.58; Tables 2A and 2B). Again, the converse is also true as the probability of observing a paralogous pair that has diverged in both terminal and internal disorder, and in which the paralog with the longest terminal disordered segment and most internal disordered segments has the longer half-life, is very small (p[divergence of N-terminal and internal disorder given longer half-life] = 0.01; Table 2C). Taken together, the results suggest that the gain or loss of long terminal or internal disordered segments can significantly influence the half-life of a protein upon gene duplication during evolution. Thus evolution of intrinsic features such as disordered segments may be an important contributor to the degradation rate of proteins.

Table 2. Conditional Probabilities for Intrinsically Disordered Segments and Protein Half-Life in Pairs of Paralogs.

Event (E) Condition (C) Probability (Event | Condition) = P(E⋂C)/P(C)
A Shorter half-life divergence of N-terminal disorder
P(ΔH<0min|NSL)=252391=0.64
Divergence of N-terminal disorder shorter half-life
P(NSL|ΔH<0min)=252787=0.32
B Shorter half-life divergence of internal disordered segments
P(ΔH<0min|Inn+x)=323561=0.58
Divergence of internal disordered segments shorter half-life
P(Inn+x|ΔH<0min)=323799=0.40
C Shorter half-life divergence of N-terminal or internal disordered segments
P(ΔH<0min|NSLInn+x)=542741=0.73
Divergence of internal disordered segments shorter half-life
P(NSLInn+x|ΔH<0min)=5421176=0.46
Shorter half-life divergence of N-terminal and internal disordered segments
P(ΔH<0min|NSLInn+x)=33211=0.16
Divergence of N-terminal and internal disordered segments shorter half-life
P(NSLInn+x|ΔH<0min)=331176=0.03
Divergence of N-terminal and internal disordered segments longer half-life
P(NSLInn+x|ΔH>0min)=111013=0.01

See Figure 4 for a description of the definitions. NSL denotes pairs where one paralog has a short and the other paralog a long N-terminal disordered segment. In = > n+x denotes pairs where one of the two paralogs has a higher number of internal disordered segments (ΔI ≥ 1). See also Tables S8A (for part A) and S8B (for part B).

Functional Analysis and Literature Evidence Support the Role of Disordered Segments in Governing Protein Half-Life and Phenotype

Disordered segments that affect half-life could be important for governing phenotypes, because precise protein turnover is important for many cellular processes. An analysis of function annotations of proteins with long N-terminal or internal disordered segments revealed enrichment for protein kinases and phosphoproteins and associations with regulatory and transcription functions, as well as cell-cycle processes (Tables S1F and S4B). Paralogs that have diverged in terminal or internal disordered segments have similar functions and are additionally involved in, for example, ATP and nucleotide binding and ubiquitin conjugation activities (Table S6C). These are all functions involved in signaling and regulation, where alteration of protein half-life can significantly affect the duration of activity of the protein and thereby impact cellular phenotype (Legewie et al., 2008) (see Discussion).

A literature search revealed several examples where changes in disordered segments lead to phenotypic differences through altered protein half-life. Stabilizing the half-life of the yeast kinase Ime2, a positive regulator of meiosis, by deletion of an internal disordered region results in altered sporulation efficiency (Guttmann-Raviv et al., 2002). Similarly, deletion of a highly disordered 47-amino-acid stretch at the N terminus of yeast Cdc6 prevents its degradation, although in this case, the deletion also abolishes the interaction with a ubiquitin ligase complex (Drury et al., 1997). Deletion of the first 31 residues of the human nuclear receptor Nurr1 significantly reduces its degradation by the ubiquitin-proteasome pathway and consequently leads to increased activation as a transcription factor (Alvarez-Castelao et al., 2013). Interestingly, the deleted region corresponds completely to a putative disordered segment at the N terminus and the size of the deletion could now explain the effects on half-life. These selected examples illustrate the importance of disordered segments for maintaining correct protein turnover.

Discussion

Ubiquitination by E3 ligases has a dominant role in deciding when a protein gets targeted for proteasomal degradation, but it has remained unclear how intrinsic features affect the lifetime of a protein and whether such features have been exploited to alter half-life during evolution. Here, we uncovered genome-scale principles of how intrinsically disordered segments influence protein turnover in the cell and during evolution. On a genomic scale, in vivo, sufficiently long disordered regions at the termini or in the middle of proteins can directly decrease half-life (Figure 5). A large number of control calculations confirmed that the reported trends are independent of confounding factors, such as the cutoffs used to group the proteins, the disorder prediction method, the statistical tests used, protein abundance and length, subcellular localization, membrane proteins, and the nature of the N-terminal residue (Supplemental Results). Finally, we found that changes in the length and number of disordered segments upon gene duplication are linked with altered half-life, suggesting that such variation can contribute to the tuning of half-life during evolution.

Figure 5. Concept Describing the General Relationship between the Presence of Long Terminal or Internal Disordered Segments and Protein Half-Life.

Figure 5

Disordered segments influence protein half-life by permitting efficient initiation of degradation by the proteasome. Ubiquitination and other factors contributing to substrate targeting to the proteasome are not depicted.

The Structure and Composition of the Proteasome Suggest Molecular Mechanisms to Explain the Observations

The structure of the 19S regulatory particle (da Fonseca et al., 2012; Lander et al., 2012; Lasker et al., 2012) provides insights into the mechanisms by which disordered segments may increase the efficiency of proteasomal degradation and affect protein half-life. The distance between the two ubiquitin receptors, Rpn10 and Rpn13, and the ATPase unfolding channel is ∼70–80 Å. The essential deubiquitinating enzyme Rpn11 sits ∼60 Å from the ATPase ring. A terminal disordered segment of 30 residues would comfortably span these distances and could serve as a degradation initiation site. Similarly, 40 residues would be enough for an internal disordered segment to reach into the ATPase ring of the regulatory particle, even when folding back on itself. The precise distance requirements for a disordered segment to serve as an initiation site will depend on specific properties of the proteasome and the geometry and binding position of the substrate. For example, at least five different substrate receptors associate with the yeast proteasome, and some of them exhibit extensive conformational flexibility (Finley, 2009). Substrate-specific aspects that affect the distances include the state of the termini, which are frequently subject to maturation through cleavage and trimming (Lange and Overall, 2013), and properties of the polyubiquitin tag such as the linkage type (e.g., K48 and K11) and number of ubiquitin moieties (Komander and Rape, 2012), and the attachment point to the substrate (Hagai et al., 2011; Inobe et al., 2011). This could explain why, with in vivo data for thousands of different proteins, we do not observe a strict length cutoff for when disordered segments influence protein half-life: cutoffs of about 30 terminal and 40 internal disordered residues produce the largest differences between the half-lives of proteins with and without disordered segments, but shorter and longer segments also contribute to shorter half-life (Table S1E). Thus, individual proteins are likely to have specific length requirements of disordered segments that depend on a variety of factors and contribute to the range of lengths at which disordered segments decrease protein half-life on a global scale.

The ATP-independent regulators PA28 and P200 can also facilitate opening of the 20S proteasome entry gate and contribute to substrate degradation (Stadtmueller and Hill, 2011). The ATPase complex p97/VCP perhaps serves as an alternative cap that directly binds the 20S core particle as well (Barthelme and Sauer, 2012). All these complexes may have different requirements for disordered segments in the substrate proteins. In fact, it has been suggested that p97/VCP may unfold proteins lacking disordered regions (Beskow et al., 2009). In vitro, the 20S proteasome core particle by itself can degrade highly disordered proteins in a process termed degradation by default and it may also be able to do so in vivo (Tsvetkov et al., 2008). The average distance between the entry pore and the proteolytic sites in the 20S core particle is ∼70 Å (da Fonseca et al., 2012; Lasker et al., 2012). An internal disordered segment of at least 40 residues is able to span twice this distance and thus could be cleaved by the core particle alone (Figure S2D). Thus, proteins with disordered segments of specific length may be processed quickly due to efficient initiation of degradation as discussed.

The overall disorder degree of a substrate might further affect its half-life. Upon initiation of degradation, the proteasome may quickly degrade proteins with high overall levels of disorder, because its ATPase subunits spend less time to unfold these disordered proteins once they are engaged compared to proteins of similar length that are structured and need to be unfolded before they can be processed. Indeed, biochemical evidence suggests increasingly structured and stable substrates have higher turnover times and energy costs (Henderson et al., 2011; Peth et al., 2013).

Disordered Segments Influence Half-Life as an Intrinsic Feature that Can Be Modulated by Other Mechanisms

Although proteins with long terminal or internal disordered segments tend to have a short half-life, various factors can increase or decrease the half-life of individual proteins (see also Supplemental Discussion). For example, the presence of a highly structured N-terminal domain may shield proteins with internal disordered segments from degradation (Simister et al., 2011). Disordered proteins may also be protected by forming protein complexes or through interactions with other proteins. For instance, several specialized proteins have been shown to bind to and stabilize disordered proteins (Tsvetkov et al., 2009). Furthermore, specific low-complexity sequences or tandem repeats in the degradation initiation site can attenuate initiation or progression of degradation, thereby affecting substrate half-life (Sharipo et al., 1998; Tian et al., 2005; Zhang and Coffino, 2004). This is consistent with the idea that, although disordered segments are all similar in that they lack the ability to independently fold into a compact structure, many types of sequences exist within this definition that have different biophysical and conformational characteristics. For example, some disordered sequences are relatively globular and collapsed while others are expanded, and this is determined by the overall charge and sequence composition of the disordered region (Mao et al., 2013). Indeed, the proteasome has clear amino acid sequence preferences as degradation of model substrates that differ only in their disordered initiation regions varies over at least an order of magnitude (Fishbain et al., 2014). The broad distributions of half-lives observed in our study support this, as they reflect the combined properties of many possible subtypes of disordered segments, some of which are able to efficiently initiate degradation, while others may not.

In short, one can distinguish at least three distinct determinants of protein half-life. (1) Sequence motifs and the presence of regulatory proteins such as ubiquitin ligases contribute to the overall half-life of a protein by determining when substrates will be ubiquitinated and hence targeted to the proteasome. (2) Upon recognition of the ubiquitinated substrate by the proteasome, the presence of disordered segments of sufficient size either at the terminus or internally may facilitate efficient degradation initiation, thereby leading to lower half-life. (3) The overall degree of disorder may contribute to a general trend in lowering half-life by increasing the processivity of degradation upon engagement (after recognition and initiation) by the proteasome. Thus, our observations suggest that disordered segments influence protein half-life as an underlying factor that can be modulated by other cellular mechanisms, sequence determinants and the structural stability of the substrate.

Disordered Segments Could Influence the Dynamics and Regulation of Signaling Pathways

Disordered regions are prominent in regulatory and signaling proteins (Tables S1F, S4B, and S6C) (van der Lee et al., 2014). Since divergence in disordered segments may affect protein half-life, this could influence the response kinetics of signaling and regulatory pathways involving such proteins (see also Supplemental Discussion). In fact, among the paralogous pairs that show the largest divergence in the length of N-terminal disorder and half-life in yeast (Tables S6C and S7), there are several regulatory protein kinases such as MAP kinases (MKK1, MKK2, HOG1, and STE7), serine/threonine kinases (YPK1, YPK2, and KIN28), and cyclin-dependent kinases (PHO85 and CAK1). Paralogs that have diverged in terminal or internal disordered segments are generally enriched in nucleotide binding, kinase regulatory activity, and phosphoproteins. Alterations in the degradation rate of kinases, for instance, can have significant implications for the dynamics of signaling networks (Legewie et al., 2008; Purvis and Lahav, 2013). Such effects have been shown for the yeast kinase Ime2 (Guttmann-Raviv et al., 2002) and mouse transcription factor Hes7 (Hirata et al., 2004), where mutations in disordered segments lead to changes in protein half-life, which in turn severely deregulate signaling and development, respectively.

Our observations raise the possibility that proteins with a long terminal disordered segment might be better presented as antigens to the immune system. This is because the immunoproteasome, a variant of the canonical proteasome, may better process such proteins into peptides for presentation by the major histocompatibility complex (MHC) molecules (Groettrup et al., 2010). In line with this idea, it has been shown that (1) N-extended epitopes are efficiently processed by the immunoproteasome and serve as better substrates for antigen presentation (Cascio et al., 2001) and (2) the presence of a disordered region determines the direction of degradation, which in turn determines the spectrum of generated peptides (Berko et al., 2012). It is tempting to speculate that one could improve vaccine efficiency by adding or extending terminal disordered regions to epitope-containing proteins. Our findings also call for careful interpretation of half-life measurements made on proteins that are tagged at their termini using constructs with a varying degree of structure or intrinsic disorder (e.g., GFP, TAP tag, His-tag).

Divergence in Disordered Segments Provides a Means for Tuning Protein Half-Life during Evolution and Could Generate Phenotypic Variation

We observed that divergence in disordered regions might influence protein half-life among paralogs. An outstanding question is whether such changes in half-life through divergence of disordered segments are under selective pressure. Natural variation leading to alteration of disordered regions may provide a simple means for regulatory subfunctionalization of paralogous proteins upon gene duplication. It also suggests a mechanism for divergence of half-life among orthologous proteins between species. Thus, while it is clear that the emergence of destruction signals such as ubiquitination sites and dedicated ubiquitin ligases affect targeting of a protein for degradation, variation in disordered segments may provide a simple evolutionary mechanism for fine-tuning protein turnover rates.

Several genetic and molecular mechanisms may generate diversity in terminal or internal disordered segments. These include repeat expansion, alternative splicing, and alternative transcription start sites, all of which can influence the length of terminal and/or internal disorder of protein products, thereby potentially influencing the half-life. This idea is supported by the observation that protein disorder is common in insertions and deletions (Light et al., 2013). Furthermore, given that in multicellular eukaryotes (1) alternative transcription start sites commonly generate variation in N termini (Carninci et al., 2006) and (2) alternatively spliced exons are enriched in intrinsic disorder (Buljan et al., 2013), it is likely that such events that generate diversity in protein sequences in different cell types within an individual will have an effect on protein half-life. Similarly, given that disordered regions often contain homopolymeric repeat sequences (Tompa, 2003), and because tandem repeats in DNA sequences can lead to expansion or deletion of genetic material through strand slippage during replication (Levinson and Gutman, 1987), it is plausible that individuals in a population harbor genetic variants that code for proteins with altered length of disordered segments and thus have different half-lives. Changes in protein turnover in turn may disturb protein abundance and could lead to disease (Babu et al., 2011; Hirata et al., 2004; Yang et al., 2012), especially in the case of pleiotropic, regulatory, or signaling proteins. Thus mechanisms that generate diversity in the length or number of disordered segments could serve as a source of genetic variation that may have important phenotypic consequences.

Experimental Procedures

Protein Half-Life Data and Calculation of Disordered Segments

Protein half-life data and other data (Table S1A) were collected for yeast (Saccharomyces cerevisiae), mouse, and human. Intrinsic disorder was predicted for all reviewed protein sequences of these organisms (downloaded from UniProtKB/Swiss-Prot; http://www.uniprot.org/) using three complementary methods: DISOPRED2, IUPRED long, and PONDR VLS1. The presence and length of N-terminal, C-terminal, and internal disordered segments were then calculated using different algorithms and integrated with the half-life data. Proteomes were classified into groups according to the length of disordered segments: (1) proteins with short and long disordered termini (length cutoff 30 residues, treating the N and C termini separately) and (2) proteins with and without internal disordered segments (at least 40 disordered residues). The overall degree of disorder of a protein was calculated as the fraction of disordered residues (number of disordered residues divided by sequence length). The distributions of half-life values and protein disorder were analyzed using appropriate statistical tests.

See the Supplemental Experimental Procedures for more details.

Paralog Data and Calculations

Yeast paralog pairs were obtained from an all-against-all sequence comparison using BLASTClust (Altschul et al., 1990). More divergent paralogs from the yeast whole-genome duplication event (Wolfe and Shields, 1997) were added to the list. To calculate the differences in half-life (ΔH) and N-terminal disorder length (ΔL) between the individual proteins in a paralog pair, ΔL is defined to be always positive and obtained by subtracting the N-terminal disorder length of paralog 2 from the N-terminal disorder length of paralog 1 (ΔL = L1 − L2; L1 ≥ L2). To calculate ΔH, the order of paralogs in a pair is maintained, so that ΔH can be positive or negative (ΔH=H1 − H2). Thus, ΔH is negative whenever the relationship “longer disordered N terminus = shorter half-life” holds true. Similarly, the difference in the number of internal disordered segments ΔI is defined to always be positive (ΔI= I1 − I2; I1 ≥ I2), and ΔH is calculated accordingly. Paralog pairs were separated into categories according to the divergence in N-terminal (pairs that maintained or that diverged N-terminal disorder) or internal disordered segments (pairs with an identical or different number of segments).

See the Supplemental Experimental Procedures for more details.

Supplementary Material

Supplementary information

Acknowledgments

We acknowledge the Medical Research Council (U105185859). We thank A. Bertolotti, C. Chothia, C. Ravarani, D. Komander, G. Chalancon, J. Marsh, R. ter Horst, M. Buljan, R. Hegde, R. Weatheritt, S. Balaji, S. Chavali, S. Teichmann, and the anonymous reviewers for their comments on this work. We acknowledge financial support from HFSP (RGY0073/2010, R.v.d.L., B.L., and M.M.B.), the Virgo consortium funded by the Dutch government (FES0908) and the Netherlands Genomics Initiative (050-060-452; R.v.d.L. and M.A.H.), the Herchel Smith Research Studentship Fund (B.L.), the LMB International PhD Programme (K.K.), an EMBO STF 312-2011 and Momentum award (LP2012-41/2012) by the Hungarian Academy of Sciences (M.F.), FEBS and FP7-PEOPLE-2012-IEF 299105 fellowships (N.S.d.G.), PrioNet Canada and UBC (J.G.), NIH (R01GM63004 and U54GM105816, A.M.), and the Welch Foundation (F-1817, A.M.). M.M.B. acknowledges the MRC, EMBO Young Investigator Programme, Trinity College, and ERASysBio+ for support and funding his research program.

Footnotes

Supplemental Information: Supplemental Information includes Supplemental Results, Supplemental Discussion, Supplemental Experimental Procedures, four figures, and eight tables and can be found with this article online at http://dx.doi.org/10.1016/j.celrep.2014.07.055.

Author Contributions: R.v.d.L. and M.M.B. designed the project and analyzed and interpreted the results. R.v.d.L. performed the majority of the calculations. B.L. performed additional calculations, and K.K., J.G., N.S.d.G., and M.F. carried out specific analyses. B.L., K.K., M.A.H., M.F., and A.M. assisted in interpretation. R.v.d.L. and M.M.B. wrote the manuscript with help from B.L., M.A.H., M.F., and A.M.

References

  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  2. Alvarez-Castelao B, Losada F, Ahicart P, Castaño JG. The N-terminal region of Nurr1 (a.a 1-31) is essential for its efficient degradation by the ubiquitin proteasome pathway. PLoS ONE. 2013;8:e55999. doi: 10.1371/journal.pone.0055999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Babon JJ, McManus EJ, Yao S, DeSouza DP, Mielke LA, Sprigg NS, Willson TA, Hilton DJ, Nicola NA, Baca M, et al. The structure of SOCS3 reveals the basis of the extended SH2 domain function and identifies an unstructured insertion that regulates stability. Mol Cell. 2006;22:205–216. doi: 10.1016/j.molcel.2006.03.024. [DOI] [PubMed] [Google Scholar]
  4. Babu MM, van der Lee R, de Groot NS, Gsponer J. Intrinsically disordered proteins: regulation and disease. Curr Opin Struct Biol. 2011;21:432–440. doi: 10.1016/j.sbi.2011.03.011. [DOI] [PubMed] [Google Scholar]
  5. Babu MM, Kriwacki RW, Pappu RV. Structural biology. Versatility from protein disorder Science. 2012;337:1460–1461. doi: 10.1126/science.1228775. [DOI] [PubMed] [Google Scholar]
  6. Barthelme D, Sauer RT. Identification of the Cdc4•d20S proteasome as an ancient AAA+ proteolytic machine. Science. 2012;337:843–846. doi: 10.1126/science.1224352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Belle A, Tanay A, Bitincka L, Shamir R, O'Shea EK. Quantification of protein half-lives in the budding yeast proteome. Proc Natl Acad Sci USA. 2006;103:13004–13009. doi: 10.1073/pnas.0605420103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Berko D, Tabachnick-Cherny S, Shental-Bechor D, Cascio P, Mioletti S, Levy Y, Admon A, Ziv T, Tirosh B, Goldberg AL, Navon A. The direction of protein entry into the proteasome determines the variety of products and depends on the force needed to unfold its two termini. Mol Cell. 2012;48:601–611. doi: 10.1016/j.molcel.2012.08.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Beskow A, Grimberg KB, Bott LC, Salomons FA, Dantuma NP, Young P. A conserved unfoldase activity for the p97 AAA-ATPase in proteasomal degradation. J Mol Biol. 2009;394:732–746. doi: 10.1016/j.jmb.2009.09.050. [DOI] [PubMed] [Google Scholar]
  10. Buljan M, Chalancon G, Dunker AK, Bateman A, Balaji S, Fuxreiter M, Babu MM. Alternative splicing of intrinsically disordered regions and rewiring of protein interactions. Curr Opin Struct Biol. 2013;23:443–450. doi: 10.1016/j.sbi.2013.03.006. [DOI] [PubMed] [Google Scholar]
  11. Carninci P, Sandelin A, Lenhard B, Katayama S, Shimokawa K, Ponjavic J, Semple CA, Taylor MS, Engströ m PG, Frith MC, et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet. 2006;38:626–635. doi: 10.1038/ng1789. [DOI] [PubMed] [Google Scholar]
  12. Cascio P, Hilton C, Kisselev AF, Rock KL, Goldberg AL. 26S proteasomes and immunoproteasomes produce mainly N-extended versions of an antigenic peptide. EMBO J. 2001;20:2357–2366. doi: 10.1093/emboj/20.10.2357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Ciechanover A. Intracellular protein degradation: from a vague idea thru the lysosome and the ubiquitin-proteasome system and onto human diseases and drug targeting. Biochim Biophys Acta. 2012;1824:3–13. doi: 10.1016/j.bbapap.2011.03.007. [DOI] [PubMed] [Google Scholar]
  14. da Fonseca PC, He J, Morris EP. Molecular model of the human 26S proteasome. Mol Cell. 2012;46:54–66. doi: 10.1016/j.molcel.2012.03.026. [DOI] [PubMed] [Google Scholar]
  15. Di Talia S, Skotheim JM, Bean JM, Siggia ED, Cross FR. The effects of molecular noise and size control on variability in the budding yeast cell cycle. Nature. 2007;448:947–951. doi: 10.1038/nature06072. [DOI] [PubMed] [Google Scholar]
  16. Dosztányi Z, Csizmók V, Tompa P, Simon I. The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. J Mol Biol. 2005;347:827–839. doi: 10.1016/j.jmb.2005.01.071. [DOI] [PubMed] [Google Scholar]
  17. Drury LS, Perkins G, Diffley JF. The Cdc4/34/53 pathway targets Cdc6p for proteolysis in budding yeast. EMBO J. 1997;16:5966–5976. doi: 10.1093/emboj/16.19.5966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Finley D. Recognition and processing of ubiquitin-protein conjugates by the proteasome. Annu Rev Biochem. 2009;78:477–513. doi: 10.1146/annurev.biochem.78.081507.101607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fishbain S, Prakash S, Herrig A, Elsasser S, Matouschek A. Rad23 escapes degradation because it lacks a proteasome initiation region. Nat Commun. 2011;2:192. doi: 10.1038/ncomms1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fishbain S, Inobe T, Israeli E, Zokarkar A, Chavali S, Babu MM, Matouschek A. The sequence composition of disordered regions affects protein half-life by controlling the initiation step of proteasomal degradation. Nat Struct Mol Biol. 2014 doi: 10.1038/nsmb.2958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gödderz D, Schäfer E, Palanimurugan R, Dohmen RJ. The N-terminal unstructured domain of yeast ODC functions as a transplantable and replaceable ubiquitin-independent degron. J Mol Biol. 2011;407:354–367. doi: 10.1016/j.jmb.2011.01.051. [DOI] [PubMed] [Google Scholar]
  22. Goldberg AL. Protein degradation and protection against misfolded or damaged proteins. Nature. 2003;426:895–899. doi: 10.1038/nature02263. [DOI] [PubMed] [Google Scholar]
  23. Groettrup M, Kirk CJ, Basler M. Proteasomes in immune cells: more than peptide producers? Nat Rev Immunol. 2010;10:73–78. doi: 10.1038/nri2687. [DOI] [PubMed] [Google Scholar]
  24. Gsponer J, Futschik ME, Teichmann SA, Babu MM. Tight regulation of unstructured proteins: from transcript synthesis to protein degradation. Science. 2008;322:1365–1368. doi: 10.1126/science.1163581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Guttmann-Raviv N, Martin S, Kassir Y. Ime2, a meiosis-specific kinase in yeast, is required for destabilization of its transcriptional activator, Ime1. Mol Cell Biol. 2002;22:2047–2056. doi: 10.1128/MCB.22.7.2047-2056.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hagai T, Azia A, Tóth-Petróczy Á, Levy Y. Intrinsic disorder in ubiquitination substrates. J Mol Biol. 2011;412:319–324. doi: 10.1016/j.jmb.2011.07.024. [DOI] [PubMed] [Google Scholar]
  27. Henderson A, Erales J, Hoyt MA, Coffino P. Dependence of proteasome processing rate on substrate unfolding. J Biol Chem. 2011;286:17495–17502. doi: 10.1074/jbc.M110.212027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hershko A, Ciechanover A. The ubiquitin system. Annu Rev Biochem. 1998;67:425–479. doi: 10.1146/annurev.biochem.67.1.425. [DOI] [PubMed] [Google Scholar]
  29. Hirata H, Bessho Y, Kokubu H, Masamizu Y, Yamada S, Lewis J, Kageyama R. Instability of Hes7 protein is crucial for the somite segmentation clock. Nat Genet. 2004;36:750–754. doi: 10.1038/ng1372. [DOI] [PubMed] [Google Scholar]
  30. Hutchins AP, Liu S, Diez D, Miranda-Saavedra D. The repertoires of ubiquitinating and deubiquitinating enzymes in eukaryotic genomes. Mol Biol Evol. 2013;30:1172–1187. doi: 10.1093/molbev/mst022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Inobe T, Fishbain S, Prakash S, Matouschek A. Defining the geometry of the two-component proteasome degron. Nat Chem Biol. 2011;7:161–167. doi: 10.1038/nchembio.521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Irimia M, Denuc A, Ferran JL, Pernaute B, Puelles L, Roy SW, Garcia-Fernàndez J, Marfany G. Evolutionarily conserved A-to-I editing increases protein stability of the alternative splicing factor Nova1. RNA Biol. 2012;9:12–21. doi: 10.4161/rna.9.1.18387. [DOI] [PubMed] [Google Scholar]
  33. Komander D, Rape M. The ubiquitin code. Annu Rev Biochem. 2012;81:203–229. doi: 10.1146/annurev-biochem-060310-170328. [DOI] [PubMed] [Google Scholar]
  34. Komander D, Clague MJ, Urbé S. Breaking the chains: structure and function of the deubiquitinases. Nat Rev Mol Cell Biol. 2009;10:550–563. doi: 10.1038/nrm2731. [DOI] [PubMed] [Google Scholar]
  35. Kristensen AR, Gsponer J, Foster LJ. Protein synthesis rate is the predominant regulator of protein expression during differentiation. Mol Syst Biol. 2013;9:689. doi: 10.1038/msb.2013.47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Lakin ND, Jackson SP. Regulation of p53 in response to DNA damage. Oncogene. 1999;18:7644–7655. doi: 10.1038/sj.onc.1203015. [DOI] [PubMed] [Google Scholar]
  37. Lander GC, Estrin E, Matyskiela ME, Bashore C, Nogales E, Martin A. Complete subunit architecture of the proteasome regulatory particle. Nature. 2012;482:186–191. doi: 10.1038/nature10774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lange PF, Overall CM. Protein TAILS: when termini tell tales of proteolysis and function. Curr Opin Chem Biol. 2013;17:73–82. doi: 10.1016/j.cbpa.2012.11.025. [DOI] [PubMed] [Google Scholar]
  39. Lasker K, Förster F, Bohn S, Walzthoeni T, Villa E, Unverdorben P, Beck F, Aebersold R, Sali A, Baumeister W. Molecular architecture of the 26S proteasome holocomplex determined by an integrative approach. Proc Natl Acad Sci USA. 2012;109:1380–1387. doi: 10.1073/pnas.1120559109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Legewie S, Herzel H, Westerhoff HV, Blüthgen N. Recurrent design patterns in the feedback regulation of the mammalian signalling network. Mol Syst Biol. 2008;4:190. doi: 10.1038/msb.2008.29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Levinson G, Gutman GA. Slipped-strand mispairing: a major mechanism for DNA sequence evolution. Mol Biol Evol. 1987;4:203–221. doi: 10.1093/oxfordjournals.molbev.a040442. [DOI] [PubMed] [Google Scholar]
  42. Light S, Sagit R, Ekman D, Elofsson A. Long indels are disordered: a study of disorder and indels in homologous eukaryotic proteins. Biochim Biophys Acta. 2013;1834:890–897. doi: 10.1016/j.bbapap.2013.01.002. [DOI] [PubMed] [Google Scholar]
  43. Liu CW, Corboy MJ, DeMartino GN, Thomas PJ. Endoproteolytic activity of the proteasome. Science. 2003;299:408–411. doi: 10.1126/science.1079293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Mao AH, Lyle N, Pappu RV. Describing sequence-ensemble relationships for intrinsically disordered proteins. Biochem J. 2013;449:307–318. doi: 10.1042/BJ20121346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Obradovic Z, Peng K, Vucetic S, Radivojac P, Dunker AK. Exploiting heterogeneous sequence properties improves prediction of protein disorder. Proteins. 2005;61(Suppl 7):176–182. doi: 10.1002/prot.20735. [DOI] [PubMed] [Google Scholar]
  46. Pagano M, Tam SW, Theodoras AM, Beer-Romero P, Del Sal G, Chau V, Yew PR, Draetta GF, Rolfe M. Role of the ubiquitin-proteasome pathway in regulating abundance of the cyclin-dependent kinase inhibitor p27. Science. 1995;269:682–685. doi: 10.1126/science.7624798. [DOI] [PubMed] [Google Scholar]
  47. Peña MM, Melo SP, Xing YY, White K, Barbour KW, Berger FG. The intrinsically disordered N-terminal domain of thymidylate synthase targets the enzyme to the ubiquitin-independent proteasomal degradation pathway. J Biol Chem. 2009;284:31597–31607. doi: 10.1074/jbc.M109.038455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Peth A, Nathan JA, Goldberg AL. The ATP costs and time required to degrade ubiquitinated proteins by the 26 S proteasome. J Biol Chem. 2013;288:29215–29222. doi: 10.1074/jbc.M113.482570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Piwko W, Jentsch S. Proteasome-mediated protein processing by bidirectional degradation initiated from an internal site. Nat Struct Mol Biol. 2006;13:691–697. doi: 10.1038/nsmb1122. [DOI] [PubMed] [Google Scholar]
  50. Prakash S, Tian L, Ratliff KS, Lehotzky RE, Matouschek A. An unstructured initiation site is required for efficient proteasome-mediated degradation. Nat Struct Mol Biol. 2004;11:830–837. doi: 10.1038/nsmb814. [DOI] [PubMed] [Google Scholar]
  51. Purvis JE, Lahav G. Encoding and decoding cellular information through signaling dynamics. Cell. 2013;152:945–956. doi: 10.1016/j.cell.2013.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Ramakrishna S, Suresh B, Lim KH, Cha BH, Lee SH, Kim KS, Baek KH. PEST motif sequence regulating human NANOG for proteasomal degradation. Stem Cells Dev. 2011;20:1511–1519. doi: 10.1089/scd.2010.0410. [DOI] [PubMed] [Google Scholar]
  53. Ravid T, Hochstrasser M. Diversity of degradation signals in the ubiquitin-proteasome system. Nat Rev Mol Cell Biol. 2008;9:679–690. doi: 10.1038/nrm2468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rutkowski DT, Arnold SM, Miller CN, Wu J, Li J, Gunnison KM, Mori K, Sadighi Akha AA, Raden D, Kaufman RJ. Adaptation to ER stress is mediated by differential stabilities of pro-survival and pro-apoptotic mRNAs and proteins. PLoS Biol. 2006;4:e374. doi: 10.1371/journal.pbio.0040374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Schwanhäusser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M. Global quantification of mammalian gene expression control. Nature. 2011;473:337–342. doi: 10.1038/nature10098. [DOI] [PubMed] [Google Scholar]
  56. Sharipo A, Imreh M, Leonchiks A, Imreh S, Masucci MG. A minimal glycine-alanine repeat prevents the interaction of ubiquitinated I kappaB alpha with the proteasome: a new mechanism for selective inhibition of proteolysis. Nat Med. 1998;4:939–944. doi: 10.1038/nm0898-939. [DOI] [PubMed] [Google Scholar]
  57. Simister PC, Schaper F, O'Reilly N, McGowan S, Feller SM. Self-organization and regulation of intrinsically disordered proteins with folded N-termini. PLoS Biol. 2011;9:e1000591. doi: 10.1371/journal.pbio.1000591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Stadtmueller BM, Hill CP. Proteasome activators. Mol Cell. 2011;41:8–19. doi: 10.1016/j.molcel.2010.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Takeuchi J, Chen H, Coffino P. Proteasome substrate degradation requires association plus extended peptide. EMBO J. 2007;26:123–131. doi: 10.1038/sj.emboj.7601476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Tian L, Holmgren RA, Matouschek A. A conserved processing mechanism regulates the activity of transcription factors Cubitus interruptus and NF-kappaB. Nat Struct Mol Biol. 2005;12:1045–1053. doi: 10.1038/nsmb1018. [DOI] [PubMed] [Google Scholar]
  61. Tompa P. Intrinsically unstructured proteins evolve by repeat expansion. BioEssays. 2003;25:847–855. doi: 10.1002/bies.10324. [DOI] [PubMed] [Google Scholar]
  62. Tompa P, Prilusky J, Silman I, Sussman JL. Structural disorder serves as a weak signal for intracellular protein degradation. Proteins. 2008;71:903–909. doi: 10.1002/prot.21773. [DOI] [PubMed] [Google Scholar]
  63. Tsvetkov P, Asher G, Paz A, Reuven N, Sussman JL, Silman I, Shaul Y. Operational definition of intrinsically unstructured protein sequences based on susceptibility to the 20S proteasome. Proteins. 2008;70:1357–1366. doi: 10.1002/prot.21614. [DOI] [PubMed] [Google Scholar]
  64. Tsvetkov P, Reuven N, Shaul Y. The nanny model for IDPs. Nat Chem Biol. 2009;5:778–781. doi: 10.1038/nchembio.233. [DOI] [PubMed] [Google Scholar]
  65. van der Lee R, Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M, Gough J, Gsponer J, Jones DT, et al. Classification of intrinsically disordered regions and proteins. Chem Rev. 2014;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. van Ooijen G, Dixon LE, Troein C, Millar AJ. Proteasome function is required for biological timing throughout the twenty-four hour cycle. Curr Biol. 2011;21:869–875. doi: 10.1016/j.cub.2011.03.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Varshavsky A. The ubiquitin system, an immense realm. Annu Rev Biochem. 2012;81:167–176. doi: 10.1146/annurev-biochem-051910-094049. [DOI] [PubMed] [Google Scholar]
  68. Verhoef LG, Heinen C, Selivanova A, Halff EF, Salomons FA, Dantuma NP. Minimal length requirement for proteasomal degradation of ubiquitin-dependent substrates. FASEB J. 2009;23:123–133. doi: 10.1096/fj.08-115055. [DOI] [PubMed] [Google Scholar]
  69. Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J Mol Biol. 2004;337:635–645. doi: 10.1016/j.jmb.2004.02.002. [DOI] [PubMed] [Google Scholar]
  70. Wolfe KH, Shields DC. Molecular evidence for an ancient duplication of the entire yeast genome. Nature. 1997;387:708–713. doi: 10.1038/42711. [DOI] [PubMed] [Google Scholar]
  71. Yang C, Matro JC, Huntoon KM, Ye DY, Huynh TT, Fliedner SM, Breza J, Zhuang Z, Pacak K. Missense mutations in the human SDHB gene increase protein degradation without altering intrinsic enzymatic function. FASEB J. 2012;26:4506–4516. doi: 10.1096/fj.12-210146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Yen HC, Xu Q, Chou DM, Zhao Z, Elledge SJ. Global protein stability profiling in mammalian cells. Science. 2008;322:918–923. doi: 10.1126/science.1160489. [DOI] [PubMed] [Google Scholar]
  73. Zhang M, Coffino P. Repeat sequence of Epstein-Barr virus-encoded nuclear antigen 1 protein interrupts proteasome substrate processing. J Biol Chem. 2004;279:8635–8641. doi: 10.1074/jbc.M310449200. [DOI] [PubMed] [Google Scholar]
  74. Zhao M, Zhang NY, Zurawel A, Hansen KC, Liu CW. Degradation of some polyubiquitinated proteins requires an intrinsic proteasomal binding element in the substrates. J Biol Chem. 2010;285:4771–4780. doi: 10.1074/jbc.M109.060095. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information

RESOURCES