Abstract
In eukaryotes, disordered regions cover up to 50% of proteomes and mediate fundamental cellular processes. In contrast to globular domains, where about half of the amino acids are buried in the protein interior, disordered regions show higher solvent accessibility, which makes them prone to engage in non-functional interactions. Such interactions are exacerbated by the law of mass action, prompting the question of how they are minimized in abundant proteins. We find that interaction propensity or “stickiness” of disordered regions negatively correlates with their cellular abundance, both in yeast and human. Strikingly, considering yeast proteins where a large fraction of the sequence is disordered, the correlation between stickiness and abundance reaches R = − 0.55. Beyond this global amino-acid composition bias, we identify three rules by which amino-acid composition of disordered regions adjusts with high abundance. First, lysines are preferred over arginines, consistent with the latter amino acid being stickier than the former. Second, compensatory effects exist, whereby a sticky region can be tolerated if it is compensated by a distal non-sticky region. Third, such compensation requires a lower average stickiness at the same abundance when compared to a scenario where stickiness is homogeneous throughout the sequence. We validate these rules experimentally, employing them as different strategies to rescue an otherwise sticky protein fragment from aggregation. Our results highlight that non-functional interactions represent a significant constraint in cellular systems and reveal simple rules by which protein sequences adapt to that constraint. Data from this work are deposited in Figshare, at https://doi.org/10.6084/m9.figshare.8068937.v3.
Keywords: intrinsic disorder, disordered regions, non-functional interactions, protein abundance, aggregation
Abbreviation: IDR, intrinsically disordered region; YFP, yellow fluorescent protein; DHFR, dihydrofolate reductase
Highlights
-
•
Proteins span a wide range of abundance and can engage in non-functional interactions (NFI).
-
•
Minimizing NFI is a dominant constraint on the composition of disordered regions.
-
•
This composition bias may cause under-detection of disorder in low abundance proteins.
-
•
This composition bias does not need to be global across the entire protein sequence.
-
•
Constraints of NFI can serve to reverse-engineer principles of protein solubility.
Graphical abstract
Introduction
Intrinsically disordered regions (IDRs) are found in proteins across all three domains of life and predominantly among eukaryotes where they cover 35% to 45% of proteomes [1]. IDRs are generally depleted of hydrophobic amino acids, which make them unable to fold autonomously [2], [3]. Consequently, they are characterized by a lack of stable tertiary structure under physiological conditions [4]. The high-conformational variability associated with IDRs grants them unique functional and mechanical properties [4], [5], [6], [7], [8], [9], [10], [11]. For example, nucleoporins are size-filtering devices of the nuclear pore complex containing disordered regions rich in phenylalanine–glycine repeats that influence the nuclear pore complex gating behavior [12]. In addition, disordered regions are notorious for their role in mediating and scaffolding protein-protein interactions [13], [14], [15], [16], [17]. One recognition mechanism involves domain–peptide interactions, where a globular domain such as WW, PDZ, or SH3 recognizes a linear motif in a disordered region [18], [19], [20], [21], [22], [23], [24], [25]. Another class of recognition mechanism has been described as “induced fit” [26], [27], where folding occurs concurrently with binding, promoting highly specific and transient interactions [15], [22], [28], [29], [30] often found in cell signaling and regulation processes [19], [23], [31], [32], [33]. Disordered regions thus play a pivotal role in the evolution [34], [35], [36], [37], [38] and the wiring of protein-protein interactions networks [34], [35], [36], [37], [39], [40], [41], [42].
Importantly, however, properties of disordered regions that promote functional recognition are also expected to promote dysfunctional recognition [43], [44], [45]. Indeed, disordered regions naturally exhibit high solvent exposure, and we know that exposed protein surfaces show a natural tendency to bind one another promiscuously [46], [47], suggesting that a delicate balance exists in cells, between functional (selected) and non-functional (non-selected) binding [44], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60]. Such a delicate balance is reflected in the dosage sensitivity associated with over-expression of proteins that contain disordered regions or that are highly promiscuous [43], [44], [61], [62]. Consistent with this view, proteins containing disordered regions are under tight regulation with shorter half-lives, slower translation, and faster degradation, so as to minimize their presence when they are not needed [48], [57]. In addition, such potential for promiscuous binding is exploited by viruses to hijack cell regulation [63], [64], [65]. Finally, beyond promiscuous and dysfunctional interactions, disordered regions should exhibit specific properties so as not to be recognized as misfolded proteins by cellular chaperones, thereby threatening cellular homeostasis [66], [67], [68], [69], [70], [71].
As for any chemical equilibrium, dysfunctional interactions are expected to be concentration dependent [72], [73]. Therefore, we anticipate that sequence properties protecting against dysfunctional interactions [45], [57], [72], [74], [75], [76], [77] are enriched in IDRs of abundant proteins. To investigate this hypothesis, we considered the proteomes from Saccharomyces cerevisiae and Homo sapiens and compared properties of disordered regions present in low versus high-abundance proteins. As observed before, we saw that disordered regions are twice as rare in the latter [44], [48]. Importantly, however, because abundance spans several orders of magnitude, the small fraction of disordered regions in abundant proteins represents a large mass of disordered residues in the cell, suggesting that these regions must be endowed with properties that render them compatible with high abundance. We show that IDRs’ interaction propensity (stickiness) covaries with abundance more than other physicochemical properties do, including hydrophobicity and beta-amyloid formation propensity. Thus, dysfunctional interactions constrain the overall chemical composition of disordered regions. Analyzing the distribution of stickiness among disordered regions revealed that compensatory effects exist, whereby a non-sticky region in one part of a protein can buffer a sticky region hundreds of amino acids away. We verified this prediction experimentally and showed that an aggregation-prone misfolded protein can be rescued by the addition of charged residues delocalized relative to the sticky, aggregation-prone region.
Results and Discussion
Disorder content and protein abundance are inversely related
As discussed above, disordered regions are prone to promiscuous binding [43], [78], [79]. The fact that binding is a concentration-dependent process prompts us to compare the distribution of disordered regions among proteins of high versus low abundance. It has been shown before that cellular systems have evolved to regulate the availability of IDRs, whereby proteins enriched in IDRs are less abundant on average [48]. Here, we first confirm this finding. We examined protein disorder content (i.e., the percentage of disordered amino acids per protein) as a function of protein abundance for yeast and human. We defined 10 classes of abundance, henceforth referred to as “bins of abundance” (Figs. 1a and S1a, Table 1). Each bin contains approximately the same number of proteins. Abundance thresholds associated with these bins (in parts per million, or ppm) are kept identical throughout the work, but the number of proteins contained in each bin may differ in subsequent analyses when we concentrate on proteome subsets (e.g., proteins with at least 30 disordered residues). For all non-membrane proteins, we show the distribution of protein disorder content (Figs. 1b and S1b) for each bin of abundance.
Table 1.
Bins | Deciles (%) |
S. cerevisiae (yeast) |
H. sapiens (human) |
||
---|---|---|---|---|---|
A (ppm) | C (nM) | A (ppm) | C (nM) | ||
1 | 0–10 | 0.003–1.4 | 0.0–6.5 | 1e−5–0.1 | 0.0–0.5 |
2 | 10–20 | 1.4–3.0 | 6.5–13 | 0.1–0.3 | 0.5–1.5 |
3 | 20–30 | 3.0–7.8 | 13.0–33.9 | 0.3–0.5 | 1.5–2.5 |
4 | 30–40 | 7.8–14.1 | 33.9–65.3 | 0.5–1.0 | 2.5–5.0 |
5 | 40–50 | 14.1–23.4 | 65.3–111.1 | 1.0–2.1 | 5.0–10.5 |
6 | 50–60 | 23.4–37.9 | 111.1–180.9 | 2.1–4.5 | 10.5–22.4 |
7 | 60–70 | 37.9–64.9 | 180.9–311.5 | 4.5–10.1 | 22.4–50.8 |
8 | 70–80 | 64.9–119 | 311.5–583.1 | 10.1–24.8 | 50.8–123.6 |
9 | 80–90 | 119–351 | 583.1–1700 | 24.8–75 | 123.6–398.7 |
10 | 90–100 | 351–21,870 | 1700–109,000 | 75–30,590 | 398.7–152,400 |
Considering the first four bins, we notice that disorder content increases with abundance both in yeast (Fig. 1b) and human (Fig. S1b), although the potential for promiscuous interactions is theoretically increasing. We suggest two possible origins for this trend: (i) Below the abundance range of the fourth bin, disordered regions are not selected against due to very low concentrations equivalent to 5–30 nM (1–7 ppm, Table 1). Such concentrations are indeed below typical affinity constants of domain-peptide interactions (Kd ∼ 0.1–150 μM) [80]. In line with this idea, linear motifs in disordered regions are also depleted in this range of abundance in yeast (Fig. S2). (ii) Disorder prediction methods may under-predict disordered regions [81] among proteins with low abundance due to their enrichment in sticky amino acids that are predominantly hydrophobic, as we will see in the next section. In support of this notion, Pfam [82] and SUPERFAMILY [83] domains become rarer as protein abundance decreases (Fig. 1c), suggesting that disorder should increase rather than decrease.
From the fifth to tenth bins of abundance, however, we observe a continuous decrease of protein disorder content, both in yeast and in human. Protein sequences in the fifth bin contain ∼ 20% disorder on average (∼ 40% in human, Fig. S1), whereas proteins with the highest abundance (tenth bin) contain only ∼ 10% (∼ 25% in human). This difference is highly significant in both species (yeast: p = 3.8 × 10−07, human: p = 1.8 × 10−21, Wilcoxon one-sided test). The depletion of disordered regions among high-abundance proteins confirms the findings of Gsponer et al. [48], where IDR-poor proteins exhibit significantly higher abundances than IDR-rich proteins. These ideas also match results from Vavouri et al. [44], who observed that disordered regions and linear motifs were the main determinants of protein toxicity upon over-expression in yeast. Accordingly, we find that disordered regions become depleted of linear motifs in high-abundance proteins (Fig. S2). Overall, these results are consistent with binding promiscuity of disordered regions being a constraint, causing high-abundance proteins to adapt by decreasing their disordered content. Importantly, however, while the depletion is significant, it is not complete. In other words, since protein abundances span several orders of magnitude inside a cell, the ∼ 10% of disordered regions present in high-abundance proteins contribute as much as 60%–75% of all disordered residues in the cell (Fig. 1d). This prompted us to investigate whether these regions bear specific properties that minimize their potential for promiscuous interactions and aggregation.
The stickiness of disordered regions is dependent on protein abundance
We saw in the previous section that disordered regions are expected to exhibit properties minimizing their susceptibility for promiscuous interactions. To evaluate the tendency of disordered regions to engage in promiscuous interactions, we measured their interaction propensity as the average of their amino-acid interaction propensities (Figs. 2a and S3a). Amino-acid interaction propensities were taken from Levy et al. [78] and correspond to the log-ratio of their frequency at protein-protein interfaces relative to protein surfaces (Fig. S4a). Thus, in this “stickiness” scale, the higher the score of an amino acid, the more frequent it is at protein–protein interaction interfaces relative to protein surfaces. The stickiness scale shares similarities with amino-acid solubility, hydrophobicity, and aggregation scales as shown in Fig. S4b. Although the stickiness scale was defined on structured domains, we expect it to reflect the binding propensity of disordered regions as well, because similar frequencies of amino acids are seen at the interface cores of both types of complexes [87].
Importantly, disordered regions often cover only a small fraction of the protein, and solvent-exposed amino acids outside of these regions may influence the global protein stickiness. For this reason, we used three data sets of proteins with “standard,” “medium,” and “high” disorder content (Figs. 2a and S3a, see Methods) and we anticipate that data sets with higher disorder content should better reflect compositional constraints. We show the distributions of protein length, disorder content, and disorder fraction in these three data sets, for yeast and human (Fig. S5).
We compared the average stickiness (S) of IDRs to their abundance (A) in the cell and observed a negative correlation, consistent with a recent report in yeast [51] and with a similar compositional bias seen among structured domains [78]. The S/A correlation appears stronger in yeast (R = − 0.36, Fig. 2b) than in human (R = − 0.21, Fig. S3b), and is significant in both species (yeast: p = 5.2 × 10−68 human: p = 1.7 × 10−75, Spearman’s rank correlation test). The lower correlation seen in human may reflect the ambiguous nature of protein abundance data in multicellular organisms, where it is averaged over tissues. The S/A correlation increased substantially when considering the “median” and “high” disorder-content data sets, to R = − 0.41 and R = − 0.55 in yeast (Fig. 2b), and R = − 0.25 and R = − 0.33 in human (Fig. S3b). Furthermore, if we consider an alternative data set of regions not matched by a known domain (we call this set of regions in protein sequences “anti-domains”), the S/A correlation increases further, reaching − 0.58 for 1453 yeast proteins (Fig. S6). Finally, we saw a similar correlation (R = − 0.37, Fig. S7a) when using disorder predictions from the D2P2 database [88], which provides a consensus of several methods (PONDR, PrDOS, PV2, IUPred/ANCHOR, Espritz) [2], [84], [89], [90], [91], [92]. We also saw similar correlations when separating proteins in three groups according to their size (Fig. S7).
For all three disorder-content data sets, the decrease in IDR-stickiness with increasing protein abundance is highlighted with regression lines (Figs. 2b and S3b). The regression shows that IDR stickiness for low-abundance proteins (< 0.1 ppm) is above 0.0, and it decreases down to − 0.2 for proteins with the highest abundance. For reference, we compare this stickiness difference to that seen between structured domains (median, 0.12) and disordered regions (median, − 0.07; Fig. 2c), revealing a similar amplitude and stressing that the compositional bias we report is substantial.
Together, these results indicate that disordered regions present in high-abundance proteins exhibit biased compositions, whereby the fraction of their sticky amino acids is minimized. This observation supports the paradigm of a concentration-dependent optimization of amino acid composition in disordered regions so as to minimize dysfunctional interactions.
Hydrophobicity and β-amyloid propensity exhibit weaker concentration dependence
We explored alternative amino-acid interaction rules to explain the concentration-dependent adaptation in amino-acid composition. We tested whether hydrophobicity was at the origin of the correlations observed. Three distinct hydrophobicity scales (Wimley and White [93], Kyte and Doolittle [94], and Roseman [95]) as well as a sequence-based predictor of protein solubility (CamSol [96]) gave significant S/A correlations, albeit of lower magnitude than that observed with the stickiness scale (Figs. 3 and S8). We reached similar conclusions with β-amyloid and aggregation propensity predictors, based on FoldAmyloid [97], Aggrescan [67], Pawar et al. [98], and PASTA 2.0 [99].
A notable feature of the stickiness scale compared to other scales is the fact that it discriminates lysine (K) and arginine (R) as amino acids with a large difference in stickiness (Fig. S9). While lysine is the most under-represented amino acid at protein–protein interfaces (and thus the least sticky), arginine is found with nearly equal frequency at interfaces and surfaces [47], making it a significantly stickier amino acid than lysine. In line with this notion, Warwicker et al. [100] observed that high lysine and low arginine content correlate with protein solubility. This led us to define a minimal stickiness measure based solely on these two amino acids, the RK-ratio (R/(R + K)), which is the number of arginines over the number of both lysines and arginines in disordered regions of a particular protein. The RK-ratio yielded S/A correlation coefficients nearly as high or higher than those obtained using other scales both in yeast (R = − 0.24) and human (R = − 0.12), for the standard data set. For the high disorder-content data set, these values reached R = − 0.36 in yeast (Fig. 3) and R = − 0.2 in human (Fig. S8). The fact that arginine and lysine are considered largely equivalent by other scales (Fig. S9) and that arginine is the most hydrophilic amino acid according to hydrophobicity scales highlights the distinct nature of the stickiness scale and further suggests that promiscuous interactions are selected against among abundant proteins.
Local versus global optimization of stickiness
We observed that chemical properties of disordered regions change as a function of their concentration, with interaction-resilient amino acids being enriched in high-abundance proteins. This observation prompted us to ask whether stickiness must be optimized throughout the sequence, or whether compensation mechanisms can occur between different regions, whereby a non-sticky region in one part of the protein may compensate for a sticky one in a distal part and increase protein solubility. To address this question, we defined the local stickiness ωk as the average stickiness score in a window of k residues sliding through each position i of a protein sequence (Fig. 4a). For each protein, we recorded the maximum local stickiness value as Ωk = max(ωk) (see Methods for details). Then, we calculated the Spearman rank correlation between protein abundance and Ωk (Fig. 4b). We reasoned that the window size maximizing the correlation should reflect the distance in the polypeptide chain over which a highly sticky region can be compensated by a distal non-sticky region. We observed that larger windows improve the abundance-stickiness correlation up to a size of ∼ 200 residues for the standard data set and ∼ 800 residues for the high disorder-content data set, beyond which the correlation stabilizes (Figs. 4b and S10). It is difficult to ascribe a meaning to the actual size of these windows, notably because IDRs that are far in sequence may be close in space, for example, due to the protein’s structure, or due to conformational sampling [101]. However, the fact that the correlation increases with k exceeding several hundred residues does suggest the existence of compensatory effects, whereby a sticky segment in one part of a protein can be tolerated if a non-sticky segment elsewhere in the sequence can compensate for it.
Heterogeneous distribution of stickiness among disordered segments
The long-range stickiness effects suggested by the above analysis led us to analyze stickiness homogeneity versus heterogeneity along the sequence. We introduced a measure Δ = Ωk − ΩL, the difference between the maximal stickiness in a window of size k and the average stickiness of the protein. A large Δ value indicates a heterogeneous stickiness across the protein, and conversely, a low value would be observed when the distribution of stickiness is homogeneous throughout the sequence. For this analysis, we arbitrarily used in both species a value k equal to 41 for the window size, but the results are robust against different values of k (Fig. S11).
We asked whether proteins with high versus low values of Δ show different average stickiness (Figs. 5 and S12). Among high-abundance proteins (bins 6–10), proteins with heterogeneous stickiness are less sticky on average when compared to proteins with homogeneous stickiness (yeast: p = 4.4 × 10−06, human: 2.3 × 10−29, Wilcoxon one-sided test). This result could be influenced by the window size (k = 41) as well as by protein length, since long proteins are expected to show larger heterogeneity by chance alone, when compared to shorter proteins. Thus, we controlled for these two parameters (Figs. S11 and S13) and observed a consistently lower stickiness in the heterogeneous class when compared to the homogeneous class. For proteins with lower abundance, the average stickiness of the heterogeneous class did not show a consistent difference relative to the homogeneous class.
These results suggest that delocalized non-sticky regions are not as effective to promote solubility when compared to regions with homogeneous stickiness, among high-abundance proteins. Why different proteins employ different strategies to remain soluble remains to be investigated. We can speculate that functional constraints might require heterogeneous distribution of sticky residues among disordered regions. For example, the highly conserved protein PEX5 contains a long N-terminal IDR harboring several “WxxxF/Y” motifs that mediate recognition and import of proteins from the cytoplasm to the peroxisome [102]. Furthermore, random evolutionary processes are likely to result in heterogeneous stickiness patterns. Indeed, considering a sticky and solvent-exposed region, there would be many more ways of compensating it with non-sticky residues elsewhere in the sequence than by mutating the region itself.
Experimentally assessing how distal non-sticky regions impact protein solubility
The results of the two previous sections led to three main conclusions. First, stickiness is a global sequence property that can involve compensatory effects in protein sequences. Second, the chemically analogous amino acids R and K exhibit markedly different stickiness properties. Third, homogeneous stickiness throughout the sequence is more effective at promoting solubility than heterogeneous stickiness.
To assess these concepts experimentally, we expressed chimeric fluorescent proteins in yeast cells and measured their solubility under the assumption that higher solubility promotes higher abundance in the cytosol. Maximal cytosolic abundance was observed for the wild-type Venus yellow fluorescent protein (YFP) [103], while minimal cytosolic abundance was observed for YFP fused to a sticky polypeptide derived from the mouse dihydrofolate reductase (DHFR) [104], thereafter referred to as YFP-DH (Fig. 6).
We subsequently aimed to increase the solubility of YFP-DH using different non-sticky fusion tags. First, we employed non-sticky tags composed of different combinations of positively and/or negatively charged amino acids fused at the C-terminus of YFP-DH. Overall, all non-sticky tags containing six charged amino acids increased the cytosolic abundance of YFP-DH, confirming that a non-sticky region can, at least partially, rescue from insolubility caused by a distal sticky region. Consistent with previous findings [105], [106], [107], [108], negative charges promoted solubility better than positive charges, with tags based on six aspartate or glutamate yielding cytosolic abundances up to 1.5 times higher than a tag containing six lysines. The increased solubility seen with negatively charged amino acids is at odds with the stickiness scale, where lysine is the least sticky amino acid. This apparent discrepancy may originate in the avoidance of arginine at protein surfaces, leaving lysine as the only non-sticky positively charged amino acid [47]. In line with this idea, the six-arginine tag led to a 2-fold reduction in cytosolic abundance when compared to the six-lysine tag. Tags that were electrostatically neutral gave intermediate cytosolic abundance. Next, we tested the impact of stickiness heterogeneity on solubility. We created two additional non-sticky tags containing 12 and 18 charged residues. These tags decreased the average stickiness and introduced a high heterogeneity. In parallel, we designed two mutant sequences matching the average stickiness obtained with 12 or 18 charged residues, although the mutations were introduced throughout the DH sequence to create a more homogeneous stickiness profile. The addition of 6 charged residues (from 6 to 12) increased the abundance of YFP-DH, but the further addition of charges (from 12 to 18) had little effect, suggesting that delocalized non-sticky regions cannot fully compensate for a distal sticky region. The strategy consisting of mutating sticky regions directly (e.g., mutations introduced in the DH sequence) showed comparatively higher abundance, consistent with homogeneous stickiness leading to higher solubility.
In our interpretation of these experiments, we hypothesize that fusion of the DH sequence to YFP induces its aggregation and degradation. Alternatively, the fusion of the DH sequence may slow down the translation of YFP-DH relative to YFP, reduce its mRNA stability, or decrease the average YFP fluorescence due to interference with the folding of YFP for example. However, it is unlikely that the diverse sequences of the non-sticky tags could consistently rescue from such effects, as their only common property is to be non-sticky. For this reason, the increase of YFP-DH solubility by the non-sticky tags is the most parsimonious explanation explaining their respective increase in fluorescence.
Conclusion
It has been observed previously that proteins containing disordered regions are constrained in terms of their regulation [48], [57]. In addition to regulatory mechanisms, our results show that disordered regions tune their amino-acid composition to their cellular abundance. We observed this trend globally, across the yeast and human proteomes, thus generalizing previous observations made on fewer proteins of known structures [78] and in a recent study [51]. Uniquely, our approach allowed comparing how amino-acid propensities associated with specific properties (hydrophobicity, amyloid formation, aggregation, and protein interactions) covaried with abundance. Interaction propensity or “stickiness” showed the strongest covariation, implying that avoidance of promiscuous interactions is a key property shaping protein sequences. Non-functional interactions can indeed be deleterious due to their potential to sequester proteins into aggregates of misfolded proteins [43], [62], [109], [110], [111] or agglomerates of folded proteins [58], and they may also compete with functional interactions [55], [57], [64], [65], [79], [112]. The local optimization of stickiness in the sequence is expected to minimize deleterious effects associated with binary promiscuous interactions. In contrast, the fact that stickiness can also be optimized globally, across different segments separated by 200 or more amino acids, suggests an adaptation mechanism against promiscuous self-interactions leading to aggregation (Fig. 7). We experimentally validated this concept by rescuing an aggregation-prone protein-fragment after adding non-sticky residues delocalized relative to the sticky region.
In addition to the insights into sequence features preventing non-functional interactions, our work has several implications. Practically, our results imply that methods may under- and over-predict disorder in low and high-abundance proteins respectively. For example, IUPred predicts disordered regions as those lacking hydrophobic amino acids to stabilize a folded protein core. Such predictions might be improved further if taking into consideration protein abundance to model the expected hydrophobic and hydrophilic character of globular domains and disordered regions. Our findings may also have important evolutionary implications since protein abundance is known to be the main determinant of sequence conservation across species [113], [114], [115], [116], [117], [118], [119]. It will therefore be important to consider stickiness in comparative sequence analyses, for example, to examine patterns of hydrophobic amino acids [120], and more generally, to understand routes and mechanisms of adaptation to high cellular abundance.
Methods
Filtering proteins
Some proteins have a biased amino-acid composition due to their cellular function. In particular, proteins with transmembrane helices are enriched in hydrophobic residues but these are unlikely to become solvent-exposed. Hence, we discarded all membrane proteins from this study. In order to identify membrane proteins, we used predictions from TMHMM [121] and meta-predictions from TOPCONS [122], [123], [124], [125], [126], [127], [128]. A total of 1913 (29%) yeast and 7297 (36%) human proteins were discarded for which at least one transmembrane segment was predicted according to the consensus prediction from TOPCONS. In the absence of consensus, we relied on the agreement between predictions from TMHMM and three out of five predictors compiled in TOPCONS.
Protein abundance data
Protein abundances were obtained from Pax-Db (v4.0, May 2015) [129], which provides relative abundances for unicellular and multicellular organisms including tissue-specific data. For both organisms, we used overall abundance in the whole organism, inferred from all available data sets (integrated data set). The unit conversion from relative abundance in ppm to cellular concentration in molar was obtained by the following formula: C = (k · A) / NA where k ≈ 3·106 proteins/fL as an estimate of cellular density of protein molecules per femtoliter (fL) obtained from [130], the Avogadro constant NA = 6.02 × 1023 molecules/mol, and A is the abundance (e.g., 1 ppm yields A = 10-6 and results in C ≈ 5 nM). Table 1 provide the estimated range of cellular concentrations for the 10 bins of abundance.
Intrinsic disorder and domain predictions
We predicted disordered regions in both yeast and human proteomes using IUPred [84], combining short and long disorder predictions. In yeast, we selected the 20% amino acids with the highest disorder probabilities. The human proteome is known to exhibit higher disorder content [88] when compared to yeast, prompting us to use 40% of amino acids with the highest disorder probabilities.
To predict domains, we aligned profiles from Pfam-A (v27.0, May 2013) and Superfamily (v1.75, March 2013) to sequences of both proteomes and kept predictions with an E-value score below 10−3.
Defining the three sets of disordered proteins used in the analyses
We defined three data sets containing proteins with increasing disorder content. We refer to these data sets as “standard,” “medium,” and “high.” In yeast, the standard data set consists of 2217 proteins (7619 for human) where IDRs represent at least 10% (20% for human) of the sequence and contain at least 30 residues in total (50 for human). In yeast, the medium data set consists of 1563 proteins (3649 for human) where IDRs represent at least 10% (50% for human) of the sequence and contain at least 80 residues in total (50 in human). Finally, the high disorder data set contains 175 yeast proteins (944 for human), with at least 70% (80% for human) of the sequence being predicted as disordered, and a minimum of 25 disordered residues per sequence (40 residues in human). These criteria are summarized in Fig. 2a (Fig. S3a for human).
Interface propensity (stickiness) scale
Amino-acid interface propensities were obtained from Levy et al. [78]. In this scale, amino-acid interface propensities were calculated by the log-ratio of the frequency at the interface-core versus at the surface, based on a non-redundant set of Escherichia coli proteins of known structure.
This scale (Fig. S4) measures the propensity for an amino acid to interact with other amino acids and thus can be viewed as a “stickiness” score, which serves to estimate the interaction propensity of disordered regions.
Other propensity scales
We employed three distinct amino-acid hydrophobicity scales: (1) Kyte and Doolittle [94] determined vacuum-to-water transfer free energies of amino acid side chain. (2) Roseman [95], [131] estimated the free-energies of completely transferring the amino acids side-chain along with the polypeptide backbone from water to a hydrocarbon phase, correcting for self-solvation effects of polar residues induced by the proximity of flanking non-hydrogen bonded peptide bond. (3) Wimley and White [93] measured the transfer free energy for amino acids from water to the interface of a phosphatidylcholine membrane bilayer.
For CamSol [96], sequence-based predictor of protein solubility, we used opposite values (i.e., scores were multiplied by − 1) to maintain a comparable ranking of amino acids with other scales.
We also considered amino-acid potentials derived from experimental data [132] for three methods that assess β-amyloid formation: Aggrescan [67], FoldAmyloid [97], and pH 7 scale from Pawar et al. [98].
Finally, we applied PASTA 2.0 [99] on protein sequence and kept the highest energy prediction for evaluating the stability of putative cross-beta pairings among disordered regions.
Calculation of global and local average stickiness of disordered regions
The stickiness of a sequence is calculated by the mean of its amino-acid stickiness scores (Fig. S4a). The global stickiness of IDRs in a protein is calculated using residues predicted as disordered, and amino acids outside of IDRs are ignored from the calculation. The local stickiness ωk corresponds to the mean stickiness score calculated within a window of k residues sliding through each position i of a protein sequence. For each protein, we recorded the maximum local stickiness value as Ωk = max(ωk). Disordered regions are scattered along the sequence and can consist of few residues only. Thus, we imposed restrictions to avoid inferring a local stickiness value when the number of disordered residues was too small compared to the size of the window. Specifically, we inferred a local stickiness value only when the number of disordered residues in the window was equal or greater than the lowest value between:
-
-
half the window size h = (k + 1)/2
-
-
the size D of the largest disordered segment within the protein
Scanning windows of increasing size, ranging from one residue to the protein length L enabled us to assess distance-dependent fluctuations of the local maximum stickiness. For example, with a window consisting of a single amino acid (k = 1), Ω1 corresponds to the most sticky amino acid of the protein considered. With a window of size three (k = 3), Ω3 corresponds to the average stickiness of the most sticky stretch of three consecutive amino acids, and so on, and when the window size is the length of a protein L, ΩL is the average stickiness.
Solubility tags and fusion proteins
Fusion proteins were expressed under the GPD promoter in the p413 plasmid [133]. As an unfolded and insoluble polypeptide chain we used the N-terminal fragment of the mouse DHFR enzyme. This protein fragment contains 108 residues out of 188 in the full-length protein and was fused at the C-terminus of the YFP. Various synthetic sequence tags were also fused at the C-terminus of the DHFR fragment. All cloning was carried out using the PIPE cloning method [134]. The different sequence tags consisted of stretches of charged amino acids separated by glycine and serine residues. All sequences can be found in Table S3. The mutations in DH were determined manually to decrease the stickiness of the fragment while matching the stickiness of the medium and long non-sticky tag variants. Synthetic DNA sequences were ordered from IDT and were cloned by PIPE at the C-terminus of YFP, to yield the “mut-DH” variants.
Imaging and fluorescence quantification
Imaging of cells and processing of the data was carried out as described in Ref. [135]. Briefly, cells were inoculated from their glycerol stock in 384-well glass-bottom optical plates (Matrical) with a pintool (FP1 pins, V&P Scientific) operated by a Tecan robot (Tecan Evo200 with MCA384 head). Cells were grown in YPD for a minimum of 10 h before they reached an optical density of at most 1, and were imaged with a confocal spinning disk microscope using a 60 × plan apo oil-immersion objective. Each image set was composed of two brightfield (BF) images (one in focus and one defocused to facilitate cell segmentation, each with 50-ms exposure) as well as one image quantifying fluorescence with 300-ms exposure. Individual cells were segmented from the brightfield images, and fluorescence intensity was estimated from the 50th quantile of pixel intensity within each segmented cell. For each strain, eight technical replicates were imaged, the median fluorescence across cells (typically over 100 per construct) was calculated, and the mean of the eight replicates and associated standard error are reported.
The following are the supplementary data related to this article.
Acknowledgments
We thank Keith Dunker, Hagen Hofmann, Amnon Horovitz, Ben Lehner, Koby Levy, Tzachi Pilpel, Claus Wilke, and Shoshana Wodak for insightful discussions. We thank Shoshana Wodak and Mauricio Macossay-Castillo for helpful comments on the manuscript. We thank Joseph Georgeson for help with operating the microscope, and Harry Greenblatt for help with computer systems. This work was supported by the Israel Science Foundation (1452/18); by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant Agreement No. 819318); by a research grant from A.-M. Boucher; and by research grants from the Estelle Funk Foundation, the Estate of Fannie Sherr, the Estate of Albert Delighter, the Merle S. Cahn Foundation, Mrs. Mildred S. Gosden, the Estate of Elizabeth Wachsman, and the Arnold Bortman Family Foundation. E.D.L. is incumbent of the Recanati Career Development Chair of Cancer Research.
Authors Contributions: B.D. and E.D.L. designed the analyses and experiments. B.D. carried out the computational analyses. B.D. and E.D.L. analyzed the data. O.M. created the yeast strains and carried out the microscopy screen. B.D. analyzed the microscopy data. B.D. and E.D.L. wrote the manuscript.
Edited by Monika Fuxreiter
References
- 1.Xue B., Dunker A.K., Uversky V.N. Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life. J. Biomol. Struct. Dyn. 2012;30:137–149. doi: 10.1080/07391102.2012.675145. [DOI] [PubMed] [Google Scholar]
- 2.Romero P., Obradovic Z., Li X., Garner E.C., Brown C.J., Dunker A.K. Sequence complexity of disordered protein. Proteins. 2001;42:38–48. doi: 10.1002/1097-0134(20010101)42:1<38::aid-prot50>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]
- 3.Uversky V.N., Gillespie J.R., Fink A. Why Are “natively unfolded” proteins unstructured under physiologic conditions? Proteins: Struct. Funct. Bioinf. 2000;41:415–427. doi: 10.1002/1097-0134(20001115)41:3<415::aid-prot130>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]
- 4.Dunker A.K., Lawson J.D., Brown C.J., Williams R.M., Romero P., Oh J.S., Oldfield C.J., Campen A.M., Ratliff C.M., Hipps K.W., Ausio J., Nissen M.S., Reeves R., Kang C., Kissinger C.R., Bailey R.W., Griswold M.D., Chiu W., Garner E.C., Obradovic Z. Intrinsically disordered protein. J. Mol. Graph. Model. 2001;19:26–59. doi: 10.1016/s1093-3263(00)00138-8. [DOI] [PubMed] [Google Scholar]
- 5.Pavletich N.P., Pabo C.O. Zinc finger-DNA recognition: crystal structure of a Zif268–DNA complex at 2.1 A. Science. 1991;252:809–817. doi: 10.1126/science.2028256. [DOI] [PubMed] [Google Scholar]
- 6.Dunker A.K., Brown C.J., Lawson J.D., Iakoucheva L.M., Obradović Z. Intrinsic disorder and protein function. Biochemistry. 2002;41:6573–6582. doi: 10.1021/bi012159+. [DOI] [PubMed] [Google Scholar]
- 7.Tompa P. The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett. 2005;579:3346–3354. doi: 10.1016/j.febslet.2005.03.072. [DOI] [PubMed] [Google Scholar]
- 8.Van Der Lee R., Buljan M., Lang B., Weatheritt R.J., Daughdrill G.W., Dunker A.K., Fuxreiter M., Gough J., Gsponer J., Jones D.T. Others, classification of intrinsically disordered regions and proteins. Chem. Rev. 2014;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Schueler-Furman O., Wodak S.J. Computational approaches to investigating allostery. Curr. Opin. Struct. Biol. 2016;41:159–171. doi: 10.1016/j.sbi.2016.06.017. [DOI] [PubMed] [Google Scholar]
- 10.Dyson H.J., Wright P.E. Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 2005;6:197–208. doi: 10.1038/nrm1589. [DOI] [PubMed] [Google Scholar]
- 11.Forman-Kay J.D., Mittag T. From sequence and forces to structure, function, and evolution of intrinsically disordered proteins. Structure. 2013;21:1492–1499. doi: 10.1016/j.str.2013.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Patel S.S., Belmont B.J., Sante J.M., Rexach M.F. Natively unfolded nucleoporins gate protein diffusion across the nuclear pore complex. Cell. 2007;129:83–96. doi: 10.1016/j.cell.2007.01.044. [DOI] [PubMed] [Google Scholar]
- 13.Domanski M., Hertzog M., Coutant J., Gutsche-Perelroizen I., Bontems F., Carlier M.-F., Guittet E., van Heijenoort C. Coupling of folding and binding of thymosin beta4 upon interaction with monomeric actin monitored by nuclear magnetic resonance. J. Biol. Chem. 2004;279:23637–23645. doi: 10.1074/jbc.M311413200. [DOI] [PubMed] [Google Scholar]
- 14.Dyson H.J., Wright P.E. Coupling of folding and binding for unstructured proteins. Curr. Opin. Struct. Biol. 2002;12:54–60. doi: 10.1016/s0959-440x(02)00289-0. [DOI] [PubMed] [Google Scholar]
- 15.Mészáros B., Tompa P., Simon I., Dosztányi Z. Molecular principles of the interactions of disordered proteins. J. Mol. Biol. 2007;372:549–561. doi: 10.1016/j.jmb.2007.07.004. [DOI] [PubMed] [Google Scholar]
- 16.Pancsa R., Fuxreiter M. Interactions via intrinsically disordered regions: what kind of motifs? IUBMB Life. 2012;64:513–520. doi: 10.1002/iub.1034. [DOI] [PubMed] [Google Scholar]
- 17.Stein A., Mosca R., Aloy P. Three-dimensional modeling of protein interactions and complexes is going ‘omics. Curr. Opin. Struct. Biol. 2011;21:200–208. doi: 10.1016/j.sbi.2011.01.005. [DOI] [PubMed] [Google Scholar]
- 18.Davey N.E., Van Roey K., Weatheritt R.J., Toedt G., Uyar B., Altenberg B., Budd A., Diella F., Dinkel H., Gibson T.J. Attributes of short linear motifs. Mol. BioSyst. 2012;8:268–281. doi: 10.1039/c1mb05231d. [DOI] [PubMed] [Google Scholar]
- 19.Dinkel H., Michael S., Weatheritt R.J., Davey N.E., Van Roey K., Altenberg B., Toedt G., Uyar B., Seiler M., Budd A., Jödicke L., Dammert M.A., Schroeter C., Hammer M., Schmidt T., Jehl P., McGuigan C., Dymecka M., Chica C., Luck K., Via A., Chatr-aryamontri A., Haslam N., Grebnev G., Edwards R.J., Steinmetz M.O., Meiselbach H., Diella F., Gibson T.J. ELM—the database of eukaryotic linear motifs. Nucleic Acids Res. 2012;40:D242–D251. doi: 10.1093/nar/gkr1064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hu H., Columbus J., Zhang Y., Wu D., Lian L., Yang S., Goodwin J., Luczak C., Carter M., Chen L., James M., Davis R., Sudol M., Rodwell J., Herrero J.J. A map of WW domain family interactions. Proteomics. 2004;4:643–655. doi: 10.1002/pmic.200300632. [DOI] [PubMed] [Google Scholar]
- 21.Ivarsson Y. Plasticity of PDZ domains in ligand recognition and signaling. FEBS Lett. 2012;586:2638–2647. doi: 10.1016/j.febslet.2012.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mészáros B., Dosztányi Z., Simon I. Disordered binding regions and linear motifs—bridging the gap between two models of molecular recognition. PLoS One. 2012;7 doi: 10.1371/journal.pone.0046829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Van Roey K., Uyar B., Weatheritt R.J., Dinkel H., Seiler M., Budd A., Gibson T.J., Davey N.E. Short linear motifs: ubiquitous and functionally diverse protein interaction modules directing cell regulation. Chem. Rev. 2014;114:6733–6778. doi: 10.1021/cr400585q. [DOI] [PubMed] [Google Scholar]
- 24.Xin X., Gfeller D., Cheng J., Tonikian R., Sun L., Guo A., Lopez L., Pavlenco A., Akintobi A., Zhang Y., Rual J.-F., Currell B., Seshagiri S., Hao T., Yang X., Shen Y.A., Salehi-Ashtiani K., Li J., Cheng A.T., Bouamalay D., Lugari A., Hill D.E., Grimes M.L., Drubin D.G., Grant B.D., Vidal M., Boone C., Sidhu S.S., Bader G.D. SH3 interactome conserves general function over specific form. Mol. Syst. Biol. 2013;9:652. doi: 10.1038/msb.2013.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kelil A., Levy E.D., Michnick S.W. Evolution of domain–peptide interactions to coadapt specificity and affinity to functional diversity. Proc. Natl. Acad. Sci. U. S. A. 2016;113:E3862–E3871. doi: 10.1073/pnas.1518469113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Koshland D.E. The key–lock theory and the induced fit theory. Angew. Chem. Int. Ed Engl. 1995;33:2375–2378. [Google Scholar]
- 27.Koshland D.E. Application of a theory of enzyme specificity to protein synthesis. Proc. Natl. Acad. Sci. U. S. A. 1958;44:98–104. doi: 10.1073/pnas.44.2.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Uversky V.N. The multifaceted roles of intrinsic disorder in protein complexes. FEBS Lett. 2015;589:2498–2506. doi: 10.1016/j.febslet.2015.06.004. [DOI] [PubMed] [Google Scholar]
- 29.Oldfield C.J., Meng J., Yang J.Y., Yang M.Q., Uversky V.N., Dunker A.K. Flexible nets: disorder and induced fit in the associations of p53 and 14-3-3 with their partners. BMC Genomics. 2008;9:S1. doi: 10.1186/1471-2164-9-S1-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hsu W.-L., Oldfield C., Meng J., Huang F., Xue B., Uversky V.N., Romero P., Dunker A.K. Intrinsic protein disorder and protein–protein interactions. Pac. Symp. Biocomput. 2012:116–127. [PubMed] [Google Scholar]
- 31.Diella F., Haslam N., Chica C., Budd A., Michael S., Brown N.P., Trave G., Gibson T.J. Understanding eukaryotic linear motifs and their role in cell signaling and regulation. Front. Biosci. 2008;13:6580–6603. doi: 10.2741/3175. [DOI] [PubMed] [Google Scholar]
- 32.Wright P.E., Dyson H.J. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 2015;16:18–29. doi: 10.1038/nrm3920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Berlow R.B., Dyson H.J., Wright P.E. Hypersensitive termination of the hypoxic response by a disordered protein switch. Nature. 2017;543:447–451. doi: 10.1038/nature21705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bhattacharyya R.P., Reményi A., Yeh B.J., Lim W.A. Domains, motifs, and scaffolds: the role of modular interactions in the evolution and wiring of cell signaling circuits. Annu. Rev. Biochem. 2006;75:655–680. doi: 10.1146/annurev.biochem.75.103004.142710. [DOI] [PubMed] [Google Scholar]
- 35.Mosca R., Pache R.A., Aloy P. The role of structural disorder in the rewiring of protein interactions through evolution. Mol. Cell. Proteomics. 2012;11 doi: 10.1074/mcp.M111.014969. M111.014969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Beltrao P., Serrano L. Specificity and evolvability in eukaryotic protein interaction networks. PLoS Comput. Biol. 2007;3 doi: 10.1371/journal.pcbi.0030025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Buljan M., Chalancon G., Eustermann S., Wagner G.P., Fuxreiter M., Bateman A., Babu M.M. Tissue-specific splicing of disordered segments that embed binding motifs rewires protein interaction networks. Mol. Cell. 2012;46:871–883. doi: 10.1016/j.molcel.2012.05.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Pechmann S., Frydman J. Interplay between chaperones and protein disorder promotes the evolution of protein networks. PLoS Comput. Biol. 2014;10 doi: 10.1371/journal.pcbi.1003674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ekman D., Light S., Björklund A.K., Elofsson A. What properties characterize the hub proteins of the protein–protein interaction network of Saccharomyces cerevisiae? Genome Biol. 2006;7:R45. doi: 10.1186/gb-2006-7-6-r45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Van Roey K., Gibson T.J., Davey N.E. Motif switches: decision-making in cell regulation. Curr. Opin. Struct. Biol. 2012;22:378–385. doi: 10.1016/j.sbi.2012.03.004. [DOI] [PubMed] [Google Scholar]
- 41.Uversky V.N. Advances in Protein Chemistry and Structural Biology. Elsevier; 2018. Intrinsic disorder, protein–protein interactions, and disease; pp. 85–121. [DOI] [PubMed] [Google Scholar]
- 42.Tompa P., Fuxreiter M. Fuzzy complexes: polymorphism and structural disorder in protein–protein interactions. Trends Biochem. Sci. 2008;33:2–8. doi: 10.1016/j.tibs.2007.10.003. [DOI] [PubMed] [Google Scholar]
- 43.Marcotte E.M., Tsechansky M. Disorder, promiscuity, and toxic partnerships. Cell. 2009;138:16–18. doi: 10.1016/j.cell.2009.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Vavouri T., Semple J.I., Garcia-Verdugo R., Lehner B. Intrinsic protein disorder and interaction promiscuity are widely associated with dosage sensitivity. Cell. 2009;138:198–208. doi: 10.1016/j.cell.2009.04.029. [DOI] [PubMed] [Google Scholar]
- 45.Pechmann S., Levy E.D., Tartaglia G.G., Vendruscolo M. Physicochemical principles that regulate the competition between functional and dysfunctional association of proteins. Proc. Natl. Acad. Sci. U. S. A. 2009;106:10159–10164. doi: 10.1073/pnas.0812414106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Garcia-Seisdedos H., Empereur-Mot C., Elad N., Levy E.D. Proteins evolve on the edge of supramolecular self-assembly. Nature. 2017;548:244–247. doi: 10.1038/nature23320. [DOI] [PubMed] [Google Scholar]
- 47.Levy E.D. A simple definition of structural regions in proteins and its use in analyzing interface evolution. J. Mol. Biol. 2010;403:660–670. doi: 10.1016/j.jmb.2010.09.028. [DOI] [PubMed] [Google Scholar]
- 48.Gsponer J., Futschik M.E., Teichmann S.A., Babu M.M. Tight regulation of unstructured proteins: from transcript synthesis to protein degradation. Science. 2008;322:1365–1368. doi: 10.1126/science.1163581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Levy E.D., Kowarzyk J., Michnick S.W. High-resolution mapping of protein concentration reveals principles of proteome architecture and adaptation. Cell Rep. 2014;7:1333–1340. doi: 10.1016/j.celrep.2014.04.009. [DOI] [PubMed] [Google Scholar]
- 50.Landry C.R., Levy E.D., Abd Rabbo D., Tarassov K., Michnick S.W. Extracting insight from noisy cellular networks. Cell. 2013;155:983–989. doi: 10.1016/j.cell.2013.11.003. [DOI] [PubMed] [Google Scholar]
- 51.Macossay-Castillo M., Marvelli G., Guharoy M., Jain A., Kihara D., Tompa P., Wodak S.J. The balancing act of intrinsically disordered proteins: enabling functional diversity while minimizing promiscuity. J. Mol. Biol. 2019 doi: 10.1016/j.jmb.2019.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhang J., Maslov S., Shakhnovich E.I. Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size. Mol. Syst. Biol. 2008;4:210. doi: 10.1038/msb.2008.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Heo M., Maslov S., Shakhnovich E. Topology of protein interaction network shapes protein abundances and strengths of their functional and nonspecific interactions. Proc. Natl. Acad. Sci. U. S. A. 2011;108:4258–4263. doi: 10.1073/pnas.1009392108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Johnson M.E., Hummer G. Nonspecific binding limits the number of proteins in a cell and shapes their interaction networks. Proc. Natl. Acad. Sci. U. S. A. 2011;108:603–608. doi: 10.1073/pnas.1010954108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Levy E.D., Landry C.R., Michnick S.W. How perfect can protein interactomes be? Sci. Signal. 2009;2 doi: 10.1126/scisignal.260pe11. [DOI] [PubMed] [Google Scholar]
- 56.Yang J.-R., Yang J.-R., Liao B.-Y., Zhuang S.-M., Zhang J. Protein misinteraction avoidance causes highly expressed proteins to evolve slowly. Proc. Natl. Acad. Sci. 2012;109:E831–E840. doi: 10.1073/pnas.1117408109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gsponer J., Babu M.M. Cellular strategies for regulating functional and nonfunctional protein aggregation. Cell Rep. 2012;2:1425–1437. doi: 10.1016/j.celrep.2012.09.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Garcia-Seisdedos H., Villegas J.A., Levy E.D. Infinite assembly of folded proteins in evolution, disease, and engineering. Angew. Chem. Int. Ed Engl. 2018 doi: 10.1002/anie.201806092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Schreiber G., Keating A.E. Protein binding specificity versus promiscuity. Curr. Opin. Struct. Biol. 2011;21:50–61. doi: 10.1016/j.sbi.2010.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Schweke H., Mucchielli M.H., Sacquin-Mora S., Bei W. Protein interaction energy landscapes are shaped by functional and also non-functional partners. bioRxiv. 2018 doi: 10.1016/j.jmb.2019.12.047. [Preprint] [DOI] [PubMed] [Google Scholar]
- 61.Tomala K., Korona R. Evaluating the fitness cost of protein expression in Saccharomyces cerevisiae. Genome Biol. Evol. 2013;5:2051–2060. doi: 10.1093/gbe/evt154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Bolognesi B., Lorenzo Gotor N., Dhar R., Cirillo D., Baldrighi M., Tartaglia G.G., Lehner B. A concentration-dependent liquid phase separation can cause toxicity upon increased protein expression. Cell Rep. 2016;16:222–231. doi: 10.1016/j.celrep.2016.05.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Hagai T., Azia A., Babu M.M., Andino R. Use of host-like peptide motifs in viral proteins is a prevalent strategy in host–virus interactions. Cell Rep. 2014;7:1729–1739. doi: 10.1016/j.celrep.2014.04.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Babu M.M., van der Lee R., de Groot N.S., Gsponer J. Intrinsically disordered proteins: regulation and disease. Curr. Opin. Struct. Biol. 2011;21:432–440. doi: 10.1016/j.sbi.2011.03.011. [DOI] [PubMed] [Google Scholar]
- 65.Davey N.E., Travé G., Gibson T.J. How viruses hijack cell regulation. Trends Biochem. Sci. 2011;36:159–169. doi: 10.1016/j.tibs.2010.10.002. [DOI] [PubMed] [Google Scholar]
- 66.Balch W.E., Morimoto R.I., Dillin A., Kelly J.W. Adapting proteostasis for disease intervention. Science. 2008;319:916–919. doi: 10.1126/science.1141448. [DOI] [PubMed] [Google Scholar]
- 67.Sanchez de Groot N., Torrent M., Villar-Piqué A., Lang B., Ventura S., Gsponer J., Babu M.M. Evolutionary selection for protein aggregation. Biochem. Soc. Trans. 2012;40:1032–1037. doi: 10.1042/BST20120160. [DOI] [PubMed] [Google Scholar]
- 68.Dobson C.M. Protein folding and misfolding. Nature. 2003;426:884–890. doi: 10.1038/nature02261. [DOI] [PubMed] [Google Scholar]
- 69.Geiler-Samerotte K.A., Dion M.F., Budnik B.A., Wang S.M., Hartl D.L., Drummond D.A. Misfolded proteins impose a dosage-dependent fitness cost and trigger a cytosolic unfolded protein response in yeast. Proc. Natl. Acad. Sci. U. S. A. 2011;108:680–685. doi: 10.1073/pnas.1017570108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bucciantini M., Giannoni E., Chiti F., Baroni F., Formigli L., Zurdo J., Taddei N., Ramponi G., Dobson C.M., Stefani M. Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases. Nature. 2002;416:507–511. doi: 10.1038/416507a. [DOI] [PubMed] [Google Scholar]
- 71.Sormanni P., Vendruscolo M. Protein solubility predictions using the CamSol method in the study of protein homeostasis. Cold Spring Harb. Perspect. Biol. 2019 doi: 10.1101/cshperspect.a033845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Ciryam P., Tartaglia G.G., Morimoto R.I., Dobson C.M., Vendruscolo M. Widespread aggregation and neurodegenerative diseases are associated with supersaturated proteins. Cell Rep. 2013;5:781–790. doi: 10.1016/j.celrep.2013.09.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Levy E.D., Michnick S.W., Landry C.R. Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 2012;367:2594–2606. doi: 10.1098/rstb.2012.0078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Albu R.F., Chan G.T., Zhu M., Wong E.T.C., Taghizadeh F., Hu X., Mehran A.E., Johnson J.D., Gsponer J., Mayor T. A feature analysis of lower solubility proteins in three eukaryotic systems. J. Proteome. 2015;118:21–38. doi: 10.1016/j.jprot.2014.10.011. [DOI] [PubMed] [Google Scholar]
- 75.Chiti F., Stefani M., Taddei N., Ramponi G., Dobson C.M. Rationalization of the effects of mutations on peptide and protein aggregation rates. Nature. 2003;424:805–808. doi: 10.1038/nature01891. [DOI] [PubMed] [Google Scholar]
- 76.Tartaglia G.G., Pechmann S., Dobson C.M., Vendruscolo M. A relationship between mRNA expression levels and protein solubility in E. coli. J. Mol. Biol. 2009;388:381–389. doi: 10.1016/j.jmb.2009.03.002. [DOI] [PubMed] [Google Scholar]
- 77.Ventura S. Sequence determinants of protein aggregation: tools to increase protein solubility. Microb. Cell Factories. 2005;4:11. doi: 10.1186/1475-2859-4-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Levy E.D., De S., Teichmann S.A. Cellular crowding imposes global constraints on the chemistry and evolution of proteomes. Proc. Natl. Acad. Sci. U. S. A. 2012;109:20461–20466. doi: 10.1073/pnas.1209312109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Meyer K., Kirchner M., Uyar B., Cheng J.-Y., Russo G., Hernandez-Miranda L.R., Szymborska A., Zauber H., Rudolph I.-M., Willnow T.E., Akalin A., Haucke V., Gerhardt H., Birchmeier C., Kühn R., Krauss M., Diecke S., Pascual J.M., Selbach M. Mutations in disordered regions can cause disease by creating dileucine motifs. Cell. 2018;175:239–253.e17. doi: 10.1016/j.cell.2018.08.019. [DOI] [PubMed] [Google Scholar]
- 80.Neduva V., Linding R., Su-Angrand I., Stark A., de Masi F., Gibson T.J., Lewis J., Serrano L., Russell R.B. Systematic discovery of new recognition peptides mediating protein interaction networks. PLoS Biol. 2005;3 doi: 10.1371/journal.pbio.0030405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Nielsen J.T., Mulder F.A.A. Quality and bias of protein disorder predictors. Sci. Rep. 2019;9 doi: 10.1038/s41598-019-41644-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.El-Gebali S., Mistry J., Bateman A., Eddy S.R., Luciani A., Potter S.C., Qureshi M., Richardson L.J., Salazar G.A., Smart A., Sonnhammer E.L.L., Hirsh L., Paladin L., Piovesan D., Tosatto S.C.E., Finn R.D. The Pfam protein families database in 2019. Nucleic Acids Res. 2019;47:D427–D432. doi: 10.1093/nar/gky995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Pandurangan A.P., Stahlhacke J., Oates M.E., Smithers B., Gough J. The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver. Nucleic Acids Res. 2019;47:D490–D494. doi: 10.1093/nar/gky1130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Dosztanyi Z., Csizmok V., Tompa P., Simon I. The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. J. Mol. Biol. 2005;347:827–839. doi: 10.1016/j.jmb.2005.01.071. [DOI] [PubMed] [Google Scholar]
- 85.Bateman A., Birney E., Cerruti L., Durbin R., Etwiller L., Eddy S.R., Griffiths-Jones S., Howe K.L., Marshall M., Sonnhammer E.L.L. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–280. doi: 10.1093/nar/30.1.276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Gough J., Karplus K., Hughey R., Chothia C. Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure1. J. Mol. Biol. 2001;313:903–919. doi: 10.1006/jmbi.2001.5080. [DOI] [PubMed] [Google Scholar]
- 87.Wong E.T.C., Na D., Gsponer J. On the importance of polar interactions for complexes containing intrinsically disordered proteins. PLoS Comput. Biol. 2013;9 doi: 10.1371/journal.pcbi.1003192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Oates M.E., Romero P., Ishida T., Ghalwash M., Mizianty M.J., Xue B., Dosztányi Z., Uversky V.N., Obradovic Z., Kurgan L., Dunker A.K., Gough J. D2P2: database of disordered protein predictions. Nucleic Acids Res. 2013;41:D508–D516. doi: 10.1093/nar/gks1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Li X., Romero P., Rani M., Dunker A.K., Obradovic Z. Predicting protein disorder for N-, C-, and internal regions. Genome Inform. Ser. Workshop Genome Inform. 1999;10:30–40. [PubMed] [Google Scholar]
- 90.Ishida T., Kinoshita K. PrDOS: prediction of disordered protein regions from amino acid sequence. Nucleic Acids Res. 2007;35:W460–W464. doi: 10.1093/nar/gkm363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Ghalwash M.F., Dunker A.K., Obradović Z. Uncertainty analysis in protein disorder prediction. Mol. BioSyst. 2012;8:381–391. doi: 10.1039/c1mb05373f. [DOI] [PubMed] [Google Scholar]
- 92.Walsh I., Martin A.J.M., Di Domenico T., Tosatto S.C.E. ESpritz: accurate and fast prediction of protein disorder. Bioinformatics. 2012;28:503–509. doi: 10.1093/bioinformatics/btr682. [DOI] [PubMed] [Google Scholar]
- 93.Wimley W.C., White S.H. Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat. Struct. Biol. 1996;3:842–848. doi: 10.1038/nsb1096-842. [DOI] [PubMed] [Google Scholar]
- 94.Kyte J., Doolittle R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982;157:105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
- 95.Roseman M.A. Hydrophobicity of the peptide C = O…H–N hydrogen-bonded group. J. Mol. Biol. 1988;201:621–623. doi: 10.1016/0022-2836(88)90642-0. [DOI] [PubMed] [Google Scholar]
- 96.Sormanni P., Aprile F.A., Vendruscolo M. The CamSol method of rational design of protein mutants with enhanced solubility. J. Mol. Biol. 2015;427:478–490. doi: 10.1016/j.jmb.2014.09.026. [DOI] [PubMed] [Google Scholar]
- 97.Garbuzynskiy S.O., Lobanov M.Y., Galzitskaya O.V. FoldAmyloid: a method of prediction of amyloidogenic regions from protein sequence. Bioinformatics. 2010;26:326–332. doi: 10.1093/bioinformatics/btp691. [DOI] [PubMed] [Google Scholar]
- 98.Pawar A.P., DuBay K.F., Zurdo J., Chiti F., Vendruscolo M., Dobson C.M. Prediction of “aggregation-prone” and “aggregation-susceptible” regions in proteins associated with neurodegenerative diseases. J. Mol. Biol. 2005;350:379–392. doi: 10.1016/j.jmb.2005.04.016. [DOI] [PubMed] [Google Scholar]
- 99.Walsh I., Seno F., Tosatto S.C.E., Trovato A. PASTA 2.0: an improved server for protein aggregation prediction. Nucleic Acids Res. 2014;42:W301–W307. doi: 10.1093/nar/gku399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Warwicker J., Charonis S., Curtis R.A. Lysine and arginine content of proteins: computational analysis suggests a new tool for solubility design. Mol. Pharm. 2014;11:294–303. doi: 10.1021/mp4004749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Das R.K., Pappu R.V. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc. Natl. Acad. Sci. U. S. A. 2013;110:13392–13397. doi: 10.1073/pnas.1304749110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Barros-Barbosa A., Rodrigues T.A., Ferreira M.J., Pedrosa A.G., Teixeira N.R., Francisco T., Azevedo J.E. The intrinsically disordered nature of the peroxisomal protein translocation machinery. FEBS J. 2019;286:24–38. doi: 10.1111/febs.14704. [DOI] [PubMed] [Google Scholar]
- 103.Nagai T., Ibata K., Park E.S., Kubota M., Mikoshiba K., Miyawaki A. A variant of yellow fluorescent protein with fast and efficient maturation for cell-biological applications. Nat. Biotechnol. 2002;20:87–90. doi: 10.1038/nbt0102-87. [DOI] [PubMed] [Google Scholar]
- 104.Remy I., Michnick S.W. Clonal selection and in vivo quantitation of protein interactions with protein-fragment complementation assays. Proc. Natl. Acad. Sci. U. S. A. 1999;96:5394–5399. doi: 10.1073/pnas.96.10.5394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Ruschak A.M., Rose J.D., Coughlin M.P., Religa T.L. Engineered solubility tag for solution NMR of proteins. Protein Sci. 2013;22:1646–1654. doi: 10.1002/pro.2337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Kramer R.M., Shende V.R., Motl N., Pace C.N., Scholtz J.M. Toward a molecular understanding of protein solubility: increased negative surface charge correlates with increased solubility. Biophys. J. 2012;102:1907–1915. doi: 10.1016/j.bpj.2012.01.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Trevino S.R., Scholtz J.M., Pace C.N. Measuring and increasing protein solubility. J. Pharm. Sci. 2008;97:4155–4166. doi: 10.1002/jps.21327. [DOI] [PubMed] [Google Scholar]
- 108.Trevino S.R., Scholtz J.M., Pace C.N. Amino acid contribution to protein solubility: Asp, Glu, and Ser contribute more favorably than the other hydrophilic amino acids in RNase Sa. J. Mol. Biol. 2007;366:449–460. doi: 10.1016/j.jmb.2006.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Mogk A., Bukau B., Kampinga H.H. Cellular handling of protein aggregates by disaggregation machines. Mol. Cell. 2018;69:214–226. doi: 10.1016/j.molcel.2018.01.004. [DOI] [PubMed] [Google Scholar]
- 110.Cumberworth A., Lamour G., Babu M.M., Gsponer J. Promiscuity as a functional trait: intrinsically disordered regions as central players of interactomes. Biochem. J. 2013;454:361–369. doi: 10.1042/BJ20130545. [DOI] [PubMed] [Google Scholar]
- 111.Gemayel R., Chavali S., Pougach K., Legendre M., Zhu B., Boeynaems S., van der Zande E., Gevaert K., Rousseau F., Schymkowitz J., Babu M.M., Verstrepen K.J. Variable glutamine-rich repeats modulate transcription factor activity. Mol. Cell. 2015;59:615–627. doi: 10.1016/j.molcel.2015.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Olzscha H., Schermann S.M., Woerner A.C., Pinkert S., Hecht M.H., Tartaglia G.G., Vendruscolo M., Hayer-Hartl M., Hartl F.U., Vabulas R.M. Amyloid-like aggregates sequester numerous metastable proteins with essential cellular functions. Cell. 2011;144:67–78. doi: 10.1016/j.cell.2010.11.050. [DOI] [PubMed] [Google Scholar]
- 113.Drummond D.A., Bloom J.D., Adami C., Wilke C.O., Arnold F.H. Why highly expressed proteins evolve slowly. Proc. Natl. Acad. Sci. U. S. A. 2005;102:14338–14343. doi: 10.1073/pnas.0504070102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Pal C., Papp B., Hurst L.D. Highly expressed genes in yeast evolve slowly. Genetics. 2001;158:927–931. doi: 10.1093/genetics/158.2.927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Rocha E.P., Danchin A. An analysis of determinants of amino acids substitution rates in bacterial proteins. Mol. Biol. Evol. 2004;21:108–116. doi: 10.1093/molbev/msh004. [DOI] [PubMed] [Google Scholar]
- 116.Drummond D.A., Raval A., Wilke C.O. A single determinant dominates the rate of yeast protein evolution. Mol. Biol. Evol. 2006;23:327–337. doi: 10.1093/molbev/msj038. [DOI] [PubMed] [Google Scholar]
- 117.Kimura M. Evolutionary rate at the molecular level. Nature. 1968;217:624–626. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]
- 118.Krylov D.M., Wolf Y.I., Rogozin I.B., Koonin E.V. Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. Genome Res. 2003;13:2229–2235. doi: 10.1101/gr.1589103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Zhang J., Yang J.R. Determinants of the rate of protein sequence evolution. Nat. Rev. Genet. 2015;16:409–420. doi: 10.1038/nrg3950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Foy S.G., Wilson B.A., Bertram J., Cordes M.H.J., Masel J. A shift in aggregation avoidance strategy Marks a long-term direction to protein evolution. Genetics. 2019;211:1345–1355. doi: 10.1534/genetics.118.301719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Krogh A., Larsson B., von Heijne G., Sonnhammer E.L. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 2001;305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- 122.Bernsel A., Viklund H., Hennerdal A., Elofsson A. TOPCONS: consensus prediction of membrane protein topology. Nucleic Acids Res. 2009;37:W465–W468. doi: 10.1093/nar/gkp363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Viklund H., Elofsson A. OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar. Bioinformatics. 2008;24:1662–1668. doi: 10.1093/bioinformatics/btn221. [DOI] [PubMed] [Google Scholar]
- 124.Käll L., Krogh A., Sonnhammer E.L.L. An HMM posterior decoder for sequence feature prediction that includes homology information. Bioinformatics. 2005;21(Suppl. 1):i251–i257. doi: 10.1093/bioinformatics/bti1014. [DOI] [PubMed] [Google Scholar]
- 125.Reynolds S.M., Käll L., Riffle M.E., Bilmes J.A., Noble W.S. Transmembrane topology and signal peptide prediction using dynamic bayesian networks. PLoS Comput. Biol. 2008;4 doi: 10.1371/journal.pcbi.1000213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Bernsel A., Viklund H., Falk J., Lindahl E., von Heijne G., Elofsson A. Prediction of membrane-protein topology from first principles. Proc. Natl. Acad. Sci. U. S. A. 2008;105:7177–7181. doi: 10.1073/pnas.0711151105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Viklund H., Bernsel A., Skwark M., Elofsson A. SPOCTOPUS: a combined predictor of signal peptides and membrane protein topology. Bioinformatics. 2008;24:2928–2929. doi: 10.1093/bioinformatics/btn550. [DOI] [PubMed] [Google Scholar]
- 128.Hessa T., Meindl-Beinker N.M., Bernsel A., Kim H., Sato Y., Lerch-Bader M., Nilsson I., White S.H., von Heijne G. Molecular code for transmembrane-helix recognition by the Sec61 translocon. Nature. 2007;450:1026–1030. doi: 10.1038/nature06387. [DOI] [PubMed] [Google Scholar]
- 129.Wang M., Herrmann C.J., Simonovic M., Szklarczyk D., von Mering C. Version 4.0 of PaxDb: protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics. 2015 doi: 10.1002/pmic.201400441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Milo R. What is the total number of protein molecules per cell volume? A call to rethink some published values. Bioessays. 2013;35:1050–1055. doi: 10.1002/bies.201300066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Roseman M.A. Hydrophilicity of polar amino acid side-chains is markedly reduced by flanking peptide bonds. J. Mol. Biol. 1988;200:513–522. doi: 10.1016/0022-2836(88)90540-2. [DOI] [PubMed] [Google Scholar]
- 132.Ahmed A.B., Kajava A.V. Breaking the amyloidogenicity code: methods to predict amyloids from amino acid sequence. FEBS Lett. 2013;587:1089–1095. doi: 10.1016/j.febslet.2012.12.006. [DOI] [PubMed] [Google Scholar]
- 133.Mumberg D., Muller R., Funk M. Yeast vectors for the controlled expression of heterologous proteins in different genetic backgrounds. Gene. 1995;156:119–122. doi: 10.1016/0378-1119(95)00037-7. [DOI] [PubMed] [Google Scholar]
- 134.Klock H.E., Koesema E.J., Knuth M.W., Lesley S.A. Combining the polymerase incomplete primer extension method for cloning and mutagenesis with microscreening to accelerate structural genomics efforts. Proteins. 2008;71:982–994. doi: 10.1002/prot.21786. [DOI] [PubMed] [Google Scholar]
- 135.Matalon O., Steinberg A., Sass E., Hausser J., Levy E.D. Reprogramming protein abundance fluctuations in single cells by degradation. bioRxiv. 2018 [Preprint] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.