Skip to main content
Molecular & Cellular Proteomics : MCP logoLink to Molecular & Cellular Proteomics : MCP
. 2012 Mar 2;11(7):M111.014969. doi: 10.1074/mcp.M111.014969

The Role of Structural Disorder in the Rewiring of Protein Interactions through Evolution*

Roberto Mosca ‡,§, Roland A Pache ‡,§, Patrick Aloy ‡,‖,**
PMCID: PMC3394948  PMID: 22389433

Abstract

Structurally disordered regions play a key role in protein-protein interaction networks and the evolution of highly connected proteins, enabling the molecular mechanisms for multiple binding. However, the role of protein disorder in the evolution of interaction networks has only been investigated through the analysis of individual proteins, making it impossible to distinguish its specific impact in the (re)shaping of their interaction environments. Now, the availability of large interactomes for several model organisms permits exploration of the role of disorder in protein interaction networks not only at the level of the interacting proteins but of the interactions themselves. By comparing the interactomes of human, fly, and yeast, we discovered that, despite being much more abundant, disordered interactions are significantly less conserved than their ordered counterparts. Furthermore, our analyses provide evidence that this happens not only because disordered proteins are less conserved but also because they display a higher capacity to rewire their interaction neighborhood through evolution. Overall, our results support the hypothesis that conservation of disorder gives a clear evolutionary advantage, facilitating the change of interaction partners during evolution. Moreover, this mechanism is not exclusive of a few anecdotal cases but a global feature present in the interactome networks of entire organisms.


The classical dogma of structural biology maintains that proteins need to adopt a specific well defined three-dimensional structure to perform their functions. However, there is growing evidence suggesting that this paradigm does not hold for a large fraction of the proteome in higher organisms. Indeed, there are many proteins that lack a defined stable secondary or tertiary structure in solution and yet play crucial cellular roles (13). It is now clear that these intrinsically unstructured or disordered proteins (IDPs)1 might provide some interesting functional advantages over their more structured counterparts. For instance, because they often offer a large and flexible interaction surface area, IDPs are ideal candidates to mediate in signaling cascades where specific interactions with fast association/dissociation rates are required (4, 5). However, maintaining a pool of IDPs is not easy for the cell machinery because they tend to aggregate and are more prone to proteolysis, and their overexpression is often damaging (6, 7). Thus, it is not surprising that they are subject to tight regulation from transcript synthesis to protein degradation (2).

From an evolutionary perspective, disordered regions have been classified as fast evolving (8), because they show a different pattern of accepted point mutations and present higher rates of insertions and deletions (9, 10). However, IDPs will not accept and incorporate any random mutation, and it has been recently shown that long unstructured stretches in proteins are significantly more conserved than their flanking ordered regions or loops in well structured proteins (11, 12). It is thus clear that there is selective pressure to maintain structural disorder. Indeed, there is a direct relationship between the level of protein disorder and the tree of life (11, 13), suggesting that disorder itself is a major evolutionary tool to increase organism complexity, from bacteria to higher eukaryotes.

With an increasing level of species complexity often comes the implementation of novel regulatory mechanisms, many of which are imprinted as control circuits in protein interaction networks (14). Accordingly, the role of structural disorder has been largely studied in the context of such networks. Interactions between IDPs are common in interactome networks (15) and, in particular, among network hubs (i.e., highly connected proteins) because disorder provides a solution for a protein to interact with many structurally different partners (8, 1619). The binding mechanisms mediated by disorder encompass increased flexibility and adaptability to multiple interaction interfaces, a facilitated regulation via diverse post-translational modifications and an increased interaction interface exposed by disordered regions (20). These characteristics, found to be particularly present in hub proteins, have also been proposed to be crucial in the evolution of protein interaction networks by improving the fitness of such proteins for binding through becoming longer and acquiring more disordered regions (16). Furthermore, it was shown that hubs, and particularly singlish-interface hubs (i.e., those that bind all their partners using only one or two interaction interfaces (8)) evolve at a faster rate because of their disordered regions being subject to fewer evolutionary constraints (8). However, although very inspiring, all of these studies have investigated the role of disorder in the evolution of interaction networks only through the analysis of single proteins.

Additionally, the last decade has seen the emergence of studies devoted to analyzing interactome evolution from a systemic perspective. For instance, several works have tried to estimate the rate of interaction gain and loss among different organisms (2123), and different models have been proposed on how this may shape the structure of interactomes and influence their global properties (2429). All of these advances have been recently reviewed and summarized by Levy and Pereira-Leal (30). In this context, it is our opinion that a lot still needs to be done to clarify which part of the observed effects of structural disorder are related to the evolution of the proteins as individual biological entities and which can entirely be attributed to their role in (re)shaping their interaction environment inside the network.

In this work, we make use of existing large scale interaction data to assess systematically and quantitatively the role of structural disorder in network evolution and rewiring. In particular, we build and compare interactome networks for human, fly, and yeast and classify the interactions into ordered and disordered. We then study whether the two types of interactions are equally conserved through evolution or if, on the contrary, either type is preferred to change the cellular repertoire of interactions, regardless of the level of conservation of their constituent proteins (Fig. 1).

Fig. 1.

Fig. 1.

Conservation of interactions involving disordered proteins. We compared the interactomes of human, fly, and yeast to identify conserved subnetworks and tested for enrichment/depletion of interactions involving ordered (green, yellow, and blue) and disordered proteins (red). A, A′, and A″, as well as B, B′, and B″, are examples for orthologous proteins between the three species and illustrate how disordered interactions can change during evolution.

EXPERIMENTAL PROCEDURES

Collection of High Confidence Proteomes

We collected high confidence sets of proteins for yeast (Saccharomyces cerevisiae) and fly (Drosophila melanogaster) through the mapping of the Saccharomyces Genome Database (http://downloads.yeastgenome.org/, January 2010) and FlyBase (http://www.flybase.org, March 2010) (44) proteins onto UniProt (45) and removed all pseudogenes and dubious ORFs. For human, we collected all UniProt sequences belonging to Homo sapiens. In all three cases, we only considered UniProt entries with evidence at the protein or transcript level and discarded fragments. We then added all splice variants available in Swissprot and removed redundancy by clustering sequences using UniRef100 (45). The final high confidence, nonredundant proteomes contain 5,747 sequences for yeast, 18,547 for fly, and 59,898 for human (see Table I).

Table I. General figures for the model organisms considered.

The table reports the total number of proteins (including splice variants), their structural classification, and the number of such proteins that are conserved (have orthologs in the other organisms). Furthermore, the table shows the size of the binary interactomes and the conserved subnetworks (see main text for details).

Human Fly Yeast
Proteome
    Total number of proteins 59,898 18,547 5,747
    Ordered 17,682 (30%) 5,761 (31%) 1,944 (34%)
    Disordered 17,347 (29%) 5,741 (31%) 1,142 (20%)
    Uncertain proteins 24,869 (42%) 7,045 (38%) 2,661 (46%)
Interactome (complete)
    Number of unique interactions 25,007 21,977 19,812
    Number of unique interactors 8,642 7,360 4,967
    Ordered proteins 2,227 (26%) 2,140 (29%) 1,629 (33%)
    Disordered proteins 2,601 (30%) 2,397 (33%) 1,031 (21%)
    Unclassified proteins 3,814 (44%) 2,823 (38%) 2,307 (46%)
    Ordered interactions 1,848 (8%) 1,221 (6%) 1,649 (8%)
    Disordered interactions 12,372 (49%) 13,384 (61%) 8,861 (45%)
    Unclassified interactions 10,787 (43%) 7,372 (34%) 9,302 (47%)
Versus fly Versus yeast Versus human Versus yeast Versus human Versus fly
Likely conserved interactions 4,822 359 1,297 75 598 815
Subnetworks (complete)
    Number of unique interactions 9,873 3,077 7,467 2,642 5,596 5,664
    Number of unique interactors 3,905 1,699 3,215 1,646 1,842 1,940
    Ordered proteins 1,017 (26%) 674 (40%) 934 (29%) 639 (39%) 722 (39%) 771 (40%)
    Disordered proteins 1,025 (26%) 274 (16%) 863 (27%) 307 (19%) 266 (14%) 285 (15%)
    Unclassified proteins 1,863 (48%) 751 (44%) 1,418 (44%) 700 (43%) 854 (46%) 884 (46%)
    Ordered interactions 900 (9%) 647 (21%) 578 (8%) 410 (16%) 677 (12%) 674 (12%)
    Disordered interactions 4,126 (42%) 848 (28%) 3,650 (49%) 900 (34%) 1790 (32%) 1,883 (33%)
    Unclassified interactions 4,847 (49%) 1,582 (51%) 3,239 (43%) 1,332 (50%) 3,129 (56%) 3,107 (55%)
Construction of Binary Interactomes

We collected all experimentally determined binary interactions stored in the public databases IntAct (46), MINT (47), HPRD (48), BioGRID (49), MatrixDB (50), and DIP (51), mapping protein identification codes to UniProt Accession Codes, using the mapping file provided by UniProt. We remapped the different interactors through UniRef100 to remove redundancy and discarded those interactions involving proteins not present in our high confidence proteomes, as well as those that could not be traced back to their original publications. We then added interactions found in three-dimensional structures (in biological units to avoid crystal contacts), downloaded from the Protein Data Bank (http://www.pdb.org) (36). We considered two amino acid chains to be interacting if they had at least five residue-residue contacts (i.e., hydrogen bonds, disulfide and salt bridges, and van der Waals interactions), not considering obvious physical clashes (i.e., pairs of atoms closer than the sum of their average covalent radii plus 0.5 Å).

Determination of Orthology Relationships

We determined orthology relationships between human, fly, and yeast proteins using proteome-wide reciprocal BLAST searches (52). To remove spurious hits, we required an E -value of <10−10 and considered only hits in the top 10 of the BLASTP output. This resulted in 11,430 yeast/fly, 17,534 yeast/human, and 75,415 fly/human orthologous pairs. To test the robustness of our findings with respect to the orthology assignments, we repeated all of the analyses using Inparanoid assignments (53, 54), both many-to-many and one-to-one relationships (supplemental Table 1), obtaining identical trends (supplemental Fig. 2).

Estimation of Likely Conserved Interactions

We estimated evolutionary distances between homologous proteins as the number of amino acid substitutions per site (d) calculated from the fraction of identical residues (q) using the general equation derived by Grishin (55) that accounts for substitution rate variations both between different types of amino acids and between different sites.

graphic file with name zjw00712-4167-m01.jpg

We solved this equation numerically by iteration, using d = (1/q) − 1, which allows for the substitution rate to vary only among sites, as the starting point, until the difference between subsequent estimates of d was smaller than 10−10 (default parameter).

Then, for each pair of homologs (A/A′, B/B′) that interact in at least one of the interactomes, we calculated the probability of the respective interaction being conserved as the posterior probability of interaction conservation given the difference ΔdA/A′, B/B′ between the evolutionary distances of A and A′, and B and B′.

graphic file with name zjw00712-4167-m02.jpg

This calculation is based on the likelihood ratio of observing the respective Δd under a conservation model C (all pairs of homologs with a conserved interaction) and a null model N (106 random pairs of homologs). We calculated the posterior probability using Bayes' theorem,

graphic file with name zjw00712-4167-m03.jpg

with the prior probability set to

graphic file with name zjw00712-4167-m04.jpg

such that the pair of homologous proteins (X/X′, Y/Y′) with the highest likelihood ratio is assigned an interaction conservation probability of 0.9 (default parameter). Likelihood ratios were smoothed using monotone regression (Pool Adjacent Violators Algorithm, PAVA (56)). We accepted all predicted interactions with a posterior probability bigger than 0.5.

Computation of Disorder Enrichments/Depletions

To assess whether disordered interactions are more or less conserved than ordered interactions, we computed their enrichment/depletion in the interactome networks of the given species (Fig. 2). For this, we calculated the ratio of the fraction of disordered interactions in the two species (B + D) that are conserved in one of the species (B) and the fraction of ordered interactions (A + C) that are also conserved (A), followed by log2 transformation to get a symmetrical range of values.

graphic file with name zjw00712-4167-m05.jpg

We checked the statistical significance of the enrichments by applying a two-sided Fisher's exact test with a standard p value threshold of 0.05. For clarity, we transformed the log2 enrichments into fold enrichments and use negative numbers for depletions so that, for instance, a log2 enrichment of 2 becomes a 4-fold enrichment, whereas a log2 depletion of −2 corresponds to a ratio of 0.25 (i.e., 4-fold depletion indicated by −4).

Fig. 2.

Fig. 2.

General strategy employed in the analysis of conservation of ordered/disordered interactions. After construction of binary interactomes for human, fly, and yeast, we predicted disordered residues in the three species proteomes using VSL2 (31, 32) and classified the interactions into ordered (i.e., interactions between two ordered proteins), disordered (i.e., interactions involving at least one disordered protein), and partially disordered (see main text for details). We then performed pairwise alignments of the binary interactomes to identify conserved and not conserved interactions. Finally, we computed the enrichment/depletion of conserved ordered/disordered interactions in the interactome networks.

RESULTS AND DISCUSSION

To carry out our comparative analysis, the first step was to collect and assemble a high confidence set of protein sequences for human (H. sapiens), fly (D. melanogaster), and yeast (S. cerevisiae) and predict unstructured stretches in these proteins. We did this by means of the disorder predictor VSL2 (31, 32), which is one of the most precise predictors, according to the results of the last critical assessment of structure prediction rounds (33, 34), with an accuracy of ∼81% (32). We then classified each protein as ordered or disordered if more than 70% of its residues were predicted to be structured or unstructured, respectively. We applied such a strict filter to avoid potential artifacts caused by mispredictions. All of the remaining proteins were classified as partially disordered or uncertain. To ensure the robustness of the classification, we repeated the analysis with a different disorder prediction program, DISOPRED2 (14), obtaining very similar results (data not shown). Table I reports the number of proteins included in this study for each species as well as their ordered/disordered structural classification. It can be appreciated how the fraction of ordered and disordered proteins is fairly similar, ∼30% for human and fly and 20% in yeast, in the three species considered and agrees well with current estimations (14).

Whole Interactome Comparisons

After collecting the different proteomes and classifying the proteins as ordered/disordered, we built the interactome networks for each organism and compared their level of conservation among the different species. We retrieved binary protein-protein interactions for the three model organisms from the main data repositories, and made the set nonredundant, as described under “Experimental Procedures” and summarized in Fig. 2. It is worth noting that we deliberately decided not to include experiments reporting multiprotein complexes to avoid errors in the assignment of direct physical interactions between their components and only considered interactions reported by binary detection methods (35). In total, we compiled over 20,000 interactions for each species involving between 5,000 and 8,600 proteins. The percentages of fully ordered and disordered proteins present in the interactome networks range from 26 to 33% and from 21 to 33%, respectively, and they are consistent with the composition observed in full proteomes (Table I). To certify that the strict levels of >70% ordered/disordered residues were comparable between whole proteins and interaction interfaces and discard the possibility that the interactions are mediated by the remaining 30% of disordered/ordered residues, we analyzed all the 970 human, fly, and yeast interactions for which there is a structure available in the Protein Data Bank (36) and found that, for disordered interactions (structured upon association), the interface contains between 64 and 72% of disordered residues on average, whereas the figure drops to 22–24% for ordered interactions. These numbers are fully compatible with our thresholds and show that whenever an interaction is classified as ordered/disordered, the classification is also consistent in the interfaces.

To compare the species interactomes and assess the level of conservation of the interactions, we first inferred orthology relationships between the proteins in each organism and overlaid the networks. We consider an interaction between the two proteins A and B to be conserved if it is also observed between the corresponding orthologs A′ and B′ in the other species. By performing pairwise alignments of the collected binary interactomes for the three organisms, we discovered that interactions involving at least one disordered protein (i.e., disordered interactions) are significantly less conserved than interactions between ordered proteins (i.e., ordered interactions), as determined by enrichment analysis (see “Experimental Procedures”), and this holds for all organism pairs (p values in the range [1.91 × 10−56 and 2.74 × 10−14], two-sided Fisher's exact test). In particular, we found disordered interactions to be between 3.3- and 9.6-fold less conserved than ordered ones (Fig. 3A).

Fig. 3.

Fig. 3.

Conservation of disordered interactions in interactome networks. The values report fold enrichments (depletions, because they are all negative) of conserved disordered interactions in the interactome networks of the three model organisms considered. The numbers refer to the interactome of the organisms in the rows when comparing them to the organisms in the columns for complete interactomes (A), complete interactomes including likely conserved interactions (B), high confidence interactomes including likely conserved interactions (C), and only subnetworks with proteins present in both species (D) (see the main text for details). The degree of enrichment/depletion is shown in different blue tones, from light to dark. All of the depletions were statistically significant with a standard p value threshold of 0.05 (two-sided Fisher's exact test).

Surprisingly, although there is no change in the fraction of ordered/disordered proteins in whole proteomes and interactomes, the figures change considerably when we consider not individual proteins but protein-protein interactions. In this case, we see how the fraction of ordered interactions drops to ∼6–8%, whereas the disordered ones account for 45–61% of the interactions (Table I). However, even if disordered interactions are the main components of interactome networks, if we analyze the interactions that are conserved across different species, our results show that disordered interactions are unequivocally and consistently depleted, and ordered interactions are by far more conserved.

Accounting for Undetected or False Interactions

Although the initial results show a clear trend, we need to consider several factors that could potentially affect the analysis. For instance, it is well known that current interactomes are incomplete, and only a small fraction of the total number of estimated interactions have already been identified (37). The high fraction of missing interactions has been attributed to two main causes: on the one hand, the sampling of the interaction space for the different model organisms is incomplete, and on the other, the discovery methodologies available cannot detect all types of interactions (38). In practical terms, this means that we could wrongly classify an interaction as nonconserved only because it has not been studied or detected in one of the two species compared. We tried to address the partiality introduced by the different proteome sampling by considering only those proteins that had been included in each study. However, unfortunately, this information in not available for most experiments because they only report positive interactions. Nevertheless, what we could do is to check whether there is a bias toward ordered or disordered proteins being more sampled in the different interactomes that could affect our results. Accordingly, we computed the fraction of ordered/disorders proteins in the proteome/interactome of each organism and calculated the ratio for each species comparison (supplemental Fig. 1). We found that although there are some minor differences, they are unlikely to affect our conclusions because they occur in both directions (i.e., either ordered or disordered proteins being slightly oversampled). Finally, it is worth noting that, to the best of our knowledge, this type of structural information was never used to design the very many interaction discovery experiments that are included in our study, being thus unlikely to cause any serious bias.

To compensate for the lack of detection sensitivity of the different interaction discovery methods, we implemented a procedure to predict likely conserved interactions in the interactome alignment procedure. Our strategy is founded on the concept of “interologs ” (i.e., orthologous pairs of interacting proteins) (39) and explicitly incorporates evolutionary considerations. In particular, we profit from the observation that interacting proteins evolve at rates significantly closer than expected by chance (40) (even within the same functional module (41)) to predict the probabilities for likely conserved interactions based on the difference of the evolutionary distances between the protein pairs involved in the interactions (see “Experimental Procedures”). The inclusion of likely conserved interactions notably increased the number of interactions considered by adding up to 4,822 interactions in the human versus fly comparison (Table I). We then repeated the above enrichment analysis and very consistently observed significant depletion of conserved interactions in disordered interactions with respect to ordered ones (p values in the range 2.07 × 10−128 to 1.58 × 10−25, two-sided Fisher's exact test), with disordered interactions still being 1.6–9.3-fold less conserved than ordered ones (Fig. 3B).

We also sought to exclude any potential artifact from our conclusions coming from false positives in the networks. Accordingly, we built high confidence networks by considering only those interactions reported by at least two independent publications or extracted from biological units in crystal structures. This significantly reduced the number of interactions considered to 14,337, 5,006, and 5,668 for human, fly, and yeast, respectively. We repeated the enrichment analysis and, again, found a 1.5–8.5-fold significant depletion of conserved disordered interactions (p values in the range 3.1 × 10−102 to 3.3 × 10−5, two-sided Fisher's exact test; Fig. 3C).

The results of this second set of enrichment analyses, where we deal with missing and dubious interactions, are very consistent with what we observed in complete interactome networks, showing a clear depletion in the conservation of disordered interactions. In addition, they also support the notion that, even if incomplete and error prone, current versions of interactomes already contain enough quality information to permit the analysis of their global and emerging properties (42).

Evolutionary Pressure on Network Rewiring

It has been repeatedly reported that well structured proteins are, in general, more conserved than disordered ones (43). We thus need to assess to what extent our observation that disordered interactions are significantly less conserved in interactome networks is due to the lower level of conservation of the individual interacting proteins or evolution is indeed acting on the interactions themselves: that is, to check whether interactions are not conserved because one of the interacting orthologs in the other species is missing or not detectable or because there is real rewiring of interactions among pairs of proteins that have orthologs in the other organism. Accordingly, we built interactome subnetworks consisting of those interactions whose protein components have detectable orthologs in the other organism. Note that because we are using conserved pools of proteins, interactome subnetworks for each organism do now vary depending on the species that they are compared with. As expected, the number of interactions decreased considerably, ranging from 9,873 for the human-fly comparison to 2,642 in the case of fly-yeast (Table I). We then reran the enrichment analysis and again observed the same trend: a 1.4–2.8-fold depletion of conserved disordered interactions (Fig. 3D), with p values in the range 3.13 × 10−14 to 2.58 × 10−3, in a two-sided Fisher's exact test. In this case, we found a considerable decrease of the depletion of conserved disordered interactions observed when comparing the human and fly interactomes to yeast, from 7.2- and 9.6-fold depletion to 1.4- and 1.9-fold, respectively. This effect is not observed when taking the yeast interactome as a template and comparing it with human and fly, and it can be attributed to the nonexistence of yeast orthologs for many human and fly proteins, whereas most yeast proteins have counterparts in the other species.

This final analysis reveals that, even accounting for the different levels of protein conservation by considering only common subnetworks, there is a clear depletion of conserved disordered interactions in interactomes. This suggests a selective pressure acting directly on the interactions, independent of that acting on the individual partners, that might have a direct effect on the rewiring of cell networks.

Concluding Remarks

Through the comprehensive and quantitative analysis of the conservation of protein-protein interactions across human, fly, and yeast, we have found that disordered interactions are significantly less conserved than those between well ordered interactors. By compensating for several caveats in current interactomes, we have shown that disordered interactions are much less conserved than their ordered counterparts not only because of the higher evolutionary rates of unstructured proteins but also because of their ability to rewire their interaction neighborhood by changing partners during evolution. Moreover, analyzing disordered interactions of known three-dimensional structure, likely involving disorder to order transitions upon binding, we found evidence that disorder plays a direct role at the interaction interface between proteins. Our findings complement previous reports showing that proteins exploit structural disorder to adapt their binding interfaces to several different interactors (20) and that disorder actively participates in the evolution of hubs, providing the molecular basis for their large interaction capacity (8, 16).

Overall, our results support the hypothesis that conservation of disorder gives a clear evolutionary advantage, facilitating the change of interaction partners during evolution. Moreover, this mechanism is not exclusive of a few anecdotal cases but a global feature present in the interactome networks of entire organisms.

Acknowledgments

We thank Miquel Pons, Pau Bernadó, Amelie Stein, and Arnaud Céol (Institute for Research in Biomedicine Barcelona) for helpful discussions and for critically reading the manuscript.

Footnotes

* This work was supported in part by Spanish Ministerio de Ciencia e Innovación Grants PSE-010000-2009-1 and BIO2010-22073) and the European Commission under FP7 Grant Agreement 223101 (AntiPathoGN). Roland Pache is a recipient of the Spanish FPU (Formación del Profesorado Universitario) fellowship.

Inline graphic This article contains supplemental materials.

1 The abbreviation used is:

IDP
intrinsically disordered protein.

REFERENCES

  • 1. Dunker A. K., Uversky V. N. (2008) Signal transduction via unstructured protein conduits. Nat. Chem. Biol. 4, 229–230 [DOI] [PubMed] [Google Scholar]
  • 2. Gsponer J., Futschik M. E., Teichmann S. A., Babu M. M. (2008) Tight regulation of unstructured proteins: From transcript synthesis to protein degradation. Science 322, 1365–1368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Tompa P. (2011) Unstructural biology coming of age. Curr. Opin. Struct. Biol. 21, 419–425 [DOI] [PubMed] [Google Scholar]
  • 4. Sugase K., Dyson H. J., Wright P. E. (2007) Mechanism of coupled folding and binding of an intrinsically disordered protein. Nature 447, 1021–1025 [DOI] [PubMed] [Google Scholar]
  • 5. Liu J., Faeder J. R., Camacho C. J. (2009) Toward a quantitative theory of intrinsically disordered proteins and their function. Proc. Natl. Acad. Sci. U.S.A. 106, 19819–19823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Dyson H. J., Wright P. E. (2005) Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 6, 197–208 [DOI] [PubMed] [Google Scholar]
  • 7. Vavouri T., Semple J. I., Garcia-Verdugo R., Lehner B. (2009) Intrinsic protein disorder and interaction promiscuity are widely associated with dosage sensitivity. Cell 138, 198–208 [DOI] [PubMed] [Google Scholar]
  • 8. Kim P. M., Sboner A., Xia Y., Gerstein M. (2008) The role of disorder in interaction networks: A structural analysis. Mol. Syst. Biol. 4, 179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Brown C. J., Johnson A. K., Daughdrill G. W. (2010) Comparing models of evolution for ordered and disordered proteins. Mol. Biol. Evol. 27, 609–621 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Brown C. J., Johnson A. K., Dunker A. K., Daughdrill G. W. (2011) Evolution and disorder. Curr. Opin. Struct. Biol. 21, 441–446 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Liu J., Tan H., Rost B. (2002) Loopy proteins appear conserved in evolution. J. Mol. Biol. 322, 53–64 [DOI] [PubMed] [Google Scholar]
  • 12. Schlessinger A., Schaefer C., Vicedo E., Schmidberger M., Punta M., Rost B. (2011) Protein disorder-a breakthrough invention of evolution? Curr. Opin. Struct. Biol. 21, 412–418 [DOI] [PubMed] [Google Scholar]
  • 13. Dunker A. K., Obradovic Z., Romero P., Garner E. C., Brown C. J. (2000) Intrinsic protein disorder in complete genomes. Genome Inform Ser. Workshop Genome Inform. 11, 161–171 [PubMed] [Google Scholar]
  • 14. Ward J. J., Sodhi J. S., McGuffin L. J., Buxton B. F., Jones D. T. (2004) Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 337, 635–645 [DOI] [PubMed] [Google Scholar]
  • 15. Shimizu K., Toh H. (2009) Interaction between intrinsically disordered proteins frequently occurs in a human protein-protein interaction network. J. Mol. Biol. 392, 1253–1265 [DOI] [PubMed] [Google Scholar]
  • 16. Dosztányi Z., Chen J., Dunker A. K., Simon I., Tompa P. (2006) Disorder and sequence repeats in hub proteins and their implications for network evolution. J. Proteome Res. 5, 2985–2995 [DOI] [PubMed] [Google Scholar]
  • 17. Dunker A. K., Cortese M. S., Romero P., Iakoucheva L. M., Uversky V. N. (2005) Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J 272, 5129–5148 [DOI] [PubMed] [Google Scholar]
  • 18. Haynes C., Oldfield C. J., Ji F., Klitgord N., Cusick M. E., Radivojac P., Uversky V. N., Vidal M., Iakoucheva L. M. (2006) Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput. Biol. 2, e100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Patil A., Kinoshita K., Nakamura H. (2010) Domain distribution and intrinsic disorder in hubs in the human protein-protein interaction network. Protein Sci. 19, 1461–1468 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gsponer J., Babu M. M. (2009) The rules of disorder or why disorder rules. Prog. Biophys. Mol. Biol. 99, 94–103 [DOI] [PubMed] [Google Scholar]
  • 21. Wagner A. (2003) How the global structure of protein interaction networks evolves. Proc. Biol. Sci. 270, 457–466 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Beltrao P., Serrano L. (2007) Specificity and evolvability in eukaryotic protein interaction networks. PLoS Comput. Biol. 3, e25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Shou C., Bhardwaj N., Lam H. Y., Yan K. K., Kim P. M., Snyder M., Gerstein M. B. (2011) Measuring the evolutionary rewiring of biological networks. PLoS Comput. Biol. 7, e1001050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Barabasi A. L., Albert R. (1999) Emergence of scaling in random networks. Science 286, 509–512 [DOI] [PubMed] [Google Scholar]
  • 25. Eisenberg E., Levanon E. Y. (2003) Preferential attachment in the protein network evolution. Phys. Rev. Lett. 91, 138701. [DOI] [PubMed] [Google Scholar]
  • 26. Kunin V., Pereira-Leal J. B., Ouzounis C. A. (2004) Functional evolution of the yeast protein interaction network. Mol. Biol. Evol. 21, 1171–1176 [DOI] [PubMed] [Google Scholar]
  • 27. Pagel M., Meade A., Scott D. (2007) Assembly rules for protein networks derived from phylogenetic-statistical analysis of whole genomes. BMC Evol. Biol. 7, (Suppl. 1) S16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Gibson T. A., Goldberg D. S. (2011) Improving evolutionary models of protein interaction networks. Bioinformatics 27, 376–382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Robertson D. L., Lovell S. C. (2009) Evolution in protein interaction networks: Co-evolution, rewiring and the role of duplication. Biochem. Soc. Trans. 37, 768–771 [DOI] [PubMed] [Google Scholar]
  • 30. Levy E. D., Pereira-Leal J. B. (2008) Evolution and dynamics of protein interactions and networks. Curr. Opin. Struct. Biol. 18, 349–357 [DOI] [PubMed] [Google Scholar]
  • 31. Obradovic Z., Peng K., Vucetic S., Radivojac P., Dunker A. K. (2005) Exploiting heterogeneous sequence properties improves prediction of protein disorder. Proteins 61, (Suppl. 7) 176–182 [DOI] [PubMed] [Google Scholar]
  • 32. Peng K., Radivojac P., Vucetic S., Dunker A. K., Obradovic Z. (2006) Length-dependent prediction of protein intrinsic disorder. BMC Bioinformatics 7, 208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Bordoli L., Kiefer F., Schwede T. (2007) Assessment of disorder predictions in CASP7. Proteins 69, (Suppl. 8) 129–136 [DOI] [PubMed] [Google Scholar]
  • 34. Noivirt-Brik O., Prilusky J., Sussman J. L. (2009) Assessment of disorder predictions in CASP8. Proteins 77, (Suppl. 9), 210–216 [DOI] [PubMed] [Google Scholar]
  • 35. Rual J. F., Venkatesan K., Hao T., Hirozane-Kishikawa T., Dricot A., Li N., Berriz G. F., Gibbons F. D., Dreze M., Ayivi-Guedehoussou N., Klitgord N., Simon C., Boxem M., Milstein S., Rosenberg J., Goldberg D. S., Zhang L. V., Wong S. L., Franklin G., Li S., Albala J. S., Lim J., Fraughton C., Llamosas E., Cevik S., Bex C., Lamesch P., Sikorski R. S., Vandenhaute J., Zoghbi H. Y., Smolyar A., Bosak S., Sequerra R., Doucette-Stamm L., Cusick M. E., Hill D. E., Roth F. P., Vidal M. (2005) Towards a proteome-scale map of the human protein-protein interaction network. Nature 437, 1173–1178 [DOI] [PubMed] [Google Scholar]
  • 36. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E. (2000) The Protein Data Bank. Nucleic Acids Res. 28, 235–242 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Stumpf M. P., Thorne T., de Silva E., Stewart R., An H. J., Lappe M., Wiuf C. (2008) Estimating the size of the human interactome. Proc. Natl. Acad. Sci. U.S.A. 105, 6959–6964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Venkatesan K., Rual J. F., Vazquez A., Stelzl U., Lemmens I., Hirozane-Kishikawa T., Hao T., Zenkner M., Xin X., Goh K. I., Yildirim M. A., Simonis N., Heinzmann K., Gebreab F., Sahalie J. M., Cevik S., Simon C., de Smet A. S., Dann E., Smolyar A., Vinayagam A., Yu H., Szeto D., Borick H., Dricot A., Klitgord N., Murray R. R., Lin C., Lalowski M., Timm J., Rau K., Boone C., Braun P., Cusick M. E., Roth F. P., Hill D. E., Tavernier J., Wanker E. E., Barabási A. L., Vidal M. (2009) An empirical framework for binary interactome mapping. Nat. Methods 6, 83–90 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Matthews L. R., Vaglio P., Reboul J., Ge H., Davis B. P., Garrels J., Vincent S., Vidal M. (2001) Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or “interologs.” Genome Res. 11, 2120–2126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Fraser H. B., Hirsh A. E., Steinmetz L. M., Scharfe C., Feldman M. W. (2002) Evolutionary rate in the protein interaction network. Science 296, 750–752 [DOI] [PubMed] [Google Scholar]
  • 41. Chen Y., Dokholyan N. V. (2006) The coordinated evolution of yeast proteins is constrained by functional modularity. Trends Genet. 22, 416–419 [DOI] [PubMed] [Google Scholar]
  • 42. Pujol A., Mosca R., Farrés J., Aloy P. (2010) Unveiling the role of network and systems biology in drug discovery. Trends Pharmacol. Sci. 31, 115–123 [DOI] [PubMed] [Google Scholar]
  • 43. Brown C. J., Takayama S., Campen A. M., Vise P., Marshall T. W., Oldfield C. J., Williams C. J., Dunker A. K. (2002) Evolutionary rate heterogeneity in proteins with long disordered regions. J. Mol. Evol. 55, 104–110 [DOI] [PubMed] [Google Scholar]
  • 44. Tweedie S., Ashburner M., Falls K., Leyland P., McQuilton P., Marygold S., Millburn G., Osumi-Sutherland D., Schroeder A., Seal R., Zhang H., Consortium F. (2009) FlyBase: Enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 37, D555–D559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Consortium U. (2010) The Universal Protein Resource (UniProt) in 2010. Nucleic Acids Res. 38, D142–D148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Aranda B., Achuthan P., Alam-Faruque Y., Armean I., Bridge A., Derow C., Feuermann M., Ghanbarian A. T., Kerrien S., Khadake J., Kerssemakers J., Leroy C., Menden M., Michaut M., Montecchi-Palazzi L., Neuhauser S. N., Orchard S., Perreau V., Roechert B., van Eijk K., Hermjakob H. (2010) The IntAct molecular interaction database in 2010. Nucleic Acids Res. 38, D525–D531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Ceol A., Chatr Aryamontri A., Licata L., Peluso D., Briganti L., Perfetto L., Castagnoli L., Cesareni G. (2010) MINT, the molecular interaction database: 2009 update. Nucleic Acids Res. 38, D532–D539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Keshava Prasad T. S., Goel R., Kandasamy K., Keerthikumar S., Kumar S., Mathivanan S., Telikicherla D., Raju R., Shafreen B., Venugopal A., Balakrishnan L., Marimuthu A., Banerjee S., Somanathan D. S., Sebastian A., Rani S., Ray S., Harrys Kishore C. J., Kanth S., Ahmed M., Kashyap M. K., Mohmood R., Ramachandra Y. L., Krishna V., Rahiman B. A., Mohan S., Ranganathan P., Ramabadran S., Chaerkady R., Pandey A. (2009) Human Protein Reference Database: 2009 update. Nucleic Acids Res. 37, D767-D772 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Stark C., Breitkreutz B. J., Chatr-Aryamontri A., Boucher L., Oughtred R., Livstone M. S., Nixon J., Van Auken K., Wang X., Shi X., Reguly T., Rust J. M., Winter A., Dolinski K., Tyers M. (2011) The BioGRID Interaction Database: 2011 update. Nucleic Acids Res. 39, D698–D704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Chautard E., Ballut L., Thierry-Mieg N., Ricard-Blum S. (2009) MatrixDB, a database focused on extracellular protein-protein and protein-carbohydrate interactions. Bioinformatics 25, 690–691 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Salwinski L., Miller C. S., Smith A. J., Pettit F. K., Bowie J. U., Eisenberg D. (2004) The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 32, D449–D451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Altschul S. F., Madden T. L., Schäffer A. A., Zhang J., Zhang Z., Miller W., Lipman D. J. (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Remm M., Storm C. E., Sonnhammer E. L. (2001) Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J. Mol. Biol. 314, 1041–1052 [DOI] [PubMed] [Google Scholar]
  • 54. Berglund A. C., Sjölund E., Ostlund G., Sonnhammer E. L. (2008) InParanoid 6: Eukaryotic ortholog clusters with inparalogs. Nucleic Acids Res. 36, D263–D266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Grishin N. V. (1995) Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. J. Mol. Evol. 41, 675–679 [DOI] [PubMed] [Google Scholar]
  • 56. Kelley B. P., Sharan R., Karp R. M., Sittler T., Root D. E., Stockwell B. R., Ideker T. (2003) Conserved pathways within bacteria and yeast as revealed by global protein network alignment. Proc. Natl. Acad. Sci. U.S.A. 100, 11394–11399 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Molecular & Cellular Proteomics : MCP are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES