Abstract
The architecture of the network of protein–protein physical interactions in Saccharomyces cerevisiae is exposed through the combination of two complementary theoretical network measures, betweenness centrality and ‘Q-modularity’. The yeast interactome is characterized by well-defined topological modules connected via a small number of inter-module protein interactions. Should such topological inter-module connections turn out to constitute a form of functional coordination between the modules, we speculate that this coordination is occurring typically in a pairwise fashion, rather than by way of high-degree hub proteins responsible for coordinating multiple modules. The unique non-hub-centric hierarchical organization of the interactome is not reproduced by gene duplication-and-divergence stochastic growth models that disregard global selective pressures.
INTRODUCTION
The set of all physical protein–protein interactions in a cell—the interactome—presents a foundational picture for biology, sitting at the lowest level of description at which it is possible to have an holistic view of a cell rather then just an isolated study of its individual components. In this article, we make a small contribution to the ongoing effort to understand the global architecture of this fundamental physical network (1–10). The interactome can be represented in an abstract way as a network of nodes connected by links, where nodes stand for proteins and links for direct physical interactions between proteins. In recent years, there has been much interest in applying statistical mechanics to the study of such complex networks (11,12). However, the validity of such an approach is always conditional on the fundamental assumptions of statistical mechanics being satisfied. For instance, many statistical measures will not be able to distinguish between a network with an intrinsic hierarchical topology and one without it (Figure 1).
RESULTS AND DISCUSSION
Our approach sidesteps the limitations alluded to in the preceding paragraph and focuses directly on how the interactome topology relates to two broad biological concepts. The first of these, hierarchical organization, is in this context the notion that there may exist a hierarchy in the role of proteins (10). On one hand, there are proteins that perform very specific, local functions, relevant only within the context of a particular biological process. On the other hand, some proteins may possess a global, high level role, perhaps acting as mediators of distinct biological processes. To study the topological hierarchy in the interactome, we use the graph theoretical betweenness centrality measure (13–15) (Supplementary Data). Betweenness centrality (denoted ‘traffic’, henceforth) for a node is the total number of shortest paths (between any two other nodes) in the network that pass through that node. A high traffic value for a protein therefore correlates with that protein being topologically central in the interactome. The second of these is the concept of biological functional modularity (16). In the context of proteins, at the extreme, this takes the form of protein machines performing specific functions in a cell (7). More generally, it consists of an expectation that the density of protein–protein interactions will rise as we zoom into an increasingly functionally related set of proteins. To assess modularity in the interactome topology, we use the ‘Q-modularity’ measure of Newman (17), which assigns a modularity score, Q, to any given partition of the network into modules. The modularity Q is defined as the difference between the ratio (intra-module edges)/(total edges) for the network in question, and the expected value for this ratio if edges in the network were randomized, subject to every node maintaining its original degree. We use the algorithm of Clauset et al. (18) to find an interactome partition into modules that corresponds to a large Q value.
We now explain how to produce an interactome polar map (Figure 2) by combining the information contained in the modularity and traffic analyses. The position of every protein in the map is specified in terms of its radial and angular coordinates. The radial coordinate is a function of its traffic (19). More precisely, it is proportional to log(max traffic/protein's traffic), where ‘max traffic’ is the maximum node traffic in the network (Supplementary Data). A logarithmic scale is used due to the long tail of the traffic distribution (20). The protein angular coordinates are assigned such that all proteins in the same module fall within the same angular range (Supplementary Data). This way, an interactome map is created where topologically increasingly central proteins are radially increasingly closer to the center of the map, while angular sectors correspond to topological modules. To determine the circular ordering of the modules in the map, we introduce a Ring Ordering Algorithm that, based on the interactome inter-module connectivity, attempts to place closer to each other, to the extent that it is possible, modules that are more topologically related (Supplementary Data).
We apply this analysis and discuss its implications in the context of the Saccharomyces cerevisiae interactome. The interactome dataset we use is the higher confidence ‘filtered yeast interactome’ (FYI) (10), consisting of interactions supported either by small-scale screens as reported in the MIPS database (21) or by at least two distinct methods from amongst (i) high-throughput yeast two-hybrid experiments (22,23), (ii) computational predictions based on gene co-occurrence (24,25), gene neighborhood (24,26) or gene fusion (24,25), (iii) high-throughput affinity purification/mass spectrometric protein complex identification experiments (27,28) and (iv) small-scale or module-scale experimental identification of protein complexes as reported in MIPS (21) (Supplementary Data). We consider only the giant connected component produced by this dataset, a network containing 1741 interactions amongst 741 proteins. Our dataset choice reflects a desire to bias the data towards a thorough and accurate coverage of a limited region of the interactome as opposed to a wider, but likely shallower and more error prone, sampling. Note that when defining whether two proteins interact, a binary description is being imposed on what ideally would be characterized in terms of an affinity constant. Experimentally, effectively the aim has been for a cut-off that is high enough to exclude indiscriminate low-affinity interactions, such as those that occur between a general protein and proteasomal, ribosomal, or heat shock proteins, since in principle these are less informative interactions (29,30). Hence, such interactions are largely absent from datasets. With the vast majority of the proteins at the periphery of the map and well-defined modules connected through a handful of more central proteins, the yeast interactome polar map (Figure 2) presents what we term a coordinated-functionality architecture. Next, we discuss how this interactome architecture fits in with the biology of the cell.
An examination of the MIPS database functional (biological process) annotation of the proteins (21) demonstrates that the topological modules make very good sense as biological functional modules. On average 88.8% of the proteins in a module share a similar function based on their MIPS classification (21) (Supplementary Data), confirming earlier studies connecting topology and function (31–40). Part of the mismatch between the topological modules and current protein functional annotations will likely vanish, once more complete and accurate interactome datasets become available. However, it is the disparities not due to dataset limitations that are truly interesting, for those are instances where the functional modules based on the interactome topology are not the ones we currently assign in functional classification schema. In view of (i) the good overall match found and (ii) the foundational role of the interactome in the cell, we propose that the interactome topology represents a fundamental source for the division of proteins into functional groups. As such, its modularity analysis provides a rigorous alternative to the currently subjective functional annotation present in protein databases. In accordance, we name the topological modules found so as to reflect their perceived biological roles (Figure 2).
The average degree of essential (41,42) proteins in the dataset is 5.7, while that of non-essential proteins is 3.9, a difference that is too large to be attributable to chance alone (43) (Supplementary Data). This difference may indeed be biologically meaningful. Although physically knocking out a gene associated with a high or a low degree protein, say of degree 2 or 10 respectively, may be considered equivalent, from a mathematical network perspective it is not. In one case, it involves deleting 1 node and 2 links, in the other it involves deleting 1 node and 10 links. Note that such an explanation would not involve ascribing any out of the ordinary, higher-level role to hubs, the large degree proteins. Alternatively, the observed higher average degree of essential proteins could still stem from lingering systematic biases in the FYI dataset.
While the translation of the interactome modular topology into biological functional modularity is straightforward, this is not the case for the interactome topological hierarchical organization. A fundamental question that the traffic analysis gives rise to is how the interactome topological hierarchical organization phenomenologically expresses itself. For instance, do more hierarchically central proteins in fact perform a higher-level, coordinating role in the cell? At present, these are open biological questions.
Comparing proteins of equal degree, we find no significant correlation between a protein's traffic and its essentiality (41,42) (Supplementary Data), something not too surprising, as knocking out a key protein that renders an essential functional module inoperant is plausibly more damaging than knocking out a protein that mediates two distinct processes that nonetheless can still function independently. An intriguing, though at the moment still unsupported hypothesis is that, if a protein is disrupted, its traffic level correlates with the likelihood of causing non-lethal side effects in multiple areas of the biology of the cell. Speculating further, perhaps our representation of the interactome can provide clues as to where those side effects may arise—a matter of critical importance in drug development. A different possibility is that, for some of these module-connecting proteins, interacting with multiple modules is not a sign of a role in coordinating the functionality of the modules, but rather just a result of the protein being independently used in those modules. In opposition to a true functional ‘connector’, we would call such a protein a ‘bolt’ (alternatively, ‘widget’), in reference to how, analogously, a mechanical bolt can be used in multiple functional modules of a human engineered machine, while playing no role in coordinating their functionality. Finally, note that a priori the observed 47 inter-module interactions are particularly susceptible to be false-positives, because a false-positive interaction between two random unrelated proteins is likely to result in an inter-module, high traffic interaction. However, significantly, 45 out of these 47 inter-module interactions belong to the set of interactions supported by small-scale targeted experiments and arguably it is not very likely that a false-positive interaction of the type just described would go unnoticed in a targeted experiment and further make it into the peer-reviewed literature. Of the remaining 2 inter-module interactions, one is reported in the Ito et al. (23) and Uetz et al. (22) high-throughput yeast two-hybrid datasets as well as in the MIPS dataset of protein complexes identified via small- or module-scale experiments (21), while the other is reported in the Gavin et al. (27) high-throughput protein complex identification study and again in the Uetz et al. (22) yeast two-hybrid dataset. Out of the 45 supported by small-scale experiments, four are also reported in a high-throughput yeast two-hybrid study (22,23), three in a high-throughput protein complex identification study (27,28) and two in the MIPS dataset of protein complexes identified via small or module-scale experiments (21). Whatever biological role central proteins turn out to play, we submit that they call for further experimental investigation, given their unique topological placement in the interactome.
Having hierarchically classified the proteins with the traffic measure, we are now in a position to consider network degree distribution related questions without falling prey to previously noted statistical problems (Figure 1). Of particular interest is how degree changes as one moves hierarchically across the interactome (14) (Figure 3a). Surprisingly, nodes of different degree are rather homogeneously hierarchically spread across the interactome: note the large spread between the green, red and blue curves relative to their small positive slopes; or, for a more quantifiable attribute, how the 10% of nodes with the largest degree in the periphery of the interactome have a significantly larger degree than the average degree at the center of the interactome. Thus, the interactome is not hierarchically stratified by degree. In particular, the interactome has a non-hub-centric hierarchical organization. Further, note that should the inter-module protein interactions indeed represent a form of functional coordination, then this coordination is apparently occurring overwhelmingly in a pair-wise fashion: out of the 76 proteins that possess links to modules outside their own, only 4 connect their home module to more than one other module (TAF25 and SRP1 have links to four other modules, NUP1 and CLB2 to two other modules). The other 72 proteins connect their home module to a single other module (Supplementary Data). It is also noteworthy that, amongst these 76 connecting proteins, the higher degree ones in general belong clearly in their assigned home modules (specifically, for connecting proteins of degree 4 or higher, let us exclude the two interactions that each of these proteins must have by default to connect it to its home module and to one linked module; then, 95.3% of the remaining interactions of connecting proteins are with the protein's respective home module. Supplementary Data). This pairwise-coordination is in sharp contrast with the picture of a hub protein connecting and mediating multiple modules (11,43,44).
The non-hub-centric organization runs contrary to a number of network growth models that have been proposed to explain the topology of the interactome (11,45–48). The models are based on evolution by stochastic gene duplication and divergence (49,50). Amongst other reproduced statistics, the models are able to generate the power-law degree distribution observed in early interactome datasets (51). Since these models do not make an appeal to evolutionary selection pressures, a major conclusion taken from their success was that natural selection is not required to reproduce the global structure of the interactome; instead, stochastic gene duplication and divergence suffices to give rise to that topology (45,47). However, here we report that these gene duplication models lead to hierarchically hub-centric networks. In Figure 3b, we show data pertaining to an interactome built using the model of Pastor-Satorras et al. (45). By comparison with the same plot for the yeast interactome, this time the network is clearly stratified by degree, with the larger degree nodes concentrated at the hierarchical center of the network. Now it is the average degree at the center of the network that is significantly larger than the average degree of the 10% of nodes with largest degree at the periphery. The models of Vázquez (46) (slightly different implementation of the gene duplication and divergence process) and of Wagner (47) (emphasizing a continuous divergence in the form of gain and loss of interactions amongst existing proteins) produce similar hub-centric networks (Supplementary Data). In summary, there are at least three possible explanations for the non-hub-centric hierarchy we observe in yeast: (i) it is a spurious effect associated with the limitations of existing data and the interactome is in fact hub-centric; (ii) there is some crucial feature of the gene duplication and divergence mechanics that is not understood and/or is not captured by current models or (iii) the non-hub-centric hierarchy is in fact shaped by natural selection pressures on the global interactome structure.
Our study is based on the present-day knowledge of the yeast interactome, which is still rather deficient (52–58). We minimized false-positives using the higher confidence yeast FYI dataset as the source for our study. The good correspondence between the topological modular breakdown of the interactome and the known functional annotation of proteins corroborates that false-positives are not an overriding problem in this dataset. Nonetheless, we repeated the interactome analysis using a dataset of interactions reported in the MIPS database that are validated through small-scale screenings, the most reliable source of data (55) (giant component: 392 proteins, 675 interactions. Supplementary Data). The correspondence between topological and functional modules (now on average 92% of the proteins in a module shared a similar function based on their MIPS classification), as well as the reported non-hub-centric hierarchy were again supported by this small-scale dataset (Supplementary Data). Regarding the limited coverage of our datasets [FYI ≈12% coverage, small-scale ≈7% coverage, assuming ≈6000 proteins in yeast (23)], it is of note that the doubling in size of the network, going from the small-scale to the FYI dataset, did not dilute the observed non-hub-centric topology nor the pair-wise inter-module connectivity pattern. In fact, the FYI dataset produces an even slightly less hub-centric interactome than the small-scale dataset (Supplementary Data). Likewise, the pairwise inter-module connectivity pattern is no less present in the FYI than in the small-scale dataset (where out of the 47 proteins with inter-module links, four connect their home module to more than one other module. Supplementary Data). The observation that higher degree connecting proteins are in general strongly attached to their home modules is equally confirmed in both datasets (the earlier mentioned 95.3% of home module interactions for the FYI higher degree connecting proteins, now becomes 95.7% in the small-scale dataset). Still, it is important to bear in mind the limitations of current datasets. For instance, regarding the typical pairwise coordination, the possibility that this observation is only the result of a high-number of false-negatives for inter-module protein interactions cannot be ruled out. Note, for example, how in the map there are no interactions between the translation initiation module and the ribosomal subunit modules, even though such interactions must certainly exist. Ultimately, only the generation of more accurate and comprehensive interactome datasets can unequivocally confirm or disprove some of the results and hypotheses put forward in this article (52,53).
So far our analysis has focused on the global interactome topology. Now we would also like to highlight its potential as a framework for exploiting the wealth of interactome data. The interactome can form a valuable platform for crystalizing biological thought. We briefly introduce two relevant extensions to our work. First, one may zoom into a module of interest in the interactome and locally repeat the analysis, producing a single module polar map (Figures in Supplementary Data). Such a local map provides one with a starting point to discuss the biology of the process under study, interpret and design experiments, and generate new biological hypotheses. Second, we note that the entire proteome is rarely, if ever, evenly expressed by the cell (49). Therefore, perhaps the interactome is best viewed as a potential network at the cell's disposal, with different parts of it being turned on and off to different degrees, as biologically required. Integrating mRNA expression data with the interactome polarmap (59) (Figure 4), permits a proper, unified analysis of this dynamical network.
The interactome represents an elementary abstraction of the multitude of complex biochemical interactions taking place in the rich physiological environment of the cell. In the trade-off between simplicity and realism, arguably some facts may be beneficially incorporated in future interactome models: For instance, protein interactions vary within a continuum of binding affinities and post-translational modifications, as well as allosteric interactions effectively change the possible binding partners of a protein, to name a few of the more prominent omissions at present (49). We end by noting that the organization of a network through a procedure akin to the one used in this paper may also be of relevance to the problem of network motif finding (60), as certain motifs may turn out to occur sparsely overall and yet be statistically significant in specific regions of the network (e.g. only in the high traffic central area, or only in a particular module).
SUPPLEMENTARY DATA
Supplementary data are available at NAR online
Acknowledgments
A.V. thanks above all R. Fagerstrom, and also A. Sarkar S. Milstein, P.-O. Vidalain, M. Drezje, J. Han, S. Tee, K.-H. Lin and M. Boxem for their comments and support, as well as the entire Vidal lab at Dana-Farber for its hospitality during part of the year. Finally, A.V. is indebted to Howard Stone at Harvard University, under whose orientiation this work was initiated and to Phil Prorok, Gary Gao and Xiaoxia Lin for their unconditional support at a critical stage of this work. This work was supported by DFCI Sponsored Research. Funding to pay the Open Access publication charges for this article was provided by the NIH.
Conflict of interest statement. None declared.
REFERENCES
- 1.Eisenberg D., Marcotte E.M., Xenarios I., Yeates T.O. Protein function in the post-genomic era. Nature. 2000;405:823–826. doi: 10.1038/35015694. [DOI] [PubMed] [Google Scholar]
- 2.Lu H., Zhu X., Liu H., Skogerbo G., Zhang J., Zhang Y., Cai L., Zhao Y., Sun S., Xu J., et al. The interactome as a tree—an attempt to visualize the protein-protein interaction network in yeast. Nucleic Acids Res. 2004;32:4804–4811. doi: 10.1093/nar/gkh814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vidal M. A biological atlas of functional maps. Cell. 2001;104:333–339. doi: 10.1016/s0092-8674(01)00221-5. [DOI] [PubMed] [Google Scholar]
- 4.Bader G.D., Heilbut A., Andrews B., Tyers M., Hughes T., Boone C. Functional genomics and proteomics: charting a multidimensional map of the yeast cell. Trends Cell Biol. 2003;13:344–356. doi: 10.1016/s0962-8924(03)00127-2. [DOI] [PubMed] [Google Scholar]
- 5.Sharom J.R., Bellows D.S., Tyers M. From large networks to small molecules. Curr. Opin. Chem. Biol. 2004;8:81–90. doi: 10.1016/j.cbpa.2003.12.007. [DOI] [PubMed] [Google Scholar]
- 6.Tucker C.L., Gera J.F., Uetz P. Towards an understanding of complex protein networks. Trends Cell Biol. 2001;11:102–106. doi: 10.1016/s0962-8924(00)01902-4. [DOI] [PubMed] [Google Scholar]
- 7.Alberts B. The cell as a collection of protein machines: preparing the next generation of molecular biologists. Cell. 1998;92:291–294. doi: 10.1016/s0092-8674(00)80922-8. [DOI] [PubMed] [Google Scholar]
- 8.Bork P., Jensen L.J., von Mering C., Ramani A.K., Lee I., Marcotte E.M. Protein interaction networks from yeast to human. Curr. Opin. Struc. Biol. 2004;14:292–299. doi: 10.1016/j.sbi.2004.05.003. [DOI] [PubMed] [Google Scholar]
- 9.Gavin A.-C., Superti-Fuga G. Protein complexes and proteome organization from yeast to man. Curr. Opin. Chem. Biol. 2003;7:21–27. doi: 10.1016/s1367-5931(02)00007-8. [DOI] [PubMed] [Google Scholar]
- 10.Han J.D., Bertin N., Hao T., Goldberg D.S., Berriz G.F., Zhang L.V., Dupuy D., Walhout A.J., Cusick M.E., Roth F.P., et al. Evidence for dynamically organized modularity in the yeast protein–protein interaction network. Nature. 2004;430:88–93. doi: 10.1038/nature02555. [DOI] [PubMed] [Google Scholar]
- 11.Barabási A.-L., Oltvai Z.N. Network biology: understanding the cell's functional organization. Nature Rev. Genet. 2004;112:101–114. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]
- 12.Newman M.E.J. The structure and function of complex networks. SIAM Rev. 2003;45:167–256. [Google Scholar]
- 13.Freeman L. A set of measures of centrality based upon betweenness. Sociometry. 1977;40:35–41. [Google Scholar]
- 14.Joy M.P., Brock A., Ingber D.E., Huang S. High betweenness proteins in the yeast protein interaction network. J. Biomed. Biotechnol. 2005;2:96–103. doi: 10.1155/JBB.2005.96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hahn M.W., Kern A.D. Comparative genomics of centrality and essentiality in three eukaryotic protein interaction networks. Mol. Biol. Evol. 2005;22:803–806. doi: 10.1093/molbev/msi072. [DOI] [PubMed] [Google Scholar]
- 16.Hartwell L.H., Hopfield J.J., Leibler S., Murray A.W. From molecular to modular cell biology. Nature. 1999;402:C47–C51. doi: 10.1038/35011540. [DOI] [PubMed] [Google Scholar]
- 17.Newman M.E.J. Fast algorithm for detecting community structure in networks. Phys. Rev. 2004;E69 doi: 10.1103/PhysRevE.69.066133. Art. No. 066133. [DOI] [PubMed] [Google Scholar]
- 18.Clauset A., Newman M.E.J., Moore C. Finding community structure in very large networks. Phys. Rev. 2004;E70 doi: 10.1103/PhysRevE.70.066111. art. no. 066111. [DOI] [PubMed] [Google Scholar]
- 19.Brandes U. Communicating centrality in policy network drawings. IEEE Trans Vis. Comput. Graph. 2003;9:241–253. [Google Scholar]
- 20.Goh K.-I., Kahng B., Kim D. Universal behavior of load distribution in scale-free networks. Phys. Rev. Lett. 2001;87 doi: 10.1103/PhysRevLett.87.278701. art. no. 278701. [DOI] [PubMed] [Google Scholar]
- 21.Mewes H.W., Frishman D., Guldener U., Mannhaupt G., Mayer K., Mokrejs M., Morgenstern D., Munsterkotter M., Rudd S., Weil B. MIPS: a database for genomes and protein sequences. Nucleic Acids Res. 2002;30:31–34. doi: 10.1093/nar/30.1.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Uetz P., Giota L, Cagney G., Mansfield T.A., Judson R.S., Knight J.R., Lockshon D., Narayan V., Srinivasan M., Pochart P. A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae. Nature. 2000;403:623–627. doi: 10.1038/35001009. [DOI] [PubMed] [Google Scholar]
- 23.Ito T., Chiba T., Ozawa R., Yoshida M., Hattori M., Sakaki Y. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Nature. 2001;403:623–627. doi: 10.1073/pnas.061034498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.von Mering C., Krause R., Snel B., Cornell M., Oliver S.G., Fields S., Bork P. Comparative assessment of large-scale data sets of protein–protein interactions. Nature. 2002;417:399–403. doi: 10.1038/nature750. [DOI] [PubMed] [Google Scholar]
- 25.Marcotte E.M., Pellegrini M., Thompson M.J., Yeates T.O., Eisenberg D. A combined algorithm for genome-wide prediction of protein function. Nature. 1999;402:83–86. doi: 10.1038/47048. [DOI] [PubMed] [Google Scholar]
- 26.Dandekar T., Snel B., Huynen M., Bork P. Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem. Sci. 1998;23:324–328. doi: 10.1016/s0968-0004(98)01274-2. [DOI] [PubMed] [Google Scholar]
- 27.Gavin A.-C., Bosche M., Krause R., Grandi P., Marzioch M., Bauer A., Schultz J., Rick J.M., Michon A.M., Cruciat C.M., et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002;415:141–146. doi: 10.1038/415141a. [DOI] [PubMed] [Google Scholar]
- 28.Ho Y., Gruhler A., Heilbut A., Bader G.D., Moore L., Adams S.L., Millar A., Taylor P., Bennett K., Yang L., et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415:180–183. doi: 10.1038/415180a. [DOI] [PubMed] [Google Scholar]
- 29.Krause K., von Mering C., Bork P., Dandekar T. Shared components of protein complexes—versatile building blocks or biochemical artefacts? BioEssays. 2004;26:1333–1343. doi: 10.1002/bies.20141. [DOI] [PubMed] [Google Scholar]
- 30.Soti C., Pal C., Papp B., Csermely P. Molecular chaperones as regulatory elements of cellular networks. Curr. Opin. Cell. Biol. 2005;17:210–215. doi: 10.1016/j.ceb.2005.02.012. [DOI] [PubMed] [Google Scholar]
- 31.Schwikowski B., Uetz P., Fields S. A network of protein–protein interactions in yeast. Nat. Biotechnol. 2000;18:1257–1261. doi: 10.1038/82360. [DOI] [PubMed] [Google Scholar]
- 32.Rives A.W., Galitski T. Modular organization of cellular networks. Proc. Natl Acad. Sci. USA. 2003;3:1128–1133. doi: 10.1073/pnas.0237338100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bu D., Zhao Y., Cai L., Xue H., Zhu X., Lu H., Zhang J., Sun S., Ling L., Zohang N., et al. Topological structure analysis of the protein–protein interaction network in budding yeast. Nucleic Acids Res. 2003;31:2443–2450. doi: 10.1093/nar/gkg340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Vázquez A., Flammini A., Maritan A., Vespignani A. Global protein function prediction from protein–protein interaction networks. Nat. Biotechnol. 2003;21:697–700. doi: 10.1038/nbt825. [DOI] [PubMed] [Google Scholar]
- 35.Spirin V., Mirny A. Protein complexes and functional modules in molecular networks. Proc. Natl Acad. Sci. USA. 2003;21:12123–12128. doi: 10.1073/pnas.2032324100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhou H. Distance, dissimilarity and network community structure. Phys. Rev. 2003;E67 doi: 10.1103/PhysRevE.67.061901. art. no. 06101. [DOI] [PubMed] [Google Scholar]
- 37.Pereira-Leal J.B., Enright J.A., Ouzounis C.A. Detection of functional modules from protein interaction networks. Proteins. 2004;54:49–57. doi: 10.1002/prot.10505. [DOI] [PubMed] [Google Scholar]
- 38.Brun C., Herrmann C., Guénoche A. Clustering proteins from interaction networks for the prediction of cellular functions. BMC Bioinformatics. 2004;5 doi: 10.1186/1471-2105-5-95. Art. No. 95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dunn R., Dudbridge F., Sanderson C.M. The use of edge-betweenness clustering to investigate biological function in protein interaction networks. BMC Bioinformatics. 2005;6 doi: 10.1186/1471-2105-6-39. Art. no. 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Nabieva E., Jim K., Agarwal A., Chazelle B., Singh M. Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps. Bioinformatics. 2005;21:I302–I310. doi: 10.1093/bioinformatics/bti1054. [DOI] [PubMed] [Google Scholar]
- 41.Winzeler E.A., Shoemaker D.D., Astromoff A., Liang H., Anderson K., Andre B., Baugham R., Boeke J.D., Bussey H., Chu A.M., Connelly C., et al. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science. 1999;285:901–906. doi: 10.1126/science.285.5429.901. [DOI] [PubMed] [Google Scholar]
- 42.Giaever G., Chu A.M., Ni L., Connelly C., Riles L., Veronneau S., Dow S., Lacau-Danila A., Anderson K., Andre B., Arkin A.P., et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387–391. doi: 10.1038/nature00935. [DOI] [PubMed] [Google Scholar]
- 43.Jeong H., Mason S.P., Barabási A.-L., Oltvai Z.N. Lethality and centrality in protein networks. Nature. 2001;411:41–42. doi: 10.1038/35075138. [DOI] [PubMed] [Google Scholar]
- 44.Ravasz E., Barabási A.-L. Hierarchical organization in complex networks. Phys. Rev. 2003;E67 doi: 10.1103/PhysRevE.67.026112. Art. no. 026112. [DOI] [PubMed] [Google Scholar]
- 45.Pastor-Satorras R., Smith B., Solé R.V. Evolving protein interaction networks through gene duplication. J. Theor. Biol. 2003;222:199–210. doi: 10.1016/s0022-5193(03)00028-6. [DOI] [PubMed] [Google Scholar]
- 46.Vázquez A., Flammini A., Maritan A., Vespignani A. Modeling protein in teraction networks. Complexus. 2003;1:38–44. [Google Scholar]
- 47.Wagner A. How the global structure of protein interaction networks evolves. Proc. Biol. Sci. 2003;270:457–466. doi: 10.1098/rspb.2002.2269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Middendorf M., Ziv E., Wiggins C.H. Inferring network mechanisms: the Drosophila melanogaster protein interaction network. Proc. Natl Acad. Sci. USA. 2005;102:3192–3197. doi: 10.1073/pnas.0409515102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Alberts B., Johnson A., Lewis J., Raff M., Roberts K., Walter P. Molecular Biology of the Cell. 4th edn. NY: Garland Science; 2002. [Google Scholar]
- 50.Ohno S. Evolution by Gene Duplication. Berlin: Springer; 1970. [Google Scholar]
- 51.Tanaka R., Yi T.-M., Doyle J. Some protein interaction data do not exhibit power law statistics. FEBS Lett. 2005;579:5140–5144. doi: 10.1016/j.febslet.2005.08.024. [DOI] [PubMed] [Google Scholar]
- 52.Cornell M., Paton N.W., Oliver S.G. A critical and integrated view of the yeast interactome. Comp. Funct. Genom. 2004;5:382–402. doi: 10.1002/cfg.412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hakes L., Robertson D.L., Oliver S.G. Effect of dataset selection on the topological interpretation of protein interaction networks. BMC Genomics. 2005;6:131. doi: 10.1186/1471-2164-6-131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ito T., Ota K., Kubota H., Yamayuchi Y., Chiba T., Sakuraba K., Yoshida M. Roles for the two-hybrid system in exploration of the yeast protein interactome. Mol. Cell Proteomics. 2002;1:561–566. doi: 10.1074/mcp.r200005-mcp200. [DOI] [PubMed] [Google Scholar]
- 55.von Mering C., Krause R., Snel B., Cornell M., Oliver S.G., Fields S., Bork P. Comparative assessment of large-scale data sets of protein–protein interactions. Nature. 2002;417:399–403. doi: 10.1038/nature750. [DOI] [PubMed] [Google Scholar]
- 56.Bader G.D., Hogue C.W. Analyzing yest protein–protein interaction data obtained from different sources. Nat. Biotechnol. 2002;20:991–997. doi: 10.1038/nbt1002-991. [DOI] [PubMed] [Google Scholar]
- 57.Aloy P., Russell R.B. The third dimension for protein interactions and complexes. Trends Biochem. Sci. 2002;27:633–638. doi: 10.1016/s0968-0004(02)02204-1. [DOI] [PubMed] [Google Scholar]
- 58.Edwards A.M., Kus B., Jansen R., Greenbaum D., Greenblatl J., Gerstein M. Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet. 2002;18:529–536. doi: 10.1016/s0168-9525(02)02763-4. [DOI] [PubMed] [Google Scholar]
- 59.Lichtenberg U., Jensen L.J., Brunak S., Bork P. Dynamic complex formation during the yeast cell cycle. Science. 2005;307:724–727. doi: 10.1126/science.1105103. [DOI] [PubMed] [Google Scholar]
- 60.Milo R., Shen-Orr S., Itzkovitz S., Kashtan N., Chklovskii D., Alon U. Network motifs: simple building blocks of complex networks. Science. 2002;298:824–827. doi: 10.1126/science.298.5594.824. [DOI] [PubMed] [Google Scholar]
- 61.Maslov S., Sneppen K. Specificity and stability in topology of protein networks. Science. 2002;296:910–913. doi: 10.1126/science.1065103. [DOI] [PubMed] [Google Scholar]
- 62.Gasch A.P., Spellman P.T., Kao C.M., Carmel-Harel O., Eisen M.B., Storz G., Botstein D., Brown P.O. Genomic expression programs in the response of yeast cells to envoironmental changes. Mol. Biol. Cell. 2000;11:4241–4257. doi: 10.1091/mbc.11.12.4241. [DOI] [PMC free article] [PubMed] [Google Scholar]