Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Jul 1.
Published in final edited form as: Mol Biosyst. 2014 Jan;10(1):9–17. doi: 10.1039/c3mb70225a

Exploring mechanisms of human disease through structurally resolved protein interactome networks

Jishnu Das a,b, Robert Fragoza b,c,#, Hao Ran Lee a,b,#, Nicolas A Cordero b,#, Yu Guo b,c, Michael J Meyer a,b,d, Tommy V Vo b,c, Xiujuan Wang a,b, Haiyuan Yu a,b,*
PMCID: PMC4061614  NIHMSID: NIHMS590853  PMID: 24096645

Abstract

The study of the molecular basis of human disease has gained increasing attention over the past decade. With significant improvements in sequencing effciency and throughput, a wealth of genotypic data has become available. However the translation of this information into concrete advances in diagnostic and clinical setups has proved far more challenging. Two major reasons for this are the lack of functional annotation for genomic variants and the complex nature of genotype-to-phenotype relationships. One fundamental approach to bypass these issues is to examine the effects of genetic variation at the level of proteins as they are directly involved in carrying out biological functions. Within the cell, proteins function by interacting with other proteins as a part of an underlying interactome network. This network can be determined using interactome mapping – a combination of high-throughput experimental toolkits and curation from small-scale studies. Integrating structural information from co-crystals with the network allows generation of a structurally resolved network. Within the context of this network, the structural principles of disease mutations can be examined and used to generate reliable mechanistic hypotheses regarding disease pathogenesis.

Introduction

Over the last decade and a half, there has been a dramatic increase in the effciency and a substantial decrease in the cost of sequencing. With the sequencing of the human genome, there was the promise of significant advances in translational medicine.1,2 However, while there has been a rapid accumulation of genomic data, the corresponding expansion in our understanding of pathogenic processes has been much slower. There are two major reasons for this. First, while there has been an explosion in the accumulation of genomic variants and disease-associated mutations, most of them have not been functionally annotated (Fig. 1A). This is reflected in the fact that while the number of single-nucleotide polymorphisms (SNPs) available from dbSNP3 and disease-associated mutations from HGMD4 have grown 3500% and 260%, respectively, over the last twelve years, the number of FDA-approved drugs has grown only 20% (Fig. 1A). Second, the diffculty in obtaining functional annotation is primarily attributable to the complex relationships between genotype and phenotype. A single gene can affect multiple traits (gene pleiotropy) and the same trait can be linked to numerous causal genes (locus heterogeneity). Furthermore, epistasis also brings additional complexity to genotype-to-phenotype relationships.5 To sidestep these complexities, numerous large-scale efforts have been undertaken to correlate sequence variants with an observable phenotype, but it has been diffcult to extend the observed correlation into causation. This has often been the main critique of GWA-like studies6 and has resulted in a large fraction of phenotypes with unknown molecular mechanisms (Fig. 1B).

Fig. 1.

Fig. 1

Growth of genomic data and our understanding of pathogenesis (A) accumulation of dbSNP data, HGMD mutations, disease genes and drug targets over the past 12 years (number of dbSNP variations: ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/chr_rpts/; number of HGMD mutations: http://www.hgmd.cf.ac.uk/ac/hahaha.php; number of disease genes: ftp://ftp.eimb.ru/omim/; number of FDA-approved drugs: http://www.fda.gov/AboutFDA/WhatWeDo/History/ProductRegulation/SummaryofNDAApprovalsReceipts1938tothepresent). (B) Distribution of OMIM pheno-type entries by knowledge of molecular basis (http://www.omim.org/statistics/entry).

One fundamental way to bypass the complexity of genotypeto-phenotype relationships is to directly examine the functional consequences of mutations and variants within coding regions at the protein level. Although a large number of variants are in non-coding regions, it has been shown that disease mutations and trait-associated SNPs are enriched in coding regions.7 Moreover, within the cellular environment, proteins rarely act in isolation. Interactions between proteins within the cell define major functional pathways crucial to physiological processes. The set of all interactions within the cell or the protein inter-actome can be represented as a network in which proteins are nodes and interactions between them are undirected edges. Thus maintenance of this network is critical to cellular function, and disease phenotypes can be viewed as perturbations to this network.810 Thus, the protein network can be used to gain insights into complex dependencies in pathogenic processes.8,9 It has also been shown to be useful in understanding disease sub-types and predicting disease prognosis.11,12 However, one limitation of this approach is that while such a representation is inherently two-dimensional, proteins are complex macromolecules with intricate three-dimensional structures. In this review, we outline experimental techniques used to identify protein–protein interactions and discuss recent methods developed to overlay structural information onto these interactions to construct structurally resolved protein networks. We then elucidate the importance of these networks in understanding molecular mechanisms of human disease.

High-throughput experimental toolkit for interactome mapping

There are two ways in which protein interactome networks are determined – literature-curation of small-scale studies and high-throughput (HT) experiments. In literature curation, interaction data are collected from thousands of small-scale studies each of which focuses on one or a few proteins and their interactions. On the other hand, HT experiments are much larger in scale and are typically set up as an unbiased screen of a large space. The repertoire of techniques used to determine these networks using such experiments is referred to as inter-actome mapping.13

Interactome mapping can generate binary interactions and co-complex associations.14,15 The former represents direct biophysical interactions between two proteins while the latter merely denotes membership of a complex and can often include indirect associations. There are several widely-used databases – BioGrid,16 IntAct,17 HPRD,18 iRefWeb,19 DIP,20 MINT,21 MIPS22 and VisAnt23 – that curate both categories of interactions for humans and other model organisms. However, it has been shown that the same degree of confidence cannot be associated with all interactions and those that have been validated by only one assay typically tend to be of lower quality than those that are validated by two or more assays.14,24,25 Numerous hypothesis-driven studies rely on specific interactions to design downstream experiments. Using low-quality or erroneous interactions could lead to incorrect hypotheses and futile downstream experiments. To address this, we built a repository of high-quality protein interactome networks – HINT.15 HINT also distinguishes between interactions curated from small-scale studies and those obtained from high-throughput experiments. This is essential because it has been shown that small-scale studies often contain sampling biases that make networks generated using them unsuitable for global topological analyses.14,15 In this review, we discuss five major high-throughput assays that can be used to generate binary interactome networks. To construct structurally resolved networks, it is essential for the interactions to be binary because the concept of interaction interface does not apply to indirect associations.

Yeast two-hybrid (Y2H) (Fig. 2A) was developed by Stanley Fields and Ok-Kyu Song as a genetic system to identify protein– protein interactions.26 The assay relies on the split functionality of particular eukaryotic transcription factors, for example Gal4, in which the transcription factor is split into two parts: a sequence-specific DNA-binding domain (DB) and a transcriptional-activation domain (AD). Protein–protein interactions are tested by fusing a “bait” protein X to the DB and fusing a “prey” protein Y to the AD. Each fusion protein is then expressed in haploid strains of yeast of opposite mating type. Upon mating, if protein X and Y interact, transcription factor activity will be reconstituted, allowing for downstream reporter gene expression and diploid yeast growth on selective media. The original system has undergone numerous technical modifications to make it amenable to high throughput with improved assay precision and sensitivity.27,28

Fig. 2.

Fig. 2

Schematic representations of high-throughput assays used to generate binary interactome networks. (A) Yeast two-hybrid (Y2H). (B) Protein fragment complementation assays (PCA). (C) Luminescence-based mammalian interactome mapping (LUMIER). (D) Well-based nucleic acid programmable protein array (wNAPPA). (E) Mammalian protein–protein interaction trap (MAPPIT). (F) A high-quality reference human binary interactome comprising B40 000 interactions generated from several large-scale interactome mapping efforts and thousands of small-scale studies.

Protein complementation assay (PCA) (Fig. 2B) is another popular approach for testing protein–protein interactions using mammalian cells. Similar to Y2H, in PCA, a fluorescent protein such as yellow fluorescent protein (YFP) (or an enzyme such as TEM-1 β-lactamase) is split into N- and C-terminal domains then fused to a bait protein X and a prey protein Y. If X and Y interact, YFP activity is reconstituted which can be observed by fluorescent microscopy or in high-throughput by using a plate reader.29 Unlike Y2H though, detectable protein–protein interactions are not limited to the nucleus. Thus, PCA can serve as a suitable assay for probing protein interactions at their native localizations in intact, living cells.

In luminescence-based mammalian interactome (LUMIER) (Fig. 2C) a bait protein X is fused to renilla or firefly luciferase enzyme and then co-expressed with a FLAG-tagged prey protein Y in mammalian HEK293T cells. Interaction between proteins X and Y can then be assayed by anti-FLAG immunoprecipitation of protein Y. Luciferase bioluminescence is then measured to detect whether protein Y was pulled down with X.30 Recent modifications allow LUMIER to be carried out in a high-throughput fashion using 96-well plates while also offering an improved quantitative readout.31

Well-based nucleic acid programmable protein array (wNAPPA) (Fig. 2D) is an in vitro assay, which begins with two expression vectors that encode for an anti-glutathione-S-transferase (GST) tagged protein X and a hemagglutinin (HA) tagged protein Y, respectively, which are anchored in a GST antibody-coated plate well. In vitro transcription and translation of chimeric proteins X and Y is then triggered by introducing rabbit reticulocyte lysate to the wells. Translated GST-tagged protein X will then bind to the GST antibodies coated in the well. A washing step then follows in which protein Y will remain in the well post-wash only if it interacts with protein X. The presence of protein Y – and therefore an interaction between proteins X and Y – is then detected by attaching horseradish peroxidase (HRP)-conjugated secondary antibodies specific to HA tagged protein Y and then measuring HRP-induced chemiluminescence.32

Mammalian protein–protein interaction trap (MAPPIT) (Fig. 2E) is based upon JAK-STAT signaling pathways. In JAK-STAT signaling, ligand-bound cytokine receptor complexes will reorganize themselves, in turn activating tethered Janus kinases (JAKs). Activated JAKs then phosphorylate tyrosine residues along the tails of the receptor complex which then serve as docking sites for signal transducer and activator of transcription (STAT) proteins. Receptor tail-docked STATs are next phosphorylated and activated by JAKs which then migrate to the nucleus to trigger STAT-dependent reporter gene activity. MAPPIT instead though uses a modified receptor complex in which the complex is split into two fragments: (1) a membrane-bound receptor that still permits JAK2 activation with mutated tyrosine residues to prevent STAT3 docking and (2) a receptor tail fragment containing STAT3 binding sites. Fragments 1 and 2 are then fused to bait protein X and prey protein Y. If proteins X and Y interact, JAK2 will activate STAT3 in trans, leading to STAT3-dependent reporter gene activity.33

Numerous studies have also tried to predict protein interactions based on machine-learning approaches34 or known co-crystal structures.3537 However, only those predictions that have been experimentally validated can be considered high quality. Thus, by combining data from several large-scale interactome mapping efforts24,28,38,39 (that use the above techniques) with thousands of small-scale studies, a high-quality reference human binary interactome comprising ~40 000 interactions (Fig. 2F) can be generated and denotes the first step towards producing a structurally resolved network.

Structurally resolved interactome networks

The reference interactome network has been widely used to try and understand the molecular basis of human disease.8 Numerous methods have been used to predict disease-associated genes,40 most of which rely heavily on a global “guilt-by-association” principle.41 Thus, if a particular gene is associated with a disease, the assumption is that all the interacting partners of the protein encoded by that gene are also associated with that disease. Such an understanding can be quite simplistic as the reference interactome is merely a two-dimensional representation and does not take into account the 3D structures of interacting proteins. Consequently, the percentage of successful predictions using such approaches is quite low.42 Since most interacting proteins share only a few of their associated disorders, it is essential to incorporate structural information regarding the location of disease mutations to make the predictions more accurate. This necessitates the construction of a structurally resolved interactome network.

Over the last two decades, there have been systematic efforts to structurally classify proteins into families43,44 based on domain architecture.45 This has been used to identify domain–domain interactions of known three-dimensional complexes of interacting proteins.46,47 However, the biggest challenge in constructing a structurally resolved network from these domain–domain interactions is posed by the relatively low number of available co-crystal structures compared to the amount of available proteomic network data. Co-crystal structures are not available for >90% of available binary protein–protein interactions. Moreover, complete individual structures are available for only about 10% of interacting proteins. Mosca et al. present a comprehensive analysis that highlights the paucity of experimentally determined crystal structures compared to the number of known binary interactions.48 Thus, it is essential to build structural models both to model individual proteins49 and infer interaction interfaces.

Dr Gerstein and his colleagues took the first step in this direction and used sequence similarity to compare interacting proteins with known co-crystal complexes. The authors constructed a structurally resolved yeast protein interactome to gain insight into evolutionary rates of network hubs with distinct types of interaction interfaces.50 Schuster-Bockler and Bateman focused on using a sequence-based homology approach to analyze the sites of disease-associated mutations with respect to protein interaction interfaces. Their work indicated that only about 4% of these mutations could interfere with protein–protein interactions.51 Prieto et al. built a repository of unified structural domain–domain interactions by systematically comparing six main structural domain–domain interaction sources that are based on Protein Data Bank (PDB) structures.52 The first structurally resolved human-virus protein–protein interaction network constructed by Franzosa and Xia showed that it is common for viruses to mimic host binding interfaces even without structural similarity to the human counterparts.53

Recently, we constructed a high-quality structurally resolved human binary protein interactome network using either co-crystal structures in the PDB or a homology-based interaction interface domain inference method.54 A comprehensive list of 62 663 Mendelian mutations in 3949 protein-coding genes associated with 3453 clinically distinct disorders was curated from Online Mendelian Inheritance in Man (OMIM) and Human Gene Mutation Database (HGMD), and then mapped to the structurally resolved interactome (Fig. 3A). We found that in-frame mutations are significantly enriched within interacting domains of disease-associated proteins. Furthermore, we observed that the likelihood of two in-frame mutations on the corresponding interacting domains of interacting proteins to cause the same disorder is significantly higher than that of corresponding pairs on non-interacting domains (Fig. 3B). In addition, we saw that in-frame mutation pairs on different interaction interfaces tend to cause different disorders than those on the same interface (Fig. 3C).54 These results help explain locus heterogeneity and gene pleiotropy, respectively – the alteration of specific interactions by mutations at the corresponding interface plays an important role in the pathogenesis of many disease genes. This also helps us refine the traditional guilt-by-association principle – mutations at different structural loci on the same protein can cause different diseases through disruption of separate interactions (Fig. 3D). We also used our interface inference approach to generate structurally resolved interactome networks for several other model organisms and established a database of high-quality structurally resolved protein–protein interactions, INstruct.55 Mosca et al. also used a similarly motivated structural alignment approach to infer interaction interfaces for networks in humans and eight other model organisms. Their results also suggested that structural annotation of pathways could help rationalize the mechanism of action of disease mutations.48 Thus, using structurally resolved interactome networks, it is now possible to gain insights at the molecular level into protein function and its alteration.

Fig. 3.

Fig. 3

Structurally resolved interactome networks and human disease. (A) Construction of a structurally resolved interactome network onto which disease mutations are mapped. (B) Percentage of mutation pairs on two proteins that cause the same disease. (C) Percentage of mutation pairs on the same protein that cause different diseases. (D) A higher resolution of the guilt-by-association principle – mutations at different structural loci on the same protein that cause different diseases [(B) and (C) are adapted from ref. 54].

Towards a mechanistic understanding of human disease

Human disease can be viewed as a rewiring of the reference interactome through loss or gain of interactions.8 Zhong et al. experimentally showed that disease mutations could alter the underlying interactome by edge-specific changes i.e., altering specific interactions or node-specific changes i.e., leading to complete loss of protein products.10 One example they demonstrated was the disruption of the homodimeric CBS interaction (i.e., interaction of CBS with itself) by a homocystinuria associated P145L mutation (Fig. 4A). On the other hand, a homocystinuria associated P49L mutation did not disrupt the interaction – the interaction was “pseudo wild-type” (Fig. 4A). Upon examining the interface of this protein–protein interaction (which can be obtained from the co-crystal with PDB id: 1JBQ56) using our structurally resolved network approach, we found that the P145L mutation is within the interface whereas the P49L mutation is outside the interface. We also showed that each of the three distinct colorectal cancer associated mutations on the interaction interface of MLH1 (I68N, I107R and Y293D) with PMS2 disrupted the interaction while any of the three other colorectal-cancer associated mutations (N338S, Y561H and R725C) outside the interface did not disrupt the interaction54 (Fig. 4B). In this case, the interface was inferred using our homology-based interaction interface inference method.54 These studies further establish the view that mutations at the interface can disrupt specific interactions leading to human disease.

Fig. 4.

Fig. 4

Functional consequences of human disease mutations. (A) Illustration of the interface of the CBS homodimer as obtained from a co-crystal and the location of mutations that do/do not disrupt the interaction. (B) Illustration of the predicted interface of the MLH1–PMS2 interaction and the location of mutations that do/do not disrupt the interaction. (C) Schematic representation of changes caused by disease mutations to the interactome network. (D) Summary of the pipeline used to construct 3D interactome networks to understand disease pathogenesis.

In general, there are three kinds of possible changes to the interactome network – loss of a protein (and all its interactions), loss of a specific interaction, and gain of a specific interaction (Fig. 4C).10 To be able to truly understand human disease, it is necessary to experimentally analyze relationships between the structural loci of mutations and each of these alteration types at a proteomic scale. Since the vast majority of interactions do not have corresponding co-crystal structures, it is also necessary to develop better computational models that help us accurately determine the structural locations of mutations. Combining co-crystal structures with these computational models will help generate a comprehensive atlas of protein–protein interactions that is of ubiquitous importance in understanding pathogenic processes.57,58 Such an atlas can be generated by integrative methods that incorporate both experimental and computational approaches and is likely to be highly successful in elucidating the mechanistic basis of human disease caused by rewiring of the underlying protein interactome network.

Conclusion

A key bottleneck in translational medicine has been the sharp imbalance between the number of available genomic variants and the number of well-understood disease mechanisms. The complex nature of genotype-to-phenotype relationships has made functional annotation of variants an extremely challenging problem. Analyzing alterations at the proteomic level promises to offer possible solutions to these problems as human disease can be viewed as altered protein function. Since proteins mediate cellular functions by interacting with other proteins, it is necessary to examine these changes in the context of the underlying network of protein–protein interactions. A combination of high-throughput experiments and literature curation is being used to generate the reference human protein inter-actome network. By incorporating structural details of proteins involved in these interactions, it is possible to generate a structurally resolved network. Within the context of this network, it is possible to examine structural details of disease-causing mutations and generate mechanistic hypotheses regarding pathogenesis (Fig. 4D). Follow-up of these hypotheses is likely to uncover key functional principles underlying human disease and identify more reliable drug targets.

References

  • 1.Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
  • 2.Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Miklos GLG, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, Levine AJ, Roberts RJ, Simon M, Slayman C, Hunkapiller M, Bolanos R, Delcher A, Dew I, Fasulo D, Flanigan M, Florea L, Halpern A, Hannenhalli S, Kravitz S, Levy S, Mobarry C, Reinert K, Remington K, Abu-Threideh J, Beasley E, Biddick K, Bonazzi V, Brandon R, Cargill M, Chandramouliswaran I, Charlab R, Chaturvedi K, Deng Z, Di Francesco V, Dunn P, Eilbeck K, Evangelista C, Gabrielian AE, Gan W, Ge W, Gong F, Gu Z, Guan P, Heiman TJ, Higgins ME, Ji RR, Ke Z, Ketchum KA, Lai Z, Lei Y, Li Z, Li J, Liang Y, Lin X, Lu F, Merkulov GV, Milshina N, Moore HM, Naik AK, Narayan VA, Neelam B, Nusskern D, Rusch DB, Salzberg S, Shao W, Shue B, Sun J, Wang Z, Wang A, Wang X, Wang J, Wei M, Wides R, Xiao C, Yan C, Yao A, Ye J, Zhan M, Zhang W, Zhang H, Zhao Q, Zheng L, Zhong F, Zhong W, Zhu S, Zhao S, Gilbert D, Baumhueter S, Spier G, Carter C, Cravchik A, Woodage T, Ali F, An H, Awe A, Baldwin D, Baden H, Barnstead M, Barrow I, Beeson K, Busam D, Carver A, Center A, Cheng ML, Curry L, Danaher S, Davenport L, Desilets R, Dietz S, Dodson K, Doup L, Ferriera S, Garg N, Gluecksmann A, Hart B, Haynes J, Haynes C, Heiner C, Hladun S, Hostin D, Houck J, Howland T, Ibegwam C, Johnson J, Kalush F, Kline L, Koduru S, Love A, Mann F, May D, McCawley S, McIntosh T, McMullen I, Moy M, Moy L, Murphy B, Nelson K, Pfannkoch C, Pratts E, Puri V, Qureshi H, Reardon M, Rodriguez R, Rogers YH, Romblad D, Ruhfel B, Scott R, Sitter C, Smallwood M, Stewart E, Strong R, Suh E, Thomas R, Tint NN, Tse S, Vech C, Wang G, Wetter J, Williams S, Williams M, Windsor S, Winn-Deen E, Wolfe K, Zaveri J, Zaveri K, Abril JF, Guigo R, Campbell MJ, Sjolander KV, Karlak B, Kejariwal A, Mi H, Lazareva B, Hatton T, Narechania A, Diemer K, Muruganujan A, Guo N, Sato S, Bafna V, Istrail S, Lippert R, Schwartz R, Walenz B, Yooseph S, Allen D, Basu A, Baxendale J, Blick L, Caminha M, Carnes-Stine J, Caulk P, Chiang YH, Coyne M, Dahlke C, Mays A, Dombroski M, Donnelly M, Ely D, Esparham S, Fosler C, Gire H, Glanowski S, Glasser K, Glodek A, Gorokhov M, Graham K, Gropman B, Harris M, Heil J, Henderson S, Hoover J, Jennings D, Jordan C, Jordan J, Kasha J, Kagan L, Kraft C, Levitsky A, Lewis M, Liu X, Lopez J, Ma D, Majoros W, McDaniel J, Murphy S, Newman M, Nguyen T, Nguyen N, Nodell M, Pan S, Peck J, Peterson M, Rowe W, Sanders R, Scott J, Simpson M, Smith T, Sprague A, Stockwell T, Turner R, Venter E, Wang M, Wen M, Wu D, Wu M, Xia A, Zandieh A, Zhu X. Science. 2001;291:1304–1351. doi: 10.1126/science.1058040. [DOI] [PubMed] [Google Scholar]
  • 3.Smigielski EM, Sirotkin K, Ward M, Sherry ST. Nucleic Acids Res. 2000;28:352–355. doi: 10.1093/nar/28.1.352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, Thomas NS, Cooper DN. Genome Med. 2009;1:13. doi: 10.1186/gm13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cordell HJ. Hum. Mol. Genet. 2002;11:2463–2468. doi: 10.1093/hmg/11.20.2463. [DOI] [PubMed] [Google Scholar]
  • 6.Dermitzakis ET, Clark AG. Science. 2009;326:239–240. doi: 10.1126/science.1182009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA. Proc. Natl. Acad. Sci. U. S. A. 2009;106:9362–9367. doi: 10.1073/pnas.0903103106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Vidal M, Cusick ME, Barabasi AL. Cell. 2011;144:986–998. doi: 10.1016/j.cell.2011.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Barabasi AL, Gulbahce N, Loscalzo J. Nat. Rev. Genet. 2011;12:56–68. doi: 10.1038/nrg2918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zhong Q, Simonis N, Li QR, Charloteaux B, Heuze F, Klitgord N, Tam S, Yu H, Venkatesan K, Mou D, Swearingen V, Yildirim MA, Yan H, Dricot A, Szeto D, Lin C, Hao T, Fan C, Milstein S, Dupuy D, Brasseur R, Hill DE, Cusick ME, Vidal M. Mol. Syst. Biol. 2009;5:321. doi: 10.1038/msb.2009.80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chuang HY, Lee E, Liu YT, Lee D, Ideker T. Mol. Syst. Biol. 2007;3:140. doi: 10.1038/msb4100180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Taylor IW, Linding R, Warde-Farley D, Liu Y, Pesquita C, Faria D, Bull S, Pawson T, Morris Q, Wrana JL. Nat. Biotechnol. 2009;27:199–204. doi: 10.1038/nbt.1522. [DOI] [PubMed] [Google Scholar]
  • 13.Vidal M. FEBS Lett. 2005;579:1834–1838. doi: 10.1016/j.febslet.2005.02.030. [DOI] [PubMed] [Google Scholar]
  • 14.Yu H, Braun P, Yildirim MA, Lemmens I, Venkatesan K, Sahalie J, Hirozane-Kishikawa T, Gebreab F, Li N, Simonis N, Hao T, Rual JF, Dricot A, Vazquez A, Murray RR, Simon C, Tardivo L, Tam S, Svrzikapa N, Fan C, de Smet AS, Motyl A, Hudson ME, Park J, Xin X, Cusick ME, Moore T, Boone C, Snyder M, Roth FP, Barabasi AL, Tavernier J, Hill DE, Vidal M. Science. 2008;322:104–110. doi: 10.1126/science.1158684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Das J, Yu H. BMC Syst. Biol. 2012;6:92. doi: 10.1186/1752-0509-6-92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Stark C, Breitkreutz BJ, Chatr-Aryamontri A, Boucher L, Oughtred R, Livstone MS, Nixon J, Van Auken K, Wang X, Shi X, Reguly T, Rust JM, Winter A, Dolinski K, Tyers M. Nucleic Acids Res. 2011;39:D698–D704. doi: 10.1093/nar/gkq1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kerrien S, Aranda B, Breuza L, Bridge A, Broackes-Carter F, Chen C, Duesbury M, Dumousseau M, Feuermann M, Hinz U, Jandrasits C, Jimenez RC, Khadake J, Mahadevan U, Masson P, Pedruzzi I, Pfeiffenberger E, Porras P, Raghunath A, Roechert B, Orchard S, Hermjakob H. Nucleic Acids Res. 2012;40:D841–D846. doi: 10.1093/nar/gkr1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A. Nucleic Acids Res. 2009;37:D767–D772. doi: 10.1093/nar/gkn892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Turner B, Razick S, Turinsky AL, Vlasblom J, Crowdy EK, Cho E, Morrison K, Donaldson IM, Wodak SJ. Database. 2010:baq023. doi: 10.1093/database/baq023. 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. Nucleic Acids Res. 2004;32:D449–D451. doi: 10.1093/nar/gkh086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Licata L, Briganti L, Peluso D, Perfetto L, Iannuccelli M, Galeota E, Sacco F, Palma A, Nardozza AP, Santonico E, Castagnoli L, Cesareni G. Nucleic Acids Res. 2012;40:D857–D861. doi: 10.1093/nar/gkr930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Mewes HW, Ruepp A, Theis F, Rattei T, Walter M, Frishman D, Suhre K, Spannagl M, Mayer KF, Stumpflen V, Antonov A. Nucleic Acids Res. 2011;39:D220–D224. doi: 10.1093/nar/gkq1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hu Z, Hung JH, Wang Y, Chang YC, Huang CL, Huyck M, DeLisi C. Nucleic Acids Res. 2009;37:W115–W121. doi: 10.1093/nar/gkp406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Venkatesan K, Rual JF, Vazquez A, Stelzl U, Lemmens I, Hirozane-Kishikawa T, Hao T, Zenkner M, Xin X, Goh KI, Yildirim MA, Simonis N, Heinzmann K, Gebreab F, Sahalie JM, Cevik S, Simon C, de Smet AS, Dann E, Smolyar A, Vinayagam A, Yu H, Szeto D, Borick H, Dricot A, Klitgord N, Murray RR, Lin C, Lalowski M, Timm J, Rau K, Boone C, Braun P, Cusick ME, Roth FP, Hill DE, Tavernier J, Wanker EE, Barabasi AL, Vidal M. Nat. Methods. 2009;6:83–90. doi: 10.1038/nmeth.1280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cusick ME, Yu H, Smolyar A, Venkatesan K, Carvunis AR, Simonis N, Rual JF, Borick H, Braun P, Dreze M, Vandenhaute J, Galli M, Yazaki J, Hill DE, Ecker JR, Roth FP, Vidal M. Nat. Methods. 2009;6:39–46. doi: 10.1038/nmeth.1284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Fields S, Song O. Nature. 1989;340:245–246. doi: 10.1038/340245a0. [DOI] [PubMed] [Google Scholar]
  • 27.Walhout AJ, Vidal M. Methods. 2001;24:297–306. doi: 10.1006/meth.2001.1190. [DOI] [PubMed] [Google Scholar]
  • 28.Yu H, Tardivo L, Tam S, Weiner E, Gebreab F, Fan C, Svrzikapa N, Hirozane-Kishikawa T, Rietman E, Yang X, Sahalie J, Salehi-Ashtiani K, Hao T, Cusick ME, Hill DE, Roth FP, Braun P, Vidal M. Nat. Methods. 2011;8:478–480. doi: 10.1038/nmeth.1597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Remy I, Michnick SW. Nat. Methods. 2006;3:977–979. doi: 10.1038/nmeth979. [DOI] [PubMed] [Google Scholar]
  • 30.Barrios-Rodiles M, Brown KR, Ozdamar B, Bose R, Liu Z, Donovan RS, Shinjo F, Liu Y, Dembowy J, Taylor IW, Luga V, Przulj N, Robinson M, Suzuki H, Hayashizaki Y, Jurisica I, Wrana JL. Science. 2005;307:1621–1625. doi: 10.1126/science.1105776. [DOI] [PubMed] [Google Scholar]
  • 31.Taipale M, Krykbaeva I, Koeva M, Kayatekin C, Westover KD, Karras GI, Lindquist S. Cell. 2012;150:987–1001. doi: 10.1016/j.cell.2012.06.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ramachandran N, Hainsworth E, Bhullar B, Eisenstein S, Rosen B, Lau AY, Walter JC, LaBaer J. Science. 2004;305:86–90. doi: 10.1126/science.1097639. [DOI] [PubMed] [Google Scholar]
  • 33.Ulrichts P, Lemmens I, Lavens D, Beyaert R, Tavernier J. Methods Mol. Biol. 2009;517:133–144. doi: 10.1007/978-1-59745-541-1_9. [DOI] [PubMed] [Google Scholar]
  • 34.Jansen R, Yu H, Greenbaum D, Kluger Y, Krogan NJ, Chung S, Emili A, Snyder M, Greenblatt JF, Gerstein M. Science. 2003;302:449–453. doi: 10.1126/science.1087361. [DOI] [PubMed] [Google Scholar]
  • 35.Aloy P, Russell RB. Proc. Natl. Acad. Sci. U. S. A. 2002;99:5896–5901. doi: 10.1073/pnas.092147999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tuncbag N, Gursoy A, Nussinov R, Keskin O. Nat. Protocols. 2011;6:1341–1354. doi: 10.1038/nprot.2011.367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zhang QC, Petrey D, Deng L, Qiang L, Shi Y, Thu CA, Bisikirska B, Lefebvre C, Accili D, Hunter T, Maniatis T, Califano A, Honig B. Nature. 2012;490:556–560. doi: 10.1038/nature11503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, Klitgord N, Simon C, Boxem M, Milstein S, Rosenberg J, Goldberg DS, Zhang LV, Wong SL, Franklin G, Li S, Albala JS, Lim J, Fraughton C, Llamosas E, Cevik S, Bex C, Lamesch P, Sikorski RS, Vandenhaute J, Zoghbi HY, Smolyar A, Bosak S, Sequerra R, Doucette-Stamm L, Cusick ME, Hill DE, Roth FP, Vidal M. Nature. 2005;437:1173–1178. doi: 10.1038/nature04209. [DOI] [PubMed] [Google Scholar]
  • 39.Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S, Timm J, Mintzlaff S, Abraham C, Bock N, Kietzmann S, Goedde A, Toksoz E, Droege A, Krobitsch S, Korn B, Birchmeier W, Lehrach H, Wanker EE. Cell. 2005;122:957–968. doi: 10.1016/j.cell.2005.08.029. [DOI] [PubMed] [Google Scholar]
  • 40.Wang X, Gulbahce N, Yu H. Briefings Funct. Genomics. 2011;10:280–293. doi: 10.1093/bfgp/elr024. [DOI] [PubMed] [Google Scholar]
  • 41.Oliver S. Nature. 2000;403:601–603. doi: 10.1038/35001165. [DOI] [PubMed] [Google Scholar]
  • 42.Oti M, Snel B, Huynen MA, Brunner HG. J. Med. Genet. 2006;43:691–698. doi: 10.1136/jmg.2006.041376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. Nucleic Acids Res. 2008;36:D419–D425. doi: 10.1093/nar/gkm993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sillitoe I, Cuff AL, Dessailly BH, Dawson NL, Furnham N, Lee D, Lees JG, Lewis TE, Studer RA, Rentzsch R, Yeats C, Thornton JM, Orengo CA. Nucleic Acids Res. 2013;41:D490–D498. doi: 10.1093/nar/gks1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer EL, Eddy SR, Bateman A. Nucleic Acids Res. 2010;38:D211–D222. doi: 10.1093/nar/gkp985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Stein A, Ceol A, Aloy P. Nucleic Acids Res. 2011;39:D718–D723. doi: 10.1093/nar/gkq962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Finn RD, Marshall M, Bateman A. Bioinformatics. 2005;21:410–412. doi: 10.1093/bioinformatics/bti011. [DOI] [PubMed] [Google Scholar]
  • 48.Mosca R, Ceol A, Aloy P. Nat. Methods. 2013;10:47–53. doi: 10.1038/nmeth.2289. [DOI] [PubMed] [Google Scholar]
  • 49.Pieper U, Webb BM, Barkan DT, Schneidman-Duhovny D, Schlessinger A, Braberg H, Yang Z, Meng EC, Pettersen EF, Huang CC, Datta RS, Sampathkumar P, Madhusudhan MS, Sjolander K, Ferrin TE, Burley SK, Sali A. Nucleic Acids Res. 2011;39:D465–D474. doi: 10.1093/nar/gkq1091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Kim PM, Lu LJ, Xia Y, Gerstein MB. Science. 2006;314:1938–1941. doi: 10.1126/science.1136174. [DOI] [PubMed] [Google Scholar]
  • 51.Schuster-Bockler B, Bateman A. Genome Biol. 2008;9:R9. doi: 10.1186/gb-2008-9-1-r9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Prieto C, De Las Rivas J. Proteins. 2010;78:109–117. doi: 10.1002/prot.22569. [DOI] [PubMed] [Google Scholar]
  • 53.Franzosa EA, Xia Y. Proc. Natl. Acad. Sci. U. S. A. 2011;108:10538–10543. doi: 10.1073/pnas.1101440108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wang X, Wei X, Thijssen B, Das J, Lipkin SM, Yu H. Nat. Biotechnol. 2012;30:159–164. doi: 10.1038/nbt.2106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Meyer MJ, Das J, Wang X, Yu H. Bioinformatics. 2013;29:1577–1579. doi: 10.1093/bioinformatics/btt181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Meier M, Janosik M, Kery V, Kraus JP, Burkhard P. EMBO J. 2001;20:3910–3916. doi: 10.1093/emboj/20.15.3910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Devos D, Russell RB. Curr. Opin. Struct. Biol. 2007;17:370–377. doi: 10.1016/j.sbi.2007.05.011. [DOI] [PubMed] [Google Scholar]
  • 58.Mosca R, Pons T, Ceol A, Valencia A, Aloy P. Curr. Opin. Struct. Biol. 2013 doi: 10.1016/j.sbi.2013.07.005. DOI: 10.1016/j.sbi.2013.07.005 [Epub ahead of print] [DOI] [PubMed] [Google Scholar]

RESOURCES