Abstract
Virtual interactomics represents a rapidly developing scientific area on the boundary line of bioinformatics and interactomics. Protein-related virtual interactomics then comprises instrumental tools for prediction, simulation, and networking of the majority of interactions important for structural and individual reproduction, differentiation, recognition, signaling, regulation, and metabolic pathways of cells and organisms. Here, we describe the main areas of virtual protein interactomics, that is, structurally based comparative analysis and prediction of functionally important interacting sites, mimotope-assisted and combined epitope prediction, molecular (protein) docking studies, and investigation of protein interaction networks. Detailed information about some interesting methodological approaches and online accessible programs or databases is displayed in our tables. Considerable part of the text deals with the searches for common conserved or functionally convergent protein regions and subgraphs of conserved interaction networks, new outstanding trends and clinically interesting results. In agreement with the presented data and relationships, virtual interactomic tools improve our scientific knowledge, help us to formulate working hypotheses, and they frequently also mediate variously important in silico simulations.
1. General Remarks
Many important findings in pharmacology, cell biology, and pathobiology have been achieved with the aid of virtual interactomics including computer-aided structural analysis, prediction and in silico simulation of interacting sites, protein complexes, and interaction networks. Virtual interactomics has been developed in the last thirty years, and it is in fact based on gradual bioinformatic processing of experimental data. These data were usually obtained from individual studies of interactions, and various large-scale experimental methods such as the two-hybrid system, phage display library studies reverse interactomics, SPOT arrays or microarray studies, and extended sequence studies [1–7].
In addition to sequence data, three-dimensional (3D) structures are ever more frequently required for interactomic predictions. X-ray crystallography or nuclear magnetic resonance studies represent the most frequent sources of 3D structures, whereas combination of electron microscopy of molecular complexes with X-ray crystallography turns out to be interesting for the same purpose [8–11]. Alternatively, sophisticated 3D structure simulations such as homology modeling or combination of cryoelectron microscopy densities, and molecular dynamics appear to be also useful for approximating conventional 3D input at least in some cases [12–14]. In addition to 3D shape, solvent and surface accessibilities (or more likely actual dynamic accessibility following from accompanying interacting structures or proteolysis; cf. [15, 16]) were considered to be important criteria for reevaluation of possible interaction sites. Many experimentally investigated and predicted structural relationships were also stored in interactomic databases to be selectively found, processed and compared. Moreover, some interaction data differently stored in multiple databases have been searched with the aid of special data mining servers such as Dasmiweb and PINA v2.0 ([17, 18]; see also Table 1).
Table 1.
Some more recent online accessible bioinformatic tools.
| Programs | Purpose | Input | Evaluation tools and procedures | Compared structures | Internet accessibility | References |
|---|---|---|---|---|---|---|
| Program tools | ||||||
|
| ||||||
| 3D-BLAST (two methods) | ∗Identification of 23 states of the structural alphabet | PDB ACF | ∗SASM | ∗structural alphabet sequence databases | http://3d-blast.life.nctu.edu.tw/ | [41, 42] |
| ∗Comparison of protein folds using spherical polar Fourier basis functions. | ∗Carbo-like similarity score | ∗SPF shape density rep. | http://threedblast.loria.fr/ | |||
| DASMIweb | Online integration, analysis and assessment of distributed protein interaction data and predictions (interactomic mining) | some protein identifiers | literature curation, prediction, 3D analysis | records of interactions | http://www.dasmiweb.de | [17] |
| FFAS03 | Server accepts a user supplied protein sequence and automatically generates a profile, which is then compared with several sets of sequence profiles of proteins databases | QS | PPA | DS | http://ffas.godziklab.org/ | [36, 37] |
| KinasePhos 2.0 | Prediction of kinase-specific phosphorylation sites based on the site sequences from databases | QS | k-fold + Jackknife cross-validations | phosphorylation sites from Phospho.ELM Swiss-Prot | http://KinasePhos2.mbc.nctu.edu.tw/ | [62] |
| Phos3D | Method of phosphorylation-site prediction based on 3D structural information associated with 530 phys-chem-pr | QS (+3D context) | SVM, spatial amino acid propensities | PDB coordinates | http://phos3d.mpimp-golm.mpg.de/ | [63] |
| SeSAW | Identification of functionally or evolutionarily conserved motifs based on balancing between sequence and structural similarities | ∗PDB QF, ID; ∗PDB QF, ID, IDt | P value SeSAW score | conserved domains templates | http://sysimm.ifrec.osaka-u.ac.jp/SeSAW/ | [61] |
|
| ||||||
| Databases | ||||||
|
| ||||||
| ADAN | Prediction of protein-protein interactions of different modular protein domains mediated by linear motifs | PDB ACF | PSSM_PI | integrated DaSt | http://adan-embl.ibmc.umh.es/ | [64] |
| CDD (RPS BLAST) | Conserved domain relationships, domain location of frequent binding sites | QS | model MSA derived PSSM | Conserved domains (PSSM) |
http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml
http://www.ncbi.nlm.nih.gov/BLAST |
[65–67] |
| GWIDD | Docking database; integrated resource for studies of protein-protein interactions on the genome scale | search interface | docking techniques | DaSt | http://gwidd.bioinformatics.ku.edu | [68] |
| MegaMotifBase (structural database) | 3D motif orientation, intermotif distances, solvent accessibility, secondary structure content, hydrogen bonding, residual packaging, familiar/superfamiliar/none motif relationship | motif or sequence pattern | complex evaluation including projection | DS + DaSt | http://caps.ncbs.res.in/MegaMotifbase/index.html | [69] |
| PTGL | Database for secondary structure-based protein topologies; visualization of topology diagrams and 3D structures | PDB QF | ULN4D_graph theory | DS + DaSt | http://ptgl.uni-frankfurt.de/ | [70] |
| RsiteDB | Database of protein binding pockets which interact with single strand RNA | PDB code, UPS | 3D arrangement of phys-chem-pr | 3D-CBP | http://bioinfo3d.cs.tau.ac.il/RsiteDB/ | [71] |
∗Independent alternatives; 3D-CBP: 3D consensus binding patterns important for protein-nucleotide recognition; DS: database sequences; DSA: double sequence alignment; DaSt: database structures; CS: list of compared sequences; ID, IDt: chain or template ID, respectively; MSA: multiple sequence alignment; QS: query sequence(s) (instead of QS, clonal names or gi numbers can be used in majority of given approaches); phys-chem-pr: physicochemical properties; PDB: Protein data bank; PDB QF: PDB-formatted query file; PDB ACF: PDB-derived atom coordinate file; PPA: profile-profile alignment; PSSM: position-specific scoring matrices; PSSM_PI: PSSM for protein-protein interactions calculated by FoldX; rep.: representations; SASM: structural alphabet substitution matrix; SPF: spherical polar Fourier; SVM: support vector machines; ULN4D_graph theory: unique linear notations of four descriptions for protein structures on different abstraction levels based on graph theory; UPS: unbound protein structure.
Virtual interactomic studies are mostly realized via computer programs including numerous online accessible tools. Three types of computer data processing are important for online interactomic predictions on molecular level, that is, (i) structure comparisons, (ii) molecular docking studies, and (iii) reevaluation of current database data, and accessible or proposed protein interaction networks. Similarly, three types of interacting structures can be distinguished with respect to their different molecular origin. This concerns (i) conserved structures, (ii) randomly/quasi-randomly in vitro generated or rapidly evolved structures (e.g., mimotopes and disordered regions), and (iii) binding sites of antigen receptors expressed in specific immune cell clones, which are usually developed during the regular recombination process, and later also at the time of immune response.
In contrast to reactomics, interactomics described here deals exclusively with interactions, and thus concerns only interactions of enzyme active sites but not their subsequent reaction mechanism. Consequently, modeling of enzyme reactions exceeds the topic of this review. In addition, since we dealt here with protein interactomics, this paper also does not contain information about interactions of DNA with nonpeptide ligands.
2. Structural Similarities of Interacting Sites
By using various structurally based programs, many protein interactions can be predicted based on the occurrence of phylogenetically conserved or convergently developed functionally important common structural features (motifs, sequence patterns, consensi, constructs of conserved domain sequences, supersecondary structures, supersecondary motifs, 3D-arranged structural patterns and pockets). However, diversification within protein families, and superfamilies causes losses of interacting structures or disables their accessibility. On the other hand, new interactive pairs frequently appear in cases of disordered protein regions, that is, peptide segments naturally occurring in multiple conformation variants [19–21]. The attendant stability problems, as well as some additional problems with molecular analysis, were diminished by the development of databases enabling reevaluation of selected structural relationships. The databases inform us about similar or common structural features, frequent locations of binding sites in related domains, solvent accessibility, and location of investigated segments in 3D structures of proteins (see Table 1).
During the last twenty years, conventional sequence-based search for conserved structures frequently combined different evaluations of sequence similarities. The corresponding protocols usually combined double sequence, and multiple sequence comparisons like BLASTP, PSI-BLAST, RPS-BLAST, Clustal W and MUSCLE [22, 23]. In addition, highly selective PHI-BLAST or PSI-BLAST searches with specifically restricted representative query sequences such as consensi (and also sequence patterns) made it possible to locate the corresponding potentially interacting sites in extended sequence sets [23–26]. Except for current conserved sequences, extremely variable but still defined structures such as heptad repeats were investigated for purposes of molecular topology [27, 28]. These repeats can be written as an alphabet with generalized characters representing aa groups instead of individual aa (hydrophobic, charged, polar, etc.). Together with usual (motif-related sequence block derived) patterns an important part of heptad repeats can be searched in database sequences and partially reevaluated by means of PROSITE programs [29–32]. Apart from regions with highly conserved sequences, additional conserved structures were found when combining evaluation of primary and secondary structures [33], PSI-BLAST and secondary structure [34], or when using fold recognition [34, 35]. The compared sequence queries were moreover evaluated on the sophisticated widely used FFAS03 server (about 250 references) providing the third generation of the profile-profile alignment and fold-recognition algorithm mediated by program FFAS (fold, and function assignment system; [36, 37]; Table 1). The sensitivity FFAS-related profile-profile comparison is now widely recognized and many Web servers implementing such algorithms are available, for example, HHPRED, COMPASS, COMA, PHYRE, GenThreader, FORTE and webPRC [37]. More recent multiple sequence alignment program BCL::Align includes also combined evaluation of structural similarities [38] (for applications, see, e.g., [39, 40]). The corresponding scoring function is a weighted sum of scores derived based on (i) the traditional PAM, and BLOSUM scoring matrices, (ii) position-specific scoring matrices by PSI-BLAST, (iii) secondary structure predicted by a variety of methods, (iv) chemical properties, and (v) gap penalties. Monte Carlo algorithm was then used to determine necessary optimized weights in cases of sequence alignment and fold recognition.
Input of 3D coordinates or their transformed representations was necessary for other structural studies predicting also functional interaction sites. The corresponding research yielded two servers with different 3D-BLAST programs enabling us to compare folds and fold families [41, 42] (for details see also Table 1). Alternative structural comparison substituting 3D relationships was performed when searching for the maximum contact map overlaps [43]. The extended contact map comparison appeared very early [44]. Contact map is in fact determined by the matrix of distances between individual amino acids, contact threshold and specification of contact types [45, 46]. This map can be visualized by CMView software [46]. Similarly to sequence motifs, conserved patterns of 3D peptide arrangements (CP-3D-A) were considered as an additional type of structure-function motifs. These structures were recorded by sequence independent 3D-templates and can be searched on Evolutionary Trace Annotation server [47, 48]. Frequent functionally important CP-3D-A occurred, for instance, in active sites of proteins containing porphyrin rings including members of cytochrome P450 superfamily [49]. Another recent interesting approach consisted in generalized motif search in large alphabet (cumulative occurrence of several alphabets in this case) inputs including simultaneously evaluation of DNA sequence, protein sequence and supersecondary structure motifs [50]. According to the authors, large alphabets are important in cases when structures share little similarity at primary level.
Similarly to multiple sequence alignment, various forms of multiple structural alignments (MSTA) have been generated. Older attempts at MSTA were based on secondary structural alignment [51–53]. On the other hand, recent MSTA approaches determined common spatial (3D) structures. These approaches include new strategies employing “molecular sieving” of protein structures, minimization of an energy function over the low-dimensional space of the relative rotations, and translations of the molecules, geometric hashing and contact-window-derived motif library [54–58].
In spite of the increasing possibility of database reevaluation, some structurally based programs predicting interacting sites became at least relatively autonomous. A large number of specific and relatively autonomous programs concerned the prediction of epitopes (for details see Section 3). Similarly, various possibilities of prediction of phosphorylation sites were frequently investigated (two examples in Table 1), since phosphorylation reactions represent an important signaling and regulatory network in cell biology and pathobiology. The interest followed also from the extended building of databases related to phosphorylation sites, which brought many interesting insights. For instance, in accordance with the linear motif atlas for phosphorylation-dependent signaling, tyrosine kinases mutated in cancer exhibited lower specificity than their non-oncogenic relatives [59]. Similarly, collection and motif-based prediction of phosphorylation sites in human virus proteins suggested a substantial role for human kinases in regulation/mediation of viral protein functions [60]. In contrast to programs predicting specific types of interacting sites, SeSAW represents an example of general online accessible program allowing prediction of possible functionally important structures [61] (see also Table 1). The corresponding balance between different combined structural evaluations concerned, among others, data present in position specific scoring matrix (PSSM), template-derived PSSM and template functional annotations.
3. Mimotope and Epitope Interactions
Epitopes are defined as the structures responsible for interaction of antigens with binding sites of antigen receptors. On the other hand, mimotopes belong to artificially prepared peptides, which interact with natural templates (most likely proteins), and thus mimic other peptides or organic compounds in their functionally important interactions. Mimotope development is based on synthetic peptide or phage display libraries, whereas natural development of specific cell clones is necessary for epitope recognition by specific antigen receptors. This means that both epitopes and mimotopes can sometimes considerably differ from the usual conserved structures mentioned above. In spite of the described difference, a unifying point between mimotopes and epitopes exists, because mimotopes were originally defined as peptides mimicking epitopes [72], forming thus only a subset of the later current mimotope repertoire. In addition to this historical linkage, rapidly developing (diverging) structures such as molecular mimicry enabling parasitic attack and adaptation of pathogenic viruses and bacteria, disordered regions of proteins, and protein loops appear to be good candidates for extended investigation of mimotope similarities (cf. [21, 73–75]) and mimotope based prediction (see below).
Mimotopes were originally derived in studies with a phage display library. This phage technology was discovered in the eighties [76]. The corresponding boom in the nineties then comprised novel random, partially randomized or gene-fragment-derived oligopeptide libraries able to functionally mimic epitopes, autoepitopes, short peptide ligands, protein kinase or proteinase substrates, as well as peptides mimicking organic substances such as biotin when interacting with steptavidin [77–85]. Mimotope similarities were frequently defined using sequence patterns, whereas additional types of nonsequence structural similarities were also described (see, e.g., [81, 83]). More recent biotechnological research of mimotopes yielded potential peptide drugs [86–89], peptide vaccines [90–92], and peptides suitable for specific (mostly nanoparticle mediated) drug delivery to tumor cells, brain, atherosclerotic plaques, and other therapeutically important sites of human or animal organisms [93–97]. In spite of this considerable progress, virtual interactomic tools are not still able to compare or predict organic drugs based on their effective spatial similarity with functionally active mimotopes.
Database registration and authentication of mimotopes have been performed for more than ten years [98–100]. Special programs comparing primary structures of epitopes were simultaneously developed (programs FINDMAP, and EPIMAP [101, 102]). Some of them employed also multiple-sequence alignment evaluation (program MIMOP [103]). Similarly, coexisting peptide databases were established to process the accumulated information. These databases (i) recognized sequence subsets classified after in vitro evolution of phage display libraries, (ii) offered many integrating programs, and (iii) made it possible to find all mimotope sets that have the 3D structure of a target-template complex (databases ASPD, RELIC and MimoDB [98, 99, 104, 105]). In addition, novel mimotope-assisted computer-aided epitope prediction was discovered. This prediction came from both the 3D structure of an interacting partner and sequences of similarly interacting mimotopes, and has also concerned some interacting partners different from specific antigen receptors and antigens (programs PepSurf, Pep-3D-Search, and MimoPro [106–108]). Based on this approach, improved specificity, and extended the repertoire of predicted epitopes or other interacting partners were achieved. Further progress in programming then resulted in accelerated computation in spite of more complicated, and precise strategies of data processing. In addition, pattern recognition algorithm was developed, which can effectively be employed to screen a mixture of antibodies, and define the breadth of epitopes recognized by polyserum directed against specific proteins [109]. This possibility appears to be interesting with respect to future vaccine design.
The recent status of epitope prediction is still far from resolved due to insufficient extent of the datasets, and still requires continuous improvement of database organization [110, 111]. In such state, mimotope-assisted epitope prediction mentioned above and combined approaches represent a certain improvement in the quality of epitope prediction. Online accessible combined approaches of B-cell epitope prediction mostly evaluate 3D structures together with solvent accessibility (programs CEP, ElliPro, PEPOP and 3D alternative of Epitopia [112–115]; Table 2). Similarly to many other combined approaches, combined restriction diminishes the number of false positivities but simultaneously can cause increased number of false negativities. For instance, the widely used requirement of solvent accessibility mentioned above would in fact eliminate at least part of conserved autoepitopes, which contain hydrophobic patterns (cf. [116–118]). To diminish losses following from employed combinations or effects of too strict (sure) thresholds, a metaserver was developed that sums up the results from six epitope-predicting servers [119]. In addition, some new types of online accessible prediction of linear epitopes appeared to complete the preceding results [120, 121].
Table 2.
Epitope prediction or reevaluation on accessible servers.
| Programs | Purpose | Tools or procedures of evaluation | Input | Output | Internet accessibility | References |
|---|---|---|---|---|---|---|
| Epitope prediction | ||||||
|
| ||||||
| MimoDB 2.0 | Mimotope database | MySQL relational database | according to menu | Structures visualization, alignments, and so forth | http://immunet.cn/mimodb/ | [105] |
| MimoPro | Maps a group of mimotopes back to a source antigen so as to locate the interacting epitope on the antigen | Branch and bound optimization (analysis of overlapping patches on the surface of a protein) | PDB identifier mimotope sequences | Score + 3D location in antigen | http://informatics.nenu.edu.cn/MimoPro/ | [108] |
| ViPR | Virus pathogen database | Integration of various resources | according to menu | Structures, annotations, and so forth | http://www.viprbrc.org/ | [140] |
| MetaMHC | Prediction of MHC binding epitopes | meta-approach | QS | Four metapredictor scores including MetaSVMp score | http://www.biokdd.fudan.edu.cn/Service/MetaMHC.html | [141] |
| Epitopia | Prediction of B-cell epitopes |
∗MLA ∗MLA + 3D + SA |
∗QS ∗PDB ACF |
Immunogenicity score + probability score + color scale record on QS | http://epitopia.tau.ac.il | [115] |
| ElliPro | New structure-based tool for the prediction of antibody epitopes | 3D + SA + flx + antigenicity |
∗QS ∗PDB ACF |
Score + visualized epitope 3D structure and 3D location | http://tools.immuneepitope.org/tools/ElliPro | [114] |
| MHCPred | Prediction of class II mouse MHC peptide binding affinity | ISC-PLS, SYBYL software package | QS | Binding affinity (pIC50) | http://www.jenner.ac.uk/MHCPred | [129] |
|
| ||||||
| Stable epitopes and vaccines | ||||||
|
| ||||||
| BayesB | SVM-based prediction of linear B-cell epitopes using Bayes Feature Extraction | Residue conservation + position-specific residue propensities | QS | Residue epitope propensity score | http://www.immunopred.org/bayesb/index.html | [120] |
| OptiTope | Selection of optimal peptides for epitope-based vaccines | Multistep parallel evaluation | MSA of Ag list of ES table: ES, ES related MHC |
Fraction of overall immunogenicity covered MHC | http://www.epitoolkit.org/optitope | [142] |
| PVS | PVS returns a variability-masked sequence, which can be submitted to the RANKPEP server to predict conserved T-cell epitopes. | 3D visualization of MSA derived sequence variability “per site” | PDB ACF | 3D map of variability fragments with no variable residues and their 3D location in antigen | http://imed.med.ucm.es/PVS/ | [143] |
| PEPVAC | Prediction of MHC I; server can also identify conserved and promiscuous MHC I ligands | PSSM, distance matrix, phylogenic clustering algorithm | Genome, HLA-supertypes | Selected peptide sequences score | http://immunax.dfci.harvard.edu/PEPVAC/ | [144] |
| RANKPEP | Prediction of MHC I and MHC II ligands | Profile comparison | Genome, HLA-supertypes, QS | Selected peptide sequences score | http://www.mifoundation.org/Tools/rankpep.html | [145, 146] |
∗Independent alternatives; Ag: antigen; BFE: Bayes Feature Extraction; flx: flexibility; ES: epitope sequences; HLA: human leukocyte antigens (human MHC); ISC-PLS: iterative self-consistent partial-least-squares based additive method; MHC: alleles of major histocompatibility antigens; MLA: machine learning algorithm; MSA: multiple sequence alignment; PDB: protein data bank; PDB ACF: PDB-derived atom coordinate file; QS: query sequence(s); SA: solvent accessibility; SVM: support vector machines.
An interesting input simplification has been achieved with a novel web server CBTOBE. This server uses learning of support vector machines based on physicochemical profiles, and makes it possible to predict conformation epitopes based on a sole sequence input [122]. A recent private alternative of CBTOBE was based on older evaluation of secondary structure and solvent accessibility (server COBEpro [123]) further complemented by evolutionary information and machine-learning-derived evaluation [124].
Combined predictions of epitopes presented to T-cell receptors have integrated class I MHC (major histocompatibility complex) peptide binding affinity, TAP transport efficiency and prediction of proteasome cleavage (programs EpiJen, NetCTL-1.2, FRED [125–128]). Nevertheless, independent simulations of peptide-binding affinity of various MHC molecules have been also proposed [129] as well as the corresponding neural network-based learning [130]. Both combined, and sole approaches then represent starting steps for further reevaluation with respect to T-cell receptor interactions, for example, using learning of support vector machines, and strict kernels based on 531 physicochemical properties (POPISK [131]). Important information about existing T-cell, and B-cell epitopes can be also obtained from the Immune Epitope Database (IEDB [132, 133]). Among others, EpitopeViewer of the 3D structural subcomponent of IEDB (IEDB-3D) allows the user to visualize, render and analyze the structure, and save structural and contact views as high-quality pictures for publication [133].
Since production of various vaccines, and detection kits with mononoclonal antibodies requires high efficiency of preparations, special searches for conserved epitopes have been developed for this purpose. Though the meaning of crucial term “conserved epitope” exceeds primary structure relationships, the repertoires of the corresponding structures appear to be sometimes considerably limited. In fact, structural or regulatory adaptations of pathogenic microogranisms and viruses cause less stability of immunologically important, and promising promiscuous (cytotoxic or helper) T-cell epitopes broadly cross-reacting with different MHC antigens, different frequencies of T-cell epitopes specific usually only for certain unique MHC molecule, as well as losses of immunogenicity or even absence of immune response. In addition to current evaluation, drug effects (cf. effects in docking studies described below) and accompanying structural variability (e.g., enzyme polymorphisms or special pathogenetical effects) have to be considered with respect to possible peptide epitope modifications, possible bias, or improvement of therapeutical design. It is a question whether conserved spatial structures following from multiple structural alignments mentioned above can be also interesting for prediction of conserved conformation of B-cell epitopes. The second part of Table 2 contains some web servers interesting with respect to prediction of conserved epitopes, whereas selected examples related to AIDS research follow.
The first more complex prediction of HIV epitopes was based on sequence conservativeness, secondary structure, solvent accessibility, hydrophilicity and flexibility [134]. The study pointed to an unfavorably frequent occurrence of changes in secondary structures predicted as antigenic. Certain progress in the research of conserved HIV-1 epitopes was achieved when assessing their possible interactions with MHC antigens. This comprised peptide prediction based on EpiMatrix score [135], construction of the peptide property model from a training dataset [136, 137] and combined T-cell epitope evaluation mentioned above [126, 127]. Lately, phylogenetic hidden Markov models allowed to predict HIV-1-related T-cell epitopes based on contiguous aa positions that evolve under immune pressure dependent among others on host HLA alleles [138]. In a recent paper, structure-function analysis based on a specifically devised mathematical model revealed that protection from neutralization (shielding of neutralization-sensitive domains) is enforced by intersubunit contact between the variable loops 1 and 2 (gp120 V1V2) of HIV-1, and domains of neighboring gp120 subunits in the trimer encompassing the V3 loop [139].
4. Protein Docking
Molecular docking is a method, which predicts the preferred reciprocal orientation of two molecules when they bound to each other to form a stable complex [147]. In case of simulated protein interactions, the authors currently speak about protein docking rather than about molecular docking, since molecular docking represents a term comprising also nucleic acid interactions with nonpeptide molecules. Formerly, molecular docking simulated “lock-and-key” type of protein interactions. Its original variants appeared in the eighties and reassumed interpretations of older molecular graphics [148–152]. These approaches were restricted by complementarity demands or simplified requirements for energy minimization. Lately, “hand-in-glove” analogy was found to be more appropriate for molecular docking than the “lock-and-key” one [153]. In addition to this conventional model, three new models have been recently developed for docking studies of protein-protein interactions [154], that is, (i) conformer selection model, using a novel ensemble docking algorithm, (ii) induced fit model, employing energy-gradient-based backbone minimization, and (iii) combined conformer selection/induced fit model. Physics-based molecular mechanics allowed the development of the force fields, which enabled the assessment of relative binding strength [155]. Alternative knowledge-based approaches were evolved to derive a statistical potential for interactions from a large database of protein-ligand complexes [156]. Affinity evaluation, statistical methods and combined procedures represent frequent alternatives of evaluation in recent protein docking approaches (Table 3). In addition to usual full 3D models simulating molecular complexes, contact docking was also proposed. This docking is based on contact map representations of molecules (cf. chapter 3). According to the authors contact docking appears feasible, and is able to complement other computational methods for the prediction of protein-protein interactions [157].
Table 3.
Examples of protein docking.
| Approach/combination | Investigated molecules | Predicted/investigated partners | Evaluation tools and results | Employed internet addresses | References |
|---|---|---|---|---|---|
| Protein-ligand | |||||
|
| |||||
| SwissDock | protein | small molecule (ligand) | FullFitness + RMSD | http://www.swissdock.ch | [183] |
| Glide (version 5.6) | P-glycoprotein | 24 P-gp binders + 102 endogenous molecules | Glide XP score + MM-GB/SA rescoring function | http://pubchem.ncbi.nlm.nih.gov/ | [184] |
| VSDocker (AutoGrid4, AutoDock4, etc.) | protein | small molecule (example with 86 775 ligands) | ΔG bind | http://www.bio.nnov.ru/projects/vsdocker2/ | [185] |
| Opal (Autodock tools) | protein | small molecule (ligand) | ΔG bind | http://kryptonite.ucsd.edu/opal2/dashboard?command=serviceList | [186] |
| FlexX module of SYBYL 7.0 and FMO | neuraminidase (from N1 subtype of influenza virus) | carboxyhexenyl derivatives or analogues | FlexX energy scores |
http://www.rcsb.org/pdb
http://grid.ccnu.edu.cn/ |
[187] |
| TarFisDock | protein | drugs | interaction energy | http://www.dddc.ac.cn/tarfisdock/ | [188] |
| KinDOCK | ATP binding sites of PK | focused ligand library | theoretical affinity | http://abcis.cbs.cnrs.fr/kindock | [189] |
| AutoDock + Gold | Cdc25 dual specificity phosphatases | library of sulfonylated aminothiazoles as inhibitors | ΔG bind + overall Gold fitness function | none | [190] |
| Molsoft module + ACD database | thyroid hormone receptor | 250 000 compounds | EQS score | none | [191] |
|
| |||||
| Protein-protein | |||||
|
| |||||
| ZDOCK | protein | flexible/shape complementary protein | combined evaluation using Fast Fourier Transform |
http://zlab.umassmed.edu/zdock/
http://www.fftw.org |
[192] [159, 193] |
| KBDOCK | protein | protein with hetero-binding site | combined knowledge-based approach | http://kbdock.loria.fr/ | [194] |
| RosettaDock | protein | protein | CAPRI CS | http://rosettadock.graylab.jhu.edu | [195] |
| ClusPro | protein | protein | combined evaluation | http://cluspro.bu.edu/login.php | [196, 197] |
|
| |||||
| Protein-nucleic acid | |||||
|
| |||||
| DARS-RNP QUASI-RNP |
protein | RNA | knowledge-based potentials for scoring protein-RNA models | http://iimcb.genesilico.pl/RNP/ | [198] |
| protein-DNA docking approaches | transcription factor/protein | DNA | RMSD + knowledge based energy |
http://pqs.ebi.ac.uk/
http://sparks.informatics.iupui.edu/czhang/ddna2.html |
[199] [200] |
|
| |||||
| Antigen receptors | |||||
|
| |||||
| DS-QM based peptide binding prediction | HLA-DP2 | HLA-DP2 interacting potential TCR ligands | CS following from two different DS-QM | none | [201] |
| SnugDock | antibody | antigen | paratope based structural optimization + CLEP | http://www.rosettacommons.org/ | [202] |
| pDOCK | MHC I and II | potential TCR ligands | RMSD | http://biolinfo.org/mpid-t2 | [203] |
| MHCsim | MHC I and II | potential TCR ligands | molecular dynamic simulation | http://igrid-ext.cryst.bbk.ac.uk/MHCsim/ | [204] |
ACD: available chemicals directory (database); CAPRI: critical assessment of predicted interactions; CLEP: combined lowest energy prediction; CS: combined scoring; DS-QM: docking score-based quantitative matrices; EQS score: score combining grid energy, electrostatic and entropy terms; FMO: fragment molecular orbitals; HLA-DP: a subset of human MHC II; ΔG bind: free energy of binding; PK: protein kinases; MM-GB/SA molecular mechanics scoring function with generalized Born implicit solvent effects; P-gp: P-glycoprotein; RMSD: root mean square displacement statistics; TCR: T-cell receptor.
The number of molecules whose interactions can be scanned by current protein docking depends usually on the complexity of evaluated interactions and manner of simulation. For instance, interaction of 142 drugs inhibiting Poly-(ADP-ribose)-polymerase was tested in a large-scale virtual docking screening together with 300 000 added organic compounds with the aid of the program Lead Finder [158], whereas only 176 protein-protein interactions were analyzed by efficient program ZDOCK during extended simulation [159]. In addition to current output, some docking studies even verified the results of simulation using in silico experiments. For instance, aa involvement in enzyme interactions was proved with simulated mutation (program SYBYL 6.7 and Internal Coordinate Mechanics method [160, 161]) whereas other authors compared separate simulated effects of inhibitors and substrates (program BioDock [162]).
In addition to current molecular docking simulations (Table 3), certain docking studies analyzed modulation effects of additional molecules on the evaluated interaction [163–165]. Homology modeling followed by extensive molecular dynamics simulation was used to identify nonpeptidic small organic compounds (among 150 000 compounds) that bind to a human leukocyte antigen HLA-DR1301, and block the presentation of myelin basic protein peptide (aa positions 152–165) to T-cells. This peptide represents one of the epitopes critical for multiple sclerosis. In silico selection resulted in a set of 106 small molecules, two lead compounds were confirmed to specifically block IL-2 secretion by DR1301-restricted T-cells in a dose-dependent and reversible manner [163].
Pockets opening to protein surface represents potential sites for ligand binding or protein-protein interactions that were indeed identified in some cases [166–168]. Consequently, an algorithm BDOCK facilitating pocket-based prediction of protein binding site was proposed to improve protein docking [169], and moreover new algorithms predicting pockets are still proposed [170]. Many pockets were only transiently present on simulated surfaces of investigated proteins (e.g., Bcl-xl, interleukin 2, MDM2), and were all opened only 2.5 picosecond when using model intervals corresponding to ten nanosecond range [171].
Some docking studies comprised phylogenic aspects of protein interactions. It was observed that except for antibody-antigen complexes [172], the surface density of conserved residue positions at the interface regions of interacting protein surfaces is high. The corresponding combination of the residue conservation information with a widely used shape complementarity algorithm improved the ability of protein docking to predict the native structure of protein-protein complexes. Efficient comparative docking consists in selection based on conservation in terms of chain positions and primary structure in the first step, and the following high-throughput structure-based docking evaluation [173] (see also Table 3). Recent combined strategy including multiple sequence alignment and fold recognition analysis has been proposed to perform more precise docking, and predict also protein function [174].
Bioinspired algorithms were also applied to molecular docking simulations including neural networks, evolutionary computing and swarm intelligence [175]. Though neural networks participated in some older evaluations working in conventional programs of molecular docking [176–179], their independent scoring functions for docking have appeared lately [180, 181]. An extreme neural network approach required only sequence input to perform a docking-like procedure [182]. Predictions followed from a trained model that simulated binding or nonbinding stages.
5. Protein Interaction Networks
Protein interaction networks (PINs) are usually represented by graphs. Nodes of these graphs denote interacting molecules, whereas each edge linking two nodes indicates the corresponding interaction. The prevailing part of PIN is usually constituted by protein-protein interaction network (P-PIN). In addition to P-PIN, we can observe a record of protein interactions with (i) nonpeptidic hormones or mediators and drugs, (ii) processed, targeted, and functional complexes forming RNA, and (iii) recombining, hypermutating, repairing, replicating, twisting/untwisting, and transcribed DNA. These nonpeptidic compounds represent in fact inputs, outputs, interaction-stabilizing agents, or relay batons of P-PIN. In cases of DNA, and RNA, some papers appeared dealing with special protein-RNA, and protein-DNA interaction networks [205–208]. Protein-DNA networks moreover combined both the protein-centric and the DNA-centric points of view [205].
Like in other related networks, clusters are recognized in P-PIN. Two main types of P-PIN-related clusters can be distinguished, that is, protein complexes and functional modules [209, 210]. Protein complexes are groups of proteins that interact with each other at the same time and place, forming a single multimolecular machine (e.g., metabolic multienzyme complexes). Functional modules consist of proteins that participate in a particular cellular process while binding to each other at a different time and place (e.g., multiple signaling cascades are functional modules sometimes including also protein complexes). An important type of functional modules, that is, responsive functional modules (RFMs), includes protein interactions activated under specific conditions. These RFMs appear to be interesting with respect to prediction of potential biomarkers [211].
Much attention has been paid to the identification of small conserved subgraphs, particularly those occurring significantly often within the biological networks. These conserved subgraphs were referred to as network motifs or simply as motifs (similarly to primary structure motifs), and were considered as basic building blocks of complex networks including P-PIN [212, 213]. Ancestral pathways or functional modules including conserved interactions were derived when comparing P-PIN of phylogenetically distant species such as human, Saccharomyces cerevisiae, Drosophila melanogaster and Caenorhabditis elegans, and various bacterial species [214, 215]. Proteins necessary for proteasome function, transcription, RNA processing and translation were frequent in conserved subgraphs of compared Eukaryota, whereas DNA repair proteins prevailed in conserved prokaryotic subgraphs. Incorporation of literature-curated and evolutionarily conserved interactions also allowed to develop an interaction network for 54 proteins providing a tool for understanding Purkinje cell degeneration [216]. This result made it possible to find experimentally 770 mostly novel protein-protein interactions using a stringent yeast two-hybrid screen.
Recent integrative approaches combined at least transcriptional data with P-PIN analyses, and at most five layers including phenotype association with single-nucleotide polymorphism, disease-tissue, drug-tissue and drug-gene relationships, in addition to the topical P-PIN record [217] (see also Table 4). Two-layer comparison of disease-related mRNA expression, and human P-PIN was for instance used together with hierarchical clustering of networks to elucidate human disease similarities. The results led to a hypothesis considering common usage of some drugs in the diseases, which exhibited close relationship with respect to given clustering [218]. More extended analysis of gene expression overlays with protein-protein interaction, transcriptional-regulatory and signaling networks identified distinct driver-networks for each of the three common clinical breast cancer subtypes, that is, oestrogen receptor positive, human epidermal growth factor receptor 2 positive and triple receptor-negative breast cancers [219]. Integrative online accessible tools reevaluating P-PIN relationships were developed to predict candidate genes critical for the occurrence of different diseases (cf. network-based disease gene prioritization; [220, 221]).
Table 4.
Online accessible tools for PPI network assembly, reevaluation, and comparison.
| Purpose | Input | Evaluation results | Network content | Internet accessibility | References | |
|---|---|---|---|---|---|---|
| Program tools | ||||||
|
| ||||||
| PINTA | Resource for the prioritization of disease related candidate genes based on the differential expression of their neighborhood in a genome-wide protein-protein interaction network. | Files with disease-specific expression data | Internal scores, P values, Bonferroni–Holm P-values | menu: human, mouse, rat, worm (C. elegans) and yeast P-PIN |
http://www.esat.kuleuven.be/pinta/ | [221] |
| PCFamily | Finds homologous structure complexes of the query using BLASTP to search the structural template database | QS or QS set in FASTA format | Z-value agreement ratio | 941 protein complexes | http://pcfamily.life.nctu.edu.tw | [236] |
| BisoGenet | Build and visualize biological networks in a fast and user-friendly manner including P-PIN | List of identifiers | Node degrees, cluster coefficients | 5365 human genes | http://bio.cigb.edu.cu/bisogenet-cytoscape/ | [245] |
| DynaMod | Identifies significant functional modules reflecting the change of modularity and differential expressions that are correlated with gene expression profiles under different conditions | Genome-wide expression profile | Z-score, Bonferroni or FDR P-values | In frame of GSEA | http://biocondor6.kaist.ac.kr/dynamod/ | [241] |
| TORQUE | Cross species querying allows users to run topology-free queries on predefined or user-provided target networks resulting in subnetwork of the target network most similar to query subset | Seq of compared sets, query list, target P-PIN | Thickness of the edge | 5430 proteins, 39 936 interactions | http://www.cs.tau.ac.il/~bnet/torque.html | [246] |
| NetworkBLAST | Identifies protein complexes within and across species, forming html page with links to schemes of interactions within complexes and additional graph data files | Files with ∗PPI, BLPE ∗PPI | likelihood based density score | species related P-PIN ∗two species ∗single species |
http://www.cs.tau.ac.il/~roded/networkblast.htm | [234] |
|
| ||||||
| Databases | ||||||
|
| ||||||
| STRING | A database of functional interaction networks of proteins, globally integrated and scored | Search based on gene annotations | probabilistic confidence score | >1100 completely sequenced organisms | http://string-db.org | [251] |
| ChemProt | Disease chemical biology database including 7 × 105 chemicals and two millions chemical-protein interactions; indicates a possible formation of disease-associated protein complexes | Compound and protein identifiers | compilation of multiple chemical-protein annotations | 30 578 proteins 2227 disease-related proteins, 428 429 PPI |
http://www.cbs.dtu.dk/services/ChemProt/ | [252] |
| NetAge | Database and network analysis tools for biogerontological research (integrity and functionality of P-PIN is under a tight epigenetic control) | Tools for searching and browsing | experimental evidence about interactions | miRNA-regulated P-PIN in age-related diseases | http://www.netage-project.org | [253] |
| BioGRID | All interactions in BioGRID are available through the Osprey visualization system, which can be used to query network organization in a user-defined fashion | Intuitive graphical interface | experimental evidence about interactions | Over 198 000 genetic + protein interactions from six species | http://www.thebiogrid.org | [254] |
| EHCO | Encyclopedia of Hepatocellular Carcinoma genes Online collect, organize and compare unsorted HCC-related studies | CMS | NL processing and softbots | 97 proteins, 47 highly interactive, 18 hubs | http://ehco.iis.sinica.edu.tw | [255] |
| IntNetDB v1.0 | A database providing automatic prediction and visualization of PPI network among genes of interest; analysis includes domain-domain interactions, known gene contexts, crossvalidation, and so forth | list of query gene identifiers | likelihood ratio following from Bayesian analysis | concerns 27 species contains GSP and GSN from HPRD | http://hanlab.genetics.ac.cn/IntNetDB.htm | [256] |
∗Independent alternatives; BLPE: BLASTP E values between pairs of proteins from each of the compared species; CMS: content management system; GSEA: gene set enrichment analyses; GSP, GSN: gold standard positive and a gold standard negative dataset of HPRD, respectively; HCC: hepatocellular carcinoma; HPRD: human protein reference database; NL: natural language; PE: probabilistic evaluation; PPI: protein-protein interactions; QS: query sequences; seq: sequences.
The majority of proteins reciprocally interact via one or two interactions. This number substantially increases to tens, hundreds and more, when considering multi-interactive proteins denoted in PIN as hubs [222]. In agreement with the sense of the term, hubs are the principal agents in the interaction network and affect its function and stability [223]. Hubs were enriched in kinase and adaptor proteins including those interacting with frequently disordered phosphorylated protein regions [224, 225]. Similarly, many pathogenetically important proteins belonged to hubs, for example, the widely known tumor suppressor p53, alpha-synuclein involved in Parkinson disease and small multifunctional core protein necessary for orchestration of viral progeny in Flaviviridae [226, 227]. In principle, two types of hubs were distinguished, that is, static “party hubs” and dynamic “date hubs.” Party hubs are markedly more phylogenetically conserved, and their expression is highly correlated with their interacting partners in contrast to less conserved date hubs more frequently containing disordered regions and participating in cell signaling [228–230]. Hubs with two or more domains are more likely to connect distinct functional modules than single-domain hubs [224]. In addition to studies of biochemically interesting hubs, docking procedures (mediated by program ClusPro [231]) were recently employed together with comparative modeling to construct P-PIN [232], whereas domain-domain, domain-motif and motif-motif interactions were distinguished in some other P-PIN [233].
Extended P-PIN research yielded many tools allowing P-PIN-based prediction of protein complex formation [234–237], identification of hubs and multifunctional proteins [238, 239], identification of functional modules [240–242], network-based disease gene prioritization [217, 221, 243], as well as network building [244, 245], cross-species querying [246] or comparison [234, 237]. Some of these approaches as well as several examples of P-PIN-related databases are described in Table 4. The instrumental progress in PIN research also led to an increased number of the corresponding methodological reviews, for example, [247–250].
6. Accuracy as an Important Parameter
A detailed evaluation of accuracy of all the above approaches would require a separate review. The obstacles consist first of all in presence of complementary information in articles inaccessible for biologists and on web pages. Such state complicates mining of accuracy data first of all in the case, when server-related papers represent only the final step of author's efforts. Accuracy evaluation or even the corresponding references are also rarely present in the papers concerning new online databases. In addition, some authors also use other criterions related to accuracy to evaluate the corresponding performance value, whereas only some of such evaluations are widely known as valid accuracy substitution, for example, AUC (area under the ROC curve) discussed below.
In spite of the obstacles, we can mention several examples of high accuracy concerning the above approaches. Excellent accuracy values higher then 0.90 were found in the cases of Conserved Domain Database (or its RPS BLAST server; [257]; Table 1) and older 3D-BLAST variant under a broad range of conditions (Table 1), whereas similarly interesting AUC values (higher than 0.90) were mostly found also in the cases of recent 3D BLAST variant (Table 1), and the selected strategy of SVM- and machine-learning-based epitope prediction known as CBTOBE (chapter 3; [122]). The same accuracy levels were rarely achieved in certain subsets when employing ADAN, KinasePhos 2.0 (Table 1), ElliPro, RANKPEP (Table 2), SVM and machine-learning-based epitope prediction improved by evolutionary information BEOracle (chapter 3; [124]) and docking tool HADDOCK (chapter 4; [178]). Similarly, restricted subset-related AUC values of 0.90 and 0.93 indicated a good predicting ability when using docking-like Partner-aware prediction (chapter 4; [182]), and combined evaluation based on Glide X and molecular mechanics (Table 3) to simulate interactions of 25 antibodies and P-glycoprotein, respectively. Frequent subset accuracies of higher values than the gold standard value of 0.80 were observed during runs of programs Phos3D, KinasePhos 2.0 (Table 1), ElliPro (Table 2) and CBTOBE mentioned above (chapter 3; [122]), and two docking tools Glide XP and Gold (Table 2; [258]). Comparable AUC values then concerned RANKPEP (Table 2), BEOracle mentioned above (chapter 3; [124]) and network-based disease gene prioritization called DADA (chapter 5; [236]). The low occurrence of P-PIN in our overview of accuracy evaluation followed from the fact that the performance of P-PIN was alternatively evaluated with the aid of precision or recall. An interesting approach—Lead Finder—was developed to improve the accuracy of protein-ligand docking, binding energy estimation and virtual screening. High enrichment factors were obtained for almost all of the targets and seven different docking programs resulting in an excellent average AUC value of 0.92 [258].
7. Tendencies, Improvements, and Interplays
In summary, virtual interactomics represents a highly sophisticated, and relatively autonomous subarea of both interactomics and bioinformatics. Its development can be characterized by several independent and integrative tendencies. The independent tendencies include (i) more rapid and precise molecular modeling of protein complexes (including quantum mechanic methods and rapid pharmacological screening based on docking), (ii) more complex evaluation of structural similarities (e.g., large alphabets and combined prediction of functional sites), (iii) extensive assembly of large-scale PIN and assembly of PIN or PIN subgraphs with dynamical properties, and (iv) restriction of conserved PIN subgraphs and clinically interesting responsive functional modules. The integrative tendencies then comprise (i) establishment of structurally functionally, and purposefully oriented databases and associated data mining servers, (ii) questions concerning phylogenic relationships between structural differences in PIN subgraphs, and the differences in protein structures of subgraph forming molecules, (iii) PIN prediction and revision/reevaluation of empirically proposed PIN using comparative molecular modeling, docking and bio-inspired algorithms, (iv) comparative differential multilayer network approaches including clinical treatment cases, and (v) attempts to construct global networks within a single cell, and hopefully, within the system of different reciprocally communicating cell types (e.g., immune system). Since interactomics simplifies reactomic simulations, it can also markedly and rapidly complete reactomic results, for example, in prediction of enzyme-inhibitor effects and specificity of protein kinase interactions mentioned above. In the end, we also dealt here with accuracy, which represents an important parameter of tool choice together with reasonable tool accessibility, and possibilities to test several alternatives before prediction, simulation and network analysis or to use two or several independent methods. In conclusion, many interesting and important interplays occur inside the virtual interactomic area as well as in its boundary lines and performance evaluation.
Abbreviations
- 3D:
Three-dimensional
- aa:
Amino acid residues in protein sequence
- PIN:
Protein interaction network(s)
- P-PIN:
Protein-protein interaction network(s).
References
- 1.Espejo A, Côté J, Bednarek A, Richard S, Bedford MT. A protein-domain microarray identifies novel protein-protein interactions. Biochemical Journal. 2002;367(3):697–702. doi: 10.1042/BJ20020860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lo SH. Reverse interactomics: from peptides to proteins and to functions. ACS Chemical Biology. 2007;2(2):93–95. doi: 10.1021/cb700013q. [DOI] [PubMed] [Google Scholar]
- 3.Suter B, Kittanakom S, Stagljar I. Two-hybrid technologies in proteomics research. Current Opinion in Biotechnology. 2008;19(4):316–323. doi: 10.1016/j.copbio.2008.06.005. [DOI] [PubMed] [Google Scholar]
- 4.Briant DJ, Murphy JM, Leung GC, Sicheri F. Rapid identification of linear protein domain binding motifs using peptide SPOT arrays. Methods in Molecular Biology. 2009;570:175–185. doi: 10.1007/978-1-60327-394-7_6. [DOI] [PubMed] [Google Scholar]
- 5.Hui S, Bader GD. Proteome scanning to predict PDZ domain interactions using support vector machines. BMC Bioinformatics. 2010;11, article 507 doi: 10.1186/1471-2105-11-507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Daemen A, Signoretto M, Gevaert O, Suykens JAK, De Moor B. Improved microarray-based decision support with graph encoded interactome data. PLoS ONE. 2010;5(4) doi: 10.1371/journal.pone.0010225.e10225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Huang J, Ru B, Dai P. Bioinformatics resources and tools for phage display. Molecules. 2011;16(1):694–709. doi: 10.3390/molecules16010694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.van Dijk ADJ, Kaptein R, Boelens R, Bonvin AMJJ. Combining NMR relaxation with chemical shift perturbation data to drive protein-protein docking. Journal of Biomolecular NMR. 2006;34(4):237–244. doi: 10.1007/s10858-006-0024-8. [DOI] [PubMed] [Google Scholar]
- 9.Klinge S, Ñez-Ramírez R, Llorca O, Pellegrini L. 3D architecture of DNA Pol α reveals the functional core of multi-subunit replicative polymerases. The EMBO Journal. 2009;28(13):1978–1987. doi: 10.1038/emboj.2009.150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Stark JL, Powers R. Application of NMR and molecular docking in structure-based drug discovery. Topics in Current Chemistry. 2012;326:1–34. doi: 10.1007/128_2011_213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Orts J, Bartoschek S, Griesinger C, Monecke P, Carlomagno T. An NMR-based scoring function improves the accuracy of binding pose predictions by docking by two orders of magnitude. Journal of Biomolecular NMR. 2012;52(1):23–30. doi: 10.1007/s10858-011-9590-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Diller DJ, Li R. Kinases, homology models, and high throughput docking. Journal of Medicinal Chemistry. 2003;46(22):4638–4647. doi: 10.1021/jm020503a. [DOI] [PubMed] [Google Scholar]
- 13.Förster F, Villa E. Integration of cryo-EM with atomic and protein-protein interaction data. Methods in Enzymology. 2010;483:47–72. doi: 10.1016/S0076-6879(10)83003-4. [DOI] [PubMed] [Google Scholar]
- 14.Cavasotto CN. Homology models in docking and high-throughput docking. Current Topics in Medicinal Chemistry. 2011;11(12):1528–1534. doi: 10.2174/156802611795860951. [DOI] [PubMed] [Google Scholar]
- 15.Tsfadia Y, Friedman R, Kadmon J, Selzer A, Nachliel E, Gutman M. Molecular dynamics simulations of palmitate entry into the hydrophobic pocket of the fatty acid binding protein. FEBS Letters. 2007;581(6):1243–1247. doi: 10.1016/j.febslet.2007.02.033. [DOI] [PubMed] [Google Scholar]
- 16.Gochin M, Zhou G, Phillips AH. Paramagnetic relaxation assisted docking of a small indole compound in the HIV-1 gp41 hydrophobic pocket. ACS Chemical Biology. 2011;6(3):267–274. doi: 10.1021/cb100368d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Blankenburg H, Ramírez F, Büch J, Albrecht M. DASMIweb: online integration, analysis and assessment of distributed protein interaction data. Nucleic Acids Research. 2009;37(2):W122–W128. doi: 10.1093/nar/gkp438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cowley MJ, Pinese M, Kassahn KS, et al. PINA v2.0: mining interactome modules. Nucleic Acids Research. 2011;40:D862–D865. doi: 10.1093/nar/gkr967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sickmeier M, Hamilton JA, LeGall T, et al. DisProt: the database of disordered proteins. Nucleic Acids Research. 2007;35(1):D786–D793. doi: 10.1093/nar/gkl893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Meszaros B, Simon I, Dosztanyi, Z, et al. Prediction of protein binding regions in disordered proteins. PLoS Computational Biology. 2009;5(5) doi: 10.1371/journal.pcbi.1000376.e1000376 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mészáros B, Simon I, Dosztányi Z. The expanding view of protein-protein interactions: complexes involving intrinsically disordered proteins. Physical Biology. 2011;8(3) doi: 10.1088/1478-3975/8/3/035003.035003 [DOI] [PubMed] [Google Scholar]
- 22.Biagini A, Puigserver A. Sequence analysis of the aminoacylase-1 family. A new proposed signature for metalloexopeptidases. Comparative Biochemistry and Physiology B. 2001;128(3):469–481. doi: 10.1016/s1096-4959(00)00341-9. [DOI] [PubMed] [Google Scholar]
- 23.Kubrycht J, Sigler K, Růžička M, Souček P, Borecký J, Ježek P. Ancient phylogenetic beginnings of immunoglobulin hypermutation. Journal of Molecular Evolution. 2006;63(5):691–706. doi: 10.1007/s00239-006-0051-9. [DOI] [PubMed] [Google Scholar]
- 24.Przybylski D, Rost B. Consensus sequences improve PSI-BLAST through mimicking profile-profile alignments. Nucleic Acids Research. 2007;35(7):2238–2246. doi: 10.1093/nar/gkm107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Przybylski D, Rost B. Powerful fusion: PSI-BLAST and consensus sequences. Bioinformatics. 2008;24(18):1987–1993. doi: 10.1093/bioinformatics/btn384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wrzeszczynski KO, Rost B. Cell cycle kinases predicted from conserved biophysical properties. Proteins. 2009;74(3):655–668. doi: 10.1002/prot.22181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Beck K, Hunter I, Engel J. Structure and function of laminin: anatomy of a multidomain glycoprotein. The FASEB Journal. 1990;4(2):148–160. doi: 10.1096/fasebj.4.2.2404817. [DOI] [PubMed] [Google Scholar]
- 28.Dong X-N, Xiao Y, Dierich MP, Chen YH. N- and C-domains of HIV-1 gp41: mutation, structure and functions. Immunology Letters. 2001;75(3):215–220. doi: 10.1016/s0165-2478(00)00302-3. [DOI] [PubMed] [Google Scholar]
- 29.Sibbald PR, Sommerfeldt H, Argos P. Automated protein sequence pattern handling and PROSITE searching. Computer Applications in the Biosciences. 1991;7(4):535–536. doi: 10.1093/bioinformatics/7.4.535. [DOI] [PubMed] [Google Scholar]
- 30.Gattiker A, Gasteiger E, Bairoch A. ScanProsite: a reference implementation of a PROSITE scanning tool. Appl Bioinformatics. 2002;1(2):107–108. [PubMed] [Google Scholar]
- 31.Sigrist CJA, De Castro E, Langendijk-Genevaux PS, Le Saux V, Bairoch A, Hulo N. ProRule: a new database containing functional and structural information on PROSITE profiles. Bioinformatics. 2005;21(21):4060–4066. doi: 10.1093/bioinformatics/bti614. [DOI] [PubMed] [Google Scholar]
- 32.Sigrist CJA, Cerutti L, De Castro E, et al. PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Research. 2009;38(1):D161–D166. doi: 10.1093/nar/gkp885.gkp885 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dunbrack RL., Jr. Comparative modeling of CASP3 targets using PSI-BLAST and SCWRL. Proteins. 1999;(supplement 3):81–87. doi: 10.1002/(sici)1097-0134(1999)37:3+<81::aid-prot12>3.3.co;2-i. [DOI] [PubMed] [Google Scholar]
- 34.Elofsson A. A study on protein sequence alignment quality. Proteins. 2002;46(3):330–339. doi: 10.1002/prot.10043. [DOI] [PubMed] [Google Scholar]
- 35.Zhou H, Zhou Y. Single-body residue-level knowledge-based energy score combined with sequence-profile and secondary structure information for fold recognition. Proteins. 2004;55(4):1005–1013. doi: 10.1002/prot.20007. [DOI] [PubMed] [Google Scholar]
- 36.Jaroszewski L, Rychlewski L, Li Z, Li W, Godzik A. FFAS03: a server for profile-profile sequence alignments. Nucleic Acids Research. 2005;33(2):W284–W288. doi: 10.1093/nar/gki418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Jaroszewski L, Li Z, Cai XH, Weber C, Godzik A. FFAS server: novel features and applications. Nucleic Acids Research. 2011;39(2):W38–W44. doi: 10.1093/nar/gkr441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dong E, Smith J, Heinze S, Alexander N, Meiler J. BCL::Align-Sequence alignment and fold recognition with a custom scoring function online. Gene. 2008;422(1-2):41–46. doi: 10.1016/j.gene.2008.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Fallen K, Banerjee S, Sheehan J, et al. The Kir channel immunoglobulin domain is essential for Kir1.1 (ROMK) thermodynamic stability, trafficking and gating. Channels. 2009;3(1):57–68. doi: 10.4161/chan.3.1.7817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Vishnepolsky B, Pirtskhalava M. CONTSOR—a new knowledge-based fold recognition potential, based on side chain orientation and contacts between residue terminal groups. Protein Science. 2012;21(1):134–141. doi: 10.1002/pro.763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yang J-M, Tung CH. Protein structure database search and evolutionary classification. Nucleic Acids Research. 2006;34(13):3646–3659. doi: 10.1093/nar/gkl395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mavridis L, Ghoorah AW, Venkatraman V, Ritchie DW. Representing and comparing protein folds and fold families using three-dimensional shape-density representations. Proteins. 2012;80(2):530–545. doi: 10.1002/prot.23218. [DOI] [PubMed] [Google Scholar]
- 43.Di Lena P, Fariselli P, Margara L, Vassura M, Casadio R. Fast overlapping of protein contact maps by alignment of eigenvectors. Bioinformatics. 2010;26(18):2250–2258. doi: 10.1093/bioinformatics/btq402. [DOI] [PubMed] [Google Scholar]
- 44.Rodionov MA, Johnson MS. Residue-residue contact substitution probabilities derived from aligned three-dimensional structures and the identification of common folds. Protein Science. 1994;3(12):2366–2377. doi: 10.1002/pro.5560031221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Godzik A, Skolnick J, Kolinski A. Regularities interaction patterns of globular proteins. Protein Engineering. 1993;6(8):801–810. doi: 10.1093/protein/6.8.801. [DOI] [PubMed] [Google Scholar]
- 46.Vehlow C, Stehr H, Winkelmann M, et al. CMView: interactive contact map visualization and analysis. Bioinformatics. 2011;27(11):1573–1574. doi: 10.1093/bioinformatics/btr163.btr163 [DOI] [PubMed] [Google Scholar]
- 47.Barker JA, Thornton JM. An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis. Bioinformatics. 2003;19(13):1644–1649. doi: 10.1093/bioinformatics/btg226. [DOI] [PubMed] [Google Scholar]
- 48.Ward RM, Venner E, Daines B, et al. Evolutionary trace annotation server: automated enzyme function prediction in protein structures using 3D templates. Bioinformatics. 2009;25(11):1426–1427. doi: 10.1093/bioinformatics/btp160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Nebel JC. Generation of 3D templates of active sites of proteins with rigid prosthetic groups. Bioinformatics. 2006;22(10):1183–1189. doi: 10.1093/bioinformatics/btl040. [DOI] [PubMed] [Google Scholar]
- 50.Kuksa PP, Pavlovic V. Efficient motif finding algorithms for large-alphabet inputs. BMC Bioinformatics. 2010;11(supplement 8, article S1) doi: 10.1186/1471-2105-11-S8-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Dror O, Benyamini H, Nussinov R, Wolfson H. MASS: multiple structural alignment by secondary structures. Bioinformatics. 2003;19(1):i95–i104. doi: 10.1093/bioinformatics/btg1012. [DOI] [PubMed] [Google Scholar]
- 52.Dror O, Benyamini H, Nussinov R, Wolfson HJ. Multiple structural alignment by secondary structures: algorithm and applications. Protein Science. 2003;12(11):2492–2507. doi: 10.1110/ps.03200603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Williams A, Gilbert DR, Westhead DR. Multiple structural alignment for distantly related all β structures using TOPS pattern discovery and simulated annealing. Protein Engineering. 2003;16(12):913–923. doi: 10.1093/protein/gzg116. [DOI] [PubMed] [Google Scholar]
- 54.Konagurthu AS, Whisstock JC, Stuckey PJ, Lesk AM. MUSTANG: a multiple structural alignment algorithm. Proteins. 2006;64(3):559–574. doi: 10.1002/prot.20921. [DOI] [PubMed] [Google Scholar]
- 55.Micheletti C, Orland H. MISTRAL: a tool for energy-based multiple structural alignment of proteins. Bioinformatics. 2009;25(20):2663–2669. doi: 10.1093/bioinformatics/btp506. [DOI] [PubMed] [Google Scholar]
- 56.Konagurthu AS, Reboul CF, Schmidberger JW, et al. MUSTANG-MR structural sieving server: applications in protein structural analysis and crystallography. PLoS ONE. 2010;5(4) doi: 10.1371/journal.pone.0010048.e10048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Siu WY, Mamoulis N, Yiu SM, Chan HL. A data-mining approach for multiple structural alignment of proteins. Bioinformation. 2010;4(8):366–370. doi: 10.6026/97320630004366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Sun H, Sacan A, Ferhatosmanoglu H, Wang Y. Smolign: a spatial motifs based protein multiple structural alignment method. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2012;9(1):14 pages. doi: 10.1109/TCBB.2011.67. [DOI] [PubMed] [Google Scholar]
- 59.Miller ML, Jensen LJ, Diella F, et al. Linear motif atlas for phosphorylation-dependent signaling. Science Signaling. 2008;1(35):p. ra2. doi: 10.1126/scisignal.1159433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Schwartz D, Church GM. Collection and motif-based prediction of phosphorylation sites in human viruses. Science Signaling. 2010;3(137):p. rs2. doi: 10.1126/scisignal.2001099. [DOI] [PubMed] [Google Scholar]
- 61.Standley DM, Yamashita R, Kinjo AR, Toh H, Nakamura H. SeSAW: balancing sequence and structural information in protein functional mapping. Bioinformatics. 2010;26(9):1258–1259. doi: 10.1093/bioinformatics/btq116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Wong YH, Lee TY, Liang HK, et al. KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns. Nucleic acids research. 2007;35:W588–W594. doi: 10.1093/nar/gkm322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Durek P, Schudoma C, Weckwerth W, Selbig J, Walther D. Detection and characterization of 3D-signature phosphorylation site motifs and their contribution towards improved phosphorylation site prediction in proteins. BMC Bioinformatics. 2009;10, article 117 doi: 10.1186/1471-2105-10-117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Encinar JA, Fernandez-Ballester G, Sánchez IE, et al. ADAN: a database for prediction of protein-protein interaction of modular domains mediated by linear motifs. Bioinformatics. 2009;25(18):2418–2424. doi: 10.1093/bioinformatics/btp424. [DOI] [PubMed] [Google Scholar]
- 65.Marchler-Bauer A, Panchenko AR, Shoemarker BA, Thiessen PA, Geer LY, Bryant SH. CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Research. 2002;30(1):281–283. doi: 10.1093/nar/30.1.281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Marchler-Bauer A, Anderson JB, Chitsaz F, et al. CDD: specific functional annotation with the Conserved Domain Database. Nucleic Acids Research. 2009;37(1):D205–D210. doi: 10.1093/nar/gkn845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Marchler-Bauer A, Lu S, Anderson JB, et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Research. 2011;39(1):D225–D229. doi: 10.1093/nar/gkq1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Kundrotas PJ, Zhu Z, Vakser IA. GWIDD: genome-wide protein docking database. Nucleic Acids Research. 2009;38(1):D513–D517. doi: 10.1093/nar/gkp944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Pugalenthi G, Suganthan PN, Sowdhamini R, Chakrabarti S. MegaMotifBase: a database of structural motifs in protein families and superfamilies. Nucleic Acids Research. 2008;36(1):D218–D221. doi: 10.1093/nar/gkm794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.May P, Kreuchwig A, Steinke T, Koch I. PTGL: a database for secondary structure-based protein topologies. Nucleic Acids Research. 2010;38(1):D326–D330. doi: 10.1093/nar/gkp980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Shulman-Peleg A, Nussinov R, Wolfson HJ. RsiteDB: a database of protein binding pockets that interact with RNA nucleotide bases. Nucleic Acids Research. 2009;37(1):D369–D373. doi: 10.1093/nar/gkn759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Geysen HM, Rodda SJ, Mason TJ. A priori delineation of a peptide which mimics a discontinuous antigenic determinant. Molecular Immunology. 1986;23(7):709–715. doi: 10.1016/0161-5890(86)90081-7. [DOI] [PubMed] [Google Scholar]
- 73.Penninger JM, Bachmaier K. Review of microbial infections and the immune response to cardiac antigens. Journal of Infectious Diseases. 2000;181(6):S498–S504. doi: 10.1086/315613. [DOI] [PubMed] [Google Scholar]
- 74.Fernandez-Fuentes N, Oliva B, Fiser A. A supersecondary structure library and search algorithm for modeling loops in protein structures. Nucleic Acids Research. 2006;34(7):2085–2097. doi: 10.1093/nar/gkl156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Libbey JE, McCoy LL, Fujinami RS. Molecular mimicry in multiple sclerosis. International Review of Neurobiology. 2007;79:127–147. doi: 10.1016/S0074-7742(07)79006-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Smith GP. Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface. Science. 1985;228(4705):1315–1317. doi: 10.1126/science.4001944. [DOI] [PubMed] [Google Scholar]
- 77.Devlin JJ, Panganiban LC, Devlin PE. Random peptide libraries: a source of specific protein binding molecules. Science. 1990;249(4967):404–406. doi: 10.1126/science.2143033. [DOI] [PubMed] [Google Scholar]
- 78.Scott JK, Smith GP. Searching for peptide ligands with an epitope library. Science. 1990;249(4967):386–390. doi: 10.1126/science.1696028. [DOI] [PubMed] [Google Scholar]
- 79.Jellis CL, Cradick TJ, Rennert P, et al. Defining critical residues in the epitope for a HIV-neutralizing monoclonal antibody using phage display and peptide array technologies. Gene. 1993;137(1):63–68. doi: 10.1016/0378-1119(93)90252-x. [DOI] [PubMed] [Google Scholar]
- 80.Matthews DJ, Wells JA. Substrate phage: selection of protease substrates by monovalent phage display. Science. 1993;260(5111):1113–1117. doi: 10.1126/science.8493554. [DOI] [PubMed] [Google Scholar]
- 81.Koivunen E, Wang B, Ruoslahti E. Isolation of a highly specific ligand for the α5β1 integrin from a phage display library. Journal of Cell Biology. 1994;124(3):373–380. doi: 10.1083/jcb.124.3.373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Pasqualini R, Koivunen E, Ruoslahti E. A peptide isolated from phage display libraries is a structural and functional mimic of an RGD-binding site on integrins. Journal of Cell Biology. 1995;130(5):1189–1196. doi: 10.1083/jcb.130.5.1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Schmitz R, Baumann G, Gram H. Catalytic specificity of phosphotyrosine kinases Blk, Lyn, c-Src and Syk as assessed by phage display. Journal of Molecular Biology. 1996;260(5):664–677. doi: 10.1006/jmbi.1996.0429. [DOI] [PubMed] [Google Scholar]
- 84.Fack F, Hügle-Dörr B, Song D, Queitsch I, Petersen G, Bautz EKF. Epitope mapping by phage display: random versus gene-fragment libraries. Journal of Immunological Methods. 1997;206(1-2):43–52. doi: 10.1016/s0022-1759(97)00083-5. [DOI] [PubMed] [Google Scholar]
- 85.Blüthner M, Schäfer C, Schneider C, Bautz FA. Identification of major linear epitopes on the sp100 nuclear PBC autoantigen by the gene-fragment phage-display technology. Autoimmunity. 1999;29(1):33–42. doi: 10.3109/08916939908995970. [DOI] [PubMed] [Google Scholar]
- 86.Ladner RC, Sato AK, Gorzelany J, de Souza M. Phage display-derived peptides as therapeutic alternatives to antibodies. Drug Discovery Today. 2004;9(12):525–529. doi: 10.1016/S1359-6446(04)03104-6. [DOI] [PubMed] [Google Scholar]
- 87.Brissette R, Prendergast JKA, Goldstein NI. Identification of cancer targets and therapeutics using phage display. Current Opinion in Drug Discovery and Development. 2006;9(3):363–369. [PubMed] [Google Scholar]
- 88.Hohm T, Limbourg P, Hoffmann D. A multiobjective evolutionary method for the design of peptidic mimotopes. Journal of Computational Biology. 2006;13(1):113–125. doi: 10.1089/cmb.2006.13.113. [DOI] [PubMed] [Google Scholar]
- 89.Castel G, Chtéoui M, Heyd B, Tordo N. Phage display of combinatorial peptide libraries: application to antiviral research. Molecules. 2011;16(5):3499–3518. doi: 10.3390/molecules16053499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Sherev T, Wiesmüller KH, Walden P. Mimotopes of tumor-associated T-cell epitopes for cancer vaccines determined with combinatorial peptide libraries. Applied Biochemistry and Biotechnology B. 2003;25(1):53–61. doi: 10.1385/MB:25:1:53. [DOI] [PubMed] [Google Scholar]
- 91.Zhong YW, Xu DP, Li XD, Dai JZ, Xu B, Li L. Epitope screening of influenza A (H3N2) by using phage display library. Zhonghua Shi Yan He Lin Chuang Bing Du Xue Za Zhi. 2009;23(4):272–274. [PubMed] [Google Scholar]
- 92.Sun E-C, Zhao J, Yang T, et al. Identification of a conserved JEV serocomplex B-cell epitope by screening a phage-display peptide library with a mAb generated against West Nile virus capsid protein. Virology Journal. 2011;8(100):1–10. doi: 10.1186/1743-422X-8-100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Rothenfluh DA, Bermudez H, O’Neil CP, Hubbell JA. Biofunctional polymer nanoparticles for intra-articular targeting and retention in cartilage. Nature Materials. 2008;7(3):248–254. doi: 10.1038/nmat2116. [DOI] [PubMed] [Google Scholar]
- 94.Giordano RJ, Edwards JK, Tuder RM, Arap W, Pasqualini R. Combinatorial ligand-directed lung targeting. Proceedings of the American Thoracic Society. 2009;6(5):411–415. doi: 10.1513/pats.200903-014AW. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Passarella RJ, Spratt DE, van der Ende AE, et al. Targeted nanoparticles that deliver a sustained, specific release of paclitaxel to irradiated tumors. Cancer Research. 2010;70(11):4550–4559. doi: 10.1158/0008-5472.CAN-10-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Lee GY, Kim JH, Oh GT, Lee BH, Kwon IC, Kim IS. Molecular targeting of atherosclerotic plaques by a stabilin-2-specific peptide ligand. Journal of Controlled Release. 2011;155(2):211–217. doi: 10.1016/j.jconrel.2011.07.010. [DOI] [PubMed] [Google Scholar]
- 97.Li J, Feng L, Fan L, et al. Targeting the brain with PEG-PLGA nanoparticles modified with phage-displayed peptides. Biomaterials. 2011;32(21):4943–4950. doi: 10.1016/j.biomaterials.2011.03.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Valuev VP, Afonnikov DA, Ponomarenko MP, Milanesi L, Kolchanov NA. ASPD (Artificially Selected Proteins/Peptides Database): a database of proteins and peptides evolved in vitro. Nucleic Acids Research. 2002;30(1):200–202. doi: 10.1093/nar/30.1.200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Mandava S, Makowski L, Devarapalli S, Uzubell J, Rodi DJ. RELIC—a bioinformatics server for combinatorial peptide analysis and identification of protein-ligand interaction sites. Proteomics. 2004;4(5):1439–1460. doi: 10.1002/pmic.200300680. [DOI] [PubMed] [Google Scholar]
- 100.Casey JL, Coley AM, Foley M. Phage display of peptides in ligand selection for use in affinity chromatography. Methods in Molecular Biology. 2007;421:111–124. doi: 10.1007/978-1-59745-582-4_8. [DOI] [PubMed] [Google Scholar]
- 101.Mumey BM, Bailey BW, Kirkpatrick B, Jesaitis AJ, Angel T, Dratz EA. A new method for mapping discontinuous antibody epitopes to reveal structural features of proteins. Journal of Computational Biology. 2003;10(3-4):555–567. doi: 10.1089/10665270360688183. [DOI] [PubMed] [Google Scholar]
- 102.Mumey B, Ohler N, Angel T, Jesaitis A, Dratz E. Filtering epitope alignments to improve protein surface prediction. Frontiers of High Performance Computing and Networking. 2006;4331:648–657. [Google Scholar]
- 103.Moreau V, Granier C, Villard S, Laune D, Molina F. Discontinuous epitope prediction based on mimotope analysis. Bioinformatics. 2006;22(9):1088–1095. doi: 10.1093/bioinformatics/btl012. [DOI] [PubMed] [Google Scholar]
- 104.Ru B, Huang J, Dai P, et al. MimoDB: a new repository for mimotope data derived from phage display technology. Molecules. 2010;15(11):8279–8288. doi: 10.3390/molecules15118279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Huang J, Ru B, Zhu P, et al. MimoDB 2.0: a mimotope database and beyond. Nucleic Acids Research. 2012;40:D271–D277. doi: 10.1093/nar/gkr922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Mayrose I, Shlomi T, Rubinstein ND, et al. Epitope mapping using combinatorial phage-display libraries: a graph-based algorithm. Nucleic Acids Research. 2007;35(1):69–78. doi: 10.1093/nar/gkl975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Huang YX, Bao YL, Guo SY, Wang Y, Zhou CG, Li YX. Pep-3D-Search: a method for B-cell epitope prediction based on mimotope analysis. BMC Bioinformatics. 2008;9, article 538 doi: 10.1186/1471-2105-9-538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Chen WH, Sun PP, Lu Y, Guo WW, Huang YX, Ma ZQ. MimoPro: a more efficient Web-based tool for epitope prediction using phage display libraries. BMC Bioinformatics. 2011;12, article 199 doi: 10.1186/1471-2105-12-199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Denisova GF, Denisov DA, Bramson JL. Applying bioinformatics for antibody epitope prediction using affinity-selected mimotopes—relevance for vaccine design. Immunome Research. 2010;6(supplement 2, article S6) doi: 10.1186/1745-7580-6-S2-S6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Bryson CJ, Jones TD, Baker MP. Prediction of immunogenicity of therapeutic proteins: validity of computational tools. BioDrugs. 2010;24(1):1–8. doi: 10.2165/11318560-000000000-00000. [DOI] [PubMed] [Google Scholar]
- 111.Sun P, Chen W, Huang Y, Wang H, Ma Z, Lv Y. Epitope prediction based on random peptide library screening: benchmark dataset and prediction tools evaluation. Molecules. 2011;16(6):4971–4993. doi: 10.3390/molecules16064971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Kulkarni-Kale U, Bhosle S, Kolaskar AS. CEP: a conformational epitope prediction server. Nucleic Acids Research. 2005;33(2):W168–W171. doi: 10.1093/nar/gki460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Moreau V, Fleury C, Piquer D, et al. PEPOP: computational design of immunogenic peptides. BMC Bioinformatics. 2008;9, article 71 doi: 10.1186/1471-2105-9-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Ponomarenko J, Bui H-H, Li W, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. 2008;9, article 514 doi: 10.1186/1471-2105-9-514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Rubinstein ND, Mayrose I, Martz E, Pupko T. Epitopia: a web-server for predicting B-cell epitopes. BMC Bioinformatics. 2009;10, article 287 doi: 10.1186/1471-2105-10-287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Van de Water J, Gershwin ME, Leung P, Ansari A, Coppel RL. The autoepitope of the 74-kD mitochondrial autoantigen of primary biliary cirrhosis corresponds to the functional site of dihydrolipoamide acetyltransferase. Journal of Experimental Medicine. 1988;167(6):1791–1799. doi: 10.1084/jem.167.6.1791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.El-Kased RF, Koy C, Deierling T, et al. Mass spectrometric and peptide chip epitope mapping of rheumatoid arthritis autoantigen RA33. European Journal of Mass Spectrometry. 2009;15(6):747–759. doi: 10.1255/ejms.1040. [DOI] [PubMed] [Google Scholar]
- 118.Burkart V, Siegenthaler RK, Blasius E, et al. High affinity binding of hydrophobic and autoantigenic regions of proinsulin to the 70 kDa chaperone DnaK. BMC Biochemistry. 2010;11(1, article 44) doi: 10.1186/1471-2091-11-44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Liang S, Zheng D, Standley DM, Yao B, Zacharias M, Zhang C. EPSVR and EPMeta: prediction of antigenic epitopes using support vector regression and multiple server results. BMC Bioinformatics. 2010;11, article 381 doi: 10.1186/1471-2105-11-381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Wee LJK, Simarmata D, Kam YW, Ng LFP, Tong JC. SVM-based prediction of linear B-cell epitopes using Bayes Feature Extraction. BMC Genomics. 2010;11(supplement 4, article S21) doi: 10.1186/1471-2164-11-S4-S21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Su C-H, Pal NR, Lin K-L, Chung I-F. Identification of amino acid propensities that are strong determinants of linear B-cell epitope using neural networks. PLoS ONE. 2012;7(2) doi: 10.1371/journal.pone.0030617.e30617 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Ansari HR, Raghava GP. Identification of conformational B-cell Epitopes in an antigen from its primary sequence. Immunome Research. 2010;6(1, article 6) doi: 10.1186/1745-7580-6-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Sweredoski MJ, Baldi P. COBEpro: a novel system for predicting continuous B-cell epitopes. Protein Engineering, Design and Selection. 2009;22(3):113–120. doi: 10.1093/protein/gzn075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Wang Y, Wu W, Negre NN, White KP, Li C, Shah PK. Determinants of antigenicity and specificity in immune response for protein sequences. BMC Bioinformatics. 2011;12, article 251 doi: 10.1186/1471-2105-12-251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Larsen MV, Lundegaard C, Lamberth K, et al. An integrative approach to CTL epitope prediction: a combined algorithm integrating MHC class I binding, TAP transport efficiency, and proteasomal cleavage predictions. European Journal of Immunology. 2005;35(8):2295–2303. doi: 10.1002/eji.200425811. [DOI] [PubMed] [Google Scholar]
- 126.Larsen MV, Lundegaard C, Lamberth K, Buus S, Lund O, Nielsen M. Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction. BMC Bioinformatics. 2007;8, article 424 doi: 10.1186/1471-2105-8-424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Doytchinova IA, Guan P, Flower DR. EpiJen: a server for multistep T cell epitope prediction. BMC Bioinformatics. 2006;7, article 131 doi: 10.1186/1471-2105-7-131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Feldhahn M, Dönnes P, Thiel P, Kohlbacher O. FRED—a framework for T-cell epitope detection. Bioinformatics. 2009;25(20):2758–2759. doi: 10.1093/bioinformatics/btp409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Hattotuwagama CK, Toseland CP, Guan P, et al. Toward prediction of class II mouse major histocompatibility complex peptide binding affinity: in silico bioinformatic evaluation using partial least squares, a robust multivariate statistical technique. Journal of Chemical Information and Modeling. 2006;46(3):1491–1502. doi: 10.1021/ci050380d. [DOI] [PubMed] [Google Scholar]
- 130.Lundegaard C, Lund O, Nielsen M. Prediction of epitopes using neural network based methods. Journal of Immunological Methods. 2011;374(1-2):26–34. doi: 10.1016/j.jim.2010.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Tung C-W, Ziehm M, Kämper A, Kohlbacher O, Ho S-Y. POPISK: T-cell reactivity prediction using support vector machines and string kernels. BMC Bioinformatics. 2011;12, article 446 doi: 10.1186/1471-2105-12-446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Kim Y, Sette A, Peters B. Applications for T-cell epitope queries and tools in the Immune Epitope Database and Analysis Resource. Journal of Immunological Methods. 2011;374(1-2):62–69. doi: 10.1016/j.jim.2010.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Ponomarenko J, Papangelopoulos N, Zajonc DM, Peters B, Sette A, Bourne PE. IEDB-3D: structural data within the immune epitope database. Nucleic Acids Research. 2011;39(1):D1164–D1170. doi: 10.1093/nar/gkq888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Modrow S, Hahn BH, Shaw GM. Computer-assisted analysis of envelope protein sequences of seven human immunodeficiency virus isolates: prediction of antigenic epitopes in conserved and variable regions. Journal of Virology. 1987;61(2):570–578. doi: 10.1128/jvi.61.2.570-578.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Schafer JRA, Jesdale BM, George JA, Kouttab NM, De Groot AS. Prediction of well-conserved HIV-1 ligands using a matrix-based algorithm, EpiMatrix. Vaccine. 1998;16(19):1880–1884. doi: 10.1016/s0264-410x(98)00173-x. [DOI] [PubMed] [Google Scholar]
- 136.Sung MH, Simon R. Genomewide conserved epitope profiles of HIV-1 predicted by biophysical properties of MHC binding peptides. Journal of Computational Biology. 2004;11(1):125–145. doi: 10.1089/106652704773416920. [DOI] [PubMed] [Google Scholar]
- 137.Xiao Y, Segal MR. Prediction of genomewide conserved epitope profiles of HIV-1: classifier choice and peptide representation. Statistical Applications in Genetics and Molecular Biology. 2005;4(1, article 25) doi: 10.2202/1544-6115.1158. [DOI] [PubMed] [Google Scholar]
- 138.Lacerda M, Scheffler K, Seoighe C. Epitope discovery with phylogenetic hidden markov models. Molecular Biology and Evolution. 2010;27(5):1212–1220. doi: 10.1093/molbev/msq008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Rusert P, Krarup A, Magnus C, et al. Interaction of the gp120 V1V2 loop with a neighboring gp120 unit shields the HIV envelope trimer against cross-neutralizing antibodies. Journal of Experimental Medicine. 2011;208(7):1419–1433. doi: 10.1084/jem.20110196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Pickett BE, Sadat EL, Zhang Y, et al. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Research. 2012;40:D593–D598. doi: 10.1093/nar/gkr859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Hu X, Zhou W, Udaka K, Mamitsuka H, Zhu S. MetaMHC: a meta approach to predict peptides binding to MHC molecules. Nucleic Acids Research. 2010;38(2):W474–W479. doi: 10.1093/nar/gkq407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Toussaint NC, Kohlbacher O. OptiTope—a web server for the selection of an optimal set of peptides for epitope-based vaccines. Nucleic Acids Research. 2009;37(2):W617–W622. doi: 10.1093/nar/gkp293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Garcia-Boronat M, Diez-Rivero CM, Reinherz EL, Reche PA. PVS: a web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery. Nucleic Acids Research. 2008;36:W35–W41. doi: 10.1093/nar/gkn211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Reche PA, Reinherz EL. PEPVAC: a web server for multi-epitope vaccine development based on the prediction of supertypic MHC ligands. Nucleic Acids Research. 2005;33(2):W138–W142. doi: 10.1093/nar/gki357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Reche PA, Glutting JP, Reinherz EL. Prediction of MHC class I binding peptides using profile motifs. Human Immunology. 2002;63(9):701–709. doi: 10.1016/s0198-8859(02)00432-9. [DOI] [PubMed] [Google Scholar]
- 146.Reche PA, Glutting JP, Zhang H, Reinherz EL. Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles. Immunogenetics. 2004;56(6):405–419. doi: 10.1007/s00251-004-0709-7. [DOI] [PubMed] [Google Scholar]
- 147.Lengauer T, Rarey M. Computational methods for biomolecular docking. Current Opinion in Structural Biology. 1996;6(3):402–406. doi: 10.1016/s0959-440x(96)80061-3. [DOI] [PubMed] [Google Scholar]
- 148.Hansch C. Dihydrofolate reductase inhibition. A study in the use of X-ray crystallography, molecular graphics, and quantitative structure-activity relations in drug design. Drug Intelligence and Clinical Pharmacy. 1982;16(5):391–395. doi: 10.1177/106002808201600506. [DOI] [PubMed] [Google Scholar]
- 149.Kuntz ID, Blaney JM, Oatley SJ, Langridge R, Ferrin TE. A geometric approach to macromolecule-ligand interactions. Journal of Molecular Biology. 1982;161(2):269–288. doi: 10.1016/0022-2836(82)90153-x. [DOI] [PubMed] [Google Scholar]
- 150.Horjales E, Branden CI. Docking of cyclohexanol-derivatives into the active site of liver alcohol dehydrogenase. Using computer graphics and energy minimization. Journal of Biological Chemistry. 1985;260(29):15445–15451. [PubMed] [Google Scholar]
- 151.DesJarlais RL, Sheridan RP, Dixon JS, Kuntz ID, Venkataraghavan R. Docking flexible ligands to macromolecular receptors by molecular shape. Journal of Medicinal Chemistry. 1986;29(11):2149–2153. doi: 10.1021/jm00161a004. [DOI] [PubMed] [Google Scholar]
- 152.Stamato FMLG, Longo E, Ferreira R, Tapia O. The catalytic mechanism of serine proteases. III. An Indo-ISCRF study of the methylacetate docking in α-chymotrypsin. Journal of Theoretical Biology. 1986;118(1):45–59. doi: 10.1016/s0022-5193(86)80007-8. [DOI] [PubMed] [Google Scholar]
- 153.Jorgensen WL. Rusting of the lock and key model for protein-ligand binding. Science. 1991;254(5034):954–955. doi: 10.1126/science.1719636. [DOI] [PubMed] [Google Scholar]
- 154.Chaudhury S, Gray JJ. Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. Journal of Molecular Biology. 2008;381(4):1068–1087. doi: 10.1016/j.jmb.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 155.Olson AJ, Goodsell DS. Automated docking and the search for HIV protease inhibitors. SAR and QSAR in environmental research. 1998;8(3-4):273–285. doi: 10.1080/10629369808039144. [DOI] [PubMed] [Google Scholar]
- 156.Muegge I, Martin YC, Hajduk PJ, Fesik SW. Evaluation of PMF scoring in docking weak ligands to the FK506 binding protein. Journal of Medicinal Chemistry. 1999;42(14):2498–2503. doi: 10.1021/jm990073x. [DOI] [PubMed] [Google Scholar]
- 157.Lise S, Walker-Taylor A, Jones DT. Docking protein domains in contact space. BMC Bioinformatics. 2006;7, article 310 doi: 10.1186/1471-2105-7-310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Novikov FN, Stroylov VS, Stroganov OV, Kulkov V, Chilov GG. Developing novel approaches to improve binding energy estimation and virtual screening: a PARP case study. Journal of Molecular Modeling. 2009;15(11):1337–1347. doi: 10.1007/s00894-009-0497-y. [DOI] [PubMed] [Google Scholar]
- 159.Pierce BG, Hourai Y, Weng Z. Accelerating protein docking in ZDOCK using an advanced 3D convolution library. PLoS ONE. 2011;6(9) doi: 10.1371/journal.pone.0024657.e24657 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 160.Cho H, Yun CW, Park WK, et al. Modulation of the activity of pro-inflammatory enzymes, COX-2 and iNOS, by chrysin derivatives. Pharmacological Research. 2004;49(1):37–43. doi: 10.1016/s1043-6618(03)00248-2. [DOI] [PubMed] [Google Scholar]
- 161.Bai S, Du T, Khosravi E. Applying internal coordinate mechanics to model the interactions between 8R-lipoxygenase and its substrate. BMC Bioinformatics. 2010;11(supplement 6, article S2) doi: 10.1186/1471-2105-11-S6-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 162.Pedretti A, Villa L, Vistoli G. Modeling of binding modes and inhibition mechanism of some natural ligands of farnesyl transferase using molecular docking. Journal of Medicinal Chemistry. 2002;45(7):1460–1465. doi: 10.1021/jm011075w. [DOI] [PubMed] [Google Scholar]
- 163.Koehler NKU, Yang CY, Varady J, et al. Structure-based discovery of nonpeptidic small organic compounds to block the T cell response to myelin basic protein. Journal of Medicinal Chemistry. 2004;47(21):4989–4997. doi: 10.1021/jm030362s. [DOI] [PubMed] [Google Scholar]
- 164.Michels AW, Ostrov DA, Zhang L, et al. Structure-based selection of small molecules to alter allele-specific MHC class II antigen presentation. Journal of Immunology. 2011;187(11):5921–5930. doi: 10.4049/jimmunol.1100746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 165.Zaheer-Ul-Haq, Khan W. Molecular and structural determinants of adamantyl susceptibility to HLA-DRs allelic variants: an in silico approach to understand the mechanism of MLEs. Journal of Computer-Aided Molecular Design. 2011;25(1):81–101. doi: 10.1007/s10822-010-9404-y. [DOI] [PubMed] [Google Scholar]
- 166.Bhinge A, Chakrabarti P, Uthanumallian K, Bajaj K, Chakraborty K, Varadarajan R. Accurate detection of protein:ligand binding sites using molecular dynamics simulations. Structure. 2004;12(11):1989–1999. doi: 10.1016/j.str.2004.09.005. [DOI] [PubMed] [Google Scholar]
- 167.Tang XN, Lo CW, Chuang YC, et al. Prediction of the binding mode between GSK3β and a peptide derived from GSKIP using molecular dynamics simulation. Biopolymers. 2011;95(7):461–471. doi: 10.1002/bip.21603. [DOI] [PubMed] [Google Scholar]
- 168.Li X, Keskin O, Ma B, Nussinov R, Liang J. Protein-protein interactions: hot spots and structurally conserved residues often locate in complemented pockets that pre-organized in the unbound states: implications for docking. Journal of Molecular Biology. 2004;344(3):781–795. doi: 10.1016/j.jmb.2004.09.051. [DOI] [PubMed] [Google Scholar]
- 169.Huang B, Schroeder M. Using protein binding site prediction to improve protein docking. Gene. 2008;422(1-2):14–21. doi: 10.1016/j.gene.2008.06.014. [DOI] [PubMed] [Google Scholar]
- 170.Hetényi C, van der Spoel D. Toward prediction of functional protein pockets using blind docking and pocket search algorithms. Protein Science. 2011;20(5):880–893. doi: 10.1002/pro.618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 171.Eyrisch S, Helms V. Transient pockets on protein surfaces involved in protein-protein interaction. Journal of Medicinal Chemistry. 2007;50(15):3457–3464. doi: 10.1021/jm070095g. [DOI] [PubMed] [Google Scholar]
- 172.Duan Y, Reddy BVB, Kaznessis YN. Residue conservation information for generating near-native structures in protein-protein docking. Journal of Bioinformatics and Computational Biology. 2006;4(4):793–806. doi: 10.1142/s0219720006002223. [DOI] [PubMed] [Google Scholar]
- 173.Cockell SJ, Oliva B, Jackson RM. Structure-based evaluation of in silico predictions of protein-protein interactions using comparative docking. Bioinformatics. 2007;23(5):573–581. doi: 10.1093/bioinformatics/btl661. [DOI] [PubMed] [Google Scholar]
- 174.Pierri CL, Parisi G, Porcelli V. Computational approaches for protein function prediction: a combined strategy from multiple sequence alignment to molecular docking-based virtual screening. Biochimica et Biophysica Acta. 2010;1804(9):1695–1712. doi: 10.1016/j.bbapap.2010.04.008. [DOI] [PubMed] [Google Scholar]
- 175.Heberlé G, de Azevedo WF. Bio-inspired algorithms applied to molecular docking simulations. Current Medicinal Chemistry. 2011;18(9):1339–1352. doi: 10.2174/092986711795029573. [DOI] [PubMed] [Google Scholar]
- 176.Palma PN, Krippahl L, Wampler JE, Moura JJG. Bigger: a new (soft) docking algorithm for predicting protein interactions. Proteins. 2000;39(4):372–384. [PubMed] [Google Scholar]
- 177.Giordanetto F, Cotesta S, Catana C, et al. Novel scoring functions comprising QXP, SASA, and protein side-chain entropy terms. Journal of Chemical Information and Computer Sciences. 2004;44(3):882–893. doi: 10.1021/ci0499626. [DOI] [PubMed] [Google Scholar]
- 178.van Dijk ADJ, De Vries SJ, Dominguez C, Chen H, Zhou H-X, Bonvin AMJJ. Data-driven docking: HADDOCK’S adventures in CAPRI. Proteins. 2005;60(2):232–238. doi: 10.1002/prot.20563. [DOI] [PubMed] [Google Scholar]
- 179.Betzi S, Suhre K, Chétrit B, Guerlesquin F, Morelli X. GFscore: a general nonlinear consensus scoring function for high-throughput docking. Journal of Chemical Information and Modeling. 2006;46(4):1704–1712. doi: 10.1021/ci0600758. [DOI] [PubMed] [Google Scholar]
- 180.Durrant JD, McCammon JA. NNScore: a neural-network-based scoring function for the characterization of protein-ligand complexes. Journal of Chemical Information and Modeling. 2010;50(10):1865–1871. doi: 10.1021/ci100244v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 181.Durrant JD, McCammon JA. NNScore 2.0: a neural-network receptor-ligand scoring function. Journal of Chemical Information and Modeling. 2011;51(11):2897–2903. doi: 10.1021/ci2003889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 182.Ahmad S, Mizuguchi K. Partner-aware prediction of interacting residues in protein-protein complexes from sequence data. PLoS ONE. 2011;6(12) doi: 10.1371/journal.pone.0029104.e29104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 183.Grosdidier A, Zoete V, Michielin O. SwissDock, a protein-small molecule docking web service based on EADock DSS. Nucleic Acids Research. 2011;39(2):W270–W277. doi: 10.1093/nar/gkr366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 184.Dolghih E, Bryant C, Renslo AR, Jacobson MP. Predicting binding to P-glycoprotein by flexible receptor docking. PLoS Computational Biology. 2011;7(6) doi: 10.1371/journal.pcbi.1002083.e1002083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 185.Prakhov ND, Chernorudskiy AL, Gainullin MR. VSDocker: a tool for parallel high-throughput virtual screening using AutoDock on Windows-based computer clusters. Bioinformatics. 2010;26(10):1374–1375. doi: 10.1093/bioinformatics/btq149. [DOI] [PubMed] [Google Scholar]
- 186.Ren J, Williams N, Clementi L, Krishnan S, Li WW. Opal web services for biomedical applications. Nucleic Acids Research. 2010;38(2):W724–W731. doi: 10.1093/nar/gkq503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 187.Zhang Q, Yang J, Liang K, et al. Binding interaction analysis of the active site and its inhibitors for neuraminidase (N1 Subtype) of human influenza virus by the integration of molecular ducking, FMO calculation and 3D-QSAR CoMFA modeling. Journal of Chemical Information and Modeling. 2008;48(9):1802–1812. doi: 10.1021/ci800041k. [DOI] [PubMed] [Google Scholar]
- 188.Li H, Gao Z, Kang L, et al. TarFisDock: a web server for identifying drug targets with docking approach. Nucleic Acids Research. 2006;34:W219–W224. doi: 10.1093/nar/gkl114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 189.Martin L, Catherinot V, Labesse G. kinDOCK: a tool for comparative docking of protein kinase ligands. Nucleic Acids Research. 2006;34:W325–W329. doi: 10.1093/nar/gkl211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 190.Lavecchia A, Cosconati S, Limongelli V, Novellino E. Modeling of Cdc25B dual specifity protein phosphatase inhibitors: docking of ligands and enzymatic inhibition mechanism. ChemMedChem. 2006;1(5):540–550. doi: 10.1002/cmdc.200500092. [DOI] [PubMed] [Google Scholar]
- 191.Schapira M, Raaka BM, Das S, et al. Discovery of diverse thyroid hormone receptor antagonists by high-throughput docking. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(12):7354–7359. doi: 10.1073/pnas.1131854100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 192.Chen R, Li L, Weng Z. ZDOCK: an initial-stage protein-docking algorithm. Proteins. 2003;52(1):80–87. doi: 10.1002/prot.10389. [DOI] [PubMed] [Google Scholar]
- 193.Pierce B, Tong W, Weng Z. M-ZDOCK: a grid-based approach for Cn symmetric multimer docking. Bioinformatics. 2005;21(8):1472–1478. doi: 10.1093/bioinformatics/bti229. [DOI] [PubMed] [Google Scholar]
- 194.Ghoorah AW, Devignes MD, Smaïl-Tabbone M, Ritchie DW. Spatial clustering of protein binding sites for template based protein docking. Bioinformatics. 2011;27(20):2820–2827. doi: 10.1093/bioinformatics/btr493. [DOI] [PubMed] [Google Scholar]
- 195.Lyskov S, Gray JJ. The RosettaDock server for local protein-protein docking. Nucleic acids research. 2008;36:W233–238. doi: 10.1093/nar/gkn216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 196.Comeau SR, Gatchell DW, Vajda S, Camacho CJ. ClusPro: an automated docking and discrimination method for the prediction of protein complexes. Bioinformatics. 2004;20(1):45–50. doi: 10.1093/bioinformatics/btg371. [DOI] [PubMed] [Google Scholar]
- 197.Comeau SR, Gatchell DW, Vajda S, Camacho CJ. ClusPro: a fully automated algorithm for protein-protein docking. Nucleic Acids Research. 2004;32:W96–W99. doi: 10.1093/nar/gkh354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 198.Tuszynska I, Bujnicki JM. DARS-RNP and QUASI-RNP: new statistical potentials for protein-RNA docking. BMC Bioinformatics. 2011;12, article 348 doi: 10.1186/1471-2105-12-348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 199.Liu Z, Guo JT, Li T, Xu Y. Structure-based prediction of transcription factor binding sites using a protein-DNA docking approach. Proteins. 2008;72(4):1114–1124. doi: 10.1002/prot.22002. [DOI] [PubMed] [Google Scholar]
- 200.Xu B, Yang Y, Liang H, Zhou Y. An all-atom knowledge-based energy function for protein-DNA threading, docking decoy discrimination, and prediction of transcription-factor binding profiles. Proteins. 2009;76(3):718–730. doi: 10.1002/prot.22384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 201.Patronov A, Dimitrov I, Flower DR, Doytchinova I. Peptide binding prediction for the human class II MHC allele HLA-DP2: a molecular docking approach. BMC Structural Biology. 2011;11, article 32 doi: 10.1186/1472-6807-11-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 202.Sircar A, Gray JJ. SnugDock: paratope structural optimization during antibody-antigen docking compensates for errors in antibody homology models. PLoS Computational Biology. 2010;6(1) doi: 10.1371/journal.pcbi.1000644.e1000644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 203.Khan JM, Ranganathan S. PDOCK: a new technique for rapid and accurate docking of peptide ligands to Major Histocompatibility Complexes. Immunome Research. 2010;6(supplement 1, article S2) doi: 10.1186/1745-7580-6-S1-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 204.Todman SJ, Halling-Brown MD, Davies MN, Flower DR, Kayikci M, Moss DS. Toward the atomistic simulation of T cell epitopes. Automated construction of MHC: peptide structures for free energy calculations. Journal of Molecular Graphics and Modelling. 2008;26(6):957–961. doi: 10.1016/j.jmgm.2007.07.005. [DOI] [PubMed] [Google Scholar]
- 205.Sathyapriya R, Vijayabaskar MS, Vishveshwara S. Insights into protein-DNA interactions through structure network analysis. PLoS Computational Biology. 2008;4(9) doi: 10.1371/journal.pcbi.1000170.e1000170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 206.Saiz L, Vilar JMG. Protein-protein/DNA interaction networks: versatile macromolecular structures for the control of gene expression. IET Systems Biology. 2008;2(5):247–255. doi: 10.1049/iet-syb:20080091. [DOI] [PubMed] [Google Scholar]
- 207.Mata J. Genome-wide mapping of myosin protein-RNA networks suggests the existence of specialized protein production sites. The FASEB Journal. 2010;24(2):479–484. doi: 10.1096/fj.09-140335. [DOI] [PubMed] [Google Scholar]
- 208.Ascano M, Hafner M, Cekan P, Gerstberger S, Tuschl T. Identification of RNA-protein interaction networks using PAR-CLIP. Wiley Interdisciplinary Reviews: RNA. 2011;3(2):159–177. doi: 10.1002/wrna.1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 209.Spirin V, Mirny LA. Protein complexes and functional modules in molecular networks. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(21):12123–12128. doi: 10.1073/pnas.2032324100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 210.Wang J, Li M, Deng Y, Pan Y. Recent advances in clustering methods for protein interaction networks. BMC Genomics. 2010;11(3, article S10) doi: 10.1186/1471-2164-11-S3-S10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 211.Wu Z, Zhao X, Chen L. Identifying responsive functional modules from protein-protein interaction network. Molecules and Cells. 2009;27(3):271–277. doi: 10.1007/s10059-009-0035-x. [DOI] [PubMed] [Google Scholar]
- 212.Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U. Network motifs: simple building blocks of complex networks. Science. 2002;298(5594):824–827. doi: 10.1126/science.298.5594.824. [DOI] [PubMed] [Google Scholar]
- 213.Ciriello G, Guerra C. A review on models and algorithms for motif discovery in protein-protein interaction networks. Briefings in Functional Genomics and Proteomics. 2008;7(2):147–156. doi: 10.1093/bfgp/eln015. [DOI] [PubMed] [Google Scholar]
- 214.Dutkowski J, Tiuryn J. Identification of functional modules from conserved ancestral protein-protein interactions. Bioinformatics. 2007;23(13):i149–i158. doi: 10.1093/bioinformatics/btm194. [DOI] [PubMed] [Google Scholar]
- 215.Qian X, Yoon BJ. Effective identification of conserved pathways in biological networks using hidden Markov models. PLoS ONE. 2009;4(12) doi: 10.1371/journal.pone.0008070.e8070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 216.Lim J, Hao T, Shaw C, et al. A protein-protein interaction network for human inherited ataxias and disorders of purkinje cell degeneration. Cell. 2006;125(4):801–814. doi: 10.1016/j.cell.2006.03.032. [DOI] [PubMed] [Google Scholar]
- 217.Wang L, Khankhanian P, Baranzini SE, Mousavi P. iCTNet: a Cytoscape plugin to produce and analyze integrative complex traits networks. BMC Bioinformatics. 2011;12, article 380 doi: 10.1186/1471-2105-12-380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 218.Suthram S, Dudley JT, Chiang AP, Chen R, Hastie TJ, Butte AJ. Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets. PLoS Computational Biology. 2010;6(2) doi: 10.1371/journal.pcbi.1000662.e1000662 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 219.Dutta B, Pusztai L, Qi Y, et al. A network-based, integrative study to identify core biological pathways that drive breast cancer clinical subtypes. British Journal of Cancer. 2012;106(6):1107–1116. doi: 10.1038/bjc.2011.584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 220.Erten S, Bebek G, Koyutürk M. Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks. Journal of Computational Biology. 2011;18(11):1561–1574. doi: 10.1089/cmb.2011.0154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 221.Nitsch D, Tranchevent LC, Gonalves JP, Vogt JK, Madeira SC, Moreau Y. PINTA: a web server for network-based gene prioritization from expression data. Nucleic Acids Research. 2011;39(2):W334–W338. doi: 10.1093/nar/gkr289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 222.Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN. Flexible nets: the roles of intrinsic disorder in protein interaction networks. The FEBS Journal. 2005;272(20):5129–5148. doi: 10.1111/j.1742-4658.2005.04948.x. [DOI] [PubMed] [Google Scholar]
- 223.Patil A, Kinoshita K, Nakamura H. Hub promiscuity in protein-protein interaction networks. International Journal of Molecular Sciences. 2010;11(4):1930–1943. doi: 10.3390/ijms11041930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 224.Patil A, Kinoshita K, Nakamura H. Domain distribution and intrinsic disorder in hubs in the human protein-protein interaction network. Protein Science. 2010;19(8):1461–1468. doi: 10.1002/pro.425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 225.Bustos DM. The role of protein disorder in the 14-3-3 interaction network. Molecular Biosystems. 2012;8(1):178–184. doi: 10.1039/c1mb05216k. [DOI] [PubMed] [Google Scholar]
- 226.Ivanyi-Nagy R, Darlix JL. Intrinsic disorder in the core proteins of flaviviruses. Protein and Peptide Letters. 2010;17(8):1019–1025. doi: 10.2174/092986610791498911. [DOI] [PubMed] [Google Scholar]
- 227.Wang J, Cao Z, Zhao L, Li S. Novel strategies for drug discovery based on intrinsically disordered proteins (IDPs) International Journal of Molecular Sciences. 2011;12(5):3205–3219. doi: 10.3390/ijms12053205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 228.Ekman D, Light S, Björklund AK, Elofsson A. What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biology. 2006;7(6, article R45) doi: 10.1186/gb-2006-7-6-r45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 229.Singh GP, Ganapathi M, Dash D. Role of intrinsic disorder in transient interactions of hub proteins. Proteins. 2007;66(4):761–765. doi: 10.1002/prot.21281. [DOI] [PubMed] [Google Scholar]
- 230.Kahali B, Ahmad S, Ghosh TC. Exploring the evolutionary rate differences of party hub and date hub proteins in Saccharomyces cerevisiae protein-protein interaction network. Gene. 2009;429(1-2):18–22. doi: 10.1016/j.gene.2008.09.032. [DOI] [PubMed] [Google Scholar]
- 231.Katebi AR, Kloczkowski A, Jernigan RL. Structural interpretation of protein-protein interaction network. BMC Structural Biology. 2010;10(supplement 1, article S4) doi: 10.1186/1472-6807-10-S1-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 232.Wang F, Liu M, Song B, et al. Prediction and characterization of protein-protein interaction networks in swine. Proteome Science. 2012;10, article 2 doi: 10.1186/1477-5956-10-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 233.Pang E, Lin K. Yeast protein-protein interaction binding sites: prediction from the motif-motif, motif-domain and domain-domain levels. Molecular BioSystems. 2010;6(11):2164–2173. doi: 10.1039/c0mb00038h. [DOI] [PubMed] [Google Scholar]
- 234.Kalaev M, Smoot M, Ideker T, Sharan R. NetworkBLAST: comparative analysis of protein networks. Bioinformatics. 2008;24(4):594–596. doi: 10.1093/bioinformatics/btm630. [DOI] [PubMed] [Google Scholar]
- 235.Jung SH, Hyun B, Jang WH, Hur HY, Han DS. Protein complex prediction based on simultaneous protein interaction network. Bioinformatics. 2009;26(3):385–391. doi: 10.1093/bioinformatics/btp668. [DOI] [PubMed] [Google Scholar]
- 236.Lo Y-S, Lin CY, Yang JM. PCFamily: a web server for searching homologous protein complexes. Nucleic Acids Research. 2010;38(2):W516–W522. doi: 10.1093/nar/gkq464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 237.Phan HT, Sternberg MJ. PINALOG: a novel approach to align protein interaction networks—implications for complex detection and function prediction. Bioinformatics. 2012;28:1239–1245. doi: 10.1093/bioinformatics/bts119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 238.Fox AD, Hescott BJ, Blumer AC, Slonim DK. Connectedness of PPI network neighborhoods identifies regulatory hub proteins. Bioinformatics. 2011;27(8):1135–1142. doi: 10.1093/bioinformatics/btr099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 239.Becker E, Robisson B, Chapple CE, Guénoche A, Brun C. Multifunctional proteins revealed by overlapping clustering in protein interaction network. Bioinformatics. 2012;28(1):84–90. doi: 10.1093/bioinformatics/btr621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 240.Dittrich MT, Klau GW, Rosenwald A, Dandekar T, Müller T. Identifying functional modules in protein-protein interaction networks: an integrated exact approach. Bioinformatics. 2008;24(13):i223–i231. doi: 10.1093/bioinformatics/btn161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 241.Sun CH, Hwang T, Oh K, Yi GS. DynaMod: dynamic functional modularity analysis. Nucleic Acids Research. 2010;38(2):W103–W108. doi: 10.1093/nar/gkq362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 242.Cui G, Shrestha R, Han K. ModuleSearch: finding functional modules in a protein-protein interaction network. Computer Methods in Biomechanics and Biomedical Engineering. 2012;15(7):691–699. doi: 10.1080/10255842.2011.555404. [DOI] [PubMed] [Google Scholar]
- 243.Erten S, Bebek G, Ewing RM, Koyutürk M. DADA: degree-aware algorithms for network-based disease gene prioritization. BioData Mining. 2011;4(1, article 19) doi: 10.1186/1756-0381-4-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 244.Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 245.Martin A, Ochagavia ME, Rabasa LC, Miranda J, Fernandez-de-Cossio J, Bringas R. BisoGenet: a new tool for gene network building, visualization and analysis. BMC Bioinformatics. 2010;11, article 91 doi: 10.1186/1471-2105-11-91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 246.Bruckner S, Hüffner F, Karp RM, Shamir R, Sharan R. Torque: topology-free querying of protein interaction networks. Nucleic Acids Research. 2009;37(2):W106–W108. doi: 10.1093/nar/gkp474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 247.Franzosa E, Linghu B, Xia Y. Computational reconstruction of protein-protein interaction networks: algorithms and issues. Methods in Molecular Biology. 2009;541:89–100. doi: 10.1007/978-1-59745-243-4_5. [DOI] [PubMed] [Google Scholar]
- 248.Li SS, Wu C. Using peptide array to identify binding motifs and interaction networks for modular domains. Methods in Molecular Biology. 2009;570:67–76. doi: 10.1007/978-1-60327-394-7_3. [DOI] [PubMed] [Google Scholar]
- 249.Sardiu ME, Washburn MP. Building protein-protein interaction networks with proteomics and informatics tools. Journal of Biological Chemistry. 2011;286(27):23645–23651. doi: 10.1074/jbc.R110.174052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 250.Sanz-Pamplona R, Berenguer A, Sole X, et al. Tools for protein-protein interaction network analysis in cancer research. Clinical and Translational Oncology. 2012;14(1):3–14. doi: 10.1007/s12094-012-0755-9. [DOI] [PubMed] [Google Scholar]
- 251.Szklarczyk D, Franceschini A, Kuhn M, et al. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Research. 2011;39(1):D561–D568. doi: 10.1093/nar/gkq973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 252.Taboureau O, Nielsen SK, Audouze K, et al. ChemProt: a disease chemical biology database. Nucleic Acids Research. 2011;39(1):D367–D372. doi: 10.1093/nar/gkq906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 253.Tacutu R, Budovsky A, Fraifeld VE. The NetAge database: a compendium of networks for longevity, age-related diseases and associated processes. Biogerontology. 2010;11(4):513–522. doi: 10.1007/s10522-010-9265-8. [DOI] [PubMed] [Google Scholar]
- 254.Breitkreutz B-J, Stark C, Reguly T, et al. The BioGRID interaction Database: 2008 update. Nucleic Acids Research. 2008;36(1):D637–D640. doi: 10.1093/nar/gkm1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 255.Hsu CN, Lai JM, Liu CH, et al. Detection of the inferred interaction network in hepatocellular carcinoma from EHCO (Encyclopedia of Hepatocellular Carcinoma genes Online) BMC Bioinformatics. 2007;8, article 66 doi: 10.1186/1471-2105-8-66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 256.Xia K, Dong D, Han JDJ. IntNetDB v1.0: an integrated protein-protein interaction network database generated by a probabilistic model. BMC Bioinformatics. 2006;7, article 508 doi: 10.1186/1471-2105-7-508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 257.Fong JH, Marchler-Bauer A. Protein subfamily assignment using the Conserved Domain Database. BMC Research Notes. 2008;1, article 114 doi: 10.1186/1756-0500-1-114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 258.Stroganov OV, Novikov FN, Stroylov VS, Kulkov V, Chilov GG. Lead finder: an approach to improve accuracy of protein-ligand docking, binding energy estimation, and virtual screening. Journal of Chemical Information and Modeling. 2008;48(12):2371–2385. doi: 10.1021/ci800166p. [DOI] [PubMed] [Google Scholar]
