Abstract
Bacterial and viral pathogens affect their eukaryotic host partly by interacting with proteins of the host cell. Hence, to investigate infection from a systems’ perspective we need to construct complete and accurate host–pathogen protein–protein interaction networks. Because of the paucity of available data and the cost associated with experimental approaches, any construction and analysis of such a network in the near future has to rely on computational predictions. Specifically, this challenge consists of a number of sub-problems: First, prediction of possible pathogen interactors (e.g. effector proteins) is necessary for bacteria and protozoa. Second, the prospective host binding partners have to be determined and finally, the impact on the host cell analyzed. This review gives an overview of current bioinformatics approaches to obtain and understand host–pathogen interactions. As an application example of the methods covered, we predict host–pathogen interactions of Salmonella and discuss the value of these predictions as a prospective for further research.
Keywords: Host-Pathogen, Protein Interaction, Computational Biology, Interaction Prediction
1. Introduction
Despite tremendous advances in biomedical science, infectious diseases are still a major health problem due to the rise of novel variants of pathogens that are resistant to available drugs just as the multi-resistant Staphyloccocus aureus [1] or pathogenic Escherichia coli strains [2]. The approval of the oxazolidinone antibiotic Linezolid in 2000 was preceded by four decades without discovering any new structural class of antibacterial drugs [3], illustrating that there is an urgent need for new antiviral, antibacterial and antiprotozoal compounds. An understanding of host–pathogen interactions on the molecular level is crucial for the development of such therapeutics. While some experimental data for host–pathogen interactions exist (reviewed below), their relative paucity emphasizes the need for computational predictions. Modern technologies such as genome sequencing and high throughput methods for protein–protein interaction (PPI) detection provide a source of information that can be exploited by computational methods to provide short-cuts to laborious wet lab research [4]. These methods can support many different stages of knowledge generation, covering the generation of target lists for the investigation of unknown virulence factors, prediction of the points of attack in the host cell and enlightenment of the modes of molecular interactions during the infection process.
On a molecular level, infection can be viewed as the interference of pathogenic proteins (most of the microbial toxins are proteins) with the hosts’ interaction network. Viral proteins use the host protein synthesis machinery, whereas bacterial ones and proteins from parasitic protozoa are secreted or injected into host cells. Many pathogenic proteins follow general protein architectural rules and contain distinct functional and structural features, e.g., signal peptides that lead to targeted transport and pathogenic effector domains, which interact with the host system. A common pathogen strategy is to mimic eukaryotic PPI domains or binding motifs in order to remodel the host PPI network [5]. These structural mimicries are often the result of horizontal gene transfer (HGT) [6] in combination with rapidly evolving pathogen genomes as part of host–pathogen coevolution [7]. Some well-known host–pathogen PPI interactions are mediated by host SH2, SH3 and PDZ domains. For example, the Influenza A non-structural protein 1 (NS1) contains regions that bind the SH2 and SH3 domains of the regulatory subunits of PI3K [8], [9] and the class 1 PDZ-binding motifs of NS1 binds Scribble [10]. Moreover, these PPI motifs and domains can determine the virulence of the pathogen. For example, NS1 of avian and Spanish Influenza A has a SH3 domain-binding motif that preferably interacts with Crk-like protein, and this interaction increased Akt phosphorylation downstream [11]. Bacterial molecular mimicries include the effector espF of the enteropathogenic E. coli which interacts with human sorting nexin 9 in order to achieve membrane re-modeling [12]. The effector IpgB2 in Shigella as well as some of its homologs in other bacteria are functional mimicries of Rho family GTPases and interfere with host signaling pathways [13], whereas the chlamydial IncA protein contains SNARE like motifs used to interfere with intra-cellular trafficking in order to establish the inclusion membrane of Chlamydia (an obligate intracellular parasite) [14]. An important feature of pathogenic proteins is their propensity to target proteins from pathways that aid infection and counteract the hosts defense systems (“anti-immunology”), e.g. by apoptosis evasion [15], cell cycle activation [16], proteasomal and ubiquitination alteration [17], actin and cytoskeleton reorganization [18] and innate immunity inhibition [19]. To conclude, interactions of pathogen proteins with host proteins are often mediated by domains and motifs and these interactions are not random but target specific proteins of the host. We can exploit this fact in computational approaches to detect host–pathogen interactions. The methods for doing so will be discussed in this review.
This review is structured as follows: First, we will summarize resources of host–pathogen interactions (HP-PPI) currently available and we will discuss text-mining methods that are capable of extracting large-scale interaction data from the literature. Second, we will review bioinformatics methods to predict bacterial effector proteins (i.e., proteins that are secreted by bacterial pathogens in order to interact with the host system). Third, we will discuss the prediction of HP-PPIs. We will focus on methods utilizing domain–domain interactions (DDIs) and short linear motifs (SLMs) since comprehensive databases of these types of interactions are available [20], [21], [22], [23] and can be exploited for the prediction of HP-PPIs. Currently available HP-PPI networks are reviewed in the fourth part. The fifth part of the article provides an example analysis that shows how the discussed methods can contribute to the relatively under-explored field of host–pathogen interactions. Methods to increase the reliability and the pathogen class specificity of HP-PPI prediction methods are discussed last.
2. Databases and text-mining
Currently, there are several databases available that cover various aspects of HP-PPIs: broadly, these can be divided into databases of virulence related genes and of known HP-PPIs as detected by small or large scale screens. Some databases go beyond the scope of listing data and allow query-driven analyses by the user. We will also discuss HP-PPI specific text-mining approaches, as these approaches are valuable for mining HP-PPIs literature that is not contained in databases and can complement database searches.
2.1. Bacterial virulence related genes
For bacterial pathogens, the VFDB [24] and the MvirDB [25] databases comprise a collection of known bacterial virulence related genes such as toxins, resistance genes, and effector proteins which are delivered into the host cell by active transport. VFDB contains hand-curated data of experimentally validated virulence factors and covers currently data from 8 major bacterial pathogens. MvirDB is a meta-database that collects data from different sources including VFDB as well as databases dedicated to toxins, pathogenicity islands and antibiotic resistance and covers in addition to bacteria, eukaryotic and viral sequences. PATRIC (short for Pathosystems Resource Integration Center) is a meta-database that collects genomic data from bacterial and viral sequences and processes the data by a unified, automated annotation pipeline. The resource is comprehensive in terms of the covered species (>5000 partially or completely sequenced bacteria, >2600 viral species with several thousand strains) and offers a variety of comparative analysis tools [26].
Several databases are dedicated to effector proteins secreted by the Type III secretion system and provide curated and predicted sets of bacterial effectors and cognate proteins as the Effective database [27], the T3SE database [28], and the T3DB database [29].
2.2. HP-PPI databases
PHI-base [30] is a database covering around 1800 verified bacterial, fungal, and oomycete host–pathogen interactions. HPIDB [31] is a meta-database that collects host–pathogen interactions from different interaction resources such as the DIP [32] or MINT [33] PPI collections, and from specialized resources as the PIG (pathogen interaction gateway) database [34]. The latter one collects host–pathogen PPI information between human and several viral and bacterial pathogens as well as additional data from GO annotations and Interpro assignments. It is integrated into the PATRIC resource [26]. iRefWeb is a very comprehensive meta-database that, despite not being especially designed for HP-PPIs, allows to search for interaction pairs measured between different species [35]. A number of databases are especially dedicated to protein–protein interactions between viral proteins and host proteins such as the Virusmint database [36]. This database consists of more than 5000 interactions as well as literature annotations. Virhostnet is a similar database, but contains, aside from host–pathogen interactions, the interactomes of the host (human) and of the respective viruses. CAPIH [37] is a database for analyzing HIV related interactions. In addition, it provides genetic variation data and predictions of phosphorylation and methylation sites. Another database, GPS-prot, is a meta-database that allows graphical browsing of HIV–human interaction networks [38]. Where some of the aforementioned databases just collect the interaction data-sets, others curate data by hand like PHI-base or PIG. A summary of these databases can be found in Table 1 .
Table 1.
Overview of HP-PPI databases.
| Database | Species | Curated or meta-database | Special features | Size | Last update |
|---|---|---|---|---|---|
| PHI-base | 97 pathogens, 76 host species | Yes | Search by phenotype possible | 1335 interactions | 2009 |
| HPIDB | 389 pathogens 58 host species | Meta-database | BLAST | 22223 interactions | 2012 |
| PIG | 21 bacteria in the host–pathogen section | Meta-database | BLAST & domain information, graphical representation of HP-PPI network | 7180 interactions | 2012 |
| Virusmint | 10 viral species, host: human | Yes | Graphical representation of HP-PPI network | 5140 interactions | ? |
| Virhostnet | 180 viral species, host: human | Yes | – | 2671 interactions | ? |
| CAPIH | HIV/human | Meta-database | Resource for comparative analysis of HIV–host interactions with model organisms | 1447 interactions | ? |
| GPS-prot | HIV/human | Meta-database | Allows the integration of genetic interactions (RNAi screens) | ? | 2011 |
Host–Pathogen PPI databases. “Database”: the database name, “Species”: the amount and phyla of the species covered in the database, “Special features”: special technical aspects, “Size”: the amount of interactions, “Last update:” last update according to the prospective web-site.
2.3. Integrated database and analysis systems
Several databases do not only list interaction data but also allow further exploration of the data. Most of the databases allow graphical exploration of the interaction networks (all mentioned databases except HPIDB). HPIDB allows BLASTing against the interaction sets implementing to detect homologs as well as conserved interacting pairs (Interologs, described below). CAPIH visualizes differences between orthologs of human, macaque, chimp, and mouse in terms of genetic variation and protein modifications. This information can potentially allow interpreting possible differences of HIV–human interactions in their model organism equivalents which can be very useful for translating animal based drug discovery to humans.
2.4. Text-mining
A promising approach to supplement the knowledge on host–pathogen interactions is automated mining of the literature (e.g. PubMed abstracts); such procedures can uncover functional relationships and interactions not deposited in the primary interaction databases and can aid the identification of relevant literature to improve the curation process of host–pathogen related databases. Many approaches exist to accomplish this for intra-species interactions, however, only quite recently the value of text-mining has been assessed for host–pathogen interactions: Yin and co-workers evaluated the usage of a certain group of classification algorithms, namely support vector machines (SVM) to retrieve documents related to host–pathogen interactions. They used cross-validation experiments to show that this can be accomplished with a rate of 50% relevant articles in the top 10% of the ranked documents, where a random guess would yield 10% [39]. In a different study, Thieu et al. [40] employed two approaches: An SVM based approach similar as the one used by Yin et al. and an approach based on semantic analysis, which allows the computational description of dependencies between words reflecting the logical structure of a sentence. In addition to Yin et al., who only discriminate abstracts which are relevant to host–pathogen interaction from abstracts of arbitrary articles, Thieu et al. formulate two additional problems: find relevant sentences that describe the mode of interaction, and find the actual interacting entities. It could be shown that the semantic approach has a higher performance on the latter two tasks compared to the SVM but still only received a precision (fraction of correct predictions) of 0.16 and a recall (fraction of found known positive instances) of 0.11 in the identification of actual HP-PPIs on the given test-set, compared to 0.03 and 0.05 in case of the SVM. Both studies have been performed on an initial, hand-curated set of articles and show the general feasibility of data mining in the domain of host–pathogen interactions. In theory, such algorithms could be applied to the complete set of available literature to generate a comprehensive knowledge-base for host–pathogen interactions.
3. Identification of secreted virulence factors
Before predicting interaction partners, the proteins that are actually employed by the pathogen for this purpose (the effectors) have to be identified. In case of viruses, all proteins are candidates since they are all expressed in the host system. In case of bacteria and protozoa, the detection of suitable candidates must be performed prior to further analysis. This prediction may be specific for the employed transport system. Three of the seven known bacterial transport-systems (Types III, IV and VI) penetrate the host cell membranes and inject proteins into the cytosol. Other bacterial transport systems can be used if effector entry into the hosts’ cytosol is not required or mediated by alternative pathways: for example, the Type II secretion system has been shown to be relevant for bacterial virulence [41], [42] but does not provide a direct way for translocation into the host’s cytosol. Several of the bacterial transport systems can be employed by the same species. An example is Pseudomonas syringae, which delivers effectors by the Type III as well as by the Twin-Arginine system [43]. In the case of protozoa, the knowledge of the exact transport routes into the host are largely unknown [44] but effector specific motifs have been identified in some species which can be exploited for prediction. There is a need for general prediction methods that do not refer to a certain transporter since not all routes to deliver effectors are known. In this paragraph, we describe methods to detect effectors based on sequence homology or on the presence of a signal peptide, a sequence that leads to active transport into the host. Moreover, we discuss detection methods by genomic and functional properties as domain signatures that hint to a role as effector protein.
3.1. Homology-based methods
With a list of known effectors at hand, homology searches can be applied to detect effector candidates. On the one hand, this approach has the drawback of being restricted to known families of effector proteins. This issue becomes especially evident when effectors are evolutionary “invented” by genome re-arrangements, as it has been shown for Type III secreted effector proteins in gram(−) bacteria [45]. On the other hand, effectors from closely related species or effectors that are transmitted by horizontal gene transfer (HGT) are detectable by homology searches. A good example for this method has been the Shigella toxin, which renders EHEC strains of E. coli especially harmful [46]. In this manuscript, 300 known and predicted effectors were used to detect homologous candidates in the proteome of an E. coli EHEC strain. This strategy identified 65 candidates of which 60% could be shown to be secreted. The sensitivity of homology searches in identifying effectors can be roughly estimated by the number of orthologous effectors known to date. The T3DB database [29] provides an up-to-date set of effectors with their respective orthologous relationships between different species. Where 90 effectors do not have well defined orthologs, 138 could potentially be detected by their evolutionary relationships to other effectors (resulting in a sensitivity of 60%). Since in this analysis genes from different strains are unified into one entry, the amount of successfully identified effectors between different strains might be higher.
3.2. Signal based detection
A more straightforward way to detect secreted proteins is the prediction of the secretion signal. This is a peptide sequence that is recognized by the respective secretion apparatus. This prediction is possible if basic information on the secretion signal is available. These data can be used to train a Hidden Markov Model or a binary classification algorithm. Substrates of two of the bacterial transport systems (namely, the Type II and V secretion system, commonly known as the sec-dependent pathways) can be detected by both their cleavage signal and a hydrophobic N-terminal signal sequence. The latter is crucial for the transport into the periplasmatic space [47]. Even if these systems do not deliver directly into the host cell, substrates of these secretion systems have been shown to play a role in infection, as can be shown for Legionella [48]. Computationally, this signal can be encoded in an Hidden Markov Model (HMM) which can detect transported proteins with high accuracy (sensitivity and selectivity >88%), as implemented in the SignalP detection software [49]. This software is also capable of predicting proteins that are secreted by the exocytosis pathway in Eukaryotes. In gram(−) bacteria, the transport of effectors is often mediated by the Type III secretion apparatus (TTSS), a molecular syringe spanning through the bacterial and eukaryotic host cells [50]. The recognition of substrates by the TTSS is mediated by the N-terminal end of the effector proteins [51], [52], [53] and is in some cases supported by TTSS specific chaperones [54], [55]. Different studies applied binary classification algorithms (neural networks, Naive Bayes classifier, and SVMs) [56] for TTSS specific effector prediction resulting in predictions that are far better than random. This was determined in cross-validation experiments with a sensitivity up to 90% and a specificity of 88–97% [53], [56], [57]. Since the exact signal that leads to transport is unknown, these approaches use representations of amino-acid frequencies or frequencies of short amino-acid stretches from the proteins’ N-termini as input features. The first 25 to the first 100 amino-acids are sufficient to initiate Type III secretion [52]. Samudrala et al. complemented this approach by implementing additional effector specific features such as their low sequence conservation and unusual phylogenetic distribution [53]. These additional features are discriminative between effectors and non-effectors and substantially contribute to the prediction performance by increasing the ROC-AUC (used to measure the general classification performance, which is 0.5 for a random prediction and 1.0 for a perfect classification) from 0.80 to 0.95. An interesting case is the agent of malaria, the eukaryote Plasmodium falciparum that infects erythrocytes and stays intracellular in a vacuole. The effectors of Plasmodium have one of two similar short motifs in common, called PEXEL and vacuolar transport signal (VTS) that lead to transport out of the pathogen, across the vacuole and into the host cell [58], [59]. This motif can be utilized to detect novel virulence factors using a HMM or a sequence pattern. Using these motifs alone to detect secreted proteins would produce a large amount of false positives due to the shortness of the motifs, covering up to 28% of the Plasmodiums’ proteome [60]. Hiss et al. reported that these motifs are flanked by patterns of amino acids with similar physico-chemical properties. These patterns can be used to classify the predicted effectors further, hereby reducing the amount of candidates to ∼7% of the proteome [60]. Microarray data can be used to identify which effectors are active in certain stages of the life cycle, since malaria depends on the infection of erythrocytes and has a tissue specific life-cycle [59]. Another motif (the RxLR motif and flanking regions, similar to the PEXEL motif) has been described in several plant pathogens such as Phytophthora infestans, an oomycete. Remarkably, this signal is interchangeable with the PEXEL signal [61]. This finding might indicate a common mechanism used to deliver eukaryotic effectors. There is evidence that this mechanism works in the absence of the pathogen and therefore comprises a utilized host mechanism, but the topic is highly disputed [44], [62].
3.3. “Genomic” and “function based” approaches
To detect effectors delivered by transport systems for which the signal of substrate recognition is completely unknown, more general features of effectors must be employed. For some bacterial species, effectors tend to be organized in pathogenicity islands on the chromosome and their genes exhibit a deviating CG content [63]. Furthermore, their orthologs have an unusual phylogenetic distribution since they have been acquired by horizontal gene transfer [63]. They may therefore be more similar to eukaryotic domains as to other bacterial sequences [64]. A promising approach is based on machine learning that integrates several of such features. Such a method has been successfully applied for substrates of the Type IV secretion system in Legionella pneumophila [54], achieving a cross-validation performance of ROC-AUC 0.95 and a >90% success rate in wet-lab tests while testing for translocation. An interesting way to detect effectors is to identify domains suspected to interact with the host system as indicated by their binding mode or phylogenetic pattern. The latter case refers to so called “eukaryotic-like” domains unusual in bacteria but frequent in the eukaryotic host cell as ankyrin repeats [65], [66], [67]. In addition to serve as a basis for detection, these domains also indicate a function of the effector in the host and might mediate the interaction with the host system. Effector candidates can be detected by screening for these domains using HMM representations of such domains as they can be found in Pfam or SMART [68]. A convenient way to detect domain signatures is the use of Interpro-scan [69] which provides a comprehensive ‘umbrella-system’ for several domain signature databases. Many of the eukaryotic-like domains in effectors have been identified by expert knowledge in a case-to-case manner. Thousands of different domain signatures exist and comprehensively scanning these can provide additional domains of interest. The Effective database [27] provides a systematic screen for eukaryotic-like domains based on an evaluation of their taxonomic distribution between eukaryotes and pathogenic bacterial species. This database lists a frequency based Z-score for each domain in a pathogen which describes how more frequent the domain appears in the pathogenic species as in non-pathogenic ones. However, the reliability of this classification of eukaryotic domains has not been systematically assessed yet.
4. Prediction of HP-PPIs
The prediction of protein–protein interactions is a well-investigated problem for the intra-species case, and it is the next logical step to adapt these approaches for host–pathogen interactions. However, not all methods suitable for intra-species interaction prediction can be applied here. In this section, the most important methods for the predictions of HP-PPIs are discussed. These methods are based on mapping of known interactions, methods based on machine learning on sequence information, and methods based on domain–domain interactions. A special and very promising variant of the latter is the detection of short linear motifs (SLMs) that are the binding motifs for a number of eukaryotic interaction domains.
4.1. Interologs
The most evident approach is the mapping of known interactions onto homologous or orthologous pairs of sequences, based on the rationale of conserved interactions between the same or similar genes in different species. If an interaction of two genes is known in A, this interaction is likely to occur in B as well to maintain this common function. The transferred PPI is then called ‘Interolog’. The interolog method becomes weaker with larger evolutionary distances since genes may neo-functionalize during evolution and orthologous assignments will become more error-prone due to the lack of sequence similarity. In addition, many HP-PPIs will be unique inventions from the host–pathogen arms race that are not existent between orthologs of other species. Nevertheless, host–pathogen interactions may reveal known interactions, for instance if a gene acquired by HGT from the host could be re-utilized to attack the host system. Since reliable mappings can only be achieved between sequences with very high sequence similarity (>30%) or very significant E-Values as shown in the studies by Aloy et al. [70] and Yu et al. [71], the original approach may miss many fast evolving effectors. In the case of HP-PPIs, several authors therefore combined a sensitive version of the interolog approach with pruning steps to remove false positives that are unlikely to interact due to spatial or temporal constraints. Using interologs, Wuchty [72] and Lee et al. [73] predicted HP-PPIs between P. falciparum and human, uncovering interactions that are meaningful for the maintenance of the parasites intracellular life-style. Using an additional machine learning step, Wuchty reports a 79% true positive rate while picking up only 4.7% false positives on a structurally inferred test-set from a previous study [74]. Lee et al. show that their predictions are significantly enriched with relevant Gene Ontology terms dealing with Calcium related processes which are relevant to the Plasmodiums life cycle. Krishnadev et al. predicted interactions between human and three bacterial pathogens using a combined approach of stringent interologs and domain-based information (discussed below) hereby identifying several known as well as unknown relationships [75]. Tyagi et al. utilized the same approach to detect a small set of human–Helicobacter interactions [76]. Franzosa et al. implemented a structure based approach to detect human–virus interactions by first extracting human interacting pairs from the PDB and then mapping virus proteins to these structures by sequence similarity [77].
4.2. Genomic context based methods
Where text-mining as well as Interolog based approaches rely on known interactions, approaches to predict physically or at least functionally interacting proteins might uncover completely novel relationships. While genomic-context methods have been very successful to predict physical and functional interactions for species specific interactomes [78], [79], [80], direct application of genomic context methods on HP-PPIs lacks their underlying rationale (as genes should be neighbors on the chromosome to detect operons and conserved neighborhoods, or are under the same evolutionary constraints detectable by similar phylogenetic profiles, named the co-occurrence method). Nonetheless, interactions predicted using genomic context methods can be reliably mapped by orthology to other species, even if the genomes of said species did not contribute to the detection of the functional or physical interaction [79]. An application to a host–pathogen system would be an analogous procedure to the Interolog approach and is as such subjected to the same restrictions. In addition, the fraction of only functionally or indirectly interacting proteins in the predicted sets might be relatively big: in an early study of Huynen et al. [81], the fraction of predicted direct interactions has been reported to be between maximal ∼50% (for tracking of gene fusion events) and minimal 34% for the co-occurrence method. In general, the usefulness of a genomic context based approach for HP-PPI prediction has not been determined but might provide an interesting source of information complementary to known physical interactions.
4.3. Sequence based methods
A promising way is the utilization of prediction methods that work on general properties of the amino-acid sequence. For example, machine learning techniques on representations of short stretches of amino-acids and their chemical–physical properties can be employed for a very general PPI prediction. Dyer and co-workers integrated such an approach for the prediction of human–HIV interactions [82], [83]. In this work, the authors extended the sequence derived feature set with information on domains and information on the local network properties of the human interaction partner. In contrast to the studies done on bacterial and eukaryotic pathogens, the high amount of experimental data for HIV allows the creation of a rigid test-set of reliable interactions. Dyer et al. evaluated several combinations of their features used for prediction and a combination of all (sequence, domain, and network information) achieved the best performance. Depending on the chosen cut-off, high precision and recall can be achieved, e.g. 80% precision at 50% recall.
4.4. Domain based prediction
Dyer et al. reported that a strong contributor to the successful prediction is the domain information [83]. This observation motivates the use of domain based methods to infer HP-PPI interactions. For the intra-species case, several methods to predict domain–domain interactions have been described. These methods are based on the rationale that domains are the mediators of interactions and, more importantly, are universally used by different proteins. The Protein Structural Interactome map (PSIMAP) and its cognate database, Psibase, is a resource that predicts interactions between folds as classified in the SCOP database by analyzing PDB structures [84]. A general resource for domain–domain interactions based on domain definitions given by Pfam models is the Domain Interaction Map (DIMA) [22]. It integrates several methods for predicting domain–domain interactions which rely on different principles of detection: in the Domain Pair Exclusion Method (DPEA, [85]) the frequency of known interactions of co-occurring domain pairs are analyzed. The DIPD [86] approach uses machine learning techniques to determine domain interaction pairs that are predictive for protein-interactions. Other integrated methods are based on correlated mutations and search for domain pairs that contain co-evolving residues [87]. The Domain Profile method (DPROF) is based on the same idea as the co-occurrence method to predict interactions between complete proteins: based on the assumption that interacting domains are under evolutionary pressure to be maintained concerted, the phylogenetic distributions of a pair of domains are compared and an interaction is reported if these are sufficiently high and informative [88]. IPfam [89] and 3DiD [90] analyze PDB structures and extract domain pairs that are in close contact in these structures. Where all these methods have been reported to result in meaningful predictions, their overall performance is difficult to assess due to the incompleteness of the domain and structural databases. However, they have been used in several host–pathogen screens since they allow to predict interactions over wider evolutionary distances as simple homology. Dyer et al. developed an own scoring system between domains in interactions using Bayesian statistics and predicted interactions between H. sapiens and P. falciparum. He reported over 500 PPIs which tend to significantly attack human proteins that are in close proximity to each other in the human interactome, that significantly share GO annotations, and that are co-expressed in the pathogen [91]. Tyagi et al. used, besides interologs, iPfam based predictions to increase the amount of detected Heliobacter pylori–human interactions. Kim et al. [92] developed XooNET, a pre-calculated resource for Xanthomonas oryzae, an important rice pathogen, comprising beside the intra-cellular interactome, predictions for 3400 host–pathogen interactions based on stringent cut-offs on PSIMAP, iPfam and interologs. One of the objectives for the utilization of domain-interactions to predict HP-PPIs is that an effector protein might have been acquired evolutionary by re-using a common interaction domain. The principle mechanism to achieve such a fusion is a mechanism of flexible re-shuffling of genes allowing to fuse a functional domain with a signal peptide for transport into the host. Such evolutionary re-shuffling events have been described for 18–32% of bacterial effectors, depending on the species of interest [45]. Some domains might not be part of the pathogens genetic repertoire and have been reinvented as analogous molecular mimicry. Interesting candidates for such a process are short linear motifs, which might be easily gained by few mutations.
4.5. Short linear motifs
A wide range of protein domains recognize their substrates by short linear motifs (such as kinase domains and peptide recognition modules (PRMs), which include SH3 (Src homology 3), SH2 (Src homology 2), and PDZ (PSD-95/Discs-large/ZO-1) domains). Many play an important role in eukaryotic specific signal-transduction and regulation processes, making them an interesting target for pathogens. A general resource for these short linear motifs is the ELM database [93]. A recent survey of Davey et al. [94] identified at least 50 different short linear motifs that exist in viral proteins and that are used to manipulate their host system by acting as molecular mimicry. Examples from bacterial effector proteins have been described as well [79], [94], [95], [96]. Dampier et al. reported that the outcome of HIV infection treatments depend on the existence of linear motifs in the respective HIV strain indicating that these are functional and interact with the host [97]. Methods for predicting short linear moftis are based on experimentally derived data on binding specificity from synthetic peptide arrays (SPOT), oriented peptide array libraries (OPAL), protein domain microarrays, and phage display based techniques [98], [99], [100], [101], [102], [103]. The results of these experiments provide data on binding specificities which are captured in position weight matrices (PWMs), alternatively called position specific specificity profiles (PSSMs). These matrices describe the short linear motifs based on the frequencies of each measured amino-acid for each position [99]. Computational methods such as Scansite use PWMs to predict peptide binding motifs for a range of domain families such as kinase, SH3, SH2, PDZ, and 14–3–3 domains [104]. The scoring system of the PWMs can be exploited according to this rationale since higher binding specificities are reflected in higher scores assigned by the detection algorithm. In order to detect candidate molecular mimicry, scanning for short linear motifs in e.g. viral proteins and comparing them to the highest scoring human instances is needed, as done for PDZ domain recognition sites by Tonikian et al. [21]. They were able to detect known viral targets (such as SCRIB, whose functional disruption causes hyperproliferation) and the occurrence of binding motifs to the PDZ domain of this protein correlated with the oncogenic potential of HPV strains. Protein structure information such as protein disorder and solvent accessibility can be used in conjunction with PWMs to increase the accuracy of the prediction. MOTIPS is such a computational method that incorporates protein structure and domain specificity information in a Bayesian framework to predict binding partners for SH3 and S/T kinase domains [105].
5. Example: bacterial effector PPI prediction based on domain–domain interactions
In this section, we discuss an example analysis to deliver candidate HP-PPIs based on domain–domain interactions as a possible novel way to predict and rank bacteria–human interactions. We used publicly available data to predict human interaction partners of known effectors from Salmonella sp. Since the resulting lists of putative targets of a certain domain might be very large, we ranked the interactor candidates by the degree of the target proteins in the human PPI network. Due to the effectors tendency to interact with hubs [106], the actual interactors should appear enriched in the top of these lists.
For this approach, domain–domain interaction data have been downloaded from the DIMA 3.0 database [22]. A scan of all Pfam domains in proteins from human and the respective pathogens was downloaded from SIMAP (a database that provides pre-calculated data on all Pfam models [107]). Data on the human interactome were downloaded from iRefWeb [35]. For each human protein, we computed the degree as the number of reported physical interactions from high-throughput data (filtering experiments with <100 interactions). The resulting data contain the interacting Pfam domains and the human and bacterial proteins that contain one of the corresponding domains, with the degree of the proteins in decreasing order. The data were searched for effectors that have been experimentally verified or are homologous to a verified effector in another bacteria. This list has been obtained from the literature [108], [109]. In principle, this list could be extended by predicted effectors to get a more extended host–pathogen interaction network. Table 2 contains a summary of the possible interactions between Salmonella effectors and host proteins. It lists the human protein with the highest degree together with their function. The overall work-flow is pictured in Fig. 1 .
Table 2.
Effector–host PPI predictions for Salmonella typhimurium. A short description of the effector function is depicted beside their name. For each effector, the Pfam names of the interacting domains are depicted. The bacterial domain is named first. For the human domain, the protein with the highest degree, together with their function are given.
| SipA interferes with host cell actin cytoskeleton | ||
| PF09052–PF01044 | VINC_HUMAN | Actin filament binding protein |
| sopE–sopE2 activator of CDC42 and RAC1 | ||
| PF07487–PF00071 | CDC42_HUMAN | Cytoskeleton organiser |
| sptP tyrosine phosphatase and GTPase activating protein (GAP) activities | ||
| PF00102–PF00017 | GRB2_HUMAN | Adapter protein linking growth factor receptors and the Ras signaling pathway |
| PF00102–PF07714 | EGFR_HUMAN | EGF receptor |
| PF00102–PF00244 | 1433G_HUMAN | Adapter protein that regulates signaling pathways |
| PF00102–PF02262 | CBL_HUMAN | Adapter protein that negatively regulatesr signaling pathways |
| PF00102–PF00102 | PTN12_HUMAN | Dephosphorylates cellular tyrosine kinases |
| PF00102–PF00647 | EF1G_HUMAN | Molecular anchor |
| PF00102–PF01846 | PR40A_HUMAN | Binds to WASP (actin cytoskeleton organiser) |
| PF00102–PF00493 | MCM2_HUMAN | Component replicative helicase complex |
| PF00102–PF00041 | INSR_HUMAN | Receptor tyrosine kinase which mediates the actions of insulin |
| PF00102–PF01049 | CADH2_HUMAN | Calcium-dependent cell adhesion protein |
| PF00102–PF03810 | IMB1_HUMAN | Nuclear protein import |
| PF00102–PF02933 | NSF_HUMAN | Required for vesicle mediated transport |
| PF00102–PF00568 | WASP_HUMAN | WASP (actin cytoskeleton organiser) |
| PF00102–PF00091 | TBA3C_HUMAN | Tubulin forms microtubules |
| PF03545–PF00071 | CDC42_HUMAN | Cytoskeleton organiser |
| sspH2–slrP E3 ubiquitin ligase that interferes with host’s ubiquitination pathway | ||
| PF00560–PF02344 | MYC_HUMAN | Activates the transcription of growth-related genes |
| PF00560–PF08919 | ABL1_HUMAN | Non-receptor tyrosine-protein kinase involved in cell growth and survival, cytoskeleton remodeling etc. |
| PF00560–PF00076 | EWS_HUMAN | May act as arepressor in the tumorigenic process |
| PF00560–PF00071 | CDC42_HUMAN | Cytoskeleton organiser |
| PF00560–PF00020 | TNR1A_HUMAN | Receptor for TNFSF2/TNF-alpha (apoptosis) |
| PF00560–PF06068 | RUVB2_HUMAN | Helicase activity |
| PF00560–PF00012 | HSP7C_HUMAN | Repressor of transcriptional activation |
| PF00560–PF00149 | PP2AA_HUMAN | Modulatesactivity MAP-2 kinase, dephosphorylation of p53 |
| PF00560–PF00769 | EZRI_HUMAN | Probably involved in connections of major cytoskeletal structures to the plasma membrane |
| PF00560–PF07679 | RB_HUMAN | Regulator of cell division that acts as a tumor suppressor |
| PF00560–PF00028 | CADH2_HUMAN | Calcium-dependent cell adhesion protein |
| PF00560–PF03931 | SKP1_HUMAN | Essential component of the SCF ubiquitin ligase complex |
| PF00560–PF00134 | CCNK_HUMAN | Cyclin-K: possible role in transcriptional regulation |
| PF00560–PF01749 | IMA2_HUMAN | Functions in nuclear protein import |
| PF00560–PF03145 | SIAH1_HUMAN | E3 ubiquitin-protein ligase |
| sifA–sifB promotes bacterial survival, formation of Salmonella-induced filaments | ||
| PF06767–PF00169 | PLCG1_HUMAN | Regulation of intracellular signaling cascades involving IP3 |
| sseJ promotes bacterial survival, formation of Salmonella-induced filaments | ||
| PF06767–PF00169 | PLCG1_HUMAN | Regulation of intracellular signaling cascades involving IP3 |
| PF00657–PF00400 | 2ABB_HUMAN | Promotes proapoptotic activity within the PP2A complex |
| PF00657–PF00240 | UBQL4_HUMAN | Plays a role in the regulation of proteosomal protein degradation |
| PF00657–PF00097 | TRAF6_HUMAN | E3 ubiquitin ligase |
| PF00657–PF02037 | XRCC6_HUMAN | ATP-dependent helicase |
| SpvB actin cytoskeleton reorganiser | ||
| PF03496–PF00071 | CDC42_HUMAN | Cytoskeleton organiser |
| PF03496–PF00022 | ACTG_HUMAN | Actin filaments |
Figure 1.

Work-flow of the example analysis. (1) Detection of host-pathogen interaction candidates by a domain-domain based approach, (2) ranking of the candidates according to the degree in the host interaction network.
SopE and SopE2 of Salmonella typhimurium and the SopE Salmonella enterica ortholog are predicted and known to interact with Cdc42. In the case of the S. enterica, it was the only interaction predicted. A 3D structure is available for this well documented interaction and the physiological effects are also known: SopE activates Cdc42 resulting in actin cytoskeletal rearrangements that promote bacterial uptake in non-phagocytotic cells [110]. The SipA effector from S. typhimurium is known to inhibit F-actin destabilizing proteins [111], [112], such as vinculin, an F-actin binding protein. The SlrP and SspH2 effectors have a leucine rich repeat (PF00560) and are also known Cdc42 targeting proteins and inhibitor of interleukin expression. Correspondingly, they are predicted to bind the TNFα receptor and Cdc42. Moreover, SlrP and SspH2 are involved in the ubiquitin pathway [109], [113] and are here predicted interaction partners of E3 ubiquitin–protein ligase SIAH1 and S-phase kinase-associated protein 1, which regulates the ubiquitination of NFκB inhibitor alpha. SptP is a phosphatase and an inhibitor of Cdc42, helping the host cell to recover after bacterial invasion [114]. Its predicted targets are adapter proteins (most notably from the Ras pathway), the ubiquitin and proteasome pathway and with multiple cytoskeletal components. Of these cytoskeletal components, SptP is predicted to interact with WASP, a well-known actin filament stabilizing protein and target for bacterial effectors [5], with a WASP binding protein (Pre-mRNA-processing factor 40 homolog A), and with tubulin and with a protein involved in vesicular transport (Vesicle-fusing ATPase). SpvB is an enzyme that ribosylates actin and is predicted to interact with actin and Cdc42. SifA, SifB and SseJ all share a domain that mediates the interaction with Phosphoinositide phospholipase C-gamma-1, an interaction that is structurally resolved [115]. SifA and SifB are involved in the formation of Salmonella induced filaments (Sifs). SseJ has an additional lipase/acylhydrolase domain and is a negative regulator of Sif formation. Its activity is a virulence determinant [116]. Our prediction indicates a strong role in ubiquitin pathway alteration.
To conclude, the Salmonella data are consistent with previous work on effector protein interactions by predicting interactions with all known pathways that are targeted upon infection.
6. Increasing the precision of HP-PPI predictions
Many of the discussed approaches tend to over-predict the amount of interaction partners. For example, domain–domain interaction based methods will return many wrong positive predictions for frequent domain signatures. In general, the optimal trade-off between sensitivity and selectivity could be found with benchmark sets of HP-PPIs. These sets only exist for a couple of host–pathogen pairs (as HIV–human). Therefore, the challenge is to prune over-predicted interactions in a biological meaningful way to get a realistic picture of the interactome and to prioritize interesting targets for follow-up experiments (such as structural modeling). So, the precision of the HP-PPI predictions must be increased by removing biologically unlikely results. On the host side, this can be achieved by integrating data on tissue specific expression (if the pathogen is known to attack a certain tissue), and on the pathogen side by removing genes which are not expressed in the infection cycle. In addition, gene expression analysis of cells during infection may hint to genes activated due to the hosts defense mechanisms. Dyer et al. showed that genes of P. falciparum whose proteins are predicted to interact with human proteins tend to be co-expressed in the different life-cycle states of the parasite [91]. Wuchty used gene expression information on the Plasmodium’s life cycle, prediction of exported proteins, and human tissue specificity to prune candidate lists for the prediction of a P. falciparum–human PPI network, breaking down an initial network of >100 000 candidates to a set of 2244 interacting pairs [72]. Davis et al. refined predictions for 10 different bacterial pathogens by the use of host and pathogen expression data reducing the amount of predictions from several thousands to a few hundred high confidence candidates [74].
7. Different biology of viruses and bacteria – possible implications for HP-PPI predictions
Several discussed rationales behind the HP-PPI prediction may be limited to certain pathogen classes. Molecular mimicries are found in viruses and bacteria, but they can originate by different ways and be subjected to different selection pressures.
Bacteria exchange genetic material with the host [117] and HGTs leading to bacterial virulence related genes have some well documented cases, e.g. some effectors in Legionella (RalF effector in particular [118], [64]; for a review see Stebbins et al. [119]). However, not all mimicry originate by HGT, and several reported cases of functional or structural bacterial mimicry are probably de novo evolutionary inventions, like the Shigella IpgB2 [13] or the E. coli (EPEC) EspF effector [12]. Detecting these effectors by Interolog or genomic context approaches is difficult. A possible solution for these cases would be the individual modeling of the three-dimensional structure of putative effector–target complexes, which is, for now, intractable for a large set of sequences. On the other hand, while focusing on domains, short eukaryotic interaction scaffold and signaling domains have been reported for several known or suspected effector proteins [65], [120], [121]. Evidently, these eukaryotic-like domains can also be detected in effectors with clearly eukaryotic origin (acquired by HGT), such as the Legionella LeS2 effector and others [121], [122]. These examples were detected by the use of domain-signatures or sensitive sequence similarity searches. Bacteria exchange their genetic material frequently by conjugation and the exchange of plasmids, often exchanging virulence factors [117], [123], [124]. Some obligate intra-cellular bacteria as Chlamydia are excluded from this source of rapid genetic exchange as long as they do not share their habitat with other bacteria, as reported for bacterial symbionts in amoebae [125]. The exchange of genetic material results in a phylogenetic signal that can be used to predict effectors.
Viruses mutate very rapidly but have strong constraints on their genomic size. Such constraints may lead to a preference for mimicking short linear motifs. These SLMs can be easily achieved by mutations in viral effector proteins (e.g. the generation of a PxxP binding motif to interact with SH3 domains). The mutational potential and evolutionary pressure to limit the genome size of viruses makes the search for clear examples of HGT difficult. It is thus not clear to what extent the large overlap of host–pathogen and host–host interaction interfaces [77] is caused by HGT (instead of being new evolutionary inventions). In a recent work, Rappoport et al. systematically investigated HGT events from eukaryotes to viruses and could identify several instances of dsDNA viruses that clearly acquired genetic material from the host. In addition, they report that these are often streamlined into single domain proteins without linker regions, a finding in congruence with the viruses’ genomic size constraints [126].
To conclude, Interolog and domain signature based methods may be more applicable to non-viral pathogens, whereas detection of short linear motifs might be more successful for viral genomes for which many of these have been reported [127], [128]. Genomic context and Interolog methods are most successful in clear cases of HGT. The structural modeling can be applied to all, if 3D structures of related host species are available. Domain–domain based interaction methods are applicable to detect interactions based on HGT from host to pathogen and might be fruitful for viruses considering the described streamlining effect.
8. What do the HP-PPI networks tell us?
The first studies of host pathogen PPI networks were based on high-throughput screens such as yeast-two-hybrid and more recently, genome-wide RNAi scans to screen for functional or genetic interactions. Their inherent potential makes them indispensable to understand the host–pathogen interaction on a genome-wide and systems biology scale. Y2H based networks, sometimes backed up by literature mining (for HIV type 1 virus text mining see [129], [130]), are the primal source of recent day physical HP-PPI networks of viruses, bacteria and parasitic protozoa. These networks are available for HIV type 1 virus [131], [132], [133], Hepatitis C virus [134], [135], Influenza [136], SARS-coronavirus [137], Herpes simplex virus [138], [139], Dengue virus [140], Epstein–Barr virus [141], Yersinia pestis, Bacillus anthracis and Francisella tularensis [142]. These studies have major implications for our understanding of infection, as the extent of the HP-PPI network and the effectiveness of pathogen interference can only be appreciated by genome-wide assays. Systems biology studies show that host pathways are targeted (i) specifically, (ii) by multiple effectors and (iii) preferably at hubs (highly connected proteins in the interaction networks). This in accordance with the observation, that effector proteins are rich in binding motifs and interacting domains, a fact that is seen in effectors of pathogens of animals as well as plants [106].
For example, acetylated HIV integrase has thirteen known binding partners and effects key proteins in several processes, ranging from chromatin remodeling to cytoskeleton rearrangement [133]. HCV and influenza HP-PPI networks also confirm the aforementioned view, whereby pathways are altered by direct PPI interactions from receptor scaffolds down to the level of transcription factors [136]. Navratil et al. showed that 22% of the interferon pathway, a major antiviral player, is directly interacting with viral proteins [143]. As mentioned in the introduction, the targeted pathways are related to cell cycle regulation, nuclear transport, ubiquitin, proteasomal degradation and immune response regulation [79], [82], [142], [144], [145]. These pathways are targeted in order to effectively manipulate the host system and circumvent the immune system. It could be shown that two evolutionary distinct pathogens of Arabidopsis, the bacterium P. syringae and the eukaryote Hyaloperonospora arabidopsidis target the same pathways as a result of convergent evolution [106]. The other recurring theme in HP-PPI network analysis, the targeting of hubs and bottlenecks by pathogen proteins, is usually explained as an evolutionary method of the pathogen to try to maximize their influence on the host to redirect the network priorities to increase pathogen fitness. Franzosa and Xia [77] reported that the viral proteins structurally mimic the interface of host regulatory protein domains that are involved in transient contacts. This way, pathogen proteins can out-compete host proteins for interactions [146].
For now, the most feasible structural approaches are investigating domain–domain interactions where other approaches such as 3-dimensional modeling of host–pathogen networks are still too computational demanding to be easily automated for a large scale prediction. Known domain–domain interactions were the starting point for the PPI mapping of P. falciparum [147]. Likewise, the DIP and iPfam databases were used for the construction of the H. pylori–human interaction network [76]. We used a similar approach in this article to point out the merit of combining a domain based interaction prediction method and general network characteristics to get probable interaction partners of known bacterial effector proteins. The present-day challenge is the combination of structural biology and network biology to increase the reliability and completeness of HP-PPI networks. Another difficult task is the implementation of non-domain related interactions such as short linear motifs; this will complete our view on host–pathogen interactions as the motifs are known to be enriched in viral proteins [148].
9. Biological implications of HP interactomics and future perspectives
In this review, we focused on the prediction of HP-PPI and the analysis of their impact on the host. One important prospective of this research is the development of novel treatment strategies. Structure-based predictions of domain–domain interactions and motif binding allow the development of specific compounds that interfere with HP-PPIs as it has been done with small molecules that interfere with the p53 [149], TNFα [150] and Wnt [151] pathways as possible cancer treatments. Interestingly, these pathways may also be targets of viruses and bacteria: the p53, TNFα and Wnt pathways are altered by pathogens to avoid immune induced apoptosis. This indicates that the pool of possible targets is large and to a degree overlapping with diseases other than infection.
The search for new antibiotics and infection-related drugs will be greatly supported by structural modeling of inhibitor–protein complexes to block elementary steps in the pathogens interaction with the host. An example of this approach is the inhibition of neuraminidase inhibitors for treatment of influenza. Since resistance of influenza to neuraminidase inhibitors such as Tamiflu® is already widespread [152], additional drugs are urgently needed. Every confirmed interaction in HP-PPI can be potentially used for the development of novel antiviral drugs.
Until recently, treatments focused on targeting a few key players in the etiology of the disease. Systems biology provides another treatment paradigm: can we use the information that HP-PPI networks provide us to design strategies to combat infection at the network level? The HP-PPI or host PPI networks can both be targeted for this purpose [153]. Systems biology approaches may help to identify the impact of virulence factors on the host system based on computational models of signal transduction and pathway analysis. Key in this approach is the application of small molecules against several nodes that are altered upon infection [154]. Both treatment viewpoints are yet to be explored and both are critically dependent on the ongoing optimization of the techniques presented in this review.
References
- 1.Parer S., Lotthé A., Chardon P., Poncet R., Jean-Pierre H., Jumas-Bilak E. Infect. Control Hosp. Epidemiol. 2012;33:167–174. doi: 10.1086/663703. [DOI] [PubMed] [Google Scholar]
- 2.Buchholz U., Bernard H., Werber D., Böhmer M.M., Remschmidt C., Wilking H., Deleré Y., Van der Heiden M., Adlhoch C., Dreesman J., Ehlers J., Ethelberg S., Faber M., Frank C., Fricke G., Greiner M., Höhle M., Ivarsson S., Jark U., Kirchner M., Koch J., Krause G., Luber P., Rosner B., Stark K., Kühne M. N. Engl. J. Med. 2011;365:1763–1770. doi: 10.1056/NEJMoa1106482. [DOI] [PubMed] [Google Scholar]
- 3.Walsh C. Nat. Rev. Microbiol. 2003;1:65–70. doi: 10.1038/nrmicro727. [DOI] [PubMed] [Google Scholar]
- 4.Magliani W., Conti S., Salati A., Arseni S., Ravanetti L., Frazzi R., Polonelli L. Curr. Med. Chem. 2004;11:1793–1800. doi: 10.2174/0929867043364964. [DOI] [PubMed] [Google Scholar]
- 5.Elde N.C., Malik H.S. Nat. Rev. Microbiol. 2009;7:787–797. doi: 10.1038/nrmicro2222. [DOI] [PubMed] [Google Scholar]
- 6.Lurie-Weinberger M.N., Gomez-Valero L., Merault N., Glöckner G., Buchrieser C., Gophna U. Int. J. Med. Microbiol. 2010;300:470–481. doi: 10.1016/j.ijmm.2010.04.016. [DOI] [PubMed] [Google Scholar]
- 7.Aguileta G., Refrégier G., Yockteng R., Fournier E., Giraud T. Infect. Genet. Evol. 2009;9:656–670. doi: 10.1016/j.meegid.2009.03.010. [DOI] [PubMed] [Google Scholar]
- 8.Shin Y.-K., Li Y., Liu Q., Anderson D.H., Babiuk L.A., Zhou Y. J. Virol. 2007;81:12730–12739. doi: 10.1128/JVI.01427-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hale B.G., Batty I.H., Downes C.P., Randall R.E. J. Biol. Chem. 2008;283:1372–1380. doi: 10.1074/jbc.M708862200. [DOI] [PubMed] [Google Scholar]
- 10.Thomas M., Kranjec C., Nagasaka K., Matlashewski G., Banks L. Virol J. 2011;8:25. doi: 10.1186/1743-422X-8-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Heikkinen L.S., Kazlauskas A., Melén K., Wagner R., Ziegler T., Julkunen I., Saksela K. J Biol Chem. 2008;283:5719–5727. doi: 10.1074/jbc.M707195200. [DOI] [PubMed] [Google Scholar]
- 12.Alto N.M., Weflen A.W., Rardin M.J., Yarar D., Lazar C.S., Tonikian R., Koller A., Taylor S.S., Boone C., Sidhu S.S., Schmid S.L., Hecht G.A., Dixon J.E. J. Cell Biol. 2007;178:1265–1278. doi: 10.1083/jcb.200705021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Alto N.M., Shao F., Lazar C.S., Brost R.L., Chua G., Mattoo S., McMahon S.A., Ghosh P., Hughes T.R., Boone C., Dixon J.E. Cell. 2006;124:133–145. doi: 10.1016/j.cell.2005.10.031. [DOI] [PubMed] [Google Scholar]
- 14.Delevoye C., Nilges M., Dehoux P., Paumet F., Perrinet S., Dautry-Varsat A., Subtil A. PLoS Pathog. 2008;4:e1000022. doi: 10.1371/journal.ppat.1000022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Böhme L., Rudel T. Microbes Infect. 2009;11:1063–1070. doi: 10.1016/j.micinf.2009.08.014. [DOI] [PubMed] [Google Scholar]
- 16.Swanton C., Card G.L., Mann D., McDonald N., Jones N. Trends Biochem. Sci. 1999;24:116–120. doi: 10.1016/s0968-0004(99)01354-7. [DOI] [PubMed] [Google Scholar]
- 17.Collins C.A., Brown E.J. Trends Cell Biol. 2010;20:205–213. doi: 10.1016/j.tcb.2010.01.002. [DOI] [PubMed] [Google Scholar]
- 18.Rottner K., Stradal T.E.B., Wehland J. Dev. Cell. 2005;9:3–17. doi: 10.1016/j.devcel.2005.06.002. [DOI] [PubMed] [Google Scholar]
- 19.Diacovich L., Gorvel J.-P. Nat. Rev. Microbiol. 2010;8:117–128. doi: 10.1038/nrmicro2295. [DOI] [PubMed] [Google Scholar]
- 20.Raghavachari B., Tasneem A., Przytycka T.M., Jothi R. Nucleic Acids Res. 2008;36:D656–D661. doi: 10.1093/nar/gkm761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tonikian R., Zhang Y., Sazinsky S.L., Currell B., Yeh J.-H., Reva B., Held H.A., Appleton B.A., Evangelista M., Wu Y., Xin X., Chan A.C., Seshagiri S., Lasky L.A., Sander C., Boone C., Bader G.D., Sidhu S.S. PLoS Biol. 2008;6:e239. doi: 10.1371/journal.pbio.0060239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Luo Q., Pagel P., Vilne B., Frishman D. Nucleic Acids Res. 2011;39:D724–D729. doi: 10.1093/nar/gkq1200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gfeller D., Butty F., Wierzbicka M., Verschueren E., Vanhee P., Huang H., Ernst A., Dar N., Stagljar I., Serrano L., Sidhu S.S., Bader G.D., Kim P.M. Mol. Syst. Biol. 2011;7:484. doi: 10.1038/msb.2011.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Chen L., Yang J., Yu J., Yao Z., Sun L., Shen Y., Jin Q. Nucleic Acids Res. 2005;33:D325–D328. doi: 10.1093/nar/gki008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhou C.E., Smith J., Lam M., Zemla A., Dyer M.D., Slezak T. Nucleic Acids Res. 2007;35:D391–D394. doi: 10.1093/nar/gkl791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gillespie J.J., Wattam A.R., Cammer S.A., Gabbard J.L., Shukla M.P., Dalay O., Driscoll T., Hix D., Mane S.P., Mao C., Nordberg E.K., Scott M., Schulman J.R., Snyder E.E., Sullivan D.E., Wang C., Warren A., Williams K.P., Xue T., Yoo H.S., Zhang C., Zhang Y., Will R., Kenyon R.W., Sobral B.W. Infect. Immun. 2011;79:4286–4298. doi: 10.1128/IAI.00207-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Jehl M.-A., Arnold R., Rattei T. Nucleic Acids Res. 2011;39:D591–D595. doi: 10.1093/nar/gkq1154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tay D.M.M., Govindarajan K.R., Khan A.M., Ong T.Y.R., Samad H.M., Soh W.W., Tong M., Zhang F., Tan T.W. BMC Bioinformatics. 2010;11(Suppl 7):S4. doi: 10.1186/1471-2105-11-S7-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wang Y., Huang H., Sun M., Zhang Q., Guo D. BMC Bioinformatics. 2012;13:66. doi: 10.1186/1471-2105-13-66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Winnenburg R., Urban M., Beacham A., Baldwin T.K., Holland S., Lindeberg M., Hansen H., Rawlings C., Hammond-Kosack K.E., Köhler J. Nucleic Acids Res. 2008;36:D572–D576. doi: 10.1093/nar/gkm858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kumar R., Nanduri B. BMC Bioinformatics. 2010;11(Suppl 6):S16. doi: 10.1186/1471-2105-11-S6-S16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Xenarios I., Salwínski L., Duan X.J., Higney P., Kim S.-M., Eisenberg D. Nucleic Acids Res. 2002;30:303–305. doi: 10.1093/nar/30.1.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Licata L., Briganti L., Peluso D., Perfetto L., Iannuccelli M., Galeota E., Sacco F., Palma A., Nardozza A.P., Santonico E., Castagnoli L., Cesareni G. Nucleic Acids Res. 2012;40:D857–D861. doi: 10.1093/nar/gkr930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Driscoll T., Dyer M.D., Murali T.M., Sobral B.W. Nucleic Acids Res. 2009;37:D647–D650. doi: 10.1093/nar/gkn799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.B. Turner, S. Razick, A.L. Turinsky, J. Vlasblom, E.K. Crowdy, E. Cho, K. Morrison, I.M. Donaldson, S.J. Wodak, Database (Oxford), 2010, baq023. [DOI] [PMC free article] [PubMed]
- 36.Chatr-aryamontri A., Ceol A., Peluso D., Nardozza A., Panni S., Sacco F., Tinti M., Smolyar A., Castagnoli L., Vidal M., Cusick M.E., Cesareni G. Nucleic Acids Res. 2009;37:D669–D673. doi: 10.1093/nar/gkn739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lin F.-K., Pan C.-L., Yang J.-M., Chuang T.-J., Chen F.-C. BMC Microbiol. 2009;9:164. doi: 10.1186/1471-2180-9-164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Fahey M.E., Bennett M.J., Mahon C., Jäger S., Pache L., Kumar D., Shapiro A., Rao K., Chanda S.K., Craik C.S., Frankel A.D., Krogan N.J. BMC Bioinformatics. 2011;12:298. doi: 10.1186/1471-2105-12-298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Yin L., Xu G., Torii M., Niu Z., Maisog J.M., Wu C., Hu Z., Liu H. Artif. Intell. Med. 2010;49:155–160. doi: 10.1016/j.artmed.2010.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Thieu T., Joshi S., Warren S., Korkin D. Bioinformatics. 2012;28:867–875. doi: 10.1093/bioinformatics/bts042. [DOI] [PubMed] [Google Scholar]
- 41.Rossier O., Starkenburg S.R., Cianciotto N.P. Infect. Immun. 2004;72:310–321. doi: 10.1128/IAI.72.1.310-321.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cianciotto N.P. Trends Microbiol. 2005;13:581–588. doi: 10.1016/j.tim.2005.09.005. [DOI] [PubMed] [Google Scholar]
- 43.Bronstein P.A., Marrichi M., Cartinhour S., Schneider D.J., DeLisa M.P. J. Bacteriol. 2005;187:8450–8461. doi: 10.1128/JB.187.24.8450-8461.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ellis J.G., Dodds P.N. Proc. Natl. Acad. Sci. USA. 2011;108:14381–14382. doi: 10.1073/pnas.1111668108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Stavrinides J., Ma W., Guttman D.S. PLoS Pathog. 2006;2:e104. doi: 10.1371/journal.ppat.0020104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tobe T., Beatson S.A., Taniguchi H., Abe H., Bailey C.M., Fivian A., Younis R., Matthews S., Marches O., Frankel G., Hayashi T., Pallen M.J. Proc. Natl. Acad. Sci. USA. 2006;103:14941–14946. doi: 10.1073/pnas.0604891103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pugsley A.P. Microbiol. Rev. 1993;57:50–108. doi: 10.1128/mr.57.1.50-108.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hales L.M., Shuman H.A. Infect. Immun. 1999;67:3662–3666. doi: 10.1128/iai.67.7.3662-3666.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Nielsen H., Krogh A. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1998;6:122–130. [PubMed] [Google Scholar]
- 50.Beeckman D.S.A., Vanrompay D.C.G. Curr. Issues Mol. Biol. 2009;12:17–42. [PubMed] [Google Scholar]
- 51.Cheng L.W., Anderson D.M., Schneewind O. Mol. Microbiol. 1997;24:757–765. doi: 10.1046/j.1365-2958.1997.3831750.x. [DOI] [PubMed] [Google Scholar]
- 52.Subtil A., Delevoye C., Balañá M.-E., Tastevin L., Perrinet S., Dautry-Varsat A. Mol. Microbiol. 2005;56:1636–1647. doi: 10.1111/j.1365-2958.2005.04647.x. [DOI] [PubMed] [Google Scholar]
- 53.Samudrala R., Heffron F., McDermott J.E. PLoS Pathog. 2009;5:e1000375. doi: 10.1371/journal.ppat.1000375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Burstein D., Zusman T., Degtyar E., Viner R., Segal G., Pupko T. PLoS Pathog. 2009;5:e1000508. doi: 10.1371/journal.ppat.1000508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Guo M., Chancey S.T., Tian F., Ge Z., Jamir Y., Alfano J.R. J. Bacteriol. 2005;187:4257–4269. doi: 10.1128/JB.187.12.4257-4269.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wang Y., Zhang Q., Sun M.-A., Guo D. Bioinformatics. 2011;27:777–784. doi: 10.1093/bioinformatics/btr021. [DOI] [PubMed] [Google Scholar]
- 57.Arnold R., Brandmaier S., Kleine F., Tischler P., Heinz E., Behrens S., Niinikoski A., Mewes H.-W., Horn M., Rattei T. PLoS Pathog. 2009;5:e1000376. doi: 10.1371/journal.ppat.1000376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hiller N.L., Bhattacharjee S., van Ooij C., Liolios K., Harrison T., Lopez-Estraño C., Haldar K. Science. 2004;306:1934–1937. doi: 10.1126/science.1102737. [DOI] [PubMed] [Google Scholar]
- 59.Marti M., Good R.T., Rug M., Knuepfer E., Cowman A.F. Science. 2004;306:1930–1933. doi: 10.1126/science.1102452. [DOI] [PubMed] [Google Scholar]
- 60.Hiss J.A., Przyborski J.M., Schwarte F., Lingelbach K., Schneider G. PLoS One. 2008;3:e1560. doi: 10.1371/journal.pone.0001560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bhattacharjee S., Hiller N.L., Liolios K., Win J., Kanneganti T.-D., Young C., Kamoun S., Haldar K. PLoS Pathog. 2006;2:e50. doi: 10.1371/journal.ppat.0020050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Dou D., Kale S.D., Wang X., Jiang R.H.Y., Bruce N.A., Arredondo F.D., Zhang X., Tyler B.M. Plant Cell. 2008;20:1930–1947. doi: 10.1105/tpc.107.056093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Gal-Mor O., Finlay B.B. Cell. Microbiol. 2006;8:1707–1719. doi: 10.1111/j.1462-5822.2006.00794.x. [DOI] [PubMed] [Google Scholar]
- 64.de Felipe K.S., Pampou S., Jovanovic O.S., Pericone C.D., Ye S.F., Kalachikov S., Shuman H.A. J. Bacteriol. 2005;187:7716–7726. doi: 10.1128/JB.187.22.7716-7726.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Caturegli P., Asanovich K.M., Walls J.J., Bakken J.S., Madigan J.E., Popov V.L., Dumler J.S. Infect. Immun. 2000;68:5277–5283. doi: 10.1128/iai.68.9.5277-5283.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Al-Khodor S., Price C.T., Habyarimana F., Kalia A., Kwaik Y.A. Mol. Microbiol. 2008;70:908–923. doi: 10.1111/j.1365-2958.2008.06453.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Blanié S., Gelfi J., Bertagnoli S., Camus-Bouclainville C. Virol J. 2010;7:56. doi: 10.1186/1743-422X-7-56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Letunic I., Doerks T., Bork P. Nucleic Acids Res. 2009;37:D229–D232. doi: 10.1093/nar/gkn808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Quevillon E., Silventoinen V., Pillai S., Harte N., Mulder N., Apweiler R., Lopez R. Nucleic Acids Res. 2005;33:W116–W120. doi: 10.1093/nar/gki442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Aloy P., Ceulemans H., Stark A., Russell R.B. J. Mol. Biol. 2003;332:989–998. doi: 10.1016/j.jmb.2003.07.006. [DOI] [PubMed] [Google Scholar]
- 71.Yu H., Luscombe N.M., Lu H.X., Zhu X., Xia Y., Han J.-D.J., Bertin N., Chung S., Vidal M., Gerstein M. Genome Res. 2004;14:1107–1118. doi: 10.1101/gr.1774904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Wuchty S. PLoS One. 2011;6:e26960. doi: 10.1371/journal.pone.0026960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lee S.-A., Chan C.-h., Tsai C.-H., Lai J.-M., Wang F.-S., Kao C.-Y., Huang C.-Y.F. BMC Bioinformatics. 2008;9(Suppl. 12):S11. doi: 10.1186/1471-2105-9-S12-S11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Davis F.P., Barkan D.T., Eswar N., McKerrow J.H., Sali A. Protein Sci. 2007;16:2585–2596. doi: 10.1110/ps.073228407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Krishnadev O., Srinivasan N. Int. J. Biol. Macromol. 2011;48:613–619. doi: 10.1016/j.ijbiomac.2011.01.030. [DOI] [PubMed] [Google Scholar]
- 76.Tyagi N., Krishnadev O., Srinivasan N. Mol. BioSyst. 2009;5:1630–1635. doi: 10.1039/b906543c. [DOI] [PubMed] [Google Scholar]
- 77.Franzosa E.A., Xia Y. Proc. Natl. Acad. Sci. USA. 2011;108:10538–10543. doi: 10.1073/pnas.1101440108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kolesov G., Mewes H.W., Frishman D. J. Mol. Biol. 2001;311:639–656. doi: 10.1006/jmbi.2001.4701. [DOI] [PubMed] [Google Scholar]
- 79.von Mering C., Huynen M., Jaeggi D., Schmidt S., Bork P., Snel B. Nucleic Acids Res. 2003;31:258–261. doi: 10.1093/nar/gkg034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Bowers P.M., Pellegrini M., Thompson M.J., Fierro J., Yeates T.O., Eisenberg D. Genome Biol. 2004;5:R35. doi: 10.1186/gb-2004-5-5-r35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Huynen M., Snel B., Lathe W., Bork P. Genome Res. 2000;10:1204–1210. doi: 10.1101/gr.10.8.1204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Dyer M.D., Murali T.M., Sobral B.W. PLoS Pathog. 2008;4:e32. doi: 10.1371/journal.ppat.0040032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Dyer M.D., Murali T.M., Sobral B.W. Infect. Genet. Evol. 2011;11:917–923. doi: 10.1016/j.meegid.2011.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Gong S., Yoon G., Jang I., Bolser D., Dafas P., Schroeder M., Choi H., Cho Y., Han K., Lee S., Choi H., Lappe M., Holm L., Kim S., Oh D., Bhak J. Bioinformatics. 2005;21:2541–2543. doi: 10.1093/bioinformatics/bti366. [DOI] [PubMed] [Google Scholar]
- 85.Riley R., Lee C., Sabatti C., Eisenberg D. Genome Biol. 2005;6:R89. doi: 10.1186/gb-2005-6-10-r89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Zhao X.-M., Chen L., Aihara K. Proteins. 2010;78:1243–1253. doi: 10.1002/prot.22643. [DOI] [PubMed] [Google Scholar]
- 87.Kass I., Horovitz A. Proteins. 2002;48:611–617. doi: 10.1002/prot.10180. [DOI] [PubMed] [Google Scholar]
- 88.Pagel P., Wong P., Frishman D. J. Mol. Biol. 2004;344:1331–1346. doi: 10.1016/j.jmb.2004.10.019. [DOI] [PubMed] [Google Scholar]
- 89.Finn R.D., Marshall M., Bateman A. Bioinformatics. 2005;21:410–412. doi: 10.1093/bioinformatics/bti011. [DOI] [PubMed] [Google Scholar]
- 90.Stein A., Panjkovich A., Aloy P. Nucleic Acids Res. 2009;37:D300–D304. doi: 10.1093/nar/gkn690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Dyer M.D., Murali T.M., Sobral B.W. Bioinformatics. 2007;23:i159–i166. doi: 10.1093/bioinformatics/btm208. [DOI] [PubMed] [Google Scholar]
- 92.Kim J.-G., Park D., Kim B.-C., Cho S.-W., Kim Y.T., Park Y.-J., Cho H.J., Park H., Kim K.-B., Yoon K.-O., Park S.-J., Lee B.-M., Bhak J. BMC Bioinformatics. 2008;9:41. doi: 10.1186/1471-2105-9-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Dinkel H., Michael S., Weatheritt R.J., Davey N.E., Roey K.V., Altenberg B., Toedt G., Uyar B., Seiler M., Budd A., Jödicke L., Dammert M.A., Schroeter C., Hammer M., Schmidt T., Jehl P., McGuigan C., Dymecka M., Chica C., Luck K., Via A., Chatr-Aryamontri A., Haslam N., Grebnev G., Edwards R.J., Steinmetz M.O., Meiselbach H., Diella F., Gibson T.J. Nucleic Acids Res. 2012;40:D242–D251. doi: 10.1093/nar/gkr1064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Davey N.E., Travé G., Gibson T.J. Trends Biochem. Sci. 2011;36:159–169. doi: 10.1016/j.tibs.2010.10.002. [DOI] [PubMed] [Google Scholar]
- 95.Ravi Chandra B., Gowthaman R., Raj Akhouri R., Gupta D., Sharma A. Protein Eng Des Sel. 2004;17:175–182. doi: 10.1093/protein/gzh024. [DOI] [PubMed] [Google Scholar]
- 96.Aitio O., Hellman M., Kazlauskas A., Vingadassalom D.F., Leong J.M., Saksela K., Permi P. Proc. Natl. Acad. Sci. USA. 2010;107:21743–21748. doi: 10.1073/pnas.1010243107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Dampier W., Evans P., Ungar L., Tozeren A. BMC Med. Genomics. 2009;2:47. doi: 10.1186/1755-8794-2-47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Pawson T., Nash P. Science. 2003;300:445–452. doi: 10.1126/science.1083653. [DOI] [PubMed] [Google Scholar]
- 99.Tonikian R., Xin X., Toret C.P., Gfeller D., Landgraf C., Panni S., Paoluzi S., Castagnoli L., Currell B., Seshagiri S., Yu H., Winsor B., Vidal M., Gerstein M.B., Bader G.D., Volkmer R., Cesareni G., Drubin D.G., Kim P.M., Sidhu S.S., Boone C. PLoS Biol. 2009;7:e1000218. doi: 10.1371/journal.pbio.1000218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Wu C., Ma M.H., Brown K.R., Geisler M., Li L., Tzeng E., Jia C.Y.H., Jurisica I., Li S.S.-C. Proteomics. 2007;7:1775–1785. doi: 10.1002/pmic.200601006. [DOI] [PubMed] [Google Scholar]
- 101.Rodriguez M., Li S.S.-C., Harper J.W., Songyang Z. J. Biol. Chem. 2004;279:8802–8807. doi: 10.1074/jbc.M311886200. [DOI] [PubMed] [Google Scholar]
- 102.Espejo A., Côté J., Bednarek A., Richard S., Bedford M.T. Biochem J. 2002;367:697–702. doi: 10.1042/BJ20020860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Landgraf C., Panni S., Montecchi-Palazzi L., Castagnoli L., Schneider-Mergener J., Volkmer-Engert R., Cesareni G. PLoS Biol. 2004;2:E14. doi: 10.1371/journal.pbio.0020014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Obenauer J.C., Cantley L.C., Yaffe M.B. Nucleic Acids Res. 2003;31:3635–3641. doi: 10.1093/nar/gkg584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Lam H.Y.K., Kim P.M., Mok J., Tonikian R., Sidhu S.S., Turk B.E., Snyder M., Gerstein M.B. BMC Bioinformatics. 2010;11:243. doi: 10.1186/1471-2105-11-243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Mukhtar M.S., Carvunis A.-R., Dreze M., Epple P., Steinbrenner J., Moore J., Tasan M., Galli M., Hao T., Nishimura M.T., Pevzner S.J., Donovan S.E., Ghamsari L., Santhanam B., Romero V., Poulin M.M., Gebreab F., Gutierrez B.J., Tam S., Monachello D., Boxem M., Harbort C.J., McDonald N., Gai L., Chen H., He Y., E.U.E.C. Vaaaaandenhaute J., Roth F.P., Hill D.E., Ecker J.R., Vidal M., Beynon J., Braun P., Dangl J.L. Science. 2011;333:596–601. doi: 10.1126/science.1203659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Rattei T., Tischler P., Gotz S., Jehl M.A., Hoser J., Arnold R., Conesa A., Mewes H.W. Nucleic Acids Res. 2010;38:D223–D226. doi: 10.1093/nar/gkp949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Dean P. FEMS Microbiol. Rev. 2011;35:1100–1125. doi: 10.1111/j.1574-6976.2011.00271.x. [DOI] [PubMed] [Google Scholar]
- 109.Srikanth C.V., Mercado-Lubo R., Hallstrom K., McCormick B.A. Cell. Mol. Life Sci. 2011;68:3687–3697. doi: 10.1007/s00018-011-0841-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Buchwald G., Friebel A., Galán J.E., Hardt W.-D., Wittinghofer A., Scheffzek K. EMBO J. 2002;21:3286–3295. doi: 10.1093/emboj/cdf329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.McGhie E.J., Hayward R.D., Koronakis V. Mol. Cell. 2004;13:497–510. doi: 10.1016/s1097-2765(04)00053-x. [DOI] [PubMed] [Google Scholar]
- 112.Zhou D., Mooseker M.S., Galán J.E. Science. 1999;283:2092–2095. doi: 10.1126/science.283.5410.2092. [DOI] [PubMed] [Google Scholar]
- 113.Levin I., Eakin C., Blanc M.-P., Klevit R.E., Miller S.I., Brzovic P.S. Proc. Natl. Acad. Sci. USA. 2010;107:2848–2853. doi: 10.1073/pnas.0914821107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Fu Y., Galán J.E. Nature. 1999;401:293–297. doi: 10.1038/45829. [DOI] [PubMed] [Google Scholar]
- 115.Ohlson M.B., Huang Z., Alto N.M., Blanc M.-P., Dixon J.E., Chai J., Miller S.I. Cell Host Microbe. 2008;4:434–446. doi: 10.1016/j.chom.2008.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Ohlson M.B., Fluhr K., Birmingham C.L., Brumell J.H., Miller S.I. Infect. Immun. 2005;73:6249–6259. doi: 10.1128/IAI.73.10.6249-6259.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Koonin E.V., Makarova K.S., Aravind L. Annu. Rev. Microbiol. 2001;55:709–742. doi: 10.1146/annurev.micro.55.1.709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Nagai H., Kagan J.C., Zhu X., Kahn R.A., Roy C.R. Science. 2002;295:679–682. doi: 10.1126/science.1067025. [DOI] [PubMed] [Google Scholar]
- 119.Stebbins C.E., Galán J.E. Nature. 2001;412:701–705. doi: 10.1038/35089000. [DOI] [PubMed] [Google Scholar]
- 120.Eugster M., Roten C.-A.H., Greub G. BMC Evol. Biol. 2007;7:231. doi: 10.1186/1471-2148-7-231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Ponting C.P., Aravind L., Schultz J., Bork P., Koonin E.V. J. Mol. Biol. 1999;289:729–745. doi: 10.1006/jmbi.1999.2827. [DOI] [PubMed] [Google Scholar]
- 122.Degtyar E., Zusman T., Ehrlich M., Segal G. Cell. Microbiol. 2009;11:1219–1235. doi: 10.1111/j.1462-5822.2009.01328.x. [DOI] [PubMed] [Google Scholar]
- 123.Mazodier P., Davies J. Annu. Rev. Genet. 1991;25:147–171. doi: 10.1146/annurev.ge.25.120191.001051. [DOI] [PubMed] [Google Scholar]
- 124.Thomas C.M., Nielsen K.M. Nat. Rev. Microbiol. 2005;3:711–721. doi: 10.1038/nrmicro1234. [DOI] [PubMed] [Google Scholar]
- 125.Gimenez G., Bertelli C., Moliner C., Robert C., Raoult D., Fournier P.-E., Greub G. BMC Genomics. 2011;12:542. doi: 10.1186/1471-2164-12-542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Rappoport N., Linial M. PLoS Comput. Biol. 2012;8:e1002364. doi: 10.1371/journal.pcbi.1002364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Dutartre H., Harris M., Olive D., Collette Y. Virology. 1998;247:200–211. doi: 10.1006/viro.1998.9244. [DOI] [PubMed] [Google Scholar]
- 128.Javier R.T., Rice A.P. J. Virol. 2011;85:11544–11556. doi: 10.1128/JVI.05410-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Ptak R.G., Fu W., Sanders-Beer B.E., Dickerson J.E., Pinney J.W., Robertson D.L., Rozanov M.N., Katz K.S., Maglott D.R., Pruitt K.D., Dieffenbach C.W. AIDS Res. Hum. Retroviruses. 2008;24:1497–1502. doi: 10.1089/aid.2008.0113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Fu W., Sanders-Beer B.E., Katz K.S., Maglott D.R., Pruitt K.D., Ptak R.G. Nucleic Acids Res. 2009;37:D417–D422. doi: 10.1093/nar/gkn708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.König R., Zhou Y., Elleder D., Diamond T.L., Bonamy G.M.C., Irelan J.T., Chiang C.-Y., Tu B.P., De Jesus P.D., Lilley C.E., Seidel S., Opaluch A.M., Caldwell J.S., Weitzman M.D., Kuhen K.L., Bandyopadhyay S., Ideker T., Orth A.P., Miraglia L.J., Bushman F.D., Young J.A., Chanda S.K. Cell. 2008;135:49–60. doi: 10.1016/j.cell.2008.07.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Rain J.-C., Cribier A., Gérard A., Emiliani S., Benarous R. Methods. 2009;47:291–297. doi: 10.1016/j.ymeth.2009.02.002. [DOI] [PubMed] [Google Scholar]
- 133.Allouch A., Cereseto A. Amino Acids. 2011;41:1137–1145. doi: 10.1007/s00726-009-0444-3. [DOI] [PubMed] [Google Scholar]
- 134.de Chassey B., Navratil V., Tafforeau L., Hiet M.S., Aublin-Gex A., Agaugué S., Meiffren G., Pradezynski F., Faria B.F., Chantier T., Le Breton M., Pellet J., Davoust N., Mangeot P.E., Chaboud A., Penin F., Jacob Y., Vidalain P.O., Vidal M., André P., Rabourdin-Combe C., Lotteau V. Mol. Syst. Biol. 2008;4:230. doi: 10.1038/msb.2008.66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Tripathi L.P., Kataoka C., Taguwa S., Moriishi K., Mori Y., Matsuura Y., Mizuguchi K. Mol. BioSyst. 2010;6:2539–2553. doi: 10.1039/c0mb00103a. [DOI] [PubMed] [Google Scholar]
- 136.Shapira S.D., Gat-Viks I., Shum B.O.V., Dricot A., de Grace M.M., Wu L., Gupta P.B., Hao T., Silver S.J., Root D.E., Hill D.E., Regev A., Hacohen N. Cell. 2009;139:1255–1267. doi: 10.1016/j.cell.2009.12.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Pfefferle S., Schöpf J., Kögl M., Friedel C.C., Müller M.A., Carbajo-Lozoya J., Stellberger T., von Dall’Armi E., Herzog P., Kallies S., Niemeyer D., Ditt V., Kuri T., Züst R., Pumpor K., Hilgenfeld R., Schwarz F., Zimmer R., Steffen I., Weber F., Thiel V., Herrler G., Thiel H.-J., Schwegmann-Wessels C., Pöhlmann S., Haas J., Drosten C., von Brunn A. PLoS Pathog. 2011;7:e1002331. doi: 10.1371/journal.ppat.1002331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Uetz P., Dong Y.-A., Zeretzke C., Atzler C., Baiker A., Berger B., Rajagopala S.V., Roupelieva M., Rose D., Fossum E., Haas J. Science. 2006;311:239–242. doi: 10.1126/science.1116804. [DOI] [PubMed] [Google Scholar]
- 139.Fossum E., Friedel C.C., Rajagopala S.V., Titz B., Baiker A., Schmidt T., Kraus T., Stellberger T., Rutenberg C., Suthram S., Bandyopadhyay S., Rose D., von Brunn A., Uhlmann M., Zeretzke C., Dong Y.-A., Boulet H., Koegl M., Bailer S.M., Koszinowski U., Ideker T., Uetz P., Zimmer R., Haas J. PLoS Pathog. 2009;5:e1000570. doi: 10.1371/journal.ppat.1000570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Khadka, S., Vangeloff, A.D., Zhang, C., Siddavatam, P., Heaton, N.S., Wang, L., Sengupta, R., Sahasrabudhe, S., Randall, G., Gribskov, M., Kuhn, R.J., Perera, R., and LaCount, D.J. Mol Cell Proteomics, 10, (2011) M111.012187. [DOI] [PMC free article] [PubMed]
- 141.Calderwood M.A., Venkatesan K., Xing L., Chase M.R., Vazquez A., Holthaus A.M., Ewence A.E., Li N., Hirozane-Kishikawa T., Hill D.E., Vidal M., Kieff E., Johannsen E. Proc. Natl. Acad. Sci. USA. 2007;104:7606–7611. doi: 10.1073/pnas.0702332104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Dyer M.D., Neff C., Dufford M., Rivera C.G., Shattuck D., Bassaganya-Riera J., Murali T.M., Sobral B.W. PLoS One. 2010;5:e12089. doi: 10.1371/journal.pone.0012089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Navratil V., de Chassey B., Meyniel L., Pradezynski F., André P., Rabourdin-Combe C., Lotteau V. J. Proteome Res. 2010;9:3527–3536. doi: 10.1021/pr100326j. [DOI] [PubMed] [Google Scholar]
- 144.van Dijk D., Ertaylan G., Boucher C.A., Sloot P.M. BMC Syst. Biol. 2010;4:96. doi: 10.1186/1752-0509-4-96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.MacPherson J.I., Dickerson J.E., Pinney J.W., Robertson D.L. PLoS Comput. Biol. 2010;6:e1000863. doi: 10.1371/journal.pcbi.1000863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Sarmady M., Dampier W., Tozeren A. PLoS One. 2011;6:e23293. doi: 10.1371/journal.pone.0023293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 147.Wuchty S. BMC Genomics. 2006;7:122. doi: 10.1186/1471-2164-7-122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 148.Sarmady M., Dampier W., Tozeren A. PLoS One. 2011;6:e20735. doi: 10.1371/journal.pone.0020735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Vassilev L.T. Cell Cycle. 2004;3:419–421. [PubMed] [Google Scholar]
- 150.Li L., Thomas R.M., Suzuki H., De Brabander J.K., Wang X., Harran P.G. Science. 2004;305:1471–1474. doi: 10.1126/science.1098231. [DOI] [PubMed] [Google Scholar]
- 151.Zhang Y., Appleton B.A., Wiesmann C., Lau T., Costa M., Hannoush R.N., Sidhu S.S. Nat. Chem. Biol. 2009;5:217–219. doi: 10.1038/nchembio.152. [DOI] [PubMed] [Google Scholar]
- 152.Thorlund K., Awad T., Boivin G., Thabane L. BMC Infect. Dis. 2011;11:134. doi: 10.1186/1471-2334-11-134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 153.Tan S.-L., Ganji G., Paeper B., Proll S., Katze M.G. Nat. Biotechnol. 2007;25:1383–1389. doi: 10.1038/nbt1207-1383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.Pawson T., Linding R. FEBS Lett. 2008;582:1266–1270. doi: 10.1016/j.febslet.2008.02.011. [DOI] [PubMed] [Google Scholar]
