IRaPPA: information retrieval based integration of biophysical models for protein assembly selection

Iain H Moal; Didier Barradas-Bautista; Brian Jiménez-García; Mieczyslaw Torchala; Arjan van der Velde; Thom Vreven; Zhiping Weng; Paul A Bates; Juan Fernández-Recio

doi:10.1093/bioinformatics/btx068

. 2017 Feb 14;33(12):1806–1813. doi: 10.1093/bioinformatics/btx068

IRaPPA: information retrieval based integration of biophysical models for protein assembly selection

Iain H Moal ^1,^2,^✉, Didier Barradas-Bautista ², Brian Jiménez-García ², Mieczyslaw Torchala ³, Arjan van der Velde ^4,⁵, Thom Vreven ⁴, Zhiping Weng ⁴, Paul A Bates ³, Juan Fernández-Recio ²

Editor: Anna Tramontano

PMCID: PMC5783285 EMSID: EMS75818 PMID: 28200016

Abstract

Motivation

In order to function, proteins frequently bind to one another and form 3D assemblies. Knowledge of the atomic details of these structures helps our understanding of how proteins work together, how mutations can lead to disease, and facilitates the designing of drugs which prevent or mimic the interaction.

Results

Atomic modeling of protein–protein interactions requires the selection of near-native structures from a set of docked poses based on their calculable properties. By considering this as an information retrieval problem, we have adapted methods developed for Internet search ranking and electoral voting into IRaPPA, a pipeline integrating biophysical properties. The approach enhances the identification of near-native structures when applied to four docking methods, resulting in a near-native appearing in the top 10 solutions for up to 50% of complexes benchmarked, and up to 70% in the top 100.

Availability and Implementation

IRaPPA has been implemented in the SwarmDock server (http://bmm.crick.ac.uk/∼SwarmDock/), pyDock server (http://life.bsc.es/pid/pydockrescoring/) and ZDOCK server (http://zdock.umassmed.edu/), with code available on request.

Supplementary information

Supplementary data are available at Bioinformatics online.

Introduction

Interactions between proteins are central to biology, from intracellular and intercellular signal transduction to the mesoscopic assemblies which regulate cell morphology. Although a full understanding of these interactions can only be gained through consideration of their atomic details, the rate at which structures are determined lags far behind the rate interactions are discovered, with only 7% of the known human interactome structurally characterized (Mosca et al., 2013). Protein–protein docking can close this gap and potentially increase coverage to 50% (Mosca et al., 2013). A major impediment, however, is the ability to pick out the correct solution from a set of decoys generated by a docking program.

The task of ranking docked poses is achieved by a function whose arguments are a set of predicted structures and output is an ordered list of the same structures. For some complexes this can be achieved successfully using functions based on physical models for which docked poses were not used in their formulation (Camacho et al., 2000; Gabb et al., 1997; Moal and Fernández-Recio, 2013; Moont et al., 1999; Moal et al., 2015b; Wodak et al., 1987; Zacharias, 2003). However, with the growing availability of structural data, functions that are trained using docking decoys tend to have better success rates. These ad-hoc functions range from weighted terms in a linear combination of metrics such as electrostatic and desolvation energy (Cheng et al., 2007; Pierce and Weng, 2007), to pair potentials calculated directly from structure (Chuang et al., 2008; Tobi, 2010). The form of the scoring function, the objective function used to parameterize it, and the method used to solve the objective function vary; Tobi (2010) use linear programming to derive a potential function to satisfy the constraints that non-native decoys should be higher in energy than the native structure of the same complex. Cheng et al. (2007) and Pierce and Weng (2007) employ the simplex method to optimize term weights so as to minimize the sum of the ranks of the top-ranked near-native decoy, with Cheng et al. (2007) optimizing the logarithm of the ranks to shift the focus away from the poorly ranked complexes and towards cases for which near-native structures could be highly ranked. The common factor in these approaches is the use of domain knowledge to tailor the method to the docking problem: the implicit acknowledgement that the scores of decoys only matter relative to those of other decoys of the same complex and it need not be the case that the score of a near-native for one complex be better than a non-native decoy of a different complex; the recognition that a low-ranking near-native should not be considered detrimental to the performance of the method as long as a different near-native solution is high-ranking; and accounting for the fact that improving a near-native ranking of, for example 10 positions, is a lot more significant when going from rank 11 to rank 1 than going from rank 411 to rank 401.

The sorts of idiosyncrasies outlined above are not addressed by ’black box’ machine learning approaches. A standard binary classifier for distinguishing near-natives from non-natives may struggle with class imbalance due to the large number of non-natives relative to near-natives. Further, a hard threshold to distinguish between the two classes is coarse and ignores the fact that while non-native structures may be categorically incorrect, near-natives have a gradation of accuracy. A standard regression model against quality measures such as RMSD or DockQ score (Basu and Wallner, 2016) would be unaware that only the ordering of top-scoring decoys matters and unduly attempt to order structures far from the binding site. Both regression and classification would require the lumping together of decoys from different complexes. However, a procedure with many parallels to the ranking of docked poses is undertaken by Internet search engines. When searching for a query, the engine identifies many documents which match the query and characterizes them using metrics such as the number of times the query appears in the document. Similarly, docking a complex also undertakes a search, a conformational search, and the returned conformations are characterized by metrics such as statistical potentials. Search engines use these metrics to return a ranked list of the documents where only the first page of search hits, the topmost ranked items, are usually considered by the user, just as only the top-ranked poses are generally considered by the users of docking software. The web-search algorithms are trained from a set of queries and their corresponding document lists, partially ordered by document relevance. Consequently, the same algorithms can be used to train ranking functions from sets of complexes and their corresponding decoy sets, partially ordered by the evaluation criteria developed by the docking community over the early rounds of the CAPRI experiment (Lensink et al., 2007). These algorithms have received much attention due to their ubiquity and importance, with the field being stimulated by competitions such as the Internet Mathematics contest in 2009, the Yahoo! Learning to Rank Challenge in 2010, and the ICDM Expedia Challenge in 2013. However, the protein–protein docking community has not yet exploited this equivalence and embraced the tools and conceptual frameworks used in the design of information retrieval systems (Moal et al., 2013).

Here, we present IRaPPA (Integrative Ranking of Protein–Protein Assemblies), a method to select and combine physicochemical descriptors for ranking docked poses (Fig. 1 and Methods). Decoys are characterized with a large selection of metrics (Moal et al., 2015a), including biophysical models, statistical potentials and composite energy functions. These are combined using ranking support vector machines (R-SVMs), an efficient method that is popular for information retrieval (Joachims, 2002). As the ranking of decoys does not need to be performed instantaneously, as would be required for an Internet search engine, we take advantage of the accuracy and stability of aggregating an ensemble of learners (Breiman, 1996). To combine the individual rankings of the R-SVM ensemble, we generate a consensus ranking using the Schulze voting method, a common method that is popular due to its ease of implementation and desirable rank aggregation properties (Schulze, 2011). Our approach was applied independently to decoy structures from four state of the art docking programs, SwarmDock (Moal and Bates, 2010), pyDock (Cheng et al., 2007), ZDOCK (Chen et al., 2003) and SDOCK (Zhang and Lai, 2011).

Fig. 1 — An overview of the algorithm. 1: The training decoys are characterized using physicochemical descriptors, which are organized into a matrix (see Section 2.1). 2: R-SVMs are calculated with a random model training and model selection split (Section 2.3). Each R-SVM generates a weight vector (w) in descriptor space such that decoys for each complex (ellipses), when projected upon it, are ordered to minimize the number of swapped pairs relative to a perfect ranking: high quality (red) > medium quality (orange) > acceptable (yellow) > incorrect (gray) (Section 2.4). 3: The highest performing models are selected according to their performance on the model selection set (Section 2.5), and form the R-SVM ensemble. 4: When applied to a new set of decoys, rankings from each R-SVM in the ensemble are combined into a graph whose edge weights indicate the number of times each pose (node) is ranked higher than each other pose. For each pair of poses, a pairwise ranking is obtained by finding the strongest directed path between them, from which the final consensus ranking follows (Section 2.6) (Color version of this figure is available at *Bioinformatics* online.)

2 Materials and methods

2.1 Biophysical functions

Physicochemical descriptors were calculated in an identical manner to the CCharPPI server (Moal et al., 2015a). The descriptors suffixed with _MIN or _CB were omitted due to computational cost, and the pyDock and SIPPER terms were omitted due to difficulties with the computing cluster used. The NSC and NIPacking descriptors were also discarded due to a large number of failures, leaving an overall failure rate of 0.16%. The corresponding missing descriptor values were imputed as the mean within the docking method. We also included cluster sizes as descriptors, calculated with the g_cluster tool in GROMACS (Pronk et al., 2013), using single-linkage clustering of ligand Cα positions after superposition on the receptor, with cut-offs at 0.5 Å intervals in the 3–7 Å range. This gave 91 features in total, the values of which were normalized as z-scores. See Supplementary Table S2 for a complete feature list and Moal et al. (2013) for a more detailed description of the features.

2.2 Decoy sets

The interactions used in this study were taken from the docking benchmark 5.0 (Vreven et al., 2015). SwarmDock decoys were obtained from the SwarmDock server (Torchala et al., 2013), generating around 480 poses per complex. ZDOCK 3.0.1 was run using dense angular sampling (Chen et al., 2003), pyDock was run using a grid size of 0.7 Å, surface thickness of 1.3 Å, and rotational step of $12^{°}$ (Cheng et al., 2007), and SDOCK poses were generated using default settings (Zhang and Lai, 2011). To reduce the computational cost of training, which scales linearly with the number of complexes but quadratically with the number of decoys within a complex, we only consider the top 500 structures from the latter three programs, and thus discard the 41, 51 and 55 complexes for SDOCK, pyDock and ZDOCK respectively, for which a near native can be found but is ranked below 500. Of these 8, 15 and 14 are from the benchmark 5.0 update. Poses were classified as incorrect, acceptable, medium quality, or high quality, using the CAPRI criteria, outlined in Lensink et al. (2007). For the 55 new complexes in the benchmark 5.0 update (Vreven et al., 2015), the number of complexes for which a near-native could be found was 41, 33, 25 and 32, for SwarmDock, pyDock, ZDOCK and SDOCK respectively. For the 176 benchmark 4 complexes (Hwang et al., 2010), these are 126, 103, 114 and 109. Affinity, flexibility and type data were obtained from the benchmark and Kastritis et al. (2011).

2.3 Overview of training, validation and metaparameter optimization

Once the decoys were classified according to the CAPRI criteria and characterized using the descriptors, the process for training and validating the models consisted of several steps. These steps were performed independently for the four docking methods, using the decoys generated by that docking method. Additionally, two different procedures were taken for validating the method (denoted A and B), such that the steps below were repeated eight times.

The complexes were split into multiple sets:
- A: The BM4 complexes were randomly partitioned into a training set and model selection set with a 2:1 ratio, and the new complexes in the BM5 set form the test set. This was repeated 200 times.
- B: All complexes were randomly partitioned into training, model selection and test set in a 2:1:1 ratio. This was repeated 800 times, so that for each complex there are approximately 200 splits for which that complex is in the test set.
The training sets were used to train ranking support vector machines (R-SVMs, see Section 2.4).
Each R-SVM was scored by applying it to its corresponding model selection set (see Section 2.5).
Model selection and application to test set:
- A: All but the top scoring n R-SVM models are discarded. These are applied to the BM5 test set to produce n rankings for each complex.
- B: For each complex in the BM4, all but the top scoring n R-SVM models for which that complex is in its corresponding test set, and thus not used for training or model selection, are discarded. The remaining R-SVM models are applied to that complex. Thus, n rankings are produced for each complex.
The rankings for each complex are combined using the Schulze method (Section 2.6) to produce a consensus ranking for each complex. These rankings are used to calculate the retrieval rates and success rates.

During the training of the method there are two adjustable parameters, c (see Section 2.4) and n, the number of R-SVM models kept in step 4 above. As c is multiplicative, it was sampled logarithmically 50 times in the 10⁻⁴ to 10³ range inclusive. Steps 2 to 5 were performed 50 times concurrently for each sample of c. The value of n was also sampled 50 times (from n = 1 to n = 50) and tested concurrently in steps 4 to 5. Thus, results were gained for the entire 50 × 50 metaparameter grid. This systematic evaluation of parameter space reveals that the method is both insensitive to small changes and robust across a wide range of c and n values. For instance the SwarmDock benchmark 4 top 10 retrieval rate only varying between 67% and 74% ( $\bar{x} = 71$ %, σ = 1.0%) in the c = [1.0,10³] and n = [5,40] grid. The success and retrieval rates in Figures 2 and 3, and Supplementary Table S1, correspond to c and n metaparameter values found by leave-one-out cross validation; for each complex in the test set, the sum of the scores (Section 2.5) of the remaining complexes was calculated for each point in the metaparameter grid, and the parameters corresponding to the highest score was applied to the complex (Supplementary Tables S3 and S4). The version implemented in the SwarmDock, ZDOCK and pyDock servers were re-trained with the entire benchmark 5 data using the mode average cross-validated n and c metaparameter pair obtained using the benchmark 4.0, reported in Supplementary Table S3. For the SwarmDock and pyDock servers, the decoys are the same as reported here. For the ZDOCK server the decoys were re-calculated using the most recent version of ZDOCK (3.0.2), for which tests using the BM4 data for training and new BM5 complexes for testing yielded similar results in terms of top 1/10/100 success rates and the identities of the complexes with high ranking near-native decoys.

Fig. 2 — The top 1 and top 10 success rates for the whole docking pipeline for (a) the 55 new benchmark 5 complexes and (b) the 176 benchmark 4 complexes, using data from (1) this study, as original rankings (lighter colors) or using either the benchmark 5 complexes as external test set or multiple leave-many-out cross-validation with the benchmark 4 (dark colors), (2) Vreven *et al.*, 2015, (3) Torchala *et al.*, 2013, (4) Schneidman-Duhovny *et al.*, 2012, (5) Chowdhury *et al.*, 2013, (6) Ohue *et al.*, 2014, (7) Huang, 2015. Results from Huang use a slightly different definition of near-native. Arrows indicate the performance prior to re-ranking and following re-ranking. The left panels indicate (c) the performance according to CAPRI category, and when the data are partitioned by (d) flexibility, (e) affinity and (f) type categories. The docking results for the LCP2/FLNA interaction are shown in (g) and (h). Panel (g) shows a histogram of the distribution of Cα distances between cross-linked residues K81 and K498 for all decoys. Decoys falling within each interval are ordered and colored according to rank. Panel (h) shows the docked structure, indicating K81 and K298 in green, and V528 and L656 in purple, with zoomed in and rotated regions indicating nearby interacting residues (Color version of this figure is available at *Bioinformatics* online.)

Fig. 3 — Each panel shows the top 1 (left), top 10 (centre) and top 100 (right) retrieval rates for the original (red dots) and consensus (green dashes) rankings, as well as the distributions for the ensembles of support vector machines when applied to the new complex in the benchmark 5 as external test set (**a–d**), and using multiple leave-many-out cross-validation of the benchmark 4 (**e–h**) (Color version of this figure is available at *Bioinformatics* online.)

2.4 The R-SVM method

The R-SVM method adopted here constructs a model which is trained to minimize the fraction of swapped pairs relative to a perfect ranking (Joachims, 2002). This is achieved by finding a weight vector in descriptor space, w, such that when the descriptor values ( $d_{i k}$ ) for the decoys (i) of an interaction (k) are projected upon it to yield positions along that vector ( $w d_{i k}$ ), the decoys can be ranked according to their positions. A perfect ranking would have the projected positions of all the decoys in the high quality CAPRI category greater than those in the medium category, which in turn are greater than those in the acceptable category, which are greater than those in the incorrect category. Formally, for all pairs of decoys (i, j) not in the same CAPRI category, where decoy i is closer to the native structure than j, the inequality $w d_{i k} > w d_{j k}$ would be satisfied. While in theory it may be possible for many weight vectors to achieve a perfect ranking, and thus choose w that maximizes the margin by which the CAPRI categories are separated by maximizing $2 / | | w | |^{2}$ (Vapnik, 1998), in reality the different CAPRI categories are not perfectly separable for all complexes. To account for this, non-negative slack variables (ξ_ijk) can be introduced into each inequality, the sum of which can be minimized to bring the ranking as close as possible to a perfect ranking:

w d_{i k} \geq w d_{j k} + 1 - ξ_{i j k} where ξ_{i j k} \geq 0 for all k

(1)

Thus, the problem of getting as close as possible to a perfect ranking becomes one of simultaneously minimizing the square norm of w in order to maximize the soft margin, and minimizing the slack variables ξ_ijk:

\underset{w \in R^{91}, ξ_{i j k} \in R_{\geq 0}}{arg min} (\frac{1}{2} | | w | |^{2} + c \sum ξ_{i j k}) subject to (1)

(2)

The c parameter controls the trade-off between the two terms being minimized in Equation 2. A large c puts the focus onto getting closest to perfect rankings on the training examples, while a small c puts more emphasis on maximizing the separation between CAPRI categories, even at the expense of misranking some decoys. The optimal choice of c, and the range across which it produces models that generalize well on unseen data, depends upon the training data. For instance proteins can often form long-lived associations via alternative binding sites or binding modes that are not evident in the crystal structure, and thus a correct binding mode may be misclassified as incorrect in the training data. By balancing c, it is possible for the R-SVM to find a weight vector that permits this misclassification rather than force it to be ranked highly, to the detriment of the confidence in which the majority of correctly classified poses are ranked.

The first constraint in Equation 1 can be rearranged to $w (d_{i k} - d_{j k}) \geq 1 - ξ_{i j k}$ , which shows that the above ranking procedure is equivalent to an SVM classifier trained on the pairs of decoys for which an inequality is specified, with features corresponding to the difference between the individual features of the two decoys. Thus, the weight vector from such a classifier can be used to rank the decoys in the model selection and test sets. The optimization of Equation 2 was performed using the n-slack algorithm with shrinking heuristics described by Joachims (2005), through the SVM^rank program (Joachims, 2006). The average components of the weight vectors for top scoring models are shown in Supplementary Table S5.

2.5 Scoring R-SVM models

Each R-SVM model is evaluated by its total score S, the sum of individual scores for each of the n_t complexes in the model selection set, s_i, compared to the mean score for that complex across the R-SVM ensemble, $\bar{s_{i}}$ :

S = \sum_{i = 1}^{n_{t}} (s_{i} - \bar{s_{i}})

(3)

By taking the score relative to the mean, the total score reduces biases in the model selection set by preferentially favouring R-SVM models which perform well on difficult complexes, those which the other models struggle to perform well on, and disfavours models which perform poorly on easy complexes, those which the other models do perform well on.

For calculating the individual scores of a complex, the decoys are first clustered at 3.5 Å (see above). The clusters, the number of which we denote as n_c, are ranked by the score of their top scoring decoy. The overall rank for the complex, r, is the rank of the best ranked cluster for which the top ranked decoy within that cluster is of acceptable quality or better. The scores are then calculated as:

s_{i} = \frac{{log}_{10} (n_{c}) - {log}_{10} (r)}{{log}_{10} (n_{c})}

(4)

This score can range from 0, if only the last cluster has a top ranked decoy that is not incorrect, to 1, where the top scoring decoy of the top ranked cluster is not incorrect. If no acceptable or better solutions appear as top ranked decoys within any cluster, s_i is set to zero. The logarithmic form gives greater importance to higher ranks such that the increase in s_i in going from rank 11 to rank 1 is far greater than going from, say, rank 411 to 401.

2.6 Applying the method with Schulze ranking

To apply the model to the test sets or new docking cases, the physicochemical features are calculated for each decoy and transformed to z-scores using the previously determined mean and σ values. The decoys are ranked using each of the n selected models in the R-SVM ensemble, by their order when projected onto the R-SVM weight vector line in descriptor space (Section 2.4). Each of these rankings is combined using the Schulze electoral voting system (Schulze, 2011). Firstly, a complete digraph is constructed in which each node corresponds to a decoy, and the edge weights indicate the number of times the tail node decoy is ranked higher than the head node decoy. This graph is then used to find the strongest paths between all ordered pairs of decoys, (a, b), where path strength corresponds to the minimum edge weight in a directed path originating at a and terminating at b. To clarify this point, the strength of any given path between the decoys corresponds to the smallest weight of any edge within the path. The strongest path is the path, out of all possible paths, that gives the highest strength. The identification of the strongest path is achieved using a dynamic programming approach (Schulze, 2011). Using this information, decoy a is ranked higher than decoy b if the strength of the strongest path of (a, b) is greater than that of (b, a). As preferences are transitive, a consensus ranking follows directly from the pairwise rankings.

3 Results and discussion

3.1 Validation and benchmarking

To validate the method, we trained the models using complexes in the protein–protein docking benchmark 4 (Hwang et al., 2010), and evaluated the ability to retrieve near-native solutions using the new complexes added in the benchmark 5 (Vreven et al., 2015) as an external validation set (Fig. 3a–d). Of the complexes for which a near-native solution could be found, a near-native structure was identified as the top ranked solution in 12–22% of the interactions prior to re-ranking, which increased to 16–44% using our approach. Similarly, retrieval in the top 10 increases from 33–51%, to 50–67% and top 100 improves from 70–90% to 91–100%, indicating that sampling becomes the limiting factor in obtaining a top 100 near-native solution within our scoring scheme. Considering this improvement in the context of the whole docking pipeline, which includes complexes for which no near-natives are sampled (Fig. 2a and Supplementary Table S1), this corresponds to top 1 success rates up to 22% for all 55 complexes in the benchmark 5 update, top 10 success rates up to 45%, and top 100 success rates up to 69%.

We also applied the method to the original complexes in the docking benchmark 4 using multiple leave-many-out cross-validation (Fig. 3e–h). A quarter of the complexes were left out at random from the training set for each of the R-SVM models, and for each complex the Schulze re-ranking only combined the models for which the complex was omitted from the training. We see improvements of 10–20% to 24–42%, 31–50% to 51–68% and 69–90% to 89–100% respectively for the top 1/10/100 retrieval rates. For SwarmDock, this corresponds to top 1/10/100 success rates of 31%, 52% and 68% respectively, when considering all 176 complex in the benchmark 4 (Fig. 2b and Supplementary Table S1). On both benchmarks, a large improvement can be attributed to the R-SVMs which, when combined using Schulze ranking, typically performing as good as or better than the average R-SVM model on its own. For all four docking protocols, the method yields a significantly better ranking of near-native solution (P $≪ 0.01$ , Wilcoxon signed-rank test), and large improvements compared to other methods reported in the literature (Fig. 2a,b and Supplementary Table S1).

The combined results for the two benchmarks, and across the different CAPRI quality categories, are shown in Figure 2c. A strong improvement in success rates is seen for the acceptable and medium categories, although there are too few data to make conclusions regarding the high quality category. Further, the improvements in results apply across all categories of flexibility, binding affinity or biological role (Fig. 2d–f). As well as increasing the rank of near-native decoys, the method also improves the quality of the decoys at the top of the ranking, even when a high ranking near-native could be found in the initial rankings; considering the complexes for which a near-native could be found in the top 100 before and after re-ranking, the top-ranked near-native was of a higher CAPRI class for 20 complexes in the re-ranked list using the SwarmDock structures, compared to 13 that were higher in the initial rankings. For pyDock, ZDOCK and SDOCK these figures are 14 and 3, 22 and 7 and 22 and 6 respectively. This also holds when considering only the top 10 decoys, with values of 8 and 5, 6 and 2, 10 and 4, and 7 and 3 respectively.

3.2 Application to LCP2/FLNA interaction

To further validate the method, we investigated the interaction between the sterile alpha motif (SAM) domain of lymphocyte cytosolic protein 2 (LCP2, pdb 2EAP) with filamin A (FLNA, pdb 4M9P). This interaction was initially predicted based on co-expression, co-location, shared binding partners, and co-occurrence of post-translational modifications (McDowall et al., 2009), and was recently confirmed by cross-linking mass spectrometry (Liu et al., 2015). Using SwarmDock without distance restraints, we generated a single structure consistent with length and steric requirements of the observed K81-K498 disuccinimidyl sulfoxide cross-link at the periphery of the binding site (Kao et al., 2011), with a K81-K498 Cα–Cα distance of 6.9 Å (Fig. 2g–h). This structure is ranked 3rd using the presented method, compared to a rank of 7 using the standard protocol from the SwarmDock server. FLNA deactivation is associated with periventricular heterotopia, an X-linked dominant disorder in females which is hemizygous lethal in males, that can also arise sporadically from partially deactivating missense mutations in FLNA. These mutations appear either in the first calponin homology domain, which binds actin, or in uncharacterized 3rd (V528M) (Kakita et al., 2002) or 4th (L656F) (Sheen et al., 2001) Ig-like domains. Despite V528 and L656 being 18 Å apart, both residues participate in the predicted interaction, either via direct packing of L656 with LCP2 T90 and I88, or as V528 acting to support its adjacent residues R527 and L529, of which the former forms a salt bridge with LCP2 E78, and the latter forms a cation-π interaction with F15 and a hydrogen bond with N82 and V13. This suggests that both mutations may interfere with communication between the cytoskelton and the phosphotyrosine signalling network via this interaction. Thus, our approach has allowed us to predict a low-energy structure of a previously uncharacterized complex, validated by chemical cross-linking and by reconciling the common pathology of spatially separate FLNA mutants which, at the same time, establishes a link between the LCP2/FLNA interaction and periventricular heterotopia.

3.3 Web server implementations

We have implemented the method in the following web servers:

SwarmDock: As an option available at http://bmm.crick.ac.uk/∼SwarmDock/ (Torchala et al., 2013)
pyDockWeb: As a separate server to re-rank results generated with the pyDockWeb server, available at http://life.bsc.es/pid/pydockrescoring/ (Jiménez-García et al., 2013)
ZDOCK: As an option available at http://zdock.umassmed.edu/

Computation times are considerably longer than for a standard docking run due to the cost of the descriptor calculations; tests on the SwarmDock server between two proteins of around 250 residues typically take 11 h from submission. On all three servers we aim to return results within a day depending upon server loads.

4 Conclusion

In conclusion, integrating biophysical functions using methods originally developed for information retrieval and electoral voting provides a powerful method for enhancing the atomic modelling of protein complexes in a way that is tailored to the technique used to generate the models, resulting in both a better ranking of near-native decoys and an improvement in the quality of high-ranking near-native decoys.

Supplementary Material

Supplementary Data

Click here for additional data file.^{(103.3KB, pdf)}

Acknowledgements

The authors thank Sarah Teichmann, Pedro Beltrao, Alexandre Bonvin, Roberto Mosca, Patrick Aloy, Nick Goldman and Rudi Agius for helpful comments.

Funding

This work was supported by the European Molecular Biology Laboratory [I.H.M.]; the European Commission [Marie Curie Actions PIEF-GA-2012-327899 to I.H.M.]; the Biotechnology and Biological Sciences Research Council [Future Leader Fellowship BB/N011600/1 to I.H.M.]; Consejo Nacional de Ciencia y Tecnología [217686 to D.B.]; The Francis Crick Institute, which receives its core funding from Cancer Research UK (FC001003), the UK Medical Research Council (FC001003) and the Wellcome Trust (FC001003) [M.T., P.A.B.]; Ministerio de Economía y Competitividad [FPI fellowship to B.J.G., I + D + I Research Project BIO2013-48213-R to J.F.R.]; and National Institutes of Health [R01 GM116960 to ZW].

Conflict of Interest: none declared.

References

Basu S., Wallner B. (2016) DockQ: a quality measure for protein–protein docking models. PLoS ONE, 11, e0161879.. [DOI] [PMC free article] [PubMed] [Google Scholar]
Breiman L. (1996) Bagging predictors. Mach. Learn., 24, 123–140. [Google Scholar]
Camacho C.J. et al. (2000) Scoring docked conformations generated by rigid-body protein–protein docking. Proteins, 40, 525–537. [DOI] [PubMed] [Google Scholar]
Chen R. et al. (2003) ZDOCK: an initial-stage protein-docking algorithm. Proteins, 52, 80–87. [DOI] [PubMed] [Google Scholar]
Cheng T.M. et al. (2007) pyDock: electrostatics and desolvation for effective scoring of rigid-body protein–protein docking. Proteins, 68, 503–515. [DOI] [PubMed] [Google Scholar]
Chowdhury R. et al. (2013) Protein–protein docking with F(2)Dock 2.0 and GB-rerank. PLoS ONE, 8, e51307.. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chuang G.Y. et al. (2008) DARS (Decoys As the Reference State) potentials for protein–protein docking. Biophys. J., 95, 4217–4227. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gabb H.A. et al. (1997) Modelling protein docking using shape complementarity, electrostatics and biochemical information. J. Mol. Biol., 272, 106–120. [DOI] [PubMed] [Google Scholar]
Huang S.Y. (2015) Exploring the potential of global protein–protein docking: an overview and critical assessment of current programs for automatic ab initio docking. Drug Discov. Today, 20, 969–977. [DOI] [PubMed] [Google Scholar]
Hwang H. et al. (2010) Protein–protein docking benchmark version 4.0. Proteins, 78, 3111–3114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiménez-García B. et al. (2013) pyDockWEB: a web server for rigid-body protein–protein docking using electrostatics and desolvation scoring. Bioinformatics, 29, 1698–1699. [DOI] [PubMed] [Google Scholar]
Joachims T. (2002). Optimizing search engines using clickthrough data In: KDD ’02: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142. ACM Press, New York. [Google Scholar]
Joachims T. (2005). A support vector method for multivariate performance measures In: ICML ’05: Proceedings of the 22nd International Conference on Machine Learning, pp. 377–384. ACM Press, New York. [Google Scholar]
Joachims T. (2006). Training linear SVMs in linear time In: KDD ’06: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 217–226. ACM Press, New York. [Google Scholar]
Kakita A. et al. (2002) Bilateral periventricular nodular heterotopia due to filamin 1 gene mutation: widespread glomeruloid microvascular anomaly and dysplastic cytoarchitecture in the cerebral cortex. Acta Neuropathol., 104, 649–657. [DOI] [PubMed] [Google Scholar]
Kao A. et al. (2011) Development of a novel cross-linking strategy for fast and accurate identification of cross-linked peptides of protein complexes. Mol. Cell Proteomics, 10, M110.002212.. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kastritis P.L. et al. (2011) A structure-based benchmark for protein–protein binding affinity. Protein Sci., 20, 482–491. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lensink M.F. et al. (2007) Docking and scoring protein complexes: CAPRI 3rd edition. Proteins, 69, 704–718. [DOI] [PubMed] [Google Scholar]
Liu F. et al. (2015) Proteome-wide profiling of protein assemblies by cross-linking mass spectrometry. Nat. Methods, 12, 1179–1184. [DOI] [PubMed] [Google Scholar]
McDowall M.D. et al. (2009) PIPs: human protein–protein interaction prediction database. Nucleic Acids Res., 37, (D), 651–656. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moal I.H., Bates P.A. (2010) SwarmDock and the use of normal modes in protein–protein docking. Int. J. Mol. Sci., 11, 3623–3648. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moal I.H., Fernández-Recio J. (2013) Intermolecular contact potentials for protein–protein interactions extracted from binding free energy changes upon mutation. J Chem. Theory Comput., 9, 3715–3727. [DOI] [PubMed] [Google Scholar]
Moal I.H. et al. (2013) The scoring of poses in protein–protein docking: current capabilities and future directions. BMC Bioinformatics, 14, 286.. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moal I.H. et al. (2015a) CCharPPI web server: computational characterization of protein–protein interactions from structure. Bioinformatics, 31, 123–125. [DOI] [PubMed] [Google Scholar]
Moal I.H. et al. (2015b) Inferring the microscopic surface energy of protein–protein interfaces from mutation data. Proteins, 83, 640–650. [DOI] [PubMed] [Google Scholar]
Moont G. et al. (1999) Use of pair potentials across protein interfaces in screening predicted docked complexes. Proteins, 35, 364–373. [PubMed] [Google Scholar]
Mosca R. et al. (2013) Interactome3D: adding structural details to protein networks. Nat. Methods, 10, 47–53. [DOI] [PubMed] [Google Scholar]
Ohue M. et al. (2014) MEGADOCK: an all-to-all protein–protein interaction prediction system using tertiary structure data. Protein Pept. Lett., 21, 766–778. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pierce B., Weng Z. (2007) ZRANK: reranking protein docking predictions with an optimized energy function. Proteins, 67, 1078–1086. [DOI] [PubMed] [Google Scholar]
Pronk S. et al. (2013) GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics, 29, 845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schneidman-Duhovny D. et al. (2012) A method for integrative structure determination of protein–protein complexes. Bioinformatics, 28, 3282–3289. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schulze M. (2011) A new monotonic, clone-independent, reversal symmetric, and condorcet-consistent single-winner election method. Soc. Choice Welf., 36, 267–303. [Google Scholar]
Sheen V.L. et al. (2001) Mutations in the X-linked filamin 1 gene cause periventricular nodular heterotopia in males as well as in females. Hum. Mol. Genet., 10, 1775–1783. [DOI] [PubMed] [Google Scholar]
Tobi D. (2010) Designing coarse grained- and atom based-potentials for protein–protein docking. BMC Struct. Biol., 10, 40.. [DOI] [PMC free article] [PubMed] [Google Scholar]
Torchala M. et al. (2013) SwarmDock: a server for flexible protein–protein docking. Bioinformatics, 29, 807–809. [DOI] [PubMed] [Google Scholar]
Vapnik V.N. (1998). Statistical Learning Theory. Adaptive and Learning Systems for Signal Processing, Communications, and Control. Wiley, New York. [Google Scholar]
Vreven T. et al. (2015) Updates to the integrated protein–protein interaction benchmarks: docking benchmark version 5 and affinity benchmark version 2. J. Mol. Biol., 427, 3031–3041. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wodak S.J. et al. (1987) Computer studies of interactions between macromolecules. Prog. Biophys. Mol. Biol., 49, 29–63. [DOI] [PubMed] [Google Scholar]
Zacharias M. (2003) Protein–protein docking with a reduced protein model accounting for side-chain flexibility. Protein Sci., 12, 1271–1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang C., Lai L. (2011) SDOCK: a global protein–protein docking program using stepwise force-field potentials. J. Comput. Chem., 32, 2598–2612. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Click here for additional data file.^{(103.3KB, pdf)}

[btx068-B1] Basu S., Wallner B. (2016) DockQ: a quality measure for protein–protein docking models. PLoS ONE, 11, e0161879.. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B2] Breiman L. (1996) Bagging predictors. Mach. Learn., 24, 123–140. [Google Scholar]

[btx068-B3] Camacho C.J. et al. (2000) Scoring docked conformations generated by rigid-body protein–protein docking. Proteins, 40, 525–537. [DOI] [PubMed] [Google Scholar]

[btx068-B4] Chen R. et al. (2003) ZDOCK: an initial-stage protein-docking algorithm. Proteins, 52, 80–87. [DOI] [PubMed] [Google Scholar]

[btx068-B5] Cheng T.M. et al. (2007) pyDock: electrostatics and desolvation for effective scoring of rigid-body protein–protein docking. Proteins, 68, 503–515. [DOI] [PubMed] [Google Scholar]

[btx068-B6] Chowdhury R. et al. (2013) Protein–protein docking with F(2)Dock 2.0 and GB-rerank. PLoS ONE, 8, e51307.. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B7] Chuang G.Y. et al. (2008) DARS (Decoys As the Reference State) potentials for protein–protein docking. Biophys. J., 95, 4217–4227. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B8] Gabb H.A. et al. (1997) Modelling protein docking using shape complementarity, electrostatics and biochemical information. J. Mol. Biol., 272, 106–120. [DOI] [PubMed] [Google Scholar]

[btx068-B9] Huang S.Y. (2015) Exploring the potential of global protein–protein docking: an overview and critical assessment of current programs for automatic ab initio docking. Drug Discov. Today, 20, 969–977. [DOI] [PubMed] [Google Scholar]

[btx068-B10] Hwang H. et al. (2010) Protein–protein docking benchmark version 4.0. Proteins, 78, 3111–3114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B11] Jiménez-García B. et al. (2013) pyDockWEB: a web server for rigid-body protein–protein docking using electrostatics and desolvation scoring. Bioinformatics, 29, 1698–1699. [DOI] [PubMed] [Google Scholar]

[btx068-B12] Joachims T. (2002). Optimizing search engines using clickthrough data In: KDD ’02: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142. ACM Press, New York. [Google Scholar]

[btx068-B13] Joachims T. (2005). A support vector method for multivariate performance measures In: ICML ’05: Proceedings of the 22nd International Conference on Machine Learning, pp. 377–384. ACM Press, New York. [Google Scholar]

[btx068-B14] Joachims T. (2006). Training linear SVMs in linear time In: KDD ’06: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 217–226. ACM Press, New York. [Google Scholar]

[btx068-B15] Kakita A. et al. (2002) Bilateral periventricular nodular heterotopia due to filamin 1 gene mutation: widespread glomeruloid microvascular anomaly and dysplastic cytoarchitecture in the cerebral cortex. Acta Neuropathol., 104, 649–657. [DOI] [PubMed] [Google Scholar]

[btx068-B16] Kao A. et al. (2011) Development of a novel cross-linking strategy for fast and accurate identification of cross-linked peptides of protein complexes. Mol. Cell Proteomics, 10, M110.002212.. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B17] Kastritis P.L. et al. (2011) A structure-based benchmark for protein–protein binding affinity. Protein Sci., 20, 482–491. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B18] Lensink M.F. et al. (2007) Docking and scoring protein complexes: CAPRI 3rd edition. Proteins, 69, 704–718. [DOI] [PubMed] [Google Scholar]

[btx068-B19] Liu F. et al. (2015) Proteome-wide profiling of protein assemblies by cross-linking mass spectrometry. Nat. Methods, 12, 1179–1184. [DOI] [PubMed] [Google Scholar]

[btx068-B20] McDowall M.D. et al. (2009) PIPs: human protein–protein interaction prediction database. Nucleic Acids Res., 37, (D), 651–656. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B21] Moal I.H., Bates P.A. (2010) SwarmDock and the use of normal modes in protein–protein docking. Int. J. Mol. Sci., 11, 3623–3648. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B22] Moal I.H., Fernández-Recio J. (2013) Intermolecular contact potentials for protein–protein interactions extracted from binding free energy changes upon mutation. J Chem. Theory Comput., 9, 3715–3727. [DOI] [PubMed] [Google Scholar]

[btx068-B23] Moal I.H. et al. (2013) The scoring of poses in protein–protein docking: current capabilities and future directions. BMC Bioinformatics, 14, 286.. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B24] Moal I.H. et al. (2015a) CCharPPI web server: computational characterization of protein–protein interactions from structure. Bioinformatics, 31, 123–125. [DOI] [PubMed] [Google Scholar]

[btx068-B25] Moal I.H. et al. (2015b) Inferring the microscopic surface energy of protein–protein interfaces from mutation data. Proteins, 83, 640–650. [DOI] [PubMed] [Google Scholar]

[btx068-B26] Moont G. et al. (1999) Use of pair potentials across protein interfaces in screening predicted docked complexes. Proteins, 35, 364–373. [PubMed] [Google Scholar]

[btx068-B27] Mosca R. et al. (2013) Interactome3D: adding structural details to protein networks. Nat. Methods, 10, 47–53. [DOI] [PubMed] [Google Scholar]

[btx068-B28] Ohue M. et al. (2014) MEGADOCK: an all-to-all protein–protein interaction prediction system using tertiary structure data. Protein Pept. Lett., 21, 766–778. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B29] Pierce B., Weng Z. (2007) ZRANK: reranking protein docking predictions with an optimized energy function. Proteins, 67, 1078–1086. [DOI] [PubMed] [Google Scholar]

[btx068-B30] Pronk S. et al. (2013) GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics, 29, 845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B31] Schneidman-Duhovny D. et al. (2012) A method for integrative structure determination of protein–protein complexes. Bioinformatics, 28, 3282–3289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B32] Schulze M. (2011) A new monotonic, clone-independent, reversal symmetric, and condorcet-consistent single-winner election method. Soc. Choice Welf., 36, 267–303. [Google Scholar]

[btx068-B33] Sheen V.L. et al. (2001) Mutations in the X-linked filamin 1 gene cause periventricular nodular heterotopia in males as well as in females. Hum. Mol. Genet., 10, 1775–1783. [DOI] [PubMed] [Google Scholar]

[btx068-B34] Tobi D. (2010) Designing coarse grained- and atom based-potentials for protein–protein docking. BMC Struct. Biol., 10, 40.. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B35] Torchala M. et al. (2013) SwarmDock: a server for flexible protein–protein docking. Bioinformatics, 29, 807–809. [DOI] [PubMed] [Google Scholar]

[btx068-B36] Vapnik V.N. (1998). Statistical Learning Theory. Adaptive and Learning Systems for Signal Processing, Communications, and Control. Wiley, New York. [Google Scholar]

[btx068-B37] Vreven T. et al. (2015) Updates to the integrated protein–protein interaction benchmarks: docking benchmark version 5 and affinity benchmark version 2. J. Mol. Biol., 427, 3031–3041. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B38] Wodak S.J. et al. (1987) Computer studies of interactions between macromolecules. Prog. Biophys. Mol. Biol., 49, 29–63. [DOI] [PubMed] [Google Scholar]

[btx068-B39] Zacharias M. (2003) Protein–protein docking with a reduced protein model accounting for side-chain flexibility. Protein Sci., 12, 1271–1282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btx068-B40] Zhang C., Lai L. (2011) SDOCK: a global protein–protein docking program using stepwise force-field potentials. J. Comput. Chem., 32, 2598–2612. [DOI] [PubMed] [Google Scholar]

PERMALINK

IRaPPA: information retrieval based integration of biophysical models for protein assembly selection

Iain H Moal

Didier Barradas-Bautista

Brian Jiménez-García

Mieczyslaw Torchala

Arjan van der Velde

Thom Vreven

Zhiping Weng

Paul A Bates

Juan Fernández-Recio

Roles

Abstract

Motivation

Results

Availability and Implementation

Supplementary information

Introduction

Fig. 1.

2 Materials and methods

2.1 Biophysical functions

2.2 Decoy sets

2.3 Overview of training, validation and metaparameter optimization

Fig. 2.

Fig. 3.

2.4 The R-SVM method

2.5 Scoring R-SVM models

2.6 Applying the method with Schulze ranking

3 Results and discussion

3.1 Validation and benchmarking

3.2 Application to LCP2/FLNA interaction

3.3 Web server implementations

4 Conclusion

Supplementary Material

Acknowledgements

Funding

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases