DaReUS-Loop: a web server to model multiple loops in homology models

Yasaman Karami; Julien Rey; Guillaume Postic; Samuel Murail; Pierre Tufféry; Sjoerd J de Vries

doi:10.1093/nar/gkz403

. 2019 May 22;47(W1):W423–W428. doi: 10.1093/nar/gkz403

DaReUS-Loop: a web server to model multiple loops in homology models

Yasaman Karami ^1,², Julien Rey ^1,², Guillaume Postic ^1,^2,³, Samuel Murail ¹, Pierre Tufféry ^1,^2,^✉, Sjoerd J de Vries ^1,^2,^✉

PMCID: PMC6602439 PMID: 31114872

Abstract

Loop regions in protein structures often have crucial roles, and they are much more variable in sequence and structure than other regions. In homology modeling, this leads to larger deviations from the homologous templates, and loop modeling of homology models remains an open problem. To address this issue, we have previously developed the DaReUS-Loop protocol, leading to significant improvement over existing methods. Here, a DaReUS-Loop web server is presented, providing an automated platform for modeling or remodeling loops in the context of homology models. This is the first web server accepting a protein with up to 20 loop regions, and modeling them all in parallel. It also provides a prediction confidence level that corresponds to the expected accuracy of the loops. DaReUS-Loop facilitates the analysis of the results through its interactive graphical interface and is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/DaReUS-Loop/.

INTRODUCTION

Prediction of protein structures is one of the most challenging problems in biology (1). This is reflected by the large number of protein sequences known today (∼109 million) in UniProt versus the number of known protein structures (about 139 thousand) in Protein Data Bank, PDB (2). This means homology modeling is a crucial technique to obtain structural insight (3), and homology modeling methods keep improving significantly (4,5). Loops are regions with often crucial roles in protein-protein interactions, protein function, drug design and docking of small molecules (6–8). Successful loop modeling can lead towards accurate design and engineering of proteins, large peptides, antibodies, drugs or synthetic vaccines, to name a few (9). Despite the development of dedicated loop modeling methods, the overall accuracy of homology models tends to be considerably lower in loop regions, and loop modeling of homology models remains an open problem (10–13).

Loop modeling approaches can be divided into ab initio (14–19), data-based (20–24) and the combination of both methods (25–27). Ab initio methods explore the conformational space to find loop conformations computationally, while data-based approaches mine a database using the geometry of flanks (a few residues before and after the loop of interest), to search for possible candidates. Many of these methods achieve successful loop predictions in exact environments (i.e. missing loops in crystal structures) (16–18,21,23,24,27). However, few methods have been applied to the prediction of loops in perturbed situations (i.e. homology models) (16,21,23). The difficulty of those cases is reflected by the much lower accuracy of the resulting loop models. Moreover, few methods are available as web servers: principally GalaxyLoopPS2 (16), LoopIng (23), Sphinx (27). In addition to these methods, there are servers for ModLoop (28), ArchPRED (29), FALC-Loop (30) and RCD+ (18), which have only been tested on crystal structures, and there is SuperLooper2 (20), which is an interactive web application rather than an automated web server. Moreover, there are MODELLER (31), Loopy (32), OSCAR-loop (33), Rosetta-NGK (15), LEAP (17) and M-DiSGro (34), available only as tools that have to be installed locally. Finally, there are several web servers that are specific for the prediction of loops in antibodies (27,35–38).

Previously we have proposed DaReUS-Loop, a data-based approach using remote or unrelated structures for loop modeling (39). The method has been validated on benchmarks of loops extracted from CASP11 and CASP12 targets and shown to improve the accuracy of loop modeling, with respect to the state-of-the-art approaches (considering both ab initio and data-based methods). In addition, significant improvements have been obtained to predict long loops with at least 15 and at most 30 residues. Importantly, DaReUS-Loop tackles the practical application of loop modeling in non-ideal conditions (homology models) (39).

Here, we describe the DaReUS-Loop web server, an automated platform for modeling or remodeling loops in the context of homology models. The web server uses the same protocol as in the original publication (39), except that in the final minimization, MODELLER (31) (which is not free software) has been replaced by GROMACS (40). For the convenience of the user, the web server accepts a protein with up to 20 loop regions defined, and models them all in parallel. The server assigns a confidence value to every modelled loop, that correlates well with the accuracy of predictions.

MATERIALS AND METHODS

Description of the web server

The details of the DaReUS-Loop protocol are explained in (39). The only difference is that final minimisation with MODELLER (31), which is not free software, has been replaced by GROMACS (40) (see Supplementary data for the details). In the current study, MODELLER was still used to generate initial homology models that were subsequently re-modelled using the DaReUS-Loop server.

The minimum loop length is 2, and the maximum loop length is 30 residues. In addition, for the convenience of the user, the web server accepts a protein with up to 20 loop regions defined, and models them in parallel. This has implications on the valid combinations of loop candidates, which is why the server can run in three different modes: remodeling mode, modeling mode, and advanced modeling mode. These options are briefly explained below (see Supplementary data for a more detailed definition).

In all cases, the DaReUS-Loop web server takes as input: (i) atomic coordinates of a protein in PDB format and (ii) a protein sequence in FASTA format.

Remodeling: The server accepts an initial homology model and remodels the loops as indicated by the user in a gapped sequence. In this mode, each loop is being modelled separately, while the other loops are kept in their initial configuration (from the input structure file).
Modeling: The server takes a gapped PDB and completes the missing loops using the full protein sequence provided by the user, in parallel. In addition, the server builds a consensus model, choosing the top candidate of each loop. Then final models for every loop are built using this consensus structure.
Advanced modeling: In this mode, the inputs are similar to modeling mode (a gapped PDB and full protein sequence). Each loop is being modelled independently, while the other loops are omitted as gaps. This mode slightly improves the loop accuracy at the cost of introducing gaps in the final models.

Note that all three modes produce the same results if the input protein has only one loop to be modelled.

It is possible to define a PDB code that will be excluded from the search, for the purpose of benchmarking. In this case, close homologs (those with at least 70% sequence identity) are ignored within the search dataset.

For every loop region, the server returns a maximum of 10 candidate models and a confidence score. In addition, to facilitate the quality assessment of loop candidates, the server returns a table reporting the final GROMACS energy values (40) and another energy measure that is the KORP score (41) for every loop candidate. In case of multiple loops, a general clash report is generated, showing possible clashes between candidates of different loops. The report is useful in the advanced and remodeling modes, since there is a possibility that candidates of one loop have clashes with candidates of other loops. Therefore, the clash report guides the user how to choose different combinations of candidates for each loop to avoid possible clashes. A brief description of the protocol is shown in Figure 1.

Figure 1. — The work flow of DaReUS-Loop web server.

The server provides a visualisation facility using the NGL Viewer (42). User can select the modelled loops one at a time and all the final loop candidates will be shown on the structure using different colours. This options facilitates the visual inspection of final models.

All DaReUS-Loop web server results presented in the manuscript are for remodeling mode and all predictions were evaluated using the flank RMSD, as defined in (39).

RESULTS

Performance and comparison with other approaches

The DaReUS-Loop web server has been validated on the same test sets as the original DaReUS-Loop protocol, namely the targets of the CASP11 (http://predictioncenter.org/casp11/) and CASP12 (http://predictioncenter.org/casp12/) experiments (43,44). The server results were compared with those of the original DaReUS-Loop protocol and with GalaxyLoop-PS2 (16), Rosetta Next-generation KIC (NGK) (15), RCD+ (18), LoopIng (23) and Sphinx (27).Consequently, the comparisons are grouped by the type of the method: ab initio (GalaxyLoop-PS2 and Rosetta) and data-based (LoopIng and Sphinx). However, most of these methods have some limitations, for instance GalaxyLoopPS2 can model loops of maximum 20 amino acids that belong to proteins with <300 residues. In this context, the same subsets as in the original DaReUS-Loop paper were used. Here, set_ai is the subset of loops where all ab initio methods (GalaxyLoopPS2, RCD+, and Rosetta NGK) gave a result. Likewise, set_db is the subset of loops where both data-based methods (LoopIng and Sphinx) gave a result. This subset was evaluated using 2-residue flanks, since LoopIng does not return more. Each subset was limited to those loops that were classified as high-confidence by DaReUS-Loop. Finally, the loops of the original homology model, as generated by MODELLER (31), were evaluated as reference.

Overall statistics on the best of top 10 models are shown in Table 1. The average performance of the DaReUS-Loop web server is within 0.1 Å of the published DaReUS-Loop protocol. Average performance is better than NGK, GalaxyLoopPS2, RCD+ and MODELLER by at least 0.59, 0.34, 0.80 and 0.94 Å, for the CASP11 and CASP12 test sets, respectively. The remodeling protocol outperforms LoopIng for all sets, with a gain of at least 1.28 Å and outperforms Sphinx by at least 0.89 Å.

Table 1.

Comparison with state-of-the-art methods

		CASP11	CASP12	<1 Å (%)	<2 Å (%)
set_ai	DaReUS-Loop server	2.00	2.35	20	53
	DaReUS-Loop	1.91	2.30	23	58
	NGK	2.59	2.99	15	41
	GalaxyLoopPS2	2.34	2.88	16	45
	RCD+	2.71	3.11	8	41
	MODELLER	2.94	3.52	12	40
	size	40	46
set_db	DaReUS-Loop server	2.01	2.25	19	60
	DaReUS-Loop	2.05	2.25	19	58
	LoopIng	3.66	3.53	12	23
	Sphinx	2.90	3.19	15	43
	size	51	55

Open in a new tab

Average flanked RMSD (Å) are reported for the CASP11 and CASP12 test sets. Comparison is between the DaReUS-Loop web server and the published version, as well as various ab initio methods (Rosetta NGK, GalaxyLoop-PS2, RCD+ and MODELLER) and data-based methods (LoopIng and Sphinx). Since Sphinx is a hybrid method (combination of ab initio and knowledge-based methods), we reported its results along with LoopIng. Results are reported on the common high confidence sub-set of loops that could be predicted by all the methods of the same class (setai and setdb, respectively). All the values reported in this table correspond to the best flanked RMSD (Å) over top 10 models. The percentage of highly accurate predictions (<1 and <2 Å) is also reported. Bold values correspond to the best values among all the methods.

DaReUS-Loop generates high-accuracy loop models (<1 Å) for 20% and medium-accuracy models (<2 Å) for 53% of the cases in the set_ai subset (Table 1). The results for high accuracy constitute an improvement by 5, 4, 8 and 8% over Rosetta NGK, GalaxyLoop-PS2, RCD+ and MODELLER, respectively. For the set_db subset, the improvements are of 13% and 10%, respectively, over LoopIng and Sphinx.

A detailed comparison of the methods with respect to different loop sizes is reported in Supplementary Tables S1 and S2, for ab initio and data-based methods, respectively.

The simultaneous modeling and remodeling of multiple loops

The CASP11 and CASP12 benchmarks contain multiple loops per homology model in most of the cases (see Supplementary Figure S1). In the original DaReUS-Loop publication, each loop was re-modelled independently and one at a time. For the server, three different modes were tested: loop remodeling, loop modeling and advanced loop modeling. Detailed results are in Supplementary Table S3. Briefly, it was found that remodeling usually gives the best results, but that advanced modeling is better in some cases. Note that the three modes give the same result if only a single loop is being modelled. The performance is also reported for modeling loops that are connecting different secondary structures. For that, all the loops in the benchmark were divided into three main groups, according to the secondary structures of their flanks: (i) α−α, (ii) α−β and (iii) β−β. The results are shown in Supplementary Table S4), suggesting the performance is the best for modelling loops connecting two different α-helices, and is better for the loops joining one α-helix to a β-strand compared to the loops connecting two different β-strands.

Among the existing tools for loop modeling, Rosetta NGK is the only one that can deal with arbitrary multiple loops. While M-DiSGro is a tool for modeling multiple loops, they must be interacting, i.e. within spatial proximity. Consequently, we are only able to compare the modeling and advanced modeling modes with Rosetta NGK, and with the initial loops from MODELLER (Table 2). In advanced modeling scenario, the average flanked RMSDs for the two test sets are 2.10 and 2.18 Å, respectively. The results are better than Rosetta NGK with average flanked RMSD of 2.61 and 2.63 Å and drastically better than MODELLER (2.97 and 3.15 Å). Also the percentage of high resolution predictions is higher in both modeling modes (16%) compared to Rosetta NGK (14%) and MODELLER (9%).

Table 2.

Prediction results over the best of top10 models

	CASP11	CASP12	<1 Å (%)	<2 Å (%)
DaReUS-Loop server	2.10	2.18	17	58
NGK	2.61	2.63	14	40
MODELLER	2.97	3.15	9	43
size	48	50

Open in a new tab

Average flanked RMSD ( Å) are reported for the CASP11 and CASP12 test sets. Comparison is between DaReUS-Loop web server and other methods (Rosetta NGK and MODELLER). All the values reported in this table correspond to the best flanked RMSD ( Å) over top 10 models. The percentage of highly accurate predictions (<1 and <2 Å) is also reported. Bold values correspond to the best values among all the methods.

To the best of our knowledge, among all existing loop modeling web servers, only the GalaxyLoop server accepts multiple loops at the same time. Still, the maximum number of loops is limited to three, and in the original publication (39), GalaxyLoop was only validated on single-loop test cases. This makes DaReUS-Loop the first web server to be validated on the simultaneous modeling of multiple loop regions in homology models.

Results on a concrete example

One of the CASP11 targets, T0807 is selected as an illustrative example to demonstrate the performance of our web server. From the homology model, nine loop regions are identified: 16–30, 45–58, 69–75, 82–87, 113–119, 154–162, 168–178, 234–245 and 258–262. First, the server was run in remodeling mode, providing the initial homology model and a gapped sequence to re-model the loops. In addition, the gapped model along with full sequence was given to the web server selecting first “modeling mode” and then ‘advanced modeling mode’. Final predictions for loop 2 (45–59) in modeling mode and the confidence report for all the loops are shown in Figure 2A). Next we measured the backbone RMSD between the best loop candidate (out of top 10) and the native loop conformation (PDB id: 4WGH), after superimposing the flanks. The results for all the modes and the native conformation are reported on Figure 2B) for every loop.

Figure 2. — Illustrative example of DaReUS-Loop performance. Nine loops from the homology model of T0807 in CASP11 test set are modelled separately, in each of the three modes (remodeling, modeling and advanced modeling). (A) Example of result page provided by the web server for modeling loop number 2 (residues 45–58), using the modeling scenario. Top: final top 10 candidates predicted by the server are visualised on the structure using different colors. Bottom: the confidence values and levels for every loop are reported. (B) For every loop, the best predictions of each model are shown (remodeling: blue, modeling: pink and advanced modeling: purple) and the native loop conformations are depicted in green. The loops are spanning the following residues, respectively: 16–30, 45–58, 69–75, 82–87, 113–119, 154–162, 168–178, 234–245 and 258–262. The RMSD of the top candidate is reported with respect to the native loop conformation, after fitting on the flanks.

Computational time

The performance of DaReUS-Loop is not dependent on the protein size, but it depends on the number of loops to be modeled, as well as their sizes. The computational time needed for modeling a single loop might be within the range of 20–30 min, whereas for a protein with 10 loops or more the runtime may vary between 40 and 120 min, depending on the loop sizes. On average, the running time is between 40 and 50 min, however it highly depends on the traffic load of the cluster.

CONCLUSION

The DaReUS-loop web server relies on a data-based approach for loop modeling. Compared to previous web servers, it comes with two main advancements that are (i) improved modeling of loops in homology models, and (ii) a demonstrated ability to model several loops simultaneously. Our results show that for >50% of the loops in the test benchmark, loops can be modeled with <2 Å RMSD from the native loop conformations, taking the lowest value among the 10 predicted loop candidates.

An interesting perspective for future research is the combination of DaReUS-Loop with template-based docking methods (45–47). Template based docking provides a prediction of an entire protein-protein complex as a low-resolution model that needs to be refined (48). This is complementary to loop (re)modeling, the high-resolution refinement of loops in a single protein. This must be balanced with other forms of refinement, in particular that of the relative orientations of the protein chain.

We stress that DaReUS-Loop is a consensus method that does not rank the 10 predicted candidates. In case of multiple loops, the Cartesian combination of all loop candidates must be considered. Therefore, one direction for future research is an investigation into effective scoring function able to identify the best combination of candidates among those arising from multiple loops. However, the prediction of multiple candidates could also be considered as an advantage. For example, a flexibility score could be conceived that reflects the predicted degree of conformational flexibility/diversity for every loop. We expect it could provide a starting point to assess the impact of loop conformation on the generation of models for protein-protein complexes. In future research, knowledge about disulfide bridges will be integrated into the protocol. Finally, DaReUS-Loop could be integrated with experimental data or with residue contact prediction, which could be used as effective filters of combinations of predicted loop candidates.

DATA AVAILABILITY

The set of all models for CASP11 and CASP12 generated and analysed during the current study, as well as the results obtained from the web server in all the three modes (remodeling, modeling and advance modeling) are provided as a dataset (Harvard Dataverse, https://doi.org/10.7910/DVN/ZI6ROT). The results from other methods were reported in the previous manuscript (39).

Supplementary Material

gkz403_Supplemental_Files

Click here for additional data file.^{(158.9KB, pdf)}

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

ANR-10-BINF-0003 (BipBip); ANR-14-2011-IFB; INSERM [UMR-S 973]; Ressource Parisienne en Bioinformatique Structurale (RPBS). Funding for open access charge: public funding.

Conflict of interest statement. None declared.

REFERENCES

1. Kennedy D., Norman C.. What don’t we know. Science. 2005; 309:75. [DOI] [PubMed] [Google Scholar]
2. Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E.. The Protein Data Bank. Nucleic Acids Res. 2000; 28:235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Marti-Renom M.A., Stuart A.C., Fiser A., Sanchez R., Melo F., Sali A.. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 2000; 29:291–325. [DOI] [PubMed] [Google Scholar]
4. Roy A., Kucukural A., Zhang Y.. I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 2010; 5:725–738. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Remmert M., Biegert A., Hauser A., Soding J.. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods. 2011; 9:173–175. [DOI] [PubMed] [Google Scholar]
6. Wu S.J., Dean D.H.. Functional significance of loops in the receptor binding domain of Bacillus thuringiensis CryIIIA delta-endotoxin. J. Mol. Biol. 1996; 255:628–640. [DOI] [PubMed] [Google Scholar]
7. Jones S., Thornton J.M.. Prediction of protein-protein interaction sites using patch analysis. J. Mol. Biol. 1997; 272:133–143. [DOI] [PubMed] [Google Scholar]
8. Shi L., Javitch J.A.. The second extracellular loop of the dopamine D2 receptor lines the binding-site crevice. Proc. Natl. Acad. Sci. U.S.A. 2004; 101:440–445. [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Alvim-Gaston M., Grese T., Mahoui A., Palkowitz A.D., Pineiro-Nunez M., Watson I.. Open Innovation Drug Discovery (OIDD): a potential path to novel therapeutic chemical space. Curr. Top. Med. Chem. 2014; 14:294–303. [DOI] [PubMed] [Google Scholar]
10. Fiser A., Do R.K., Sali A.. Modeling of loops in protein structures. Protein Sci. 2000; 9:1753–1773. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Goldfeld D.A., Zhu K., Beuming T., Friesner R.A.. Loop prediction for a GPCR homology model: algorithms and results. Proteins. 2013; 81:214–228. [DOI] [PubMed] [Google Scholar]
12. Lee G.R., Heo L., Seok C.. Effective protein model structure refinement by loop modeling and overall relaxation. Proteins. 2016; 84:293–301. [DOI] [PubMed] [Google Scholar]
13. Feig M. Computational protein structure refinement: almost there, yet still so far to go. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2017; 7:e1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Mandell D.J., Coutsias E.A., Kortemme T.. Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling. Nat. Methods. 2009; 6:551–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Stein A., Kortemme T.. Improvements to robotics-inspired conformational sampling in rosetta. PLoS ONE. 2013; 8:e63090. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Park H., Lee G.R., Heo L., Seok C.. Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments. PLoS ONE. 2014; 9:e113811. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Liang S., Zhang C., Zhou Y.. LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J. Comput. Chem. 2014; 35:335–341. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Lopez-Blanco J.R., Canosa-Valls A.J., Li Y., Chacon P.. RCD+: Fast loop modeling server. Nucleic Acids Res. 2016; 44:395–400. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Wong S.W.K., Liu J.S., Kou S.C.. Fast de novo discovery of low-energy protein loop conformations. Proteins. 2017; 85:1402–1412. [DOI] [PubMed] [Google Scholar]
20. Hildebrand P.W., Goede A., Bauer R.A., Gruening B., Ismer J., Michalsky E., Preissner R.. SuperLooper–a prediction server for the modeling of loops in globular and membrane proteins. Nucleic Acids Res. 2009; 37:W571–W574. [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Choi Y., Deane C.M.. FREAD revisited: Accurate loop structure prediction using a database search algorithm. Proteins. 2010; 78:1431–1440. [DOI] [PubMed] [Google Scholar]
22. Holtby D., Li S.C., Li M.. LoopWeaver: loop modeling by the weighted scaling of verified proteins. J. Comput. Biol. 2013; 20:212–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Messih M.A., Lepore R., Tramontano A.. LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics. 2015; 31:3767–3772. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Ismer J., Rose A.S., Tiemann J.K., Goede A., Preissner R., Hildebrand P.W.. SL2: an interactive webtool for modeling of missing segments in proteins. Nucleic Acids Res. 2016; 44:W390–W394. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. van Vlijmen H.W., Karplus M.. PDB-based protein loop prediction: parameters for selection and methods for optimization. J. Mol. Biol. 1997; 267:975–1001. [DOI] [PubMed] [Google Scholar]
26. Deane C.M., Blundell T.L.. CODA: a combined algorithm for predicting the structurally variable regions of protein models. Protein Sci. 2001; 10:599–612. [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Marks C., Nowak J., Klostermann S., Georges G., Dunbar J., Shi J., Kelm S., Deane C.M.. Sphinx: merging knowledge-based and ab initio approaches to improve protein loop prediction. Bioinformatics. 2017; 33:1346–1353. [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Fiser A., Sali A.. ModLoop: automated modeling of loops in protein structures. Bioinformatics. 2003; 19:2500–2501. [DOI] [PubMed] [Google Scholar]
29. Fernandez-Fuentes N., Zhai J., Fiser A.. ArchPRED: a template based loop structure prediction server. Nucleic Acids Res. 2006; 34:W173–W176. [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Ko J., Lee D., Park H., Coutsias E.A., Lee J., Seok C.. The FALC-Loop web server for protein loop modeling. Nucleic Acids Res. 2011; 39:W210–W214. [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Marti-Renom M.A., Stuart A.C., Fiser A., Sanchez R., Melo F., Sali A.. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 2000; 29:291–325. [DOI] [PubMed] [Google Scholar]
32. Xiang Z., Soto C.S., Honig B.. Evaluating conformational free energies: the colony energy and its application to the problem of loop prediction. Proc. Natl. Acad. Sci. U.S.A. 2002; 99:7432–7437. [DOI] [PMC free article] [PubMed] [Google Scholar]
33. Liang S., Zhang C., Sarmiento J., Standley D.M.. Protein loop modeling with optimized backbone potential functions. J. Chem. Theory Comput. 2012; 8:1820–1827. [DOI] [PubMed] [Google Scholar]
34. Tang K., Wong S.W., Liu J.S., Zhang J., Liang J.. Conformational sampling and structure prediction of multiple interacting loops in soluble and β-barrel membrane proteins using multi-loop distance-guided chain-growth Monte Carlo method. Bioinformatics. 2015; 31:2646–2652. [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Sircar A., Kim E.T., Gray J.J.. RosettaAntibody: antibody variable region homology modeling server. Nucleic Acids Res. 2009; 37:W474–W479. [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Yamashita K., Ikeda K., Amada K., Liang S., Tsuchiya Y., Nakamura H., Shirai H., Standley D.M.. Kotai Antibody Builder: automated high-resolution structural modeling of antibodies. Bioinformatics. 2014; 30:3279–3280. [DOI] [PubMed] [Google Scholar]
37. Lepore R., Olimpieri P.P., Messih M.A., Tramontano A.. PIGSPro: prediction of immunoGlobulin structures v2. Nucleic Acids Res. 2017; 45:W17–W23. [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Ki Wong W., Georges G., Ros F., Kelm S., Lewis A.P., Taddese B., Leem J., Deane C.M.. SCALOP: sequence-based antibody canonical loop structure annotation. Bioinformatics. 2018; 7:e6179. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Karami Y., Guyon F., De Vries S., Tuffery P.. DaReUS-Loop: accurate loop modeling using fragments from remote or unrelated proteins. Sci Rep. 2018; 8:13673. [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Pall S., Abraham M.J., Kutzner C., Hess B., Lindahl E.. Tackling exascale software challenges in molecular dynamics simulations with GROMACS. International Conference on Exascale Applications and Software. 2014; Springer; 3–27. [Google Scholar]
41. Ramon Lopez-Blanco J., Chacon P.. KORP: knowledge-based 6D potential for fast protein and loop modeling. Bioinformatics. 2019; doi:10.1093/bioinformatics/btz026. [DOI] [PubMed] [Google Scholar]
42. Rose A.S., Bradley A.R., Valasatava Y., Duarte J.M., Prlic A., Rose P.W.. NGL viewer: web-based molecular graphics for large complexes. Bioinformatics. 2018; 34:3755–3758. [DOI] [PMC free article] [PubMed] [Google Scholar]
43. Moult J., Fidelis K., Kryshtafovych A., Schwede T., Tramontano A.. Critical assessment of methods of protein structure prediction: progress and new directions in round XI. Proteins. 2016; 84:4–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Moult J., Fidelis K., Kryshtafovych A., Schwede T., Tramontano A.. Critical assessment of methods of protein structure prediction (CASP)-Round XII. Proteins. 2018; 86:7–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Szilagyi A., Zhang Y.. Template-based structure modeling of protein-protein interactions. Curr. Opin. Struct. Biol. 2014; 24:10–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
46. Dapkunas J., Timinskas A., Olechnovic K., Margelevicius M., Diciunas R., Venclovas C.. The PPI3D web server for searching, analyzing and modeling protein-protein interactions in the context of 3D structures. Bioinformatics. 2017; 33:935–937. [DOI] [PubMed] [Google Scholar]
47. Ghoorah A.W., Devignes M.D., Smail-Tabbone M., Ritchie D.W.. Spatial clustering of protein binding sites for template based protein docking. Bioinformatics. 2011; 27:2820–2827. [DOI] [PubMed] [Google Scholar]
48. Vakser I.A. Low-resolution structural modeling of protein interactome. Curr. Opin. Struct. Biol. 2013; 23:198–205. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkz403_Supplemental_Files

Click here for additional data file.^{(158.9KB, pdf)}

Data Availability Statement

[B1] 1. Kennedy D., Norman C.. What don’t we know. Science. 2005; 309:75. [DOI] [PubMed] [Google Scholar]

[B2] 2. Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E.. The Protein Data Bank. Nucleic Acids Res. 2000; 28:235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3. Marti-Renom M.A., Stuart A.C., Fiser A., Sanchez R., Melo F., Sali A.. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 2000; 29:291–325. [DOI] [PubMed] [Google Scholar]

[B4] 4. Roy A., Kucukural A., Zhang Y.. I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 2010; 5:725–738. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Remmert M., Biegert A., Hauser A., Soding J.. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods. 2011; 9:173–175. [DOI] [PubMed] [Google Scholar]

[B6] 6. Wu S.J., Dean D.H.. Functional significance of loops in the receptor binding domain of Bacillus thuringiensis CryIIIA delta-endotoxin. J. Mol. Biol. 1996; 255:628–640. [DOI] [PubMed] [Google Scholar]

[B7] 7. Jones S., Thornton J.M.. Prediction of protein-protein interaction sites using patch analysis. J. Mol. Biol. 1997; 272:133–143. [DOI] [PubMed] [Google Scholar]

[B8] 8. Shi L., Javitch J.A.. The second extracellular loop of the dopamine D2 receptor lines the binding-site crevice. Proc. Natl. Acad. Sci. U.S.A. 2004; 101:440–445. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9. Alvim-Gaston M., Grese T., Mahoui A., Palkowitz A.D., Pineiro-Nunez M., Watson I.. Open Innovation Drug Discovery (OIDD): a potential path to novel therapeutic chemical space. Curr. Top. Med. Chem. 2014; 14:294–303. [DOI] [PubMed] [Google Scholar]

[B10] 10. Fiser A., Do R.K., Sali A.. Modeling of loops in protein structures. Protein Sci. 2000; 9:1753–1773. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Goldfeld D.A., Zhu K., Beuming T., Friesner R.A.. Loop prediction for a GPCR homology model: algorithms and results. Proteins. 2013; 81:214–228. [DOI] [PubMed] [Google Scholar]

[B12] 12. Lee G.R., Heo L., Seok C.. Effective protein model structure refinement by loop modeling and overall relaxation. Proteins. 2016; 84:293–301. [DOI] [PubMed] [Google Scholar]

[B13] 13. Feig M. Computational protein structure refinement: almost there, yet still so far to go. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2017; 7:e1307. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14. Mandell D.J., Coutsias E.A., Kortemme T.. Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling. Nat. Methods. 2009; 6:551–552. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15. Stein A., Kortemme T.. Improvements to robotics-inspired conformational sampling in rosetta. PLoS ONE. 2013; 8:e63090. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16. Park H., Lee G.R., Heo L., Seok C.. Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments. PLoS ONE. 2014; 9:e113811. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 17. Liang S., Zhang C., Zhou Y.. LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J. Comput. Chem. 2014; 35:335–341. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18. Lopez-Blanco J.R., Canosa-Valls A.J., Li Y., Chacon P.. RCD+: Fast loop modeling server. Nucleic Acids Res. 2016; 44:395–400. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19] 19. Wong S.W.K., Liu J.S., Kou S.C.. Fast de novo discovery of low-energy protein loop conformations. Proteins. 2017; 85:1402–1412. [DOI] [PubMed] [Google Scholar]

[B20] 20. Hildebrand P.W., Goede A., Bauer R.A., Gruening B., Ismer J., Michalsky E., Preissner R.. SuperLooper–a prediction server for the modeling of loops in globular and membrane proteins. Nucleic Acids Res. 2009; 37:W571–W574. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21. Choi Y., Deane C.M.. FREAD revisited: Accurate loop structure prediction using a database search algorithm. Proteins. 2010; 78:1431–1440. [DOI] [PubMed] [Google Scholar]

[B22] 22. Holtby D., Li S.C., Li M.. LoopWeaver: loop modeling by the weighted scaling of verified proteins. J. Comput. Biol. 2013; 20:212–223. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B23] 23. Messih M.A., Lepore R., Tramontano A.. LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics. 2015; 31:3767–3772. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B24] 24. Ismer J., Rose A.S., Tiemann J.K., Goede A., Preissner R., Hildebrand P.W.. SL2: an interactive webtool for modeling of missing segments in proteins. Nucleic Acids Res. 2016; 44:W390–W394. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25] 25. van Vlijmen H.W., Karplus M.. PDB-based protein loop prediction: parameters for selection and methods for optimization. J. Mol. Biol. 1997; 267:975–1001. [DOI] [PubMed] [Google Scholar]

[B26] 26. Deane C.M., Blundell T.L.. CODA: a combined algorithm for predicting the structurally variable regions of protein models. Protein Sci. 2001; 10:599–612. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B27] 27. Marks C., Nowak J., Klostermann S., Georges G., Dunbar J., Shi J., Kelm S., Deane C.M.. Sphinx: merging knowledge-based and ab initio approaches to improve protein loop prediction. Bioinformatics. 2017; 33:1346–1353. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28. Fiser A., Sali A.. ModLoop: automated modeling of loops in protein structures. Bioinformatics. 2003; 19:2500–2501. [DOI] [PubMed] [Google Scholar]

[B29] 29. Fernandez-Fuentes N., Zhai J., Fiser A.. ArchPRED: a template based loop structure prediction server. Nucleic Acids Res. 2006; 34:W173–W176. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30. Ko J., Lee D., Park H., Coutsias E.A., Lee J., Seok C.. The FALC-Loop web server for protein loop modeling. Nucleic Acids Res. 2011; 39:W210–W214. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31. Marti-Renom M.A., Stuart A.C., Fiser A., Sanchez R., Melo F., Sali A.. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 2000; 29:291–325. [DOI] [PubMed] [Google Scholar]

[B32] 32. Xiang Z., Soto C.S., Honig B.. Evaluating conformational free energies: the colony energy and its application to the problem of loop prediction. Proc. Natl. Acad. Sci. U.S.A. 2002; 99:7432–7437. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B33] 33. Liang S., Zhang C., Sarmiento J., Standley D.M.. Protein loop modeling with optimized backbone potential functions. J. Chem. Theory Comput. 2012; 8:1820–1827. [DOI] [PubMed] [Google Scholar]

[B34] 34. Tang K., Wong S.W., Liu J.S., Zhang J., Liang J.. Conformational sampling and structure prediction of multiple interacting loops in soluble and β-barrel membrane proteins using multi-loop distance-guided chain-growth Monte Carlo method. Bioinformatics. 2015; 31:2646–2652. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B35] 35. Sircar A., Kim E.T., Gray J.J.. RosettaAntibody: antibody variable region homology modeling server. Nucleic Acids Res. 2009; 37:W474–W479. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] 36. Yamashita K., Ikeda K., Amada K., Liang S., Tsuchiya Y., Nakamura H., Shirai H., Standley D.M.. Kotai Antibody Builder: automated high-resolution structural modeling of antibodies. Bioinformatics. 2014; 30:3279–3280. [DOI] [PubMed] [Google Scholar]

[B37] 37. Lepore R., Olimpieri P.P., Messih M.A., Tramontano A.. PIGSPro: prediction of immunoGlobulin structures v2. Nucleic Acids Res. 2017; 45:W17–W23. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B38] 38. Ki Wong W., Georges G., Ros F., Kelm S., Lewis A.P., Taddese B., Leem J., Deane C.M.. SCALOP: sequence-based antibody canonical loop structure annotation. Bioinformatics. 2018; 7:e6179. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B39] 39. Karami Y., Guyon F., De Vries S., Tuffery P.. DaReUS-Loop: accurate loop modeling using fragments from remote or unrelated proteins. Sci Rep. 2018; 8:13673. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B40] 40. Pall S., Abraham M.J., Kutzner C., Hess B., Lindahl E.. Tackling exascale software challenges in molecular dynamics simulations with GROMACS. International Conference on Exascale Applications and Software. 2014; Springer; 3–27. [Google Scholar]

[B41] 41. Ramon Lopez-Blanco J., Chacon P.. KORP: knowledge-based 6D potential for fast protein and loop modeling. Bioinformatics. 2019; doi:10.1093/bioinformatics/btz026. [DOI] [PubMed] [Google Scholar]

[B42] 42. Rose A.S., Bradley A.R., Valasatava Y., Duarte J.M., Prlic A., Rose P.W.. NGL viewer: web-based molecular graphics for large complexes. Bioinformatics. 2018; 34:3755–3758. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B43] 43. Moult J., Fidelis K., Kryshtafovych A., Schwede T., Tramontano A.. Critical assessment of methods of protein structure prediction: progress and new directions in round XI. Proteins. 2016; 84:4–14. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44] 44. Moult J., Fidelis K., Kryshtafovych A., Schwede T., Tramontano A.. Critical assessment of methods of protein structure prediction (CASP)-Round XII. Proteins. 2018; 86:7–15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B45] 45. Szilagyi A., Zhang Y.. Template-based structure modeling of protein-protein interactions. Curr. Opin. Struct. Biol. 2014; 24:10–23. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B46] 46. Dapkunas J., Timinskas A., Olechnovic K., Margelevicius M., Diciunas R., Venclovas C.. The PPI3D web server for searching, analyzing and modeling protein-protein interactions in the context of 3D structures. Bioinformatics. 2017; 33:935–937. [DOI] [PubMed] [Google Scholar]

[B47] 47. Ghoorah A.W., Devignes M.D., Smail-Tabbone M., Ritchie D.W.. Spatial clustering of protein binding sites for template based protein docking. Bioinformatics. 2011; 27:2820–2827. [DOI] [PubMed] [Google Scholar]

[B48] 48. Vakser I.A. Low-resolution structural modeling of protein interactome. Curr. Opin. Struct. Biol. 2013; 23:198–205. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

DaReUS-Loop: a web server to model multiple loops in homology models

Yasaman Karami

Julien Rey

Guillaume Postic

Samuel Murail

Pierre Tufféry

Sjoerd J de Vries

Abstract

INTRODUCTION

MATERIALS AND METHODS

Description of the web server

Figure 1.

RESULTS

Performance and comparison with other approaches

Table 1.

The simultaneous modeling and remodeling of multiple loops

Table 2.

Results on a concrete example

Figure 2.

Computational time

CONCLUSION

DATA AVAILABILITY

Supplementary Material

SUPPLEMENTARY DATA

FUNDING

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

DaReUS-Loop: a web server to model multiple loops in homology models

Yasaman Karami

Julien Rey

Guillaume Postic

Samuel Murail

Pierre Tufféry

Sjoerd J de Vries

Abstract

INTRODUCTION

MATERIALS AND METHODS

Description of the web server

Figure 1.

RESULTS

Performance and comparison with other approaches

Table 1.

The simultaneous modeling and remodeling of multiple loops

Table 2.

Results on a concrete example

Figure 2.

Computational time

CONCLUSION

DATA AVAILABILITY

Supplementary Material

SUPPLEMENTARY DATA

FUNDING

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases