Fast and accurate Ab Initio Protein structure prediction using deep learning potentials

Robin Pearce; Yang Li; Gilbert S Omenn; Yang Zhang

doi:10.1371/journal.pcbi.1010539

. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539

Fast and accurate Ab Initio Protein structure prediction using deep learning potentials

Robin Pearce ¹, Yang Li ¹, Gilbert S Omenn ^1,², Yang Zhang ^1,^3,^*

Editor: Christos A Ouzounis⁴

PMCID: PMC9518900 PMID: 36112717

Abstract

Despite the immense progress recently witnessed in protein structure prediction, the modeling accuracy for proteins that lack sequence and/or structure homologs remains to be improved. We developed an open-source program, DeepFold, which integrates spatial restraints predicted by multi-task deep residual neural-networks along with a knowledge-based energy function to guide its gradient-descent folding simulations. The results on large-scale benchmark tests showed that DeepFold creates full-length models with accuracy significantly beyond classical folding approaches and other leading deep learning methods. Of particular interest is the modeling performance on the most difficult targets with very few homologous sequences, where DeepFold achieved an average TM-score that was 40.3% higher than trRosetta and 44.9% higher than DMPfold. Furthermore, the folding simulations for DeepFold were 262 times faster than traditional fragment assembly simulations. These results demonstrate the power of accurately predicted deep learning potentials to improve both the accuracy and speed of ab initio protein structure prediction.

Author summary

Template-free protein structure prediction remains an important unsolved problem. We proposed a new pipeline to construct full-length protein structures by coupling multiple-level deep learning potentials with fast gradient-based folding simulations. The large-scale benchmark tests demonstrated significant advantages in both accuracy and speed over other fragment-assembly and deep learning-based approaches. The results revealed that the key factor for the success of the deep learning approach is its ability to provide an abundant set of accurate spatial restraints (~93*L where L is the protein length), which help smooth the energy landscape and make gradient-based simulation searching a feasible optimization tool. Nevertheless, extensive folding simulations are still needed for the cases where only sparse restraints are available as provided by threading alignments and low-resolution structural biology experiments.

This is a PLOS Computational Biology Methods paper.

Introduction

The goal of protein structure prediction is to determine the spatial location of every atom in a protein from its primary sequence. Depending on whether reliable structural templates are available in the PDB, protein structure prediction methods have been divided into template-based modeling (TBM) and template-free (FM) approaches, the latter of which is also called ab initio modeling [1]. For many years, TBM has been the most reliable method for modeling protein structures; however, its accuracy is essentially determined by the availability of close homologous templates and the quality of the query-template alignments. Conversely, ab initio methods are designed to use advanced energy functions and sampling techniques to improve the folding performance for proteins that lack homologous templates in the PDB. However, due to the inaccuracy in force field design and the limitations of conformational search engines, the performance of the physics-based FM methods for non-homologous targets has remained significantly worse than that of the TBM methods for targets with readily identifiable homologous templates [2, 3].

Throughout the last few years, the use of deep learning techniques to predict spatial restraints from sequence and/or multiple sequence alignments (MSAs) has dramatically improved the accuracy of ab initio structure prediction [4]. For example, in CASP11 and CASP12, predictors primarily used direct coupling analysis from MSAs and shallow neural networks to predict contact maps, where the prediction accuracy largely relied on the identification of abundant sequence homologs in order to accurately predict contacts based on the information from correlated mutation patterns [5]. In the CASP13 experiment, however, the top-ranked server groups, Zhang-Server and QUARK, used contact maps predicted by deep convolutional residual networks (ResNets) [6] to guide the I-TASSER [7] and QUARK [8] folding simulations, respectively, which greatly improved the contact prediction and folding accuracies for the physics- and knowledge-based modeling approaches. This was especially apparent for targets that lacked homologous templates and high-quality MSAs [5]. There are also gains that can be obtained from improving protein sequence matching itself, which yields substantially more of the structure contacts [9]. Here, a contact map is specified by a binary L×L matrix, where L is the protein length and each entry indicates whether the Cβ atoms (or Cα atoms for Glycine) of two residues are <8Å apart from each other. In the most recent CASP experiment, CASP14, multiple deep learning constraints, including distance maps, which are conceptually similar to contact maps but include inter-residue distance information [10, 11], inter-residue dihedral angles [12] and hydrogen-bonding networks [13], were integrated with the folding simulations. The results demonstrated significant improvements over the contact-based structure assembly approaches, due to the introduction of more precise spatial information to guide the folding simulations [13].

Despite the improvement in modeling accuracy, the approaches built on traditional fragment/template assembly folding techniques, such as I-TASSER [7], Rosetta [14] and QUARK [8], often require lengthy simulation times, especially for longer proteins, which hinders them from large-scale modeling applications. In fact, the necessity of extensive conformational sampling required for ab initio modeling is due to the immense structure space and complex energy landscape associated with protein folding. Although this may still be required when integrated with sparse spatial constraints (e.g., around n*L restraints where n<1) from threading alignments and low-resolution experiments [15–17], the advanced deep learning techniques can now provide abundant (>20*L) high-quality restraints. These abundant and accurate restraints can smooth the rough protein folding energy landscape to a large degree. In this regard, extensive folding simulations may no longer be needed, which partially explains the remarkable success enjoyed by other teams in the CASP experiments such as AlphaFold [11] in CASP13 and trRosetta [12] in CASP14, which constructed structural models using local gradient-descent based conformational searching procedures.

Inspired by these advances, we have developed a fast open-source protein folding pipeline, DeepFold, which combines a general knowledge-based statistical force field with a deep learning-based potential produced by the new DeepPotential program to improve the speed and accuracy of ab initio protein structure prediction. The pipeline was carefully benchmarked on large-scale datasets and showed superiority over other leading structure prediction approaches, all with greatly reduced simulation times compared to traditional folding simulation methods. Each component of the program, including the deep learning models and L-BFGS structure optimization pipeline, is integrated into an easy-to-use, stand-alone package available at both https://zhanggroup.org/DeepFold and https://github.com/robpearc/DeepFold. Meanwhile, an online webserver for DeepFold is available at https://zhanggroup.org/DeepFold, where users can apply the method to generate structure models for their own protein sequences.

Results and discussion

Distance and orientation restraints have the dominant impact on global fold accuracy

As shown in Fig 1, DeepFold starts by searching the query sequence through multiple whole-genome and metagenomic databases using DeepMSA2 [18] to create an MSA. Next, the co-evolutionary coupling matrices are extracted from the resulting MSA and used as input features by the deep ResNet architecture of DeepPotential to predict spatial restraints, including distance/contact maps and inter-residue torsion angle orientations. These restraints are then converted into a deep learning-based potential, which is used along with a general knowledge-based physical potential to guide the L-BFGS folding simulations for full-length model generation (see Methods).

To test DeepFold, we collected a set of 221 non-redundant (<30% sequence identity to each other) protein domains from the SCOPe 2.06 database and FM targets from CASP9-12. These proteins were non-homologous (with a sequence identity <30%) to the training dataset of DeepFold and were all defined as Hard threading targets by LOMETS [19] after excluding homologous templates with >30% sequence identity to the query. Here, a Hard target is a protein for which LOMETS could not identify a significant template, allowing for a systematic evaluation of the developed method on ab initio modeling targets. To examine the importance of the different components of the DeepFold energy function, we ran DeepFold using different combinations of spatial restraints from DeepPotential for the 221 test proteins, where the modeling results are summarized in Fig 2 and S1 Table in the Supporting Information (SI).

Overall, the baseline potential using just the general physical energy function (GE in S1 Table and Fig 2) achieved an average TM-score of only 0.184. Furthermore, when considering a cutoff TM-score ≥0.5 to indicate a correctly folded model, which would mean the predicted model and native structure share the same global fold [8, 20], the baseline energy function was unable to correctly fold any of the test proteins (S1 Table). Given that the coupling of a similar force field with replica-exchange Monte Carlo simulations in QUARK could fold substantially more proteins with a much higher average TM-score [8], this result suggests that one major reason for the failure here is due to the frustration of the baseline energy landscape, which cannot be quickly explored by gradient-based searching methods. The further inclusion of Cα and Cβ contact restraints improved the TM-score to 0.263, where 4 of the 221 test proteins, or 1.8%, were successfully folded with TM-scores ≥0.5. The addition of the Cα and Cβ distance restraints dramatically improved the average TM-score on the test dataset to 0.677, representing an increase of 157.4%, where 76.0% of the test proteins were correctly folded. Lastly, the inclusion of the inter-residue orientations further improved the average TM-score to 0.751 and the percent of successfully folded proteins to 92.3%. Overall, as the level of detail in the restraints increased, the energy landscape became increasingly smooth and thus the L-BFGS folding simulations resulted in increased average TM-scores across the test proteins.

Although the addition of inter-residue distances to the energy function brought about the highest increase in accuracy, one interesting observation is the synergistic effect observed when combining different components of the restraints. For example, the addition of inter-residue orientations improved DeepFold’s ability to find structures that optimally satisfied the distance restraints. As evidence of this, in S2 Table we present the mean absolute errors (MAEs) for the top n*L long-range distance restraints which were calculated between the DeepPotential predicted distance maps and the final DeepFold models with and without the use of the orientation restraints. The data in S2 Table shows that the introduction of inter-residue orientations helped to significantly decrease the MAE between the predicted distance maps and the structure models. For example, when considering the top 2*L distance restraints, which were sorted by their DeepPotential distance prediction confidence scores, the MAE was 0.74 Å when DeepFold was run only using the GE and contact/distance restraints, whereas the MAE was reduced by 17.6% to 0.61 Å when the orientation restraints were added. Therefore, not only do orientations provide useful geometric information on their own, they also help further smooth the energy landscape and facilitate the L-BFGS search to identify energy basins that satisfy the ensemble of spatial restraints.

Furthermore, inter-residue orientations were particularly useful for folding β-proteins. As seen in S3 Table, the inclusion of orientations increased the average TM-score for β-proteins from 0.590 to 0.706, corresponding to a 19.7% improvement, which was significantly higher than the 10.9% improvement observed on the overall dataset (S1 Table); this makes sense intuitively given the intricate hydrogen bonding patterns present in β-proteins that would require more detailed local inter-residue dihedral angle restraint information to properly recapitulate. Fig 3A presents an illustrative example from SCOPe protein d1jqpa1, which adopts a β-barrel fold. The model built without orientations had a low TM-score of 0.313 and an RMSD of 11.43 Å, where the MAE between the top 2*L DeepPotential distances and the model without orientations was 0.87 Å. In contrast, the model built using the orientation restraints had a drastically improved TM-score of 0.800 and an RMSD of 2.74 Å. Additionally, the MAE between the top 2*L DeepPotential distances and the model improved to 0.61 Å. Thus, the orientation restraints provide complementary information to the distance maps and had a particularly important role for folding β-proteins.

Fig 3 — A) Case study from SCOPe protein d1jqpa1 that demonstrates the importance of inter-residue orientations for folding β-proteins, where the native structure is shown in yellow, and the superposed predicted models built without (left) and with (right) orientation restraints are shown in blue. B) Case study from SCOPe protein d1xsza2, which highlights the importance of the general energy function for improving the physical quality of the models. The models built without (left) and with (right) the general physical energy function are depicted in rainbow coloring, where the clashing region is shown in the inset on the left and the clashes have been resolved in the model built with the general energy function on the right.

The general knowledge-based energy function improves local physical structure quality

The rapid improvement in the accuracy of deep learning-based restraint prediction has called into question the role of the physical energy function in the era of deep learning. Indeed, we saw that the major contributor to DeepFold’s accuracy is the high number of accurately predicted restraints generated by DeepPotential, where their addition dramatically improved the average TM-score from 0.184 to 0.751 (Fig 2). Nevertheless, the physical energy function, which accounts for fundamental forces that drive protein folding, such as hydrogen bonding interactions and van der Waals clashes, plays an important role in improving the physical quality of the predicted models; this is especially true when the model quality is poor. As evidence, S4 Table lists several model quality metrics for models generated with and without the use of the GE function. On the overall test set of 221 hard protein targets, the inclusion of the GE potential provided a modest yet consistent enhancement in the physical model quality, as reflected in the improvement of the MolProbity score [21] from 1.735 to 1.692 with the addition of the GE function (S4 Table). Similar trends were observed for the secondary structure quality (SOV score [22]), the number of Ramachandran outliers, and the steric clash score (S4 Table), all of which improved with the inclusion of the GE. The most notable improvement was observed in the clash score, which improved by 13.3% on the overall dataset.

More significant improvements were witnessed for the 16 targets with poor physical quality, as measured by a MolProbity score in the 50^th percentile or lower from the PDB structures. For these targets, the physical energy function improved the average MolProbity score from 2.882 to 2.308, representing an improvement of 19.9% compared to 2.5% on the overall dataset. Similarly, these improvements were consistent across the SOV score, number of Ramachandran outliers, and the clash score for these targets. Again, the most dramatic improvement occurred for the clash score, which decreased from 17.5 to 8.6, representing an improvement of 50.9%. Fig 3B illustrates a case study from SCOPe protein d1xsza2, where models were generated with and without the inclusion of the general physical energy function. In the model built without the GE function, there are several residues that directly overlap each other leading to severe steric clashing, as shown in the inset. These clashes among other factors led to a model with a very high, and thus unfavorable, MolProbity score of 3.908 (3^rd percentile) along with a very high clash score of 212.8. As shown in the inset in Fig 3B, these clashes were resolved with the inclusion of the GE potential and its term for van der Waals clashes, where the resulting model had a reduced MolProbity score of 1.624 (92^nd percentile) and a low clash score of 1.2. Clearly, simply satisfying the geometric restraints provided by deep learning may lead to models that are physically unrealistic, where the introduction of physical energy terms may partially alleviate this problem.

Comparison of DeepFold with other leading modeling methods

To further evaluate the performance of DeepFold, we compared the modeling results on the 221 test proteins with a leading contact map-based folding program (C-I-TASSER [23]), two top distance (DMPfold [24]) and distance/orientation-based (trRosetta [12]) methods, and the classic I-TASSER pipeline [7]. To provide a fair comparison, we used the same MSAs that DeepFold used, which were produced by DeepMSA2 [18] (see S1 Fig), for the deep learning restraint prediction by DMPfold, trRosetta and C-I-TASSER, as well as for template identification by LOMETS in I-TASSER and C-I-TASSER. Furthermore, templates with ≥30% sequence identity to the query were excluded from I-TASSER and C-I-TASSER.

As shown in Tables 1 and S5, the average/median TM-scores of the DeepFold models for the 221 test proteins were significantly higher than all the control methods. For instance, the average TM-score for the models produced by I-TASSER was only 0.383, where DeepFold achieved an average TM-score (0.751) that was 96.1% higher than I-TASSER with a p-value of 9.4E-80 as determined by a paired, two-sided Student’s t-test (Table 1). This result is understandable as I-TASSER does not use any deep learning spatial restraints, making the modeling accuracy more reliant on the templates, while, by design, all homologous templates were excluded for the Hard threading targets. The inclusion of deep learning contact maps into C-I-TASSER greatly increased the TM-score to 0.584. Nevertheless, DeepFold still achieved an average TM-score that was 28.6% higher than C-I-TASSER with a p-value of 1.8E-55. This is mainly due to the fact that DeepFold utilizes both distance and orientation restraints, which contain more detailed information than the contact maps used in C-I-TASSER [5]. The results for the median values were similar to the averages, where DeepFold achieved a median TM-score of 0.800, while I-TASSER and C-I-TASSER obtained median TM-scores of 0.357 and 0.607, respectively, which were significantly lower than DeepFold with p-values of 3.1E-37 and 1.9E-35 as determined by two-sided, non-parametric Wilcoxon signed-rank tests (S5 Table).

Table 1. Summary of the structure modeling results by DeepFold and the control methods on the 221 test proteins.

The p-values were calculated between DeepFold and the control methods using paired, two-sided Student’s t-tests.

Method	TM-score (p-value)	RMSD (p-value)	Correct Folds^*	TM_DeepFold > TM_Method^‡
I-TASSER	0.383 (9.4E-80)	15.10 (7.1E-25)	24.0%	95.9%
C-I-TASSER	0.584 (1.8E-55)	8.89 (4.0E-26)	67.0%	95.9%
DMPfold	0.657 (5.6E-37)	7.81 (2.0E-18)	79.6%	92.3%
trRosetta	0.694 (8.3E-24)	6.81 (4.7E-09)	85.5%	87.8%
DeepFold	0.751	5.61	92.3%	-

Open in a new tab

* This column represents the percent of proteins with TM-scores ≥0.5.

‡ This column indicates the percent of test proteins for which DeepFold generated a model with a higher TM-score than the control method.

Interestingly, there were two targets (d1ltrd and d1nova) for which I-TASSER and C-I-TASSER produced models that were significantly more accurate than DeepFold. To examine the reason for the discrepancy in performance, S2 Fig depicts the models generated by I-TASSER, C-I-TASSER, and DeepFold superposed with the native structures along with the top templates used by I-TASSER and C-I-TASSER for these proteins. For d1ltrd, despite the fact that it was a hard threading target, LOMETS was able to identify a reliable template from the PDB (1prtI) with a coverage of 92.6% and a TM-score of 0.553; thus, both I-TASSER and C-I-TASSER constructed accurate models with TM-scores of 0.663 and 0.637, respectively. Conversely for DeepFold, the generated MSA contained few homologous sequences with a normalized number of effective sequences (or Neff, defined in S1 Text) of 0.42, resulting in inaccurate predicted restraints with an MAE of 2.60 Å for the top 2*L distances. This ultimately lead DeepFold to produce a poor model with a TM-score of 0.326. Additionally, the contact precision for the top L/2 contacts used by C-I-TASSER was only 50.0%, which is largely why the C-I-TASSER model was worse than the I-TASSER model. Similarly, for d1nova, LOMETS was able to identify a reliable template (PDB ID 1hofC) with a coverage of 100% and a TM-score of 0.544, which resulted in accurate I-TASSER and C-I-TASSER models with TM-scores of 0.631 and 0.713 for the two methods, respectively. Again, for DeepFold, the generated MSA was shallow with a normalized Neff value of 9.40. Nevertheless, the predicted distance restraints were still accurate with an MAE of 0.90 Å for the top 2*L distances; however, the predicted orientations were inaccurate, particularly the Ω orientation, which had an MAE of 31.3° for the top 2*L restraints. This resulted in a model with a TM-score of 0.546, which still possessed a correct fold, but was worse than the models generated by I-TASSER and C-I-TASSER. Unlike the previous example, the C-I-TASSER model was closer to the native structure than the I-TASSER model for d1nova as the predicted contacts were accurate with a precision of 98.7% for the top L/2 contacts. These two examples highlight that even with the advances in deep learning methods, template-based modeling still remains important, particularly given the reliance of deep learning techniques on the generated MSAs, which may be lower quality than the identified templates for numerous targets.

DeepFold also outperformed two other leading distance (DMPfold) and distance/orientation-based (trRosetta) methods, where DMPfold achieved average/median TM-scores of 0.657/0.710 and trRosetta obtained average/median TM-scores of 0.694/0.749. Therefore, DeepFold’s average/median TM-scores were 14.3%/12.7% higher than DMPfold and 8.2%/6.8% higher than trRosetta, where the differences were statistically significant with p-values of 5.6E-37/2.0E-34 and 8.3E-24/1.6E-26, respectively (see Tables 1 and S5). Furthermore, Fig 4 presents a head-to-head comparison of DeepFold with the control methods, where DeepFold outperformed trRosetta and DMPfold on 194 and 204 of the 221 test proteins, respectively. Compared to DMPfold, an obvious advantage of DeepFold is the use of inter-residue dihedral angle orientations, which resulted in a substantial TM-score increase for DeepFold as shown in Fig 2. Compared to trRosetta, since both methods use distance and orientation restraints, the major advantage of DeepFold is the high accuracy of the restraints generated by DeepPotential. Therefore, in S6 Table, we provide an accuracy comparison for the Cβ distance predictions by different programs, where the distance maps by DeepPotential had a significantly lower MAE to the native structures than those produced by both trRosetta and DMPfold across all cutoff values. In S7 Table, we also list the modeling results of trRosetta using the DeepPotential restraints. Although trRosetta+DeepPotential resulted in a higher average TM-score (0.735) than trRosetta alone, due to the use of the more accurate restraints from DeepPotential, the average TM-score of DeepFold was still significantly higher than that of trRosetta+DeepPotential with a p-value of 3.9E-9. This is probably due to the unique DeepFold knowledge-based force field and the utilization of the additional Cα distance maps that are not used by trRosetta. In addition, the simultaneous optimization of the DeepFold force field with the L-BFGS search engine (see Methods) helped enhance the structure construction process.

Fig 4 — Head-to-head TM-score comparisons between DeepFold and other protein structure prediction methods: A) I-TASSER; B) C-I-TASSER; C) DMPfold; D) trRosetta; E) AlphaFold. (A-D) are based on the 221 Hard benchmark proteins, while (E) is on 31 FM targets from CASP13.

Here, of particular interest is the modeling performance for those hard targets with very few effective sequences in their MSAs, which are the most difficult targets to fold using deep learning approaches. For this purpose, we collected a set of 16 targets with normalized Neff values less than 1.0 and calculated the TM-scores for the models produced by DeepFold, trRosetta, and DMPfold. On these targets, DeepFold achieved an average TM-score of 0.494, which was 40.3% higher than trRosetta (0.352) and 44.9% higher than DMPfold (0.341). In S3 Fig, we present a scatter plot of TM-score vs. the logarithm of the normalized MSA Neff value for the three methods on all 221 test proteins, where DeepFold demonstrated a lower correlation between the TM-score and Neff value than trRosetta and DMPfold, which partially explains its superior performance.

It is of note that some of the proteins in the benchmark dataset may be homologous to the proteins that DeepFold and the other methods were trained on, as these deep learning methods often require a comprehensive set of training proteins to properly generalize. Thus, in S8 Table we depict the results for the selected methods on the 90 proteins in the benchmark dataset that shared <30% sequence identity to any of the training proteins used by DeepPotential. From the data in Tables S8 and 1, it can be seen that the performances of each of the methods, including DeepFold, were quite similar to the results on the overall benchmark dataset, where the accuracy of each of the deep learning methods on the 90 proteins was only slightly lower (~0.7–2.8% lower average TM-scores) than their accuracy on the 221 benchmark targets, which is largely due to the lower Neff values for the MSAs in the pruned dataset. Nevertheless, DeepFold still significantly outperformed each of the control methods on these targets.

Lastly, we compared the modeling accuracy of DeepFold with AlphaFold on the 31 CASP13 FM targets that the AlphaFold human group submitted models for (Fig 4E). Note, we could not benchmark the performance of AlphaFold on the 221 test proteins as the feature generation scripts and folding pipelines were not publicly available when this work was performed. It can be seen from Fig 4E that DeepFold outperformed AlphaFold on 20 of the 31 FM targets, where, on average, the TM-score of DeepFold was 0.636 compared to 0.589 for AlphaFold (p-value = 0.025, S9 Table). It is also important to note that the AlphaFold human group performed thousands of different optimization runs for the CASP13 targets as reported [11], while DeepFold only used a single optimization run in this study.

Comparison of DeepFold with the most recently developed methods: AlphaFold2 and RosettaFold

Since DeepFold uses restraints from DeepPotential, which was developed before the advances made by AlphaFold2 [25] in CASP14, it is also of interest to compare the results against the most recent self-attention-based neural network methods, namely, AlphaFold2 and RosettaFold [26]. Thus, in S4A–S4C Fig, we provide a head-to-head comparison of the DeepFold modeling results utilizing the restraints from DeepPotential with RosettaFold and AlphaFold2 on the 221 test proteins in terms of the model TM-scores, where the results are summarized in S10 Table. Overall, the average TM-score of the RosettaFold end-to-end pipeline was 0.812 and the average TM-score of the Pyrosetta version was 0.838, which were higher than the results by DeepFold (TM-score = 0.751) with p-values of 3.6E-10 and 8.0E-22, respectively. Similarly, the average TM-score of AlphaFold2 was 0.903, which was higher than DeepFold with a p-value of 1.4E-49. These results were expected given that the advances in deep self-attention neural networks and end-to-end training by AlphaFold2 and, subsequently, RosettaFold showed greatly improved modeling accuracy over previously introduced convolutional ResNet architectures, such as DeepPotential.

Notably, there were 7 targets for which DeepFold outperformed AlphaFold2. In S5 Fig, we illustrate two examples where DeepFold generated models that were significantly more accurate than AlphaFold2. The first example is from SCOPe protein d1a34a, for which DeepFold generated a model with a TM-score of 0.613, while AlphaFold2 generated a model with a TM-score of 0.242. For this target, DeepMSA2 was not able to identify any sequence homologs, resulting in an MSA composed of only the query sequence and an extremely low normalized Neff value of 0.08. Nevertheless, DeepPotential generated accurate restraints with an MAE of 1.10 Å for the top 2*L distances, resulting in a higher quality model than that produced by AlphaFold2. The second example is from SCOPe protein d1s2xa, for which DeepFold generated a model with a TM-score of 0.590, while AlphaFold2 generated a model with a TM-score of 0.369. Again, for this target, DeepMSA2 was only able to identify two sequence homologs, which resulted in a very low normalized Neff value of 0.15. Additionally, the DeepPotential restraints were fairly inaccurate with an MAE of 2.54 Å for the top 2*L distances and 59.29° for the 2*L Ω orientations. Interestingly, even though the orientation restraints were inaccurate, their inclusion greatly improved the modeling accuracy, as the model built using only the contact and distance restraints possessed a low TM-score of 0.268, while the model built using the full set of contact/distance and orientation restraints had a TM-score of 0.514. Moreover, the addition of the general knowledge-based energy function further improved the TM-score to 0.590. This suggests that even when inaccurate, the combination of various restraints with a general energy function may act synergistically to filter out inaccuracies in the predictions. It is also noteworthy that the two preceding examples were from proteins with few to no homologous sequences. In fact, if we consider the 5 proteins in the benchmark dataset with the least homologous sequence information (<3 sequence homologs) and normalized Neff values <0.20, DeepFold generated more accurate models than AlphaFold2 for 4 of these targets, where the average TM-score of DeepFold was 0.528 compared to 0.398 for AlphaFold2. This suggests that, while deep self-attention-based protein structure prediction approaches have demonstrated an improved ability to fold proteins with few sequence homologs, the performance on the most extreme cases with few to no sequence homologs remains to be improved.

Lastly, given the importance of the most recent advances in protein structure prediction, we sought to determine whether or not they could be incorporated into DeepFold to further improve its performance. To answer this question, we utilized the restraints taken from RosettaFold, including the Cβ distances and orientations, as well as the Cα distances/contacts and Cβ contacts from DeepPotential to guide the DeepFold simulations. The results of this analysis are depicted in S11 Table and S4D–S4F Fig, which present head-to-head comparisons between DeepFold utilizing the combined restraints with RosettaFold and AlphaFold2 in terms of the model TM-scores on the 221 benchmark proteins. The results show that with the combined RosettaFold and DeepPotential restraints, DeepFold achieved an average TM-score of 0.844, which was higher than that attained by the end-to-end (TM-score = 0.812) and Pyrosetta (TM-score = 0.838) versions of RosettaFold with p-values of 2.4E-11 and 1.2E-2, respectively. These data demonstrate that the DeepFold knowledge-based force field and DeepPotential contact and Cα distance restraints may improve the results obtained by RosettaFold. Additionally, they show that DeepFold is a versatile platform that can be easily adapted for any future advances in state-of-the-art deep learning restraint predictors.

DeepFold greatly improves the accuracy and speed of protein folding over classical ab initio methods

Rosetta [14] and QUARK [8] are two of the most well-known fragment-assembly methods and have been consistently ranked as the top methods for ab initio protein structure prediction in previous CASP experiments [3, 27, 28]. However, a major drawback of the traditional ab initio folding approaches is that their modeling performance drops as the protein length increases, making them significantly less reliable for modeling larger protein structures composed of more than 150 residues [1]. To examine the impact of deep learning on ab initio structure prediction for long protein sequences, we compared DeepFold to both Rosetta and QUARK, where Fig 5C depicts the TM-scores of DeepFold, QUARK, and Rosetta vs protein length. The data show that the performance of DeepFold remained consistent as the protein length increased, where the average TM-score for large proteins composed of 350–450 residues was in fact higher than that for the small proteins in the test set with lengths <150 residues (0.809 vs. 0.742), mostly due to the more favorable MSAs collected for the set of larger proteins. However, the performance of both QUARK and Rosetta noticeably decreased as the protein length increased; the average TM-score for proteins with lengths less than 150 residues was 0.329 for QUARK and 0.304 for Rosetta but was only 0.190 and 0.196 for QUARK and Rosetta, respectively, on proteins with lengths between 350 and 450 residues. From these results, DeepFold outperformed QUARK and Rosetta remarkably on the overall dataset and especially on the longest proteins in the dataset, for which the average TM-score of DeepFold was 325.8% higher than QUARK and 312.8% higher than Rosetta.

Fig 5 — A) Simulation runtime for QUARK, trRosetta, and DeepFold in minutes plotted against the protein length. B) A close up of the runtime vs protein length for DeepFold and trRosetta. C) Analysis of the average TM-score for DeepFold, QUARK, and Rosetta across different protein length ranges.

Another major limitation of fragment-assembly approaches is that they require lengthy simulations to adequately explore the immense structure space available. In Fig 5A and 5B, we list a comparison of the folding simulation time requirement for DeepFold and the QUARK fragment assembly approach for different protein lengths. The results show that the speed of DeepFold is orders of magnitude faster than QUARK, especially for large proteins. Note that we ran QUARK using 5 separate trajectories in parallel and the run times shown in Fig 5A are the average run time across all 5 simulation trajectories. Thus, if the simulations were run sequentially, the run time would be 5 times longer, which further accentuates the cost of fragment assembly. Therefore, while fragment assembly requires hours to days to fold a protein, DeepFold requires only seconds to minutes. Overall, the average run time of DeepFold on the test set was 6.98 minutes, while the average for QUARK was 1830.82 minutes for an average protein length of 188.1 residues. This indicates that QUARK requires 262.3 times the computing time that DeepFold requires for one simulation trajectory, and the difference was even greater as the sequence length increased. Overall, the run time of DeepFold was similar to trRosetta, which required 5.48 minutes to construct models on the test dataset on average. Of particular importance is that the greatly reduced folding times did not cause the model quality to deteriorate for larger proteins, demonstrating the ability of deep learning restraints to effectively smooth the energy landscape, thereby allowing rapid and accurate optimization across protein lengths.

Gradient-based protein folding requires a high number of deep learning restraints

The success of rapid L-BFGS-based protein folding approaches raises the question on what the role of fragment assembly is in protein structure prediction. As L-BFGS and other gradient-based methods are essentially local optimization techniques that may be prone to becoming trapped in local energy minima, the more extensive conformational sampling performed by fragment assembly may still be necessary in the absence of a high number of deep learning spatial restraints.

To examine this hypothesis, Fig 6A depicts the TM-score for L-BFGS-based protein folding simulations using different numbers of spatial restraints. Consistent with the data in Fig 2, Fig 6A shows that only using the GE function to guide the L-BFGS simulations resulted in a poor average TM-score of 0.184, which was significantly lower than that obtained by QUARK (TM-score = 0.274), which uses a knowledge-based energy function without deep learning restraints [8]. This indicates the frustration of the baseline energy force field of DeepFold, which cannot be quickly explored with gradient-based methods. Inclusion of the top L all-range Cβ distances slightly improved the TM-score to 0.186, and at least the top 5*L distances were required to improve the TM-score to a significant degree. In order to achieve a performance that was better than QUARK, the L-BFGS simulations required 10*L Cβ distance restraints, where the average TM-score using this number of restraints was 0.323. The inclusion of more distance restraints, such as the top 15*L and 20*L restraints, steadily improved the average TM-score to 0.392 and 0.453, respectively.

However, our tests showed that setting a specific probability cutoff for the selection of distance restraints allowed the method to achieve the best result. In DeepFold, all distances with a probability >0.55 were selected for inclusion in the L-BFGS optimization procedure, which corresponded to an average of ~93*L distance restraints on the test set, increasing the TM-score to 0.668. Overall, the addition of the full set of DeepPotential restraints (including contacts, Cα distance and orientations in addition to the Cβ distances) increased the accuracy by an additional 12.4%, resulting in a TM-score of 0.751 for the full pipeline. Thus, it is clear that L-BFGS requires a high number of spatial restraints in order to adequately smooth the energy landscape and make gradient-based protein folding feasible.

Case study reveals drastically different dynamics in Monte Carlo and L-BFGS folding simulations

To further illustrate the differences in the sampling procedures for the fragment assembly method, QUARK, and the L-BFGS optimization approach, DeepFold, we present in Fig 6B–6D a case study from the amino terminal domain of enzyme I from E. coli (SCOPe ID: d1zyma1). Both DeepFold and QUARK generated a correct fold for this target, where the TM-score of the model produced by QUARK was 0.547 and the TM-score for the DeepFold model was very high at 0.923 with an RMSD of 1.29 Å, indicating a close atomic match to the experimental structure.

To show the conformational changes during the QUARK folding simulations, Fig 6B depicts the TM-score of the conformation for the last replica at REMC cycle i relative to the conformation of the previous decoy at cycle i-1. From the figure, it can be seen that large changes in the conformation occur throughout the simulation due to the global conformational searching and replica exchange steps. On the other hand, the opposite trend was observed for the L-BFGS folding simulations shown in Fig 6C, during which large conformational changes occurred early on in the simulation, and the global fold of the protein was largely determined by the 100^th L-BFGS step. After that, only small fluctuations in the conformation occurred, where the L-BFGS optimization quickly converged and did not extensively sample the structure space due to the nature of the local optimization of the smooth energy landscape produced by the large number of deep learning restraints.

Moreover, Fig 6D depicts the DeepFold models at L-BFGS steps 100 and 1100 superposed with the experimental structure. While the global fold of the model was determined by the 100^th L-BFGS step, substantial conformational changes occurred during the later L-BFGS steps at the two regions, namely the highlighted terminal coil and core helix regions, which were poorly formed at step 100 due to the inconsistency in the spatial restraints in these sections. For the helix region in particular, the model at step 100 had poorly formed secondary structure as well as severely clashing segments. These errors were gradually corrected over the remaining 1000 L-BFGS steps. Therefore, while the global folds of proteins may quickly be determined by the consensus DeepPotential restraints during the L-BFGS simulations, additional steps are often needed to precisely fine-tune the model quality under the guidance of the atomic force field.

Conclusions

We developed an open-source program (DeepFold) to quickly construct accurate protein structure models from deep learning-based potentials. DeepFold significantly outperformed other ab initio structure prediction methods such as Rosetta, QUARK, I-TASSER, C-I-TASSER, DMPfold, and trRosetta on the test set of 221 Hard threading targets, and AlphaFold on the CASP13 FM targets. The impact of deep learning on DeepFold was best highlighted by the benchmark test with Rosetta, QUARK and I-TASSER, which represent the top traditional FM and TBM methods. On the benchmark dataset, Rosetta, QUARK and I-TASSER were only able to generate correctly folded models for 0.9%, 2.7% and 24.0% of the proteins, respectively, while DeepFold successfully folded 92.3% of the test proteins with an average TM-score of 0.751, compared to 0.260, 0.274, and 0.383 for Rosetta, QUARK and I-TASSER, respectively.

Furthermore, the average TM-score of DeepFold was 7.8% and 13.9% higher than the other leading deep learning-based methods, DMPfold and trRosetta, respectively, starting from the same MSAs. It was also 8.0% higher than AlphaFold on the 31 CASP13 FM targets. Of particular interest is the performance on the hardest targets in the dataset with very shallow MSAs (i.e., with normalized Neff values less than 1.0), where the average TM-score of DeepFold was 40.3% higher than trRosetta and 44.9% higher than DMPfold. On top of the improved accuracy, DeepFold had a similar running time as other gradient descent-based approaches such as trRosetta, but was more than 200 times faster than the traditional fragment-assembly-based approaches. The success of DeepFold is mainly due to the effective combination of the inherent knowledge-based potential with the high number of accurately predicted spatial restraints that help smooth the energy landscape, making L-BFGS optimization tractable.

Despite the success, significant improvements may still be made. For example, the use of attention-based networks [25, 29, 30], especially an end-to-end learning protocol [25], should help further improve the prediction accuracy of DeepFold. Given that the main input features to DeepPotential are derived from co-evolutionary analyses, DeepFold often requires that the input MSAs contain a sufficient number of effective sequences to enable determination of the co-evolutionary relationships between protein residues. Despite the fact that the quality of the DeepFold models was considerably less dependent on the MSA quality than other methods such as DMPfold and trRosetta, the use of a transformer architecture should help further enhance the performance of DeepPotential for those targets with poor MSA quality and few homologous sequences by self-attention based, iterative MSA refinement. This can be illustrated by the comparison of DeepFold with the most recent methods, RosettaFold and AlphaFold2, which achieved higher TM-scores on the benchmark targets. Nevertheless, when utilizing the combined RosettaFold and DeepPotential restraints, DeepFold was able to outperform both the end-to-end and distance-based versions of RosettaFold, demonstrating that it is a versatile platform that can be easily adapted for advances in the state of the art. Meanwhile, DeepFold outperformed AlphaFold2 on 4 out of the 5 targets with the least homologous sequence information (normalized Neff <0.2), revealing that there is significant room for improvement on very difficult modeling targets.

Furthermore, more efficient and precise MSA construction strategies should be developed to improve the MSA quality and reduce the time required to search the various sequence databases. The need to increase the searching efficiency is particularly important as the increase in the size of the sequence databases, mainly the metagenomics databases, is a double-edged sword. While it enables the collection of more sequences, it also greatly increases the time and computational resources necessary to search the sequence databases and the potential for false negative sequence samples due to the increase in noise. For example, searching a 150-residue protein through MetaClust, which is approximately 100 GB, using DeepMSA2 requires around 1 hour with 1 CPU; however, searching the same protein through the 5TB JGI metagenome database is dramatically more expensive, requiring approximately 4 hours using 50 CPUs. This issue is particularly important for hard modeling targets, which often require extensive homologous sequence detection. As evidence of this, in S6 Fig, we plot the number of times each of the 7 MSAs produced by DeepMSA2 were selected for the 221 benchmark targets. From the figure, it can be seen that ~55% of the targets required searching beyond the MetaClust database, while only ~15% did not require searching through any metagenomics database. Meanwhile, incorrectly collected MSAs, despite having a high number of homologous sequences, can negatively impact the modeling results as witnessed in the CASP experiments [31]. The use of a targeted MSA generation protocol that focuses on searching sequences related to the target protein’s biome represents a promising strategy for improving the speed and quality of the MSA generation and the accuracy of the final 3D structure modeling [32].

Methods

DeepFold is an algorithm that can quickly construct accurate full-length protein structure models from deep learning restraints and consists of three main steps: MSA generation by DeepMSA2, spatial restraint prediction by DeepPotential, and L-BFGS folding simulations, as depicted in Fig 1.

MSA generation by DeepMSA2

DeepMSA2 is an extension of DeepMSA [33] for iterative MSA collection, where the new components include an additional pipeline to search larger sequence databases and a novel MSA selection method based on predicted contact maps (see S1 Fig). Briefly, DeepMSA2 collects 7 candidate MSAs by iteratively searching whole-genome (Uniclust30 and UniRef90) and metagenome (Metaclust, BFD, and Mgnify) sequence databases. The first 3 MSAs are generated using the same procedure as DeepMSA (i.e., dMSA in S1 Fig), where the query sequence is first searched through Uniclust30 (2017_04) by HHblits2 to create MSA-1. Next, the sequences identified by Jackhmmer and HMMsearch are used to construct a custom HHblits database, against which HHblits2 is run starting from the MSA generated in the previous stage to generate MSA-2 and MSA-3, respectively. The four remaining MSAs are generated using a procedure called quadruple MSA (qMSA in S1 Fig), which uses HHblits2 to search the original query sequence against the Uniclust30 database (version 2020_01) to create MSA-4. Next, the sequences detected by Jackhmmer, HHblits3, and HMMsearch through the UniRef90, BFD, and Mgnify databases are used to construct custom HHblits-style databases, against which HHblits2 is employed to search starting from the MSAs generated by the previous stages to create MSA-5, MSA-6, and MSA-7, respectively. To select the final MSA, a quick TripletRes contact map prediction [34] is run starting from each of the 7 MSAs, where the MSA with the highest cumulative probability for the top 10*L all-range contacts is selected as the final MSA.

Spatial restraint prediction by Deep Potential

Starting from the selected MSAs, two sets of 1D and 2D features are extracted. The 2D features include the raw coupling parameters from the pseudo likelihood maximized (PLM) 22-state Potts model and the raw mutual information (MI) matrix, where the 22 states of the Potts model represent the 20 standard amino acids, a non-standard amino acid type, and a gap state. Here, a Potts model is a specific type of Markov Random Field (MRF) model that is widely used in protein structure prediction [35–38]. Briefly, an MRF is a graphical model that represents each column of an MSA as a node that describes the distribution of amino acids at a given position (Potts model field parameters), where the edges between nodes indicate the joint distributions of amino acids at each pair of positions. The 2D coupling parameters can then be determined from the edge weights, where residue pairs that exhibit correlated mutation patterns will possess greater edge weights, which can be used to infer positions that should be closer together in 3D space. This is based off of the intuition that if two residues are in contact with each other, then when one residue mutates, the contacting residue should also mutate in order to preserve the interaction. In DeepPotential, CCMpred [38] is used to fit the Potts model. The corresponding parameters for each residue pair in the PLM and MI matrices are extracted as additional features that measure query-specific co-evolutionary information in an MSA. The 1D features contain the Potts model field parameters, Hidden Markov Model (HMM) features, and the self-mutual information, along with the one-hot representation of the MSA and other descriptors, such as the number of sequences in the MSA.

Next, these 1D and 2D features are fed into deep convolutional residual neural networks separately, where each of them is passed through a set of one-dimensional and two-dimensional residual blocks, respectively, and are subsequently tiled together. The tiled feature representations are considered as the input of another fully residual neural network which outputs the inter-residue interaction terms, including Cα-Cα distances, Cβ-Cβ distances, and the inter-residue orientations (Fig 1). Here, the predicted spatial restraints are represented using various bins that correspond to specific distance/angle values, where DeepPotential predicts the probability that the spatial restraints fall within the specific bins. For example, for the Cα and Cβ distances, the predictions are divided into 38 bins, where the first bin represents the probability that the distance is <2Å and the final bin represents the probability that the distance is ≥20Å. The remaining 36 bins represent the probability that the distance falls in the range [2Å, 20Å), where each bin has a width of 0.5 Å. On the other hand, the 3 orientation features, as defined in S7 Fig, are predicted using a bin width of 15° with an additional bin to indicate whether there is no interaction between the two residues (i.e., Cβ-Cβ distance ≥20Å). The DeepPotential models were trained on a set of 26,151 non-redundant proteins collected from the PDB at a pair-wise sequence identity cutoff of 35%.

DeepFold Force Field

The DeepFold energy function is a linear combination of the following terms:

E_{D e e p F o l d} = (E_{C β d i s t} + E_{C α d i s t} + E_{C β c o n t} + E_{C α c o n t} + E_{Ω} + E_{θ} + E_{φ}) + (E_{h b} + E_{v d w} + E_{t o r})

(1)

where the first seven terms E_Cβdist, E_Cαdist, E_Cβcont, E_Cαcont, E_Ω, E_θ, and E_φ account for the predicted Cβ–Cβ distances, Cα–Cα distances, Cβ–Cβ contacts, Cα–Cα contacts, and three inter-residue orientation angles by DeepPotential; and the last three terms E_hb, E_vdw, and E_tor denote the generic energy terms for hydrogen bonding, van der Waals clashes, and backbone torsion angles, respectively.

Overall, the DeepFold force field consists of 24 weighting parameters, where the weights given to each of the deep learning restraints were separated into short (1<|i−j|≤11), medium (11<|i−j|≤23) and long-range (|i−j|>23) weights, which were determined by maximizing the TM-score on the training set of 257 non-redundant, Hard threading targets collected from the PDB that shared <30% sequence identity to the test proteins. Briefly, all the weights were initialized to 0, then the weight for each individual energy term was varied one-at-a-time by an increment of 0.25 in the range from [0, 25] and the DeepFold folding simulations were run using the new weights. The weight for each term that resulted in the highest average TM-score on the training set was accepted. After the initial weighting parameters were determined, 3 more optimization runs were carried out, where the weight for each energy term was again varied in a range from [0, 25] using an increment of 0.1 and the weighting parameters that resulted in the highest average TM-score on the training set were accepted. A final optimization run was carried out, where the weights were perturbed by [–2, 2] from their previously accepted values using an increment of 0.02 to precisely fine-tune their values. The details of each energy term are further explained in S2 Text in the SI. Since DeepPotential provides the bin-wise histogram probability of the spatial descriptors, these terms are further fit with cubic spline interpolation to facilitate the implementation of the L-BFGS optimization, which requires a continuously differentiable energy function.

L-BFGS Folding Simulations

A protein structure in DeepFold is specified by its backbone atoms (N, H, Cα, C, and O), Cβ atoms and the side-chain centers of mass (S8 Fig). The initial conformations are generated from the backbone torsion angles (ϕ, ψ) predicted by ANGLOR through a small, fully-connected neural network [39], where the cartesian coordinates of the backbone atoms are determined using simple geometric relationships, assuming ideal bond length and angle values. The conformational search simulations are performed using L-BFGS, with bond lengths and bond angles fixed at their ideal values, and the optimization is carried out on the backbone torsion angles.

Here, L-BFGS is a gradient-descent based optimization method that is a limited memory variant of the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm. At each step k, the search direction d_k of the simulation is calculated by

d_{k} = - H_{k}^{- 1} ∙ \nabla E_{D e e p F o l d} (x)

(2)

where $H_{k}^{- 1}$ is an estimate for the inverse Hessian matrix and ∇E_DeepFold(x) represents the gradient of E_DeepFold(x) with respect to the backbone torsion angles x = (ϕ, ψ). The value of $H_{k}^{- 1}$ at step k = 0 is set to the identity matrix, I, and the value of $H_{k + 1}^{- 1}$ is obtained following the BFGS formulation

{\begin{cases} H_{k + 1}^{- 1} = V_{k}^{T} H_{k}^{- 1} V_{k} + ρ_{k} s_{k} s_{k}^{T} \\ V_{k} = I - ρ_{k} y_{k} s_{k}^{T} \\ ρ_{k} = \frac{1}{y_{k}^{T} s_{k}} \end{cases}

(3)

where s_k = x_k+1−x_k and $y_{k} = \nabla E_{D e e p F o l d} (x_{k + 1}) - \nabla E_{D e e p F o l d} (x_{k})$ . $H_{k + 1}^{- 1}$ can be computed recursively by storing the previously calculated values of s_k and y_k. To preserve memory, L-BFGS only stores the last m values of s_k and y_k. Thus, $H_{k + 1}^{- 1}$ is calculated by

H_{k + 1}^{- 1} = (\prod_{i = k}^{k - \hat{m} + 1} V_{i}^{T}) H_{0}^{- 1} (\prod_{i = k - \hat{m} + 1}^{k} V_{i}) + \sum_{j = k}^{k - \hat{m} + 1} (\prod_{i = k + 1}^{j + 1} V_{i}) ρ_{k} s_{k} s_{k}^{T} (\prod_{i = j + 1}^{k} V_{i})

(4)

where $\hat{m} = m i n (k, m - 1)$ and m is set to 256 in DeepFold. Once the search direction d_k is decided, the torsion angles for the next step are updated according to

{\begin{matrix} ϕ_{k + 1} = ϕ_{k} + α_{k} d_{k} \\ ψ_{k + 1} = ψ_{k} + α_{k} d_{k} \end{matrix}

(5)

The value of α_k is determined using the Armijo line search technique [40] and dictates the extent to move along the given search direction. In DeepFold, a maximum of 10 L-BFGS iterations are performed with 2,000 steps each, or until the simulations converge. The final model is selected as the one with the lowest energy produced during the folding simulations.

Supporting information

S1 Table. Impact of the different components of the DeepFold energy function on the structure modeling accuracy.

(PDF)

Click here for additional data file.^{(59.9KB, pdf)}

S2 Table. Mean absolute error (MAE) between the top specified number of long-range distance restraints predicted by DeepPotential and the models built without (GE+Cont+Dist) and with (GE+Cont+Dist+Orien) inter-residue orientations.

(PDF)

Click here for additional data file.^{(59.7KB, pdf)}

S3 Table. DeepFold results on the 38 β-proteins in the test set with and without orientation restraints.

(PDF)

Click here for additional data file.^{(60.2KB, pdf)}

S4 Table. Impact of the general statistical energy function on DeepFold’s modeling performance.

(PDF)

Click here for additional data file.^{(62.3KB, pdf)}

S5 Table. Non-parametric analysis for DeepFold and the control methods on the 221 test proteins.

(PDF)

Click here for additional data file.^{(11.5KB, pdf)}

S6 Table. Top long-range distance MAE by different distance predictors on the 221 test proteins.

(PDF)

Click here for additional data file.^{(58.8KB, pdf)}

S7 Table. Modeling results for trRosetta using DeepPotential’s spatial restraints vs DeepFold.

(PDF)

Click here for additional data file.^{(59.1KB, pdf)}

S8 Table. Modeling results for DeepFold and the control methods on the 90 test proteins that were non-redundant to the training set of DeepPotential.

(PDF)

Click here for additional data file.^{(98KB, pdf)}

S9 Table. Modeling results for DeepFold and AlphaFold on the 31 CASP13 targets.

(PDF)

Click here for additional data file.^{(60.4KB, pdf)}

S10 Table. Modeling results of DeepFold using the DeepPotential restraints vs RosettaFold/AlphaFold2 on the 221 test proteins.

(PDF)

Click here for additional data file.^{(63.6KB, pdf)}

S11 Table. Modeling results of DeepFold using the combined RosettaFold/DeepPotential restraints vs RosettaFold/AlphaFold2 on the 221 test proteins.

(PDF)

Click here for additional data file.^{(63.8KB, pdf)}

S12 Table. Selection of the first well width (d_b) in the contact potential for various protein lengths (L).

(PDF)

Click here for additional data file.^{(11.5KB, pdf)}

S1 Fig

DeepMSA2 pipeline, which contains three approaches, (A) dMSA, (B) qMSA, and (C) MSA selection.

(PDF)

Click here for additional data file.^{(125.2KB, pdf)}

S2 Fig. Case study from two proteins for which I-TASSER/C-I-TASSER significantly outperformed DeepFold.

(PDF)

Click here for additional data file.^{(105.7KB, pdf)}

S3 Fig. Model TM-score vs. the logarithm of the MSA Neff value for DeepFold, trRosetta, and DMPfold.

(PDF)

Click here for additional data file.^{(165.9KB, pdf)}

S4 Fig. Head-to-head comparison between DeepFold and RosettaFold/AlphaFold2 on the 221 Hard benchmark targets.

(PDF)

Click here for additional data file.^{(124.8KB, pdf)}

S5 Fig. Case study from two proteins for which DeepFold significantly outperformed AlphaFold2.

(PDF)

Click here for additional data file.^{(124KB, pdf)}

S6 Fig. Histogram distribution of the number of times each of the 7 DeepMSA2 MSAs were selected for the 221 Hard benchmark targets.

(PDF)

Click here for additional data file.^{(72.2KB, pdf)}

S7 Fig. Definition of the inter-residue orientations predicted by DeepPotential.

(PDF)

Click here for additional data file.^{(152.5KB, pdf)}

S8 Fig. Depiction of the reduced model used to represent protein conformations during the DeepFold folding simulations.

(PDF)

Click here for additional data file.^{(101.9KB, pdf)}

S1 Text. MSA Neff value calculation.

(PDF)

Click here for additional data file.^{(41.1KB, pdf)}

S2 Text. Description of the DeepFold energy function.

(PDF)

Click here for additional data file.^{(160.4KB, pdf)}

Acknowledgments

We thank Dr. Wei Zheng for a portion of the design of Fig 1 and S1 Fig.

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work is supported in part by the National Institute of General Medical Sciences (GM136422, S10OD026825 to YZ), the National Institute of Allergy and Infectious Diseases (AI134678 to YZ), the National Science Foundation (IIS1901191, DBI2030790, MTM2025426 to YZ), and the National Institutes of Health (U24CA210967, P30ES017885 to GSO). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Zhang Y. Progress and challenges in protein structure prediction. Curr Opin Struct Biol. 2008;18(3):342–8. Epub 2008/04/26. doi: 10.1016/j.sbi.2008.02.004 ; PubMed Central PMCID: PMC2680823. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Dunbrack R, editor Template-based modeling assessment in CASP11. 11th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction; 2014; Riviera Maya, Mexico. [Google Scholar]
3.Kinch LN, Li W, Monastyrskyy B, Kryshtafovych A, Grishin NV. Evaluation of free modeling targets in CASP11 and ROLL. Proteins. 2016;84 Suppl 1:51–66. doi: 10.1002/prot.24973 . [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Pearce R, Zhang Y. Deep learning techniques have significantly impacted protein structure prediction and protein design. Curr Opin Struc Biol. 2021;68:194–207. doi: 10.1016/j.sbi.2021.01.007 WOS:000666569400003. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Pearce R, Zhang Y. Toward the solution of the protein structure prediction problem. J Biol Chem. 2021:100870. Epub 2021/06/14. doi: 10.1016/j.jbc.2021.100870 . [DOI] [PMC free article] [PubMed] [Google Scholar]
6.He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016;(770–778). [Google Scholar]
7.Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y. The I-TASSER Suite: protein structure and function prediction. Nat Methods. 2015;12(1):7–8. Epub 2014/12/31. doi: 10.1038/nmeth.3213 ; PubMed Central PMCID: PMC4428668. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Xu D, Zhang Y. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins. 2012;80(7):1715–35. Epub 2012/03/14. doi: 10.1002/prot.24065 ; PubMed Central PMCID: PMC3370074. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Jia K, Jernigan RL. New amino acid substitution matrix brings sequence alignments into agreement with structure matches. Proteins. 2021;89(6):671–82. Epub 20210202. doi: 10.1002/prot.26050 ; PubMed Central PMCID: PMC8641535. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Xu J. Distance-based protein folding powered by deep learning. Proceedings of the National Academy of Sciences of the United States of America. 2019;116(34):16856–65. Epub 2019/08/11. doi: 10.1073/pnas.1821309116 ; PubMed Central PMCID: PMC6708335. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020;577(7792):706–10. Epub 2020/01/17. doi: 10.1038/s41586-019-1923-7 . [DOI] [PubMed] [Google Scholar]
12.Yang J, Anishchenko I, Park H, Peng Z, Ovchinnikov S, Baker D. Improved protein structure prediction using predicted interresidue orientations. Proceedings of the National Academy of Sciences of the United States of America. 2020;117(3):1496–503. Epub 2020/01/04. doi: 10.1073/pnas.1914677117 ; PubMed Central PMCID: PMC6983395. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Zheng W, Li Y, Zhang C, Zhou X, Pearce R, Bell EW, et al. Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14. Proteins. 2021. Epub 2021/08/01. doi: 10.1002/prot.26193 . [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Simons KT, Kooperberg C, Huang E, Baker D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of molecular biology. 1997;268(1):209–25. Epub 1997/04/25. doi: 10.1006/jmbi.1997.0959 . [DOI] [PubMed] [Google Scholar]
15.Li W, Zhang Y, Kihara D, Huang YJ, Zheng D, Montelione GT, et al. TOUCHSTONEX: protein structure prediction with sparse NMR data. Proteins. 2003;53(2):290–306. Epub 2003/10/01. doi: 10.1002/prot.10499 . [DOI] [PubMed] [Google Scholar]
16.Barth P, Wallner B, Baker D. Prediction of membrane protein structures with complex topologies using limited constraints. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(5):1409–14. Epub 2009/02/05. doi: 10.1073/pnas.0808323106 ; PubMed Central PMCID: PMC2635801. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wu S, Zhang Y. A comprehensive assessment of sequence-based and template-based methods for protein contact prediction. Bioinformatics. 2008;24(7):924–31. Epub 2008/02/26. doi: 10.1093/bioinformatics/btn069 ; PubMed Central PMCID: PMC2648832. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Zheng W, Li Y, Zhang C, Zhou X, Pearce R, Bell EW, et al. Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14. Proteins: Structure, Function, and Bioinformatics. 2021;n/a(n/a). doi: 10.1002/prot.26193 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Zheng W, Zhang CX, Wuyun QQG, Pearce R, Li Y, Zhang Y. LOMETS2: improved meta-threading server for fold-recognition and structure-based function annotation for distant-homology proteins. Nucleic Acids Res. 2019;47(W1):W429–W36. doi: 10.1093/nar/gkz384 WOS:000475901600062. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Zhang Y, Skolnick J. Scoring function for automated assessment of protein structure template quality. Proteins. 2004;57(4):702–10. Epub 2004/10/12. doi: 10.1002/prot.20264 . [DOI] [PubMed] [Google Scholar]
21.Williams CJ, Headd JJ, Moriarty NW, Prisant MG, Videau LL, Deis LN, et al. MolProbity: More and better reference data for improved all-atom structure validation. Protein Sci. 2018;27(1):293–315. Epub 2017/10/27. doi: 10.1002/pro.3330 ; PubMed Central PMCID: PMC5734394. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Rost B, Sander C, Schneider R. Redefining the goals of protein secondary structure prediction. Journal of molecular biology. 1994;235(1):13–26. Epub 1994/01/07. doi: 10.1016/s0022-2836(05)80007-5 . [DOI] [PubMed] [Google Scholar]
23.Zheng W, Zhang C, Li Y, Pearce R, Bell EW, Zhang Y. Folding non-homology proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021;1:100014. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Greener JG, Kandathil SM, Jones DT. Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints. Nat Commun. 2019;10. doi: ARTN 3977 10.1038/s41467-019-11994-0 WOS:000483716100003. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021. Epub 2021/07/16. doi: 10.1038/s41586-021-03819-2 . [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373(6557):871–6. Epub 20210715. doi: 10.1126/science.abj8754 . [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Kinch L, Yong Shi S, Cong Q, Cheng H, Liao Y, Grishin NV. CASP9 assessment of free modeling target predictions. Proteins. 2011;79 Suppl 10:59–73. Epub 2011/10/15. doi: 10.1002/prot.23181 ; PubMed Central PMCID: PMC3226891. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Tai CH, Bai H, Taylor TJ, Lee B. Assessment of template-free modeling in CASP10 and ROLL. Proteins. 2014;82 Suppl 2:57–83. doi: 10.1002/prot.24470 . [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Rives A, Meier J, Sercu T, Goyal S, Lin Z, Guo D, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. bioRxiv. 2020:622803. doi: 10.1101/622803 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Rao R, Liu J, Verkuil R, Meier J, Canny JF, Abbeel P, et al. MSA Transformer. bioRxiv. 2021:2021.02.12.430858. doi: 10.1101/2021.02.12.430858 [DOI] [Google Scholar]
31.Zheng W, Li Y, Zhang C, Pearce R, Mortuza SM, Zhang Y. Deep-learning contact-map guided protein structure prediction in CASP13. Proteins. 2019. Epub 2019/08/01. doi: 10.1002/prot.25792 . [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Yang P, Zheng W, Ning K, Zhang Y. Decoding microbiome and protein family linkage to improve protein structure prediction. bioRxiv. 2021:2021.04.15.440088. doi: 10.1101/2021.04.15.440088 [DOI] [Google Scholar]
33.Zhang C, Zheng W, Mortuza SM, Li Y, Zhang Y. DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins. Bioinformatics. 2020;36(7):2105–12. Epub 2019/11/19. doi: 10.1093/bioinformatics/btz863 ; PubMed Central PMCID: PMC7141871. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Li Y, Zhang C, Bell EW, Zheng W, Zhou X, Yu DJ, et al. Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks. PLoS computational biology. 2021;17(3):e1008865. Epub 2021/03/27. doi: 10.1371/journal.pcbi.1008865 . [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Weigt M, White RA, Szurmant H, Hoch JA, Hwa T. Identification of direct residue contacts in protein-protein interaction by message passing. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(1):67–72. Epub 2009/01/01. doi: 10.1073/pnas.0805923106 ; PubMed Central PMCID: PMC2629192. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Morcos F, Pagnani A, Lunt B, Bertolino A, Marks DS, Sander C, et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(49):E1293–301. doi: 10.1073/pnas.1111471108 ; PubMed Central PMCID: PMC3241805. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Kamisetty H, Ovchinnikov S, Baker D. Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(39):15674–9. doi: 10.1073/pnas.1314045110 ; PubMed Central PMCID: PMC3785744. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Seemayer S, Gruber M, Soding J. CCMpred—fast and precise prediction of protein residue-residue contacts from correlated mutations. Bioinformatics. 2014;30(21):3128–30. doi: 10.1093/bioinformatics/btu500 ; PubMed Central PMCID: PMC4201158. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Wu ST, Zhang Y. ANGLOR: A Composite Machine-Learning Algorithm for Protein Backbone Torsion Angle Prediction. Plos One. 2008;3(10). doi: 10.1371/journal.pone.0003400 WOS:000265121800002. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Armijo L. Minimization of Functions Having Lipschitz Continuous First Partial Derivatives. Pac J Math. 1966;16(1):1–&. doi: 10.2140/pjm.1966.16.1 WOS:A19667408000001. [DOI] [Google Scholar]

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010539.r001

Decision Letter 0

Christos A Ouzounis, Arne Elofsson

10 Mar 2022

Dear Dr. Zhang,

Thank you very much for submitting your manuscript "Fast and Accurate Ab Initio Protein Structure Prediction Using Deep Learning Potentials" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Christos A. Ouzounis

Associate Editor

PLOS Computational Biology

Arne Elofsson

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors describe in this paper the important progress that they have made in protein structure predictions with their new DeepFold pipeline by including the predicted contacts, a deep learning-based potential and a knowledge-based statistical force field. Despite the huge publicity for Alphafold2, predicting protein structres remains an important research problem. One only has to inspect the Alphafold database to see the large number of extended segments being specified as being of low confidence.

Second paragraph – ‘use of deep learning techniques to predict spatial restraints’ is misleading since it is primarily the data from sequence correlations that lead to these gains, rather than the machine learning itself. There are new gains that can be obtained from improving protein sequence matching itself (Proteins 2021, 89:671), which yields many more of the structure contacts.

The pipeline was tested on 221 non-redundant protein domains taken from scope and FM targets form various CASPs. Results were compared against several other prediction approaches and clear gains were seen.

In the Author Summary should ‘spare’ be ‘sparse’ on next to last line?

Reviewer #2: The authors present DeepFold, a new method for protein structure prediction that uses L-BFGS down-hill minimization of distance and orientation constraints, formulated as an energy function. The constraints are derived from both MSAs and simple physical considerations. A similar approach is implicit in AlphaFold1. The authors carefully analyze the contribution of different constraints, and combinations thereof, to the shape of the energy function and prediction quality, as manifested by the TM-scores of the resulted models. They also provide visual inspection of a few interesting test cases. The new method outperforms several important alternatives in both accuracy and speed. Notably however, in this study DeepFold is not compared with AlphaFold2 and RoseTTAfold, which are freely available for testing.

The major take-home message of this study is that the larger is the number of reliable constraints, and the more diverse they are, that is distance vs. orientation and MSA-based vs. physics-based, the smoother is the energy surface and consequently the more accurate are the predictions.

Overall, the manuscript is clearly written and interesting, with minor issues raised below. However, its major weakness is the lack of comparisons to AlphaFold2 and RoseTTAfold, which are likely to outperform DeepFold. Obviously, this speculation may be wrong, and anyway performance is not the only criterion for the evaluation of a research paper, but the “elephant in the room” here is far too big to ignore.

Minor issues:

1. The authors compare methods and sets of constraints by average values of performance measures (e.g., TM-score), and test the significance of differences of these values by Student’s t-test. Averages and parametric tests maybe misleading when applied to distributions that are not normal like the ones shown in Figure 2. I believe that Instead (or in addition), the authors should use more robust, non-parametric, values and tests.

2. The authors mention “16 targets with Neff values less than 1 “. I guess this is a mistake. Even an orphan has a Neff value of 1.

3. When mentioning the use of Potts models as features, the authors should refer to previous studies that did it. Further, I believe that readers would benefit from some discussion about the meaning of Potts models in this context. If space is limited, we could do without a formal description of a much more known algorithm such as L-BFGS.

Reviewer #3: # General comments

In this work, Pearce and colleagues present DeepFold, a novel method to predict

high-quality protein three dimensional structures using an ab initio approach

which is guided by restraints originating from deep learning.

In my opinion, DeepFold constitutes a significant advance in the field, as the

authors convincingly demonstrate the high quality of results obtained while

maintaining running times reasonable. This performance speed-up is achieved by

exploiting the wealth of structural restraints obtained from deep neural network

processing of sequence-derived information, which enables the application of the

very efficient L-BFGS algorithm in a smoothed energy landscape, leading to fast

convergence.

The manuscript is well written and provides detailed benchmarking to relevant

methods, suggesting that DeepFold can yield high quality results in reasonable

time, thus making DeepFold a new important addition in the toolset of protein

structure prediction.

Below are some specific comments (grouped by manuscript section), which I

hope will be helpful to the authors. In addition, a few typos and potential

passages needing clarifications are mentioned in the end of my review.

# Specific comments

+ Results

- The authors clearly demonstrate (Fig. 4) that DeepFold outperforms competing

methods in the compiled benchmark data. I would find interesting a discussion

on the identities/properties of the few proteins where DeepFold performed worse

than its 'competitors'. Were these inferior predictions due to poor MSAs (e.g.

low Neff) or is there another reason that the authors could identify?

+ Methods and Usage

- I was unable to get results on time for my review using the online server.

However, I trust that the server performs as advertised, as the authors

provide an example input for submission and the corresponding output, which

includes a predicted model for the example query sequence along with the

intermediate results (i.e. predicted secondary structure and spatial restraints).

In addition, the authors have made available a GitHub repository with the DeepFold

source code accompanied with detailed documentation on how to install, setup and

execute the DeepFold suite. Therefore, both regular and more experienced users

will find it easy to get their hands on this new method.

- In the https://zhanggroup.org/DeepFold/README.md file it is mentioned that

"Perl and java interpreters should be installed". It would be helpful if the

minimum required versions for these interpreters are mentioned along with

any non-standard packages/classes on which the provided code is depending.

- In the manuscript the authors mention the use of DeepMSA2 for deriving a

multiple sequence alignment based on the query sequence. However, if I am not

mistaken, the code provided in the repository corresponds to an initial version

(DeepMSA). I would suggest the authors were given the opportunity to choose

which version to install/use.

- In page 13 (last paragraph) qMSA is described and it is mentioned to use

"HHblits2 to search against the Uniclust30 database". However it is unclear

from this passage which is the query used at this point. A reader needs to

consult the respective supplementary figure (Fig S1), where it is shown that

this step is executed using the original query.

In addition, it would be interesting to see the frequency by which each of

the 7 types of MSA are chosen as the final MSA. Such information could provide

additional insights for further speedup of the complete pipeline, since the

MSA construction step requires significant computational resources at all

levels - CPU, main memory, hard disk: potential users interested to install

the code locally (especially in resource limited settings) would be interested

in having information on which parts of this pipeline could be skipped without

affecting quality of the results.

- "DeepPotential models" were trained on a non-redundant set from the PDB.

However, it is unclear by reading the respective section (pg. 14) whether

the protein chains composing this training dataset exhibited any sequence

similarity to the dataset used for benchmarking.

+ Typos/minor clarifications

- Page 9, paragraph 1, line 2: "inter-reside" should read "inter-residue"?

- Page 10, paragraph 2, line 4: I personally disagree with the use of the term

"**highly** statistically significant".

- Page 10, paragraph 3, line 3: Please provide a definition for Neff here, or

give a pointer to the literature for readers not familiar with this term.

- Page 14: "DeepFold Force Field" parameter weights were initialized to zero

values, then optimized. However, it is not mentioned what was the amount of

increase to these weights (I wonder if the weights should be monotonically

increased). The "grid-searching technique" also lacks implementation details.

- Page 19: In Fig 1, at the DeepPotential box, a label "2D input features"

above the 2D residual blocks is probably missing.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No: All details of their pipeline have not been included, nor all parameters used - generally these authors have always created publicly available web sites for general open access.

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Vasilis J Promponas

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539.r002

Author response to Decision Letter 0

4 Jul 2022

Attachment

Submitted filename: response_to_reviewers.pdf

Click here for additional data file.^{(293.6KB, pdf)}

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010539.r003

Decision Letter 1

Christos A Ouzounis, Arne Elofsson

3 Sep 2022

Dear Dr. Zhang,

We are pleased to inform you that your manuscript 'Fast and Accurate Ab Initio Protein Structure Prediction Using Deep Learning Potentials' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.

Best regards,

Christos A. Ouzounis

Academic Editor

PLOS Computational Biology

Arne Elofsson

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Bravo. I am particularly glad to see your important progress reported here. To me your paper is significantly stronger after responding to all 3 reviews in the way you have!

Reviewer #3: I thank the authors for their efforts to addressed all comments raised in the initial round of review.

I have no further comments.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Vasilis J Promponas

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010539.r004

Acceptance letter

Christos A Ouzounis, Arne Elofsson

12 Sep 2022

PCOMPBIOL-D-21-02319R1

Fast and Accurate Ab Initio Protein Structure Prediction Using Deep Learning Potentials

Dear Dr Zhang,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofia Freund

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Table. Impact of the different components of the DeepFold energy function on the structure modeling accuracy.

(PDF)

Click here for additional data file.^{(59.9KB, pdf)}

(PDF)

Click here for additional data file.^{(59.7KB, pdf)}

S3 Table. DeepFold results on the 38 β-proteins in the test set with and without orientation restraints.

(PDF)

Click here for additional data file.^{(60.2KB, pdf)}

S4 Table. Impact of the general statistical energy function on DeepFold’s modeling performance.

(PDF)

Click here for additional data file.^{(62.3KB, pdf)}

S5 Table. Non-parametric analysis for DeepFold and the control methods on the 221 test proteins.

(PDF)

Click here for additional data file.^{(11.5KB, pdf)}

S6 Table. Top long-range distance MAE by different distance predictors on the 221 test proteins.

(PDF)

Click here for additional data file.^{(58.8KB, pdf)}

S7 Table. Modeling results for trRosetta using DeepPotential’s spatial restraints vs DeepFold.

(PDF)

Click here for additional data file.^{(59.1KB, pdf)}

S8 Table. Modeling results for DeepFold and the control methods on the 90 test proteins that were non-redundant to the training set of DeepPotential.

(PDF)

Click here for additional data file.^{(98KB, pdf)}

S9 Table. Modeling results for DeepFold and AlphaFold on the 31 CASP13 targets.

(PDF)

Click here for additional data file.^{(60.4KB, pdf)}

S10 Table. Modeling results of DeepFold using the DeepPotential restraints vs RosettaFold/AlphaFold2 on the 221 test proteins.

(PDF)

Click here for additional data file.^{(63.6KB, pdf)}

S11 Table. Modeling results of DeepFold using the combined RosettaFold/DeepPotential restraints vs RosettaFold/AlphaFold2 on the 221 test proteins.

(PDF)

Click here for additional data file.^{(63.8KB, pdf)}

S12 Table. Selection of the first well width (d_b) in the contact potential for various protein lengths (L).

(PDF)

Click here for additional data file.^{(11.5KB, pdf)}

S1 Fig

DeepMSA2 pipeline, which contains three approaches, (A) dMSA, (B) qMSA, and (C) MSA selection.

(PDF)

Click here for additional data file.^{(125.2KB, pdf)}

S2 Fig. Case study from two proteins for which I-TASSER/C-I-TASSER significantly outperformed DeepFold.

(PDF)

Click here for additional data file.^{(105.7KB, pdf)}

S3 Fig. Model TM-score vs. the logarithm of the MSA Neff value for DeepFold, trRosetta, and DMPfold.

(PDF)

Click here for additional data file.^{(165.9KB, pdf)}

S4 Fig. Head-to-head comparison between DeepFold and RosettaFold/AlphaFold2 on the 221 Hard benchmark targets.

(PDF)

Click here for additional data file.^{(124.8KB, pdf)}

S5 Fig. Case study from two proteins for which DeepFold significantly outperformed AlphaFold2.

(PDF)

Click here for additional data file.^{(124KB, pdf)}

S6 Fig. Histogram distribution of the number of times each of the 7 DeepMSA2 MSAs were selected for the 221 Hard benchmark targets.

(PDF)

Click here for additional data file.^{(72.2KB, pdf)}

S7 Fig. Definition of the inter-residue orientations predicted by DeepPotential.

(PDF)

Click here for additional data file.^{(152.5KB, pdf)}

S8 Fig. Depiction of the reduced model used to represent protein conformations during the DeepFold folding simulations.

(PDF)

Click here for additional data file.^{(101.9KB, pdf)}

S1 Text. MSA Neff value calculation.

(PDF)

Click here for additional data file.^{(41.1KB, pdf)}

S2 Text. Description of the DeepFold energy function.

(PDF)

Click here for additional data file.^{(160.4KB, pdf)}

Attachment

Submitted filename: response_to_reviewers.pdf

Click here for additional data file.^{(293.6KB, pdf)}

Data Availability Statement

All relevant data are within the manuscript and its Supporting Information files.

[pcbi.1010539.ref001] 1.Zhang Y. Progress and challenges in protein structure prediction. Curr Opin Struct Biol. 2008;18(3):342–8. Epub 2008/04/26. doi: 10.1016/j.sbi.2008.02.004 ; PubMed Central PMCID: PMC2680823. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref002] 2.Dunbrack R, editor Template-based modeling assessment in CASP11. 11th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction; 2014; Riviera Maya, Mexico. [Google Scholar]

[pcbi.1010539.ref003] 3.Kinch LN, Li W, Monastyrskyy B, Kryshtafovych A, Grishin NV. Evaluation of free modeling targets in CASP11 and ROLL. Proteins. 2016;84 Suppl 1:51–66. doi: 10.1002/prot.24973 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref004] 4.Pearce R, Zhang Y. Deep learning techniques have significantly impacted protein structure prediction and protein design. Curr Opin Struc Biol. 2021;68:194–207. doi: 10.1016/j.sbi.2021.01.007 WOS:000666569400003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref005] 5.Pearce R, Zhang Y. Toward the solution of the protein structure prediction problem. J Biol Chem. 2021:100870. Epub 2021/06/14. doi: 10.1016/j.jbc.2021.100870 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref006] 6.He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016;(770–778). [Google Scholar]

[pcbi.1010539.ref007] 7.Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y. The I-TASSER Suite: protein structure and function prediction. Nat Methods. 2015;12(1):7–8. Epub 2014/12/31. doi: 10.1038/nmeth.3213 ; PubMed Central PMCID: PMC4428668. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref008] 8.Xu D, Zhang Y. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins. 2012;80(7):1715–35. Epub 2012/03/14. doi: 10.1002/prot.24065 ; PubMed Central PMCID: PMC3370074. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref009] 9.Jia K, Jernigan RL. New amino acid substitution matrix brings sequence alignments into agreement with structure matches. Proteins. 2021;89(6):671–82. Epub 20210202. doi: 10.1002/prot.26050 ; PubMed Central PMCID: PMC8641535. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref010] 10.Xu J. Distance-based protein folding powered by deep learning. Proceedings of the National Academy of Sciences of the United States of America. 2019;116(34):16856–65. Epub 2019/08/11. doi: 10.1073/pnas.1821309116 ; PubMed Central PMCID: PMC6708335. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref011] 11.Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature. 2020;577(7792):706–10. Epub 2020/01/17. doi: 10.1038/s41586-019-1923-7 . [DOI] [PubMed] [Google Scholar]

[pcbi.1010539.ref012] 12.Yang J, Anishchenko I, Park H, Peng Z, Ovchinnikov S, Baker D. Improved protein structure prediction using predicted interresidue orientations. Proceedings of the National Academy of Sciences of the United States of America. 2020;117(3):1496–503. Epub 2020/01/04. doi: 10.1073/pnas.1914677117 ; PubMed Central PMCID: PMC6983395. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref013] 13.Zheng W, Li Y, Zhang C, Zhou X, Pearce R, Bell EW, et al. Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14. Proteins. 2021. Epub 2021/08/01. doi: 10.1002/prot.26193 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref014] 14.Simons KT, Kooperberg C, Huang E, Baker D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of molecular biology. 1997;268(1):209–25. Epub 1997/04/25. doi: 10.1006/jmbi.1997.0959 . [DOI] [PubMed] [Google Scholar]

[pcbi.1010539.ref015] 15.Li W, Zhang Y, Kihara D, Huang YJ, Zheng D, Montelione GT, et al. TOUCHSTONEX: protein structure prediction with sparse NMR data. Proteins. 2003;53(2):290–306. Epub 2003/10/01. doi: 10.1002/prot.10499 . [DOI] [PubMed] [Google Scholar]

[pcbi.1010539.ref016] 16.Barth P, Wallner B, Baker D. Prediction of membrane protein structures with complex topologies using limited constraints. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(5):1409–14. Epub 2009/02/05. doi: 10.1073/pnas.0808323106 ; PubMed Central PMCID: PMC2635801. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref017] 17.Wu S, Zhang Y. A comprehensive assessment of sequence-based and template-based methods for protein contact prediction. Bioinformatics. 2008;24(7):924–31. Epub 2008/02/26. doi: 10.1093/bioinformatics/btn069 ; PubMed Central PMCID: PMC2648832. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref018] 18.Zheng W, Li Y, Zhang C, Zhou X, Pearce R, Bell EW, et al. Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14. Proteins: Structure, Function, and Bioinformatics. 2021;n/a(n/a). doi: 10.1002/prot.26193 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref019] 19.Zheng W, Zhang CX, Wuyun QQG, Pearce R, Li Y, Zhang Y. LOMETS2: improved meta-threading server for fold-recognition and structure-based function annotation for distant-homology proteins. Nucleic Acids Res. 2019;47(W1):W429–W36. doi: 10.1093/nar/gkz384 WOS:000475901600062. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref020] 20.Zhang Y, Skolnick J. Scoring function for automated assessment of protein structure template quality. Proteins. 2004;57(4):702–10. Epub 2004/10/12. doi: 10.1002/prot.20264 . [DOI] [PubMed] [Google Scholar]

[pcbi.1010539.ref021] 21.Williams CJ, Headd JJ, Moriarty NW, Prisant MG, Videau LL, Deis LN, et al. MolProbity: More and better reference data for improved all-atom structure validation. Protein Sci. 2018;27(1):293–315. Epub 2017/10/27. doi: 10.1002/pro.3330 ; PubMed Central PMCID: PMC5734394. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref022] 22.Rost B, Sander C, Schneider R. Redefining the goals of protein secondary structure prediction. Journal of molecular biology. 1994;235(1):13–26. Epub 1994/01/07. doi: 10.1016/s0022-2836(05)80007-5 . [DOI] [PubMed] [Google Scholar]

[pcbi.1010539.ref023] 23.Zheng W, Zhang C, Li Y, Pearce R, Bell EW, Zhang Y. Folding non-homology proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021;1:100014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref024] 24.Greener JG, Kandathil SM, Jones DT. Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints. Nat Commun. 2019;10. doi: ARTN 3977 10.1038/s41467-019-11994-0 WOS:000483716100003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref025] 25.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021. Epub 2021/07/16. doi: 10.1038/s41586-021-03819-2 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref026] 26.Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373(6557):871–6. Epub 20210715. doi: 10.1126/science.abj8754 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref027] 27.Kinch L, Yong Shi S, Cong Q, Cheng H, Liao Y, Grishin NV. CASP9 assessment of free modeling target predictions. Proteins. 2011;79 Suppl 10:59–73. Epub 2011/10/15. doi: 10.1002/prot.23181 ; PubMed Central PMCID: PMC3226891. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref028] 28.Tai CH, Bai H, Taylor TJ, Lee B. Assessment of template-free modeling in CASP10 and ROLL. Proteins. 2014;82 Suppl 2:57–83. doi: 10.1002/prot.24470 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref029] 29.Rives A, Meier J, Sercu T, Goyal S, Lin Z, Guo D, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. bioRxiv. 2020:622803. doi: 10.1101/622803 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref030] 30.Rao R, Liu J, Verkuil R, Meier J, Canny JF, Abbeel P, et al. MSA Transformer. bioRxiv. 2021:2021.02.12.430858. doi: 10.1101/2021.02.12.430858 [DOI] [Google Scholar]

[pcbi.1010539.ref031] 31.Zheng W, Li Y, Zhang C, Pearce R, Mortuza SM, Zhang Y. Deep-learning contact-map guided protein structure prediction in CASP13. Proteins. 2019. Epub 2019/08/01. doi: 10.1002/prot.25792 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref032] 32.Yang P, Zheng W, Ning K, Zhang Y. Decoding microbiome and protein family linkage to improve protein structure prediction. bioRxiv. 2021:2021.04.15.440088. doi: 10.1101/2021.04.15.440088 [DOI] [Google Scholar]

[pcbi.1010539.ref033] 33.Zhang C, Zheng W, Mortuza SM, Li Y, Zhang Y. DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins. Bioinformatics. 2020;36(7):2105–12. Epub 2019/11/19. doi: 10.1093/bioinformatics/btz863 ; PubMed Central PMCID: PMC7141871. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref034] 34.Li Y, Zhang C, Bell EW, Zheng W, Zhou X, Yu DJ, et al. Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks. PLoS computational biology. 2021;17(3):e1008865. Epub 2021/03/27. doi: 10.1371/journal.pcbi.1008865 . [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref035] 35.Weigt M, White RA, Szurmant H, Hoch JA, Hwa T. Identification of direct residue contacts in protein-protein interaction by message passing. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(1):67–72. Epub 2009/01/01. doi: 10.1073/pnas.0805923106 ; PubMed Central PMCID: PMC2629192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref036] 36.Morcos F, Pagnani A, Lunt B, Bertolino A, Marks DS, Sander C, et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(49):E1293–301. doi: 10.1073/pnas.1111471108 ; PubMed Central PMCID: PMC3241805. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref037] 37.Kamisetty H, Ovchinnikov S, Baker D. Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(39):15674–9. doi: 10.1073/pnas.1314045110 ; PubMed Central PMCID: PMC3785744. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref038] 38.Seemayer S, Gruber M, Soding J. CCMpred—fast and precise prediction of protein residue-residue contacts from correlated mutations. Bioinformatics. 2014;30(21):3128–30. doi: 10.1093/bioinformatics/btu500 ; PubMed Central PMCID: PMC4201158. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref039] 39.Wu ST, Zhang Y. ANGLOR: A Composite Machine-Learning Algorithm for Protein Backbone Torsion Angle Prediction. Plos One. 2008;3(10). doi: 10.1371/journal.pone.0003400 WOS:000265121800002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010539.ref040] 40.Armijo L. Minimization of Functions Having Lipschitz Continuous First Partial Derivatives. Pac J Math. 1966;16(1):1–&. doi: 10.2140/pjm.1966.16.1 WOS:A19667408000001. [DOI] [Google Scholar]

PERMALINK

Fast and accurate Ab Initio Protein structure prediction using deep learning potentials

Robin Pearce

Yang Li

Gilbert S Omenn

Yang Zhang

Roles

Abstract

Author summary

Introduction

Results and discussion

Distance and orientation restraints have the dominant impact on global fold accuracy

Fig 1. Overview of the DeepFold pipeline.

Fig 2. Contribution of the various spatial restraints and energy terms on the DeepFold modeling accuracy, where the violin plot shows the TM-score of DeepFold using different combinations of energy terms/restraints on the 221 test proteins.

Fig 3. Illustrative folding examples from DeepFold.

The general knowledge-based energy function improves local physical structure quality

Comparison of DeepFold with other leading modeling methods

Table 1. Summary of the structure modeling results by DeepFold and the control methods on the 221 test proteins.

Fig 4.

Comparison of DeepFold with the most recently developed methods: AlphaFold2 and RosettaFold

DeepFold greatly improves the accuracy and speed of protein folding over classical ab initio methods

Fig 5. Dependence of the simulation time and TM-score on protein length.

Gradient-based protein folding requires a high number of deep learning restraints

Fig 6. Comparison of DeepFold and QUARK modeling results.

Case study reveals drastically different dynamics in Monte Carlo and L-BFGS folding simulations

Conclusions

Methods

MSA generation by DeepMSA2

Spatial restraint prediction by Deep Potential

DeepFold Force Field

L-BFGS Folding Simulations

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Christos A Ouzounis

Arne Elofsson

Roles

Author response to Decision Letter 0

Decision Letter 1

Christos A Ouzounis

Arne Elofsson

Roles

Acceptance letter

Christos A Ouzounis

Arne Elofsson

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases