Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
. 1998 Jan;7(1):112–122. doi: 10.1002/pro.5560070112

How do potentials derived from structural databases relate to "true" potentials?

L Zhang 1, J Skolnick 1
PMCID: PMC2143818  PMID: 9514266

Abstract

Knowledge-based potentials are used widely in protein folding and inverse folding algorithms. Two kinds of derivation methods are used. (1) The interactions in a database of known protein structures are assumed to obey a Boltzmann distribution. (2) The stability of the native folds relative to a manifold of misfolded structures is optimized. Here, a set of previously derived contact and secondary structure propensity potentials, taken as the "true" potentials, are employed to construct an artificial protein structural database from protein fragments. Then, new sets of potentials are derived to see how they are related to the true potentials. Using the Boltzmann distribution method, when the stability of the structures in the database lies within a certain range, both contact potentials and secondary structure propensities can be derived separately with remarkable accuracy. In general, the optimization method was found to be less accurate due to errors in the "excess energy" contribution. When the excess energy terms are kept as a constraint, the true potentials are recovered exactly.

Full Text

The Full Text of this article is available as a PDF (967.3 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Chou P. Y., Fasman G. D. Prediction of protein conformation. Biochemistry. 1974 Jan 15;13(2):222–245. doi: 10.1021/bi00699a002. [DOI] [PubMed] [Google Scholar]
  2. Finkelstein A. V., Badretdinov AYa, Gutin A. M. Why do protein architectures have Boltzmann-like statistics? Proteins. 1995 Oct;23(2):142–150. doi: 10.1002/prot.340230204. [DOI] [PubMed] [Google Scholar]
  3. Finkelstein A. V., Reva B. A. Search for the most stable folds of protein chains: I. Application of a self-consistent molecular field theory to a problem of protein three-dimensional structure prediction. Protein Eng. 1996 May;9(5):387–397. doi: 10.1093/protein/9.5.387. [DOI] [PubMed] [Google Scholar]
  4. Godzik A. Knowledge-based potentials for protein folding: what can we learn from known protein structures? Structure. 1996 Apr 15;4(4):363–366. doi: 10.1016/s0969-2126(96)00041-x. [DOI] [PubMed] [Google Scholar]
  5. Jones D. T., Thornton J. M. Potential energy functions for threading. Curr Opin Struct Biol. 1996 Apr;6(2):210–216. doi: 10.1016/s0959-440x(96)80076-5. [DOI] [PubMed] [Google Scholar]
  6. Kocher J. P., Rooman M. J., Wodak S. J. Factors influencing the ability of knowledge-based potentials to identify native sequence-structure matches. J Mol Biol. 1994 Feb 4;235(5):1598–1613. doi: 10.1006/jmbi.1994.1109. [DOI] [PubMed] [Google Scholar]
  7. Kolinski A., Skolnick J. Monte Carlo simulations of protein folding. I. Lattice model and interaction scheme. Proteins. 1994 Apr;18(4):338–352. doi: 10.1002/prot.340180405. [DOI] [PubMed] [Google Scholar]
  8. Maiorov V. N., Crippen G. M. Contact potential that recognizes the correct folding of globular proteins. J Mol Biol. 1992 Oct 5;227(3):876–888. doi: 10.1016/0022-2836(92)90228-c. [DOI] [PubMed] [Google Scholar]
  9. Mirny L., Domany E. Protein fold recognition and dynamics in the space of contact maps. Proteins. 1996 Dec;26(4):391–410. doi: 10.1002/(SICI)1097-0134(199612)26:4<391::AID-PROT3>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
  10. Miyazawa S., Jernigan R. L. Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol. 1996 Mar 1;256(3):623–644. doi: 10.1006/jmbi.1996.0114. [DOI] [PubMed] [Google Scholar]
  11. Serrano L., Day A. G., Fersht A. R. Step-wise mutation of barnase to binase. A procedure for engineering increased stability of proteins and an experimental analysis of the evolution of protein stability. J Mol Biol. 1993 Sep 20;233(2):305–312. doi: 10.1006/jmbi.1993.1508. [DOI] [PubMed] [Google Scholar]
  12. Sippl M. J. Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures. J Comput Aided Mol Des. 1993 Aug;7(4):473–501. doi: 10.1007/BF02337562. [DOI] [PubMed] [Google Scholar]
  13. Sippl M. J. Knowledge-based potentials for proteins. Curr Opin Struct Biol. 1995 Apr;5(2):229–235. doi: 10.1016/0959-440x(95)80081-6. [DOI] [PubMed] [Google Scholar]
  14. Skolnick J., Jaroszewski L., Kolinski A., Godzik A. Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? Protein Sci. 1997 Mar;6(3):676–688. doi: 10.1002/pro.5560060317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Thomas P. D., Dill K. A. Statistical potentials extracted from protein structures: how accurate are they? J Mol Biol. 1996 Mar 29;257(2):457–469. doi: 10.1006/jmbi.1996.0175. [DOI] [PubMed] [Google Scholar]
  16. Wilson C., Doniach S. A computer model to dynamically simulate protein folding: studies with crambin. Proteins. 1989;6(2):193–209. doi: 10.1002/prot.340060208. [DOI] [PubMed] [Google Scholar]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES