Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1999 May 11;96(10):5482–5485. doi: 10.1073/pnas.96.10.5482

Protein structure prediction by global optimization of a potential energy function

Adam Liwo *,†, Jooyoung Lee *, Daniel R Ripoll , Jaroslaw Pillardy *, Harold A Scheraga *,§
PMCID: PMC21885  PMID: 10318909

Abstract

An approach based exclusively on finding the global minimum of an appropriate potential energy function has been used to predict the unknown structures of five globular proteins with sizes ranging from 89 to 140 amino acid residues. Comparison of the computed lowest-energy structures of two of them (HDEA and MarA) with the crystal structures, released by the Protein Data Bank after the predictions were made, shows that large fragments (61 residues) of both proteins were predicted with rms deviations of 4.2 and 6.0 Å for the Cα atoms, for HDEA and MarA, respectively. This represents 80% and 53% of the observed structures of HDEA and MarA, respectively. Similar rms deviations were obtained for ∼60-residue fragments of the other three proteins. These results constitute an important step toward the prediction of protein structure based solely on global optimization of a potential energy function for a given amino acid sequence.


Prediction of protein structure based on sequence information alone is one of the challenges of contemporary structural biology. There are three classes of approach to the structure-prediction problem: sequence-homology methods, methods based on energetic criteria, and threading methods. In the first method, the unknown structure is constructed based on known structural motifs whose amino acid sequences are similar to the sequence studied, taking advantage of the empirical relationship between sequence and the three-dimensional structure (16). The methods of the second group (79) are based on the thermodynamic hypothesis formulated by Anfinsen (10), according to which the native structure of a protein corresponds to the global minimum of its free energy under given conditions. Structure prediction is therefore achieved by a search for the global minimum of an appropriate potential energy function; this is often called the ab initio or de novo approach. Throughout this paper, the ab initio approach to the protein-folding problem is meant to refer to methods based solely on global optimization of a potential energy function. The threading methods can be placed between these two approaches: they use the energy (or energy-like) functions to distinguish the native structure from alternative structures, but the unknown sequence is superposed on structural motifs chosen from a database of known protein structures (11).

Although sequence homology and threading methods are thus far the most successful tools for protein-structure prediction, their success depends on the presence of sequence- or structural-homologous proteins in the databases. On the other hand, global optimization of a potential energy function is based on physical grounds, but thus far has had little success. Protein structure prediction based solely on the thermodynamic hypothesis has been considered to be unfeasible (12, 13). The reason for this is both the inaccuracy of the potential energy functions devised to represent the protein energy landscape and the lack of powerful methods for global optimization. Thus, some researchers have introduced variants of ab initio methods that include, as a major part of the procedure, secondary-structure predictions and multiple-sequence alignments that are used as constraints in subsequent conformational searches. These methods (1417) have achieved an important degree of success in predicting the structures of a number of proteins. Here, we describe a method for protein structure prediction that is based solely on global optimization of a potential energy function.

Methodology.

Reduced representations of proteins (in which each amino acid residue is represented by one or a few interaction sites) have recently been used with great success in theoretical studies of the physics of protein folding (1823), as well as in designing methods for protein-structure prediction (1417). Recently, we devised a hierarchical approach to protein-structure prediction (24, 25), whose key stage is global optimization of off-lattice simplified polypeptide chains. In subsequent stages, the low-energy conformations of a protein obtained at the coarse-grain level are gradually transformed to all-atom chains. As in related approaches (3, 1417, 2628), the use of a reduced representation of polypeptide chains allows one to explore the conformational space in reasonable time, provided that adequately good global-search techniques are used. An off-lattice approach provides the possibility of using efficient optimization methods that require a continuous representation of the conformational space.

In the last few years, we have developed a united-residue (UNRES) force field (24, 25, 2931) that proved successful in predicting the native-like structures of small globular proteins with simple topology (such as a fragment of the B domain of staphylococcal protein A) as the lowest-energy conformations. In our model (24, 25, 29), a polypeptide chain is represented initially by a sequence of α-carbon (Cα) atoms linked by virtual bonds with attached united side chains (SC) and united peptide groups (p) located in the middle between the consecutive α-carbons. Only the united peptide groups and united side chains serve as interaction sites, the α-carbons assisting in the definition of the geometry (see Fig. 1 of ref. 29). All the virtual bond lengths (i.e., Cα–Cα and Cα–SC) are fixed; the Cα–Cα distance is taken as 3.8 Å, which corresponds to trans peptide groups, while the side-chain angles (αSC and βSC), as well as the virtual-bond angles (θ) and the dihedral angles (γ) can vary.

The energy of the virtual-bond chain is expressed by Eq. 1.

graphic file with name M1.gif
graphic file with name M2.gif 1

The term USCiSCj consists of the mean free energy of the hydrophobic (hydrophilic) interactions between the side chains. It therefore implicitly contains the contributions arising from the interactions with the solvent. The terms USCipj denote the excluded-volume potential of the side-chain–peptide-group interactions. The peptide-group interaction potential (Upipj) accounts mainly for the electrostatic interactions between them or, in other words, for their tendency to form backbone hydrogen bonds. Utor, Ub, and Urot denote the energies of virtual-dihedral angle torsions, virtual-angle bending, and side-chain rotamers; these terms reflect the local propensities of the polypeptide chain. Finally, the multibody (or cooperative) term Ucorr arises from the fact that details of the all-atom chain are lost when converting it into the simplified chain. For the functional forms of the energy terms, the reader is referred to the original papers (24, 25, 2931). The ws denote relative weights of the respective energy terms. The energy expressions were parameterized based on distribution and correlation functions determined from a set of 195 high-resolution nonhomologous structures from the Protein Data Bank (PDB) (32) or by averaging the all-atom energy functions. The relative weights of the energy terms were calculated so as to maximize the ratio of the gap between the energy of the native structure to the average energy of the nonnative structures of the phosphocarrier protein (1PTF) chosen to calibrate the force field (30). This was accomplished by developing a method (30) based on the approach of Wolynes and coworkers (19), Shakhnovich and coworkers (20), and Hao and Scheraga (21, 23, 33, 34).

The effectiveness of the approach relies on the global optimization method. Recently, we developed a very efficient method called Conformational Space Annealing (CSA) (3538) for this purpose. The CSA method searches the whole conformational space in its early stages and then narrows the search to smaller regions with low energy as the distance cutoff, Dcut, which defines the similarity of two conformations, is reduced. As in genetic algorithms (39), CSA starts with a preassigned number (usually 50) of randomly generated and subsequently energy-minimized conformations. This pool of conformations is called the bank. At the beginning, the bank is a sparse representation of the entire conformational space. A number of dissimilar conformations (usually 20) are then selected from the bank, excluding those that have already been used; they are called seeds. Each seed conformation is modified by changing from one to one-third of the total number of variables pertaining to a contiguous portion of the chain; the new variables are selected from one of the remaining bank conformations, rather than being picked at random. Each conformation is energy-minimized to give a trial conformation. Thirty trial conformations are generated for each seed (a total of 600 conformations). This is the most time-consuming part of the computation, but it is highly suitable for parallel computing (38). For each trial conformation, α, the closest conformation A from the bank (in terms of the distance DαA) is determined. If DαA<Dcut (Dcut being the current cutoff criterion), α is considered similar to A; in this case α replaces A in the bank, if it is also lower in energy. If α is not similar to A, but its energy is lower than that of the highest-energy conformation in the bank, B, α replaces B. If neither of the above conditions holds, α is rejected. The narrowing of the search regions is accomplished by setting Dcut to a large value initially (usually one-half of the average pair distance in the bank), and gradually reducing it as the search progresses. Special attention is paid to selecting seeds that are far from each other. One round of the procedure is completed when there is no seed to select (i.e., all conformations from the bank have already been used). The round is repeated a predetermined number of times. The greatest advantage of the CSA method is that it always finds distinct families of low-energy conformations.

The low-energy united-residue structures are subsequently converted to all-atom chains in the following steps (25): (i) positioning the peptide groups between consecutive Cαs so as to achieve optimal alignment of the peptide group dipoles (the dipole-path method) (24); (ii) further optimization of the backbone conformations by using the Electrostatically Driven Monte Carlo (EDMC) method (4042); (iii) adding the side chains with partial optimization of their degrees of freedom; (iv) final refinement of the all-atom chains by using the EDMC method and exploration of the flexible loop regions with the use of the Gō-Scheraga algorithm (43, 44). In the all-atom calculations, the empirical conformational energy program for peptides (ECEPP)/3 (4547) force field with the solvation parameters, optimized from thermodynamic data with fixed radii (SRFOPT) solvation free energy contribution (48), is used.

RESULTS AND DISCUSSION

With the UNRES force field, the CSA method successfully located the native-like conformations of two helical proteins (the 10–55 residue fragment of protein A, and apo calbindin D9K—a 75-residue protein) among the low-energy ones. Alternative structures (which were mirror images of the native folds) were also found. Details of the implementation of the CSA procedure with the UNRES force field are given elsewhere (37). It should be noted that the structures of protein A or apo calbindin D9K or the class of fold that they represent were not used in the optimization of the force field.

These successful predictions encouraged us to try blind predictions of some of the target proteins provided for the Third Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP3). The amino acid sequences of these targets had been volunteered by experimental structural biologists who were in the process of determining their three-dimensional structures by nuclear magnetic resonance (NMR) spectroscopy or x-ray crystallography. Our group submitted predictions for seven of these targets. For all five globular proteins, large portions of contiguous fragments (∼60) were predicted within the range of 4.2–6.8 Å rmsd (rms deviation) of Cα atoms from the experimental structures. For the two remaining targets, corresponding to small synthetic constructs, we had a similar degree of success. The details of these predictions are discussed elsewhere (49). At the time of writing this report, the structures of most of the targets have been made available only to the participants in CASP3. Additionally, experimental structures of three of them have been publicly released: T0056 (NMR structure of the N-terminal domain of Escherichia coli Dnab helicase; PDB entry: 1JWE), T0061 [x-ray structure of HDEA, E. coli (50); 89 amino acid residues; PDB entry: 1BG8] and T0079 [x-ray structure of MarA (51); 129 amino acid residues; PDB entry: 1BL0]. For target T0056, some secondary structure information was provided to the predictors. A single model was submitted for T0056, and analysis of this prediction is reported elsewhere (49). Here, we discuss the performance of our method on the other two proteins in some detail. For proteins HDEA and MarA, we submitted four models ranked according to the UNRES energy, each of them representing a low-energy family of conformations. The discussion refers to the lowest-energy conformations (submitted as model 1 for each protein).

The native structure of HDEA is a five-helix bundle with a long loop between the third (H-3) and the fourth (H-4) helix (50) (Fig. 1). The rmsd between the crystal and our predicted structure (see Table 1) is 4.2 Å for the Cα atoms, when the structures are superposed from residues D25 to I85 (80% of the experimentally observed structure), which corresponds to native helices H-3, H-4, and H-5 (the first nine and the last four residues are missing in the crystal structure). Our model is different from the native structure in the packing of the 27-residue N-terminal part (residues 1 to 9 are not observed in the crystal structure). This N-terminal fragment, which contains H-2, is rotated by approximately 180°, resulting in an overall rmsd (residues 10–85) of 9.0 Å for the Cα atoms. Superposing helices H-2 and H-3 (residues W16 to K42), we obtain an rmsd for the Cα atoms of 2.9 Å (Figure 2), which means that our predicted structure can be transformed into the native topology by only a single rotation of the N-terminal helix—turn–helix motif. It should be noted that the native protein forms a dimer in the crystal state and has a disulfide bond linking helices H-2 and H-4, which is missing in our model. Information about the existence of a disulfide bond was not provided with the target. HDEA was assessed by the CASP3 referees as one of the most difficult targets for either homology modeling or threading methods because of its rare fold. Our prediction was evaluated (51) as the most successful one for this particular target.

Figure 1.

Figure 1

Superposition of the crystal (red) and predicted (yellow) structures of HDEA. The Cα atoms of the fragment included between residues D25 and I85 were superposed. The rmsd is 4.2 Å. Helices 3, 4, and 5 are indicated as H-3, H-4 and H-5, respectively.

Table 1.

Summary of results

Protein (no. of aa) No. of aa in x-ray structure No. of aa in a fragment Cαrmsd, Å Percentage (predicted/observed)
HDEA (89) 76 76 (10–85) 9.0 100
61  (25–85) 4.2 80
27  (16–42) 2.9 36
MarA (129) 116 116  (9–124) 11.0 100
61  (9–69) 6.0 53

Figure 2.

Figure 2

Superposition of the crystal (red) and predicted (yellow) structures of the 27-residue fragment (W16 to K42) of HDEA. The Cα atoms were superposed with an rmsd of 2.9 Å. Helices 2 and 3 are indicated as H-2, and H-3, respectively.

Our second prediction pertains to the protein MarA. In the crystal structure of a complex with DNA (PDB code: 1BL0), MarA appears as a bipartite helix—turn–helix protein (51). It contains two domains composed of four helices each (our simulations resulted in low-energy conformations containing two domains, a feature that was not known a priori). Our predicted structure has the same topology of the N-terminal domain as in the crystal (52) (residues 1–8 are not observed in the x-ray structure). The rmsd between the predicted and experimental structures for the fragment containing residues D9–A69 is 6.0 Å for the Cα atoms (Figure 3). The major difference between our prediction and the crystal structure is that the C-terminal domain in the predicted structure is folded as a mirror image of that part of the native structure. Furthermore, the long helix, H-4, connecting the two domains in the x-ray structure is significantly kinked in the middle in our predicted structure. The latter features result in tight packing of the two domains in our model whereas, in the native structure, the N- and the C-terminal domains each bind separately to DNA (and hence are not packed against each other). This leads to an rmsd between the predicted and native structures of 11.0 Å for all the observed Cα atoms. Our calculations were carried out on the MarA protein only in the absence of the DNA.

Figure 3.

Figure 3

Superposition of the crystal (red) and predicted (yellow) structures of the MarA N-terminal domain. The Cα atoms of residues D9 to A69 were superposed with an rmsd of 6.0 Å. Helices 1, 2, 3, and 4 are indicated as H-1, H-2, H-3, and H-4, respectively.

CONCLUSIONS

Our results demonstrate that it is possible to predict a significant portion of protein structure by using only a potential energy function and a powerful method of conformational search, without the aid of knowledge-based information provided by multiple-sequence alignment, secondary-structure prediction, or fold recognition. The probability of obtaining a structure within an rmsd of 6 Å for 60–80 residue proteins has been discussed recently, and a prediction with such an rmsd was considered to be quite successful, i.e., with a very low probability of having been obtained by chance (17).

In our view, there are two reasons for this success. First, although our UNRES force field is approximate, it is based on physical grounds, both in deriving the energy terms and in its parameterization (25, 29, 30). Second, the CSA method of conformational search appears to locate all significant low-energy structures in real time. This is very important because, given the approximations inherent in the force field, the lowest-energy structure may not be considered as the only possible candidate for the prediction, but a few distinct low-energy structures should be taken into account. Further improvement of our approach can be accomplished by systematic interplay between CSA and UNRES. For example, by introducing cooperativity between local and backbone electrostatic interactions, it may be possible to extend the applicability of our approach from the largely α-helical proteins treated here to those including segments of β-structure.

It should also be emphasized that, at the present stage of development, our approach can treat proteins of up to 140 residues with presently available computer resources in less than a week of computations. For MarA and HDEA, the calculations consumed a total of 100 and 70 hr of wall clock time, respectively, by using 64 processors of the IBM SP2 supercomputer at the Cornell Theory Center.

Acknowledgments

We thank K. D. Gibson for helpful discussions and D. Sverdlik and D. Dwyer of the Cornell Theory Center for technical assistance. This research was supported by grants from the National Science Foundation (MCB95-13167), the National Institutes of Health (NIH) (GM-14312), the NIH National Center for Research Resources (P41RR-04293), and the Polish State Committee for Scientific Research, KBN (DS 8245-4-0124-9). Support was also received from the National Foundation for Cancer Research. The computations in this work were carried out at the Cornell Theory Center, which receives funding from Cornell University, New York State, the National Center for Research Resources at the National Institutes of Health, the National Science Foundation, the Department of Defense Modernization Program, and members of the Corporate Partnership Program.

ABBREVIATIONS

UNRES

united-residue force field

CSA

conformational space annealing

rmsd

rms deviation

PDB

Protein Data Bank

Footnotes

Bystroff, C., Simons, K. & Baker, D. Third Meeting on the Critical Assessment of Techniques for Protein Structure Prediction, Dec. 13–17, 1998, Pacific Grove, CA, p. 23.

Third Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction, Dec. 13–17, 1998, Pacific Grove, CA. http://predictioncenter.llnl.gov/casp3/Casp3.html.

References

  • 1.Lewis P N, Scheraga H A. Arch Biochem Biophys. 1971;144:584–588. doi: 10.1016/0003-9861(71)90364-x. [DOI] [PubMed] [Google Scholar]
  • 2.Warme P K, Momany F A, Rumball S V, Tuttle R W, Scheraga H A. Biochemistry. 1974;13:768–782. doi: 10.1021/bi00701a020. [DOI] [PubMed] [Google Scholar]
  • 3.Jones T A, Thirup S. EMBO J. 1986;5:819–822. doi: 10.1002/j.1460-2075.1986.tb04287.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Clark D A, Shirazi J, Rawlings C J. Prot Eng. 1991;4:751–760. doi: 10.1093/protein/4.7.751. [DOI] [PubMed] [Google Scholar]
  • 5.Rooman M J, Wodak S J. Biochemistry. 1992;31:10239–10249. doi: 10.1021/bi00157a010. [DOI] [PubMed] [Google Scholar]
  • 6.Johnson M S, Overington J P, Blundell T L. J Mol Biol. 1993;231:735–752. doi: 10.1006/jmbi.1993.1323. [DOI] [PubMed] [Google Scholar]
  • 7.Scheraga H A. Int J Quant Chem. 1992;42:1529–1536. [Google Scholar]
  • 8.Scheraga H A. Biophys Chem. 1996;59:329–339. doi: 10.1016/0301-4622(95)00126-3. [DOI] [PubMed] [Google Scholar]
  • 9.Vásquez M, Némethy G, Scheraga H A. Chem Rev. 1994;94:2183–2239. [Google Scholar]
  • 10.Anfinsen C B. Science. 1973;181:223–230. doi: 10.1126/science.181.4096.223. [DOI] [PubMed] [Google Scholar]
  • 11.Fischer D, Rice D, Bowie J U, Eisenberg D. FASEB J. 1996;10:126–136. doi: 10.1096/fasebj.10.1.8566533. [DOI] [PubMed] [Google Scholar]
  • 12.Jones D T. Curr Opin Struct Biol. 1997;7:377–387. doi: 10.1016/s0959-440x(97)80055-3. [DOI] [PubMed] [Google Scholar]
  • 13.Mirny L A, Shakhnovich E I. J Mol Biol. 1998;283:507–526. doi: 10.1006/jmbi.1998.2092. [DOI] [PubMed] [Google Scholar]
  • 14.Skolnick J, Koliński A, Brooks III C L, Godzik A, Rey A. Curr Biol. 1993;3:414–424. doi: 10.1016/0960-9822(93)90348-r. [DOI] [PubMed] [Google Scholar]
  • 15.Koliński A, Skolnick J. Proteins Struct Funct Genet. 1994;18:338–352. doi: 10.1002/prot.340180405. [DOI] [PubMed] [Google Scholar]
  • 16.Skolnick J, Koliński A, Ortiz A R. J Mol Biol. 1997;265:217–241. doi: 10.1006/jmbi.1996.0720. [DOI] [PubMed] [Google Scholar]
  • 17.Reva B A, Finkelstein A V, Skolnick J. Fold Des. 1998;3:141–147. doi: 10.1016/s1359-0278(98)00019-4. [DOI] [PubMed] [Google Scholar]
  • 18.Dill K A. Biochemistry. 1990;29:7133–7155. doi: 10.1021/bi00483a001. [DOI] [PubMed] [Google Scholar]
  • 19.Goldstein R A, Luthey-Schulten Z A, Wolynes P G. Proc Natl Acad Sci USA. 1992;89:9029–9033. doi: 10.1073/pnas.89.19.9029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sǎli A, Shakhnovich E, Karplus M. Nature (London) 1994;369:248–251. doi: 10.1038/369248a0. [DOI] [PubMed] [Google Scholar]
  • 21.Hao M H, Scheraga H A. J Phys. Chem. 1994;98:9882–9893. [Google Scholar]
  • 22.Wolynes P G, Onuchic J N, Thirumalai D. Science. 1995;267:1619–1620. doi: 10.1126/science.7886447. [DOI] [PubMed] [Google Scholar]
  • 23.Hao M H, Scheraga H A. J Chem Phys. 1995;102:1334–1348. [Google Scholar]
  • 24.Liwo A, Pincus M R, Wawak R J, Rackovsky S, Scheraga H A. Protein Sci. 1993;2:1697–1714. doi: 10.1002/pro.5560021015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Liwo A, Pincus M R, Wawak R J, Rackovsky S, Scheraga H A. Protein Sci. 1993;2:1715–1731. doi: 10.1002/pro.5560021016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Levitt M, Warshel A. Nature (London) 1975;253:694–698. doi: 10.1038/253694a0. [DOI] [PubMed] [Google Scholar]
  • 27.Seetharamulu P, Crippen G M. J Math Chem. 1991;6:91–110. [Google Scholar]
  • 28.Maiorov V N, Crippen G M. J Mol. Biol. 1992;227:876–888. doi: 10.1016/0022-2836(92)90228-c. [DOI] [PubMed] [Google Scholar]
  • 29.Liwo A, Oldziej S, Pincus M R, Wawak R J, Rackovsky S, Scheraga H A. J Comput Chem. 1997;18:849–873. [Google Scholar]
  • 30.Liwo A, Pincus M R, Wawak R J, Rackovsky S, Oldziej S, Scheraga H A. J Comput Chem. 1997;18:874–887. [Google Scholar]
  • 31.Liwo A, Kaźmierkiewicz R, Czaplewski C, Groth M, Oldziej S, Wawak R J, Rackovsky S, Pincus M R, Scheraga H A. J Comput Chem. 1998;19:259–276. [Google Scholar]
  • 32.Bernstein F C, Koetzle T F, Williams G J B, Meyer E F, Jr, Brice M D, Rodgers J R, Kennard O, Shimanouchi T, Tasumi M. J Mol Biol. 1977;112:535–542. doi: 10.1016/s0022-2836(77)80200-3. [DOI] [PubMed] [Google Scholar]
  • 33.Hao M H, Scheraga H A. Proc Natl Acad Sci USA. 1996;93:4984–4989. doi: 10.1073/pnas.93.10.4984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hao M H, Scheraga H A. J Phys Chem. 1996;100:14540–14584. [Google Scholar]
  • 35.Lee J, Scheraga H A, Rackovsky S. J Comput Chem. 1997;18:1222–1232. [Google Scholar]
  • 36.Lee J, Scheraga H A, Rackovsky S. Biopolymers. 1998;46:103–115. doi: 10.1002/(SICI)1097-0282(199808)46:2<103::AID-BIP5>3.0.CO;2-Q. [DOI] [PubMed] [Google Scholar]
  • 37.Lee J, Liwo A, Scheraga H A. Proc Natl Acad Sci USA. 1999;96:2025–2030. doi: 10.1073/pnas.96.5.2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lee, J. & Scheraga, H. A. (1999) Int. J. Quant. Chem., in press.
  • 39.Goldberg D E. Genetic Algorithms in Search, Optimization and Machine Learning. Reading, MA: Addison–Wesley; 1989. [Google Scholar]
  • 40.Ripoll D R, Scheraga H A. Biopolymers. 1988;27:1283–1303. doi: 10.1002/bip.360270808. [DOI] [PubMed] [Google Scholar]
  • 41.Ripoll D R, Scheraga H A. Biopolymers. 1990;30:165–176. doi: 10.1002/bip.360300116. [DOI] [PubMed] [Google Scholar]
  • 42.Ripoll D R, Liwo A, Scheraga H A. Biopolymers. 1998;46:117–126. doi: 10.1002/(SICI)1097-0282(199808)46:2<117::AID-BIP6>3.0.CO;2-P. [DOI] [PubMed] [Google Scholar]
  • 43.Gō N, Scheraga H A. Macromolecules. 1970;3:178–187. [Google Scholar]
  • 44.Palmer K A, Scheraga H A. J Comput Chem. 1991;12:505–526. [Google Scholar]
  • 45.Momany F A, McGuire R F, Burgess A W, Scheraga H A. J Phys Chem. 1975;79:2361–2381. [Google Scholar]
  • 46.Némethy G, Pottle M S, Scheraga H A. J Phys Chem. 1983;87:1883–1887. [Google Scholar]
  • 47.Némethy G, Gibson K D, Palmer K A, Yoon C N, Paterlini G, Zagari A, Rumsey S, Scheraga H A. J Phys Chem. 1992;96:6472–6484. [Google Scholar]
  • 48.Vila J, Williams R L, Vásquez M, Scheraga H A. Proteins Struct Funct Genet. 1991;10:199–218. doi: 10.1002/prot.340100305. [DOI] [PubMed] [Google Scholar]
  • 49.Lee, J., Liwo, A., Ripoll, D. R., Pillardy, J. & Scheraga, H. A. (1999) Proteins Struct. Funct. Genet., submitted.
  • 50.Yang F, Gustafson K R, Boyd M R, Wlodawer A. Nat Struct Biol. 1998;5:763–764. doi: 10.1038/1796. [DOI] [PubMed] [Google Scholar]
  • 51.Koehl P, Levitt M. Nat Struct Biol. 1999;6:108–111. doi: 10.1038/5794. [DOI] [PubMed] [Google Scholar]
  • 52.Rhee S, Martin R G, Rosner J L, Davies D R. Proc Natl Acad Sci USA. 1998;95:10413–10418. doi: 10.1073/pnas.95.18.10413. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES