Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
editorial
. 2005 Aug;14(8):1943–1944. doi: 10.1110/ps.051581805

The implications of higher (or lower) success in secondary structure prediction of chain fragments

Chung-Jung Tsai 1, Ruth Nussinov 1,2
PMCID: PMC2279305  PMID: 16046621

The accompanying paper by Kihara (2005) provides clear data illustrating what has long been believed: Local secondary structure formation is affected by long-range interactions. To predict secondary structures more accurately, long-range tertiary information should be taken into account. This is an important contribution, as it substantiates what was commonly expected. And the investigator performs this job in a systematic, comprehensive way, providing an excellent reference for future papers on the limitations in all current schemes for secondary structure prediction.

As the paper notes, secondary structure prediction is one of the oldest problems addressed through what is now called bioinformatics research (Chuo and Fasman 1978; Garnier et al. 1978). Today it is generally accepted that formation and subsequent association of secondary structures do not necessarily constitute the first steps in protein folding. Nevertheless, accurate prediction of the secondary structure elements is a tremendous asset in correct tertiary prediction. Current work on secondary structure prediction uses methods such as neural networks (Rost and Sander 1994), support vector machines (Hua and Sun 2001; Ward et al. 2003), machine learning algorithms, and hidden Markov models. Older methods are also enhanced (Kloczkowski et al. 2002), and new servers are constructed. Yet, despite considerable effort, current prediction success does not exceed the upper 70%–80%. This value is calculated with respect to the secondary structure assignment, which also suffers from inconsistencies between the different methods. This barrier has been attributed to long-range effects. Kihara (2005) demonstrates these effects by calculating the residue contact order (RCO) (Plaxco et al. 1998) on a large data set of nonhomologous proteins. This allows him to probe a potential relationship between the residue separation in terms of the RCO and the accuracy of the secondary structure prediction.

The results presented in this paper reinforce several interesting conclusions relating to prediction accuracy. As is broadly accepted, folding is not a random search; rather, the landscape is funnel-shaped, which implies that building-block fragments of the protein have preferred conformations (Tsai et al. 2000). The conformations of the building blocks are stabilized by mutual interactions, short- or long-range, and their population times are related to the success of the prediction of the secondary structures. Thus, (1) if the residue has long-range interactions, then according to Figure 5 in the Kihara paper (2005), its population time in a given secondary structure is variable, as implied by low accuracy. In contrast, if the residue has short-range interactions, its population time is high; (2) if the prediction success of the secondary structure is high, the folding rate is fast, since it has higher population time (if the RCO is unchanged). The probability of success or confidence in the prediction can be used in scoring.

The results provided by Kihara (2005) convincingly illustrate that to achieve higher accuracy in the prediction, the (~20) residue windows frequently used in the prediction of the secondary structures are not good enough. They are too short to account for long-range interactions. And consideration of long-range interactions is essential for a higher rate of success in the prediction. Yet, longer windows are difficult to implement in the calculation since they require a much larger data set. Nevertheless, the Kihara paper provides a real impetus to proceed in this direction.

Acknowledgments

The projects described here have been funded in whole or in part with Federal funds from the National Cancer Institute, NIH, under contract number NO1-CO-12400. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government.

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051581805.

[This paper is commentary on the paper On the effect of long-range interactions on the secondary structure formation of proteins by Daisuke Kihara in this issue.]

References

  1. Chuo, P.Y. and Fasman, G.D. 1978. Empirical predictions of protein conformation. Annu. Rev. Biochem. 47 251–276. [DOI] [PubMed] [Google Scholar]
  2. Garnier, J., Osguthorpe, D.J., and Robson, B. 1978. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J. Mol. Biol. 120 97–120. [DOI] [PubMed] [Google Scholar]
  3. Hua, S. and Sun, Z. 2001. A novel method of protein structure prediction with high segment overlap measure: Support vector machine approach. J. Mol. Biol. 308 397–407. [DOI] [PubMed] [Google Scholar]
  4. Kihara, D. 2005. On the effect of long range interactions on secondary structure formation in proteins. Protein Sci. (this issue). [DOI] [PMC free article] [PubMed]
  5. Kloczkowski, A., Ting,K.L., Jernigan, R.L., and Garnier, J. 2002. Combining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence. Proteins 49 154–166. [DOI] [PubMed] [Google Scholar]
  6. Plaxco, K., Simons, K.T., and Baker, D. 1998. Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 277 985–994. [DOI] [PubMed] [Google Scholar]
  7. Rost, B. and Sander, C. 1994. Combining evolutionary information and neural networks to predict protein secondary structure. Proteins 19 55–72. [DOI] [PubMed] [Google Scholar]
  8. Tsai, C.J., Maizel Jr., J.V., and Nussinov, R. 2000. Anatomy of protein structures: Visualizing how a one-dimensional protein chain folds into a three-dimensional shape. Proc. Natl. Acad. Sci. 97 12038–12043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Ward, J.J., McGuffin, L.J., Buxton, B.F., and Jones, D.T. 2003. Secondary structure prediction with support vector machines. Bioinformatics 19 1650–1655. [DOI] [PubMed] [Google Scholar]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES