Abstract
Two computational methods widely used in time series analysis were applied to protein sequences, and their ability to derive structural information not directly accessible through classical sequence comparisons methods was assessed. The primary structures of 19 rubredoxins of both mesophilic and thermophilic bacteria, coded with hydrophobicity values of amino acid residues, were considered as time series and were analyzed by 1) recurrence quantification analysis and 2) spectral analysis of the sequence major eigenfunctions. The results of the two methods agreed to a large extent and generated a classification consistent with known 3D structural characteristics of the studied proteins. This classification separated in a clearcut manner a thermophilic protein from mesophilic proteins. The classification of primary structures given by the two dynamical methods was demonstrated to be basically different from classification stemming from classical sequence homology metrics. Moreover, on a more detailed scale, the method was able to discriminate between thermophilic and mesophilic proteins from a set of chimeric sequences generated from the mixing of a mesophilic (Rubr Clopa) and a thermophilic (Rubr Pyrfu) protein. Overall, our results point to a new way of looking at protein sequence comparisons.
Full Text
The Full Text of this article is available as a PDF (560.1 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Anfinsen C. B. Principles that govern the folding of protein chains. Science. 1973 Jul 20;181(4096):223–230. doi: 10.1126/science.181.4096.223. [DOI] [PubMed] [Google Scholar]
- Eidsness M. K., Richie K. A., Burden A. E., Kurtz D. M., Jr, Scott R. A. Dissecting contributions to the thermostability of Pyrococcus furiosus rubredoxin: beta-sheet chimeras. Biochemistry. 1997 Aug 26;36(34):10406–10413. doi: 10.1021/bi970110r. [DOI] [PubMed] [Google Scholar]
- Faure P., Korn H. A nonrandom dynamic component in the synaptic noise of a central neuron. Proc Natl Acad Sci U S A. 1997 Jun 10;94(12):6506–6511. doi: 10.1073/pnas.94.12.6506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giuliani A., Piccirillo G., Marigliano V., Colosimo A. A nonlinear explanation of aging-induced changes in heartbeat dynamics. Am J Physiol. 1998 Oct;275(4 Pt 2):H1455–H1461. doi: 10.1152/ajpheart.1998.275.4.H1455. [DOI] [PubMed] [Google Scholar]
- Giuliani A, Manetti C. Hidden peculiarities in the potential energy time series of a tripeptide highlighted by a recurrence plot analysis: A molecular dynamics simulation. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1996 Jun;53(6):6336–6340. doi: 10.1103/physreve.53.6336. [DOI] [PubMed] [Google Scholar]
- Hansch Corwin, Hoekman David, Gao Hua. Comparative QSAR: Toward a Deeper Understanding of Chemicobiological Interactions. Chem Rev. 1996 May 9;96(3):1045–1076. doi: 10.1021/cr9400976. [DOI] [PubMed] [Google Scholar]
- Higgins D. G., Sharp P. M. Fast and sensitive multiple sequence alignments on a microcomputer. Comput Appl Biosci. 1989 Apr;5(2):151–153. doi: 10.1093/bioinformatics/5.2.151. [DOI] [PubMed] [Google Scholar]
- Kaspar F, Schuster HG. Easily calculable measure for the complexity of spatiotemporal patterns. Phys Rev A Gen Phys. 1987 Jul 15;36(2):842–848. doi: 10.1103/physreva.36.842. [DOI] [PubMed] [Google Scholar]
- Mandell A. J., Owens M. J., Selz K. A., Morgan W. N., Shlesinger M. F., Nemeroff C. B. Mode matches in hydrophobic free energy eigenfunctions predict peptide-protein interactions. Biopolymers. 1998 Aug;46(2):89–101. doi: 10.1002/(SICI)1097-0282(199808)46:2<89::AID-BIP4>3.0.CO;2-T. [DOI] [PubMed] [Google Scholar]
- Mandell A. J., Selz K. A., Shlesinger M. F. Mode matches and their locations in the hydrophobic free energy sequences of peptide ligands and their receptor eigenfunctions. Proc Natl Acad Sci U S A. 1997 Dec 9;94(25):13576–13581. doi: 10.1073/pnas.94.25.13576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin Y. C. A practitioner's perspective of the role of quantitative structure-activity analysis in medicinal chemistry. J Med Chem. 1981 Mar;24(3):229–237. doi: 10.1021/jm00135a001. [DOI] [PubMed] [Google Scholar]
- Page R. D. TreeView: an application to display phylogenetic trees on personal computers. Comput Appl Biosci. 1996 Aug;12(4):357–358. doi: 10.1093/bioinformatics/12.4.357. [DOI] [PubMed] [Google Scholar]
- Pearson W. R., Lipman D. J. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saitou N., Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987 Jul;4(4):406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
- Selz K. A., Mandell A. J., Shlesinger M. F. Hydrophobic free energy eigenfunctions of pore, channel, and transporter proteins contain beta-burst patterns. Biophys J. 1998 Nov;75(5):2332–2342. doi: 10.1016/S0006-3495(98)77677-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sieker L. C., Stenkamp R. E., LeGall J. Rubredoxin in crystalline state. Methods Enzymol. 1994;243:203–216. doi: 10.1016/0076-6879(94)43016-0. [DOI] [PubMed] [Google Scholar]
- Strait B. J., Dewey T. G. The Shannon information entropy of protein sequences. Biophys J. 1996 Jul;71(1):148–155. doi: 10.1016/S0006-3495(96)79210-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sweet R. M., Eisenberg D. Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure. J Mol Biol. 1983 Dec 25;171(4):479–488. doi: 10.1016/0022-2836(83)90041-4. [DOI] [PubMed] [Google Scholar]
- Thompson J. D., Gibson T. J., Plewniak F., Jeanmougin F., Higgins D. G. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997 Dec 15;25(24):4876–4882. doi: 10.1093/nar/25.24.4876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webber C. L., Jr, Zbilut J. P. Dynamical assessment of physiological systems and states using recurrence plot strategies. J Appl Physiol (1985) 1994 Feb;76(2):965–973. doi: 10.1152/jappl.1994.76.2.965. [DOI] [PubMed] [Google Scholar]
- Weiss O., Herzel H. Correlations in protein sequences and property codes. J Theor Biol. 1998 Feb 21;190(4):341–353. doi: 10.1006/jtbi.1997.0560. [DOI] [PubMed] [Google Scholar]
- Wilbur W. J., Lipman D. J. Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A. 1983 Feb;80(3):726–730. doi: 10.1073/pnas.80.3.726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zbilut J. P., Giuliani A., Webber C. L., Jr, Colosimo A. Recurrence quantification analysis in structure-function relationships of proteins: an overview of a general methodology applied to the case of TEM-1 beta-lactamase. Protein Eng. 1998 Feb;11(2):87–93. doi: 10.1093/protein/11.2.87. [DOI] [PubMed] [Google Scholar]
- von Heijne G. Signal sequences are not uniformly hydrophobic. J Mol Biol. 1982 Aug 15;159(3):537–541. doi: 10.1016/0022-2836(82)90300-x. [DOI] [PubMed] [Google Scholar]
