Abstract
An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand.
Full text
PDF








Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Breathnach R., Chambon P. Organization and expression of eucaryotic split genes coding for proteins. Annu Rev Biochem. 1981;50:349–383. doi: 10.1146/annurev.bi.50.070181.002025. [DOI] [PubMed] [Google Scholar]
- Eigen M. Selforganization of matter and the evolution of biological macromolecules. Naturwissenschaften. 1971 Oct;58(10):465–523. doi: 10.1007/BF00623322. [DOI] [PubMed] [Google Scholar]
- Eigen M., Winkler-Oswatitsch R. Transfer-RNA, an early gene? Naturwissenschaften. 1981 Jun;68(6):282–292. doi: 10.1007/BF01047470. [DOI] [PubMed] [Google Scholar]
- Fickett J. W. Recognition of protein coding regions in DNA sequences. Nucleic Acids Res. 1982 Sep 11;10(17):5303–5318. doi: 10.1093/nar/10.17.5303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gingeras T. R., Sciaky D., Gelinas R. E., Bing-Dong J., Yen C. E., Kelly M. M., Bullock P. A., Parsons B. L., O'Neill K. E., Roberts R. J. Nucleotide sequences from the adenovirus-2 genome. J Biol Chem. 1982 Nov 25;257(22):13475–13491. [PubMed] [Google Scholar]
- Gold L., Pribnow D., Schneider T., Shinedling S., Singer B. S., Stormo G. Translational initiation in prokaryotes. Annu Rev Microbiol. 1981;35:365–403. doi: 10.1146/annurev.mi.35.100181.002053. [DOI] [PubMed] [Google Scholar]
- Grantham R., Gautier C., Gouy M. Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according to genome type. Nucleic Acids Res. 1980 May 10;8(9):1893–1912. doi: 10.1093/nar/8.9.1893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grantham R., Gautier C., Gouy M., Jacobzone M., Mercier R. Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res. 1981 Jan 10;9(1):r43–r74. doi: 10.1093/nar/9.1.213-b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemura T., Ozeki H. Codon usage and transfer RNA contents: organism-specific codon-choice patterns in reference to the isoacceptor contents. Cold Spring Harb Symp Quant Biol. 1983;47(Pt 2):1087–1097. doi: 10.1101/sqb.1983.047.01.123. [DOI] [PubMed] [Google Scholar]
- Macchiato M. F., Tramontano A. Thermodynamic approach to a possible theory of the evolution of a genetic code. Z Naturforsch C. 1982 Oct;37(10):1031–1037. doi: 10.1515/znc-1982-1025. [DOI] [PubMed] [Google Scholar]
- Pierno G., Barni N., Candurro M., Cipollaro M., Franzè A., Juliano L., Macchiato M. F., Mastrocinque G., Moscatelli C., Scarlato V. Computer programs for the characterization of protein coding genes. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):281–285. doi: 10.1093/nar/12.1part1.281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rak B., Lusky M., Hable M. Expression of two proteins from overlapping and oppositely oriented genes on transposable DNA insertion element IS5. Nature. 1982 May 13;297(5862):124–128. doi: 10.1038/297124a0. [DOI] [PubMed] [Google Scholar]
- Shepherd J. C. Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification. Proc Natl Acad Sci U S A. 1981 Mar;78(3):1596–1600. doi: 10.1073/pnas.78.3.1596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shepherd J. C. Periodic correlations in DNA sequences and evidence suggesting their evolutionary origin in a comma-less genetic code. J Mol Evol. 1981;17(2):94–102. doi: 10.1007/BF01732679. [DOI] [PubMed] [Google Scholar]
- Shulman M. J., Steinberg C. M., Westmoreland N. The coding function of nucleotide sequences can be discerned by statistical analysis. J Theor Biol. 1981 Feb 7;88(3):409–420. doi: 10.1016/0022-5193(81)90274-5. [DOI] [PubMed] [Google Scholar]
- Staden R. Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):505–519. doi: 10.1093/nar/12.1part2.505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Staden R., McLachlan A. D. Codon preference and its use in identifying protein coding regions in long DNA sequences. Nucleic Acids Res. 1982 Jan 11;10(1):141–156. doi: 10.1093/nar/10.1.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tramontano A., Scarlato V., Barni N., Cipollaro M., Franzè A., Macchiato M. F., Cascino A. Statistical evaluation of the coding capacity of complementary DNA strands. Nucleic Acids Res. 1984 Jun 25;12(12):5049–5059. doi: 10.1093/nar/12.12.5049. [DOI] [PMC free article] [PubMed] [Google Scholar]
