Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1991 Nov 25;19(22):6313–6317. doi: 10.1093/nar/19.22.6313

Analysis of distribution of bases in the coding sequences by a diagrammatic technique.

C T Zhang 1, R Zhang 1
PMCID: PMC329145  PMID: 1956790

Abstract

The frequencies of occurrence of four bases in the first, second and third codon positions and in the total coding sequences have been calculated by the codon usage table published in 1990 by Ikemura et al. The distribution of frequencies are further analysed in detail by a graphic technique presented recently by us. Formulas expressing the frequencies of four bases in the first and second codon positions in terms of frequencies of amino acids have been given. It is shown by the graphic analysis that for 90 species, in the first codon position the purine bases are dominant and in most cases G is the most dominant base. In the second codon position A is the most dominant base, while G is the least dominant base. In the third codon position the G + C content varies from 0.1 to 0.9, keeping the A + C content equal to 1/2 and G content equal to that of C, approximately. If the frequencies for bases A, C, G and U in the total coding sequences are denoted by a, c, g and u, respectively, it is found that the unequal formula: a2 + c2 + g2 + u2 less than 1/3, is valid for each of the 90 species including the human and E.coli etc.

Full text

PDF
6313

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Aota S., Gojobori T., Ishibashi F., Maruyama T., Ikemura T. Codon usage tabulated from the GenBank Genetic Sequence Data. Nucleic Acids Res. 1988;16 (Suppl):r315–r402. doi: 10.1093/nar/16.suppl.r315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Grantham R., Gautier C., Gouy M., Jacobzone M., Mercier R. Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res. 1981 Jan 10;9(1):r43–r74. doi: 10.1093/nar/9.1.213-b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Grantham R., Gautier C., Gouy M., Mercier R., Pavé A. Codon catalog usage and the genome hypothesis. Nucleic Acids Res. 1980 Jan 11;8(1):r49–r62. doi: 10.1093/nar/8.1.197-c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ikemura T. Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol. 1985 Jan;2(1):13–34. doi: 10.1093/oxfordjournals.molbev.a040335. [DOI] [PubMed] [Google Scholar]
  5. Maruyama T., Gojobori T., Aota S., Ikemura T. Codon usage tabulated from the GenBank genetic sequence data. Nucleic Acids Res. 1986;14 (Suppl):r151–r197. doi: 10.1093/nar/14.suppl.r151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Murray E. E., Lotzer J., Eberle M. Codon usage in plant genes. Nucleic Acids Res. 1989 Jan 25;17(2):477–498. doi: 10.1093/nar/17.2.477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Trifonov E. N. Translation framing code and frame-monitoring mechanism as suggested by the analysis of mRNA and 16 S rRNA nucleotide sequences. J Mol Biol. 1987 Apr 20;194(4):643–652. doi: 10.1016/0022-2836(87)90241-5. [DOI] [PubMed] [Google Scholar]
  8. Zhang C. T., Zhang R. Diagrammatic representation of the distribution of DNA bases and its applications. Int J Biol Macromol. 1991 Feb;13(1):45–49. doi: 10.1016/0141-8130(91)90009-j. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES