Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1994 Aug 25;22(16):3412–3417. doi: 10.1093/nar/22.16.3412

Tandemly repeated pentanucleotides in DNA sequences of eucaryotes.

B Borstnik 1, D Pumpernik 1, D Lukman 1, D Ugarković 1, M Plohl 1
PMCID: PMC523737  PMID: 8078778

Abstract

Genetic sequence data banks were scanned in order to retrieve tandemly repeated pentanucleotides (pnts). It was found that among 102 (=(1024-4)/2/5) possible distinct pnts roughly each fourth is involved in tandem repeats. It is shown that tandemly repeated pnts are composed of frequently occurring di- and trinucleotides and that those pnts which occur frequently in the form of mono- or di-pnts form also tandem repeats either in the form of satellites or in the form of shorter tandem repeats. Human satellite III is taken as a specific example. It is shown that the first guanine within GG-AAT pnt exhibits the highest mutability. Sequential distribution of base changes gives evidence that the mutations do not occur at random positions but in a correlated fashion so that long stretches of original pnts remain intact. It is found that pnts related to the satellite III are present in introns and flanking regions of some structural genes, but are not preserved between orthologous genes of related species. The results corroborate the most plausible mechanism of their evolution--rapid amplification followed by successive divergence of repeat units by various mutational processes.

Full text

PDF
3412

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  2. Benson D., Lipman D. J., Ostell J. GenBank. Nucleic Acids Res. 1993 Jul 1;21(13):2963–2965. doi: 10.1093/nar/21.13.2963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bernardi G., Olofsson B., Filipski J., Zerial M., Salinas J., Cuny G., Meunier-Rotival M., Rodier F. The mosaic genome of warm-blooded vertebrates. Science. 1985 May 24;228(4702):953–958. doi: 10.1126/science.4001930. [DOI] [PubMed] [Google Scholar]
  4. Bernardi G. The isochore organization of the human genome. Annu Rev Genet. 1989;23:637–661. doi: 10.1146/annurev.ge.23.120189.003225. [DOI] [PubMed] [Google Scholar]
  5. Bird A. P. DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 1980 Apr 11;8(7):1499–1504. doi: 10.1093/nar/8.7.1499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Choo K. H., Earle E., Vissel B., Kalitsis P. A chromosome 14-specific human satellite III DNA subfamily that shows variable presence on different chromosomes 14. Am J Hum Genet. 1992 Apr;50(4):706–716. [PMC free article] [PubMed] [Google Scholar]
  7. Cross S. H., Charlton J. A., Nan X., Bird A. P. Purification of CpG islands using a methylated DNA binding column. Nat Genet. 1994 Mar;6(3):236–244. doi: 10.1038/ng0394-236. [DOI] [PubMed] [Google Scholar]
  8. Frommer M., Prosser J., Tkachuk D., Reisner A. H., Vincent P. C. Simple repeated sequences in human satellite DNA. Nucleic Acids Res. 1982 Jan 22;10(2):547–563. doi: 10.1093/nar/10.2.547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Gentleman J. F., Mullin R. C. The distribution of the frequency of occurrence of nucleotide subsequences, based on their overlap capability. Biometrics. 1989 Mar;45(1):35–52. [PubMed] [Google Scholar]
  10. Grady D. L., Ratliff R. L., Robinson D. L., McCanlies E. C., Meyne J., Moyzis R. K. Highly conserved repetitive DNA sequences are present at human centromeres. Proc Natl Acad Sci U S A. 1992 Mar 1;89(5):1695–1699. doi: 10.1073/pnas.89.5.1695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gutiérrez G., Oliver J. L., Marín A. Dinucleotides and G+C content in human genes: opposite behavior of GpG, GpC, and TpC at II-III codon positions and in introns. J Mol Evol. 1993 Aug;37(2):131–136. doi: 10.1007/BF02407348. [DOI] [PubMed] [Google Scholar]
  12. Hanai R., Wada A. The effects of guanine and cytosine variation on dinucleotide frequency and amino acid composition in the human genome. J Mol Evol. 1988;27(4):321–325. doi: 10.1007/BF02101194. [DOI] [PubMed] [Google Scholar]
  13. Hasegawa M., Di Rienzo A., Kocher T. D., Wilson A. C. Toward a more accurate time scale for the human mitochondrial DNA tree. J Mol Evol. 1993 Oct;37(4):347–354. doi: 10.1007/BF00178865. [DOI] [PubMed] [Google Scholar]
  14. Jackson M. S., Mole S. E., Ponder B. A. Characterisation of a boundary between satellite III and alphoid sequences on human chromosome 10. Nucleic Acids Res. 1992 Sep 25;20(18):4781–4787. doi: 10.1093/nar/20.18.4781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Jeffreys A. J., Wilson V., Thein S. L. Individual-specific 'fingerprints' of human DNA. Nature. 1985 Jul 4;316(6023):76–79. doi: 10.1038/316076a0. [DOI] [PubMed] [Google Scholar]
  16. Jørgensen A. L., Laursen H. B., Jones C., Bak A. L. Evolutionarily different alphoid repeat DNA on homologous chromosomes in human and chimpanzee. Proc Natl Acad Sci U S A. 1992 Apr 15;89(8):3310–3314. doi: 10.1073/pnas.89.8.3310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kremer E. J., Pritchard M., Lynch M., Yu S., Holman K., Baker E., Warren S. T., Schlessinger D., Sutherland G. R., Richards R. I. Mapping of DNA instability at the fragile X to a trinucleotide repeat sequence p(CCG)n. Science. 1991 Jun 21;252(5013):1711–1714. doi: 10.1126/science.1675488. [DOI] [PubMed] [Google Scholar]
  18. Larsen F., Gundersen G., Lopez R., Prydz H. CpG islands as gene markers in the human genome. Genomics. 1992 Aug;13(4):1095–1107. doi: 10.1016/0888-7543(92)90024-m. [DOI] [PubMed] [Google Scholar]
  19. Love J. M., Knight A. M., McAleer M. A., Todd J. A. Towards construction of a high resolution map of the mouse genome using PCR-analysed microsatellites. Nucleic Acids Res. 1990 Jul 25;18(14):4123–4130. doi: 10.1093/nar/18.14.4123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Mahadevan M., Tsilfidis C., Sabourin L., Shutler G., Amemiya C., Jansen G., Neville C., Narang M., Barceló J., O'Hoy K. Myotonic dystrophy mutation: an unstable CTG repeat in the 3' untranslated region of the gene. Science. 1992 Mar 6;255(5049):1253–1255. doi: 10.1126/science.1546325. [DOI] [PubMed] [Google Scholar]
  21. Maroteaux L., Heilig R., Dupret D., Mandel J. L. Repetitive satellite-like sequences are present within or upstream from 3 avian protein-coding genes. Nucleic Acids Res. 1983 Mar 11;11(5):1227–1243. doi: 10.1093/nar/11.5.1227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Meijers J. C., Chung D. W. Organization of the gene coding for human protein C inhibitor (plasminogen activator inhibitor-3). Assignment of the gene to chromosome 14. J Biol Chem. 1991 Aug 15;266(23):15028–15034. [PubMed] [Google Scholar]
  23. Moyzis R. K., Buckingham J. M., Cram L. S., Dani M., Deaven L. L., Jones M. D., Meyne J., Ratliff R. L., Wu J. R. A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes. Proc Natl Acad Sci U S A. 1988 Sep;85(18):6622–6626. doi: 10.1073/pnas.85.18.6622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Nussinov R. Eukaryotic dinucleotide preference rules and their implications for degenerate codon usage. J Mol Biol. 1981 Jun 15;149(1):125–131. doi: 10.1016/0022-2836(81)90264-3. [DOI] [PubMed] [Google Scholar]
  25. Nussinov R. Strong doublet preferences in nucleotide sequences and DNA geometry. J Mol Evol. 1984;20(2):111–119. doi: 10.1007/BF02257371. [DOI] [PubMed] [Google Scholar]
  26. Ohno S., Yomo T. The grammatical rule for all DNA: junk and coding sequences. Electrophoresis. 1991 Feb-Mar;12(2-3):103–108. doi: 10.1002/elps.1150120203. [DOI] [PubMed] [Google Scholar]
  27. Peng C. K., Buldyrev S. V., Goldberger A. L., Havlin S., Sciortino F., Simons M., Stanley H. E. Long-range correlations in nucleotide sequences. Nature. 1992 Mar 12;356(6365):168–170. doi: 10.1038/356168a0. [DOI] [PubMed] [Google Scholar]
  28. Pevzner P. A., Borodovsky MYu, Mironov A. A. Linguistics of nucleotide sequences. I: The significance of deviations from mean statistical characteristics and prediction of the frequencies of occurrence of words. J Biomol Struct Dyn. 1989 Apr;6(5):1013–1026. doi: 10.1080/07391102.1989.10506528. [DOI] [PubMed] [Google Scholar]
  29. Plohl M., Borstnik B., Lucijanić-Justić V., Ugarković D. Evidence for random distribution of sequence variants in Tenebrio molitor satellite DNA. Genet Res. 1992 Aug;60(1):7–13. doi: 10.1017/s0016672300030615. [DOI] [PubMed] [Google Scholar]
  30. Sol K., DuBow M. S. Sequence analysis of cloned human satellite DNAs reveals nonrandom variations. Genome. 1993 Apr;36(2):334–342. doi: 10.1139/g93-046. [DOI] [PubMed] [Google Scholar]
  31. Stoehr P. J., Cameron G. N. The EMBL data library. Nucleic Acids Res. 1991 Apr 25;19 (Suppl):2227–2230. doi: 10.1093/nar/19.suppl.2227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Stückle E. E., Nielsen P. J., Grob U. Probability of occurrence of specific oligomers. J Theor Biol. 1992 Dec 7;159(3):299–306. doi: 10.1016/s0022-5193(05)80726-x. [DOI] [PubMed] [Google Scholar]
  33. Volinia S., Bernardi F., Gambari R., Barrai I. Co-localization of rare oligonucleotides and regulatory elements in mammalian upstream gene regions. J Mol Biol. 1988 Sep 20;203(2):385–390. doi: 10.1016/0022-2836(88)90006-x. [DOI] [PubMed] [Google Scholar]
  34. Williamson J. R., Raghuraman M. K., Cech T. R. Monovalent cation-induced structure of telomeric DNA: the G-quartet model. Cell. 1989 Dec 1;59(5):871–880. doi: 10.1016/0092-8674(89)90610-7. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES