Abstract
The CpG dinucleotide is present at approximately 20% of its expected frequency in vertebrate genomes, a deficiency thought due to a high mutation rate from the methylated form of CpG to TpG and CpA. We examine the hypothesis that the 20% frequency represents an equilibrium between rate of creation of new CpGs and accelerated rate of CpG loss from methylation. Using this model, we calculate the expected reduction in the equilibrium frequency of the CpG dinucleotide and find that the observed CpG deficiency can be explained by mutation from methylated CpG to TpG/CpA at approximately 12 times the normal transition rate, the exact rate depending on the ratio of transitions to transversions. The observed rate of CpG dinucleotide loss in a human alpha-globin nonprocessed pseudogene, psi alpha 1, and the apparent replenishment of the CpG pool in this sequence by new mutations, agree with the above parameters. These calculations indicate that it would take 25 million years or less, a small fraction of the time for vertebrate evolution, for CpG frequency to be reduced from undepleted levels to the current depleted levels.
Full text
PDFImages in this article
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Barker D., Schafer M., White R. Restriction sites containing CpG show a higher frequency of polymorphism in human DNA. Cell. 1984 Jan;36(1):131–138. doi: 10.1016/0092-8674(84)90081-3. [DOI] [PubMed] [Google Scholar]
- Bird A. P. CpG-rich islands and the function of DNA methylation. Nature. 1986 May 15;321(6067):209–213. doi: 10.1038/321209a0. [DOI] [PubMed] [Google Scholar]
- Bird A. P. DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 1980 Apr 11;8(7):1499–1504. doi: 10.1093/nar/8.7.1499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bird A. P., Taggart M. H., Nicholls R. D., Higgs D. R. Non-methylated CpG-rich islands at the human alpha-globin locus: implications for evolution of the alpha-globin pseudogene. EMBO J. 1987 Apr;6(4):999–1004. doi: 10.1002/j.1460-2075.1987.tb04851.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown T. C., Jiricny J. Different base/base mispairs are corrected with different efficiencies and specificities in monkey kidney cells. Cell. 1988 Aug 26;54(5):705–711. doi: 10.1016/s0092-8674(88)80015-1. [DOI] [PubMed] [Google Scholar]
- Bulmer M. A statistical analysis of nucleotide sequences of introns and exons in human genes. Mol Biol Evol. 1987 Jul;4(4):395–405. doi: 10.1093/oxfordjournals.molbev.a040453. [DOI] [PubMed] [Google Scholar]
- Bulmer M. Neighboring base effects on substitution rates in pseudogenes. Mol Biol Evol. 1986 Jul;3(4):322–329. doi: 10.1093/oxfordjournals.molbev.a040401. [DOI] [PubMed] [Google Scholar]
- Cooper D. N., Krawczak M. Cytosine methylation and the fate of CpG dinucleotides in vertebrate genomes. Hum Genet. 1989 Sep;83(2):181–188. doi: 10.1007/BF00286715. [DOI] [PubMed] [Google Scholar]
- Cooper D. N., Youssoufian H. The CpG dinucleotide and human genetic disease. Hum Genet. 1988 Feb;78(2):151–155. doi: 10.1007/BF00278187. [DOI] [PubMed] [Google Scholar]
- Coulondre C., Miller J. H., Farabaugh P. J., Gilbert W. Molecular basis of base substitution hotspots in Escherichia coli. Nature. 1978 Aug 24;274(5673):775–780. doi: 10.1038/274775a0. [DOI] [PubMed] [Google Scholar]
- Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981;17(6):368–376. doi: 10.1007/BF01734359. [DOI] [PubMed] [Google Scholar]
- Gardiner-Garden M., Frommer M. CpG islands in vertebrate genomes. J Mol Biol. 1987 Jul 20;196(2):261–282. doi: 10.1016/0022-2836(87)90689-9. [DOI] [PubMed] [Google Scholar]
- Green P. M., Bentley D. R., Mibashan R. S., Nilsson I. M., Giannelli F. Molecular pathology of haemophilia B. EMBO J. 1989 Apr;8(4):1067–1072. doi: 10.1002/j.1460-2075.1989.tb03474.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura M. Evolutionary rate at the molecular level. Nature. 1968 Feb 17;217(5129):624–626. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]
- Lindahl T. DNA repair enzymes. Annu Rev Biochem. 1982;51:61–87. doi: 10.1146/annurev.bi.51.070182.000425. [DOI] [PubMed] [Google Scholar]
- Russell G. J., Walker P. M., Elton R. A., Subak-Sharpe J. H. Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol. 1976 Nov;108(1):1–23. doi: 10.1016/s0022-2836(76)80090-3. [DOI] [PubMed] [Google Scholar]
- SWARTZ M. N., TRAUTNER T. A., KORNBERG A. Enzymatic synthesis of deoxyribonucleic acid. XI. Further studies on nearest neighbor base sequences in deoxyribonucleic acids. J Biol Chem. 1962 Jun;237:1961–1967. [PubMed] [Google Scholar]
- Tykocinski M. L., Max E. E. CG dinucleotide clusters in MHC genes and in 5' demethylated genes. Nucleic Acids Res. 1984 May 25;12(10):4385–4396. doi: 10.1093/nar/12.10.4385. [DOI] [PMC free article] [PubMed] [Google Scholar]