Abstract
The frequency and distribution of the rare dinucleotide CpG was examined in 15 mammalian genes. CpG is highly methylated at cytosine in mammalian DNA (1,2) and 5-methylcytosine (5mC) is thought to undergo a transition mutation via deamination to produce thymine (3). This would result in the accumulation of TpG and CpA and depletion of CpG during evolution (4). Consistent with this hypothesis, the gene sample of 26,541 dinucleotides contained CpG at 40% the frequency expected by base composition and the CpG transition products, TpG+CpA, were significantly elevated at 124% of expected random frequency. However, because CpG occurs at only 25% of expected random frequency in the genome, the sampled genes were considerably enriched in this dinucleotide. CpGs were asymmetrically distributed in sequences flanking the genes. 5'-flanking sequences were enriched in CpG at 135% of the frequency expected assuming a symmetrical distribution of all the CpGs in the sampled genes (p less than 0.01), while 3'-flanking regions were depleted in CpG at 40% of expected values (p less than 0.0001). This asymmetry may reflect the role of 5-methylcytosine in gene expression. In contrast the frequencies of GpC and GpT+ ApC did not differ significantly from that predicted by base composition and these dinucleotides were not asymmetrically distributed.
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bell G. I., Pictet R. L., Rutter W. J., Cordell B., Tischer E., Goodman H. M. Sequence of the human insulin gene. Nature. 1980 Mar 6;284(5751):26–32. doi: 10.1038/284026a0. [DOI] [PubMed] [Google Scholar]
- Bell G. I., Pictet R., Rutter W. J. Analysis of the regions flanking the human insulin gene and sequence of an Alu family member. Nucleic Acids Res. 1980 Sep 25;8(18):4091–4109. doi: 10.1093/nar/8.18.4091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bewley T. A., Dixon J. S., Li C. H. Sequence comparison of human pituitary growth hormone, human chorionic somatomammotropin, and ovine pituitary growth and lactogenic hormones. Int J Pept Protein Res. 1972;4(4):281–287. doi: 10.1111/j.1399-3011.1972.tb03430.x. [DOI] [PubMed] [Google Scholar]
- Bird A. P. DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 1980 Apr 11;8(7):1499–1504. doi: 10.1093/nar/8.7.1499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bird A. P., Taggart M. H., Smith B. A. Methylated and unmethylated DNA compartments in the sea urchin genome. Cell. 1979 Aug;17(4):889–901. doi: 10.1016/0092-8674(79)90329-5. [DOI] [PubMed] [Google Scholar]
- Chang A. C., Cochet M., Cohen S. N. Structural organization of human genomic DNA encoding the pro-opiomelanocortin peptide. Proc Natl Acad Sci U S A. 1980 Aug;77(8):4890–4894. doi: 10.1073/pnas.77.8.4890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cooke N. E., Coit D., Weiner R. I., Baxter J. D., Martial J. A. Structure of cloned DNA complementary to rat prolactin messenger RNA. J Biol Chem. 1980 Jul 10;255(13):6502–6510. [PubMed] [Google Scholar]
- Efstratiadis A., Posakony J. W., Maniatis T., Lawn R. M., O'Connell C., Spritz R. A., DeRiel J. K., Forget B. G., Weissman S. M., Slightom J. L. The structure and evolution of the human beta-globin gene family. Cell. 1980 Oct;21(3):653–668. doi: 10.1016/0092-8674(80)90429-8. [DOI] [PubMed] [Google Scholar]
- Ehrlich M., Gama-Sosa M. A., Huang L. H., Midgett R. M., Kuo K. C., McCune R. A., Gehrke C. Amount and distribution of 5-methylcytosine in human DNA from different types of tissues of cells. Nucleic Acids Res. 1982 Apr 24;10(8):2709–2721. doi: 10.1093/nar/10.8.2709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ehrlich M., Wang R. Y. 5-Methylcytosine in eukaryotic DNA. Science. 1981 Jun 19;212(4501):1350–1357. doi: 10.1126/science.6262918. [DOI] [PubMed] [Google Scholar]
- Fiddes J. C., Goodman H. M. Isolation, cloning and sequence analysis of the cDNA for the alpha-subunit of human chorionic gonadotropin. Nature. 1979 Oct 4;281(5730):351–356. doi: 10.1038/281351a0. [DOI] [PubMed] [Google Scholar]
- Glanville N., Durnam D. M., Palmiter R. D. Structure of mouse metallothionein-I gene and its mRNA. Nature. 1981 Jul 16;292(5820):267–269. doi: 10.1038/292267a0. [DOI] [PubMed] [Google Scholar]
- Grantham R., Gautier C., Gouy M., Jacobzone M., Mercier R. Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res. 1981 Jan 10;9(1):r43–r74. doi: 10.1093/nar/9.1.213-b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruenbaum Y., Naveh-Many T., Cedar H., Razin A. Sequence specificity of methylation in higher plant DNA. Nature. 1981 Aug 27;292(5826):860–862. doi: 10.1038/292860a0. [DOI] [PubMed] [Google Scholar]
- Honjo T., Obata M., Yamawaki-Katoaka Y., Kataoka T., Kawakami T., Takahashi N., Mano Y. Cloning and complete nucleotide sequence of mouse immunoglobulin gamma 1 chain gene. Cell. 1979 Oct;18(2):559–568. doi: 10.1016/0092-8674(79)90072-2. [DOI] [PubMed] [Google Scholar]
- Ivarie R. D., Morris J. A. Induction of prolactin-deficient variants of GH3 rat pituitary tumor cells by ethyl methanesulfonate: reversion by 5-azacytidine, a DNA methylation inhibitor. Proc Natl Acad Sci U S A. 1982 May;79(9):2967–2970. doi: 10.1073/pnas.79.9.2967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ivarie R. D., Morris J. A., Martial J. A. Prolactin-deficient variants of GH3 rat pituitary tumor cells: linked expression of prolactin and another hormonally responsive protein in GH3 cells. Mol Cell Biol. 1982 Feb;2(2):179–189. doi: 10.1128/mcb.2.2.179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Law S. W., Dugaiczyk A. Homology between the primary structure of alpha-fetoprotein, deduced from a complete cDNA sequence, and serum albumin. Nature. 1981 May 21;291(5812):201–205. doi: 10.1038/291201a0. [DOI] [PubMed] [Google Scholar]
- Lawn R. M., Adelman J., Franke A. E., Houck C. M., Gross M., Najarian R., Goeddel D. V. Human fibroblast interferon gene lacks introns. Nucleic Acids Res. 1981 Mar 11;9(5):1045–1052. doi: 10.1093/nar/9.5.1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawn R. M., Efstratiadis A., O'Connell C., Maniatis T. The nucleotide sequence of the human beta-globin gene. Cell. 1980 Oct;21(3):647–651. doi: 10.1016/0092-8674(80)90428-6. [DOI] [PubMed] [Google Scholar]
- Malissen M., Malissen B., Jordan B. R. Exon/intron organization and complete nucleotide sequence of an HLA gene. Proc Natl Acad Sci U S A. 1982 Feb;79(3):893–897. doi: 10.1073/pnas.79.3.893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGhee J. D., Ginder G. D. Specific DNA methylation sites in the vicinity of the chicken beta-globin genes. Nature. 1979 Aug 2;280(5721):419–420. doi: 10.1038/280419a0. [DOI] [PubMed] [Google Scholar]
- McGhee J. D., Wood W. I., Dolan M., Engel J. D., Felsenfeld G. A 200 base pair region at the 5' end of the chicken adult beta-globin gene is accessible to nuclease digestion. Cell. 1981 Nov;27(1 Pt 2):45–55. doi: 10.1016/0092-8674(81)90359-7. [DOI] [PubMed] [Google Scholar]
- Nagata S., Mantei N., Weissmann C. The structure of one of the eight or more distinct chromosomal genes for human interferon-alpha. Nature. 1980 Oct 2;287(5781):401–408. doi: 10.1038/287401a0. [DOI] [PubMed] [Google Scholar]
- Niall H. D., Hogan M. L., Sauer R., Rosenblum I. Y., Greenwood F. C. Sequences of pituitary and placental lactogenic and growth hormones: evolution from a primordial peptide by gene reduplication. Proc Natl Acad Sci U S A. 1971 Apr;68(4):866–870. doi: 10.1073/pnas.68.4.866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nishioka Y., Leder P. The complete sequence of a chromosomal mouse alpha--globin gene reveals elements conserved throughout vertebrate evolution. Cell. 1979 Nov;18(3):875–882. doi: 10.1016/0092-8674(79)90139-9. [DOI] [PubMed] [Google Scholar]
- Nussinov R. The universal dinucleotide asymmetry rules in DNA and the amino acid codon choice. J Mol Evol. 1981;17(4):237–244. doi: 10.1007/BF01732761. [DOI] [PubMed] [Google Scholar]
- Ohno S., Taniguchi T. Structure of a chromosomal gene for human interferon beta. Proc Natl Acad Sci U S A. 1981 Sep;78(9):5305–5309. doi: 10.1073/pnas.78.9.5305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Page G. S., Smith S., Goodman H. M. DNA sequence of the rat growth hormone gene: location of the 5' terminus of the growth hormone mRNA and identification of an internal transposon-like element. Nucleic Acids Res. 1981 May 11;9(9):2087–2104. doi: 10.1093/nar/9.9.2087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Proudfoot N. J., Maniatis T. The structure of a human alpha-globin pseudogene and its relationship to alpha-globin gene duplication. Cell. 1980 Sep;21(2):537–544. doi: 10.1016/0092-8674(80)90491-2. [DOI] [PubMed] [Google Scholar]
- Quinto C., Quiroga M., Swain W. F., Nikovits W. C., Jr, Standring D. N., Pictet R. L., Valenzuela P., Rutter W. J. Rat preprocarboxypeptidase A: cDNA sequence and preliminary characterization of the gene. Proc Natl Acad Sci U S A. 1982 Jan;79(1):31–35. doi: 10.1073/pnas.79.1.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Razin A., Riggs A. D. DNA methylation and gene function. Science. 1980 Nov 7;210(4470):604–610. doi: 10.1126/science.6254144. [DOI] [PubMed] [Google Scholar]
- Salser W. Globin mRNA sequences: analysis of base pairing and evolutionary implications. Cold Spring Harb Symp Quant Biol. 1978;42(Pt 2):985–1002. doi: 10.1101/sqb.1978.042.01.099. [DOI] [PubMed] [Google Scholar]
- Sano H., Sager R. Tissue specificity and clustering of methylated cystosines in bovine satellite I DNA. Proc Natl Acad Sci U S A. 1982 Jun;79(11):3584–3588. doi: 10.1073/pnas.79.11.3584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seiler-Tuyns A., Birnstiel M. L. Structure and expression in L-cells of a cloned H4 histone gene of the mouse. J Mol Biol. 1981 Oct 5;151(4):607–625. doi: 10.1016/0022-2836(81)90426-5. [DOI] [PubMed] [Google Scholar]
- Shine J., Seeburg P. H., Martial J. A., Baxter J. D., Goodman H. M. Construction and analysis of recombinant DNA for human chorionic somatomammotropin. Nature. 1977 Dec 8;270(5637):494–499. doi: 10.1038/270494a0. [DOI] [PubMed] [Google Scholar]
- Spritz R. A., DeRiel J. K., Forget B. G., Weissman S. M. Complete nucleotide sequence of the human delta-globin gene. Cell. 1980 Oct;21(3):639–646. doi: 10.1016/0092-8674(80)90427-4. [DOI] [PubMed] [Google Scholar]
- Spritz R. A., Jagadeeswaran P., Choudary P. V., Biro P. A., Elder J. T., deRiel J. K., Manley J. L., Gefter M. L., Forget B. G., Weissman S. M. Base substitution in an intervening sequence of a beta+-thalassemic human globin gene. Proc Natl Acad Sci U S A. 1981 Apr;78(4):2455–2459. doi: 10.1073/pnas.78.4.2455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ullrich A., Dull T. J., Gray A., Brosius J., Sures I. Genetic variation in the human insulin gene. Science. 1980 Aug 1;209(4456):612–615. doi: 10.1126/science.6248962. [DOI] [PubMed] [Google Scholar]
- van der Ploeg L. H., Flavell R. A. DNA methylation in the human gamma delta beta-globin locus in erythroid and nonerythroid tissues. Cell. 1980 Apr;19(4):947–958. doi: 10.1016/0092-8674(80)90086-0. [DOI] [PubMed] [Google Scholar]