Abstract
Plant DNA methylation is its own language, interpreted by the cell to maintain silencing of transposons, facilitate chromatin structure, and to ensure proper expression of some genes. Just as in any language, context is important. Rather than being a simple “on-off switch”, DNA methylation has a range of “meanings” dependent upon the underlying sequence and its location in the genome. Differences in the sequence context of individual sites are established, maintained, and interpreted by differing molecular pathways. Varying patterns of methylation within genes and surrounding sequences are associated with a continuous range of expression differences, from silencing to constitutive expression. These often-subtle differences have been pieced together from years of effort, but have taken off with the advent of methods for assessing methylation across entire genomes. Recognizing these patterns and identifying underlying causes is essential for understanding the function of DNA methylation and its systems-wide contribution to a range of processes in plant genomes.
Keywords: DNA methylation, Epigenetics, Epigenomics, Gene expression, Chromatin
Introduction
DNA is often called the “language of life”, but it does not operate in isolation. Genetic information first passes through multiple layers of regulation before being expressed as phenotype. These layers of regulation range from protein-protein interactions and protein modifications, to modifications made to RNA and even DNA itself. Within all this is the epigenome; consisting of histones [1], histone modifications [2], non-coding RNAs, and the methylation of DNA [3]. These directly shape the structure of the genome, defining regions of euchromatin and heterochromatin, and helping to facilitate proper gene expression. DNA methylation, the addition of a methyl group to cytosine bases, is a key part of plant epigenomes. There is growing interest not only in the possible regulatory role of DNA methylation, but also its potential as an untapped source of diversity for plant improvement. Variants in DNA methylation, or “epialleles”, can have phenotypic consequences and are in some cases “epigenetic”, resulting in heritable phenotypic variation independent from any differences in DNA sequence [4]. The affects and associations of DNA methylation are context-dependent and recognizing this is critical to a proper understanding and utilization of DNA methylation.
Multiple techniques have been developed to assess DNA methylation [5]. However, the greatest advances has come with the ability to quantify methylation at the scale of whole genomes [6]. This first became possible with the advent of array-based methods [7]. Now with whole genome bisulfite sequencing (WGBS), the production of single-base resolution maps of DNA methylation across entire genomes is possible [8, 9]. This is not without challenges in analysis and interpretation. For example, these techniques are typically applied to whole tissues consisting of mixed populations of plant cells. As a result, despite the fact that at any single base a cytosine is either methylated or not, these data represent an average of these mixed cell populations [10]. This requires careful consideration of how to determine methylation status and how to measure methylation levels. Other reviews have further discussion of the nuances of measuring and interpreting WGBS data [10].
Plants are a powerful system for studying DNA methylation and epigenetic phenomena. Although plant DNA methylation shares similarities and some of the same mechanisms as mammals [11, 12], they have also evolved their own distinct pathways [12]. In plants, as in mammals, changes in DNA methylation are important during reproduction and in reproductive tissues, a topic that has been reviewed elsewhere [13]. However, whereas the epigenome of mammals is typically reset and re-established each generation [14]; in flowering plants there is not an analogous systematic genome-wide erasure of DNA methylation like there is in mammals. Instead, the parental state is typically reinforced and stably inherited [15]. As a consequence, the spontaneous formation of epialleles can be faithfully propagated between generations and there are multiple documented cases of epigenetic inheritance in plants [16–19]. This not only makes plants ideal for studying epigenetics, but also opens the possibility for utilizing or inducing epigenetic differences for purposes of plant breeding and improvement [20, 21]. Much of what we now know about DNA methylation in plants comes from the study of one species, Arabidopsis thaliana, but this is rapidly expanding to other species (23–30). This has opened up the field of comparative epigenomics, which applies the toolkit of comparative genomics to the study of the epigenome [22]. With it has come deeper insight into the evolution, mechanisms, and context-specific effects of DNA methylation. This review will focus on recent findings from epigenomics into the context-specific patterns of DNA methylation across the genome and within genes with discussion of sources of variation, their origins, potential functions, and potential applications.
Maintenance of DNA methylation is context dependent
At the most basic level, contextual differences exist between individual cytosines based on the sequence of neighboring bases. Plants methylate DNA in three different sequence contexts: dinucleotide CpG or CG (hereafter mCG) sites and trinucleotide CHG and CHH (H = A, T, or C) sites (hereafter mCHG and mCHH) [23, 24]. This is in contrast to mammals where methylation is primarily found in the CpG context [25] and in specific cell types, the CpH context [26, 27]. Additional base modifications to cytosines, such as 5-hydroxymethylation, also exist in mammals, but these have not been identified in plants despite extensive efforts [28]. This division of plant DNA methylation into various sequence contexts has a mechanistic basis, as differing enzymes and pathways are involved in the establishment and maintenance of DNA methylation for each context [12]. This makes the appropriate choice of methods and analysis critical for studying plant DNA methylation, as methods that cannot distinguish between sequence context obscure mechanistic and functional differences.
Perhaps the most familiar type of methylation is mCG, as it is found across eukaryotes [25, 29]. Not only is mCG found across kingdoms, it is also the predominant type of methylation found in plants and animals [8, 9, 27]. This is due in part to the mechanisms that ensure its maintenance after DNA replication. CpG and CHG sites are symmetrical, as there is a mirrored cytosine on the opposing strand [23]. This is in contrast to CHH sites, which are asymmetrical [24]. The symmetrical nature of CpG sites is critical to the mechanism of its maintenance [30]. This is done by DNA (cytosine-5-)-METHYLTRANSFERASE 1 (DNMT1) in mammals [31] and its ortholog in plants METHYLTRANSFERASE 1 (MET1) [11, 32, 33]. In mammals, DNMT1 is associated with the replication machinery during DNA replication [34]. Due to the semiconservative replication of DNA, newly replicated DNA is hemi-methylated, consisting of one original methylated strand and one newly synthesized unmethylated strand. The mammalian Ubiquitin-like containing PHD and RING finger domains 1 (UHRF1) [30] and their plant orthologs, VARIANT IN METHYLATION 1 (VIM1-5) [35, 36], recognize these hemi-methylated mCG sites and recruit DNMT1/MET1 to methylate the symmetrical site on the opposing strand. In this way, mCG is maintained across cell divisions. This pathway is essential to the continued maintenance of mCG, as met1 mutants lose the majority of mCG in their genome [9, 37]. Even after MET1 is restored by outcrossing the mutant to a wild-type plant and the mutant allele segregated away, CG methylation does not fully return [38, 39].
CHG methylation, like mCG, is symmetrical [23], but it is maintained by a different pathway that involves CHROMOMETHYLASE 3 (CMT3) [40, 41]. This enzyme is part of the plant-specific CMT family of methyltransferases, characterized by a conserved CHROMO domain [42, 43]. The CHROMO domain, along with a BROMO ADJACENT HOMOLOGY (BAH) domain, facilitate the binding of the CMTs to dimethylated lysine 9 residues of histone 3 tails (H3K9me2) [44]. H3K9me2 is found primarily in regions of constitutive heterochromatin and CMT3 binds to H3K9me2, where it methylates CHG sites [41]. H3K9 methylation itself, is dependent upon mCHG [45]. The histone methyltransferases KRYPTONITE (KYP), Su(var)3-9 homologue 5 (SUVH5), and SUVH6 possess SRA domains that bind to methylated DNA, leading to the establishment of H3K9me2 [45–47]. This creates a self-reinforcing loop by which mCHG and H3K9me2 maintain their localization in the genome [45]. Indeed, loss of H3K9me2 in kyp suvh5 suvh6 triple mutants leads to a loss of mCHG throughout those regions and mCHG-deficient mutants lead to reduced H3K9me2 in heterochromatin [48].
CHH methylation is asymmetrical and as such there is no mirrored cytosine on the opposite strand to serve as a guide for maintaining methylation after DNA replication [24]. Rather it is established de novo each round by one of two mechanisms. The first involves the activity of another member of the CMT family, CMT2 [49]. Like CMT3, it possesses a CHROMO and BAH domain and interacts with H3K9me2, leading to the methylation of CHH in H3K9me2 regions [44, 49, 50]. There is also some crossover between mCHH and mCHG. Although CMT2 preferentially methylates mCHH, it does to some extent methylate mCHG [50]. The bulk of CMT2 methylation is found within transposon bodies and within the pericentromere [49]. The ability of CMT2, CMT3, and MET1 to access and methylate heterochromatic regions is dependent in part on the chromatin remodeler DECREASED DNA METHYLATION 1 (DDM1) [49, 51] and mutation of DDM1 leads to a progressive loss of methylation in all three contexts over generations [52]. There also exists a third member of the family, CMT1, however, to date its function has not been identified [42]. Indeed, CMT1 is truncated in many A. thaliana accessions, suggesting that it may be non-functional [42].
Although all three contexts have their own distinct mechanisms, they are united by one common pathway, RNA-directed DNA Methylation (RdDM) [53]. In this pathway, 24-nucleotide (24-nt) short-interfering RNAs (siRNAs) [54] are dependent on the plant-specific POLYMERASE IV (POL IV) [55–58]. These siRNAs guide the de novo methyltransferase DOMAINS REARRANGED 2 (DRM2) to methylate target regions in all three contexts [59, 60] (For an in-depth review see [12, 53]). RdDM is in particular associated with high levels of mCHH and appears to target regions like the edges of transposons [49] and euchromatin-heterochromatin boundaries, like mCHH-islands in Zea mays [61, 62].
There is also cross-talk between all these pathways. For example, in met1 mutants, there is a redistribution of H3K9me2 and histone 3 lysine 27 tri-methylation (H3K27me3) to certain loci, leading to the accumulation of mCHG at new H3K9me2 sites [63]. Crosses between met1 and wild-type plants can result in epigenetic “shock” in the F1 progeny, inducing novel epialleles that do not exist in either parent [39]. The F1 progeny of these crosses have widespread differences in small RNAs [39]. Many of the changes observed are in part due to demethylation of the seventh intron of INCREASED IN BONSAI METHYLATION 1 (IBM1), a histone demethylase responsible for removing H3K9me2 from actively transcribed genes [39, 64]. This causes a decrease in expression of IBM1 and altered H3K9me2 distributions genome-wide, which subsequently alters targeting of CMT3 and CMT2 [39, 64]. These results illustrate in part the complex interplay amongst the various DNA methylation pathways.
Genome-wide patterns of methylation in flowering plant genomes
Genomes are dynamic and vary in size [65, 66], ploidy [67], gene content [68, 69], repetitive elements [66] and more. DNA methylation is shaped by all these factors and vary considerably within species [70–72] and between species [29, 73–76]. Within A. thaliana, methylation levels in all three contexts varied among accessions, differing as much as ~2× for mCG, ~6× for mCHG, and ~11× for mCHH [72]. Methylation levels differ by even greater degrees between plant species, with mCG varying ~3×, mCHG ~9×, and mCHH ~16× [74, 75, 77, 78]. Underlying these genome-wide variations are dissimilarities in both the percentage of sites methylated and how highly those sites are methylated [75, 76]. In Poaceae species (grasses), mCHH methylation is found primarily in small regions of highly methylated sites, whereas in some of the Fabaceae (legumes), like Glycine max, it is predominated by large regions of low methylation [75]. Genome-wide association analysis (GWAS) and quantitative trait loci (QTL) analysis have implied both cis and trans acting factors in explaining methylation differences [71, 79] and many of these factors have now been identified.
The molecular pathways underlying DNA methylation are a major source of natural variation at all scales. This remains a source of variation within kingdoms as well. For instance, the roundworm Caenorhabditis elegans and the yeast Saccharomyces cerevisiae have no DNA methyltransferases in their genome and also no cytosine DNA methylation [80, 81]. Although methylation has been found in all plant genomes to date, certain pathways have been found to predominate and others to be defective. Several examples come from the Brassicaceae. In Brassica rapa a mutation in NRPD1, the largest subunit of Pol IV, leads to reduced production of Pol IV-dependent siRNAs [82]. Across the Brassicaceae methylomes sequenced to date, per-site levels of mCHG are typically lower than in other species, with the lowest being in the species Eutrema salsugineum and Conringia planisiliqua [75, 83]. Analysis of these species genomes shows that they have both independently lost CMT3, while phylogenetics studies show that CMT3 in other Brassicaceae are under relaxed selection, accumulating non-synonymous mutations at a higher rate [83, 84]. GWAS within A. thaliana revealed natural variation in methylation levels of transposons was due to a premature stop codon in CMT2 in many wild accessions [72]. Similarly, in the Poaceae, CMT2 is absent in Z. mays [49], where the majority of mCHH is due to the activity of RdDM [85]. So while plants have multiple unique methylation enzymes and pathways, these continue to evolve in a lineage specific manner, leading to increasing epigenomic diversity.
There is a strong tie between DNA methylation and the silencing of repetitive DNA, which has a strong impact on the epigenome [86–88]. Methylation levels can vary within repeats and transposons themselves depending on the particular transposon family [89] and potentially also its age [90]. Although polyploidy can and does contribute to genome size variation, it is the expansion and reduction of repetitive elements that are the primary drivers [65, 66]. As the number of repeats and transposons in a genome increase, so does the intergenic spaces and overall genome size. It is expected that overall methylation levels will vary both with genome size and repeat content. Indeed, among the Brassicaceae, much of the variation of DNA methylation observed between species is driven by differences in repetitive elements, this is especially true in conserved syntenic regions [74]. When viewed across the angiosperms, total mC (methylation in all contexts) is correlated to increasing genome size [91]. Broken down methylation contexts, both mCG and mCHG are correlated to increases in genome size and the number of repeats in the genome, surprisingly though mCHH does not [75]. This could be driven in part by differences in mCHH in the Poaceae, which have low mCHH levels, yet typically large genomes.
Functionality of nonCG methylated genes and hypothetical origins
Within plant species there are genes that are consistently methylated in all cytosine sequences (Figure 1A) [71]. In A. thaliana many of these genes are targets of RdDM and these show tissue-specific expression, being reactivated in pollen [71]. These genes share common functions based on gene ontology (GO); being enriched for hydrolases, cell wall modification, and translation machinery. These processes are essential for pollen tube tip growth, which suggests a possible developmental regulation of these RdDM-targeted loci [71]. This reactivation of silenced genes coincides with the activation of many transposons from the vegetative nucleus [71]. In A. thaliana, DDM1 is not expressed in the pollen vegetative nucleus and this is associated with the expression of transposons as well as the production of epigenetically activated small RNAs (easiRNAs) that are capable of moving to the germline nuclei [15]. It is thought that these easiRNAs are one mechanism by which epigenome states are reinforced each generation in flowering plants [15]. Some of these silenced genes are also enriched for defense responses [79]. Methylation and gene expression changes do occur in newly dividing cells upon bacterial infection [92, 93]. It is possible that these methylated genes are in fact pseudogenes and accumulate mutations due to the mutagenic effects of DNA methylation [94]. However, many of these silenced genes show no evidence of accumulating loss-of-function mutations in A. thaliana [71], which further supports a functional role for many of these genes.
Figure 1.

A) Genes with nonCG methylation (mCHG and mCHH) within gene bodies are typically targets of RNA-directed DNA methylation (RdDM) and silenced. B) Gene-body methylated genes are expressed and methylated in the CG context only. They are characterized by a depletion of mCG around the transcriptional start site (TSS), increasing mCG through the gene-body, and a depletion of mCG over the transcriptional termination site (TTS). C) Methylation of CG over the TSS is associated with silencing of expression.
Although within species comparisons imply a functionality of nonCG methylation of genes in development or as part of environmental responses, nonCG methylated states are not conserved in orthologous genes between species [74, 75]. This suggests that it largely arises de novo within interbreeding populations, diverging with increasing genetic distance. Indeed, clustering of A. thaliana accessions based on differentially methylated regions largely reconstitutes the genetic and geographic relationship of these lines [71]. Many possible hypotheses could explain this. For example, nonCG methylation might target lineage-specific or orphan genes as in the case of the Qua-Quine Startch (QQS) gene in A. thaliana [95]. However, this is insufficient to explain the number of nonCG methylated genes in some species, which can constitute up to a third of all genes [75].
In highly conserved regions between A. thaliana, A. lyrata, and C. rubella, transposon gains and losses alone explain much of methylation differences between species [74]. In many species there is an enrichment of transposons upstream and downstream of nonCG methylated genes [75].[96] At a genome-wide level, correlations are also observed for methylation levels within the coding sequence (CDS) and genome size for nonCG methylation [75, 76]. Spreading of methylation from transposons into gene bodies is known to result in silencing [97], with specific examples being the case of CmWIP1 in C. melo [98]. In Z. mays, spreading of heterochromatic states from transposons into neighboring regions and genes is limited to specific transposon families [89]. Presence-absence variation of transposons resulted in gene-expression differences between lines [89]. Between species there is a correlation of increasing CDS methylation levels with increasing repeat content in genes and for many species, an enrichment of transposons upstream and downstream of nonCG genes [75]. GWAS in A. thaliana indicates that ~35% of differentially methylated regions are associated with local genetic variants and this is likely an underestimate [71] as QTL analysis of differentially methylated regions in G. max indicates a much higher proportion segregates with the parental genotype [79]. Two examples from A. thaliana demonstrate how copy-number variation and gene structure can lead to methylation and silencing. There are three PHOSPHORIBOSYLANTHRANILATE ISOMERASE (PAI1-3) genes in most A. thaliana accessions and in some accessions, these genes are highly methylated and silenced [99, 100]. It was discovered that in lines where PAI1-3 are silenced, there is a fourth copy, PAI4, that is tandemly duplicated on the opposite strand of PAI1 resulting in a PAI1–PAI4 inverted repeat, which is sufficient to induce silencing of all PAI genes in the genome [99–101]. In another example, a truncated and rearranged copy of FOLT2 in certain A. thaliana accessions leads to silencing and methylation of FOLT1 [102]. Methylation and silencing of FOLT1 is stable and inherited even after the inducing FOLT2 copy is segregated away [102].
Despite the lack of conservation of nonCG methylation between species, there is a degree of similarity at a functional level for nonCG genes when gene ontology is taken into account, as the same GO terms are enriched even across distantly related species [75]. These include processes such as proteolysis, cell death, and defense responses and could reflect the strict regulatory control that such processes are under. Many species also show enrichment for processes like photosynthesis, electron transport chain, and primary metabolism [75]. At first glance, this is unexpected, as these are essential functions and not expected to be silenced. Closer investigation of these loci shows that typically these nuclear genes are orthologous to mitochondrial and chloroplast genes and are likely insertions from organellar genomes into the nuclear genome [75]. Indeed, the movement of DNA from plant organelles into the nucleus is a constant and ongoing process and bear the marks of accumulated mutations due to methylation [103–106]. No single hypothesis for how genes become the targets of nonCG methylation and silencing exists. Rather a combination of multiple mechanisms is most likely involved in the origins of these silenced loci.
Gene-body methylation in flowering plants
DNA methylation is typically thought of as a repressive chromatin modification, however, this is not always the case. In both plants and animals, methylation of gene bodies in only the mCG context is associated with higher gene expression and more specifically, constitutively expressed genes (Figure 1B) [29, 37, 107–109]. These gene-body methylated (gbM) genes characterized by depletion of mCG in and around the transcriptional start site (TSS) and transcriptional termination sites (TTS) [108]. The pattern of methylation appears to be a critical aspect, as methylation across the TSS by mCG alone is sufficient for transcriptional silencing (Figure 1C) [75].
The widespread nature of gbM across species [29] and its unexpected association with gene expression begs the question as to the functional role of gbM and the origin of gbM genes [110]. The methylation status of gbM genes is often conserved across orthologs between species, as in the case of O. sativa and B. distachyon [111]. These results having been generalized more recently across a greater range of species [74–76]. Although conservation is observed, it breaks down over evolutionary distance. Within a species, there is natural variation in which genes are gbM, as demonstrated in A. thaliana [72, 112]. Both gains and losses of gbM are observed between closely related species like A. lyrata and A. thaliana or O. sativa and B. distachyon [75, 111]. In one accession of M. guttatus, ~60.7% of genes are gbM [75], whereas at the other extreme there is a depletion of gbM genes in many Brassicaceae [75, 83]. Outside of the angiosperms, gbM is known to be absent in more basal plant species such as Marchantia polymorpha [76] and Selaginella moellendorffii [29]. In other non-angiosperms, such as ferns, both mCG and mCHG are found within gene-bodies and do not show typical patterns of mCG (a depletion at the TSS and TTS) [76, 84]. These results suggest that within plants, gbM as defined in this review article, is specific to angiosperms [84]. Within angiosperms, gbM has been completely lost at least twice independently in two Brassicaceae: E. salsugineum and C. planisiliqua [75, 83]. Comparison of gene expression of gbM loci from A. thaliana and orthologs in E. salsugineum indicates no overt effects on gene expression even though one group has gbM and the other group does not [83]. Many hypotheses have been put forward as to the possible functions of gbM. These include the exclusion of the histone variant H2A.Z, alternative splicing, the suppression of antisense transcription, and as a dampener on gene expression [108, 113–115]. Analysis of the transcriptome, histone variants, and histone modifications in E. salsugineum failed to support any of these hypotheses [83]. Collectively, these observations indicate that gbM is dispensable over evolutionary time.
A second line of evidence that gbM is dispensable comes from studies of epigenetic recombinant inbred lines (epiRILs) [38, 83]. The epiRILs were generated by crossing A. thaliana met1 mutants, which lose nearly all mCG in the genome, to genetically identical wild-type plants [38]. In the F2 generation, plants homozygous for wild-type MET1 were retained and selfed for eight generations. This created a population of genetically identical plants, which varied in their methylation status, being a mosaic of methylated regions from the wild-type parent and unmethylated regions from the met1 mutant [38]. This epiRIL population enabled the effects of gbM and its loss to be tested in genes that previously possessed it [83]. As in the case of E. salsugineum, there was no evidence that loss of gbM resulted in changes in gene expression, alternative splicing, or in the distribution of H2A.Z [83]. Results from gbM loss in E. salsugineum and C. planisiliqua, as well as the epiRILs, indicate no functional consequences of gbM, at least for current hypotheses, however it is possible there are yet undiscovered functions of gbM.
Origins of gene-body methylation in flowering plants
Given the evidence showing gbM is dispensable, how are additional associations explained? Beyond gene expression, many other structural and evolutionary features of gbM genes have been identified. Structurally, gbM genes in plants are typically longer and have more exons when compared to non-gbM genes [83, 111, 116]. They also have been reported to be more slowly evolving, having a lower ratio of non-synonymous to synonymous mutations (dN/dS) [83, 111, 116]. The slower rate of evolution, however, seems to break down with increasing evolutionary distance [75]. These may be accidental associations, however, united by a common link of transcription. For example, in plants, longer genes are typically more highly expressed [117] and expression levels are known to contribute to evolutionary rates [118, 119]. Associations between gbM and these features could arise as a result that these genes are more actively transcribed.
The independent loss of gbM and cmt3 in both E. salsugineum and C. planisiliqua supports the hypothesis that they are connected (Figure 2). One possible explanation suggest the involvement of IBM1, a histone demethylase that removes H3K9me2 from actively transcribed genes [120]. Mutations in ibm1 result in H3K9me2 accumulation specifically in gbM loci [120]. This leads not only to increased mCHG through the recruitment of CMT3 within these gene bodies, but also to increased mCG and mCHH through yet unknown means (Figure 2) [83]. That IBM1 is necessary for the active removal of H3K9me2 in gbM genes, suggests that H3K9me2 is found at these loci at low levels in wild-type plants. Once mCG is established in the gene-body, as long as it does not impede normal gene function, it would be faithfully maintained by MET1 (Figure 2) [83]. In the absence of mechanisms to re-establish new mCG, gbM would decay slowly due to the fidelity of maintenance dependent on MET1 and with the epimutation rate [17, 83, 121]. This would explain why loss of gbM has not been observed in A. thaliana cmt3 mutants and poses a challenge for proving such a hypothesis as it may require many generations to observe. The example of gbM in angiosperms and its association with CMT3 demonstrates the power of comparative epigenomics for hypothesis generation and for hypothesis testing.
Figure 2.

Proposed model for the establishment of gene-body methylation. IBM1 actively removes H3K9me2 from transcribed genes, indicating that H3K9me2 at some level is incorporated in these genes. In ibm1 mutants there is an accumulation of CG, CHG, and CHH methylation in transcribed genes. CMT3 is associated with H3K9me2 and methylates CHG sites. Unknown factors (X?) mediate methylation of CG and CHH sites. After DNA replication, MET1 maintains CG methylation in transcribed genes, while CHG and CHH methylation is lost.
Methylation of regulatory regions and gene expression
Methylation of regulatory elements of genes also affects transcription factor (TF) binding and gene expression. In A. thaliana, the genome-wide cis-element binding (cistrome) has been assessed for 529 different TFs using DNA affinity purification sequencing (DAP-seq) [122]. Modified versions of this technique were also used to assess the impact of DNA methylation on TF occupancy. The majority (72%) of TFs were inhibited by methylation, whereas 24% showed no inhibition. Interestingly, 4.3% of TFs preferentially bound methylated DNA, further illustrating the context-specific effects of DNA methylation. A fascinating example of this involves the regulation of a DNA glycosylase, REPRESSOR OF SILENCING 1 (ROS1), which leads to demethylation of DNA [123, 124]. In this example, methylation of a neighboring TE in the promoter region of ROS1 promotes gene expression, whereas loss of methylation suppresses it. It is proposed that this serves as a “rheostat”, fine tuning the expression of ROS1 to variations in the methylation level of the genome [123, 124].
Transcription factor occupancy or gene expression itself may drive changes in DNA methylation [125]. One of the best examples to date in plants examined the relationship between DNA methylation and gene expression of Oryza sativa and A. thaliana under phosphate starvation [126]. By sampling from multiple time points, Secco and colleagues (2015) found that gene expression changes preceded changes in methylation in upstream or downstream regions in O. sativa. Most of the methylation differences observed in O. sativa were associated with TEs, in particular MINITURE INVERTED REPEATS (MITEs). Upon resupply of phosphate, gene expression returned to levels found in plants that were not subject to phosphate-starvation. Methylation profiles lagged behind gene-expression changes, being more similar to phosphate-starved samples even after 31 days of recovery [126]. Meanwhile, few methylation differences where associated with phosphate-starvation in A. thaliana [126]. These results strongly suggest that in certain cases, it is changes in gene expression that drive methylation differences in regulatory regions, rather than the other way around.
In Z. mays the presence of high levels of mCHH upstream and downstream of genes has been observed and this is associated with genes that are more highly expressed [61]. These regions have been termed “CHH islands” and they were originally found to be enriched among MITE transposons [61] and other terminal inverted repeat (TIR) sequences [62]. Methylated CHH (mCHH) islands are primarily the result of RdDM pathways, as they are lost in the mediator of paramutation 1 (mop1) and mop3 mutants, known components of RdDM in Z. mays [61, 62]. Like so many aspects of DNA methylation, their association with gene expression begs the question of which came first. In mop1 and mop3 mutants, loss of mCHH islands has no affect on neighboring gene expression and presence-absence variation between accessions was found indicating that mCHH islands do not drive gene expression [62]. Furthermore, although mCHH islands could be identified across species [75], many showed no association with differences in gene expression. Rather, the primary factor in determining their abundance appears to be the presence of repetitive elements upstream or downstream of genes, as there is a correlation between the two [61, 62, 75]. Indeed, differences in the distribution of repeats near genes explains the depletion and enrichment of mCHH islands between species [75]. The presence of higher mCG and mCHG levels upstream or downstream of mCHH islands in the direction away from the gene and changes in histone modifications indicates that mCHH islands may mark a boundary between euchromatin and localized heterochromatin in plant genomes [62]. The examples of ROS1 and mCHH islands, shows that there is much we still do not know about DNA methylation and its impact on the regulatory regions of genes.
Epialleles as a source of phenotypic variation in nature and in agriculture
As plants do not reset their epigenome each generation in a manner similar to mammals, changes to methylated states that occur in the germline can be passed on to offspring, resulting in epigenetic variation. One classic example is the peloric mutant in Linaria vulgaris. In wild-type plants, flowers have dorsoventral asymmetry, however, in the peloric mutant, the flowers display a radially symmetrical floral phenotype [127]. These mutants have a storied history, having been described in 1749 by Carl Linnaeus. 250 year later, the basis of this mutation was identified as the hypermethylation and silencing of Lcyc, the L. vulgaris ortholog of the cycloidea gene in Antirrhinum majus [127]. Many other examples have been identified since: the Colorless non-ripening (Cnr) locus in S. lycopersicum (tomato) [128, 129], CmWIP1 in C. melo (melon) [98], and the PAI [100, 101, 130], FOLT1/FOLT2 [102], and QQS genes in A. thaliana [95, 131]. Epialleles like Cnr and CmWIP1 directly impact traits of agricultural production and importance. The Cnr epialle in tomato leads to silencing of a SQUAMOSA promoter binding protein–like essential to fruit ripening [129]. This results in an inhibition of ripening, producing instead a colorless fruit with a loss of cell adhesion and unfavorable characteristics [128, 129]. The epiallele at CmWIP1, as previously mentioned, results in spreading of DNA methylation from a neighboring transposon [98]. CmWIP1 is involved in flower sex determination in melons and in the silenced epiallele, produces female flowers. With microarrays and whole-genome bisulfite sequencing, many more such epialleles have and are being identified [16, 17, 79, 132].
The ability of epialleles to impact gene expression and subsequently agriculturally important phenotypes has sparked interest in their use for agriculture [21]. An excellent example is the MANTLED locus of Elaeis guineensis (African oil palm) [133]. Micropropagation of E. guineensis leads to a high frequency of plants producing sterile floral organs, greatly reducing yield. In an epigenome-wide association study, this was mapped to loss of methylation at a transposable element called KARMA [133]. KARMA, is located in the intron of the gene DEFICIENS and loss of methylation leads to missplicing of DEFICIENS and improper flower formation [133]. Having identified the underlying epiallele, it will be possible to test micropropagated plants for this epiallele prior to planting. Epialleles could also potentially be used in breeding programs, either as a source of novel phenotypic variation [38, 134–136] or by using epialleles as molecular markers for creating genetic maps and mapping quantitative traits [135]. Indeed, modern breeding methods like genomic selection are already being adapted for use of epigenetic markers [137]
Conclusion
As part of the epigenome, DNA methylation is an additional layer through which the language of the genome is interpreted. However, context is important and meanings can change with subtle variations in patterns of the epigenome. DNA methylation is not a simple on-off switch. In plants, DNA can be methylated at three sequence contexts: CG, CHG, and CHH. These are established and maintained by distinct molecular mechanisms, and in the cases of mCHG and mCHH, processes that are unique to plants. The effect of methylation on gene expression is highly dependent upon the type of methylation as well as the pattern of that methylation within or outside of the gene. Decades of work have gone into understanding the nature of DNA methylation, which is now being accelerated by whole genome approaches and a flood of new data. While much progress has been made there remains much that is unknown, in particular the effects of methylation outside of gene bodies upon gene expression and how targeting of certain genes by DNA methylation originates. The full extent of epigenetic variation is also not understood, despite its potential to impact phenotypic variation and traits of agricultural importance. Wider studies that take from multiple populations, species, and mutants will be needed for a deeper understanding of DNA methylation and its integration into how it affects gene expression.
Highlights.
Plants methylate DNA in three sequence contexts: CG, CHG, and CHH (H = A, T, C).
Methylation levels and distribution differ widely between species.
Distinct gene methylation patterns associate with different transcriptional results.
Gene body methylation is dispensable over evolutionary time.
Acknowledgments
This work was supported by the National Science Foundation (NSF) (MCB – 1339194, IOS – 1546867) and the National Institutes of Health (R00GM100000) to R.J.S. C.E.N was supported by a NSF postdoctoral fellowship (IOS – 1402183).
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Talbert PB, Henikoff S. Histone variants–ancient wrap artists of the epigenome. Nat Rev Mol Cell Biol. 2010;11(4):264–75. doi: 10.1038/nrm2861. [DOI] [PubMed] [Google Scholar]
- 2.Zhang K, et al. Distinctive core histone post-translational modification patterns in Arabidopsis thaliana. PLoS One. 2007;2(11):e1210. doi: 10.1371/journal.pone.0001210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Finnegan EJ, et al. DNA Methylation in Plants. Annu Rev Plant Physiol Plant Mol Biol. 1998;49:223–247. doi: 10.1146/annurev.arplant.49.1.223. [DOI] [PubMed] [Google Scholar]
- 4.Richards EJ. Inherited epigenetic variation–revisiting soft inheritance. Nat Rev Genet. 2006;7(5):395–401. doi: 10.1038/nrg1834. [DOI] [PubMed] [Google Scholar]
- 5.Shen L, Waterland RA. Methods of DNA methylation analysis. Curr Opin Clin Nutr Metab Care. 2007;10(5):576–81. doi: 10.1097/MCO.0b013e3282bf6f43. [DOI] [PubMed] [Google Scholar]
- 6.Lister R, Ecker JR. Finding the fifth base: genome-wide sequencing of cytosine methylation. Genome Res. 2009;19(6):959–66. doi: 10.1101/gr.083451.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schumacher A, et al. Microarray-based DNA methylation profiling: technology and applications. Nucleic Acids Res. 2006;34(2):528–42. doi: 10.1093/nar/gkj461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cokus SJ, et al. Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature. 2008;452(7184):215–9. doi: 10.1038/nature06745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lister R, et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008;133(3):523–36. doi: 10.1016/j.cell.2008.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Schultz MD, Schmitz RJ, Ecker JR. ‘Leveling’ the playing field for analyses of single-base resolution DNA methylomes. Trends Genet. 2012;28(12):583–5. doi: 10.1016/j.tig.2012.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Finnegan EJ, Dennis ES. Isolation and identification by sequence homology of a putative cytosine methyltransferase from Arabidopsis thaliana. Nucleic Acids Res. 1993;21(10):2383–8. doi: 10.1093/nar/21.10.2383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11(3):204–20. doi: 10.1038/nrg2719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kawashima T, Berger F. Epigenetic reprogramming in plant sexual reproduction. Nat Rev Genet. 2014;15(9):613–24. doi: 10.1038/nrg3685. [DOI] [PubMed] [Google Scholar]
- 14.Seisenberger S, et al. Reprogramming DNA methylation in the mammalian life cycle: building and breaking epigenetic barriers. Philos Trans R Soc Lond B Biol Sci. 2013;368(1609):20110330. doi: 10.1098/rstb.2011.0330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Slotkin RK, et al. Epigenetic reprogramming and small RNA silencing of transposable elements in pollen. Cell. 2009;136(3):461–72. doi: 10.1016/j.cell.2008.12.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Becker C, et al. Spontaneous epigenetic variation in the Arabidopsis thaliana methylome. Nature. 2011;480(7376):245–9. doi: 10.1038/nature10555. [DOI] [PubMed] [Google Scholar]
- 17.Schmitz RJ, et al. Transgenerational epigenetic instability is a source of novel methylation variants. Science. 2011;334(6054):369–73. doi: 10.1126/science.1212959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Heard E, Martienssen RA. Transgenerational epigenetic inheritance: myths and mechanisms. Cell. 2014;157(1):95–109. doi: 10.1016/j.cell.2014.02.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hagmann J, et al. Century-scale methylome stability in a recently diverged Arabidopsis thaliana lineage. PLoS Genet. 2015;11(1):e1004920. doi: 10.1371/journal.pgen.1004920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jaligot E, Rival A. Applying Epigenetics in Plant Breeding. Balancing Genome Stability and Phenotypic Plasticity. 2015:159–192. [Google Scholar]
- 21.Ji L, Neumann DA, Schmitz RJ. Crop Epigenomics: Identifying, Unlocking, and Harnessing Cryptic Variation in Crop Genomes. Mol Plant. 2015;8(6):860–70. doi: 10.1016/j.molp.2015.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhong X. Comparative epigenomics: a powerful tool to understand the evolution of DNA methylation. New Phytol. 2015 doi: 10.1111/nph.13540. [DOI] [PubMed] [Google Scholar]
- 23.Gruenbaum Y. Sequence specificity of methylation in higher plant DNA. Nature. 1981;292(5826):860–862. doi: 10.1038/292860a0. [DOI] [PubMed] [Google Scholar]
- 24.Meyer P, Niedenhof I, ten Lohuis M. Evidence for cytosine methylation of nonsymmetrical sequences in transgenic Petunia hybrida. EMBO J. 1994;13(9):2084–8. doi: 10.1002/j.1460-2075.1994.tb06483.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bird AP. DNA methylation patterns and epigenetic memory. Genes Dev. 2002;16:6–21. doi: 10.1101/gad.947102. [DOI] [PubMed] [Google Scholar]
- 26.Ramsahoye BH, et al. Non-CpG methylation is prevalent in embryonic stem cells and may be mediated by DNA methyltransferase 3a. Proceedings of the National Academy of Sciences. 97(10):5237–5242. doi: 10.1073/pnas.97.10.5237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lister R, et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009;462(7271):315–22. doi: 10.1038/nature08514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Erdmann RM, et al. 5-hydroxymethylcytosine is not present in appreciable quantities in Arabidopsis DNA. G3 (Bethesda) 2014;5(1):1–8. doi: 10.1534/g3.114.014670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zemach A, et al. Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010;328(5980):916–9. doi: 10.1126/science.1186366. [DOI] [PubMed] [Google Scholar]
- 30.Bostick M, et al. UHRF1 plays a role in maintaining DNA methylation in mammalian cells. Science. 2007;317(5845):1760–4. doi: 10.1126/science.1147939. [DOI] [PubMed] [Google Scholar]
- 31.Bestor T, et al. Cloning and sequencing of a cDNA encoding DNA methyltransferase of mouse cells. The carboxyl-terminal domain of the mammalian enzymes is related to bacterial restriction methyltransferases. J Mol Biol. 1988;203(4):971–83. doi: 10.1016/0022-2836(88)90122-2. [DOI] [PubMed] [Google Scholar]
- 32.Finnegan EJ, Peacock WJ, Dennis ES. Reduced DNA methylation in Arabidopsis thaliana results in abnormal plant development. Proc Natl Acad Sci U S A. 1996;93(16):8449–54. doi: 10.1073/pnas.93.16.8449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ronemus MJ, et al. Demethylation-induced developmental pleiotropy in Arabidopsis. Science. 1996;273(5275):654–7. doi: 10.1126/science.273.5275.654. [DOI] [PubMed] [Google Scholar]
- 34.Vertino PM, et al. DNMT1 is a component of a multiprotein DNA replication complex. Cell Cycle. 2002;1(6):416–23. doi: 10.4161/cc.1.6.270. [DOI] [PubMed] [Google Scholar]
- 35.Woo HR, et al. VIM1, a methylcytosine-binding protein required for centromeric heterochromatinization. Genes Dev. 2007;21(3):267–77. doi: 10.1101/gad.1512007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Woo HR, Dittmer TA, Richards EJ. Three SRA-domain methylcytosine-binding proteins cooperate to maintain global CpG methylation and epigenetic silencing in Arabidopsis. PLoS Genet. 2008;4(8):e1000156. doi: 10.1371/journal.pgen.1000156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhang X, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006;126(6):1189–201. doi: 10.1016/j.cell.2006.08.003. [DOI] [PubMed] [Google Scholar]
- 38.Reinders J, et al. Compromised stability of DNA methylation and transposon immobilization in mosaic Arabidopsis epigenomes. Genes Dev. 2009;23(8):939–50. doi: 10.1101/gad.524609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rigal M, et al. Epigenome confrontation triggers immediate reprogramming of DNA methylation and transposon silencing in Arabidopsis thaliana F1 epihybrids. Proc Natl Acad Sci U S A. 2016 doi: 10.1073/pnas.1600672113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lindroth AM, et al. Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation. Science. 2001;292(5524):2077–80. doi: 10.1126/science.1059745. [DOI] [PubMed] [Google Scholar]
- 41.Jackson JP, et al. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature. 2002;416(6880):556–60. doi: 10.1038/nature731. [DOI] [PubMed] [Google Scholar]
- 42.Henikoff S, Comai L. A DNA methyltransferase homolog with a chromodomain exists in multiple polymorphic forms in Arabidopsis. Genetics. 1998;149(1):307–18. doi: 10.1093/genetics/149.1.307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Pavlopoulou A, Kossida S. Plant cytosine-5 DNA methyltransferases: structure, function, and molecular evolution. Genomics. 2007;90(4):530–41. doi: 10.1016/j.ygeno.2007.06.011. [DOI] [PubMed] [Google Scholar]
- 44.Du J, et al. Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell. 2012;151(1):167–80. doi: 10.1016/j.cell.2012.07.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Johnson LM, Cao XF, Jacobsen SE. Interplay between two epigenetic marks: DNA methylation and histone H3 lysine 9 methylation. Current Biology. 2002;12(16):1360–1367. doi: 10.1016/s0960-9822(02)00976-4. [DOI] [PubMed] [Google Scholar]
- 46.Ebbs ML, Bartee L, Bender J. H3 lysine 9 methylation is maintained on a transcribed inverted repeat by combined action of SUVH6 and SUVH4 methyltransferases. Mol Cell Biol. 2005;25(23):10507–15. doi: 10.1128/MCB.25.23.10507-10515.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ebbs ML, Bender J. Locus-specific control of DNA methylation by the Arabidopsis SUVH5 histone methyltransferase. Plant Cell. 2006;18(5):1166–76. doi: 10.1105/tpc.106.041400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Stroud H, et al. Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell. 2013;152(1–2):352–64. doi: 10.1016/j.cell.2012.10.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zemach A, et al. The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell. 2013;153(1):193–205. doi: 10.1016/j.cell.2013.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Stroud H, et al. Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat Struct Mol Biol. 2014;21(1):64–72. doi: 10.1038/nsmb.2735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Vongs A, et al. Arabidopsis thaliana DNA methylation mutants. Science. 1993;260(5116):1926–8. doi: 10.1126/science.8316832. [DOI] [PubMed] [Google Scholar]
- 52.Kakutani T, Jeddeloh JA, Richards EJ. Characterization of an Arabidopsis thaliana DNA hypomethylation mutant. Nucleic Acids Res. 1995;23(1):130–7. doi: 10.1093/nar/23.1.130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Matzke MA, Mosher RA. RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat Rev Genet. 2014;15(6):394–408. doi: 10.1038/nrg3683. [DOI] [PubMed] [Google Scholar]
- 54.Chan SW, et al. RNA silencing genes control de novo DNA methylation. Science. 2004;303(5662):1336. doi: 10.1126/science.1095989. [DOI] [PubMed] [Google Scholar]
- 55.Herr AJ, et al. RNA polymerase IV directs silencing of endogenous DNA. Science. 2005;308(5718):118–20. doi: 10.1126/science.1106910. [DOI] [PubMed] [Google Scholar]
- 56.Onodera Y, et al. Plant nuclear RNA polymerase IV mediates siRNA and DNA methylation-dependent heterochromatin formation. Cell. 2005;120(5):613–22. doi: 10.1016/j.cell.2005.02.007. [DOI] [PubMed] [Google Scholar]
- 57.Luo J, Hall BD. A multistep process gave rise to RNA polymerase IV of land plants. J Mol Evol. 2007;64(1):101–12. doi: 10.1007/s00239-006-0093-z. [DOI] [PubMed] [Google Scholar]
- 58.Ream TS, et al. Subunit compositions of the RNA-silencing enzymes Pol IV and Pol V reveal their origins as specialized forms of RNA polymerase II. Mol Cell. 2009;33(2):192–203. doi: 10.1016/j.molcel.2008.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Cao X, Jacobsen SE. Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc Natl Acad Sci U S A. 2002;99(Suppl 4):16491–8. doi: 10.1073/pnas.162371599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cao X, Jacobsen SE. Role of the arabidopsis DRM methyltransferases in de novo DNA methylation and gene silencing. Curr Biol. 2002;12(13):1138–44. doi: 10.1016/s0960-9822(02)00925-9. [DOI] [PubMed] [Google Scholar]
- 61.Gent JI, et al. CHH islands: de novo DNA methylation in near-gene chromatin regulation in maize. Genome Res. 2013;23(4):628–37. doi: 10.1101/gr.146985.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Li Q, et al. RNA-directed DNA methylation enforces boundaries between heterochromatin and euchromatin in the maize genome. Proc Natl Acad Sci U S A. 2015 doi: 10.1073/pnas.1514680112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Deleris A, et al. Loss of the DNA methyltransferase MET1 Induces H3K9 hypermethylation at PcG target genes and redistribution of H3K27 trimethylation to transposons in Arabidopsis thaliana. PLoS Genet. 2012;8(11):e1003062. doi: 10.1371/journal.pgen.1003062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Rigal M, et al. DNA methylation in an intron of the IBM1 histone demethylase gene stabilizes chromatin modification patterns. EMBO J. 2012;31(13):2981–93. doi: 10.1038/emboj.2012.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bennetzen JL, Ma J, Devos KM. Mechanisms of recent genome size variation in flowering plants. Ann Bot. 2005;95(1):127–32. doi: 10.1093/aob/mci008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Bennetzen JL, Wang H. The contributions of transposable elements to the structure, function, and evolution of plant genomes. Annu Rev Plant Biol. 2014;65:505–30. doi: 10.1146/annurev-arplant-050213-035811. [DOI] [PubMed] [Google Scholar]
- 67.Soltis PS, et al. Polyploidy and genome evolution in plants. Curr Opin Genet Dev. 2015;35:119–125. doi: 10.1016/j.gde.2015.11.003. [DOI] [PubMed] [Google Scholar]
- 68.Sebat J, et al. Large-scale copy number polymorphism in the human genome. Science. 2004;305(5683):525–8. doi: 10.1126/science.1098918. [DOI] [PubMed] [Google Scholar]
- 69.Swanson-Wagner RA, et al. Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor. Genome Res. 2010;20(12):1689–99. doi: 10.1101/gr.109165.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Eichten SR, et al. Epigenetic and genetic influences on DNA methylation variation in maize populations. Plant Cell. 2013;25(8):2783–97. doi: 10.1105/tpc.113.114793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Schmitz RJ, et al. Patterns of population epigenomic diversity. Nature. 2013;495(7440):193–8. doi: 10.1038/nature11968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Dubin MJ, et al. DNA methylation in Arabidopsis has a genetic basis and shows evidence of local adaptation. Elife. 2015;4:e05255. doi: 10.7554/eLife.05255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Feng S, et al. Conservation and divergence of methylation patterning in plants and animals. Proc Natl Acad Sci U S A. 2010;107(19):8689–94. doi: 10.1073/pnas.1002720107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Seymour DK, et al. Evolution of DNA methylation patterns in the Brassicaceae is driven by differences in genome organization. PLoS Genet. 2014;10(11):e1004785. doi: 10.1371/journal.pgen.1004785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Niederhuth CE, et al. Widespread natural variation of DNA methylation within angiosperms. bioRxiv. 2016 doi: 10.1186/s13059-016-1059-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Takuno S, Ran J-H, Gaut BS. Evolutionary patterns of genic DNA methylation vary across land plants. Nature Plants. 2016;2(2):15222. doi: 10.1038/nplants.2015.222. [DOI] [PubMed] [Google Scholar]
- 77.Colicchio JM, et al. DNA methylation and gene expression in Mimulus guttatus. BMC Genomics. 2015;16:507. doi: 10.1186/s12864-015-1668-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Wang H, et al. CG gene body DNA methylation changes and evolution of duplicated genes in cassava. Proc Natl Acad Sci U S A. 2015 doi: 10.1073/pnas.1519067112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Schmitz RJ, et al. Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population. Genome Res. 2013;23(10):1663–74. doi: 10.1101/gr.152538.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.PROFFITT JH, et al. 5-Methylcytosine Is Not Detectable in Saccharomyces cerevisiae DNA. MOLECULAR AND CELLULAR BIOLOGY. 1984;4(5):985–988. doi: 10.1128/mcb.4.5.985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Yi S. Birds do it, bees do it, worms and ciliates do it too: DNA methylation from unexpected corners of the tree of life. Genome Biol. 2012;13(10):174. doi: 10.1186/gb-2012-13-10-174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Huang Y, Kendall T, Mosher RA. Pol IV-Dependent siRNA Production is Reduced in Brassica rapa. Biology (Basel) 2013;2(4):1210–23. doi: 10.3390/biology2041210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Bewick AJ, et al. On the origin and evolutionary consequences of gene body DNA methylation. Proc Natl Acad Sci U S A. 2016;113(32):9111–6. doi: 10.1073/pnas.1604666113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Bewick AJ, et al. The evolution of CHROMOMETHYLTRANSFERASES and gene body DNA methylation in plants. bioRxiv. 2016 [Google Scholar]
- 85.Li Q, et al. Genetic perturbation of the maize methylome. Plant Cell. 2014;26(12):4602–16. doi: 10.1105/tpc.114.133140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Chandler VL, Walbot V. DNA modification of a maize transposable element correlates with loss of activity. Proc Natl Acad Sci U S A. 1986;83(6):1767–71. doi: 10.1073/pnas.83.6.1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Chomet PS, Wessler S, Dellaporta SL. Inactivation of the Maize Transposable Element Activator (Ac) Is Associated with Its DNA Modification. Embo Journal. 1987;6(2):295–302. doi: 10.1002/j.1460-2075.1987.tb04753.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Rabinowicz PD, et al. Differential methylation of genes and repeats in land plants. Genome Res. 2005;15(10):1431–40. doi: 10.1101/gr.4100405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Eichten SR, et al. Spreading of heterochromatin is limited to specific families of maize retrotransposons. PLoS Genet. 2012;8(12):e1003127. doi: 10.1371/journal.pgen.1003127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Maumus F, Quesneville H. Ancestral repeats have shaped epigenome and genome composition for millions of years in Arabidopsis thaliana. Nat Commun. 2014;5:4104. doi: 10.1038/ncomms5104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Alonso C, et al. Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in angiosperms. Front Genet. 2015;6:4. doi: 10.3389/fgene.2015.00004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Dowen RH, et al. Widespread dynamic DNA methylation in response to biotic stress. Proceedings of the National Academy of Sciences of the United States of America. 2012;109(32):E2183–E2191. doi: 10.1073/pnas.1209329109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Yu A, et al. Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense. Proc Natl Acad Sci U S A. 2013;110(6):2389–94. doi: 10.1073/pnas.1211757110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Shen JC, Rideout WM, 3rd, Jones PA. The rate of hydrolytic deamination of 5-methylcytosine in double-stranded DNA. Nucleic Acids Res. 1994;22(6):972–6. doi: 10.1093/nar/22.6.972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Silveira AB, et al. Extensive natural epigenetic variation at a de novo originated gene. PLoS Genet. 2013;9(4):e1003437. doi: 10.1371/journal.pgen.1003437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Niederhuth CE, et al. Widespread natural variation of DNA methylation within angiosperms. bioRxiv. 2016 doi: 10.1186/s13059-016-1059-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Weil C, Martienssen R. Epigenetic interactions between transposons and genes: lessons from plants. Curr Opin Genet Dev. 2008;18(2):188–92. doi: 10.1016/j.gde.2008.01.015. [DOI] [PubMed] [Google Scholar]
- 98.Martin A, et al. A transposon-induced epigenetic change leads to sex determination in melon. Nature. 2009;461(7267):1135–8. doi: 10.1038/nature08498. [DOI] [PubMed] [Google Scholar]
- 99.Bender J, Fink GR. Epigenetic Control of an Endogenous Gene Family Is Revealed by a Novel Blue Fluorescent Mutant of Arabidopsis. Cell. 1995;83(5):725–734. doi: 10.1016/0092-8674(95)90185-x. [DOI] [PubMed] [Google Scholar]
- 100.Melquist S, Luff B, Bender J. Arabidopsis PAI gene arrangements, cytosine methylation and expression. Genetics. 1999;153(1):401–13. doi: 10.1093/genetics/153.1.401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Luff B, Pawlowski L, Bender J. An inverted repeat triggers cytosine methylation of identical sequences in Arabidopsis. Mol Cell. 1999;3(4):505–11. doi: 10.1016/s1097-2765(00)80478-5. [DOI] [PubMed] [Google Scholar]
- 102.Durand S, et al. Rapid establishment of genetic incompatibility through natural epigenetic variation. Curr Biol. 2012;22(4):326–31. doi: 10.1016/j.cub.2011.12.054. [DOI] [PubMed] [Google Scholar]
- 103.Stegemann S, et al. High-frequency gene transfer from the chloroplast genome to the nucleus. Proc Natl Acad Sci U S A. 2003;100(15):8828–33. doi: 10.1073/pnas.1430924100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Huang CY, et al. Mutational decay and age of chloroplast and mitochondrial genomes transferred recently to angiosperm nuclear chromosomes. Plant Physiol. 2005;138(3):1723–33. doi: 10.1104/pp.105.060327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Kleine T, Maier UG, Leister D. DNA transfer from organelles to the nucleus: the idiosyncratic genetics of endosymbiosis. Annu Rev Plant Biol. 2009;60:115–38. doi: 10.1146/annurev.arplant.043008.092119. [DOI] [PubMed] [Google Scholar]
- 106.Roark LM, et al. Recent and frequent insertions of chloroplast DNA into maize nuclear chromosomes. Cytogenet Genome Res. 2010;129(1–3):17–23. doi: 10.1159/000312724. [DOI] [PubMed] [Google Scholar]
- 107.Tran RK, et al. DNA methylation profiling identifies CG methylation clusters in Arabidopsis genes. Curr Biol. 2005;15(2):154–9. doi: 10.1016/j.cub.2005.01.008. [DOI] [PubMed] [Google Scholar]
- 108.Zilberman D, et al. Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2007;39(1):61–9. doi: 10.1038/ng1929. [DOI] [PubMed] [Google Scholar]
- 109.Baubec T, et al. Genomic profiling of DNA methyltransferases reveals a role for DNMT3B in genic methylation. Nature. 2015;520(7546):243–7. doi: 10.1038/nature14176. [DOI] [PubMed] [Google Scholar]
- 110.Teixeira FK, Colot V. Gene body DNA methylation in plants: a means to an end or an end to a means? EMBO J. 2009;28(8):997–8. doi: 10.1038/emboj.2009.87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Takuno S, Gaut BS. Gene body methylation is conserved between plant orthologs and is of evolutionary consequence. Proc Natl Acad Sci U S A. 2013;110(5):1797–802. doi: 10.1073/pnas.1215380110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Kawakatsu T, et al. Epigenomic Diversity in a Global Collection of Arabidopsis thaliana Accessions. Cell. 2016;166(2):492–505. doi: 10.1016/j.cell.2016.06.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Zilberman D, et al. Histone H2A.Z and DNA methylation are mutually antagonistic chromatin marks. Nature. 2008;456(7218):125–9. doi: 10.1038/nature07324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Coleman-Derr D, Zilberman D. Deposition of histone variant H2A.Z within gene bodies regulates responsive genes. PLoS Genet. 2012;8(10):e1002988. doi: 10.1371/journal.pgen.1002988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Regulski M, et al. The maize methylome influences mRNA splice sites and reveals widespread paramutation-like switches guided by small RNA. Genome Res. 2013;23(10):1651–62. doi: 10.1101/gr.153510.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Takuno S, Gaut BS. Body-methylated genes in Arabidopsis thaliana are functionally important and evolve slowly. Mol Biol Evol. 2012;29(1):219–27. doi: 10.1093/molbev/msr188. [DOI] [PubMed] [Google Scholar]
- 117.Ren XY, et al. In plants, highly expressed genes are the least compact. Trends Genet. 2006;22(10):528–32. doi: 10.1016/j.tig.2006.08.008. [DOI] [PubMed] [Google Scholar]
- 118.Pal C, Papp B, Hurst LD. Highly expressed genes in yeast evolve slowly. Genetics. 2001;158(2):927–31. doi: 10.1093/genetics/158.2.927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Yang L, Gaut BS. Factors that contribute to variation in evolutionary rate among Arabidopsis genes. Mol Biol Evol. 2011;28(8):2359–69. doi: 10.1093/molbev/msr058. [DOI] [PubMed] [Google Scholar]
- 120.Miura A, et al. An Arabidopsis jmjC domain protein protects transcribed genes from DNA methylation at CHG sites. EMBO J. 2009;28(8):1078–86. doi: 10.1038/emboj.2009.59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.van der Graaf A, et al. Rate, spectrum, and evolutionary dynamics of spontaneous epimutations. Proc Natl Acad Sci U S A. 2015 doi: 10.1073/pnas.1424254112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.O’Malley RC, et al. Cistrome and Epicistrome Features Shape the Regulatory DNA Landscape. Cell. 2016;165(5):1280–92. doi: 10.1016/j.cell.2016.04.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Lei M, et al. Regulatory link between DNA methylation and active demethylation in Arabidopsis. Proc Natl Acad Sci U S A. 2015 doi: 10.1073/pnas.1502279112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Williams BP, et al. Methylation-sensitive expression of a DNA demethylase gene serves as an epigenetic rheostat. PLoS Genet. 2015;11(3):e1005142. doi: 10.1371/journal.pgen.1005142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Feldmann A, et al. Transcription factor occupancy can mediate active turnover of DNA methylation at regulatory regions. PLoS Genet. 2013;9(12):e1003994. doi: 10.1371/journal.pgen.1003994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Secco D, et al. Stress induced gene expression drives transient DNA methylation changes at adjacent repetitive elements. Elife. 2015;4 doi: 10.7554/eLife.09343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Cubas P, Vincent C, Coen E. An epigenetic mutation responsible for natural variation in floral symmetry. Nature. 1999;401(6749):157–61. doi: 10.1038/43657. [DOI] [PubMed] [Google Scholar]
- 128.Thompson AJ, et al. Molecular and genetic characterization of a novel pleiotropic tomato-ripening mutant. Plant Physiol. 1999;120(2):383–90. doi: 10.1104/pp.120.2.383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Manning K, et al. A naturally occurring epigenetic mutation in a gene encoding an SBP-box transcription factor inhibits tomato fruit ripening. Nat Genet. 2006;38(8):948–52. doi: 10.1038/ng1841. [DOI] [PubMed] [Google Scholar]
- 130.Bender J, Fink GR. Epigenetic control of an endogenous gene family is revealed by a novel blue fluorescent mutant of Arabidopsis. Cell. 1995;83(5):725–34. doi: 10.1016/0092-8674(95)90185-x. [DOI] [PubMed] [Google Scholar]
- 131.Li L, et al. Identification of the novel protein QQS as a component of the starch metabolic network in Arabidopsis leaves. Plant J. 2009;58(3):485–98. doi: 10.1111/j.1365-313X.2009.03793.x. [DOI] [PubMed] [Google Scholar]
- 132.Eichten SR, et al. Heritable Epigenetic Variation among Maize Inbreds. PLoS Genet. 2011;7(11):e1002372. doi: 10.1371/journal.pgen.1002372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Ong-Abdullah M, et al. Loss of Karma transposon methylation underlies the mantled somaclonal variant of oil palm. Nature. 2015 doi: 10.1038/nature15365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Johannes F, et al. Assessing the impact of transgenerational epigenetic variation on complex traits. PLoS Genet. 2009;5(6):e1000530. doi: 10.1371/journal.pgen.1000530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Cortijo S, et al. Mapping the epigenetic basis of complex traits. Science. 2014;343(6175):1145–8. doi: 10.1126/science.1248127. [DOI] [PubMed] [Google Scholar]
- 136.Dapp M, et al. Heterosis and inbreeding depression of epigenetic Arabidopsis hybrids. Nature Plants. 2015;1(7):15092. doi: 10.1038/nplants.2015.92. [DOI] [PubMed] [Google Scholar]
- 137.Hu Y, et al. Prediction of Plant Height in Arabidopsis thaliana Using DNA Methylation. Data Genetics. 2015;201(2):779–93. doi: 10.1534/genetics.115.177204. [DOI] [PMC free article] [PubMed] [Google Scholar]
