Skip to main content
Plant Physiology logoLink to Plant Physiology
. 2015 Apr 13;168(4):1262–1274. doi: 10.1104/pp.15.00052

Examining the Causes and Consequences of Context-Specific Differential DNA Methylation in Maize1,[OPEN]

Qing Li 1,2,2, Jawon Song 1,2,2, Patrick T West 1,2, Greg Zynda 1,2, Steven R Eichten 1,2, Matthew W Vaughn 1,2, Nathan M Springer 1,2,*
PMCID: PMC4528731  PMID: 25869653

Natural variation of DNA methylation in maize is evident among five diverse maize inbred lines.

Abstract

DNA methylation is a stable modification of chromatin that can contribute to epigenetic variation through the regulation of genes or transposons. Profiling of DNA methylation in five maize (Zea mays) inbred lines found that while DNA methylation levels for more than 99% of the analyzed genomic regions are similar, there are still 5,000 to 20,000 context-specific differentially methylated regions (DMRs) between any two genotypes. The analysis of identical-by-state genomic regions that have limited genetic variation provided evidence that DMRs can occur without local sequence variation, but they are less common than in regions with genetic variation. Characterization of the sequence specificity of DMRs, location of DMRs relative to genes and transposons, and patterns of DNA methylation in regions flanking DMRs reveals a distinct subset of DMRs. Transcriptome profiling of the same tissue revealed that only approximately 20% of genes with qualitative (on-off) differences in gene expression are associated with DMRs, and there is little evidence for association of DMRs with genes that show quantitative differences in gene expression. We also identify a set of genes that may represent cryptic information that is silenced by DNA methylation in the reference B73 genome. Many of these genes exhibit natural variation in other genotypes, suggesting the potential for selection to act upon existing epigenetic natural variation. This study provides insights into the origin and influences of DMRs in a crop species with a complex genome organization.


DNA methylation occurs in both animals and plants and has been reported to play an important role in a number of processes, including the establishment of imprinting in animal and plant species (Jirtle and Skinner, 2007; Gehring et al., 2009; Waters et al., 2011; Rodrigues et al., 2013; Zhang et al., 2014), the whole-chromosome inactivation in animals (Wutz, 2011), and the silencing of transposons (Chan et al., 2005). DNA methylation has also been suggested to be involved in development, response to environmental stresses, human disease, and other processes (Jones and Baylin, 2002; Jirtle and Skinner, 2007; Dowen et al., 2012; Yu et al., 2013). Although DNA methylation can play a role in epigenetic regulation that does not require alterations in DNA sequence, there is also evidence that DNA methylation can be triggered by structural rearrangements (Bender and Fink, 1995) or transposon insertions (Eichten et al., 2012). Differences in DNA methylation among individuals in a population are likely to include both epigenetic and genetic causes.

The establishment and maintenance and DNA methylation are complex processes involving a variety of enzymes (Law and Jacobsen, 2010; Matzke and Mosher, 2014). Establishment of DNA methylation is generally guided by the RNA-directed DNA methylation pathway, while there are distinct maintenance pathways for cytosines at different sequence contexts. CG methylation is maintained by METHYLTRANSFERASE1 during DNA replication process and requires the VARIANT IN METHYLATION1/UBIQUITIN-LIKE WITH PHD AND RING FINGER DOMAINS1 factors to target methylation to hemimethylated sites. CHG (in which H is any base except G) methylation is maintained by the plant-specific chromomethylases, such as the Arabidopsis (Arabidopsis thaliana) CHROMOMETHYLASE3 (CMT3; Law and Jacobsen, 2010). The maintenance of CHG methylation involves a self-enforcing loop with dimethylation of Lys-9 of histone H3 (Johnson et al., 2007; Du et al., 2012). CHH methylation can be maintained by either the RNA-directed DNA methylation or CMT2 pathway (Law and Jacobsen, 2010; Zemach et al., 2013; Stroud et al., 2014). In plants, DNA methylation can be removed by a family of DNA glycosylase enzymes that include DEMETER and REPRESSOR OF SILENCING1 (Zhu, 2009; Zhang and Zhu, 2012). The specific targeting of methylation/demethylation activities and the fidelity of these processes can influence the occurrence of DNA methylation variation. While we have focused on describing the key genes/enzymes influencing DNA methylation in Arabidopsis, there are also homologs of most of these genes in maize (Zea mays; Li et al., 2014), and it is likely that the majority of these pathways are conserved and performing similar functions in maize.

The analysis of Arabidopsis mutant accumulation lines that were grown for 30 generations in the absence of specific selective pressures finds that the methylation levels at specific cytosines can vary at a rate substantially higher than the observed single nucleotide polymorphism rate (Becker et al., 2011; Schmitz et al., 2011). However, differentially methylated regions (DMRs) are quite rare and occur at frequencies similar to genetic changes (Becker et al., 2011; Schmitz et al., 2011). Studies of identical-by-state (IBS) regions of the maize genome also provide evidence for stochastic changes in DNA methylation patterns in the absence of local sequence changes (Eichten et al., 2011). However, structural rearrangements and transposon insertions can also influence the DNA methylation patterns for specific alleles (Bender and Fink, 1995; Durand et al., 2012; Castelletti et al., 2014). The analysis of genetically diverse populations of Arabidopsis (Schmitz et al., 2013b), rice (Oryza sativa; Chodavarapu et al., 2012), maize (Eichten et al., 2013; Regulski et al., 2013), or soybean (Glycine max; Schmitz et al., 2013a) finds numerous DMRs. While there are examples in which these DMRs are caused by local sequence rearrangements (Eichten et al., 2012; Schmitz et al., 2013b) it is likely that these include epigenetic changes as well.

There is evidence that some targets of epigenetic silencing acquire DNA methylation, and therefore, DNA methylation is often considered to be related to gene expression. However, various studies of differential methylation in plant populations find that only a subset of DMRs are associated with altered expression for nearby genes (Eichten et al., 2013; Schmitz et al., 2013b). The relationships between gene expression and DNA methylation differ depending upon the sequence context for DNA methylation and location of the DMR relative to the gene. Gene body DMRs that occur within the central transcribed portion of the gene usually include CG methylation only (Saze and Kakutani, 2011; Schmitz et al., 2013b). There is little evidence that these gene body CG DMRs are associated with gene expression differences, and this gene body methylation is commonly found in highly expressed genes. In maize, there is evidence that many highly expressed genes contain high levels of CHH methylation within several hundred base pair of the transcription start site (TSS; Gent et al., 2013). By contrast, high levels of CG or CHG methylation near the TSS are often associated with reduced gene expression. However, it remains unclear what portion of differential gene expression in plant populations might be attributed to altered levels of DNA methylation.

In this study, we profiled context-specific DNA methylation patterns in five diverse maize inbred lines to identify and characterize context-specific DMRs. We were able to develop methods to robustly identify context-specific DMRs in the maize genome. DMRs are found even in IBS portions of the maize genome but are more common in regions of the genome with genetic variation. The characterization of the overlap in differences for different sequence contexts, location relative to genes and transposons, and methylation patterns for flanking regions allowed us to identify discrete types of DMRs. The contribution of DMRs to differential gene expression was also evaluated by assessing gene expression patterns in the same tissue. DMRs are associated with approximately 20% of genes with qualitative (on-off) differences in gene expression but rarely contribute to quantitative differences in gene expression. In addition, we identified a set of genes with potential cryptic information in B73 reference genome, some of which were already accessible to breeders in other genotypes.

RESULTS AND DISCUSSION

Whole-genome bisulfite sequencing (WGBS) was performed using DNA isolated from seedling leaf tissue of five maize inbred lines: B73, Mo17, Oh43, CML322, and Tx303. B73 is the genotype used to produce the reference genome sequence of maize (Schnable et al., 2009) and represents the stiff-stalk heterotic group. Mo17 and Oh43 are classified as nonstiff-stalk temperate lines, while CML322 and Tx303 are tropical inbred lines (Flint-Garcia et al., 2005). These represent three of the major groups of domesticated maize germplasm (Flint-Garcia et al., 2005). These genotypes are all parents within the nested association mapping population developed for quantitative trait mapping in maize (McMullen et al., 2009). For each genotype, 140 to 185 million reads were obtained, providing 3.5 to 6 times coverage with greater than 99% conversion rates (Supplemental Table S1). The genomewide DNA methylation levels were quite similar for each of the five genotypes. B73 had slightly higher levels of DNA methylation, but this may be due to the fact that repetitive regions will map at the highest frequency within the reference genome and may lead to slightly higher overall estimates of DNA methylation. The genome was grouped into 100-bp nonoverlapping sliding tiles. The coverage and context-specific DNA methylation levels were calculated for each 100-bp tile in the maize genome. The use of 100-bp tiles simplifies comparisons among genotypes and also likely captures the biologically meaningful units of chromatin, such as nucleosomes.

Discovery of Context-Specific DMRs

A primary goal was to identify a robust set of DMRs between genotypes. There are biological differences in the abundance and variation of CHH methylation compared with CG/CHG methylation, and therefore different criteria were employed to discover these distinct types of DMRs. CG and CHG DMRs were discovered by comparing each of the four nonreference genotypes with B73 for 100-bp tiles that had at least three sites (CG or CHG) and at least 2 times coverage in the two lines being contrasted. After applying these criteria, we were able to compare DNA methylation levels for 5.6 to 6.6 million 100-bp tiles, representing 27% to 32% of the maize genome (Table I). It is worth noting that the analysis of DNA methylation differences will be focused on single-copy regions of similarity between inbred lines, and we cannot compare polymorphic transposon insertions using this approach. It is likely that there is DNA methylation at many of these unalignable regions (Seymour et al., 2014). A tile was classified as differentially methylated if the difference in DNA methylation levels between the two lines was greater than 60% for CG or CHG contexts. If two tiles within 100 bp of each other were both classified as differentially methylated, then they were merged together to create a larger DMR. In each of the four genotype contrasts, greater than 99% of the 100-bp tiles do not exhibit differential methylation (Table I). However, there are 15,684 to 22,204 CG or CHG DMRs in each of the four contrasts (Table I). While over 50% of the DMRs encompass a single 100 bp, there are many examples of multiple adjacent 100-bp tiles that exhibit differential methylation, resulting in average DMR sizes of 199 to 218 bp (Supplemental Fig. S1A). The CG and CHG DMRs are distributed throughout the maize genome, with a slight enrichment on chromosome arms (Fig. 1A; Supplemental Fig. S1B).

Table I. DMRs found in this study.

BvC, B73 versus CML322; BvM, B73 versus Mo17; BvO, B73 versus Oh43; BvT, B73 versus Tx303; NA, not applied.

Features BvC BvM BvO BvT Merged
No. CG tiles with 2× coverage (% all tiles) 5,711,274 (27.7%) 5,867,123 (28.5%) 5,850,207 (28.4%) 5,646,804 (27.4%) NA
No. CHG tiles with 2× coverage (% all tiles) 5,697,086 (27.7%) 5,798,435 (28.2%) 5,835,932 (28.3%) 5,638,371 (27.4%) NA
No. CHH tiles with 2× coverage (% all tiles) 6,462,820 (31.1%) 6,217,268 (30.2%) 6,621,837 (32.2%) 6,408,902 (31.1%) NA
No. CG DMRs 22,204 16,387 20,163 20,434 53,816
No. CHG DMRs 20,742 15,684 17,714 19,036 50,899
No. CHH DMRs 5,272 6,721 4,410 4,469 16,736
Percentage CG DMRs 0.78 0.57 0.67 0.72 NA
Percentage CHG DMRs 0.74 0.50 0.61 0.67 NA
Percentage CHH DMRs 0.09 0.12 0.07 0.08 NA
Average CG DMR size (bp) 210 211 204 207 210
Average CHG DMR size (bp) 218 199 218 214 200
Average CHH DMR size (bp) 110 108 109 111 110

Figure 1.

Figure 1.

Identification and validation of context-specific DMRs among four maize genotype contrasts. A, The genomewide distribution of DMRs, genes, TEs, and coverage is shown using Integrative Genomics Viewer (IGV; Robinson et al., 2011). Each track displays the number of features per 1-Mbp window of the maize genome. Only data from chromosome 1 (Chr 1) is shown. B, DMR validation and support rates for each genotype contrast (BvC, B73 versus CML322; BvM, B73 versus Mo17; BvO, B73 versus Oh43; BvT, B73 versus Tx303) using an independent sequence capture bisulfite sequencing data set. The subset of the DMR tiles (numbers indicated at the base of each bar) that also have coverage in a sequence capture bisulfite sequencing (SeqCap Epi) data set are analyzed. Each DMR is classified as validated (green), supported (yellow), or not supported (gray). Validated DMRs represent DMRs that show greater than 60% (for CG and CHG) or greater than 20% (for CHH) difference in the same direction in the SeqCap Epi assay. Supported DMRs are the DMRs that show 40% to 60% (for CG and CHG) or 10% to 20% (for CHH) difference in the same direction in the SeqCap Epi assay.

CHH methylation tends to be present at lower levels compared with CG or CHG methylation, both on a genomic scale and at specific loci (Gent et al., 2013; West et al., 2014). Often regions with elevated CHH methylation will only exhibit 20% to 40% methylation of CHH sites. Given the lower levels of this mark, we used a different set of criteria to identify CHH DMRs. We identified 100-bp tiles in which one genotype has less than 5% CHH methylation and the other genotype exhibits greater than 25% CHH methylation. Even using these relaxed criteria, the number of CHH DMRs was 4- to 9-fold lower than the number of CG or CHG DMRs in each contrast (Table I). It was less common to identify two adjacent 100-bp tiles that both exhibit CHH differences resulting in smaller average size of CHH DMRs (Supplemental Fig. S1A).

Validation of Context-Specific DMRs

Several approaches were used to validate the DMRs. The most robust validation utilized a sequence-capture bisulfite sequencing approach (Li et al., 2014) to perform targeted bisulfite sequencing for a set of approximately 4,000 genomic regions in independent samples of each of these genotypes. This provided coverage for a subset of the DMRs in each of the four contrasts (Fig. 1B), and there is quite strong agreement (R = 0.96–0.99) in the DNA methylation levels measured in the two approaches for CG and CHG DMRs (Supplemental Fig. S2). DMRs were classified as validated, supported, or not supported based on the observed methylation levels in the SeqCap Epi (Roche NimbleGen) data set (Fig. 1B; Supplemental Fig. S2). The majority (>90%) of CG and CHG DMRs are validated or supported by analysis of DNA methylation in independent DNA samples from the same genotypes using a different method (Fig. 1B). By contrast, only 39% to 62% of CHH DMRs are supported or validated. The correlation between WGBS and SeqCap Epi was not improved when we used 50- or 200-bp tiles for DMR discovery (Supplemental Table S2). The proportion of DMRs that are validated is only slightly improved when we only assessed regions with higher (3 or 5 times) coverage (Supplemental Table S3), suggesting that our use of a low-coverage cutoff is not strongly reducing our validation rate. Many of the CG and CHG DMRs can also be supported by analysis of a methylated DNA immunoprecipitation data set (Supplemental Fig. S3).

We compared our 100-bp tile approach that used simple cutoffs for coverage and methylation difference with the eDMR approach (Li et al., 2013) that utilized statistical approaches to discover DMRs. This approach does not require predefined windows but instead will search for context-specific DMRs across the genome. Implementing the eDMR algorithm identified a smaller number of DMRs than the tile-based method described above for CG and CHG DMRs (Supplemental Fig. S4A). A comparison of the DMRs identified using the tile-based method and eDMR reveals that most (>75%) of the CG or CHG DMRs identified by eDMR are also identified using the tile-based method (Supplemental Fig. S4A). However, only less than 10% of the CHH eDMRs are also found using the tile-based method (Supplemental Fig. S4A). For most of the contrasts, the DMRs found by eDMR did not have higher validation rates than the DMRs identified by the tile-based approach (Supplemental Fig. S4B). We elected to focus on the tile-based DMRs for the remainder of the study, as they had validation rates that were similar to, or better than, the validation rate for the eDMR approach and the tile-based DMRs also provided a common coordinate system for cross-sample comparisons. The lower validation rates for CHH DMRs found using either the tile-based or eDMR method may reflect biological variation in CHH patterns in different plants or technical issues in the discovery. The remaining analyses focus on the CG and CHG DMRs that could be validated at high rates.

Characterization of CG and CHG DMRs

While the DMRs were identified for specific sequence contexts, it was clear that, in some cases, the same genomic region exhibits changes in multiple sequence contexts (Fig. 2A). Many (approximately 40%) of the CG DMRs also have over 60% differences in the level of CHG methylation (Supplemental Fig. S5A). Across the four genotypes, approximately 25% of the CG DMRs show similar levels of CHG methylation in both genotypes. In these cases, the CHG methylation levels are usually low in both lines (Supplemental Fig. S5A, pie charts). The CG DMRs rarely exhibit differences in CHH methylation, and the CHH methylation levels are quite low for both genotypes (Supplemental Fig. S5B). A similar analysis of CHG DMRs reveals that many CHG DMRs also exhibit differences in CG methylation (Supplemental Fig. S5C). However, when CHG DMRs have similar levels of CG methylation in both genotypes, the level of CG methylation tends to be high in both genotypes the majority of the time (Supplemental Fig. S5C, pie charts). Similar to CG DMRs, the CHG DMRs generally don’t show a difference in CHH methylation, and the CHH level is very low for both genotypes (Supplemental Fig. S5D). The analyses in Figure 2A and Supplemental Figure S5 suggest that CG and CHG DMRs can be divided into three types, CG only, CG/CHG, and CHG only. For each genotype contrast, we merged the CG and CHG DMRs into a single list and classified each DMR as one of the three types or unclassified. In each contrast, about 50% of the DMRs that could be classified are CG/CHG, and roughly 25% are CG only or CHG only (Supplemental Table S4). The locations of the three types of DMRs relative to genes and transposons were assessed relative to randomly selected regions (Fig. 2, B and C). All three types of DMRs are enriched for being located within genes or both genes and transposable elements (TEs) and depleted for being located within TEs. The depletion of being located near TEs is less pronounced in CG/CHG DMRs. Many (approximately 70%) of the CG/CHG DMRs are still near or in TEs, although this percentage is less than the random set (Fig. 2B). Similar to findings in Arabidopsis and soybean (Schmitz et al., 2013a, 2013b), the CG-only DMRs show the most enrichment for being near filtered genes sets genes, and the CHG-only DMRs show the most enrichment for being near both genes and TEs (Fig. 2B). These patterns are similar across the four contrasts (Supplemental Fig. S5). The enrichment of CG-only DMRs near or within genes, which generally have low CHG methylation, explains the observation that CHG levels are generally low at CG-only DMRs. Similarly, the enrichment of CHG-only DMRs near both genes and TEs, which generally have both high CG and CHG methylation, explain the observation that CHG-only DMRs usually have high CG levels. The type of nearby TEs for the CG/CHG and CHG-only DMRs are different (Fig. 2C). Though they are both less likely to be near long terminal repeat (LTR)-spreading elements compared with the random set, the CHG-only DMRs are enriched for being near TEs classified as other, while CG/CHG DMRs are often near LTR-spreading or terminal inverted repeat elements (Fig. 2C).

Figure 2.

Figure 2.

Context-specific DMRs and their proximity to genomic features. A, Context-specific methylation levels (0%–100%) are plotted for several B73-Mo17 DMRs using IGV (Robinson et al., 2011). The colored region in the center shows context-specific methylation within the DMR, while the black shading for the edges shows methylation levels outside of the DMR. B, Pie charts are used to show the proportion of DMRs that are located near (less than 500 bp) or in genes (red), TEs (blue), both genes and TEs (yellow), or DMRs located more than 500 bp from either a gene or a TE (black). Separate pie charts are shown for CG/CHG DMRs (greater than 60% difference at both CG and CHG contexts), CG-only DMRs (greater than 60% CG difference and less than 20% CHG difference), and CHG-only DMRs (greater than 60% CHG difference and less than 20% CG difference). The pie chart labeled random shows the proportions of each location type for 20,000 randomly selected 100-bp tiles that had coverage in our data set. The size of the pie chart is scaled to the number of DMRs in the specific category, except the random set. C, For the DMRs that are located within or near a TE, we classified the type of TE as terminal inverted repeat (TIR; blue), LTR spreading (red), LTR nonspreading (yellow), or other (black). The spreading/nonspreading classifications are based on Eichten et al. (2012). B and C show the average proportions for the four genotype contrasts; the values for each contrast are available in Supplemental Figure S5. In B and C, an asterisk is used to indicate significant enrichment of the feature relative to 100 random sets at P < 0.01, and a downward arrow indicates significant underrepresentation.

To understand how DMRs might arise, we assessed the DNA methylation profiles in the regions that flank DMRs. We envisioned three scenarios for the creation of DMRs, including (1) a local gain of methylation in one genotype in a region of low methylation (low-low [LL]), (2) a local loss of DNA methylation in one genotype in a region that is highly methylated (high-high [HH]), or (3) a difference in the position of the boundary between methylated and unmethylated DNA in two genotypes (high-low [HL]). Examples of each of these three types of DMRs are provided in Figure 3, A to C. The DNA methylation levels for the second and third tiles on each side of the DMR were averaged to determine the flanking DNA methylation level. The closest tile was not used, as we expect that the actual edge of the DMR may be located in the middle of this tile. This analysis was restricted to the DMRs that had coverage for both flanking regions in both genotypes so only a subset of the DMRs was assessed, ranging from 296 to 2,324 in the four genotype contrasts. Clustering of the DNA methylation levels of the left and right flanks for DMRs (Fig. 3, D and E) reveals that the HH and HL patterns are more commonly observed than the LL patterns for all contrasts (Fig. 3F; Supplemental Fig. S6). The relatively rare LL pattern may simply be due to the fact that the majority of the maize genome is methylated. Highly methylated tiles (CG and CHG) are approximately 10-fold more common than tiles with low CG/CHG methylation.

Figure 3.

Figure 3.

Characterization of methylation at DMR flanking regions. A to C, Three representative types of DNA methylation at DMR flanking regions are shown for CG methylation in B73-Mo17 using IGV (Robinson et al., 2011). In A, the flanking regions both show low methylation (LL), while in B, the flanking regions both show high DNA methylation (HH) and, in C, high methylation in one flanking region and low methylation for the other flanking region (HL). D and E, Hierarchical clustering (Ward’s method) of methylation levels at the left and right flanking regions of CG (D) or CHG (E) DMRs. The B73 versus Mo17 contrast was shown as an example, and clustering for other genotypes provides similar patterns. F, The flanking pattern characterization of B73-Mo17 DMRs was compared with context specificity of DNA methylation changes and genomic location for B73-Mo17 DMRs (similar plots for the other contrasts are available in Supplemental Fig. S6). The DMR sequence contexts and flanking region types are cross tabulated to give a total of nine different DMR categories. The size of each pie chart is scaled to the number of DMRs in each category. The number above each pie chart shows the DMR number in that specific category. Gene (red) indicates that a DMR is within 500 bp of a filtered gene set gene, TE (blue) indicates that a DMR is within 500 bp of a TE, both (yellow) indicates that a DMR is within 500 bp of both a gene and a TE, and neither (black) indicates that a DMR is more than 500 bp from both annotated genes and TEs.

By combining the flanking DNA methylation patterns and the context specificity of DNA methylation differences with the proximity to genes or TEs, we found that the LL patterns are enriched for CG-only DMRs near genes (Fig. 3F; Supplemental Fig. S6). This agrees with the fact that genic regions were generally not methylated and the main methylation type in genic region is CG methylation (West et al., 2014). The HL patterns are often found for CG/CHG DMRs and in the region between genes and TEs, suggesting that there might be different stable boundaries between heterochromatin and euchromatin for the different alleles. The HH flanking patterns are enriched for CG/CHG and CHG-only DMRs and are usually near TEs (Fig. 3F; Supplemental Fig. S6). The HH DMRs are enriched for having the high methylation state in four of the five genotypes (Supplemental Fig. S7) and likely reflect a rare loss of DNA methylation in TE regions.

Frequency of High and Low Methylation States

The CG/CHG, CG-only, or CHG-only DMRs identified in each contrast were merged to form a nonredundant list of DMRs across all genotypes to investigate the frequency of the high and low DNA methylation states within the five genotypes. The number of genotypes with coverage for each DMR was determined (Supplemental Fig. S8A). The 18,632 CG/CHG DMRs, 12,789 CG-only DMRs, and 10,671 CHG-only DMRs with coverage in all five genotypes were used to perform hierarchical clustering of DNA methylation levels (Supplemental Fig. S8B). This reveals that there are more examples in which B73 exhibits high levels of DNA methylation relative to at least one other genotype than examples in which B73 has low methylation. A linear discriminant analysis was used to classify each genotype as high or low methylated, and then the ratio of high to low methylated alleles was used to generate a histogram (Supplemental Fig. S8C). It was more common to have three or four genotypes with high methylation levels than to have only one or two genotypes that were highly methylated.

Role of Genetic Variation in Contributing to DMRs

Differential methylation could occur due to purely epigenetic variation, or it may be driven by local or remote sequence variation. One way to assess the role of local sequence variation is to study the rate of DMRs in IBS genomic regions relative to the genomewide rate of DMRs. IBS regions in each of the four genotype contrasts were identified using Beagle (Browning and Browning, 2011). We then identified genomic regions greater than 1 Mbp in which greater than 90% of the regions were classified as being IBS, resulting in a range of 1 to 17.75 Mbp of IBS regions in the four contrasts (Supplemental Fig. S9A). The presence of DMRs in these IBS regions suggests that DMRs can occur even in regions without substantial genetic variation. The DMRs located within IBS regions are not enriched for specific contexts, flanking patterns of DNA methylation, or location relative to genes and TEs (Supplemental Fig. S9, B–D). However, the frequency of DMRs in the IBS regions is usually lower than the genomewide proportion of DMRs (Supplemental Fig. S9B). This suggests that local sequence variation might account for a portion of the DMRs, while either epigenetic variation or transacting remote sequence variation would account for the remaining DMRs.

Contribution of DMRs to Differentially Expressed Genes

Many studies have reported that there is a limited negative correlation between gene expression levels and DNA methylation. Previous studies have also found that a small proportion of DMRs are negatively correlated with expression levels of nearby genes (Eichten et al., 2013; Schmitz et al., 2013b). In this study, we decided to assess the frequency of differentially expressed genes that contain DMRs to better understand how frequently differential gene expression might involve DNA methylation changes. RNA sequencing was performed on three biological replicates of RNA isolated from the same tissues used for WGBS for each of the five genotypes. Differentially expressed genes were identified using DEseq (Anders and Huber, 2010), and then the differentially expressed genes were divided according to the fold change between the two genotypes (Fig. 4, A–C; Supplemental Table S5). Plots of the difference in CG or CHG methylation for the two genotypes reveal that there are some differences in CG or CHG methylation for genes that exhibit very strong (>10-fold) differential expression, with the high expressed genotype containing lower levels of CG and CHG methylation near the TSS but higher levels of CHH methylation 5′ of the TSS (Fig. 4, A–C). Plots of the actual DNA methylation level for differentially expressed genes with qualitative change show that both alleles have similar methylation levels in flanking regions, but the average CG/CHG level is low and the CHH is high for the high-expressing genotype (Fig. 4, D–F). One interesting finding is that the regions that show the most association between DNA methylation and gene expression are located around the TSS, corresponding to the region that shows the lowest DNA methylation in the metamethylation profiles of genic and surrounding regions. This suggests an important role of DNA methylation in this region for influencing transcription. Methylation in different contexts associates differently with gene expression. CG methylation differences for on-off genes occur only in the first approximately 1 kb of the gene. By contrast, differences in CHG methylation persist further into the gene. While CG and CHG methylation show a negative correlation with gene expression, CHH methylation shows a positive correlation, and the region that shows the highest correlation is 200- to 500-bp upstream of the gene, corresponding to the CHH island described in previous studies (Gent et al., 2013; West et al., 2014). Interestingly, the same genes that exhibit elevated CG and CHG methylation in the region downstream of the TSS also exhibit reduced CHH methylation in the region upstream of the TSS, providing further evidence for a role of elevated CHH methylation in promoter in gene expression.

Figure 4.

Figure 4.

Association of DNA methylation variation with differential expression. A to C, The difference in DNA methylation levels were calculated for differentially expressed genes (methylation at high-expressed gene [H)] minus methylation at lower expressed gene [L]) for CG (A), CHG (B), and CHH (C) contexts. Differentially expressed genes were divided into six categories (color code indicated within the plot) based on the fold change between the highest (H) and the lowest (L) genotypes. Methylation levels were calculated for each category as the average of all the genes in that category. The two vertical lines represent the –200 bp upstream of a gene and 600 bp into a gene from 5′ end. The arrow indicates the direction of transcription. D to F, The actual methylation levels for the on-off category at CG (D), CHG (E), and CHH (F) in the genotypes with the highest or the lowest gene expression level. G and H, Percentage of genes in each expression category with CG (G) and CHG (H) DMRs. I, Distribution of CG/CHG difference between off and on genotypes at –200- to 600-bp regions. nDEG, Not differentially expressed genes; DEG, differentially expressed genes.

The number of genes with DMRs was assessed for genes with different expression patterns (Fig. 4, G and H). In general, the frequency of genes with DMRs is similar for nondifferentially expressed genes and genes with less than 5-fold change in expression. Genes with greater than 10-fold change or on-off genes show significantly higher levels of DMRs. In greater than 95% of these cases, the DMR exhibits higher methylation for the genotype with low, or no, expression. While the genes with on-off change in expression are enriched for DMRs, it is worth noting that the majority (approximately 80%) of the differentially expressed genes with on-off expression differences do not have DMRs and exhibit similar methylation in both genotypes (Fig. 4I). The remaining approximately 20% of genes exhibit large differences in DNA methylation level between the two alleles, with lower CG/CHG methylation for the higher expressed allele (Fig. 4I). These findings provide evidence that expression variation for most of the genes is not associated with DNA methylation variation.

Potential for Cryptic Information in the Maize Genome

Several studies have created epigenetic recombinant inbred line populations in Arabidopsis that segregate for DNA methylation levels in chromosomal regions without genetic variation (Johannes et al., 2009; Reinders et al., 2009). These populations have been used to provide evidence for cryptic information in a particular ecotype that can be released by removing DNA methylation. There is some evidence that this cryptic information may represent genes with natural variation for silencing (Johannes et al., 2009; Reinders et al., 2009). We were interested in characterizing a set of genes that may be subject to silencing by DNA methylation in the reference genotype and determining whether there was natural variation for this silenced state. There are 38,726 maize genes that have coverage for the –200- to +600-bp (relative to the TSS) region in B73. Many (approximately 85%) of these genes are expressed in seedling leaf tissue of B73 or have low levels of CG/CHG methylation surrounding the TSS (Fig. 5A, group 1 genes). The tissue-specific expression for the remaining 6,451 genes was assessed using the B73 expression atlas (Sekhon et al., 2013) to identify 2,602 genes that are expressed in other tissues of B73 (Fig. 5A, group 2 genes). The final 3,849 genes that have high levels of CG/CHG methylation and are not expressed in B73 were divided into two groups based on whether they exhibit expression in seedling leaf tissue of a panel of 51 diverse maize genotypes (Eichten et al., 2013). These two groups of genes represent potentially epigenetically silenced genes that exhibit natural variation (1,578 group 3 genes expressed in some genotypes) or fixed silencing in fairly diverse germplasm (2,271 group 4 genes; Fig. 5A). We assessed whether our data for Oh43, Mo17, CML322, and Tx303 could be used to assess whether the genes in group 3 were related to differences in DNA methylation. There are 119 of the group 3 genes that are expressed in at least one of these four genotypes (but not in B73) and had DNA methylation data for the –200- to +600-bp region surrounding the TSS. We found that 28 of the 119 (23.5%) of these genes have very low (less than 20%) CG and CHG methylation for the genotype with expression, suggesting that at least a subset of the group 3 genes are associated with altered methylation of the promoter region.

Figure 5.

Figure 5.

Characterizing the potential for cryptic information within the B73 genome. A, The genes with methylcytosine (mC) coverage at the –200 to +600 region (relative to the TSS) were assessed to identify genes that are expressed or have low CG/CHG methylation (group 1), genes that are expressed in other tissues of the B73 expression atlas (group 2), genes that are expressed in seedling tissue of other maize genotypes (group 3), or genes with no detectable expression (group 4). B and C, The distribution of lengths for mRNA transcripts (B) and introns (C) for genes from groups 1 to 4 is shown using a boxplot. D, The proportion of genes within each group located within subgenome 1 (Sub1) or subgenome 2 (Sub2) is shown. E, The proportion of genes in each group that are syntenic with rice is shown. F, The closest TEs for the genes in each group were classified. G, The proportion of genes with Gene Ontology (GO) annotation is shown for each group of genes. H, The proportion of genes located within the low-recombining central portion of each chromosome is assessed for each group of genes. TIR, Terminal inverted repeat.

A number of properties were examined for the genes in the four different groups. The intron length and transcript length are shorter for the group 3 and 4 genes (Fig. 5, B and C). The proportion of genes located within subgenomes 1 and 2 (Schnable et al., 2011) was similar for all four groups of genes (Fig. 5D). However, the proportion of genes located in syntenic positions relative to rice or sorghum (Sorghum bicolor) was much higher for the group 1 genes (Fig. 5E). The genes in groups 2 to 4 are enriched for being near LTR-spreading transposons and are much less likely to have a Gene Ontology annotation (Fig. 5, F and G). The proportion of the genes located in the recombination-poor central region of the chromosome was assessed for each group (Fig. 5H). The group 1 genes are similar to all genes, but the group 2 to 4 genes are enriched for being located in the low-recombination central portion of maize chromosomes. This analysis provides evidence that there is potential cryptic information in the maize genome. Nearly one-half of the highly methylated genes exhibit natural variation for expression levels, suggesting that these alleles have been accessible to plant breeders. However, given the location of many of these genes within the recombination-poor regions of the maize genome, there may be challenges in isolating favorable linkages of the expressed alleles with other allelic variation. The other half of the genes that are highly methylated and not expressed in B73 do not show expression variation in this tissue of other maize genotypes. Perturbation of DNA methylation levels may allow the expression of these genes to reveal if there is important cryptic information in the maize genome that could allow for agronomic improvement. One hurdle to this approach is the apparent lethality of strong perturbation of DNA methylation in maize (Li et al., 2014).

CONCLUSION

DNA methylation has the potential to provide heritable information beyond DNA sequence. In this study, we profile the context-specific DNA methylation patterns for five maize genotypes to identify regions with differential methylation. The fact that genomic regions with nearly identical sequence still contain DMRs suggests that at least a portion of the variation for DNA methylation is purely epigenetic. There are distinct subtypes of DMRs in terms of the context specificity of DNA methylation and the patterns of methylation in regions flanking the DMRs. By assessing gene expression in the same tissue used for methylation profiling, we were able to carefully assess the relationship between altered expression and DNA methylation. DNA methylation differences are associated with gene expression changes only for a subset of the genes that show major differences in expression. Genes with quantitative differences in expression (less than 10-fold changes) rarely exhibit differential DNA methylation. This study also provided an opportunity to study the genes that might provide cryptic information in maize. A set of silent, methylated genes was identified within the reference genome. Some of these genes are expressed in other tissues or other maize genotypes, while others are silenced in all samples assessed. These genes may provide a reservoir of cryptic information that could influence traits if reactivated in populations similar to the epigenetic recombinant inbred lines utilized in Arabidopsis.

MATERIALS AND METHODS

Plant Materials

The maize (Zea mays) inbred lines B73, Mo17, CML322, Oh43, and Tx303 were grown to the third leaf stage. The third leaf was harvested, frozen in liquid nitrogen, and used for DNA and RNA isolation. Genomic DNA was prepared using the standard cetyl-trimethyl-ammonium bromide method. Total RNA was prepared using TRIzol reagent following manufacturers’ instructions.

WGBS and Data Analysis

WGBS libraries were prepared as previously described in Eichten et al. (2013). Libraries that passed quality control (concentration of greater than 2 nm and size range between 200 to 500 bp) were sequenced on a HiSEquation 2000 using 100 cycles and paired-end mode. Adapters were trimmed using Trim_Galore. Reads were mapped to the maize B73 reference genome version 2 using Bismark (0.10.1; Krueger and Andrews, 2011), allowing one mismatch in the alignment (-N 1). Methylated cytosines were extracted from aligned reads using the Bismark methylation extractor under standard parameters. The proportion of CG, CHG, and CHH methylation was determined as weighted methylation levels (Schultz et al., 2012) in 100-bp nonoverlapping windows across the genome.

Identification of DMR: Tile and eDMR Methods

To identify DMRs using the 100-bp tiles, the coverage for each tile was determined by dividing the total number of times cytosines of a specific sequence context were covered by a read by the total number of cytosines of a specific sequence context within the tile. CG and CHG DMRs were required to have a minimum of three symmetrical CG or CHG sites, at least 2× coverage, and a minimum methylation difference between two genotypes of 60%. CHH sequence context DMRs were required to have a minimum of six asymmetrical CHH sites, a minimum of 2× coverage, and one genotype with less than 5% methylation and the other genotype with greater 25% methylation. DMRs that were within 100 bp of one another had their methylation values averaged and were merged into a single DMR.

To identify DMRs using eDMR (Li et al., 2013), the output file from Bismark’s methylation extractor was used to identify differentially methylated cytosines using a R package methylKit using the default parameter (Akalin et al., 2012). These are then used for implementing the eDMR package (Li et al., 2013), which identifies DMRs using default distance cutoff determined automatically by bimodal normal distributions of distances between cytosines. Overlap between the tile DMR and eDMR data sets was determined using the intersectBed tool from the BEDTools genomic toolkit (Quinlan and Hall, 2010).

DMR Validation

The SeqCap Epi platform was used to validate some of the DMRs identified in this study using independent biological samples (third leaf tissue) for each genotype. Three technical replicates for B73, two technical replicates for Mo17, two biological replicates for Oh43, and one sample each for CML322 and Tx303 were used to generate bisulfite converted sequencing libraries that were used to perform sequence capture. The capture pool targets approximately 5,000 regions of the maize genome (Li et al., 2014). The samples were sequenced using an Illumina MiSeq with 100 cycles (paired end) for B73 and Mo17 and with 150 cycles (paired end) for Oh43, CML322, and Tx303. The methylation levels from replicates were averaged and used to compare with the methylation levels from the WGBS data set.

Methylation profiling using methylated DNA immunoprecipitation followed by microarray hybridization had been performed on these five genotypes in a previous study (Eichten et al., 2013). The methylation values for array probes located with 300 bp of a DMR identified in this study were extracted from National Center for Biotechnology Information (NCBI) Gene Expression Omnibus accession number GSE46949. The differential DNA methylation estimates were compared with the differences in WGBS levels determined in this study for each genotype contrast, and the correlation of the values was computed using Pearson correlation.

IBS Regions

IBS regions between each genotype were initially identified using the BEAGLE algorithm (Browning and Browning, 2011). These IBS regions were then intersected with 250-kb-sliding 1-Mbp tiles. One-megabase pair tiles that were composed of at least 90% BEAGLE-identified IBS regions were considered as IBS regions for further analyses.

DMR Merging

The DMRs were discovered using a pairwise comparison strategy by comparing B73 with each of the four other inbred lines. To get a nonredundant list of DMRs found in all contrasts, the DMRs from the four contrasts for each cytosine context were merged using BEDTools (Quinlan and Hall, 2010). The number of inbred lines that have data for each DMR was determined, and further analyses were performed on DMRs that have data in all five genotypes. At each DMR, the five lines were assigned to two groups such that the intragroup variance was minimized and the intergroup variance was maximized.

The CG and CHG DMRs from each genotype contrast were merged using the outermost boundaries. DNA methylation differences for CG and CHG contrasts were used to classify each DMR into one of the four categories: CG only, CHG only, CG/CHG, and unclassified. CG-only DMRs show greater than 60% CG and less than 20% CHG difference, CHG-only DMRs show less than 20% CG and greater than 60% CHG difference, CG/CHG DMRs are those that have greater than 60% difference for both CG and CHG, and the other DMRs that can’t satisfy any of these criteria were classified as unclassified DMRs.

Methylation Levels at DMR Flanking Regions

To determine the methylation level surrounding DMRs, the methylation levels for the three 100-bp tiles to the left and to the right of every DMR were extracted. These tiles were then filtered for the minimum 2× coverage cutoff and the minimum sequence context-specific sites. The average value of the second and third windows, if they contained data, was determined as the surrounding methylation value. These values were used to perform a hierarchical clustering analysis using the Ward’s method. Based on the clustering, the DMRs were grouped into four types: DMRs that show high (or low) methylation on both sides in both contrasting lines for either CG or CHG methylation (HH or LL), DMRs that show high methylation on one side and low methylation on the other side in both contrasting lines for either CG or CHG methylation (HL), and undefined DMRs that cannot be assigned to any of the three groups.

DMR Proximity to Genes and Transposons

The DMR proximity to genes and TEs was determined using the closestBed tool from the BEDTools genomic toolkit (Quinlan and Hall, 2010). TE annotations were obtained from the Maize TE Consortium (ZmB73_5b), and gene annotations were obtained from the maize Filtered Gene Set (ZmB73_5a). The random subsets of tiles used as a permutation control for the tile DMRs were generated using a custom Perl script. The script obtains a specified amount of random tiles that meet the minimum coverage and minimum sequence-specific site count of CG, CHG, or CHH cutoffs.

RNA Sequencing and Analysis

We performed RNA sequencing using the same tissue that was used for methylation analysis. Three biological replicates were done for each of the five lines. RNA libraries were prepared using Illumina TruSeq kit following manufacturers’ instructions. We did sequencing on a HiSEquation 2000 using paired-end 50 cycles. Raw sequencing reads were analyzed using FASTQC to remove bases with poor quality or adapter sequences. Trimmed reads were then mapped to the maize reference genome version 2, allowing for one mismatch using TopHat (Trapnell et al., 2009). Uniquely mapped reads were then filtered using SAMtools (-q 50; Li et al., 2009). Only reads that are properly paired and are both mapped in the right orientation and in the same place were used. Read counts per gene model were then summarized using BAM2COUNT, which can be found on iPlant.

Differentially Expressed Genes and Their Correlation with Methylation

The data set with raw read counts per gene per genotype was imported into R environment. Read counts were normalized using DESeq (Anders and Huber, 2010) and were subjected to a pairwise comparison between B73 and each of the other four lines to identify differentially expressed genes. The differentially expressed genes from all four contrasts were then combined and were divided into one of the five groups based on the fold change between the genotypes with the highest and the lowest expression levels: less than 2, 2 to 5, 5 to 10, greater than 10, and on-off genes. On-off genes are the genes that have detectable mRNA expression in at least one of the five lines but have no reads (0) in at least one other line. Methylation levels at each sequence context were calculated for each gene at a region that spans 2,000 bp upstream to 3,000 bp (or the actual gene size if gene size is less than 3,000 bp) into the gene, divided into 100 bins. Methylation difference between the genotypes that show the highest and the lowest expression was calculated for each bin for each gene.

Identifying Genes with Cryptic Information

Genes that had methylation data for the region from –200 to 600 bp (relative to the TSS) were identified. The methylation levels for CG/CHG were then averaged over this region for each gene. Genes with at least 60% CG and CHG methylation were identified and were further filtered using the expression data to find these that have no detectable expression (normalized read count per kilobase pair, <0.5) in B73. Genes that meet these criteria were further filtered using the B73 expression atlas data from Sekhon et al. (2013) to find the genes that are not developmentally regulated. These genes were further classified into two groups: one group showing expression in some other inbred lines (Eichten et al., 2013) and another group that was not expressed and was considered to be genes with potential cryptic information.

The WGBS data set has been deposited into NCBI under accession numbers SRR850328, SRR850332, SRX731432, SRX731433, and SRX731434. The RNAseq data set has been deposited into NCBI under accession number SRP052226.

Supplemental Data

The following supplemental materials are available.

Supplementary Material

Supplemental Data

Acknowledgments

We thank Tyler Kent and Jeffrey Ross-Ibarra for assistance with the discovery of IBS regions, iPlant Collaborative for providing the tools and resources for data analysis, the Texas Advanced Computing Center at the University of Texas at Austin for computational support and data storage, and the Minnesota Supercomputing Institute for access to software and user support for data analyses.

Glossary

DMR

differentially methylated region

IBS

identical-by-state

TSS

transcription start site

WGBS

whole-genome bisulfite sequencing

TE

transposable element

LTR

long terminal repeat

LL

low-low

HH

high-high

HL

high-low

NCBI

National Center for Biotechnology Information

Footnotes

1

This work was supported by the National Science Foundation (grant no. DBI–1237931 to M.W.V. and N.M.S.).

[OPEN]

Articles can be viewed without a subscription.

References

  1. Akalin A, Kormaksson M, Li S, Garrett-Bakelman FE, Figueroa ME, Melnick A, Mason CE (2012) methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol 13: R87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11: R106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Becker C, Hagmann J, Müller J, Koenig D, Stegle O, Borgwardt K, Weigel D (2011) Spontaneous epigenetic variation in the Arabidopsis thaliana methylome. Nature 480: 245–249 [DOI] [PubMed] [Google Scholar]
  4. Bender J, Fink GR (1995) Epigenetic control of an endogenous gene family is revealed by a novel blue fluorescent mutant of Arabidopsis. Cell 83: 725–734 [DOI] [PubMed] [Google Scholar]
  5. Browning BL, Browning SR (2011) A fast, powerful method for detecting identity by descent. Am J Hum Genet 88: 173–182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Castelletti S, Tuberosa R, Pindo M, Salvi S (2014) A MITE transposon insertion is associated with differential methylation at the maize flowering time QTL Vgt1. G3 4: 805–812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chan SW, Henderson IR, Jacobsen SE (2005) Gardening the genome: DNA methylation in Arabidopsis thaliana. Nat Rev Genet 6: 351–360 [DOI] [PubMed] [Google Scholar]
  8. Chodavarapu RK, Feng S, Ding B, Simon SA, Lopez D, Jia Y, Wang GL, Meyers BC, Jacobsen SE, Pellegrini M (2012) Transcriptome and methylome interactions in rice hybrids. Proc Natl Acad Sci USA 109: 12040–12045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Dowen RH, Pelizzola M, Schmitz RJ, Lister R, Dowen JM, Nery JR, Dixon JE, Ecker JR (2012) Widespread dynamic DNA methylation in response to biotic stress. Proc Natl Acad Sci USA 109: E2183–E2191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Du J, Zhong X, Bernatavichute YV, Stroud H, Feng S, Caro E, Vashisht AA, Terragni J, Chin HG, Tu A, et al. (2012) Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151: 167–180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Durand S, Bouché N, Perez Strand E, Loudet O, Camilleri C (2012) Rapid establishment of genetic incompatibility through natural epigenetic variation. Curr Biol 22: 326–331 [DOI] [PubMed] [Google Scholar]
  12. Eichten SR, Briskine R, Song J, Li Q, Swanson-Wagner R, Hermanson PJ, Waters AJ, Starr E, West PT, Tiffin P, et al. (2013) Epigenetic and genetic influences on DNA methylation variation in maize populations. Plant Cell 25: 2783–2797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Eichten SR, Ellis NA, Makarevitch I, Yeh CT, Gent JI, Guo L, McGinnis KM, Zhang X, Schnable PS, Vaughn MW (2012) Spreading of heterochromatin is limited to specific families of maize retrotransposons. PLoS Genet 8: e1003127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Eichten SR, Swanson-Wagner RA, Schnable JC, Waters AJ, Hermanson PJ, Liu S, Yeh CT, Jia Y, Gendler K, Freeling M, et al. (2011) Heritable epigenetic variation among maize inbreds. PLoS Genet 7: e1002372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Flint-Garcia SA, Thuillet AC, Yu J, Pressoir G, Romero SM, Mitchell SE, Doebley J, Kresovich S, Goodman MM, Buckler ES (2005) Maize association population: a high-resolution platform for quantitative trait locus dissection. Plant J 44: 1054–1064 [DOI] [PubMed] [Google Scholar]
  16. Gehring M, Bubb KL, Henikoff S (2009) Extensive demethylation of repetitive elements during seed development underlies gene imprinting. Science 324: 1447–1451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gent JI, Ellis NA, Guo L, Harkess AE, Yao Y, Zhang X, Dawe RK (2013) CHH islands: de novo DNA methylation in near-gene chromatin regulation in maize. Genome Res 23: 628–637 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Jirtle RL, Skinner MK (2007) Environmental epigenomics and disease susceptibility. Nat Rev Genet 8: 253–262 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Johannes F, Porcher E, Teixeira FK, Saliba-Colombani V, Simon M, Agier N, Bulski A, Albuisson J, Heredia F, Audigier P, et al. (2009) Assessing the impact of transgenerational epigenetic variation on complex traits. PLoS Genet 5: e1000530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Johnson LM, Bostick M, Zhang X, Kraft E, Henderson I, Callis J, Jacobsen SE (2007) The SRA methyl-cytosine-binding domain links DNA and histone methylation. Curr Biol 17: 379–384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Jones PA, Baylin SB (2002) The fundamental role of epigenetic events in cancer. Nat Rev Genet 3: 415–428 [DOI] [PubMed] [Google Scholar]
  22. Krueger F, Andrews SR (2011) Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27: 1571–1572 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Law JA, Jacobsen SE (2010) Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet 11: 204–220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Li Q, Eichten SR, Hermanson PJ, Zaunbrecher VM, Song J, Wendt J, Rosenbaum H, Madzima TF, Sloan AE, Huang J, et al. (2014) Genetic perturbation of the maize methylome. Plant Cell 26: 4602–4616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25: 2078–2079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Li S, Garrett-Bakelman FE, Akalin A, Zumbo P, Levine R, To BL, Lewis ID, Brown AL, D’Andrea RJ, Melnick A, et al. (2013) An optimized algorithm for detecting and annotating regional differential methylation. BMC Bioinformatics (Suppl 5) 14: S10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Matzke MA, Mosher RA (2014) RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat Rev Genet 15: 394–408 [DOI] [PubMed] [Google Scholar]
  28. McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li H, Sun Q, Flint-Garcia S, Thornsberry J, Acharya C, Bottoms C, et al. (2009) Genetic properties of the maize nested association mapping population. Science 325: 737–740 [DOI] [PubMed] [Google Scholar]
  29. Quinlan AR, Hall IM (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26: 841–842 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Regulski M, Lu Z, Kendall J, Donoghue MT, Reinders J, Llaca V, Deschamps S, Smith A, Levy D, McCombie WR, et al. (2013) The maize methylome influences mRNA splice sites and reveals widespread paramutation-like switches guided by small RNA. Genome Res 23: 1651–1662 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Reinders J, Wulff BB, Mirouze M, Marí-Ordóñez A, Dapp M, Rozhon W, Bucher E, Theiler G, Paszkowski J (2009) Compromised stability of DNA methylation and transposon immobilization in mosaic Arabidopsis epigenomes. Genes Dev 23: 939–950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29: 24–26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Rodrigues JA, Ruan R, Nishimura T, Sharma MK, Sharma R, Ronald PC, Fischer RL, Zilberman D (2013) Imprinted expression of genes and small RNA is associated with localized hypomethylation of the maternal genome in rice endosperm. Proc Natl Acad Sci USA 110: 7934–7939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Saze H, Kakutani T (2011) Differentiation of epigenetic modifications between transposons and genes. Curr Opin Plant Biol 14: 81–87 [DOI] [PubMed] [Google Scholar]
  35. Schmitz RJ, He Y, Valdés-López O, Khan SM, Joshi T, Urich MA, Nery JR, Diers B, Xu D, Stacey G, et al. (2013a) Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population. Genome Res 23: 1663–1674 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Schmitz RJ, Schultz MD, Lewsey MG, O’Malley RC, Urich MA, Libiger O, Schork NJ, Ecker JR (2011) Transgenerational epigenetic instability is a source of novel methylation variants. Science 334: 369–373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Schmitz RJ, Schultz MD, Urich MA, Nery JR, Pelizzola M, Libiger O, Alix A, McCosh RB, Chen H, Schork NJ, et al. (2013b) Patterns of population epigenomic diversity. Nature 495: 193–198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Schnable JC, Springer NM, Freeling M (2011) Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc Natl Acad Sci USA 108: 4069–4074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, et al. (2009) The B73 maize genome: complexity, diversity, and dynamics. Science 326: 1112–1115 [DOI] [PubMed] [Google Scholar]
  40. Schultz MD, Schmitz RJ, Ecker JR (2012) ‘Leveling’ the playing field for analyses of single-base resolution DNA methylomes. Trends Genet 28: 583–585 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Sekhon RS, Briskine R, Hirsch CN, Myers CL, Springer NM, Buell CR, de Leon N, Kaeppler SM (2013) Maize gene atlas developed by RNA sequencing and comparative evaluation of transcriptomes based on RNA sequencing and microarrays. PLoS ONE 8: e61005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Seymour DK, Koenig D, Hagmann J, Becker C, Weigel D (2014) Evolution of DNA methylation patterns in the Brassicaceae is driven by differences in genome organization. PLoS Genet 10: e1004785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Stroud H, Do T, Du J, Zhong X, Feng S, Johnson L, Patel DJ, Jacobsen SE (2014) Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat Struct Mol Biol 21: 64–72 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Trapnell C, Pachter L, Salzberg SL (2009) TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25: 1105–1111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Waters AJ, Makarevitch I, Eichten SR, Swanson-Wagner RA, Yeh CT, Xu W, Schnable PS, Vaughn MW, Gehring M, Springer NM (2011) Parent-of-origin effects on gene expression and DNA methylation in the maize endosperm. Plant Cell 23: 4221–4233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. West PT, Li Q, Ji L, Eichten SR, Song J, Vaughn MW, Schmitz RJ, Springer NM (2014) Genomic distribution of H3K9me2 and DNA methylation in a maize genome. PLoS ONE 9: e105267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wutz A. (2011) Gene silencing in X-chromosome inactivation: advances in understanding facultative heterochromatin formation. Nat Rev Genet 12: 542–553 [DOI] [PubMed] [Google Scholar]
  48. Yu A, Lepère G, Jay F, Wang J, Bapaume L, Wang Y, Abraham AL, Penterman J, Fischer RL, Voinnet O, et al. (2013) Dynamics and biological relevance of DNA demethylation in Arabidopsis antibacterial defense. Proc Natl Acad Sci USA 110: 2389–2394 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Zemach A, Kim MY, Hsieh PH, Coleman-Derr D, Eshed-Williams L, Thao K, Harmer SL, Zilberman D (2013) The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell 153: 193–205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zhang H, Zhu JK (2012) Active DNA demethylation in plants and animals. Cold Spring Harb Symp Quant Biol 77: 161–173 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhang M, Xie S, Dong X, Zhao X, Zeng B, Chen J, Li H, Yang W, Zhao H, Wang G, et al. (2014) Genome-wide high resolution parental-specific DNA and histone methylation maps uncover patterns of imprinting regulation in maize. Genome Res 24: 167–176 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Zhu JK. (2009) Active DNA demethylation mediated by DNA glycosylases. Annu Rev Genet 43: 143–166 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from Plant Physiology are provided here courtesy of Oxford University Press

RESOURCES