Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2019 Nov 16;48(1):460–471. doi: 10.1093/nar/gkz1081

Structural insights into target DNA recognition by R2R3-MYB transcription factors

Baihui Wang 1,3, Qiang Luo 1,3, Yingping Li 1, Liufan Yin 1, Nana Zhou 1, Xiangnan Li 2, Jianhua Gan 3,, Aiwu Dong 1,
PMCID: PMC7145699  PMID: 31733060

Abstract

As the largest group of MYB family transcription factors, R2R3-MYB proteins play essential roles during plant growth and development. However, the structural basis underlying how R2R3-MYBs recognize the target DNA remains elusive. Here, we report the crystal structure of Arabidopsis WEREWOLF (WER), an R2R3-MYB protein, in complex with its target DNA. Structural analysis showed that the third α-helices in both the R2 and R3 repeats of WER fit in the major groove of the DNA, specifically recognizing the DNA motif 5′-AACNGC-3′. In combination with mutagenesis, in vitro binding and in vivo luciferase assays, we showed that K55, N106, K109 and N110 are critical for the function of WER. Although L59 of WER is not involved in DNA binding in the structure, ITC analysis suggested that L59 plays an important role in sensing DNA methylation at the fifth position of cytosine (5mC). Like 5mC, methylation at the sixth position of adenine (6mA) in the AAC element also inhibits the interaction between WER and its target DNA. Our study not only unravels the molecular basis of how WER recognizes its target DNA, but also suggests that 5mC and 6mA modifications may block the interaction between R2R3-MYB transcription factors and their target genes.

INTRODUCTION

Transcription factors control many essential biological processes by regulating the expression of genes. According to the characteristics of their DNA-binding domains, transcription factors can be divided into different families (1), among which MYB domain-containing transcription factors constitute a large family and perform diverse functions in eukaryotes (2). MYB family transcription factors share a conserved domain architecture, with a DNA-binding domain (MYB domain) consisting of 1–4 imperfect repeats at the N-terminus and transcription activation/repression domains at the C-terminus. Each repeat of the MYB domain is ∼52 amino acids in length and forms a helix-turn-helix (HTH) architecture (3). The three imperfect repeats (R) in c-myb proto-oncogene product (c-Myb), a key regulator for proliferation and differentiation of hematopoietic cells (4), are referred to as R1, R2 and R3, and other MYB proteins are classified according to their sequence similarity to these c-Myb repeats (5). Based on their sequence characteristics, MYB transcription factors are grouped into 1R, R2R3, 3R and 4R classes, which contain one to four repeats, respectively (3).

R2R3-MYB proteins only exist in terrestrial plants, but they form the largest subfamily of MYB transcription factors (2). As an example, more than 100 R2R3-MYB proteins have been identified in Arabidopsis (3). R2R3-MYB proteins play important roles in metabolism, cell fate determination, growth and development, and responses to biotic and abiotic stress (3). For example, many R2R3-MYB genes are involved in lignin or flavonoid synthesis in different species, including Arabidopsis AtMYB32 (6), Triticum aestivum TaMYB1D (7), Prunus persica PpMYB18 (8), Chrysanthemum morifolium CmMYB1 (9) and Pinus taeda PtMYB14 (10). In Arabidopsis, AtMYB0 (GLABRA1, GL1) and AtMYB23 function in trichome initiation in shoots (11) and AtMYB66 (WEREWOLF, WER) determines root hair patterning (12). AtMYB59 (13) and AtMYB77 (14) modulate root growth, AtMYB91/AS1 regulates leaf development (15), and AtMYB21, AtMYB24 and AtMYB57 control anther development (16). There are also many R2R3-MYB genes involved in stress responses: AtMYB96 contributes to drought stress response and pathogen resistance (17,18), AtMYB102 functions in insect defense (19), AtMYB15 is related to cold tolerance (20), and AtMYB62 regulates phosphate starvation responses (21).

Despite the broad range of studies on the biological functions of R2R3-MYBs in plants, the molecular basis of their recognition of target DNAs is poorly understood. Structural studies of MYB proteins have been performed mainly in animals and viruses. The structure of c-Myb was first analyzed in solution by nuclear magnetic resonance (NMR) spectroscopy, which indicated that R2 and R3 are involved in target DNA recognition (22,23). The crystal structures of mouse c-Myb (MsMyb), avian myeloblastosis virus (AMV) v-Myb and Trichomonas vaginalis MYB3 (TvMyb3) further revealed that the third helices of R2 and R3 recognize the target DNAs in the major groove (24,25). Although there are many more MYB members in plants than in animals, only very limited structural information is available for plant MYB proteins. The first reported plant MYB protein structure was the crystal structure of Antirrhinum majus RADIALIS (AmRAD) (26). Recently, the crystal structure of Arabidopsis phosphate starvation response 1 (AtPHR1) in complex with its target DNA was also determined, showing that two copies of PHR1 MYB domains bind to the major groove of DNA (27). Both AmRAD and AtPHR1 are 1R-type MYB transcription factors. However, despite being the largest group of MYB transcription factors, no structures of plant R2R3-MYB proteins have been reported so far.

To better understand how plant R2R3-MYB proteins recognize their target DNAs, we used Arabidopsis WER (AtMYB66) as a model to perform structural analysis. In Arabidopsis roots, WER is specifically expressed in non-hair cells (N-cells) and activates GLABRA2 (GL2), a central regulatory gene for epidermal cell fate determination (12). Previous in vitro and in vivo studies showed that WER specifically recognizes the cis-element WER Binding Site (WBS) located within the GL2 promoter and inhibits root hair formation in N-cells (28,29). Here, we report the structure of the WER–DNA complex, showing that R3 specifically recognizes the AAC element and R2 associates with the GC element in a redundant manner. The R3 residues responsible for DNA binding are conserved in both plants and animals, indicating that plant R2R3-MYBs and animal R1R2R3-MYBs recognize the AAC element in a conserved manner. In contrast, the R2 residues involved in DNA binding are variable, which probably contributes to the diversity of target DNA sequences between plants and animals. Similar to other transcription factors (30), DNA methylation at the fifth position of cytosine (5mC) in the AAC element weakened the interaction between WER and DNA. Interestingly, we found that DNA methylation at the sixth position of adenine (6mA) within the AAC element also impaired the interaction between WER and its target DNA, suggesting that 6mA modification is involved in target DNA recognition by transcription factors.

MATERIALS AND METHODS

Protein expression and purification

DNAs encoding 6 × His-SUMO-tagged full-length and truncated WER proteins were obtained by PCR and subcloned into the pET28a vector (Novagen, Madison, WI, USA). Constructs for WER mutants were generated by the overlapping PCR method. The primers used in plasmid construction are listed in Supplementary Table S1. A DNA encoding MsMyb (77–193) was synthesized by the GENEWIZ company (https://www.genewiz.com.cn/) and was subcloned into the pET28a vector. The resulting constructs were transferred into Escherichia coli BL21 (DE3) competent cells for protein expression. Protein expression was induced by adding isopropyl β-d-1-thiogalacto-pyranoside (IPTG) to a final concentration of 0.2 mM. The induced cultures were then grown at 18°C for 18 h. The cells were harvested by centrifugation and lysed by a high pressure disruptor. The homogenate was clarified by centrifugation (22 000 g) at 4°C for 1 h. The supernatant was loaded onto a Ni-NTA column. The eluted sample was dialyzed and treated with Ulp1 protease to remove the 6× His-SUMO tag. Then, the proteins were purified with a Superdex 75 16/60 preparation grade column (GE Healthcare, Milwaukee, WI, USA). The proteins were concentrated (∼30 mg/ml) and stored in a buffer of 300 mM NaCl, and 20 mM Tris–HCl, pH 8.0.

EMSA experiment

Different quantities of WER 12–130 (from 0 to 0.8 μM) were mixed with 0.1 μM DNA in buffer (150 mM NaCl and 20 mM Tris–HCl, pH 8.0). The total volume of the reaction system was 20 μl. To investigate the effect of redox, either 2 mM DTT or 5 mM H2O2 were included in the WER 12–130 (from 0 to 0.6 μM) and DNA (0.1 μM) reaction. The samples were incubated on ice for 1 h and then analyzed on 6% native PAGE gels with 0.5× TBE buffer. The gel was stained with GelRed and imaged using a UV system.

ITC experiments

ITC experiments were performed with a MicroCal iTC200 (GE Healthcare) by injecting WER 12–130 (400 μM), MsMyb (400 μM) or mutant protein solutions into 20 μM DNA duplex solutions. Before reaction, all protein and DNA samples were dialyzed against a buffer composed of 150 mM NaCl and 20 mM HEPES, pH 7.5. For all reactions, 24 injections (each 1.6 μl) were performed in the experiment at 25°C. Binding curves were generated by plotting the heat change of the binding reaction, and the data were fitted using one set of binding site model with Origin 7.0.

Crystallization, data collection and structural refinement

Six truncated WER proteins (which contained amino acids 12–120, 12–130, 12–135, 9–120, 9–130 and 9–135 of WER) were expressed, purified and used in co-crystallization trials with various DNAs. The crystallization samples were prepared by mixing 15 mg/ml protein (in 150 mM NaCl, 20 mM Tris–HCl pH 8.0 buffer) with DNA at a molar ratio of 1:1.2. High quality crystals were obtained in the presence of WER 12–130 (named WER-R2R3) and dsDNA (5′-AAATTCTCCA10A11C12C13G14C15ATTTTC-3′, 5′-CGAAAATG8*C9*G10*G11*T12*T13*GGAGAATT-3′) containing an overhang. The crystals were grown by the hanging drop vapor diffusion method at 18°C, the well solution is composed of 0.1 M MES pH 6.0, 30% (v/v) PEG600, 5% (w/v) PEG1000 and 10% (v/v) glycerol.

All crystals were cryo-protected using their mother liquor supplemented with 25% glycerol and snap-frozen in liquid nitrogen. The X-ray diffraction data were collected on beamline BL17U at the Shanghai Synchrotron Radiation Facility (SSRF) at cryogenic temperatures and maintained with a cryogenic system. The complex structure was solved by the molecular replacement method using the TvMyb3 structure (PDB_ID: 3ZQC) as a search model. The structure was refined using the Refmac5 program of CCP4i (31) or the phenix.refine program of Phenix (32). The data collection and refinement statistics are summarized in Supplementary Table S2. All structure images were created with PyMOL.

Dual luciferase assay

Arabidopsis protoplasts were isolated as previously described (33). Four-week-old rosette leaves were cut into 1 mm slices and fully immersed in protoplast enzymatic hydrolysate (0.15% (w/v) cellulase R10, 0.035% (w/v) pectolyase Y-23, 0.4 M mannitol, 20 mM KCl, 20 mM MES, 10 mM CaCl2, 0.02 mg/ml BSA). The protoplasts were lightly shaken for 3–4 h at room temperature while protected from light. For the dual luciferase assay, the promoter of GL2 was amplified and inserted into the pGreenII-0800-LUC vector. The full-length CDSs of WER and its mutants fused with 3 × FLAG were amplified by nested PCR and inserted into the p1300 vector. Different combinations of the two types of plasmids were co-transferred into protoplasts transiently. After 12–16 h incubation in the dark, the protoplasts were harvested by low-speed centrifugation and quantified with a dual-luciferase assay kit (E1910.Promega). The renilla luciferase activity was used as an internal control to normalize the vector and then to normalize the luciferase activity. The relative activity values are shown as means ± SD of three independent biological replicates.

RESULTS

The target DNA sequences of Arabidopsis R2R3-MYB proteins

The cis-elements of R2R3-MYB proteins can be grouped into two distinctive motifs: 5′-(C/T)AACNG-3′ and 5′-ACC(A/T)A(A/C)-3′ (Figure 1A), according to the previous verification in vitro and/or in vivo (34). As revealed in the structures of both MsMyb and TvMyb3 in complex with DNA, cis-elements are recognized by the third helices of R2 and R3 repeats. Interestingly, though their cis-elements are different, plant R2R3-MYB proteins share high sequence similarity in their third and sixth helices, corresponding to the third helices of R2 and R3 (Figure 1B). These observations indicate that plant R2R3-MYB proteins may be able to adopt subtle different conformation to coordinate with the corresponding cis-elements.

Figure 1.

Figure 1.

DNA motifs recognized by R2R3-MYB proteins in Arabidopsis. (A) Sequences of R2R3-MYB target motifs that have been verified by in vitro and in vivo assays. (B) Sequence alignment of typical R2R3-MYB proteins. The secondary structure of WER was predicted by Phyre2 program (http://www.sbg.bio.ic.ac.uk/phyre2). (C) EMSA (Electrophoretic Mobility Shift Assay) assay showing the interaction between WER and its target DNA (5′-AAATTCTCCAACCGCATTTTC-3′, 5′-GAAAATGCGGTTGGAGAATTT-3′). The DNA concentration was fixed at 0.1 μM and the protein concentration was increased from 0 to 0.8 μM. (D) ITC (Isothermal Titration Calorimetry) experiment measuring the binding affinity between wild-type WER and its target DNA (5′-AAATTCTCCAACCGCATTTTC-3′, 5′-GAAAATGCGGTTGGAGAATTT-3′).

To unravel the molecular basis of target gene recognition by plant R2R3-MYB proteins, we selected WER as a model. Consistent with previous studies (28,29,35), our in vitro EMSA experiment (Figure 1C) confirmed that WER binds to its target DNA containing the 5′-(C/T)AACNG-3′ motif within the promoter of GL2. As determined by the ITC analysis (Figure 1D), the equilibrium dissociation constant (Kd) value between WER and its target DNA is approximately 51 nM.

The overall structure of the WER–DNA complex

To gain more insight into DNA recognition by WER, we purified different truncated WER proteins and performed co-crystallization trials with various DNAs. Though crystals were only obtained in the presence of WER 12–130 (named WER-R2R3) and dsDNA, they diffracted well (up to 2.15 Å). The structure was solved by the molecular replacement method and is referred to as the WER–DNA complex hereafter. The data collection and refinement statistics are summarized in Supplementary Table S2. As depicted in Figure 2A, WER-R2R3 is composed of six α-helices, H1 (V23–H36), H2 (W41–K47), H3 (G54–L65), H4 (E76–L89), H5 (W93–R99) and H6 (D105–L116). Helices H1–H3 and H4–H6 belong to the R2 and R3 repeats, respectively. R2 and R3 are connected by a 7-residue linker (67PNVKRGN73, Supplementary Figure S1A). In addition to other hydrophobic residues, each helix contains a conserved aromatic residue (Phe or Trp) forming the inner hydrophobic core of R2 (Supplementary Figure S1B) and R3 (Supplementary Figure S1C), respectively. Owing to the presence of R52, R58, and R60, the outer surface of WER-R2R3 is highly positive (Figure 2A and Supplementary Figure S2A), and the side chains of these positively charged residues form extensive H-bond interactions with DNA phosphate backbones. Several other residues such as G19, W21, S56, H40, N42, N91, W93 and S94 are also involved in DNA backbone recognition (Supplementary Figure S2B–E).

Figure 2.

Figure 2.

Structure of the WER–DNA complex. (A) The overall folding of WER–DNA complex. DNAs are shown as sticks. In the left and right panels, WER-R2R3 is shown in cartoon and as an electrostatic surface potential map, respectively. The 2Fo-Fc electron density map was contoured at the 1.0 σ level. (B) Close-up view showing the relative orientations of the H3 and H6 helices of WER-R2R3 and DNA. The helices are shown in cartoon-and-stick form. The DNA is shown as spheres. (C–G) Sequence-specific recognition of A10:T13*, A11:T12*, C12:G11*, G14:C9* and C15:G8*, which correspond to the first, second, third, fifth, and sixth base pairs of the DNA, respectively. DNA base pairs and WER residues responsible for base recognition are shown as sticks. The C-atoms of R2 and R3 residues are magenta and cyan, respectively. Water molecules are shown as red spheres. The distances of the direct or water-mediated H-bond interactions are indicated by numbers.

WER-R2R3 contains two Cys residues (C53 and C57), which are highly conserved in plant R2R3-MYB proteins (Figure 1B). Previous studies suggested that, via change of their redox state, these Cys residues can affect DNA binding activity of plant R2R3-MYB proteins (36). Consistent with the previous report, our EMSA analysis showed that no effect was observed on WER–DNA interaction by adding reducing agent DTT, whereas oxidizing agent H2O2 significantly blocked the interaction between WER and its target DNA (Supplementary Figure S3). In the WER–DNA complex, the CB-SG bond of C53 points toward the DNA duplex, but does not directly interact with the DNA (Supplementary Figure S4A). C57 of WER corresponds to the C130 residue of MsMyb (Supplementary Figure S4A). The CB-SG bond of C57 points toward the hydrophobic core of R2 and inserts into the pocket formed by the side chains of W21, L29, I44, L50 and R52 (Supplementary Figure S4B). Like C130 in MsMyb (24), the reduced C57 residue might also stabilize the conformation of the R2 repeat of WER.

Sequence-specific recognition between WER and its target DNA

The WER–DNA complex structure showed that the third helices (H3 and H6) of R2 and R3 are inserted into the major groove of the DNA, producing a sequence-specific interaction with the 5′-AACCGC-3′ motif (Figure 2BG and Supplementary Figure S2E). A10:T13* (the first base pair of the 5′-AACCGC-3′ motif) forms two H-bonds with N110 of R3; one (2.9 Å) is between the N7 atom of A10 and the ND2 atom of N110 and the other (3.0 Å) is between the N6 atom of A10 and the OD1 atom of N110 (Figure 2C). A11:T12* (the second base pair of the 5′-AACCGC-3′ motif) is also recognized by one Asn residue (N106), through two similar H-bonds between the nucleobase of A11 and the side chain of N106 (Figure 2D). To stabilize the conformation of A11:T12* base pair, the nucleobase of T12* forms two water-mediated H-bonds: one with N110 and the other with N113 (Figure 2D).

The third base pair of the 5′-AACCGC-3′ motif, C12:G11*, is recognized by K109 of R3 (Figure 2E). Instead of C12, K109 interacts with the nucleobase of the pairing G11*. The NZ atom of K109 forms H-bond interactions with both the O6 and the N7 atoms of G11*; the average distance is around 2.9 Å (Figure 2E). The side chain of L59 points toward the nucleobase of C12, and the distance between the CD2 atom of L59 and the C5 atom of C12 is 3.7 Å (Figure 2E). The nucleobases of C13:G10* do not have any direct interaction with WER. G14:C9* and C15:G8* (the fifth and sixth base pairs of the 5′-AACCGC-3′ motif) are both recognized by K55 of R2. The NZ atom of K55 forms one H-bond (2.7 Å) with the O6 atom of G14, and mediated by one water molecule, the NZ atom of K55 also interacts with the N7 atom of G14 (Figure 2F). Instead of C15, K55 forms a direct H-bond interaction with the pairing G8* of the sixth base pair (Figure 2G).

Verification of WER–DNA interaction in vitro and in vivo

To verify the specific interactions observed in the WER–DNA complex, we constructed and purified five WER single-point mutants, in which K55, L59, N106, K109, or N110 were substituted by an Ala (A) amino acid, and performed ITC analysis (Figure 3A). Compared to that of wild-type WER, DNA binding affinities of N106A, K109A, and N10A mutants were approximately 20∼30-fold decreased, and K55A mutant caused more dramatic reduction (>40-fold). Different from other mutants, the DNA binding affinity of L59A mutant (Kd: 47 nM) is comparable to that of wild-type WER (Kd: 51 nM) (Figure 3A), probably because of the long distance (3.7 Å) between the CD2 atom of L59 and the C5 atom of C12 (Figure 2E).

Figure 3.

Figure 3.

Verification of sequence-specific interactions. (A) ITC experiments showing the impact of mutation of WER residues involved in dsDNA (5′-AAATTCTCCAACCGCATTTTC-3′, 5′-GAAAATGCGGTTGGAGAATTT-3′) binding. (B) ITC experiments showing the impact of DNA core motif (5′-AACCGC-3′) mutation on WER binding. The detailed sequence of the target DNA is shown on the top of the table. For mutated DNAs, only the mutated base pairs are listed in the table for clarity. (C) Dual luciferase assay of the GL2 promoter activation activity of wild-type and mutated WER proteins. Values are means ± SD of three independent biological replicates. * and *** indicate statistically significant differences between the wild type (WT) and mutant at P < 0.05 and P < 0.001, respectively. (D) ITC experiments showing the binding affinities of WER L59A mutant to the A12:T11*, G12:C11* and T12:A11* mutants of the 5′-AACCGC-3′ motif, respectively.

In addition to WER, we also made systematic mutations of the DNA cis-element 5′-AACCGC-3′, in which the five WER-interacting base pairs (A10:T13*, A11:T12*, C12:G11*, G14:C9* and C15:G8*) were replaced by other Watson-Crick base pairs (Figure 3B and Supplementary Figure S5). As revealed by ITC analysis, any substitution within the A10:T13*, A11:T12*, or C12:G11* base pairs significantly decreased (∼20–50-fold) the binding affinity between WER and DNA. The most dramatic reduction was observed for the C10:G13* and A12:T11* mutants, Kd values of which are 2004 and 2590 nM, respectively. Replacement of G14:C9* base pair with A14:T9* or T14:A9* has no obvious impact on WER–DNA interaction, but C14:G9* mutation caused a 40-fold decrease in the DNA binding affinity of WER. Substitution of C15:G8* by all other Watson-Crick base pairs did not affect the interaction of WER–DNA (Figure 3B and Supplementary Figure S5). G14:C9* and C15:G8* are both recognized by K55 of R2. Different from many other residues, the side chain of lysine is long and flexible. We speculate that K55 of WER may undergo certain conformational changes to interact with different base pairs at either 14th or 15th position of the target DNA, such as A14:T9* and T14:A9*, and maintain the binding affinity between WER and the mutated DNAs. Taken together, these results indicated that WER specifically recognizes the 5′-AACNDN-3′ (D: A or T or G) motif.

To further verify the sequence-specific interactions identified in the structure, we used a dual-luciferase reporter assay to compare the GL2 promoter activation activities of the wild-type and mutated WER proteins in planta. The GL2 promoter containing the 5′-AACCGC-3′ motif was transferred into a luciferase reporter vector linked with the firefly luciferase reporter (Figure 3C). When the effector construct P35S::WER and the reporter construct PGL2::Firefly luciferase were co-transferred into Arabidopsis protoplasts, the activity of firefly luciferase was enhanced to 7-fold compared with the negative control (Figure 3C). Compared with wild-type WER, the GL2 promoter activities of the WER K55A, N106A, K109A and N110A mutants (Figure 3C) were dramatically decreased, indicating that K55, N106, K109 and N110 are all essential for WER to associate with its target DNA in planta.

We noticed that although the in vitro DNA binding affinity of WER L59A mutant was comparable to that of wild-type WER (Figures 1D and 3A), the GL2 promoter activation activity of WER L59A was weaker than that of wild-type WER in planta (Figure 3C). We were puzzled by this observation, so we measured the binding affinities of WER L59A to the A12:T11*, G12:C11* or T12:A11* mutants of the 5′-AACCGC-3′ DNA motif (Figure 3D). Although they were poor substrates for wild-type WER, the A12:T11*, G12:C11* and T12:A11* mutants were all good substrates to WER L59A (Figure 3D). Besides the 5′-AACCGC-3′ motif within the GL2 promoter, we speculate that WER L59A might bind to other genes containing the sequences of 5′-AANCGC-3′, probably leading to its weak GL2 promoter activation activity in planta.

5mC modification of the DNA blocks its interaction with R2 repeat of WER

As revealed by structural superposition (Supplementary Figure S6A), the overall folding of WER-R2R3 is similar to that of MsMyb (PDB code: 1H8A), TvMyb1 (PDB code: 2KDZ) and TvMyb3 (PDB code: 3ZQC) with root-mean-square derivations of 0.900, 1.861 and 0.673 Å, respectively. WER-R2R3 shares up to 40% sequence similarity with MsMyb, TvMyb1 and TvMyb3 (Supplementary Figure S6B). Of the six DNA-recognizing residues (K55, L59, N106, K109, N110 and N113) of WER, five are conserved in MsMyb, TvMyb1 and TvMyb3; whereas L59 of WER is substituted by a Glu (E) amino acid in all the other proteins (Supplementary Figure S6B).

The side chain of L59 points toward the nucleobase of C12 (the 12th cytosine of 5′-AAATTCTCCA10A11C12C13G14C15ATTTTC-3′) in the WER–DNA complex (Figure 4A). In eukaryotes, cytosine can be methylated at the fifth position (5mC), which acts as an important epigenetic mark (37). As confirmed by complex structures, 5mC is recognized by one Ile (I) residue in many proteins (Supplementary Figure S6C). Thus, we wonder whether MYB-type transcription factors can interact with 5mC-modified DNA. To answer this question, we first performed ITC analysis using WER-R2R3 and DNA with or without methyl modification at the C5 position of the nucleotide C12. Compared to the unmodified DNA (Figure 1D), the binding affinity between the 5mC modified DNA (5′-AA5mC-3′) and WER decreased to more than 45-fold with a Kd value of 2336 nM (Figure 4B). Different from wild-type WER, ITC analysis showed that 5mC modification has no obvious impact on DNA binding to the WER L59A mutant (Figure 4B). Both Leu and Ile residues contain two methyl groups and are hydrophobic in nature. However, unlike Ile in which the methyl groups are attached to its CB and CG atoms, both methyl groups of Leu are attached to the CG atom, which may cause close contact with the methyl group of 5mC as remodeling in Supplementary Figure S6D, leading to decreased binding affinity between WER and 5mC-modified DNA.

Figure 4.

Figure 4.

R2 repeat is incompatible with DNA 5mC modification. (A) The interactions between L59/E132 and the C5 atom of cytosine in WER–DNA/MsMyb–DNA (PDB code: 1H8A) complexes. (B) ITC analysis showing the impact of 5mC modification on DNA binding by WER and its mutants. (C) Consensus sequence and conservation analysis of the R2 motif involved in DNA recognition of MYB family proteins. Plant and animal MYB proteins are shown in the upper and lower panels, respectively.

Although the percentage is low, some plant R2R3-MYB members (Figure 4C upper panel) have a Glu (E) residue in the position corresponding to L59 of WER. The Leu residue is also substituted by Glu in MsMyb, TvMyb1 and TvMyb3 (Supplementary Figure S6E). In fact, evolutionary analysis revealed that this Glu residue is highly conserved in animals (Figure 4C, lower panel). In the MsMyb structure, the Glu residue (E132) forms one H-bond with the N4 atom of C20 (Figure 4A). To test whether the Glu residue can tolerate 5mC modification in the target DNA, we constructed one L→E mutant of WER (WER L59E) and performed ITC analysis. Compared with wild-type WER (Figure 1D), the WER L59E mutant showed a similar binding affinity (47.8 nM) to the unmodified DNA (Figure 4B). 5mC modification caused an ∼4-fold decrease in the DNA binding affinity for WER L59E (Figure 4B), and similar results were also observed in a previous MsMyb study (24). The L59A mutation did not affect the interaction between WER and its target DNA (Figures 1D and 3A), indicating that L59 of WER is not critical for DNA binding, which can explain why L59E mutation has no obvious impact on DNA binding by WER. Compared to Leu, the side chain of Glu is more flexible. As shown in the modeling figure (Supplementary Figure S6D), the side chain of Glu59 can easily undergo conformational change to accommodate the methyl group of 5mC, which may explain why L59E is less efficient in blocking 5mC binding. Together, our results suggested that DNA 5mC modification may block the binding of MYB transcription factors to their target DNAs in both plants and animals.

6mA modification of the DNA inhibits its interaction with R3 repeat of WER

In the structure of the WER–DNA complex, A10 and A11 (the 10th and 11th adenines of 5′-AAATTCTCCA10A11C12C13G14C15ATTTTC-3′) were bound by two Asn residues (N106 and N110) of WER R3 repeat. The amino groups at the 6 position in the two adenines both form direct H-bond interactions with the side chains of N106 and N110 (Figure 5A), and similar interactions were also observed in the MsMyb-DNA and TvMyb-DNA structures (Supplementary Figure S7A–C). Similar to cytosine, adenine can also be methylated. Like 5mC, methylation at the sixth position of adenine (6mA) can also function as an epigenetic mark in eukaryotes (38). To investigate whether 6mA modification can affect DNA binding by MYB proteins, we synthesized 6mA-modified DNA and performed ITC analysis (Figure 5B). Compared with the unmodified DNA, methylation of the first adenine (5′-6mAAC-3′) decreased the binding affinity between WER and its target DNA by 12-fold, and methylation of the second adenine (5′-A6mAC-3′) reduced the affinity of WER for its DNA site by 5-fold. Double methylation (5′-6mA6mAC-3′) caused more serious (>20-fold) inhibition on WER–DNA interaction. These results indicated that WER is incompatible with DNA 6mA modification.

Figure 5.

Figure 5.

R3 repeat is incompatible with DNA 6mA modification. (A) The interactions between R3 Asn residues and the AA element in WER–DNA and MsMyb–DNA (PDB code: 1H8A) complexes. ITC analysis shows the impact of 6mA modification on DNA binding by (B) WER and (C) MsMyb, respectively. (D) Consensus sequence and conservation analysis of the R3 motif involved in DNA recognition of MYB domain proteins. Plant and animal MYB proteins are shown in the upper and lower panels, respectively. (E) The potential target motif AACNDN (D: A or T or G) recognized by WER can be methylated in vivo by searching the Arabidopsis DNA methylation databases (GSM2807190 for 5mC, GSM2157793 for 6mA).

In addition to WER, we also measured the binding affinities between MsMyb and 6mA-modified DNAs. As shown in Figure 5C, 6mA modification of either of the two adenines caused a dramatic reduction of the DNA binding affinity to MsMyb. Besides WER and MsMyb, the two adenine-interacting Asn residues are also highly conserved in other plant and animal MYB proteins (Figure 5D), suggesting that 6mA modification might be a conserved mechanism for regulating the expression of genes targeted by MYB proteins in both plants and animals.

Our WER–DNA complex structure and ITC analysis showed that WER specifically recognizes the 5′-AACNDN-3′ (D: A or T or G) motif (Figures 2 and 3). To analyze if the potential target elements of WER are methylated in vivo, we analyzed the AACNDN motifs genome-wide within the promoter regions by searching the Arabidopsis DNA methylation databases (GSM2807190 for 5mC, GSM2157793 for 6mA), considering that WER is a transcription factor. We found 1467251 AACNDN motifs located at gene promoter regions (0–3 kb upstream of transcription start site) of 33322 genes in Arabidopsis. Among them, 111984 AACNDN motifs are methylated including 110021 motifs with 5mC modification at the third cytosine, 605 and 1941 motifs with 6mA modification at the first and second adenine, respectively. Some motifs contain both types of methylation, for example 272 motifs with 5mC modification at the third cytosine and 6mA modification at the first adenine, 310 motifs with 5mC modification at the third cytosine and 6mA modification at the second adenine, and 1 motif with 5mC modification at the third cytosine and 6mA modifications at both the first and second adenines. Together, our results indicated that AACNDN motifs are subject to methylation in planta (Figure 5E).

DISCUSSION

In this study, we determined the crystal structure of WER in complex with its target DNA, representing the first structure of R2R3-type MYB proteins. Although most of the residues involved in DNA binding are conserved, one residue (Leu versus Glu) of the R2 repeat is the major difference between plant and animal MYB proteins, which probably contributes to the variety in their target DNAs. Interestingly, we found that not only 5mC but also 6mA modifications inhibit the interactions between MYB transcription factors and their target DNAs.

The structure of the WER–DNA complex showed that the R2 and R3 repeats of WER specifically recognize the 5′-AACCGC-3′ motif. The first adenine, second adenine and the third cytosine interact with R3, and the fifth guanine and sixth cytosine are recognized by R2. However, compared to DNA containing 5′-AACCGC-3′ motif (Figure 1D), the 5′-ACCCGC-3′ containing DNA binding affinity of WER is much weaker (Figure 3B). Interestingly, though sequence alignment showed that the key residues responsible for DNA recognition are highly conserved in all R2R3-MYB proteins (Figure 1B), nearly half R2R3-MYB members recognize DNA with a 5′-ACC-3′ element (Figure 1A), suggesting that these R2R3-MYB members may undergo subtle conformational changes or possesses some unidentified features to favor 5′-ACC-3′ element.

DNA cytosine methylation (DNA 5mC) is a conserved epigenetic modification in eukaryotes. Specifically, DNA 5mC is often associated with transcription factor binding sites. For example, the transcription factor NRF1 selectively binds to unmethylated target DNA, and 5mC methylation disrupts the protein-DNA interaction in vitro (39). In vivo, NRF1 can efficiently recognize the Asz1 promoter to active the luciferase expression. When the Asz1 promoter is methylated by a CpG methyltransferase, the activation activity of NRF1 is significantly reduced (39). Consistently, deletion of three DNA methyltransferases (Dnmt3a, Dnmt3b and Dnmt1) creates a large number of novel binding sites for NRF1 in mouse ES cells (30), suggesting that DNA 5mC modifications have widely hidden NRF1 binding sites across the genome. In Arabidopsis, investigation of the global impact of DNA 5mC modification by calculating the ratio of DAP-seq or ChIP-seq binding strength at cis-elements showed that 72% (234 out of the total 327 analyzed members) of transcription factors are sensitive to the DNA 5mC modification and 24% (79 members) are weakly impacted, while only 4% (14 members) preferentially bind to methylated motifs (40). Our data showed that the L59 residue of the WER R2 repeat is incompatible with DNA 5mC modification in the core AAC element (Figure 4B). Indeed, DNA 5mC modifications and MYB transcription factors display opposite functions in many plant-specific processes, especially fruit ripening (41,42). Apple fruit skin anthocyanin accumulation is negatively related to the DNA methylation level but positively correlated with MYB transcription factors (43). Many R2R3-MYB proteins have been shown to promote anthocyanin accumulation (44–48), but DNA hypermethylation resulted in colorless or non-ripening fruits (49). Importantly, the L→A mutation (WER L59A) resulted in slightly stronger binding affinity than that of wild-type WER and the mutated protein was not sensitive to DNA 5mC modification (Figure 4B). This observation may provide a potential way to improve the agronomic traits of flowers or fruits.

In addition to DNA 5mC, DNA 6mA modification has been discovered in various eukaryotes, including vertebrates (frog, fish, pig, mouse and human) (50–53), plants (Arabidopsis and rice) (54–56) and fungi (57). Compared to mammals, the DNA 6mA modification level is higher in plants. In the rice genome, approximately 0.2% of adenines are modified by 6mA (56) and a similar level was also observed in Arabidopsis (54). DNA 6mA is not randomly distributed but rather located around transcription start sites in Arabidopsis (54). Additionally, increasing evidence has shown that DNA 6mA modification acts as a gene expression-associated epigenetic maker that participates in multiple cellular processes including stress responses (55,58,59), tumorigenesis (53), and neuronal development (60). However, how DNA 6mA modification affects gene expression regulation is still unclear. As revealed by the structures of both WER and MsMyb, the R3 repeat specifically recognizes the AAC element, and our ITC analysis showed that DNA 6mA modification on AAC element significantly weakens the interaction between WER/MsMyb and their target DNAs (Figure 5B and C). Searching the Arabidopsis DNA methylation databases, we find that AACNDN (D: A or T or G) motifs recognized by WER can be methylated in planta (Figure 5E). Taken together, our study suggests that both DNA 5mC and 6mA modifications may regulate gene expression by impairing the interaction between MYB transcription factors and their target DNAs during plant growth and development.

DATA AVAILABILITY

Structural factors and coordinates have been deposited in the Protein Data Bank under accession code 6KKS for the WER–DNA complex.

Supplementary Material

gkz1081_Supplemental_File

ACKNOWLEDGEMENTS

We thank Dr Jianxiang Liu for pGreenII-0800-LUC vector, and the staff of beamlines BL17U1 and BL19U1 at the Shanghai Synchrotron Radiation Facility for assistance during data collection.

Author contributions: A.D. conceived and designed the research. J.G. supervised the experiments. B.W., Q.L., Y.L. L.Y., N.Z. and X.L. performed the experiments. B.W., Q.L. and J.G. analyzed the data. B.W., Q.L., J.G. and A.D. wrote the manuscript. All authors read, revised and approved the manuscript.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Natural Science Foundation of China [NSFC31570315, 31871218, 91519308]; National Basic Research Program of China [2012CB910500]. Funding for open access charge: National Natural Science Foundation of China [NSFC31570315].

Conflict of interest statement. None declared.

REFERENCES

  • 1. Riechmann J.L., Heard J., Martin G., Reuber L., Jiang C.Z., Keddie J., Adam L., Pineda O., Ratcliffe O.J., Samaha R.R. et al.. Arabidopsis transcription factors: Genome-wide comparative analysis among eukaryotes. Science. 2000; 290:2105–2110. [DOI] [PubMed] [Google Scholar]
  • 2. Feller A., Machemer K., Braun E.L., Grotewold E.. Evolutionary and comparative analysis of MYB and bHLH plant transcription factors. Plant J. 2011; 66:94–116. [DOI] [PubMed] [Google Scholar]
  • 3. Dubos C., Stracke R., Grotewold E., Weisshaar B., Martin C., Lepiniec L.. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010; 15:573–581. [DOI] [PubMed] [Google Scholar]
  • 4. Lipsick J.S., Wang D.M.. Transformation by v-Myb. Oncogene. 1999; 18:3047–3055. [DOI] [PubMed] [Google Scholar]
  • 5. Stracke R., Werber M., Weisshaar B.. The R2R3-MYB gene family in Arabidopsis thaliana. Curr. Opin. Plant Biol. 2001; 4:447–456. [DOI] [PubMed] [Google Scholar]
  • 6. Preston J., Wheeler J., Heazlewood J., Li S.F., Parish R.W.. AtMYB32 is required for normal pollen development in Arabidopsis thaliana. Plant J. 2004; 40:979–995. [DOI] [PubMed] [Google Scholar]
  • 7. Wei Q.H., Zhang F., Sun F.S., Luo Q.C., Wang R.B., Hu R., Chen M.J., Chang J.L., Yang G.X., He G.Y.. A wheat MYB transcriptional repressor TaMyb1D regulates phenylpropanoid metabolism and enhances tolerance to drought and oxidative stresses in transgenic tobacco plants. Plant Science. 2017; 265:112–123. [DOI] [PubMed] [Google Scholar]
  • 8. Zhou H., Lin-Wang K., Wang F., Espley R.V., Ren F., Zhao J., Ogutu C., He H., Jiang Q., Allan A.C. et al.. Activator-type R2R3-MYB genes induce a repressor-type R2R3-MYB gene to balance anthocyanin and proanthocyanidin accumulation. New Phytol. 2019; 221:1919–1934. [DOI] [PubMed] [Google Scholar]
  • 9. Zhu L., Shan H., Chen S., Jiang J., Gu C., Zhou G., Chen Y., Song A., Chen F.. The heterologous expression of the chrysanthemum R2R3-MYB Transcription factor CmMYB1 Alters lignin composition and represses flavonoid synthesis in arabidopsis thaliana. PLoS One. 2013; 8:e65680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Bedon F., Bomal C., Caron S., Levasseur C., Boyle B., Mansfield S.D., Schmidt A., Gershenzon J., Grima-Pettenati J., Seguin A. et al.. Subgroup 4 R2R3-MYBs in conifer trees: gene family expansion and contribution to the isoprenoid- and flavonoid-oriented responses. J. Exp. Bot. 2010; 61:3847–3864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kirik V., Lee M.M., Wester K., Herrmann U., Zheng Z.G., Oppenheimer D., Schiefelbein J., Hulskamp M.. Functional diversification of MYB23 and GL1 genes in trichome morphogenesis and initiation (vol 132, pg 1477, 2005). Development. 2005; 132:1749–1749. [DOI] [PubMed] [Google Scholar]
  • 12. Lee M.M., Schiefelbein J.. WEREWOLF, a MYB-related protein in Arabidopsis, is a position-dependent regulator of epidermal cell patterning. Cell. 1999; 99:473–483. [DOI] [PubMed] [Google Scholar]
  • 13. Mu R.L., Cao Y.R., Liu Y.F., Lei G., Zou H.F., Liao Y., Wang H.W., Zhang W.K., Ma B., Du J.Z. et al.. An R2R3-type transcription factor gene AtMYB59 regulates root growth and cell cycle progression in Arabidopsis. Cell Res. 2009; 19:1291–1304. [DOI] [PubMed] [Google Scholar]
  • 14. Shin R., Burch A.Y., Huppert K.A., Tiwari S.B., Murphy A.S., Guilfoyle T.J., Schachtman D.P.. The Arabidopsis transcription factor MYB77 modulates auxin signal transduction. Plant Cell. 2007; 19:2440–2453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Theodoris G., Inada N., Freeling M.. Conservation and molecular dissection of ROUGH SHEATH2 and ASYMMETRIC LEAVES1 function in leaf development. PNAS. 2003; 100:6837–6842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Cheng H., Song S.S., Xiao L.T., Soo H.M., Cheng Z.W., Xie D.X., Peng J.R.. Gibberellin acts through jasmonate to control the expression of MYB21, MYB24, and MYB57 to promote stamen filament growth in arabidopsis. PLos Genet. 2009; 5:e1000440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Seo P.J., Xiang F.N., Qiao M., Park J.Y., Lee Y.N., Kim S.G., Lee Y.H., Park W.J., Park C.M.. The MYB96 Transcription factor mediates abscisic acid signaling during drought stress response in arabidopsis. Plant Physiol. 2009; 151:275–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Seo P.J., Park C.M.. MYB96-mediated abscisic acid signals induce pathogen resistance response by promoting salicylic acid biosynthesis in Arabidopsis. New Phytol. 2010; 186:471–483. [DOI] [PubMed] [Google Scholar]
  • 19. De Vos M., Denekamp M., Dicke M., Vuylsteke M., Van Loon L., Smeekens S.C., Pieterse C.M.. The arabidopsis thaliana transcription factor AtMYB102 Functions in defense against the insect herbivore pieris rapae. Plant Signal Behav. 2006; 1:305–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Agarwal M., Hao Y., Kapoor A., Dong C.H., Fujii H., Zheng X., Zhu J.K.. A R2R3 type MYB transcription factor is involved in the cold regulation of CBF genes and in acquired freezing tolerance. J. Biol. Chem. 2006; 281:37636–37645. [DOI] [PubMed] [Google Scholar]
  • 21. Devaiah B.N., Madhuvanthi R., Karthikeyan A.S., Raghothama K.G.. Phosphate starvation responses and gibberellic acid biosynthesis are regulated by the MYB62 transcription factor in Arabidopsis. Mol. Plant. 2009; 2:43–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Ogata K., Hojo H., Aimoto S., Nakai T., Nakamura H., Sarai A., Ishii S., Nishimura Y.. Solution structure of a DNA-binding unit of Myb: a helix-turn-helix-related motif with conserved tryptophans forming a hydrophobic core. PNAS. 1992; 89:6428–6432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Ogata K., Morikawa S., Nakamura H., Sekikawa A., Inoue T., Kanai H., Sarai A., Ishii S., Nishimura Y.. Solution structure of a specific DNA complex of the Myb DNA-binding domain with cooperative recognition helices. Cell. 1994; 79:639–648. [DOI] [PubMed] [Google Scholar]
  • 24. Tahirov T.H., Sato K., Ichikawa-Iwata E., Sasaki M., Inoue-Bungo T., Shiina M., Kimura K., Takata S., Fujikawa A., Morii H. et al.. Mechanism of c-Myb-C/EBP beta cooperation from separated sites on a promoter. Cell. 2002; 108:57–70. [DOI] [PubMed] [Google Scholar]
  • 25. Wei S.Y., Lou Y.C., Tsai J.Y., Ho M.R., Chou C.C., Rajasekaran M., Hsu H.M., Tai J.H., Hsiao C.D., Chen C.. Structure of the Trichomonas vaginalis Myb3 DNA-binding domain bound to a promoter sequence reveals a unique C-terminal beta-hairpin conformation. Nucleic Acids Res. 2012; 40:449–460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Stevenson C.E.M., Burton N., Costa M.M.R., Nath U., Dixon R.A., Coen E.S., Lawson D.M.. Crystal structure of the MYB domain of the RAD transcription factor from Antirrhinum majus. Proteins. 2006; 65:1041–1045. [DOI] [PubMed] [Google Scholar]
  • 27. Jiang M., Sun L., Isupov M.N., Littlechild J.A., Wu X., Wang Q., Yang W., Wu Y.. Structural basis for the Target DNA recognition and binding by the MYB domain of phosphate starvation response 1. FEBS J. 2019; 286:2809–2821. [DOI] [PubMed] [Google Scholar]
  • 28. Song S.K., Ryu K.H., Kang Y.H., Song J.H., Cho Y.H., Yoo S.D., Schiefelbein J., Lee M.M.. Cell fate in the arabidopsis root epidermis is determined by competition between WEREWOLF and CAPRICE. Plant Physiol. 2011; 157:1196–1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Zhu Y., Rong L., Luo Q., Wang B., Zhou N., Yang Y., Zhang C., Feng H., Zheng L., Shen W.H. et al.. The histone chaperone NRP1 interacts with WEREWOLF to Activate GLABRA2 in arabidopsis root hair development. Plant Cell. 2017; 29:260–276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Domcke S., Bardet A.F., Ginno P.A., Hartl D., Burger L., Schubeler D.. Competition between DNA methylation and transcription factors determines binding of NRF1. Nature. 2015; 528:575–579. [DOI] [PubMed] [Google Scholar]
  • 31. Murshudov G.N., Skubak P., Lebedev A.A., Pannu N.S., Steiner R.A., Nicholls R.A., Winn M.D., Long F., Vagin A.A.. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D. 2011; 67:355–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Adams P.D., Afonine P.V., Bunkoczi G., Chen V.B., Davis I.W., Echols N., Headd J.J., Hung L.W., Kapral G.J., Grosse-Kunstleve R.W. et al.. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D. 2010; 66:213–221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Yoo S.D., Cho Y.H., Sheen J.. Arabidopsis mesophyll protoplasts: a versatile cell system for transient gene expression analysis. Nat. Protoc. 2007; 2:1565–1572. [DOI] [PubMed] [Google Scholar]
  • 34. Prouse M.B., Campbell M.M.. The interaction between MYB proteins and their target DNA binding sites. BBA-Gene Regul. Mech. 2012; 1819:67–77. [DOI] [PubMed] [Google Scholar]
  • 35. Koshino-Kimura Y., Wada T., Tachibana T., Tsugeki R., Ishiguro S., Okada K.. Regulation of CAPRICE transcription by MYB proteins for root epidermis differentiation in Arabidopsis. Plant Cell Physiol. 2005; 46:817–826. [DOI] [PubMed] [Google Scholar]
  • 36. Heine G.F., Hernandez J.M., Grotewold E.. Two cysteines in plant R2R3 MYB domains participate in REDOX-dependent DNA binding. J. Biol. Chem. 2004; 279:37878–37885. [DOI] [PubMed] [Google Scholar]
  • 37. Zhang H., Lang Z., Zhu J.K.. Dynamics and function of DNA methylation in plants. Nat. Rev. Mol. Cell Biol. 2018; 19:489–506. [DOI] [PubMed] [Google Scholar]
  • 38. Luo G.Z., Blanco M.A., Greer E.L., He C., Shi Y.. DNA N(6)-methyladenine: a new epigenetic mark in eukaryotes. Nat. Rev. Mol. Cell Biol. 2015; 16:705–710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Wang J., Tang C., Wang Q., Su J., Ni T., Yang W., Wang Y., Chen W., Liu X., Wang S. et al.. NRF1 coordinates with DNA methylation to regulate spermatogenesis. FASEB J. 2017; 31:4959–4970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. O’Malley R.C., Huang S.C., Song L., Lewsey M.G., Bartlett A., Nery J.R., Galli M., Gallavotti A., Ecker J.R.. Cistrome and epicistrome features shape the regulatory DNA Landscape. Cell. 2016; 165:1280–1292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Liu R., How-Kit A., Stammitti L., Teyssier E., Rolin D., Mortain-Bertrand A., Halle S., Liu M., Kong J., Wu C. et al.. A DEMETER-like DNA demethylase governs tomato fruit ripening. PNAS. 2015; 112:10804–10809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Lang Z., Wang Y., Tang K., Tang D., Datsenka T., Cheng J., Zhang Y., Handa A.K., Zhu J.K.. Critical roles of DNA demethylation in the activation of ripening-induced genes and inhibition of ripening-repressed genes in tomato fruit. PNAS. 2017; 114:E4511–E4519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. El-Sharkawy I., Liang D., Xu K.. Transcriptome analysis of an apple (Malus x domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation. J. Exp. Bot. 2015; 66:7359–7376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Oglesby L., Ananga A., Obuya J., Ochieng J., Cebert E., Tsolova V.. Anthocyanin accumulation in muscadine berry skins is influenced by the expression of the MYB Transcription factors, MybA1, and MYBCS1. Antioxidants-Basel. 2016; 5:35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Dasgupta K., Thilmony R., Stover E., Oliveira M.L., Thomson J.. Novel R2R3-MYB transcription factors from Prunus americana regulate differential patterns of anthocyanin accumulation in tobacco and citrus. GM Crops Food. 2017; 8:85–105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Li S.N., Wang W.Y., Gao J.L., Yin K.Q., Wang R., Wang C.C., Petersen M., Mundy J., Qiu J.L.. MYB75 Phosphorylation by MPK4 is required for Light-Induced anthocyanin accumulation in arabidopsis. Plant Cell. 2016; 28:2866–2883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Zhou H., Peng Q., Zhao J., Owiti A., Ren F., Liao L., Wang L., Deng X., Jiang Q., Han Y.. Multiple R2R3-MYB Transcription factors involved in the regulation of anthocyanin accumulation in peach flower. Front. Plant Sci. 2016; 7:1557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Chen K., Liu H., Lou Q., Liu Y.. Ectopic expression of the grape hyacinth (Muscari armeniacum) R2R3-MYB Transcription Factor Gene, MaAN2, Induces anthocyanin accumulation in tobacco. Front Plant Sci. 2017; 8:965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Manning K., Tor M., Poole M., Hong Y., Thompson A.J., King G.J., Giovannoni J.J., Seymour G.B.. A naturally occurring epigenetic mutation in a gene encoding an SBP-box transcription factor inhibits tomato fruit ripening. Nat. Genet. 2006; 38:948–952. [DOI] [PubMed] [Google Scholar]
  • 50. Koziol M.J., Bradshaw C.R., Allen G.E., Costa A.S.H., Frezza C., Gurdon J.B.. Identification of methylated deoxyadenosines in vertebrates reveals diversity in DNA modifications. Nat Struct Mol Biol. 2016; 23:24–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Liu J., Zhu Y., Luo G.Z., Wang X., Yue Y., Zong X., Chen K., Yin H., Fu Y., Han D. et al.. Abundant DNA 6mA methylation during early embryogenesis of zebrafish and pig. Nat. Commun. 2016; 7:13052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Wu T.P., Wang T., Seetin M.G., Lai Y., Zhu S., Lin K., Liu Y., Byrum S.D., Mackintosh S.G., Zhong M. et al.. DNA methylation on N(6)-adenine in mammalian embryonic stem cells. Nature. 2016; 532:329–333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Xiao C.L., Zhu S., He M., Chen Zhang, Q., Chen Y., Yu G., Liu J., Xie S.Q., Luo F. et al.. N(6)-Methyladenine DNA Modification in the human genome. Mol. Cell. 2018; 71:306–318. [DOI] [PubMed] [Google Scholar]
  • 54. Liang Z., Shen L., Cui X., Bao S., Geng Y., Yu G., Liang F., Xie S., Lu T., Gu X. et al.. DNA N(6)-Adenine methylation in arabidopsis thaliana. Dev. Cell. 2018; 45:406–416. [DOI] [PubMed] [Google Scholar]
  • 55. Zhang Q., Liang Z., Cui X., Ji C., Li Y., Zhang P., Liu J., Riaz A., Yao P., Liu M. et al.. N(6)-Methyladenine DNA Methylation in japonica and indica rice genomes and its association with gene expression, plant development, and stress responses. Mol. Plant. 2018; 11:1492–1508. [DOI] [PubMed] [Google Scholar]
  • 56. Zhou C., Wang C., Liu H., Zhou Q., Liu Q., Guo Y., Peng T., Song J., Zhang J., Chen L. et al.. Identification and analysis of adenine N(6)-methylation sites in the rice genome. Nat. Plants. 2018; 4:554–563. [DOI] [PubMed] [Google Scholar]
  • 57. Mondo S.J., Dannebaum R.O., Kuo R.C., Louie K.B., Bewick A.J., LaButti K., Haridas S., Kuo A., Salamov A., Ahrendt S.R. et al.. Widespread adenine N6-methylation of active genes in fungi. Nat. Genet. 2017; 49:964–968. [DOI] [PubMed] [Google Scholar]
  • 58. Yao B., Cheng Y., Wang Z.Q., Li Y.J., Chen L., Huang L.X., Zhang W.X., Chen D.H., Wu H., Tang B.S. et al.. DNA N6-methyladenine is dynamically regulated in the mouse brain following environmental stress. Nat. Commun. 2017; 8:1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Ma C.C., Niu R., Huang T.X., Shao L.W., Peng Y., Ding W.Q., Wang Y., Jia G.F., He C., Li C.Y. et al.. N6-methyldeoxyadenine is a transgenerational epigenetic signal for mitochondrial stress adaptation. Nat. Cell Biol. 2019; 21:319–327. [DOI] [PubMed] [Google Scholar]
  • 60. Yao B., Li Y., Wang Z., Chen L., Poidevin M., Zhang C., Lin L., Wang F., Bao H., Jiao B. et al.. Active N(6)-methyladenine demethylation by DMAD regulates gene expression by coordinating with polycomb protein in neurons. Mol. Cell. 2018; 71:848–857. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkz1081_Supplemental_File

Data Availability Statement

Structural factors and coordinates have been deposited in the Protein Data Bank under accession code 6KKS for the WER–DNA complex.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES