Skip to main content
iMeta logoLink to iMeta
. 2025 Mar 5;4(2):e70009. doi: 10.1002/imt2.70009

Specificity landscapes of 40 R2R3‐MYBs reveal how paralogs target different cis‐elements by homodimeric binding

Tian Li 1, Hao Chen 1, Nana Ma 1,2, Dingkun Jiang 1,2, Jiacheng Wu 1, Xinfeng Zhang 1, Hao Li 1,2, Jiaqing Su 3, Piaojuan Chen 1, Qing Liu 4, Yuefeng Guan 3, Xiaoyue Zhu 1, Juncheng Lin 1, Jilin Zhang 5,6,7, Qin Wang 1,, Honghong Guo 1,2,, Fangjie Zhu 1,
PMCID: PMC11995187  PMID: 40236784

Abstract

Paralogous transcription factors (TFs) frequently recognize highly similar DNA motifs. Homodimerization can help distinguish them according to their different dimeric configurations. Here, by studying R2R3‐MYB TFs, we show that homodimerization can also directly change the recognized DNA motifs to distinguish between similar TFs. By high‐throughput SELEX, we profiled the specificity landscape for 40 R2R3‐MYBs of subfamily VIII and curated 833 motif models. The dimeric models show that homodimeric binding has evoked specificity changes for AtMYBs. Focusing on AtMYB2 as an example, we show that homodimerization has modified its specificity and allowed it to recognize additional cis‐regulatory sequences that are different from the closely related CCWAA‐box AtMYBs and are unique among all AtMYBs. Genomic sites described by the modified dimeric specificities of AtMYB2 are conserved in evolution and involved in AtMYB2‐specific transcriptional activation. Collectively, this study provides rich data on sequence preferences of VIII R2R3‐MYBs and suggests an alternative mechanism that guides closely related TFs to respective cis‐regulatory sites.

Keywords: cis‐elements, DNA binding specificity, homodimerization, MYB family transcription factors, paralogs, regulatory noncoding genome, transcriptional regulation


Closely related transcription factor (TF) paralogs are facing the “specificity paradox”—they share similar binding motifs, but their cis‐regulatory targets and physiological roles can be different. By applying high‐throughput SELEX to 40 R2R3‐MYB TFs, this study currently generates the largest data set illustrating the homodimeric specificities of plant TFs, while also reveals a yet unrecognized mechanism to solve the “specificity paradox”—a TF's binding specificity can change drastically upon homodimerization, and become unique across the whole family.

graphic file with name IMT2-4-e70009-g005.jpg

Highlights

  • A homodimerization‐based mechanism enables eukaryotes to distinguish paralogous transcription factors.

  • The largest plant SELEX data set illustrates the sequence preferences of 40 VIII R2R3‐MYBs.

  • The high specificity AtMYBs were discovered and named CCWAA‐box.

INTRODUCTION

The spatial and temporal expression profile of genes is largely determined by the transcription factors (TFs) that bind to specific DNA sequences and activate the target genes. Specificity describes the degree of discrimination when a TF binds different DNA sequences, that is, how “picky” the TF is upon DNA binding. The specificity of a TF is typically measured by the information content (IC) of the TF's binding motif. By recognizing long binding motifs (~23 bits of IC) [1, 2, 3, 4], prokaryotic TFs have sufficient specificity to define ~10 targets in their ~106 bp genome. In contrast, eukaryotic TFs are facing a specificity paradox. This is because they recognize much shorter DNA motifs (~12.1 bits of IC) [1, 5, 6, 7] that would define ~105 binding sites in the ~109 bp genome, but practically, a eukaryotic TF has to specifically regulate only 10–1000 genes [8, 9]. Moreover, the specificity paradox also applies to TFs of the same family. Whereas closely related TFs tend to bind highly similar sequences, their binding sites and target genes, as well as their physiological functions, can be distinct from each other [10, 11]. While epigenetic modifications [12, 13], chromatin contexts [11, 14], and TFs' DNA shape preference [15, 16, 17, 18, 19] have partially resolved this paradox by restricting the suitable target sites, dimerization of TFs also represents a straightforward solution that increases both the specificity [20, 21] and affinity [22] of eukaryotic TFs.

Homodimerization occurs between monomers of the same TF. It is known to help discriminate between closely related paralogous TFs because the dimeric configurations (relative spacing and orientation) can be unique for each TF. For example, the homodimeric configuration has contributed to distinguishing between the closely related animal TFs of families like ETS, T‐box, forkhead, RHD, and homeodomain [23]. In plants, the importance of dimeric configuration was demonstrated for the auxin response factors (ARFs)‐family TFs. Although ARFs within the same clade share a similar sequence preference for monomers [24], their homodimers prefer different relative orientations and spacings [25]. Moreover, changes to homodimeric configurations also distinguish the same TF in different species in evolution [26]. Despite these findings, whether homodimerization affects the sequence specificity of TFs, and whether such effect is shared by all closely related TFs or can be specific to individual TFs, is yet to be explored.

MYB represents one of the largest families of plant TFs in higher plants [27, 28]. The size expansion of MYB TFs is mainly attributed to the rapid expansion of the R2R3‐MYBs during green plant evolution [29, 30], and especially to the subfamily VIII R2R3‐MYBs that emerged in the common ancestor of Zygnematophyceae and land plants [30]. The sequences of the R2R3‐MYB TFs feature two highly conserved MYB repeats that bind to DNA. Members of subfamily VIII R2R3‐MYB TFs regulate numerous plant‐specific physiological processes, such as the synthesis of secondary metabolites [31, 32], deposition of cell walls [33], responses to biotic and abiotic stresses [34, 35], formation of cuticle and trichomes [36], and development of stamens [37]. In stark contrast to the diverse physiological functions of VIII R2R3‐MYBs, the publicly available high‐throughput data of DNA affinity purification sequencing (DAP‐seq) and protein binding microarray (PBM) [7, 13, 38, 39, 40] indicate that their monomers bind to highly similar and short DNA motifs [30, 40] (Figure S1A). Therefore, how VIII R2R3‐MYB TFs solve the “specificity paradox” to target different downstream pathways is of biochemical interest and physiological importance.

Subfamily VIII AtMYBs often dimerize in transcriptional regulation [41]. Homodimers have also been extensively reported for VIII AtMYBs (e.g., AtMYB21, AtMYB24 [42], AtMYB3 [43], and AtMYB82 [44]). However, DAP‐seq and PBM libraries of VIII R2R3‐MYB TFs have provided limited insights into their dimeric binding. This is because for PBM, the length of the randomized region on the probes is ~10 bp, which, in general, is not adequate to accommodate the dimeric binding modes. For DAP‐seq, the complexity of the input library is restricted by the size of the genome. DAP‐seq usually results in thousands of peaks; the limited number of sequences from DAP‐seq peaks makes it difficult to derive dimeric binding models of TFs (e.g., very few dimeric models were found for VIII AtMYBs; Data S1) because these sequences represent an undersampling of the high‐information‐content dimeric binding modes.

Here, we systematically analyzed with high‐throughput systematic evolution of ligands by exponential enrichment (HT‐SELEX) the binding preferences of 40 R2R3‐MYBs of subfamily VIII. The complexity (1012 input sequences) and ligand length (101‐bp randomized region) of the SELEX library allow a comprehensive characterization of both the monomeric and dimeric specificities (Data S2). The monomeric specificities revealed that a group of “CCWAA‐box” AtMYBs feature a distinct specificity and tend to be more selective than other VIII AtMYBs. The homodimeric specificities suggest that when two molecules of VIII AtMYBs bind closely to each other, the binding specificity of the half‐sites can change drastically. This has allowed AtMYB2 to recognize cis‐regulatory codes that are unique among all AtMYBs, thereby representing an alternative solution to the “specificity paradox” faced by paralogous eukaryotic TFs.

RESULTS

Specificity landscapes reveal a high selectivity of CCWAA‐box MYBs

To profile the landscape of DNA‐binding specificity of VIII R2R3‐MYBs, 3–5 cycles of high‐throughput SELEX enrichment were carried out. The enrichment suggests that SELEX has evolved high‐affinity ligands (Figure S2A–C). Replicates of SELEX libraries agree well with each other (Figure S2C). The mutual‐information‐based analyses [45] (Figure S2D) also confirmed the enrichment of TF‐preferred DNA sequences in the SELEX libraries.

Figure 2.

Figure 2

Modified specificities of closely spaced AtMYB2 homodimers. (A) AtMYBs with similar monomeric motifs have different homodimeric preferences. Enrichments of dimeric AtMYB CREs with different spacings and three relative orientations in SELEX libraries. Two “ACCWAMY” strings are concatenated with the annotated spacing (x‐axis) and assessed for enrichment. (B) Homodimeric preferences of genomic AtMYB CREs. Enrichments of dimeric AtMYB CREs with different spacings and three relative orientations in DAP‐seq peaks. (C) Monomeric and homodimeric motifs of AtMYB2. De novo discovered motifs in the AtMYB2 SELEX library, with their information content (red squares) and enrichment (green squares) indicated to the left. Dimeric motifs were named according to their relative orientations and spacings. The spacing regions are colored gray (half‐site: ACCWAAY). Note that half‐sites in ER1s and ER0s are less similar to the monomeric motifs, especially for ER1a (red). (D) ER1a is dissimilar to reported plant motifs. The ER1a motif was compared to all reported plant motifs [6, 48, 49], and the three most similar known motifs are shown. (E) Distinct DNA‐shape preference of ER1a. Average minor groove widths are illustrated for each position of the sequences harboring respective motifs of AtMYB2. Positions are aligned with additional gaps (the light yellow zone between the vertical dashes) to separate the two half‐sites of ER0–2. Note that the profile is similar for half‐sites of all motifs except for ER1a. (F) Non‐DBD region affects AtMYB2 dimeric preferences. SELEXs with progressively truncated AtMYB2 in the non‐DBD region reveal that segments A and D determine the dimeric configurations of AtMYB2 because the dimeric preference changes dramatically when truncating A and D, but only slightly when truncating B and C. Accordingly, A and D are also evolutionarily more conserved (Figure S3H). The degree of conservation is also summarized across AtMYB2.

The SELEX libraries include 38 Arabidopsis VIII MYBs (AtMYBs), 14 of which were not covered by previous PBM and DAP‐seq datasets (Figure 1A). However, also for the overlapping datasets, the SELEX libraries can offer more binding models (Figure 1B), especially the dimeric binding models (Figure 1B,C). Moreover, the monomeric motifs generated by SELEX cover a larger specificity space compared to the DAP‐seq motifs (Figure 1D). This presumably is a consequence of the nonrandom nature of the DAP‐seq input sequences—the motifs from DAP‐seq reflect not only the biochemical affinity of the cis‐elements but also their genomic prevalence. Therefore, the SELEX data set complements the previous PBM/DAP datasets.

Figure 1.

Figure 1

The landscape of binding specificity of VIII R2R3‐MYBs. (A) Comparison of SELEX and previous datasets. (B) Successful rates of motif discovery. The fractions of libraries that yield TF motifs (with Autoseed) are shown. (C) Numbers of dimeric motifs derived from individual libraries. ***p < 0.001 in t‐test. (D) The specificity space of monomeric motifs. Multidimensional scaling results of the monomeric motifs. The normal data ellipses (95%) were also shown to facilitate comparison. (E) The monomeric binding preferences of VIII R2R3‐AtMYBs. Motif logos are discovered from the HT‐SELEX libraries, and also from the published DAP‐seq libraries [13]. PBM library is used for AtMYB46. The color of the bar to the left of the TF names indicates the clades of VIII R2R3‐AtMYBs [30]. The color of squares to the left of the motifs represents its relative enrichment compared to the primary motifs. The CCWAA‐box AtMYBs were boxed with blue dashes. (F) Sequence alignment of the DBDs of VIII‐E R2R3‐MYBs. The VIII‐E AtMYBs studied in this work are aligned with all VIII‐E MYBs in M. polymorpha and S. moellendorffii. DNA‐contacting residuals are annotated based on the structure of AtMYB66 (PDB: 6KKS [46], also included in alignment). The two mutations at positions 53 and 113 (red boxes) likely accounted for the unique specificity of the CCWAA‐box AtMYBs (blue box). (G) Contacts between AtMYB66 and DNA bases. PDB: 6KKS visualized by DNAproDB [47]. (H) Binding specificities of ancestral VIII‐E R2R3‐MYBs. SELEX motifs of MA0073s0038 and MN199019. Note that their motifs are more similar to most VIII AtMYBs (e.g., AtMYB10, 58) and are different from CCWAA‐box AtMYBs (e.g., AtMYB57, 2, 79; positions with large discrepancies are gray‐shaded).

Next, we examined the monomeric binding preferences of the VIII AtMYBs and clustered them according to their primary motifs (Figure 1E). De novo motif discovery reveals that the examined AtMYBs enriched primary, secondary, and tertiary motifs (Figure 1E). Consistent with the PBM studies [7], when multiple monomeric motifs exist for the same AtMYB, they are usually quite similar. This is different from TFs of bZIP [20], C2C2‐Dof, and DREB [7] families, for which the secondary motif could significantly diverge from the primary motif. In general, the monomeric motifs derived from SELEX agree well with the DAP‐seq motifs (Figure 1E and Figure S1A). Although with weaker enrichments, SELEX is capable of discovering secondary/tertiary motifs that are usually not detected in DAP‐seq.

While the majority of the VIII AtMYBs prefer to bind DNA sequences summarized by “AMCWAMY” (W: A/T; M: A/C; Y: C/T), many AtMYBs of the VIII‐E clade [30] prefer to bind “RCCWAAY” (CCWAA‐box) as the consensus (blue box; Figure 1E) and feature a higher specificity than other AtMYBs (Figure 1E), that is, motifs of CCWAA‐box AtMYBs are with higher ICs (Figure S1C). The CCWAA‐box AtMYBs likely originated from a common ancestor (within the same monophyletic clade; Figure S1B). To find potential amino acids that account for their distinct specificity, we next aligned the sequences of the VIII AtMYBs studied in this work, together with all the VIII MYBs in M. polymorpha and S. moellendorffii, whereby the VIII subfamily of MYBs has not expanded (Figure 1F and Figure S2E). We focused on the amino acid sites that are both conserved in evolution (thus higher probability to involve in DNA binding) and differ between CCWAA‐box AtMYBs and other MYBs. We found that mutations in positions 53 and 113 (red frames in Figure 1F and Figure S2E) presumably explained the distinct specificity. The base contacts by asparagine at position 113 have also been validated by the structural data of AtMYB66 [46] (Figure 1F,G), and by the AlphaFold structure of AtMYB2 (with R113; Figure S2F). Further examination of the sequences in M. polymorpha and S. moellendorffii suggests that consistent with the majority of VIII AtMYBs, ancestors of VIII‐E clade MYBs (Figure 1F) and other clades of VIII MYBs (Figure S2E) also contain C, N at positions 53, 113. In agreement with the sequence analyses, we performed SELEX for MA0073s0038 (M. polymorpha) and MN199019 (S. moellendorffii), and found that they bind to “ACCWAMM” (Figure 1H), which is more similar to the “AMCWAMY” (most other AtMYBs, e.g., AtMYB58/10 in Figure 1H) than the “RCCWAAY” (CCWAA‐box AtMYBs). Therefore, the distinct specificity of CCWAA‐box VIII‐E AtMYBs could have originated from their unique AA identities at positions 53 and 113.

Modified specificity of closely spaced AtMYB2 homodimers

The closely related AtMYBs are further discriminated by their homodimeric configurations (Figure 2A and Figure S3A). For example, the CCWAA‐box AtMYBs 2, 57, and 79 were neighbors in the clustering and feature highly similar motifs (Figure 1E); however, their preferred homodimeric spacings and orientations are distinct from each other (Figure 2A). The most preferred dimeric orientation for AtMYB2 is everted repeats (ER), while inverted repeats (IR) are most preferred by AtMYB57, and direct repeats (DR) by AtMYB79 (Figure 2A). The AtMYBs also prefer different dimeric spacings. For the DR configurations, while AtMYB2 specifically binds DR3 and DR4, AtMYBs 57 and 79 bind to almost all DR modes (Figure 2A). Although de novo discovery can hardly find dimeric motifs from DAP‐seq libraries (Figure 1B, Data S1), the enrichment analysis is still possible and gives insights into configurational preferences of homodimeric cis‐regulatory elements (CREs). Next, we generated DAP‐seq libraries for AtMYB2/57/79 and analyzed the homodimeric configurations of their genomic CREs (Figure 2B and Figure S3B). Consistent with previous analyses on dimeric WRKY CREs [50], the homodimeric configurations of the AtMYBs in DAP‐seq resemble those in SELEX but also bear discrepancies, likely because thermodynamic selection and drifts both contributed to shaping the cis‐regulatory codes in evolution [50].

Because the enrichment analysis with consensus cannot determine whether the sequence preferences of AtMYBs have changed upon homodimerization, we next systematically discovered and curated the homodimeric motifs for AtMYBs (Figure 2C, all motifs visualized in Data S2, data in Tables S4S41). In total, we report 743 homodimeric motifs of AtMYBs in this study. The dimeric motifs were named according to the relative orientations (DR, IR, and ER) and spacings between the monomers (Figure 2C). In public databases [5, 6, 48, 49], the majority of the reported plant motifs for AtMYBs were monomeric (Figure S1A). The identified dimeric motifs could add extra insights into the combinatorial regulatory codes of AtMYBs. Notably, we observed that AtMYBs can change their DNA‐binding specificity when two monomers bind closely to each other (Figure 2C and Figure S3C), that is, the half‐sites of the dimeric motifs diversify from the monomeric motif. Such specificity change is particularly prominent for AtMYB2 (Figure 2C).

For AtMYB2, the discovered dimeric motifs encompass all relative orientations of the monomers (Figure 2C), with ER8 being the strongest dimeric mode. The motif models more enriched in SELEX (green squares in Figure 2C) generally also feature higher information contents (IC, red squares in Figure 2C). The closely spaced homodimers in the ER conformation are associated with high ICs (thus sufficient affinity and specificity) and represent reliable binding modes (ER0s and ER1s). In contrast with the distantly spaced ER modes (ER7–10; Figure 2C) whereby the half‐sites are almost the same as the monomeric motif Ma, the specificity has changed upon proximal homodimerization (ER0s and ER1s) and is especially drastic for ER1a (name in red; Figure 2C). The half‐site on the right of ER1a is completely different from the monomeric modes Ma and Mb. ER0s and ER1s also align poorly with previously reported plant motifs (Figure 2D and Figure S3D), suggesting that these dimeric modes were not reported before. The two binding events in all the identified dimeric modes are cooperative (Figure S3E).

In addition to individual nucleotides, the shape of local DNA sequences also contributes to the specificity of TF binding [15, 16, 17] and aids in discriminating binding sites of closely related TFs [18, 19, 51]. We examined the DNA‐shape parameters for the SELEX reads that matched each ER dimeric motif with a high IC, and found that the half‐sites of most dimeric modes feature DNA‐shape patterns that resemble the shapes of the monomeric AtMYB2 motif (Figure 2E and Figure S3F). A narrowed minor grove is observed for the spacing regions between the two half‐sites (Figure 2E). However, ER1a differs from other ER modes for most of the shape parameters, for example, the minor grove width (Figure 2E), buckle, rise, shift, and opening (Figure S3F). Therefore, proximal homodimeric binding not only induces changes to the nucleotide specificity of AtMYB2 but also to its DNA‐shape specificity.

The non‐DBD (DNA binding domain) region was shown to affect the dimerization and DNA‐binding specificity of TFs [52, 53]. Similarly, we found that the non‐DBD region of AtMYB2 is critical for its dimerization (MYB2‐full vs. MYB2‐Δall; Figure 2F). Consistently, the AlphaFold structures of AtMYB2‐ER8 show that the non‐DBD region is indispensable for the contacts between the two AtMYB2 monomers (Figure S3G). In a higher resolution, the non‐DBD segments D and A are the major determinants of the dimeric configurations. Truncation of segment D has decreased the distantly spaced dimeric modes (ER7–9) of AtMYB2 (MYB2‐full vs. MYB2‐ΔD; Figure 2F), and truncation of segment A has further abolished almost all of the dimeric modes (DR3/4, IR0/‐1, and ER0/1s) while selectively enhanced ER2 and ER3 (MYB2‐ΔBCD vs. MYB2‐Δall; Figure 2F). In contrast, only slight changes were observed when truncating segments B (MYB2‐ΔCD vs. MYB2‐ΔBCD; Figure 2F) and C (MYB2‐ΔD vs. MYB2‐ΔCD; Figure 2F). Accordingly, segments A and D are also more conserved than B and C in evolution (Figure 2F and Table S3), and segment D likely also represents the activation domain (Figure S3H). These results underscore the importance of the non‐DBD region in shaping the dimeric binding specificities.

Because AtMYB2 drastically changes its specificity when binding as closely spaced ER homodimers, we next focused on ER0s and ER1s (especially ER1a) of AtMYB2 to explore their uniqueness, biochemical activity, and transcriptional regulatory relevance.

The modified specificities are unique to AtMYB2

To examine whether the modified specificity of ER0s and ER1s are unique to AtMYB2, we explored the enrichments of ER0s and ER1s in SELEX libraries of all AtMYBs (Figure 3A). Although ER0s and ER1s were not detected for the majority of the other AtMYBs, we did observe that AtMYB41/49/107 enriched ER0b, 1a, and 1b. Because motif enrichment analysis is subject to false positives, we next inspected the de novo motifs from SELEX and found that AtMYB41/49/107 libraries did not contain the exact ER0b, 1a, and 1b of AtMYB2 (Figure 3B), but instead enriched degenerated ER1s that represent a larger set of preferred DNA sequences. The positions with high IC in the ER1s of AtMYB41/49/107 align with the ER0b, 1a, and 1b of AtMYB2 (Figure 3B); we thus suspect it is the presence of these ER1 modes that have resulted in the observed motif enrichments of ER0b, 1a, and 1b for AtMYB41/49/107 (Figure 3A). To validate this hypothesis, we generated an in silico SELEX library (Figure 3A) that enriches only the ER1 of AtMYB49 and used the library for enrichment analysis of AtMYB2's motifs. As expected, the simulated SELEX library also enriched the ER0b, 1a, and 1b of AtMYB2. Therefore, among the examined VIII AtMYBs, the observed ER0s and ER1s are unique to AtMYB2, with the exception that a motif similar to AtMYB2's ER1b is indeed discovered from libraries of AtMYB21/57/62 (Figure 3B).

Figure 3.

Figure 3

The modified specificities are unique to AtMYB2. (A) AtMYB2 enriches unique ER0s and ER1s in SELEX. Enrichments of AtMYB2 ER0s and ER1s in SELEX libraries of AtMYBs (left). For a virtual TF that binds only the ER1 of AtMYB41/49/107 (right), its in silico SELEX also strongly enriched ER0b, 1a, 1b of AtMYB2, suggesting that the observed enrichments for AtMYB41/49/107 in the left panel are false positives and that AtMYB2 has unique ER0 and ER1 modes, except that ER1b is also bound by AtMYB21/57/62. (B) Degenerated AtMYB41/49/107 ER1s are aligned with AtMYB2 motifs. The ER1s of MYB41, 49, 107, 21, 57, and 62 are compared to ER0b, 1a, 1b of AtMYB2. The bar color indicates the clades of the VIII AtMYBs according to Figure 1E. (C) AtMYB2 enriches unique ER0s and ER1s in DAP‐seq peaks. Enrichments of AtMYB2 motifs are surveyed for DAP‐seq peaks. Note that ER0s and ER1s of AtMYB2 (dashed box) are uniquely enriched for AtMYB2. Similar to the observation for SELEX (A, B), the enrichments of ER0b, 1a, 1b for AtMYB49, 107, 93, and 80 (cyan boxes) are presumably from their degenerated ER1 motifs. Note that the specificity‐changed ER0/1s are only enriched for AtMYB2 here but not in (Figure 2B), whereby the use of fixed strings cannot capture the specificity changes.

To further validate the uniqueness of the ER0s and ER1s of AtMYB2, we exploited the highly abundant DAP‐seq datasets both for AtMYBs [13] and MYBs of other plants [54, 55, 56, 57, 58]; they also support enrichment analysis of available dimeric motifs. We examined the enrichment of AtMYB2 motifs within DAP‐seq peaks (Figure 3C and Figure S4). The results show that among the 54 DAP‐seq libraries of AtMYBs, almost all subfamily VIII R2R3‐AtMYBs (dots aside from TF name; Figure 3C) have enriched AtMYB2's monomeric motifs (Ma/Mb). This is in line with the similarity between monomeric motifs across the VIII R2R3‐AtMYBs (Figure S1A). AtMYB2's dimeric modes ER7–9 and DR2–4 also enriched in ~1/2 of the other VIII R2R3‐MYBs (Figure 3C). Among the homodimeric modes with modified specificities (ER0s and ER1s of AtMYB2; boxed with the dashed line in Figure 3C), ER0s and ER1a are uniquely and strongly enriched for AtMYB2, suggesting that these dimeric binding modes provide sufficient specificity to distinguish AtMYB2 from other AtMYBs. A few other AtMYBs also moderately enrich ER0s and ER1s, for example, the AtMYBs 49, 80, 93, and 107 (cyan boxes; Figure 3C). As already validated for AtMYB49 and 107 (Figure 3A,B), such enrichments presumably originate from sequences corresponding to more degenerated motifs.

Moreover, we analyzed the published DAP‐seq libraries of MYB‐family TFs for Eucalyptus grandis, Oryza sativa, Vitis vinifera, and Camelina sativa. Similar to AtMYBs (Figure 3C), many of these DAP‐seq libraries enriched the monomeric modes Ma/Mb, but strong enrichments of ER0s and ER1a were not observed (Figure S4), further confirming their uniqueness to AtMYB2.

The modified specificities allow AtMYB2 to recognize additional genomic sites

Because the specificity change is most drastic for ER1a, we next focused on ER1a as an example to validate its biochemical affinity to AtMYB2. With the recombinant AtMYB2 protein (Figure S5A,B), we performed EMSA with DNA ligands containing an ER1a consensus, together with ligands harboring Ma and ER8 for comparison (Figure 4A). We observed that all ligands are capable of binding a single molecule of AtMYB2. However, the shifted band according to dimeric binding of AtMYB2 is observed at high concentrations (36, 48 ng/µL) of AtMYB2 only for ER1a and ER8 ligands, but not for Ma ligands (Figure 4A), suggesting that although bearing a drastic specificity change, ER1a still represent a functional dimeric CRE of AtMYB2.

Figure 4.

Figure 4

The modified specificities allow AtMYB2 to recognize additional genomic sites. (A) ER1a is functional for the dimeric binding of AtMYB2. Electrophoretic mobility shift assay of AtMYB2 binding to Ma, ER1a, and ER8 consensus. The concentrations of AtMYB2 are 18, 24, 36, 48 ng/µL from left to right (ER1a and ER8) and 24, 36, 48 ng/µL for Ma. (B) ER1a is enriched in genomic CREs of AtMYB2. Enrichment of ER1a in DAP‐seq peaks of AtMYB2 is comparable to the enrichment of the monomeric motifs Ma/Mb (left), whereas the shuffled DAP‐seq peaks show little enrichment of ER1a (middle). Mutations to CAC in the right half‐site of ER1a also decrease its enrichment (right). (C–E) ER0s and ER1s improve the accuracy of the prediction of AtMYB2 genomic CREs. Precision recall (PR) curves (C, D) and area under the curve (AUPRC) values (E) for motif‐based prediction of DAP‐seq peaks. Note that when combined with the monomeric motifs (M), the ER modes most significantly increased the prediction accuracy (E); further separating individual ER modes shows that ER1s and ER0s (D, E) with modified specificities have contributed most of the additional prediction power. Wilcoxon signed‐rank test (for paired precision values at each recall level) is performed to evaluate if adding the dimeric modes to M has significantly increased the predictive power. (F) ER0s and ER1s define unique targets of AtMYB2. ER0/1 sites are specifically enriched in DAP peaks of AtMYB2 but not in peaks of AtMYB57/79. This contrasts with IR, DR, and ER2–10. (G, H) The modified specificities of ER1a contribute to the affinity of AtMYB2 CREs. DAP‐seq peaks are scored with either the Ma and ER1a motifs of AtMYB2 (G). Peaks with low Ma scores but high ER1a scores were identified. Red circles highlight the three peaks that are exemplified in (H), whereby CRE sequences similar to ER1a but not to Ma (D1 and D2) specifically colocalize with the DAP peaks of AtMYB2 (CRE in peak indicated by gray), and the CRE sequence (M1) similar to Ma is associated with DAP peaks of AtMYB2/57/79, emphasizing the affinity and specificity contributions from the modified right half‐site of ER1a.

Moreover, we asked whether genomic sites of ER1a are recognized by AtMYB2. For this purpose, we further analyzed the DAP‐seq data of AtMYB2. Among the top 3000 peaks, there are 11.7% peaks contain one or more ER1a binding sites (Figure 4B). This fraction is comparable to that of the peaks containing monomeric AtMYB2 sites (11.2%) and is the highest among all AtMYB2's dimeric sites (Figure S5C). The number of ER1a sites diminishes when the peak sequences are shuffled (0.35%) or upon mutating the ER1a motif at the CAC nucleotides (2.75%; Figure 4B); therefore, the enrichment of ER1a is not due to mono‐nucleotide bias or the left half‐site of ER1a with unchanged specificity. These results suggest that genomic ER1a sites are recognized by AtMYB2 and contribute to the DAP‐seq signals.

Because DAP‐seq uses gDNA that is free of DNA‐associated proteins such as histones, the binding affinity of AtMYB2 to the genomic sites depends purely on the underlying DNA sequence. Therefore, we examined to which extent the DAP‐seq signals can be modeled by the curated SELEX motifs of AtMYB2. Based on the highest motif score of each sequence, a binary classification was performed to discriminate genomic regions into bound and unbound. Sequences in the DAP‐seq peaks are treated as bound, and the sequences outside the peaks are treated as unbound. The precision‐recall (PR) curves were generated by varying the threshold (Figure 4C,D and Figure S5D). The results show that combining the two monomeric motifs has relatively low predictive power (curve M; Figure 4C), with an AUPRC (area under the PR curve) of 0.0301. Adding DR motifs (but not IR) into the prediction can improve the accuracy (Figure 4C,E). However, the most significant improvement in prediction accuracy is observed when incorporating the ER motifs (AUPRC = 0.0863; Figure 4C,E). The additional predictive power of ERs mainly comes from the closely‐spaced ER motifs (ER0s and ER1a; Figure 4E and Figure S5E), but not from the ER motifs with large spacings (e.g., ER8; Figure 4E) despite their higher affinities (Figure 2C). This is consistent with the fact that drastic specificity changes primarily occur within half‐sites of ER0 and ER1 motifs (Figure 2C). Notably, adding ER1a alone to monomeric motifs has increased AUPRC from 0.0301 to 0.0471 (Figure 4D,E). Moreover, the ER0/1 s of AtMYB2 cannot improve the prediction of the DAP signals of the closely related AtMYB57/79 (Figure S5D). Consistently, we found that in the generated DAP libraries of AtMYB57/79, significantly fewer peaks contain sites of ER0/1 s compared to AtMYB2, whereas more peaks contain sites of Ma/Mb, DR, IR, and ER2–10 (Figure 4F). These comparative analyses suggest that ER0/1s are uniquely bound by AtMYB2.

To identify individual sites that contributed to ER1a's additional predictive power, we scored the DAP‐seq peaks of AtMYB2 with both ER1a and Ma (Figure 4G). The weak correlation and the existence of high ER1a score, and low Ma score peaks (e.g., the red circles in Figure 4G, coverage tracks, and binding site sequences exemplified in Figure 4H) suggest that the modified specificities in the right half‐site of ER1a also contributes to AtMYB2 binding, and help distinguish AtMYB2 from the closely related AtMYB57/79 (Figure 4H). Although both DAP‐seq and SELEX reflect the intrinsic biochemical affinity of a TF, discrepancies between the two libraries are observed for the enrichment of the AtMYB2 motifs (Figure S5F). Specifically, ER1a and ER0s are more enriched in DAP‐seq peaks than in SELEX reads (Figure S5F). Such discrepancy presumably reflects the preferred usage of cis‐regulatory codes in the genome resulting from evolutional drifts [50].

The modified specificities of AtMYB2 are functional to activate transcription

We further explored the roles of ER0s and ER1s of AtMYB2 in transcriptional regulation. First, we found that ER0s and ER1s are enriched around TSSs to an extent comparable with the monomeric motifs (Figure 5A). As a control, when we mutate the right half‐site of ER1a, TSS enrichment of the motif decreases (CACtoTTA and CACtoATA; Figure 5A). In general, SELEX motifs with higher IC contents (yellow‐red squares in Figure 5A) are also more enriched around the TSSs, suggesting that the high‐affinity binding modes identified by SELEX are more likely to be transcriptionally functional.

Figure 5.

Figure 5

The modified specificities of AtMYB2 are functional to activate transcription. (A) ER0 and ER1 CREs enrich around promoters. Distributions of AtMYB2 CREs around transcription start sites (TSSs) of A. thaliana genes were evaluated by the density of motif matches. The yellow‐red squares indicate the information content of the motifs (same as in Figure 2C). Note that ER0s and ER1s are enriched around TSSs to an extent comparable with the monomeric motifs and that mutations to the right half‐site of ER1a (CACtoTTA/ATA) decreased its enrichment. (B) ER0s and ER1s of AtMYB2 activate transcription. (Top) Vectors used in the dual‐luciferase reporter assays. The consensus of each motif was repeated 4 times and inserted in front of a mini‐35S promoter. (Bottom) The reporter activity of ER0s and ER1a. ***p < 0.001 in t‐test. (C) Genes activated by AtMYB2 overlap with DNA affinity purification sequencing (DAP‐seq) peaks. DEGs were identified by comparing RNA‐seq data before and after overexpression of AtMYB2 in protoplasts. The DAP‐seq peaks of AtMYB2 significantly overlap with the upregulated DEGs but not with the downregulated DEGs, p values from Fisher's exact test. (D) Genes activated by AtMYB2 enrich modified dimeric specificities in promoters. Motif enrichments in the promoters of upregulated DEGs are compared to those in promoters of non‐DEGs. Note that mutations to the right half‐site of ER1a (CACtoTTA/ATA) decreased the enrichment. (E–G) AtMYB2 specifically binds to the promoter of AT2G16760 and activates its transcription. (E) The coverage tracks of RNA‐seq and DAP‐seq near the ER1a target AT2G16760. The sequence of the AtMYB2 CRE in the DAP‐seq peak (gray) is aligned with ER1a for comparison. (F) Expressions of both AtMYB2 and AT2G16760 are induced after ABA treatment of Arabidopsis seedlings (data from Expression Angler [59]). (G) Reporter assay comparing the activity of ER1a consensus to the AtMYB2 CRE sequence of AT2G16760. (H) The biological roles of AtMYB2. The GO enrichment of upregulated DEGs by AtMYB2 overexpression. Arrows indicate the GO terms related to ABA responses and metabolic pathways. (I) AtMYB2 binding is compatible with nucleosome occupancy. Nucleosome occupancy before and after overexpression of AtMYB2 in Arabidopsis protoplasts was measured by MNase‐seq. AtMYB2 has little effect on nucleosome occupancy before (gray line) and after (red line) overexpression near its binding sites altogether (DAP‐seq peaks) and its CREs of most individual motifs (Ma CREs and ER1a CREs). However, the DEGs upregulated by AtMYB2 overexpression show increased nucleosome occupancy near their TTSs (red line, Genes).

To experimentally validate whether ER0s and ER1s of AtMYB2 regulate transcription, we performed reporter assays using 4 × tandem repeats of these homodimeric motifs in Arabidopsis protoplasts (Figure 5B). In the presence of ectopic expression of AtMYB2, ER0s and ER1s can drive reporter gene expression to an extent higher than the constitutive 35S promoter and comparable to other high‐affinity dimeric modes (ER8, DR4, and IR0/‐1; Figure S6A). The activities of ER0/1s are also higher than two monomeric motifs (2 × Ma; Figure S6A). Mutating the right half‐site of ER1a decreases the promoter activity (Figure 5B); thus, it is the modified specificity of ER1a that allows dimeric binding of AtMYB2 and results in the observed high transcriptional regulatory activity. Consistent with the fact that AtMYB57 and 79 do not bind to ER0/1s (Figures 3A and 4F), when AtMYB57/79 is expressed, the transactivating capability of ER0a/ER1a is comparable or lower than two monomeric motifs (2 × Ma; Figure S6A), and much weaker than IR0, whereby half‐sites are with less specificity change. Therefore, the ER0/1s CREs are specifically functional for AtMYB2.

Next, we asked whether AtMYB2 can specifically activate downstream genes targeted by ER0s and ER1s. To this end, protoplasts from Arabidopsis leaves were measured by RNA‐seq before and after the ectopic overexpression (OE) of AtMYB2. In parallel, the OE of AtMYB57/79 was also performed for comparison. The OE has increased the transcript level of AtMYB2 by >100‐fold (Figure S6B) and resulted in 1349 differentially expressed genes (DEGs, |log2FoldChange | > 2, FDR < 0.05). We found that the upregulated genes significantly overlap with the DAP‐seq peaks of AtMYB2, whereas the downregulated genes do not (Figure 5C), supporting that AtMYB2 is a transcriptional activator rather than a repressor. Promoters of the upregulated genes have enriched 26 out of 27 AtMYB2 binding motifs identified in SELEX (Figure S6C), including the ER0s and ER1s (Figure 5D and S6D). Mutating the right half‐site of ER1a decreased the enrichment (CACtoTTA and CACtoATA; Figure 5A), suggesting that the changed specificity of ER1a has contributed to transcriptional activation. Multiple genes are upregulated by AtMYB2 through the ER1a CREs (Figure 5E and Figure S6E,F). For example, the most significantly upregulated ER1a target is AT2G16760 (Figure S6E), which encodes a calcium‐dependent phosphotriesterase superfamily protein. Accordingly, AtMYB2, but not AtMYB57/79, binds to the promoter of AT2G16760 and activates its transcription after overexpression (Figure 5E). Consistent with AtMYB2's role as an ABA‐responsive factor, the expression level of AT2G16760 also increases after ABA treatment (Figure 5F). However, the CRE targeting AT2G16760 is not a perfect ER1a site and harbors mismatches to the ER1a consensus (alignment in Figure 5E). We substituted all four mismatches back and observed an elevated expression in the reporter assay (Figure 5G), confirming the highest affinity and activity of ER1a consensus. Compared to DAP‐seq, the enrichment of AtMYB2 motifs in SELEX better correlates with the ability of these motifs to activate transcription (Figure S6G), potentially because DAP‐seq enrichment is additionally affected by the nonrandom genomic background aside from the affinity. Analyses of the DAP‐seq data of AtMYB2/57/79 have shown that ER1a CREs are specifically bound by AtMYB2 but not by AtMYB57/79 (Figure 4H); here, with the expression profiles, we further validated that genes harboring ER0/1 CREs in the promoter are specifically upregulated by OE of AtMYB2 but not OE of AtMYB57/79 (Figure S6H).

To explore the biological roles of AtMYB2, gene ontology (GO) analysis was performed for genes upregulated by AtMYB2 overexpression (Figure 5H). In agreement with the reported functions of AtMYB2 and MYB‐family TFs [34, 60, 61], biological processes of “abscisic acid‐activated signaling pathway” and those related to metabolism (arrows; Figure 5H) were enriched with the most significant p‐values. The genes upregulated by different dimeric modes have also enriched different GO terms (Figure S7A); the GO terms associated with ER0/1s could have contributed to the unique biological roles of AtMYB2.

In general, TF‐DNA bindings are incompatible with nucleosomes [45]. Upon binding to cognate CREs, TFs tend to evict or remodel nucleosomes and increase the nearby chromatin accessibility [62, 63]. However, when we measure the nucleosome occupancy by MNase‐seq for AtMYB2 binding sites, we found that AtMYB2 tends to bind nucleosomal DNA (DAP‐seq peaks; Figure 5I) and that OE of AtMYB2 has little effect on nucleosome occupancy near the AtMYB2 CREs. This is true both for the monomeric AtMYB2 CREs (Ma CREs; Figure 5I) and for the dimeric CREs that feature either the unchanged (ER8 CREs; Figure S7B) or changed specificities (ER1a CREs; Figure 5I). It is yet noteworthy that the genes upregulated by AtMYB2 OE have enriched nucleosomes at the end of the transcribing regions (Genes; Figure 5I). This enrichment of nucleosome is not observed for the downregulated genes, for the upregulated genes before AtMYB2 expression (Genes; Figure 5I), and for the non‐DEG genes (Figure S7C). Collectively, these observations suggest that DNA‐binding of AtMYB2 does not seem to interfere with nucleosome occupancy and that AtMYB2 is a potential “pioneer TF.” This is also supported by the observed short monomeric motifs of AtMYBs (Figure 1G), which may enable the shallow binding [63, 64] that minimizes the steric hindrance when TFs bind to nucleosomal DNA.

The modified specificities of AtMYB2 are conserved in evolution

The existence of purifying selection, as indicated by a high level of sequence conservation, in general, is a reliable sign of functional CREs [65, 66]. We reasoned that if the homodimeric AtMYB2 CREs with modified specificities are physiologically functional, they should also be conserved during evolution. Therefore, we inspected the distribution of PhastCons scores (across 63 angiosperms [67]) around ER0/ER1 CREs located in the promoter region (Figure 6). As a positive control, we first validated that CREs of the monomeric mode (Ma) are more conserved than their flanks (Figure 6A). After that, we also interrogated the dimeric CREs with changed specificities and found that they also colocalized with peaks of conservation scores (Figure 6B,C). Notably, the widths of the conservation peaks are in accord with the width of the homodimeric CRE motifs. To further validate if the modified specificities in these motifs are functional, as an example, we mutated the CAC positions of the ER1a mode and visualized the nearby PhastCons scores for the putative CREs (Figure 6B). Accordingly, we found that the conservation peaks narrowed to span only the left half‐site, a width resembling the peak of the monomeric motif (Figure 6A). Taken together, the evolutional evidence further supports that the homodimeric motifs are not biochemical artifacts observed in SELEX, but instead, they represent functional CREs of gene expression.

Figure 6.

Figure 6

The modified specificities of AtMYB2 are conserved in evolution. (A–C) The conservation scores around AtMYB2 CREs. PhastCons scores (across 63 angiosperms) near promoter CREs corresponding to the AtMYB2 motifs Ma (A), ER1a (B), and ER0s and ER1b (C). Note that peak widths of the conservation scores agree with the widths of the SELEX motifs of AtMYB2 and that mutating the right half‐site of ER1a narrows its conservation peaks (CACtoTTA/ATA in B). (D) Homodimerization helps distinguish closely related TFs. CCWAA‐box VIII MYBs share highly similar monomeric motifs. However, upon homodimerization, their unique dimeric specificities and dimeric configurations (spacing and relative orientation) help discriminate between these AtMYBs, defining unique CREs of AtMYB2.

DISCUSSION

More than 90% of extant angiosperms retain evidence of one or more ancient genome‐wide duplications [68]. How TFs diversify in their DNA‐binding specificity and regulatory roles after gene duplications is of primary importance for understanding the functional complexity and environmental plasticity that emerged during plant evolution. The remarkable expansion of MYB family TFs accompanying plant terrestrialization has led to a large number of family members and a great intrafamily specificity divergence [69]. To understand how closely related members of AtMYBs recognize distinguished cis‐regulatory codes, this work comprehensively profiled the DNA‐binding specificity of subfamily VIII R2R3‐AtMYBs and found that homodimerization allows AtMYB2 to recognize a set of cis‐regulatory codes that are not bound by other AtMYBs (Figure 3A,C), which in turn enables AtMYB2 to bind (Figure 4F,H) and activate (Figure 5E and Figure S6F,H) specific target genes.

This finding offers an alternative solution to the “specificity paradox” faced by eukaryotic TFs. Prior studies have already suggested multiple solutions to resolve TFs with a similar monomeric motif, including (a) diversity in DNA shape preferences [18, 19, 51]; (b) combinatorial binding with different partners [70, 71, 72, 73, 74]; (c) multiple DNA‐binding specificities [75, 76]; (d) latent specificities induced by co‐factors [77, 78]; (e) DNA modifications [12, 13]; (f) binding to weak but specific sites in transcriptional hubs whereby the TF concentration is elevated [11]; (g) the configurations of TF homodimers [23, 25, 79]. In addition to these, here we show that homodimerization can also directly change the specificity of TFs and establish their unique DNA‐recognition selectivity. We therefore propose a dual‐role model through which homodimerization assists in discriminating between similar TFs (Figure 6D). This model also provides insights into gene editing—if homodimeric CREs with specificity changes are targeted, the editing will affect transcriptional regulation by only one of several paralogous TFs, thus improving the accuracy.

Subfamily VIII R2R3‐AtMYBs form a monophyletic clade that potentially originates from the same ancestral gene through duplications [30]. This study suggests that multiple mechanisms have cooperated to guide paralogous VIII AtMYBs to different cis‐regulatory codes. First, groups can be defined according to the identity of their primary motifs (Figure 1E). Next, within each group where the primary motifs are similar, the AtMYBs are further distinguished by a) more subtle monomeric specificity differences, such as the observed differences with the secondary/tertiary motifs (Figure 1E), and by b) the homodimeric binding preferences (Figure 6D), including the unique dimeric configurations (Figure 2A and Figure S3A) and the TF‐specific modifications of binding specificity upon homodimerization (Figure 3). Different from the compulsory dimeric TFs (e.g., bZIP, bHLH, MADS, and LFY), which bind dimeric sites with restricted orientations and short spacings, AtMYBs are capable of binding dimeric sites with fewer constraints. Such abundant homodimeric configurations of VIII AtMYBs also facilitated the discrimination between them (Figure 2A and Figure S3A).

The specificity profiles generated in this study also supplemented the previous DAP‐seq datasets [13] to illustrate the biochemical sequence preferences of AtMYBs, especially for their dimeric sequence preferences. From the data, we identified that the CCWAA‐box VIII AtMYBs have a distinct specificity and a high selectivity. Accordingly, these AtMYBs also share common structural features (Figure 1) and partially overlap regarding the physiological processes they regulate, such as stamen development [37, 80] and responses to the plant hormone ABA [34, 81]. Moreover, the generated data also inspire the rational design of CREs to drive tissue‐specific gene expression by VIII MYBs. The monomeric motif models generated for 16 VIII MYBs were not reported before. The motifs suggest that the previously identified consensus binding sequences were not optimal. For example, AtMYB2 was reported to bind A/TAACCA, C/TAACG/TG [34], TAACTG [80, 82, 83], and TAACGG [80]; in contrast, the generated motif models suggest that ACCTAAY has the highest affinity and should drive gene expression with the highest efficiency. In addition, the complex SELEX libraries also illustrate the dimeric preferences of VIII MYBs and provide a rich resource of dimeric motif models (Data S2). This facilitates the design of even stronger promoters and enhancers by offering the combinatorial rules of AtMYB CREs.

Altogether, through profiling the intact specificity landscape of AtMYBs, this work unraveled the intricacies and multiple sources of DNA‐binding specificities for closely related paralogous TFs beyond their seemingly analogous monomeric motifs and illustrated the versatile roles of homodimerization in establishing the unique identity of TFs and in enriching the complexity of the cis‐regulatory circuits.

CONCLUSION

We investigated the specificity landscape of 40 R2R3‐MYBs of subfamily VIII using high‐throughput SELEX, curated 833 motif models and identified a cluster of “CCWAA‐box” AtMYBs that features a distinct specificity and a high selectivity. Moreover, with AtMYBs as an example, we demonstrated that homodimerization can evoke specificity changes—a yet unreported mechanism—to help distinguish between similar paralogous TFs. AtMYB2 has relied on this mechanism and developed a dimeric specificity that is distinct from other closely related “CCWAA‐box” AtMYBs and unique among all AtMYBs. The unique homodimeric cis‐regulatory codes of AtMYB2 are biochemically functional, conserved in evolution, and capable of activating transcription. Based on the facts, we proposed a dual‐role model through which homodimerization assists in discriminating between similar TFs.

METHODS

Cloning and protein expression

The full‐length CDS sequences of AtMYBs, the DBD sequences of AtMYB2/MA0073s0038 (M. polymorpha)/MN199019 (S. moellendorffii), and the progressive truncations of AtMYB2 (Table S1) were inserted into the pIX‐Halo with the In‐Fusion Kit (Vazyme; C112). Subsequently, the proteins with a Halo tag were expressed using the TnT® SP6 High‐Yield Wheat Germ Protein Expression System (Promega, L3261). For each AtMYB protein, 3 µg plasmids were used in a 50 µL reaction and incubated for 2 h at 25°C; the expressed proteins were then used for the subsequent experiments.

High‐throughput SELEX and data pre‐processing

High‐throughput SELEX (HT‐SELEX) was performed as previously described [23]. The input SELEX library was designed to include a central random region of 101‐bp and flanking regions that were compatible with Illumina sequencing (TruSeq adapter sequences). Two microliters fusion proteins were incubated with 6 µL of SELEX library (30 ng/µL) and 30 µL of TCAPT buffer (140 mM KCl, 5 mM NaCl, 1 mM MgCl2, 3 µM ZnSO4, 100 µM EGTA, 10 mM Tris, pH 8, 0.1% Tween) at 25°C for 1 h. The DNA‐protein complexes were then enriched with Magne Halo‐Tag Beads (Promega, G7282), and the unbound DNA ligands were washed away with the HydroSpeed plate washer (Tecan, 30190101). The bound DNA ligands were then amplified by PCR with the amplification primers (Table S1) and used as the input library for the next cycle. After 3–5 cycles, the enriched DNA ligands and the input ligands were amplified with the PE primers (Table S1) and Illumina sequenced. Raw sequences from Illumina NovaSeq. 6000 were demultiplexed according to the i7 indices of each sample. Fastp [84] was used in pre‐processing, the low‐quality reads, and PCR duplicates were removed, the adaptors were trimmed, and then the paired‐end reads were merged and segmented into 40‐bp sequences for motif discovery.

Data analyses of HT‐SELEX

Autoseed was used to de novo discover motifs from the pre‐processed SELEX reads with parameters “‐40N <Background sequence> <Signal sequence> 1 8 10 0.35–50 100” [85]. Autoseed constructs motifs based on the seed consensus and its neighboring kmers with Hamming distance ≤ 1, and therefore separates closely related models (Figure S1E). The motifs obtained from Autoseed were further filtered to remove duplicated motifs, single nucleotide runs, and motifs with total ICs <1. The spacing distance between two monomeric sites is defined by the distance between the two representative kmers [70]. For example, the spacing between two AtMYB2 monomers is defined by the distance intermediate of the two “RCCWAAY” sequences. The strength of TF signals in SELEX libraries was measured by E‐MI (enriched‐sequence‐based mutual information), which collectively captures all possible binding events of TFs [45]. The IC of the motifs was calculated by summing the IC values over all positions of a motif. Mean IC (per‐base IC) was calculated by dividing the IC of the motif by its length. The enrichment and frequency of all motifs in the SELEX library were evaluated by motif‐matching analyses. To compare the identified AtMYB motifs with previously reported plant motifs, all TF motifs in PlantPAN [6], PlantTFDB [48], and HOMER plants [49] were collected as the reference set. Classification and naming of MYB TFs were in accord with PlantTFDB [48]. Sequence alignment and phylogenetic analysis for MYB DBDs were performed respectively by mafft [86] and iqtree2 [87]. To calculate motif enrichments, motif matching analysis was performed using motifmatchr [88]. DNA shape analysis was performed using DNAshapeR [89]. ComplexHeatmap is used for heatmap visualization [90].

The dimeric preference of AtMYBs includes both the relative monomer orientation and spacing. The preference was analyzed by counting occurrences of ACCWAMY (a degenerated sequence representing the specificity of most AtMYBs) concatenated with different relative orientations and spacings. AlphaFold3 [91] was employed to build the structure models of AtMYB2‐DNA, using full‐length and DBD sequences of AtMYB2 and DNA sequences according to the consensus of each dimeric mode. Visualization of the structural models was achieved with PyMOL. The DNA contacts by AtMYB were visualized with DNAproDB [47]. To generate the in silico SELEX library, we first generated 107 random sequences with a 101‐bp length. Then, the ER1 motif of AtMYB49 was matched to each random sequence with a sliding window, and the highest matching score among all windows was taken as the score of the sequence. Subsequently, the sequences with a score greater than 10 were collected as the in silico‐enriched SELEX library.

Electrophoretic mobility shift assays (EMSA)

Double‐stranded DNA probes harboring the consensuses of Ma, ER1a, and ER8 of AtMYB2 were synthesized by annealing oligonucleotides and diluted to 10 ng/µL (DNA sequences in Table S2). Because the amount of protein expressed with the wheat germ system (Promega; L3261) is limited, here the AtMYB2 CDS sequence was inserted into pETG20A‐His and transformed into Rosetta 2 (DE3) pLysS strains [92]. The transformed E. coli was grown in LB medium (with auto‐induced ZYP5052 medium [23]) at 37°C for 4 h and 17°C for 14 h. The bacteria were then collected and lysed by lysis buffer (0.5 mg/mL lysozyme, 2 mg/mL DNase I, 1 mM PMSF), then incubated with His‐tag Ni Sepharose (Sangon, C600332) at 25°C for 2 h, and subsequently washed using Buffer A (50 mM Tris, 300 mM NaCl, 10 mM imidazole, pH 7.5) and Buffer B (Buffer A but with 50 mM imidazole) to remove nonspecifically bound proteins. Elution of the His‐tagged AtMYB2 protein was achieved using Buffer C (Buffer A with 500 mM imidazole). The purified AtMYB2 protein was concentrated using a 10 kD ultra‐centrifugal filter (Sigma; UFC8010). For EMSA, we prepared the binding buffer (10 mM Tris pH 7.5, 50 mM KCl) according to the commercial recipe (ThermoFisher, 20148) (Figure S1D). Subsequently, the AtMYB2 protein was diluted into 18, 24, 36, and 48 ng/µL gradient, and incubated with 10 ng DNA probes in a 20 µL reaction at 25°C for 1 h. The reaction mixture was loaded onto a 6% native PAGE and run in 0.5 × TBE at 4°C (100 min, 100 V). The gel was stained with gel blue (UElandy; S2019L) and imagined with a Bio‐Rad scanner (Bio‐Rad; 1708195EDU).

DAP‐seq and data pre‐processing

DAP‐seq is performed as described [93, 94]. Briefly, genomic DNA was extracted from 4 weeks of Arabidopsis thaliana leaves and sonicated into 200‐bp fragments by the Covaris S2 sonicator, then purified using DNA clean beads (Vazyme; N411) at a 1.5:1 bead: DNA ratio. Adapters were then ligated to the genomic DNA fragments using the ligation toolkit (Vazyme; N203, N204) to generate the input library of DAP‐seq. Subsequently, 12 µL of Halo‐tagged AtMYB2/57/79 protein and 200 ng of the input library were incubated within 30 µL TCAPT buffer for 1 h. DNA‐protein complexes were pulled down with Magne Halo‐Tag Beads (Promega; G7282) and washed with the HydroSpeed plate washer (Tecan; 30190101). The DNA ligands bound to the AtMYB2/57/79 were amplified by PCR (using amplification primers in Table S1) and sequenced on an Illumina NovaSeq. 6000. Two independent biological replicates were included. Raw reads were demultiplexed, removed for low‐quality and adapter sequences, and then aligned using BWA v.0.7.17 [95] to the TAIR10 genome. The Araport11 files were used to obtain annotations of the genome.

Data analyses of DAP‐seq

Peak calling was carried out using MACS3 v.3.0.0a7 [96] with parameters “‐f BAMPE ‐‐d‐min 5 ‐‐min‐length 15 ‐‐call‐summits ‐‐keep‐dup all ‐‐cutoff‐analysis.” The motif hits in DAP‐seq libraries were identified by motifmatchr [88] with the p‐value set to 1e‐5. To calculate enrichments, the motif hits in shuffled DAP‐seq peaks were used as the background. A PR analysis was conducted by utilizing motifmatchr to score the 200‐bp genomic fragments. The fragments overlapping with the top 3000 DAP‐seq peaks were used as the positive set, while the remaining fragments served as the negative set. The highest motif‐matching score across the 200‐bp region is taken as the score of a segment. By varying the threshold of the binary classifier, the PR curves were generated, and the AUPRC values were calculated. Analyses for the dimeric configurations in DAP‐seq libraries were performed the same as described for the SELEX libraries. To obtain less biased motifs, and to enable comparison with SELEX, here we also used Autoseed (with parameters “‐40N <Background sequence> <Signal sequence> 1 8 10 0.35 ‐ 50 100”) to discover motifs for DAP‐seq libraries.

Overexpression of AtMYBs in Arabidopsis protoplasts

The full‐length CDS sequence of AtMYB2/57/79 was inserted into the pHBT‐GFP‐NOS, which was then transferred into Arabidopsis (Col‐0 ecotype) protoplasts to overexpress the AtMYBs protein. Protoplasts were isolated from leaves of 4‐week‐old Arabidopsis and transfected [97]. The isolated protoplasts were resuspended in the MMG solution (0.4 M mannitol, 15 mM MgCl2, 4 mM MES, pH 5.7) to a concentration of 2 × 105 cells/mL. Transfection was performed by combining 1 mL protoplasts with 100 µL plasmid (1 µg/µL) and 1.1 mL PEG solution (40% PEG4000, 0.2 M mannitol, 0.1 M CaCl2). After 8 min, transfection was stopped by adding 4.4 mL of W5 solution (154 mM NaCl, 125 mM CaCl2, 5 mM KCl, and 2 mM MES, pH 5.7). The transfected protoplasts were incubated for 14 h at room temperature to allow protein expression, followed by centrifuging to collect 5 × 106 cells as the AtMYB2 overexpression sample for RNA‐seq and MNase‐seq.

RNA‐seq and data analyses

5 × 106 cells of AtMYB‐overexpressed protoplasts were collected for RNA‐seq. RNA isolation, library construction, and sequencing were performed by the Novogene Company. The obtained RNA‐seq raw reads were first removed for adapter sequences, and low‐quality reads by Trimmomatic v.0.39 [98]. The filtering parameters are “ILLUMINACLIP:TruSeq. 3‐PE‐2.fa:2:30:7:1 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:13 MINLEN:50.” Then, the clean reads were mapped to the TAIR10 genome using Hisat2 v.2.2.1 [99]. The mapped reads from each RNA‐seq library were next assembled into transcriptome by Stringtie v.1.3.4 with the parameter “‐G Araport11.gtf” and removed for redundancy by CD‐HIT v.4.7. Then the genomic sequences of the assembled transcriptome were extracted into a fasta file, and indexed with salmon v.0.13.1 [100]. With the index, the expression of genes was calculated using the fastq files generated by Trimmomatic, and normalized to TPM (transcripts per million reads) with salmon. Subsequently, genes with false discovery rate (FDR) < 0.05 and |log2fold change| > 2 were defined as differentially expressed genes using DESeq. 2. GO analyses were then carried out with GO information from org.At.tair.db.

Conservation and TSS‐enrichment analyses of AtMYB2 CREs

To visualize the distribution of AtMYB2 CREs near the TSSs of the A. thaliana genome, MYB2 motifs were used to match the genomic sequences using motifmatchr [88] with the p‐value set to 1e‐4. To eliminate potential biases from the nucleotide composition of the motifs, the distributions of reversed AtMYB2 CREs (matches of the reversed motifs) were subtracted from the distribution of AtMYB2 CREs. The resulting distribution profiles were further smoothed with a 200‐bp window. For conservation analysis, the genome‐wide PhastCons scores of Arabidopsis calculated across 63 representative plants were downloaded from the PlantRegMap [67] database. AtMYB2 CREs (motif hits) in the promoters (0 to −1 kb from TSSs) were included in the conservation analysis, and the average conservation scores around the aligned AtMYB2 CREs were calculated for each position and visualized.

Dual‐luciferase reporter assay

The CDS sequences of AtMYBs were inserted into the pHBT‐GFP‐NOS vector for expression of AtMYB2/57/79. The reporter vector pGreenII 0800‐LUC was constructed [101] to include four repeats of the consensus of the AtMYB2 binding motifs from SELEX, followed by a mini35S promoter (DNA sequences in Table S2). The plasmids were extracted with the GoldHi EndoFree Plasmid Maxi Kit (CWBIO, CW2104M) and transfected into protoplasts as described above. The protoplasts were placed in the dark for 12 h at room temperature. Next, the luminescence strengths were measured using a reporter assay kit (Yeasen; 11402ES60) and a microplate reader (Tecan, Spark). The ratio of Fluc to Rluc was calculated to represent promoter activity.

MNase‐seq and data analyses

MNase‐seq follows the described procedure [102], 5 × 106 cells of Arabidopsis protoplasts were collected for MNase‐seq. 1 ml of chilled RIPA buffer (10 mM Tris–HCl, pH 8.0, 140 mM NaCl, 1 mM EDTA, 1% Triton X‐100, 0.1% SDS, 0.1% sodium deoxycholate, 0.25 M sucrose, 0.1% beta‐ME, 1 mM PMSF) was added. After a thorough vortex, the mixture is centrifuged at 800 g for 2 min to remove the supernatant. The washing process is repeated 3–5 times. The final pellet containing nuclei was resuspended by 200 µL MNase buffer (0.3 M sucrose, 20 mM Tris‐HCl, pH 7.5, 3 mM CaCl2). The Micrococcal Nuclease (New England Biolabs, M0247S) was diluted 128 times with the MNase buffer, and 3 µL of the diluted MNase was added into 200 µL resuspension of the nuclei, incubated at 37°C and 1500 rpm for 8 min on a mixer (Miulab; MTH‐100). The digestion was terminated by adjusting the EDTA concentration to 20 mM. The digested DNA fragments were recovered using DNA purification beads (Vazyme, N411), and subjected to Illumina library preparation. MNase‐seq raw reads were removed for low‐quality sequences and adapter sequences using Trim Galore with parameters “‐q 30 ‐‐paired ‐‐stringency 5 ‐‐fastqc ‐‐gzip”. The BWA aligner [95] was used to align the reads to the TAIR10 genome with the quality threshold set to 20. The alignments are then converted to bam format, sorted and removed for duplicates, indexed, and visualized.

Statistical analyses

The statistical analyses were performed in R. The function wilcox.test was used to apply the Wilcoxon signed‐rank test. Fisher's exact test was performed with the fisher.test function. T‐tests were performed with the t.test function.

AUTHOR CONTRIBUTIONS

Tian Li: Writing—review and editing; data curation; investigation; validation; formal analysis; visualization. Hao Chen: Writing—review and editing; data curation; investigation; validation; formal analysis; visualization. Nana Ma: Data curation. Dingkun Jiang: Data curation. Jiacheng Wu: Data curation. Xinfeng Zhang: Data curation. Hao Li: Data curation. Jiaqing Su: Data curation. Piaojuan Chen: Data curation. Qing Liu: Data curation. Yuefeng Guan: Project administration. Xiaoyue Zhu: Project administration. Juncheng Lin: Project administration. Jilin Zhang: Project administration. Qin Wang: Project administration. Honghong Guo: Writing—review and editing; formal analysis; visualization; funding acquisition. Fangjie Zhu: Writing—review and editing; investigation; validation; formal analysis; funding acquisition.

CONFLICT OF INTEREST STATEMENT

The authors declare no conflicts of interest.

ETHICS STATEMENT

No animals or humans were involved in this study.

Supporting information

Figure S1: Closely related R2R3‐AtMYBs bind highly similar sequences.

Figure S2: HT‐SELEX enriches AtMYBs binding signals.

Figure S3: Modified specificities of closely spaced AtMYBs homodimers.

Figure S4: Enrichment of AtMYB2 binding modes in MYBs of other plants.

Figure S5: AtMYB2 recognizes unique targets with modified specificity.

Figure S6: The homodimeric binding modes of AtMYB2 activate transcription.

Figure S7: Transcriptional regulation by AtMYB2.

Data S1: De novo motifs from published DAP‐seq libraries of VIII AtMYBs.

Data S2: Monomeric and homodimeric motifs of VIII R2R3‐AtMYBs.

IMT2-4-e70009-s002.docx (17.4MB, docx)

Table S1: Sequence information for DNA ligands.

Table S2: Sequence information for dual‐luciferase reporter assay and EMSA.

Table S3: Orthologs of AtMYB2.

Table S4: MYB2 motif models from SELEX library.

Table S5: MYB57 motif models from SELEX library.

Table S6: MYB79 motif models from SELEX library.

Table S7: MYB90 motif models from SELEX library.

Table S8: MYB19 motif models from SELEX library.

Table S9: MYB46 motif models from SELEX library.

Table S10: MYB50 motif models from SELEX library.

Table S11: MYB61 motif models from SELEX library.

Table S12: MYB67 motif models from SELEX library.

Table S13: MYB83 motif models from SELEX library.

Table S14: MYB80 motif models from SELEX library.

Table S15: MYB49 motif models from SELEX library.

Table S16: MYB107 motif models from SELEX library.

Table S17: MYB63 motif models from SELEX library.

Table S18: MYB74 motif models from SELEX library.

Table S19: MYB13 motif models from SELEX library.

Table S20: MYB58 motif models from SELEX library.

Table S21: MYB10 motif models from SELEX library.

Table S22: MYB30 motif models from SELEX library.

Table S23: MYB30 motif models from SELEX library.

Table S24: MYB121 motif models from SELEX library.

Table S25: MYB27 motif models from SELEX library.

Table S26: MYB62 motif models from SELEX library.

Table S27: MYB108 motif models from SELEX library.

Table S28: MYB21 motif models from SELEX library.

Table S29: MYB71 motif models from SELEX library.

Table S30: MYB66 motif models from SELEX library.

Table S31: MYB14 motif models from SELEX library.

Table S32: MYB72 motif models from SELEX library.

Table S33: MYB114 motif models from SELEX library.

Table S34: MYB41 motif models from SELEX library.

Table S35: MYB34 motif models from SELEX library.

Table S36: MYB85 motif models from SELEX library.

Table S37: MYB51 motif models from SELEX library.

Table S38: MYB99 motif models from SELEX library.

Table S39: MYB15 motif models from SELEX library.

Table S40: MYB55 motif models from SELEX library.

Table S41: MYB86 motif models from SELEX library.

IMT2-4-e70009-s001.xlsx (386.9KB, xlsx)

ACKNOWLEDGMENTS

Fangjie Zhu was funded by the National Key Research and Development Program of China (2024YFC3407200), the National Natural Science Foundation of China (32370582, 32170554), “Chu Ying” young talent project of Fujian, Major Special Project of Fujian (2021NZ029009), Funding for High‐level Oversea Personnel (Ministry of Human Resources and Social Security). Honghong Guo was funded by the Fujian Agriculture and Forestry University (KFXH23027). We thank Dr. Hui Li from the Beijing Forestry University (College of Biological Sciences and Biotechnology) for offering seeds of Arabidopsis myb2 mutant. We thank the Functional Chemical Genomics Core Facility of Fujian Agriculture and Forestry University Metabolomics Center for automation assistance. We also thank Huimin Xu and Shuang Wu from Fujian Agriculture and Forestry University (College of Horticulture) for their assistance in data collection. We thank the Functional Chemical Genomics Core Facility of Fujian Agriculture and Forestry University Metabolomics Center for automation assistance.

Li, Tian , Chen Hao, Ma Nana, Jiang Dingkun, Wu Jiacheng, Zhang Xinfeng, Li Hao, et al. 2025. “Specificity Landscapes of 40 R2R3‐MYBs Reveal How Paralogs Target Different cis‐Elements by Homodimeric Binding.” iMeta 4, e70009. 10.1002/imt2.70009

Tian Li, Hao Chen, and Nana Ma contributed equally to this study.

Contributor Information

Qin Wang, Email: qinwangCRY@163.com.

Honghong Guo, Email: guohh@fafu.edu.cn.

Fangjie Zhu, Email: fjzhu@fafu.edu.cn.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available in the China National Genomics Data Center at https://ngdc.cncb.ac.cn/gsa/browse/CRA014337. All sequencing data have been deposited to China National Genomics Data Center under accession PRJCA022556, with raw sequence data under GSA: CRA014337 (https://ngdc.cncb.ac.cn/gsa/browse/CRA014337). Data link for UCSC Genome Browser session: https://genome.ucsc.edu/s/CH_1997/MYB2_project. The scripts have been deposited on GitHub (https://github.com/Daven-1997/2025_R2R3_MYBs_code). Supplementary materials (figures, tables, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.

REFERENCES

  • 1. Wunderlich, Zeba , and Mirny Leonid A.. 2009. “Different Gene Regulation Strategies Revealed by Analysis of Binding Motifs.” Trends in Genetics 25: 434–440. 10.1016/j.tig.2009.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Eckweiler, Denitsa , Dudek Christian‐Alexander, Hartlich Juliane, Brötje David, and Jahn Dieter. 2018. “PRODORIC2: The Bacterial Gene Regulation Database in 2018.” Nucleic Acids Research 46: D320–D326. 10.1093/nar/gkx1091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Wang, Tingting , Sun Wenju, Fan Ligang, Hua Canfeng, Wu Nan, Fan Shaorong, Zhang Jilin, Deng Xin, and Yan Jian. 2021. “An Atlas of the Binding Specificities of Transcription Factors in Pseudomonas aeruginosa Directs Prediction of Novel Regulators in Virulence.” eLife 10: e61885. 10.7554/eLife.61885 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Fan, Ligang , Wang Tingting, Hua Canfeng, Sun Wenju, Li Xiaoyu, Grunwald Lucas, Liu Jingui, et al. 2020. “A Compendium of DNA‐Binding Specificities of Transcription Factors in Pseudomonas syringae .” Nature Communications 11: 4947. 10.1038/s41467-020-18744-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Bryne, J. C. , Valen E., Tang M.‐H. E., Marstrand T., Winther O., da Piedade I., Krogh A., Lenhard B., and Sandelin A.. 2007. “JASPAR, the Open Access Database of Transcription Factor‐Binding Profiles: New Content and Tools in the 2008 Update.” Nucleic Acids Research 36: D102–D106. 10.1093/nar/gkm955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Chow, ChiNga , Lee TzongYi, Hung YuCheng, Li GuanZhen, Tseng KuanChieh, Liu YaHsin, Kuo PoLi, Zheng Han‐Qin, and Chang Wen‐Chi. 2019. “PlantPAN3.0: A New and Updated Resource for Reconstructing Transcriptional Regulatory Networks From ChIP‐seq Experiments in Plants.” Nucleic Acids Research 47: D1155–D1163. 10.1093/nar/gky1081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Franco‐Zorrilla, José M. , López Vidriero Irene, Carrasco José L., Godoy Marta, Vera Pablo, and Solano Roberto. 2014. “DNA‐Binding Specificities of Plant Transcription Factors and Their Potential to Define Target Genes.” Proceedings of the National Academy of Sciences 111: 2367–2372. 10.1073/pnas.1316278111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Davidson, Eric H . 2010. “Emerging Properties of Animal Gene Regulatory Networks.” Nature 468: 911–920. 10.1038/nature09645 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Jones, D. Marc , and Vandepoele Klaas. 2020. “Identification and Evolution of Gene Regulatory Networks: Insights From Comparative Studies in Plants.” Current Opinion in Plant Biology 54: 42–48. 10.1016/j.pbi.2019.12.008 [DOI] [PubMed] [Google Scholar]
  • 10. Merabet, Samir , and Mann Richard S.. 2016. “To Be Specific or Not: The Critical Relationship Between Hox and TALE Proteins.” Trends in Genetics 32: 334–347. 10.1016/j.tig.2016.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kribelbauer, Judith F. , Rastogi Chaitanya, Bussemaker Harmen J., and Mann Richard S.. 2019. “Low‐Affinity Binding Sites and the Transcription Factor Specificity Paradox in Eukaryotes.” Annual Review of Cell and Developmental Biology 35: 9.1‐9.23. 10.1146/annurev-cellbio-100617-062719 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Yin, Yimeng , Morgunova Ekaterina, Jolma Arttu, Kaasinen Eevi, Sahu Biswajyoti, Khund‐Sayeed Syed, Das Pratyush K., et al. 2017. “Impact of Cytosine Methylation on DNA Binding Specificities of Human Transcription Factors.” Science 356: eaaj2239. 10.1126/science.aaj2239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. O'Malley, Ronan C. , Huang Shao‐shan Carol, Song Liang, Lewsey Mathew G., Bartlett Anna, Nery Joseph R., Galli Mary, Gallavotti Andrea, and Ecker Joseph R.. 2016. “Cistrome and Epicistrome Features Shape the Regulatory DNA Landscape.” Cell 165: 1280–1292. 10.1016/j.cell.2016.04.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Neikes, Hannah K. , Kliza Katarzyna W., Gräwe Cathrin, Wester Roelof A., Jansen Pascal W. T. C., Lamers Lieke A., Baltissen Marijke P., et al. 2023. “Quantification of Absolute Transcription Factor Binding Affinities in the Native Chromatin Context Using BANC‐seq.” Nature Biotechnology 41: 1801–1809. 10.1038/s41587-023-01715-w [DOI] [PubMed] [Google Scholar]
  • 15. Abe, Namiko , Dror Iris, Yang Lin, Slattery Matthew, Zhou Tianyin, Bussemaker Harmen J., Rohs Remo, and Mann Richard S.. 2015. “Deconvolving the Recognition of DNA Shape from Sequence.” Cell 161: 307–318. 10.1016/j.cell.2015.02.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Rohs, Remo , West Sean M., Sosinsky Alona, Liu Peng, Mann Richard S., and Honig Barry. 2009. “The Role of DNA Shape in Protein‐DNA Recognition.” Nature 461: 1248–1253. 10.1038/nature08473 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Mathelier, Anthony , Xin Beibei, Chiu Tsu‐Pei, Yang Lin, Rohs Remo, and Wasserman Wyeth W.. 2016. “DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.” Cell Systems 3: 278–286.e4. 10.1016/j.cels.2016.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Sielemann, Janik , Wulf Donat, Schmidt Romy, and Bräutigam Andrea. 2021. “Local DNA Shape Is a General Principle of Transcription Factor Binding Specificity in Arabidopsis thaliana .” Nature Communications 12: 6549. 10.1038/s41467-021-26819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Gordân, Raluca , Shen Ning, Dror Iris, Zhou Tianyin, Horton John, Rohs Remo, and Bulyk Martha L.. 2013. “Genomic Regions Flanking E‐Box Binding Sites Influence DNA Binding Specificity of bHLH Transcription Factors Through DNA Shape.” Cell Reports 3: 1093–1104. 10.1016/j.celrep.2013.03.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Li, Miaomiao , Yao Tao, Lin Wanru, Hinckley Will E., Galli Mary, Muchero Wellington, Gallavotti Andrea, Chen Jin‐Gui, and Huang Shao‐shan Carol. 2023. “Double DAP‐seq Uncovered Synergistic DNA Binding of Interacting bZIP Transcription Factors.” Nature Communications 14: 2600. 10.1038/s41467-023-38096-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Rodríguez‐Martínez, José A. , Reinke Aaron W., Bhimsaria Devesh, Keating Amy E., and Ansari Aseem Z.. 2017. “Combinatorial bZIP Dimers Display Complex DNA‐binding Specificity Landscapes.” eLife 6: e19272. 10.7554/eLife.19272 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Boer, D. Roeland , Freire‐Rios Alejandra, van den Berg Willy A. M., Saaki Terrens, Manfield Iain W., Kepinski Stefan, López Vidrieo Irene, et al. 2014. “Structural Basis for DNA Binding Specificity by the Auxin‐Dependent ARF Transcription Factors.” Cell 156: 577–589. 10.1016/j.cell.2013.12.027 [DOI] [PubMed] [Google Scholar]
  • 23. Jolma, Arttu , Yan Jian, Whitington Thomas, Toivonen Jarkko, Nitta Kazuhiro R., Rastas Pasi, Morgunova Ekaterina, et al. 2013. “DNA‐Binding Specificities of Human Transcription Factors.” Cell 152: 327–339. 10.1016/j.cell.2012.12.009 [DOI] [PubMed] [Google Scholar]
  • 24. Galli, Mary , Khakhar Arjun, Lu Zefu, Chen Zongliang, Sen Sidharth, Joshi Trupti, Nemhauser Jennifer L., Schmitz Robert J., and Gallavotti Andrea. 2018. “The DNA Binding Landscape of the Maize AUXIN RESPONSE FACTOR Family.” Nature Communications 9: 4526. 10.1038/s41467-018-06977-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Stigliani, Arnaud , Martin‐Arevalillo Raquel, Lucas Jérémy, Bessy Adrien, Vinos Poyo Thomas, Mironova Victoria, Vernoux Teva, Dumas Renaud, and Parcy François. 2019. “Capturing Auxin Response Factors Syntax Using DNA Binding Models.” Molecular Plant 12: 822–832. 10.1016/j.molp.2018.09.010 [DOI] [PubMed] [Google Scholar]
  • 26. Sayou, Camille , Monniaux Marie, Nanao Max H., Moyroud Edwige, Brockington Samuel F., Thévenon Emmanuel, Chahtane Hicham, et al. 2014. “A Promiscuous Intermediate Underlies the Evolution of LEAFY DNA Binding Specificity.” Science 343: 645–648. 10.1126/science.1248229 [DOI] [PubMed] [Google Scholar]
  • 27. Riechmann, J. L. , Heard J., Martin G., Reuber L., Jiang C.‐Z., Keddie J., Adam L., et al. 2000. “Arabidopsis Transcription Factors: Genome‐Wide Comparative Analysis Among Eukaryotes.” Science 290: 2105–2110. 10.1126/science.290.5499.2105 [DOI] [PubMed] [Google Scholar]
  • 28. Dubos, Christian , Stracke Ralf, Grotewold Erich, Weisshaar Bernd, Martin Cathie, and Lepiniec Loïc. 2010. “MYB Transcription Factors in Arabidopsis .” Trends in Plant Science 15: 573–581. 10.1016/j.tplants.2010.06.005 [DOI] [PubMed] [Google Scholar]
  • 29. Du, Hai , Liang Zhe, Zhao Sen, Nan Ming‐Ge, Tran Lam‐Son Phan, Lu Kun, et al. 2015. “The Evolutionary History of R2R3‐MYB Proteins Across 50 Eukaryotes: New Insights into Subfamily Classification and Expansion.” Scientific Reports 5: 11037. 10.1038/srep11037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Jiang, Chen‐Kun , and Rao Guang‐Yuan. 2020. “Insights Into the Diversification and Evolution of R2R3‐MYB Transcription Factors in Plants.” Plant Physiology 183: 637–655. 10.1104/pp.19.01082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Mitreiter, Simon , and Gigolashvili Tamara. 2021. “Regulation of Glucosinolate Biosynthesis.” Journal of Experimental Botany 72: 70–91. 10.1093/jxb/eraa479 [DOI] [PubMed] [Google Scholar]
  • 32. Zhou, Jianli , Lee Chanhui, Zhong Ruiqin, and Ye Zheng‐Hua. 2009. “MYB58 and MYB63 Are Transcriptional Activators of the Lignin Biosynthetic Pathway During Secondary Cell Wall Formation in Arabidopsis .” The Plant Cell 21: 248–266. 10.1105/tpc.108.063321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. McCarthy, Ryan L. , Zhong Ruiqin, and Ye Zheng‐Hua. 2009. “MYB83 Is a Direct Target of SND1 and Acts Redundantly With MYB46 in the Regulation of Secondary Cell Wall Biosynthesis in Arabidopsis .” Plant & Cell Physiology 50: 1950–1964. 10.1093/pcp/pcp139 [DOI] [PubMed] [Google Scholar]
  • 34. Abe, Hiroshi , Urao Takeshi, Ito Takuya, Seki Motoaki, Shinozaki Kazuo, and Yamaguchi‐Shinozaki Kazuko. 2003. “Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) Function as Transcriptional Activators in Abscisic Acid Signaling.” The Plant Cell 15: 63–78. 10.1105/tpc.006130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Liao, Chancan , Zheng Yuan, and Guo Yan. 2017. “MYB30 Transcription Factor Regulates Oxidative and Heat Stress Responses Through ANNEXIN‐mediated Cytosolic Calcium Signaling in Arabidopsis .” New Phytologist 216: 163–177. 10.1111/nph.14679 [DOI] [PubMed] [Google Scholar]
  • 36. Oshima, Yoshimi , Shikata Masahito, Koyama Tomotsugu, Ohtsubo Norihiro, Mitsuda Nobutaka, and Ohme‐Takagi Masaru. 2013. “MIXTA‐Like Transcription Factors and WAX INDUCER1/SHINE1 Coordinately Regulate Cuticle Development in Arabidopsis and Torenia fournieri .” The Plant Cell 25: 1609–1624. 10.1105/tpc.113.110783 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Cheng, Hui , Song Susheng, Xiao Langtao, Soo Hui Meng, Cheng Zhiwei, Xie Daoxin, and Peng Jinrong. 2009. “Gibberellin Acts Through Jasmonate to Control the Expression of MYB21, MYB24, and MYB57 to Promote Stamen Filament Growth in Arabidopsis .” PLoS Genetics 5: e1000440. 10.1371/journal.pgen.1000440 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Chang, Katherine Noelani , Zhong Shan, Weirauch Matthew T., Hon Gary, Pelizzola Mattia, Li Hai, Huang Shao‐shan Carol, et al. 2013. “Temporal Transcriptional Response to Ethylene Gas Drives Growth Hormone Cross‐Regulation in Arabidopsis .” eLife 2: e00675. 10.7554/eLife.00675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Sullivan, Alessandra M. , Arsovski Andrej A., Thompson Agnieszka, Sandstrom Richard, Thurman Robert E., Neph Shane, Johnson Audra K., et al. 2019. “Mapping and Dynamics of Regulatory DNA in Maturing Arabidopsis thaliana Siliques.” Frontiers in Plant Science 10: 1434. 10.3389/fpls.2019.01434 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Weirauch, Matthew T. , Yang Ally, Albu Mihai, Cote Atina G., Montenegro‐Montero Alejandro, Drewe Philipp, Najafabadi Hamed S., et al. 2014. “Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity.” Cell 158: 1431–1443. 10.1016/j.cell.2014.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Wang, Baihui , Luo Qiang, Li Yingping, Du Kangxi, Wu Zhen, Li Tianyang, Shen Wen‐Hui, et al. 2022. “Structural Insights into Partner Selection for MYB and bHLH Transcription Factor Complexes.” Nature Plants 8: 1108–1117. 10.1038/s41477-022-01223-w [DOI] [PubMed] [Google Scholar]
  • 42. Song, Susheng , Qi Tiancong, Huang Huang, Ren Qingcuo, Wu Dewei, Chang Changqing, Peng Wen, et al. 2011. “The Jasmonate‐ZIM Domain Proteins Interact With the R2R3‐MYB Transcription Factors MYB21 and MYB24 to Affect Jasmonate‐Regulated Stamen Development in Arabidopsis .” The Plant Cell 23: 1000–1013. 10.1105/tpc.111.083089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Kim, Daewon , Jeon Su Jeong, Yanders Samantha, Park Sung‑Chul, Kim Ho Soo, and Kim Sewon. 2022. “MYB3 Plays an Important Role in Lignin and Anthocyanin Biosynthesis Under Salt Stress Condition in Arabidopsis .” Plant Cell Reports 41: 1549–1560. 10.1007/s00299-022-02878-7 [DOI] [PubMed] [Google Scholar]
  • 44. Liang, Gang , He Hua, Li Yang, Ai Qin, and Yu Diqiu. 2014. “MYB82 Functions in Regulation of Trichome Development in Arabidopsis .” Journal of Experimental Botany 65: 3215–3223. 10.1093/jxb/eru179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Zhu, Fangjie , Farnung Lucas, Kaasinen Eevi, Sahu Biswajyoti, Yin Yimeng, Wei Bei, Dodonova Svetlana O., et al. 2018. “The Interaction Landscape Between Transcription Factors and the Nucleosome.” Nature 562: 76–81. 10.1038/s41586-018-0549-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Wang, Baihui , Luo Qiang, Li Yingping, Yin Liufan, Zhou Nana, Li Xiangnan, and Gan Jianhua, et al. 2020. “Structural Insights Into Target DNA Recognition by R2R3‐MYB Transcription Factors.” Nucleic Acids Research 48: 460–471. 10.1093/nar/gkz1081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Sagendorf, Jared M. , Markarian Nicholas, Berman Helen M., and Rohs Remo. 2020. “DNAproDB: An Expanded Database and Web‐Based Tool for Structural Analysis of DNA–protein Complexes.” Nucleic Acids Research 48: D277–D287. 10.1093/nar/gkz889 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Jin, Jinpu , Tian Feng, Yang DeChang, Meng YuQi, Kong Lei, Luo Jingchu, and Gao Ge. 2017. “PlantTFDB 4.0: Toward a Central Hub for Transcription Factors and Regulatory Interactions in Plants.” Nucleic Acids Research 45: D1040–D1045. 10.1093/nar/gkw982 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Heinz, Sven , Benner Christopher, Spann Nathanael, Bertolino Eric, Lin Yin C., Laslo Peter, Cheng Jason X., et al. 2010. “Simple Combinations of Lineage‐Determining Transcription Factors Prime Cis‐Regulatory Elements Required for Macrophage and B Cell Identities.” Molecular Cell 38: 576–589. 10.1016/j.molcel.2010.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Wen, Chenjin , Yuan Zhen, Zhang Xiaotian, Chen Hao, Luo Lin, Li Wanying, Li Tian, et al. 2023. “Sea‐ATI Unravels Novel Vocabularies of Plant Active Cistrome.” Nucleic Acids Research 51: 11568–11583. 10.1093/nar/gkad853 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Cooper, Brendon H. , Dantas Machado Ana Carolina, Gan Yan, Aparicio Oscar M., and Rohs Remo. 2023. “DNA Binding Specificity of All Four Saccharomyces cerevisiae Forkhead Transcription Factors.” Nucleic Acids Research 51: 5621–5633. 10.1093/nar/gkad372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Lai, Xuelei , Vega‐Léon Rosario, Hugouvieux Veronique, Blanc‐Mathieu Romain, van der Wal Froukje, Lucas Jérémy, Silva Catarina S., et al. 2021. “The Intervening Domain Is Required for DNA‐Binding and Functional Identity of Plant MADS Transcription Factors.” Nature Communications 12: 4760. 10.1038/s41467-021-24978-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Fontana, Mattia , Roosjen Mark, Crespo García Isidro, van den Berg Willy, Malfois Marc, Boer Roeland, Weijers Dolf, et al. 2023. “Cooperative Action of Separate Interaction Domains Promotes High‐Affinity DNA Binding of Arabidopsis thaliana ARF Transcription Factors.” Proc Natl Acad Sci USA 120: e2219916120. 10.1073/pnas.2219916120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Brown, Katrien , Takawira Lazarus T., O'Neill Marja M., Mizrachi Eshchar, Myburg Alexander A., and Hussey Steven G.. 2019. “Identification and Functional Evaluation of Accessible Chromatin Associated With Wood Formation in Eucalyptus grandis .” New Phytologist 223: 1937–1951. 10.1111/nph.15897 [DOI] [PubMed] [Google Scholar]
  • 55. Wu, Leiming , Zhang Mingliang, Zhang Ran, Yu Haizhong, Wang Hailang, Li Jingyang, Wang Youmei, et al. 2021. “Down‐Regulation of OsMYB103L Distinctively Alters Beta‐1,4‐glucan Polymerization and Cellulose Microfibers Assembly for Enhanced Biomass Enzymatic Saccharification in Rice.” Biotechnology for Biofuels 14: 245. 10.1186/s13068-021-02093-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Gomez‐Cano, Fabio , Chu Yi‐Hsuan, Cruz‐Gomez Mariel, Abdullah Hesham M., Lee Yun Sun, Schnell Danny J., and Grotewold Erich. 2022. “Exploring Camelina sativa Lipid Metabolism Regulation by Combining Gene Co‐Expression and DNA Affinity Purification Analyses.” The Plant Journal 110: 589–606. 10.1111/tpj.15682 [DOI] [PubMed] [Google Scholar]
  • 57. Orduña, Luis , Li Miaomiao, Navarro‐Payá David, Zhang Chen, Santiago Antonio, Romero Pablo, Ramšak Živa, et al. 2022. “Direct Regulation of Shikimate, Early Phenylpropanoid, and Stilbenoid Pathways by Subgroup 2 R2R3‐MYBs in Grapevine .” The Plant Journal 110: 529–547. 10.1111/tpj.15686 [DOI] [PubMed] [Google Scholar]
  • 58. Takawira, Lazarus T. , Hadj Bachir Ines, Ployet Raphael, Tulloch Jade, San Clemente Helene, Christie Nanette, Ladouce Nathalie, et al. 2023. “Functional Investigation of Five R2R3‐MYB Transcription Factors Associated With Wood Development in Eucalyptus Using DAP‐seq‐ML.” Plant Molecular Biology 113: 33–57. 10.1007/s11103-023-01376-y [DOI] [PubMed] [Google Scholar]
  • 59. Austin, Ryan S. , Hiu Shu, Waese Jamie, Ierullo Matthew, Pasha Asher, Wang Ting Ting, Fan Jim, et al. 2016. “New bar Tools for Mining Expression Data and Exploring Cis ‐Elements in Arabidopsis thaliana .” The Plant Journal 88: 490–504. 10.1111/tpj.13261 [DOI] [PubMed] [Google Scholar]
  • 60. Shi, Meicheng , Yu Lihua, Shi Jianan, and Liu Jin. 2022. “A Conserved MYB Transcription Factor Is Involved in Regulating Lipid Metabolic Pathways for Oil Biosynthesis in Green Algae.” New Phytologist 235: 576–594. 10.1111/nph.18119 [DOI] [PubMed] [Google Scholar]
  • 61. Lippold, Felix , Sanchez Diego H., Musialak Magdalena, Schlereth Armin, Scheible Wolf‐Ruediger, Hincha Dirk K., and Udvardi Michael K.. 2009. “AtMyb41 Regulates Transcriptional and Metabolic Responses to Osmotic Stress in Arabidopsis .” Plant Physiology 149: 1761–1772. 10.1104/pp.108.134874 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Lai, Xuelei , Blanc‐Mathieu Romain, GrandVuillemin Loïc, Huang Ying, Stigliani Arnaud, Lucas Jérémy, Thévenon Emmanuel, et al. 2021. “The LEAFY Floral Regulator Displays Pioneer Transcription Factor Properties.” Molecular Plant 14: 829–837. 10.1016/j.molp.2021.03.004 [DOI] [PubMed] [Google Scholar]
  • 63. Jin, Run , Klasfeld Samantha, Zhu Yang, Fernandez Garcia Meilin, Xiao Jun, Han Soon‐Ki, Konkol Adam, and Wagner Doris. 2021. “LEAFY Is a Pioneer Transcription Factor and Licenses Cell Reprogramming to Floral Fate.” Nature Communications 12: 626. 10.1038/s41467-020-20883-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Fernandez Garcia, Meilin , Moore Cedric D., Schulz Katharine N., Alberto Oscar, Donague Greg, Harrison Melissa M., Zhu Heng, and Zaret Kenneth S.. 2019. “Structural Features of Transcription Factors Associating with Nucleosome Binding.” Molecular Cell 75: 921–932.e6. 10.1016/j.molcel.2019.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Lieberman‐Lazarovich, Michal , Yahav Chen, Israeli Alon, and Efroni Idan. 2019. “Deep Conservation of Cis‐Element Variants Regulating Plant Hormonal Responses.” The Plant Cell 31: tpc.00129.2019. 10.1105/tpc.19.00129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Lu, Zefu , Marand Alexandre P., Ricci William A., Ethridge Christina L., Zhang Xiaoyu, and Schmitz Robert J.. 2019. “The Prevalence, Evolution and Chromatin Signatures of Plant Regulatory Elements.” Nature Plants 5: 1250–1259. 10.1038/s41477-019-0548-z [DOI] [PubMed] [Google Scholar]
  • 67. Tian, Feng , Yang De‐Chang, Meng Yu‐Qi, Jin Jinpu, and Gao Ge. 2020. “PlantRegMap: Charting Functional Regulatory Maps in Plants.” Nucleic Acids Research 48: D1104–D1113. 10.1093/nar/gkz1020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Cui, Liying , Wall P. Kerr, Leebens‐Mack James H., Lindsay Bruce G., Soltis Douglas E., Doyle Jeff J., Soltis Pamela S., et al. 2006. “Widespread Genome Duplications Throughout the History of Flowering Plants.” Genome Research 16: 738–749. 10.1101/gr.4825606 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Lambert, Samuel A. , Yang Ally W. H., Sasse Alexander, Cowley Gwendolyn, Albu Mihai, Caddick Mark X., Morris Quaid D., Weirauch Matthew T., and Hughes Timothy R.. 2019. “Similarity Regression Predicts Evolution of Transcription Factor Sequence Specificity.” Nature Genetics 51: 981–989. 10.1038/s41588-019-0411-1 [DOI] [PubMed] [Google Scholar]
  • 70. Jolma, Arttu , Yin Yimeng, Nitta Kazuhiro R., Dave Kashyap, Popov Alexander, Taipale Minna, Enge Martin, et al. 2015. “DNA‐dependent Formation of Transcription Factor Pairs Alters Their Binding Specificity.” Nature 527: 384–388. 10.1038/nature15518 [DOI] [PubMed] [Google Scholar]
  • 71. Van Mourik, Hilda , Chen Peilin, Smaczniak Cezary, Boeren Sjef, Kaufmann Kerstin, Bemer Marian, Angenent Gerco C., and Muino Jose M.. 2023. “Dual Specificity and Target Gene Selection by the MADS‐domain Protein FRUITFULL.” Nature Plants 9: 473–485. 10.1038/s41477-023-01351-x [DOI] [PubMed] [Google Scholar]
  • 72. Smaczniak, Cezary , Muiño Jose M., Chen Dijun, Angenent Gerco C., and Kaufmann Kerstin. 2017. “Differences in DNA Binding Specificity of Floral Homeotic Protein Complexes Predict Organ‐Specific Target Genes.” The Plant Cell 29: 1822–1835. 10.1105/tpc.17.00145 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Bemer, Marian , van Dijk Aalt D. J., Immink Richard G. H., and Angenent Gerco C.. 2017. “Cross‐Family Transcription Factor Interactions: An Additional Layer of Gene Regulation.” Trends in Plant Science 22: 66–80. 10.1016/j.tplants.2016.10.007 [DOI] [PubMed] [Google Scholar]
  • 74. Lai, Xuelei , Stigliani Arnaud, Lucas Jérémy, Hugouvieux Véronique, Parcy François, and Zubieta Chloe. 2020. “Genome‐Wide Binding of SEPALLATA3 and AGAMOUS Complexes Determined by Sequential DNA‐Affinity Purification Sequencing.” Nucleic Acids Research 48: 9637–9648. 10.1093/nar/gkaa729 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Morgunova, Ekaterina , Yin Yimeng, Das Pratyush K., Jolma Arttu, Zhu Fangjie, Popov Alexander, Xu You, Nilsson Lennart, and Taipale Jussi. 2018. “Two Distinct DNA Sequences Recognized by Transcription Factors Represent Enthalpy and Entropy Optima.” eLife 7: e32963. 10.7554/eLife.32963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Inukai, Sachi , Kock Kian Hong, and Bulyk Martha L.. 2017. “Transcription factor‐DNA Binding: Beyond Binding Site Motifs.” Current Opinion in Genetics & Development 43: 110–119. 10.1016/j.gde.2017.02.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Slattery, Matthew , Riley Todd, Liu Peng, Abe Namiko, Gomez‐Alcala Pilar, Dror Iris, Zhou Tianyin, et al. 2011. “Cofactor Binding Evokes Latent Differences in DNA Binding Specificity Between Hox Proteins.” Cell 147: 1270–1282. 10.1016/j.cell.2011.10.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Rieu, Philippe , Turchi Laura, Thévenon Emmanuel, Zarkadas Eleftherios, Nanao Max, Chahtane Hicham, Tichtinsky Gabrielle, et al. 2023. “The F‐Box Protein UFO Controls Flower Development by Redirecting the Master Transcription Factor LEAFY to New Cis‐Elements.” Nature Plants 9: 315–329. 10.1038/s41477-022-01336-2 [DOI] [PubMed] [Google Scholar]
  • 79. Freire‐Rios, Alejandra , Tanaka Keita, Crespo Isidro, van der Wijk Elmar, Sizentsova Yana, Levitsky Victor, Lindhoud Simon, et al. 2020. “Architecture of DNA Elements Mediating ARF Transcription Factor Binding and Auxin‐Responsive Gene Expression in Arabidopsis .” Proceedings of the National Academy of Sciences 117: 24557–24566. 10.1073/pnas.2009554117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Guo, Xiaorui , Li Lihong, Liu Xiatong, Zhang Chong, Yao Xiaoyun, Xun Zhili, Zhao Zhijing, et al. 2022. “MYB2 Is Important for Tapetal PCD and Pollen Development by Directly Activating Protease Expression in Arabidopsis .” International Journal of Molecular Sciences 23: 3563. 10.3390/ijms23073563 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Cheng, Yuxin , Ma Yanxing, Zhang Na, Lin Rao, Yuan Yuan, Tian Hainan, Hussain Saddam, et al. 2022. “The R2R3 MYB Transcription Factor MYB71 Regulates Abscisic Acid Response in Arabidopsis .” Plants 11: 1369. 10.3390/plants11101369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Jia, Tianqi , Zhang Kaidian, Li Fan, Huang Yifeng, Fan Manman, and Huang Tao. 2020. “The AtMYB2 Inhibits the Formation of Axillary Meristem in Arabidopsis by Repressing RAX1 Gene Under Environmental Stresses.” Plant Cell Reports 39: 1755–1765. 10.1007/s00299-020-02602-3 [DOI] [PubMed] [Google Scholar]
  • 83. Baek, Dongwon , Kim Min Chul, Chun Hyun Jin, Kang Songhwa, Park Hyeong Cheol, Shin Gilok, Park Jiyoung, et al. 2012. “Regulation of miR399f Transcription by AtMYB2 Affects Phosphate Starvation Responses in Arabidopsis .” Plant Physiology 161: 362–373. 10.1104/pp.112.205922 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Chen, Shifu . 2023. “Ultrafast One‐Pass FASTQ Data Preprocessing, Quality Control, and Deduplication Using Fastp.” iMeta 2: e107. 10.1002/imt2.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Nitta, Kazuhiro R. , Jolma Arttu, Yin Yimeng, Morgunova Ekaterina, Kivioja Teemu, Akhtar Junaid, Hens Korneel, et al. 2015. “Conservation of Transcription Factor Binding Specificities Across 600 Million Years of Bilateria Evolution.” eLife 4: e04837. 10.7554/eLife.04837 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Katoh, K. , and Standley D. M.. 2013. “MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability.” Molecular Biology and Evolution 30: 772–780. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Nguyen, Lam‐Tung , Schmidt Heiko A., von Haeseler Arndt, and Minh Bui Quang. 2015. “IQ‐TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum‐Likelihood Phylogenies.” Molecular Biology and Evolution 32: 268–274. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Korhonen, Janne , Martinmäki Petri, Pizzi Cinzia, Rastas Pasi, and Ukkonen Esko. 2009. “MOODS: Fast Search for Position Weight Matrix Matches in DNA Sequences.” Bioinformatics 25: 3181–3182. 10.1093/bioinformatics/btp554 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Chiu, Tsu‐Pei , Comoglio Federico, Zhou Tianyin, Yang Lin, Paro Renato, and Rohs Remo. 2016. “DNAshapeR: An R/Bioconductor Package for DNA Shape Prediction and Feature Encoding.” Bioinformatics 32: 1211–1213. 10.1093/bioinformatics/btv735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Gu, Zuguang . 2022. “Complex Heatmap Visualization.” iMeta 1: e43. 10.1002/imt2.43 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Abramson, Josh , Adler Jonas, Dunger Jack, Evans Richard, Green Tim, Pritzel Alexander, and Ronneberger Olaf, et al. 2024. “Accurate Structure Prediction of Biomolecular Interactions With AlphaFold 3.” Nature 630: 493–500. 10.1038/s41586-024-07487-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92. Jiang, Dingkun , Zhang Xinfeng, Luo Lin, Li Tian, Chen Hao, Ma Nana, Fu Lufeng, et al. 2025. “Cytosine Methylation Changes the Preferred Cis‐Regulatory Configuration of Arabidopsis WUSCHEL‐Related Homeobox 14.” International Journal of Molecular Sciences 26: 763. 10.3390/ijms26020763 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Bartlett, Anna , O'Malley Ronan C., Huang Shao‐shan Carol, Galli Mary, Nery Joseph R., Gallavotti Andrea, and Ecker Joseph R.. 2017. “Mapping Genome‐Wide Transcription Factor Binding Sites Using DAP‐seq.” Nature Protocols 12: 1659–1672. 10.1038/nprot.2017.055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Li, Hua , Chen Zhuo, Zhu Wenjun, Ni Xueting, Wang Junru, Fu Lufeng, Chen Jialin, et al. 2024. “The MaNAP1‐MaMADS1 Transcription Factor Module Mediates Ethylene‐Regulated Peel Softening and Ripening in Banana.” The Plant Cell 37: koae282. 10.1093/plcell/koae282 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Li, Heng , and Durbin Richard. 2009. “Fast and Accurate Short Read Alignment With Burrows‐Wheeler Transform.” Bioinformatics 25: 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Zhang, Yong , Liu Tao, Meyer Clifford A., Eeckhoute Jérôme, Johnson David S., Bernstein Bradley E., Nusbaum Chad, et al. 2008. “Model‐Based Analysis of ChIP‐Seq (MACS).” Genome Biology 9: R137. 10.1186/gb-2008-9-9-r137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Yoo, SangDong , Cho YoungHee, and Sheen Jen. 2007. “Arabidopsis Mesophyll Protoplasts: A Versatile Cell System for Transient Gene Expression Analysis.” Nature Protocols 2: 1565–1572. 10.1038/nprot.2007.199 [DOI] [PubMed] [Google Scholar]
  • 98. Bolger, Anthony M. , Lohse Marc, and Usadel Bjoern. 2014. “Trimmomatic: A Flexible Trimmer for Illumina Sequence Data.” Bioinformatics 30: 2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Kim, Daehwan , Langmead Ben, and Salzberg Steven L.. 2015. “HISAT: A Fast Spliced Aligner With Low Memory Requirements.” Nature Methods 12: 357–360. 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Patro, Rob , Duggal Geet, Love Michael I., Irizarry Rafael A., and Kingsford Carl. 2017. “Salmon Provides Fast and Bias‐Aware Quantification of Transcript Expression.” Nature Methods 14: 417–419. 10.1038/nmeth.4197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Zhu, Wenjun , Li Hua, Dong Pengfei, Ni Xueting, Fan Minlei, Yang Yingjie, Xu Shiyao, et al. 2023. “Low Temperature‐Induced Regulatory Network Rewiring via WRKY Regulators During Banana Peel Browning.” Plant Physiology 193: 855–873. 10.1093/plphys/kiad322 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102. Luo, Lin , Lin Dongmei, Li Jinhui, Chen Hao, Qu Qi, Zhang Lin, and Luo Yuan, et al. 2024. “EGDB: A Comprehensive Multi‐Omics Database for Energy Grasses and the Epigenomic Atlas of pearl millet .” iMeta 3: e263. 10.1002/imt2.263 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1: Closely related R2R3‐AtMYBs bind highly similar sequences.

Figure S2: HT‐SELEX enriches AtMYBs binding signals.

Figure S3: Modified specificities of closely spaced AtMYBs homodimers.

Figure S4: Enrichment of AtMYB2 binding modes in MYBs of other plants.

Figure S5: AtMYB2 recognizes unique targets with modified specificity.

Figure S6: The homodimeric binding modes of AtMYB2 activate transcription.

Figure S7: Transcriptional regulation by AtMYB2.

Data S1: De novo motifs from published DAP‐seq libraries of VIII AtMYBs.

Data S2: Monomeric and homodimeric motifs of VIII R2R3‐AtMYBs.

IMT2-4-e70009-s002.docx (17.4MB, docx)

Table S1: Sequence information for DNA ligands.

Table S2: Sequence information for dual‐luciferase reporter assay and EMSA.

Table S3: Orthologs of AtMYB2.

Table S4: MYB2 motif models from SELEX library.

Table S5: MYB57 motif models from SELEX library.

Table S6: MYB79 motif models from SELEX library.

Table S7: MYB90 motif models from SELEX library.

Table S8: MYB19 motif models from SELEX library.

Table S9: MYB46 motif models from SELEX library.

Table S10: MYB50 motif models from SELEX library.

Table S11: MYB61 motif models from SELEX library.

Table S12: MYB67 motif models from SELEX library.

Table S13: MYB83 motif models from SELEX library.

Table S14: MYB80 motif models from SELEX library.

Table S15: MYB49 motif models from SELEX library.

Table S16: MYB107 motif models from SELEX library.

Table S17: MYB63 motif models from SELEX library.

Table S18: MYB74 motif models from SELEX library.

Table S19: MYB13 motif models from SELEX library.

Table S20: MYB58 motif models from SELEX library.

Table S21: MYB10 motif models from SELEX library.

Table S22: MYB30 motif models from SELEX library.

Table S23: MYB30 motif models from SELEX library.

Table S24: MYB121 motif models from SELEX library.

Table S25: MYB27 motif models from SELEX library.

Table S26: MYB62 motif models from SELEX library.

Table S27: MYB108 motif models from SELEX library.

Table S28: MYB21 motif models from SELEX library.

Table S29: MYB71 motif models from SELEX library.

Table S30: MYB66 motif models from SELEX library.

Table S31: MYB14 motif models from SELEX library.

Table S32: MYB72 motif models from SELEX library.

Table S33: MYB114 motif models from SELEX library.

Table S34: MYB41 motif models from SELEX library.

Table S35: MYB34 motif models from SELEX library.

Table S36: MYB85 motif models from SELEX library.

Table S37: MYB51 motif models from SELEX library.

Table S38: MYB99 motif models from SELEX library.

Table S39: MYB15 motif models from SELEX library.

Table S40: MYB55 motif models from SELEX library.

Table S41: MYB86 motif models from SELEX library.

IMT2-4-e70009-s001.xlsx (386.9KB, xlsx)

Data Availability Statement

The data that support the findings of this study are openly available in the China National Genomics Data Center at https://ngdc.cncb.ac.cn/gsa/browse/CRA014337. All sequencing data have been deposited to China National Genomics Data Center under accession PRJCA022556, with raw sequence data under GSA: CRA014337 (https://ngdc.cncb.ac.cn/gsa/browse/CRA014337). Data link for UCSC Genome Browser session: https://genome.ucsc.edu/s/CH_1997/MYB2_project. The scripts have been deposited on GitHub (https://github.com/Daven-1997/2025_R2R3_MYBs_code). Supplementary materials (figures, tables, graphical abstract, slides, videos, Chinese translated version, and updated materials) may be found in the online DOI or iMeta Science http://www.imeta.science/.


Articles from iMeta are provided here courtesy of Wiley

RESOURCES