Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2019 Aug 10;47(17):9433–9447. doi: 10.1093/nar/gkz697

The MLL1 trimeric catalytic complex is a dynamic conformational ensemble stabilized by multiple weak interactions

Lilia Kaustov 1,2,2, Alexander Lemak 1,2, Hong Wu 3,2, Marco Faini 4, Lixin Fan 5, Xianyang Fang 5, Hong Zeng 3, Shili Duan 1, Abdellah Allali-Hassani 3, Fengling Li 3, Yong Wei 3, Masoud Vedadi 3,6, Ruedi Aebersold 4,7, Yunxing Wang 5, Scott Houliston 1, Cheryl H Arrowsmith 1,3,
PMCID: PMC6755125  PMID: 31400120

Abstract

Histone H3K4 methylation is an epigenetic mark associated with actively transcribed genes. This modification is catalyzed by the mixed lineage leukaemia (MLL) family of histone methyltransferases including MLL1, MLL2, MLL3, MLL4, SET1A and SET1B. The catalytic activity of this family is dependent on interactions with additional conserved proteins, but the structural basis for subunit assembly and the mechanism of regulation is not well understood. We used a hybrid methods approach to study the assembly and biochemical function of the minimally active MLL1 complex (MLL1, WDR5 and RbBP5). A combination of small angle X-ray scattering, cross-linking mass spectrometry, nuclear magnetic resonance spectroscopy and computational modeling were used to generate a dynamic ensemble model in which subunits are assembled via multiple weak interaction sites. We identified a new interaction site between the MLL1 SET domain and the WD40 β-propeller domain of RbBP5, and demonstrate the susceptibility of the catalytic function of the complex to disruption of individual interaction sites.

INTRODUCTION

Post-translational modifications on histone tails are key epigenetic signals for regulation of chromatin structure and gene expression. H3K4 methylation is a complex, dynamic process that is strongly correlated with actively transcribed genes or those that are in a poised or bivalent state (1). Mono-, di- and trimethlyated species of H3K4 exhibit a gradient distribution with respect to transcription start sites (TSSs); H3K4me3 is most abundant close to TSSs and in promoter regions, whereas H3K4me2/me1 marked histones are enriched further up- and downstream (2). H3K4 methylation is catalyzed by the MLL/SET1 family of histone methyltransferases (3,4), through their evolutionarily conserved SET domain (5,6). The founding member of this family of H3K4 methyltransferases is the yeast SET1 protein (7,8). In mammals, methylation of H3K4 is carried out by a family of six proteins: MLL (mixed lineage leukemia protein)1 to MLL4, SET1A and SET1B (9–15). The MLL proteins play crucial roles in embryonic development and hematopoiesis through transcriptional regulation of the clustered homeobox (Hox) genes and other genes important for developmental regulation (10,16–19). Deletion of MLL1 and MLL2 can lead to severe defects in embryonic development in mice (18,20). The MLL1 gene is frequently rearranged in human acute leukemia in both adults and children (21–23). Recently, studies have identified inactivating mutations in MLL3 and MLL4 in different types of human tumors (24–27), as well as in Kabuki syndrome (28).

The catalytic activity of MLL/SET family members are dependent to varying degrees on the presence of additional evolutionarily conserved protein subunits, RbBP5, WDR5, ASH2L and DYP30, which together form the core complexes of MLL enzymes (29–34). A minimal core enzyme can be reconstituted with the C-terminal SET domain fragment of MLLs and at least two of the other subunits (29,31,35). In studies of these reconstituted core enzymes, MLL1 appears to be unique among the family members in its requirements for, and interactions with other subunits. For example, in relation to other MLLs, the catalytic activity of MLL1 is most strongly stimulated by WDR5 (31,36,37), whereas it binds with the least affinity and is only weakly stimulated by the RbBP5–ASH2L heterodimer (35).

Crystallographic studies of MLL3 support a model in which the RbBP5–ASH2L heterodimer stabilizes the catalytically active conformations of MLL2/3/4 through interactions with conserved surfaces on their SET domain (35). However, it was suggested that two key variant residues on this surface of MLL1 dramatically weakened the interaction between MLL1 and RbBP5–ASH2L relative to that of other MLL members, thereby increasing the dependence of MLL1 on WDR5 (35). The unique dependence of MLL1 activity on WDR5 may be of therapeutic relevance, as we and others have shown that pharmacological targeting of the MLL interaction site on WDR5 can functionally antagonize MLL1 in cancers that are dependent on MLL1 activity (38–40).

While there are several structures of WDR5 bound to MLL and RbBP5 peptides (37,41–44), as well as a crystal structure of the apo-SET domain (45) of MLL1 and a 24 Å resolution cryo-EM model of the homologous yeast COMPASS (46), an atomic level picture of a functional MLL1 catalytic complex is still lacking. There is evidence of a hierarchical organization, wherein WDR5 and RbBP5 jointly interact with MLL1 to form a stable species (29,31,34), which we refer to as the ‘minimal catalytic complex’. This trimer can serve as a scaffold for the association of ASH2L and DPY30 (29).

Here, we report a hybrid methods study of MLL1 and its catalytic core components in solution. Using small angle X-ray scattering (SAXS), cross-linking mass spectrometry (XL-MS), nuclear magnetic resonance (NMR) spectroscopy and computational modeling we derived a dynamic ensemble model for the WDR5–RbBP5–MLL1 complex, and identify a new interaction site between the MLL1 SET domain, and the N-terminal WD40 repeat domain of RbBP5. Our data support the notion that the functional MLL1 enzyme comprises a collection of weak but specific interactions, and that the disruption of individual interactions can have significant destabilizing effects on the entire complex.

MATERIALS AND METHODS

Cloning of MLL1, WDR5 and RbBP5 constructs

The coding regions for human MLL1RBS-SET (residues 3785–3969) and MLL1WIN-SET(residues 3745–3969) were polymerase chain reaction (PCR)-amplified and subcloned into the pET28GST-LIC vector (GeneBank ID: EF456739). We generated two mutants of the MLLWIN-SET construct from the wild-type clone using QuikChange PCR mutagenesis kit (Agilent): (i) MLL1RBS-SET7D (where residues 3786–3792 were deleted) and (ii) MLL1RBS-SET3M (which has Q3787V, P3788L and Y3791G mutations). RbBP5 constructs of different lengths (comprising residues 10–340, 10–410, 320–410, 340–538 and 1–538) and WDR5WD40 (residues 24–334) were subcloned into the pET28-MHL vector (GeneBank ID: EF456738). For the characterization of the dimeric complexes, and reconstitution of the MLL1 trimeric complex, the following construct pairs were cloned into the pFastBac Dual expression vector (Thermo Fisher Scientific): (i) full-length WDR5 and His-tagged MLL1WIN-SET, and (ii) full-length WDR5 and His-tagged RbBP5.

Protein preparation

Individual components of the MLL1 complex were expressed in Escherichia coli and purified using an N-terminal GST-tag (for MLL1) or His-tag (for WDR5 and RbBP5). We found MLL1RBS-SET to be better behaved and more stable than MLL1WIN-SET. Therefore, we used the MLL1RBS-SET construct whenever possible. The characterization of complexes involving both MLL1 and WDR5 required the use of MLL1WIN-SET. The dimeric and trimeric complexes of MLL1 used for SAXS and cross-linking studies were expressed in Sf9 cells. The dimeric complex of WDR5–MLL1WIN-SET and WDR5–RbBP5 were purified using TALON affinity resin (Clontech), followed by gel filtration chromatography. Purified dimeric complexes were incubated together on ice for 2 h to reconstitute the trimeric complex, which was subsequently purified and recovered by gel filtration chromatography. Detailed procedures are described in the ‘Supplementary Data’ section.

SAXS data collection, analysis and modeling

SAXS measurements were carried out at the beamline 12-ID-C of the Advanced Photon Source, Argonne National Laboratory. The energy of the X-ray beam was 18 Kev (wavelength λ = 0.6888 Å), and two setups (small- and wide- angle X-ray scattering, SAXS and WAXS) were used in which the sample to charge-coupled device detector (MAR research, Hamburg) distance were adjusted to achieve scattering q values of 0.006 < q < 2.3Å−1, where q = (4π/λ)sinθ and 2θ is the scattering angle. Data were analyzed using the program PRIMUS (ATSAS package, EMBL (47)). Detailed descriptions of SAXS data collection, analysis and modeling protocols, are provided in the Supplementary Data.

Chemical cross-linking mass spectrometry

The reconstituted trimer of WDR5, RbBP5 and MLL1WIN-SET was cross-linked at a concentration between 12 μM and 16 μM, with 1 mM of isotopically coded disuccinimidyl suberate (DSS-d0,DSS-d12) as described previously (48). Protease digestion was carried out with LysC and trypsin. After acidification, cross-linked peptides were purified on C18 cartridges and enriched by size-exclusion chromatography (SEC). SEC fractions were analyzed in duplicate on an LC-MS (Easy-nLC 300; Orbitrap LTQ XL). For complete details, refer to Supplementary Data.

NMR spectroscopy

NMR spectra were collected at 25°C on a Bruker spectrometer operating at 800 MHz, and equipped with a cryoprobe. Samples contained 5% D2O with protein concentrations ranging from 100 to 350 μM. For the assignment of backbone resonances of WDR5WD40, a triply-labeled (15N/13C/2H) sample was prepared and conventional triple-resonance backbone spectra were acquired as described previously (49), using the ABACUS approach (50). (1H-15N)-TROSY titrations of 15N-labeled WDR5WD40 were performed by adding aliquots of peptides corresponding to the MLL1 WIN motif (GSARAEVHLRKS—i.e. MLL13762-3773) and RbBP5 WBM motif (EDEEVDVTSV—i.e. RbBP5371-380) at molar ratios ranging from 1:1 to 1:7. Weighted chemical shift displacements were calculated using the formula: Δppm = [(δNH)2+(δN/5)2]1/2. Spectra were processed with NMR Pipe (51) and analyzed with SPARKY (52).

GST Pull-down experiments

Recombinant purified MLL1-GST proteins were incubated with various RbBP5 constructs (in an assay buffer containing 20 mM TRIS pH 7.7, 150 mM NaCl, 10 μM ZnCl2, 5 mM β-mercaptoethanol, 5 mM dithiothreitol (DTT), and 1 mM phenylmethanesulfonyl fluoride (PMSF)) in a 1:2 molar ratio at 4°C for 1 h. Proteins were then incubated with 100 μL of glutathione-Sepharose beads (GE Healthcare) for an additional 1 h. The mixture was transferred to a micro-column and was extensively washed with assay buffer. Bound proteins were eluted with 30 mM reduced glutathione, and detected by sodium dodecyl sulphate-polyacrylamide gelelectrophoresis (SDS-PAGE) and Coomassie staining.

Biolayer Interferometry

The interaction between various RbBP5 constructs with GST-tagged MLLRBS-SET and WDR5 was measured using the Octet Red System (Forte Bio). All experiments were performed using phosphate-buffered saline containing 0.2 mg/ml bovine serum albumin and 0.1% (v/v) Tween-20, in a 96-well plate with 200 μL in each well and constant shaking (1000 rpm). GST-tagged constructs were loaded onto anti-GST antibody-coated biosensors (Forte Bio), and the sensors were washed for an extended period in the buffer. Loaded sensors were then incubated with RbBP5 constructs at different concentrations before discharge into separate buffer wells. The binding affinity was determined by steady-state analysis using the program Gnuplot.

Histone methyltransferase assay

Activity assays were performed in 50 mM Tris–HCl, pH 8.0, 5 mM DTT and 0.01% Triton X-100, using 5 μM 3H-SAM and 5 μM Biotin-H31-25. Increasing concentrations of RbBP5 were added to 200 nM of WDR5–MLL1WIN-SET (with either wild-type or mutant MLL1). All reactions were incubated for 90 min at room temperature and a scintillation proximity assay (SPA) was used to determine activities. Experiments were performed in triplicate. For assays with OICR-9429, increasing concentrations of the compound was incubated with 200 nM WDR5–MLL1WIN-SET for 20 min before adding 400 nM RbBP5.

RESULTS AND DISCUSSION

SAXS data reveal solution ensembles for WDR5, RbBP5 and MLL1RBS-SET

To model catalytically active MLL1 complexes, we first collected reference solution data for the individual subunits including the SET domain of MLL1, the WD40 repeat region of WDR5 (WDR5WD40), the N-terminal domain of RbBP5 (RbBP5NTD) and full-length RbBP5 (which we refer to from here forward simply as RbBP5), followed by the characterization of dimeric and trimeric complexes. Figure 1A shows the protein constructs used in this study. Normalized Kratky plots of WDR5WD40 and RbBP5NTD exhibit a typical bell-shape with a maximum at (1.73, 1.1) expected for globular proteins and are nearly superimposable in the q range 0<qRg<3 (Figure 1B). Also, the experimental values of Rg predicted for WDR5WD40 and RbBP5NTD are in agreement with the theoretical values expected for globular proteins (Table 1 and Supplementary Figure S1). The normalized Kratky plot of MLL1RBS-SET also exhibits a bell-shape, but its maximum is shifted with respect to the globular protein position, with poor convergence at high q-values, indicating that MLL1RBS-SET contains flexible regions. The observed flexibility of MLL1RBS-SET could be attributed to known inherent dynamics of the SET domain in the absence of cofactor (35), and to the disordered N-terminus of the MLL1RBS-SET construct. The calculated solution ensembles for each protein taking into account known or predicted disordered regions (see Supplementary Data for details) establish good correspondence between our SAXS measurements and the crystal structures of WDR5 (53), the SET domain of MLL1 (45) and the WD40 domain of RbBP5 (54) (Supplementary Figure S2).

Figure 1.

Figure 1.

Individual components of the WDR5–RbBP5–MLL1 complex. (A) Schematic representation of MLL1, RbBP5 and WDR5 domain organization and constructs used in this study. For clarity, only the C-terminal region is displayed for MLL1. WIN: WDR5 interacting motif as previously defined (41); RBS: RbBP5 binding site as defined in this study; SET: catalytic methyltransferase domain; NTD: N-terminal domain; CT: C-terminus; AS+ABM: activation segment and ASH2L binding motif as defined in (35); WBM: WDR5 binding motif (43). (B) SAXS-derived Rg-based Kratky plots indicate that WDR5WD40 (blue) and RbBP5NTD (magenta) are globular, while RbBP5 (red) and MLL1RBS-SET (green) exhibit some degree of flexibility. The error bars show propagated experimental errors. (C) Normalized pair distance distribution functions P(r) calculated from experimental SAXS data with GNOM.

Table 1.

SAXS parameters derived for the MLL1 trimeric complex, as well for its individual components and associated binary complexes

Rga (Å) Rgb (Å) Dmaxc (Å) Vcd Mwe NSDf
Individual Component
 MLL1WIN-SET 20.2 20.8 73 207 22.0 (21.6) 0.58
 WDR5WD40 19.8 19.9 70 251 25.9 (34.1) 0.60
 RbBP5NTD 21.5 21.7 76 276 28.7 (36.8) 0.72
 RbBP5 32.1 33.0 113 489 60.6 (59.1) 0.64
Binary complexes
 WDR5–MLL1WIN-SET 32.5 33.6 120 372 56.5 (64.7) 0.79
 WDR5–RbBP5 39.8 41.1 140 669 91.2 (96.5) 0.79
Trimeric complex
 WDR5–RbBP5–MLL1WIN-SET 49.1 51.8 183 929 135.5 (124.6) 0.69

aRadius of gyration calculated using Guinier fit.

bRadius of gyration calculated using GNOM (60).

cMaximum distance between atoms calculated using GNOM.

dVolume of correlation (61).

eMolecular weight (Mw) estimated from SAXS using Vc (61). The Mw expected from the sequence is shown in parentheses.

fNSD: Normalized spatial discrepancy; the values given are the average from fifteen runs of DAMMIF (62).

Initially, one of the main challenges in modeling the MLL1 complex was the lack of structural information on RbBP5. For our characterization and modeling of RbBP5-containing complexes we made use of a ROSETTA-derived homology model of its WD40 domain (i.e. RbBP5NTD). However, late in the course of manuscript preparation, Mittal et al. (54) reported the crystal structure of the mouse RbBP5 WD40 repeat region, which forms a canonical 7-unit β-propeller structure (PDB ID: 5OV3). The human and mouse WD40 domains of RbBP5 have 100% sequence identity and there is excellent agreement between our homology model and the reported structure (r.m.s.d. ∼2.1 Å; Supplementary Figure S2H), which we believe validates the model's use in our study. To help understand RbBP5 behavior in solution, we collected (1H-15N)-TROSY spectra of a full-length construct, as well as constructs corresponding to the C-terminus (CT) and NTD (Figure 2A). The spectrum of RbBP5NTD is consistent with our model, and the reported β-propeller fold; there is considerable peak dispersion due to the high β-strand content, and we are able to identify ∼250 out of 316 expected backbone amide signals. We see a similar level of peak dispersion in TROSY spectra of WDR5WD40 (vide infra). We can distinguish approximately seven out of nine expected tryptophan indole signals based on their position in the lower left corner of the spectrum; however without resonance assignments, this cannot be unambiguously verified. Amide residues in long unstructured regions of proteins generally have poorly differentiated chemical environments and long relaxation times due to fast internal dynamics on the ps-ns timescale, resulting in sharp signals clustered between 7.5 and 8.5 ppm (55). The spectrum of RbBP5CT indicates a lack of structure (Figure 2A). We are able to identify ∼120 peaks excluding putative side-chain signals that are expected to appear in the upper left region of the spectrum (i.e. 7.8-6.6 ppm for 1H and 115-110 ppm for 15N). The RbBP5CT construct contains 199 residues of which 19 are prolines and no tryptophans, and it is likely that several peaks comprise signals from two or more amides. The TROSY spectrum of RbBP5 (Figure 2A and Supplementary Figure S3A–D) reflects features of both its folded and unfolded regions and is of poor quality, likely due to its large size and dynamic properties. Comparison of 1D-1H overlays corresponding to the first 15N increment of TROSY spectra of RbBP5, RbBP5NTD and RbBP5CT indicates that the strongest amide signals for RbBP5 are clustered in the center of the spectrum and arise from residues in the C-terminus (Supplementary Figure S3A). Nevertheless, several resonances from the β-propeller region are visible, and do not uniformly overlap with those in the spectrum of RbBP5NTD. The NMR spectra indicate that RbBP5 exhibits a high degree of disorder and this is consistent with its gel filtration profile (Supplementary Figure S3E). This is also reflected in its Rg, calculated from SAXS measurements (Table 1 and Supplementary Figure S1), and the shape of the normalized Rg-based Kratky plot and the pair distance distribution function P(r) (Figure 1B and C). In particular, the P(r) function has an asymmetric shape with a long smooth tail at large r-values, and the position of its maximum is shifted only slightly (∼4 Å) with respect to that of RbBP5NTD. The latter features indicate that RbBP5 has no globular content beyond its β-propeller domain. Moreover, sequence-based theoretical calculations of both secondary structure and order parameters also predict a rigid globular N-terminus and a flexible coil-like C-terminus (Supplementary Figure S3F). Molecular weight estimates derived from SAXS data indicate that both RbBP5 and RbBP5NTD are monomeric in solution (Table 1).

Figure 2.

Figure 2.

SAXS and NMR analysis of RbBP5. (A) (1H-15N)-TROSY spectra of the RbBP5NTD β-propeller domain (left), RbBP5CT (middle) and full-length RbBP5 (right). For the C-terminus (CT), the amide resonances are clustered between 7.5 and 8.5 ppm in the 1H dimension indicating that it is unfolded, while the well-dispersed spectrum of the N-terminal domain (NTD) is consistent with our homology model and the reported crystal structure (54) (PDB ID: 5OV3). Overlay of spectra of RbBP5NTD with RbBP5 (right) shows that some β-propeller resonances exhibit peak shifts. (B) A representative member of the most populated model reflects a structured NTD and a flexible, but non-random CT. (C) The clear difference in Rg distribution profiles for the initial pool of 30 000 models (with random conformations of the CT (dashed)) versus the SAXS-derived ensemble (solid) indicate that the CT, in the context of full-length RbBP5, is not randomly disordered. Inset shows the pair distance distribution function P(r) calculated for the experimental data (black circles) and for the SAXS-derived ensemble (red line).

We used the sparse ensemble selection (SES) approach (56) to calculate a solution ensemble of RbBP5 that would satisfy the SAXS data. An initial ensemble consisting of 20 000 models with random conformations of its flexible regions (i.e. residues 1–23 and 326–538) did not fit the SAXS data well (goodness-of-fit χsaxs = 9.4). We next generated an ensemble that better fits the SAXS data, by calculating an optimal weight for each model in the initial ensemble using a multi-orthogonal matching pursuit algorithm (56) (see Supplementary Data for details). The resulting optimal ensemble fits the SAXS data very well with χsaxs = 0.38 (Figure 2C and Supplementary Figure S2B)—the most populated models are shown in Figure 2B and Supplementary Figure S2F and G. In these models both N- and C-terminal regions preferably ‘fold in’, rather than adopt extended conformations (Supplementary Figure S2F). The optimal ensemble displays a much more narrow Rg distribution than the initial random ensemble, with a major peak at 37 Å (Figure 2C). This indicates that RbBP5 is more compact than would be predicted if its C-terminus was fully random.

Binary subcomplexes have dynamic non-random solution conformations mediated by WD40 repeat domains

Our SAXS data for the binary complexes of WDR5–MLL1WIN-SET and WDR5–RbBP5 both suggest the presence of significant disorder, especially for WDR5–MLL1WIN-SET (Figure 3A). The P(r) functions of WDR5–MLL1WIN-SET and WDR5–RbBP5 are typical for proteins containing globular domains tethered by long disordered regions (Figure 3A). The position of the P(r) major peaks for the aforementioned complexes is close to their respective positions for the individual components (Figure 1C), indicating that in both complexes the globular domains are not in close contact and may not adopt a unique arrangement in solution. WDR5 is known to interact with RbBP5 and MLL1 through small peptide segments designated as the WDR5 binding motif (WBM) (43) and WDR5 interacting (WIN) motif (41), respectively (Figure 1A). Both interactions have reported dissociation constants on the order of 1–2 μM (36,41,43,44). To calculate solution ensembles of the binary complexes, we first used (1H-15N)-TROSY titrations to verify that WDR5′s interaction with the motifs, as observed in the crystal structures, is maintained in solution. To this end, we expressed a triply labeled (15N/13C/2H) WDR5WD40 construct (which contains 311 residues) and assigned 254 backbone spin systems representing 82% of the sequence (Supplementary Figure S4). The assignments have been deposited in the BMRB database (BMRB_ID: 27528). Amide resonance chemical shift perturbations (CSPs) were then quantified for WDR5 titrated with peptides corresponding to the two motifs. Residues with the highest CSPs (Supplementary Figure S5A) were mapped onto the WIN (PDB ID: 4ESG) and WBM (PDB ID: 2XL2) peptide-bound crystal structures (Figure 3B). For both titrations, all of the assigned WDR5 residues at the binding interface were among those with the highest CSPs (Supplementary Figure S5A). We are therefore confident in using the crystal structures to delineate restraints governing the interaction of these motifs with WDR5 in our modeling of the binary complexes.

Figure 3.

Figure 3.

WDR5–MLL1 and WDR5–RbBP5 binary complexes. (A) Rg-based Kratky plots (left) of SAXS data for binary complexes of WDR5–MLL1WIN-SET (orange) and WDR5–RbBP5 (maroon). Normalized pair distance distribution functions P(r) calculated from experimental SAXS data with GNOM (right). The data indicate that while both complexes are flexible, WDR5–MLL1WIN-SET possesses a significantly higher degree of flexibility than WDR5–RbBP5. (B) WDR5 residues (red, in stick representation) that exhibit the highest CSPs in (1H-15N)-TROSY titrations with peptides corresponding to the WIN (green) and WBM motifs (pink) (CSPs > 0.15 ppm for WIN peptide, CSPs > 0.23 for WBM peptide). CSPs are mapped onto the crystal structures of WDR5–WIN peptide (PDB ID: 4ESG) and WDR5–WBM peptide (PDB ID: 2XL2). Histograms presenting complete CSP data obtained from the two titrations are presented in Supplementary Figure S5A. (C) The centers of geometry of RbBP5NTD (pink) and WDR5WD40 (blue) domains are positioned at about the same distance (∼45 Å) in 95% of the models in the optimal ensemble. (D) Rg distribution for the initial pool of random structures (dashed) and for the SAXS-derived optimal ensemble (solid) of WDR5–RbBP5 (left) and WDR5–MLL1WIN-SET (right). Experimental SAXS profiles (black circles) with theoretical profiles (red line) averaged over the SES ensemble (inset).

The binary subcomplexes, which are both flexible, exhibit different structural organizations. The optimal ensemble for WDR5–MLL1WIN-SET has an Rg distribution as broad as the initial random ensemble (Figure 3D), and the arrangement of the globular domains in the most populated models does not support the existence of additional interactions outside of the WIN motif (Supplementary Figure S5). In contrast, the optimal ensemble for the WDR5–RbBP5 displays a relatively narrow Rg distribution, with a major peak at ∼41 Å (Figure 3D), indicating the predominantly populated conformations are more compact than those in the initial random ensemble. The relative position of the WDR5 and RbBP5 WD40 domains in the ensemble are well defined, with a distance between their centres of mass (dWR) equal to 45.1 ± 0.7 Å (Figure 3C and Supplementary S5G). There is no apparent direct contact between the domains and their relative orientation with respect to each other is variable. The r.m.s.d. between highly populated conformers in the optimal ensemble is ∼18 Å due to the interdomain dynamics. The preference for compact conformers may be explained by the formation of interactions between RbBP5CT and the two β-propeller domains. These contacts cannot be more precisely defined due to our use of rigid-body models in the calculations. We used biolayer interferometry (BLI) to estimate the binding affinity for WDR5–RbBP5 interaction, and in our hands found the KD to be ∼ 0.3 μM (Supplementary Figure S5H and Table S1). This compares to a value of ∼2.4 μM estimated using analytical ultracentrifugation by Cosgrove and colleagues (31) (Supplementary Table S1).

In summary, our structural analysis of the binary subcomplexes suggests that WDR5–RbBP5 is relatively compact, with a well-defined distance between the WD40 domains. In contrast, WDR5–MLL1WIN-SET has a significantly higher degree of flexibility, with a broad interdomain distance distribution profile. It should be noted that SAXS measurements were collected for a putative RbBP5–MLL1RBS-SET complex, however the data were not of high enough quality to proceed with structural analysis. We believe this is due to the weak affinity between MLL1RBS-SET and RbBP5, as compared to the intermolecular affinities observed with the WDR5–RbBP5 and WDR5–MLLWIN-SET pairs (see Supplementary Table S1).

SAXS and cross-linking data suggest a dynamic triangulated ensemble for WDR5–RbBP5–MLL1WIN-SET

Our SAXS data for the WDR5–RbBP5–MLL1WIN-SET complex showed significant flexibility in the sample. The shape of the experimental Kratky plots of the complex is typical of proteins with substantial interdomain flexibility (Figure 4A and Supplementary Figure S6A). In particular, the Rg-based Kratky plot is a bell-shaped curve with a maximum at (2.26, 1.27), coordinates which are shifted to higher values with respect to those expected for a globular protein. Also, the presence of a high degree of flexibility is evidenced by the poor convergence of the Kratky plots at high q-values. The low maximum value of 0.48 in the Vc-based Kratky plot (Supplementary Figure S6A), as well as the asymmetric shape of the P(r) function (Supplementary Figure S6B), suggests an elongated shape. This agrees with the averaged ab initio SAXS-derived molecular envelope, which showed an extended shape with approximate dimensions of 220 × 105 × 70 Å (Supplementary Figure S6C).

Figure 4.

Figure 4.

Dynamic model of the trimeric WDR5–RbBP5–MLL1WIN-SET complex derived from SAXS and cross-link data. (A) Rg-based Kratky plot of SAXS data for WDR5–RbBP5–MLL1WIN-SET indicates a high degree of flexibility. (B) Sequence mapping of intraprotein and interprotein cross-links. Intraprotein cross-links are indicated with purple arcs, while interprotein cross-links between globular domains or flexible regions are indicated with blue and dashed-black lines, respectively. (C) Rg distribution for the optimal ensemble of the trimer is shown by a solid black line. Experimental SAXS profile (black circles) plotted with theoretical profiles (red line) averaged over the ensemble (inset). (D) Cartoon diagram of the four most populated models of the optimal ensemble. Only structured domains are shown. The WD40 domains of RbBP5 and WDR5 are colored in pink and blue, respectively, while MLL1 SET domain is colored differently in each model (green, cyan, yellow and orange). For reference, the WD40 domains of each model are superimposed with the most populated model of the WDR5–RbBP5 dimer (Figure 3) displayed in a semi-transparent surface representation (gray).

We note that pair distance distribution functions of proteins containing several globular domains, connected by long disordered regions, are characterized by peaks at low r-values, corresponding to intradomain distances. Therefore, if the three globular domains of WDR5, MLL1WIN-SET and RbBP5 are not interacting directly with each other within the trimer, we would expect the P(r) function to have peaks at 26–32 Å, reflecting the interatomic distances prevailing within these domains (Figure 1C and Supplementary Figure S6B). However, the experimental P(r) function has its maximum at a much larger distance of ∼47 Å (Supplementary Figure S6B), suggesting the existence of interdomain contacts.

To aid our modeling of the trimeric complex, we performed XL-MS studies. We observed many intramolecular cross-links within each of the three proteins. These were highly consistent with the available WDR5 (53), MLL1WIN-SET (45) and RbBP5NTD (54) crystal structures indicating that the models are reliable representations of the domains within the complex in solution. We also observed a number of intermolecular cross-links, with the largest number being between MLL1 and RbBP5 suggesting they are in close proximity. Figure 4B shows sequence mapping of both intra- and intermolecular DSS cross-links. There are six intermolecular cross-links between lysine residues within the globular subunits that are shown on Figure 4B by solid blue lines. All 31 experimentally observed cross-links were used in the modeling (Supplementary Tables S2 and 3).

Using both SAXS and cross-linking data as conformational restraints, we utilized the SES approach to calculate solution ensembles of WDR5–RbBP5–MLL1WIN-SET that satisfy both sets of experimental data. An initial pool of representative structures was generated by combining rigid-body modeling and molecular dynamics simulations for both all-atomic and coarse-grained models along with cross-link derived distance restraints (see Supplementary Data for details). It was assumed that MLL1WIN-SET and RbBP5 were tethered to WDR5 via the WIN and WBM motifs, respectively, as seen in crystal structures (36,41–43). It should be noted here that individual members of the initial ensemble of conformers did not necessary satisfy all intermolecular cross-links: each satisfied on average three to four.

The optimal ensemble of WDR5–RbBP5–MLL1WIN-SET fits the SAXS data as a whole, with χsaxs = 0.23 in the q-range 0<q<0.23. While only SAXS data were used to select the optimal ensemble, each experimentally observed cross-link is consistent with at least one member, so that the ensemble as a whole is consistent with all 31 cross-links. The expected Rg exhibits a wide distribution with a maximum at ∼48 Å (Figure 4C) and the SES-derived ensemble suggests that the complex can assume a range of interdomain arrangements in solution (Figure 4 and Supplementary Figure S6C). One notable feature of the optimal ensemble is that the β-propeller domains of WDR5 and RbBP5 adopt the same relative positions within the trimer as they do in the WDR5–RbBP5 dimer (Figure 4D and Supplementary Figure S6E). The placement of the MLL1 SET domain is more variable. Most conformers (∼80%) adopt a compact arrangement in which the SET domain, and the two WD40 domains are in close proximity (Figure 4D). However, in a small population of conformers (∼20%), the SET domain is ‘detached’. When the conformers adopt a compact conformation, the relative position of the SET and RbBP5 β-propeller domains is well-defined (Supplementary Figure S6F) and this is supported by four interdomain cross-links (Figure 4B; Supplementary Figure S6D and Table S2). However, WDR5′s positioning varies because its contact with RbBP5 and MLL1 occurs within their flexible linker regions. There are only two interdomain cross-links that involve WDR5, and they can only be satisfied simultaneously in ∼10% of the conformers of the optimal ensemble.

The WD40 β-propeller domain of RbBP5 has a unique interaction with MLL1

A crystallographic study by Li et al. (35) highlighted the critical role of the AS+ABM region of RbBP5 in stimulating SET domain methyltransferase activity in the MLL family. The catalytic activities of MLL2/3/4 were found to be highly dependent on the presence of RbBP5AS+ABM-ASH2LSPRY. In contrast, methyltransferase activity of MLL1 was weakly stimulated by RbBP5AS+ABM-ASH2LSPRY, and more dependent on WDR5. The authors identified a surface of the SET domain (in the I-SET motif) that serves as a hub for MLL–RbBP5–ASH2L interaction. Two MLL1 residues at this surface (Asn3861 and Gln3867) have different side-chain properties compared to MLL2/3/4 (hydrophilic/bulky vs. hydrophobic) that prevent ‘optimal’ RbBP5AS+ABM-ASH2LSPRY interaction. Mutation of these two residues to their MLL2 (or MLL3) counterparts restored the binding interface, such that MLL1 could be crystallized with (PDB ID: 5F6L) and its methyltransferase activity stimulated by the RbBP5–ASH2L dimer.

Our study of the WDR5–RbBP5–MLL1WIN-SET complex provides a basis for understanding how MLL1 methyltransferase activity is stimulated by RbBP5 and WDR5. When the trimer adopts a compact configuration (found in ∼75% of the optimal ensemble conformers), we observe a direct interaction between the WD40 domain of RbBP5 and a short peptide sequence of MLL1 located between its WIN motif and SET domain. We refer to this 7-residue RbBP5 binding sequence as the RBS region of MLL1 (Figure 1A). To confirm this specific interaction, we performed GST pull-down and BLI binding studies with several RbBP5 and GST-tagged MLL constructs (Figure 5). Both MLL1RBS-SET and MLL1WIN-SET were found to interact exclusively with RbBP5 constructs containing the N-terminal WD40 domain (Figure 5A and B). No interaction is observed between MLL1 and RbBP5 AS-containing C-terminal constructs in GST pull-down (Figure 5C) or BLI assays (data not shown). BLI was used to estimate the KD for the interaction of MLL1RBS-SET and RbBP5NTD, and found to be ∼8 μM (Figure 5D and E).

Figure 5.

Figure 5.

The N-terminal β-propeller domain of RbBP5 binds to MLL1. GST pull-down experiments (where MLL constructs are GST-tagged) show direct interaction between (A) RbBP5 and (B) RbBP5NTD with MLL1WIN-SET and MLL1RBS-SET, (C) whereas no interaction is observed between RbBP5AS+ABM and MLL1. Control experiments were performed to show that the GST tag was not involved in the interaction (data not shown). (D) BLI sensorgrams for the binding of RbBP5NTD to sensor-immobilized MLL1RBS-SET, (E) and the corresponding steady-state binding curve. (F) Sequence alignment for MLL1, MLL2 and MLL3 linker residues between the WIN motif and SET domain, with the WIN motif in green and RBS in red. (G) In GST pull-down experiments (where MLL1 constructs are GST-tagged), no interaction is observed between RbBP5NTD and MLL1RBS-SET7D (i.e. the entire RBS is deleted) or MLL1RBS-SET3M (i.e. Q3787V, P3788L, Y3791G, MLL1 to MLL2 mutation—marked above with stars in panel F). (H) In histone methyltransferase activity assays, MLL1WIN-SET7D and MLL1WIN-SET3M constructs have significantly reduced catalytic activity compared to wild-type MLL1WIN-SET. (For pull-down experiments: L = Load, FT = Flow-through, W = Wash, E = Eluate)

The RBS is a unique feature of MLL1 and is not conserved in MLL2/3/4 which rely strongly on the ASH2L-RbBP5 dimer for activation (Figure 5F). We performed mutagenesis experiments to confirm the importance of the RBS in promoting MLL1 interaction with RbBP5, and in stimulating the methyltransferase activity of the SET domain. We constructed two MLL1 mutants: the first, MLLWIN-SET7D, where its 7-residue RBS was deleted; and the second, MLL1WIN-SET3M, where three mutations (Q3787V, P3788L and Y3791G) transform MLL1 RBS to the corresponding sequence of MLL2. Both mutants failed to bind to RbBP5 constructs containing the N-terminal WD40 domain (Figure 5G). Furthermore, we found that both deletion or mutation of MLL1 RBS decreased RbBP5′s ability to stimulate methyltransferase activity of the WDR5–MLL1WIN-SET complex (Figure 5H).

Our primary model for WDR5–RbBP5–MLL1WIN-SET, which represents the majority of compact conformers in our optimal ensemble is presented in Figure 6. In this model, only the β-propeller domain of RbBP5 makes contact with the MLL1 SET domain—the AS is not positioned correctly to enable contact with the SET domain. The RBS binding surface of RbBP5NTD consists of a number of hydrophobic residues (V249, W279, I283, L286, V287 and I289), as well as Q273, Y277 and P253 (Supplementary Figure S6G). The RBS may also participate in an intramolecular association with the SET domain which serves to bridge the RbBP5/SET interaction. Interestingly, in our model we see an interaction between RbBP5NTD and Asn3861 of the SET domain, which as noted above, was identified as one of the two critical residues that distinguishes MLL1 from MLL2/3/4 vis-à-vis its ability to interact with RbBP5–ASH2L. SET domain residues that form the putative RBS + RbBP5NTD binding interface (K3825, K3828, N3861, R3871, M3897, H3898, G3899, R3903 and F3904) are shown in Supplementary Figure S6H.

Figure 6.

Figure 6.

Organization of the WDR5–RbBP5–MLL1WIN-SET trimer. (A) Cartoon diagram of models representing the major population (∼80%) of the optimal ensemble. Only structured WDR5WD40 (purple/blue), RbBP5NTD (pink) and MLL1SET (green) domains are shown. For different members of the ensemble, the SET domain is superimposed and shown by both cartoon and transparent surface representation. (B) The most populated conformer representing the MLL1 trimer is shown by a surface representation of the globular regions and a backbone trace for the flexible ones. For clarity, the CT of RbBP5 (i.e. RbBP5382-538) is not shown and the residues from the AS+ABM region are in dark orange, WIN in green, WBM in violet and RBS in red. The cross-links between globular domains are shown by solid black lines and the cross-linked lysine residues are shown in cyan. Schematic representation of the domain arrangement and critical motif and segment interactions seen in the model are shown to the right.

Within our compact trimer optimal ensemble (Figure 4D) we see a small population of conformers that adopt a domain arrangement where the RbBP5 AS is favorably positioned to interact with the SET domain (Supplementary Figure S7). It is important to note that in these species, the RBS maintains its contact with RbBP5 NTD as seen in our primary model, however it no longer forms the intramolecular bridge with the SET domain (Figure 6 and Supplementary Figure S7). This minor population of conformers highlights the potential for dual NTD/AS RbBP5 contacts with MLL1. Our GST pull-down and BLI binding studies show that the AS does not on its own, measurably interact with the SET domain (Figure 5). However, it is possible there could be an avidity effect, where RBS binding to RbBP5NTD promotes SET domain/AS interaction. Our attempts to confirm this avidity under multiple buffer conditions using BLI were inconclusive—while we see potentially stronger SET domain binding using RbBP5 constructs having both the NTD and AS+ABM region (Supplementary Figure S8), the binding behavior gave rise to non-ideal BLI sensorgrams, without steady-state and complete dissociation phases needed for proper KD determination. We believe this is due to protein aggregation in the assays. At present, we can only speculate that within the context of the trimer, the presence of full-length RbBP5 and WDR5 may facilitate some level of AS/SET interaction. This is supported by MD simulations of the all-atom model of the trimer which were initiated with the three globular domains positioned according to the minor population in the ensemble (Supplementary Figure S7), and with the AS positioned in contact with MLL1 SET as per the crystal structure of MLL1N3861I/Q3867L bound to the RbBP5AS+ABM-ASH2LSPRY dimer (32). Throughout the course of the MD trajectory (100 ns), the AS maintained constant contact with the SET domain.

Taken together, our structural characterization of WDR5–RbBP5–MLL1WIN-SET suggests that its activation is mediated in part through the unique, but weak interaction of the MLL1 RBS with RbBP5, which in turn stabilizes the SET-I motif of the catalytic SET domain. WDR5 serves as a hub to promote this interaction, through its dual binding to the WIN (on MLL1) and WBM motifs (on RbBP5). Hence, we hypothesize that a triumvirate of weak, but specific intermolecular interactions are required to maintain the integrity of the MLL1 minimal complex, and that disruption of an individual interaction site may be sufficient to disrupt catalytic activity. To test this hypothesis, we measured the ability of OICR-9429, a small molecule antagonist of WDR5–MLL1WIN interaction, to disrupt the association of WDR5–RbBP5–MLL1WIN-SET using gel filtration (Figure 7A). The disruption of WDR5–MLL1 interaction by the compound compromised the assembly of the trimer (Figure 7A and B), and inhibited its catalytic activity (Figure 7C). This is consistent with our previous work showing that OICR-9429 can disrupt the assembly and function of endogenous MLL1 complexes in cells (38). Similar results have been reported for MM-401, a peptide-based antagonist of WDR5–MLLWIN interaction (39,57). This has important implications for the development of pharmacological antagonists of the MLL1 complex, and further strengthens this approach to target other multiprotein complexes that are dependent on weak, but druggable interactions.

Figure 7.

Figure 7.

OICR-9429 attenuates the assembly of a functional trimeric complex. (A) Size exclusion chromatography of WDR5–RbBP5–MLL1WIN-SET in the absence (navy) or presence of (cyan) of ∼5-fold molar excess OICR-9429. This compound binds to WDR5 (KD = 93 nM) (38) and competes with the MLL1 WIN motif. (B) SDS-PAGE of elution fractions (L = Load). Fractions containing the trimer and WDR5–MLL1WIN-SET (shoulder at 15.2 ml) are not recovered from the column when run in the presence of OICR-9429. (C) OICR-9429 inhibits the catalytic activity of the trimer.

During revision of our manuscript, the crystal (58) (PDB ID: 6CHG) and cryo-EM structures (59) (PDB ID: 6BX3) for the yeast COMPASS were reported, which comprises orthologues of SET1, WDR5, RbBP5, ASH2L and DPY30. It is interesting to note that the relative position of the WD40 domains of WDR5 and RbBP5 is conserved not only in our dimer and trimer models, but is also consistent with the orientation in the reported COMPASS structures (Supplementary Figure S9A). However, the relative positions of the SET and two WD40 domains adopted in COMPASS (Supplementary Figure S9B) is not consistent with our experimental SAXS and cross-links data obtained for the MLL1 trimer (Supplementary Figure S9C). Moreover, the domain arrangement in COMPASS is not compatible with any of the conformers that make up our optimal ensemble of the minimal MLL1 trimer. This extends as well to our preliminary characterization of SAXS and cross-links data for the MLL1 pentameric complex. These differences suggest additional evidence of the distinct properties of MLL1 among the SET1 family of enzymes.

Supplementary Material

gkz697_Supplemental_Files

ACKNOWLEDGEMENTS

For the SAXS experiments, we gratefully acknowledge use of the SAXS Core facility of the Center for Cancer Research, National Cancer Institute (NCI). The SAXS data were collected at beamline 12-ID-B. The shared scattering beamline 12-ID-B resource is allocated under the PUP-24152 agreement between the National Cancer Institute and Argonne National Laboratory (ANL). We thank Dr Xiaobing Zuo (ANL) for his expert support. Use of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility, was operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Natural Sciences and Engineering Research Council of Canada [RGPIN-2015-05939 to C.H.A.]; Canada Research Chair in Structural Genomics (to C.H.A.); Canadian Institutes of Health Research [FDN154328 to C.H.A.]; European Molecular Biology Organization (EMBO) Long Term Fellowship (to M.F.); ERC Grant [AdG-670821 Proteomics 4D to M.F., R.A.]. The Structural Genomics Consortium is a registered charity (no: 1097737) that receives funds from AbbVie, Bayer Pharma AG, Boehringer Ingelheim, Canada Foundation for Innovation, Eshelman Institute for Innovation, Genome Canada through Ontario Genomics Institute, Innovative Medicines Initiative (EU/EFPIA) ULTRA-DD [115766], Janssen, Merck & Co., Novartis Pharma AG, Ontario Ministry of Economic Development and Innovation, Pfizer, São Paulo Research Foundation-FAPESP, Takeda, and the Wellcome Trust. Funding for open access charge: Canadian Institutes of Health Research.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Hyun K., Jeon J., Park K., Kim J.. Writing, erasing and reading histone lysine methylations. Exp. Mol. Med. 2017; 49:e324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Shah R.N., Grzybowski A.T., Cornett E.M., Johnstone A.L., Dickson B.M., Boone B.A., Cheek M.A., Cowles M.W., Maryanski D., Meiners M.J. et al.. Examining the roles of H3K4 methylation states with systematically characterized antibodies. Mol. Cell. 2018; 72:162–177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Ruthenburg A.J., Allis C.D., Wysocka J.. Methylation of lysine 4 on histone H3: intricacy of writing and reading a single epigenetic mark. Mol. Cell. 2007; 25:15–30. [DOI] [PubMed] [Google Scholar]
  • 4. Shilatifard A. Molecular implementation and physiological roles for histone H3 lysine 4 (H3K4) methylation. Curr. Opin. Cell Biol. 2008; 20:341–348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Kouzarides T. Histone methylation in transcriptional control. Curr. Opin. Genet. Dev. 2002; 12:198–209. [DOI] [PubMed] [Google Scholar]
  • 6. Dillon S.C., Zhang X., Trievel R.C., Cheng X.. The SET-domain protein superfamily: protein lysine methyltransferases. Genome Biol. 2005; 6:227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Roguev A., Schaft D., Shevchenko A., Pijnappel W.W., Wilm M., Stewart A.F.. The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4. EMBO J. 2001; 20:7137–7148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Briggs S.D., Bryk M., Strahl B.D., Cheung W.L., Davie J.K., Dent S.Y., Winston F., Allis C.D.. Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae. Genes Dev. 2001; 15:3286–3295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Nakamura T., Mori T., Tada S., Krajewski W., Rozovskaia T., Dubois G., Mazo A., Croce C.M., Canaani E.. ALL-1 is a histone methyltransferase that assembles a supercomplex of proteins involved in transcriptional regulation. Mol. Cell. 2002; 10:1119–1128. [DOI] [PubMed] [Google Scholar]
  • 10. Dou Y., Milne T.A., Tackett A.J., Smith E.R., Fukuda A., Wysocka J., Allis C.D., Chait B.T., Hess J.L., Roeder R.G.. Physical association and coordinate function of the H3 K4 methyltransferase MLL1 and the H4 K16 acetyltransferase MOF. Cell. 2005; 121:873–885. [DOI] [PubMed] [Google Scholar]
  • 11. Hughes C.M., Rozenblatt-Rosen O., Milne T.A., Copeland T.D., Levine S.S., Lee J.C., Hayes D.N., Shanmugam K.S., Bhattacharjee A., Biondi C.A. et al.. Menin associates with a trithorax family histone methyltransferase complex and with the hoxc8 locus. Mol. Cell. 2004; 13:587–597. [DOI] [PubMed] [Google Scholar]
  • 12. Goo Y.-H., Sohn Y.C., Kim D.-H., Kim S.-W., Kang M.-J., Jung D.-J., Kwak E., Barlev N.A., Berger S.L., Chow V.T. et al.. Activating signal cointegrator 2 belongs to a novel steady-state complex that contains a subset of trithorax group proteins. Mol. Cell Biol. 2003; 23:140–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Wysocka J., Myers M.P., Laherty C.D., Eisenman R.N., Herr W.. Human Sin3 deacetylase and trithorax-related Set1/Ash2 histone H3-K4 methyltransferase are tethered together selectively by the cell-proliferation factor HCF-1. Genes Dev. 2003; 17:896–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Lee J.-H., Skalnik D.G.. CpG-binding protein (CXXC finger protein 1) is a component of the mammalian Set1 histone H3-Lys4 methyltransferase complex, the analogue of the yeast Set1/COMPASS complex. J. Biol. Chem. 2005; 280:41725–41731. [DOI] [PubMed] [Google Scholar]
  • 15. Lee J.-H., Tate C.M., You J.-S., Skalnik D.G.. Identification and characterization of the human Set1B histone H3-Lys4 methyltransferase complex. J. Biol. Chem. 2007; 282:13419–13428. [DOI] [PubMed] [Google Scholar]
  • 16. Milne T.A., Dou Y., Martin M.E., Brock H.W., Roeder R.G., Hess J.L.. MLL associates specifically with a subset of transcriptionally active target genes. Proc. Natl. Acad. Sci. U. S. A. 2005; 102:14765–14770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Milne T.A., Briggs S.D., Brock H.W., Martin M.E., Gibbs D., Allis C.D., Hess J.L.. MLL targets SET domain methyltransferase activity to Hox gene promoters. Mol. Cell. 2002; 10:1107–1117. [DOI] [PubMed] [Google Scholar]
  • 18. Yu B.D., Hess J.L., Horning S.E., Brown G.A., Korsmeyer S.J.. Altered Hox expression and segmental identity in Mll-mutant mice. Nature. 1995; 378:505–508. [DOI] [PubMed] [Google Scholar]
  • 19. Terranova R., Agherbi H., Boned A., Meresse S., Djabali M.. Histone and DNA methylation defects at Hox genes in mice expressing a SET domain-truncated form of Mll. Proc. Natl. Acad. Sci. U.S.A. 2006; 103:6629–6634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Glaser S., Schaft J., Lubitz S., Vintersten K., van der Hoeven F., Tufteland K.R., Aasland R., Anastassiadis K., Ang S.-L., Stewart A.F.. Multiple epigenetic maintenance factors implicated by the loss of Mll2 in mouse development. Dev. Camb. Engl. 2006; 133:1423–1432. [DOI] [PubMed] [Google Scholar]
  • 21. Ayton P.M., Cleary M.L.. Molecular mechanisms of leukemogenesis mediated by MLL fusion proteins. Oncogene. 2001; 20:5695–5707. [DOI] [PubMed] [Google Scholar]
  • 22. Hess J.L. Mechanisms of transformation by MLL. Crit. Rev. Eukaryot. Gene Expr. 2004; 14:235–254. [DOI] [PubMed] [Google Scholar]
  • 23. Krivtsov A.V., Armstrong S.A.. MLL translocations, histone modifications and leukaemia stem-cell development. Nat. Rev. Cancer. 2007; 7:823–833. [DOI] [PubMed] [Google Scholar]
  • 24. Morin R.D., Mendez-Lago M., Mungall A.J., Goya R., Mungall K.L., Corbett R.D., Johnson N.A., Severson T.M., Chiu R., Field M. et al.. Frequent mutation of histone-modifying genes in non-Hodgkin lymphoma. Nature. 2011; 476:298–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Dalgliesh G.L., Furge K., Greenman C., Chen L., Bignell G., Butler A., Davies H., Edkins S., Hardy C., Latimer C. et al.. Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes. Nature. 2010; 463:360–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Pleasance E.D., Stephens P.J., O’Meara S., McBride D.J., Meynert A., Jones D., Lin M.-L., Beare D., Lau K.W., Greenman C. et al.. A small-cell lung cancer genome with complex signatures of tobacco exposure. Nature. 2010; 463:184–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Parsons D.W., Li M., Zhang X., Jones S., Leary R.J., Lin J.C.-H., Boca S.M., Carter H., Samayoa J., Bettegowda C. et al.. The genetic landscape of the childhood cancer medulloblastoma. Science. 2011; 331:435–439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Li Y., Bögershausen N., Alanay Y., Simsek Kiper P.O., Plume N., Keupp K., Pohl E., Pawlik B., Rachwalski M., Milz E. et al.. A mutation screen in patients with Kabuki syndrome. Hum. Genet. 2011; 130:715–724. [DOI] [PubMed] [Google Scholar]
  • 29. Dou Y., Milne T.A., Ruthenburg A.J., Lee S., Lee J.W., Verdine G.L., Allis C.D., Roeder R.G.. Regulation of MLL1 H3K4 methyltransferase activity by its core components. Nat. Struct. Mol. Biol. 2006; 13:713–719. [DOI] [PubMed] [Google Scholar]
  • 30. Dehé P.-M., Dichtl B., Schaft D., Roguev A., Pamblanco M., Lebrun R., Rodríguez-Gil A., Mkandawire M., Landsberg K., Shevchenko A. et al.. Protein interactions within the Set1 complex and their roles in the regulation of histone 3 lysine 4 methylation. J. Biol. Chem. 2006; 281:35404–35412. [DOI] [PubMed] [Google Scholar]
  • 31. Patel A., Dharmarajan V., Vought V.E., Cosgrove M.S.. On the mechanism of multiple lysine methylation by the human mixed lineage leukemia protein-1 (MLL1) core complex. J. Biol. Chem. 2009; 284:24242–24256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. van Nuland R., Smits A.H., Pallaki P., Jansen P.W.T.C., Vermeulen M., Timmers H.T.M.. Quantitative dissection and stoichiometry determination of the human SET1/MLL histone methyltransferase complexes. Mol. Cell Biol. 2013; 33:2067–2077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Couture J.-F., Skiniotis G.. Assembling a COMPASS. Epigenetics. 2013; 8:349–354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Ernst P., Vakoc C.R.. WRAD: enabler of the SET1-family of H3K4 methyltransferases. Brief. Funct. Genomics. 2012; 11:217–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Li Y., Han J., Zhang Y., Cao F., Liu Z., Li S., Wu J., Hu C., Wang Y., Shuai J. et al.. Structural basis for activity regulation of MLL family methyltransferases. Nature. 2016; 530:447–452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Avdic V., Zhang P., Lanouette S., Groulx A., Tremblay V., Brunzelle J., Couture J.-F.. Structural and biochemical insights into MLL1 core complex assembly. Structure. 2011; 19:101–108. [DOI] [PubMed] [Google Scholar]
  • 37. Zhang P., Lee H., Brunzelle J.S., Couture J.-F.. The plasticity of WDR5 peptide-binding cleft enables the binding of the SET1 family of histone methyltransferases. Nucleic Acids Res. 2012; 40:4237–4246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Grebien F., Vedadi M., Getlik M., Giambruno R., Grover A., Avellino R., Skucha A., Vittori S., Kuznetsova E., Smil D. et al.. Pharmacological targeting of the Wdr5-MLL interaction in C/EBPα N-terminal leukemia. Nat. Chem. Biol. 2015; 11:571–578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Cao F., Townsend E.C., Karatas H., Xu J., Li L., Lee S., Liu L., Chen Y., Ouillette P., Zhu J. et al.. Targeting MLL1 H3K4 methyltransferase activity in mixed-lineage leukemia. Mol. Cell. 2014; 53:247–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Zhu J., Sammons M.A., Donahue G., Dou Z., Vedadi M., Getlik M., Barsyte-Lovejoy D., Al-awar R., Katona B.W., Shilatifard A. et al.. Gain-of-function p53 mutants co-opt chromatin pathways to drive cancer growth. Nature. 2015; 525:206–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Patel A., Dharmarajan V., Cosgrove M.S.. Structure of WDR5 bound to mixed lineage leukemia protein-1 peptide. J. Biol. Chem. 2008; 283:32158–32161. [DOI] [PubMed] [Google Scholar]
  • 42. Song J.-J., Kingston R.E.. WDR5 interacts with mixed lineage leukemia (MLL) protein via the histone H3-binding pocket. J. Biol. Chem. 2008; 283:35258–35264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Odho Z., Southall S.M., Wilson J.R.. Characterization of a novel WDR5-binding site that recruits RbBP5 through a conserved motif to enhance methylation of histone H3 lysine 4 by mixed lineage leukemia protein-1. J. Biol. Chem. 2010; 285:32967–32976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Dharmarajan V., Lee J.-H., Patel A., Skalnik D.G., Cosgrove M.S.. Structural basis for WDR5 interaction (Win) motif recognition in human SET1 family histone methyltransferases. J. Biol. Chem. 2012; 287:27275–27289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Southall S.M., Wong P.-S., Odho Z., Roe S.M., Wilson J.R.. Structural basis for the requirement of additional factors for MLL1 SET domain activity and recognition of epigenetic marks. Mol. Cell. 2009; 33:181–191. [DOI] [PubMed] [Google Scholar]
  • 46. Takahashi Y., Westfield G.H., Oleskie A.N., Trievel R.C., Shilatifard A., Skiniotis G.. Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human. Proc. Natl. Acad. Sci. U.S.A. 2011; 108:20526–20531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Konarev P.V., Petoukhov M.V., Volkov V.V., Svergun D.I.. ATSAS 2.1, a program package for small-angle scattering data analysis. J. Appl. Crystallogr. 2006; 39:277–286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Leitner A., Walzthoeni T., Aebersold R.. Lysine-specific chemical cross-linking of protein complexes and identification of cross-linking sites using LC-MS/MS and the xQuest/xProphet software pipeline. Nat. Protoc. 2014; 9:120–137. [DOI] [PubMed] [Google Scholar]
  • 49. Lemak A., Gutmanas A., Chitayat S., Karra M., Farès C., Sunnerhagen M., Arrowsmith C.H.. A novel strategy for NMR resonance assignment and protein structure determination. J. Biomol. NMR. 2011; 49:27–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Lemak A., Steren C.A., Arrowsmith C.H., Llinás M.. Sequence specific resonance assignment via Multicanonical Monte Carlo search using an ABACUS approach. J. Biomol. NMR. 2008; 41:29–41. [DOI] [PubMed] [Google Scholar]
  • 51. Delaglio F., Grzesiek S., Vuister G.W., Zhu G., Pfeifer J., Bax A.. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995; 6:277–293. [DOI] [PubMed] [Google Scholar]
  • 52. Goddard T., Kneller D.G.. 2004; SPARKY 3. [Google Scholar]
  • 53. Schuetz A., Allali-Hassani A., Martín F., Loppnau P., Vedadi M., Bochkarev A., Plotnikov A.N., Arrowsmith C.H., Min J.. Structural basis for molecular recognition and presentation of histone H3 by WDR5. EMBO J. 2006; 25:4245–4252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Mittal A., Hobor F., Zhang Y., Martin S.R., Gamblin S.J., Ramos A., Wilson J.R.. The structure of the RbBP5 β-propeller domain reveals a surface with potential nucleic acid binding sites. Nucleic Acids Res. 2018; 46:3802–3812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Frueh D.P. Practical aspects of NMR signal assignment in larger and challenging proteins. Prog. Nucl. Magn. Reson. Spectrosc. 2014; 78:47–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Berlin K., Castañeda C.A., Schneidman-Duhovny D., Sali A., Nava-Tudela A., Fushman D.. Recovering a representative conformational ensemble from underdetermined macromolecular structural data. J. Am. Chem. Soc. 2013; 135:16595–16609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Alicea-Velázquez N.L., Shinsky S.A., Loh D.M., Lee J.-H., Skalnik D.G., Cosgrove M.S.. Targeted disruption of the interaction between WD-40 Repeat Protein 5 (WDR5) and Mixed Lineage Leukemia (MLL)/SET1 family proteins specifically inhibits MLL1 and SETd1A methyltransferase complexes. J. Biol. Chem. 2016; 291:22357–22372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Hsu P.L., Li H., Lau H.-T., Leonen C., Dhall A., Ong S.-E., Chatterjee C., Zheng N.. Crystal structure of the COMPASS H3K4 methyltransferase catalytic module. Cell. 2018; 174:1106–1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Qu Q., Takahashi Y.-H., Yang Y., Hu H., Zhang Y., Brunzelle J.S., Couture J.-F., Shilatifard A., Skiniotis G.. Structure and conformational dynamics of a COMPASS histone H3K4 methyltransferase complex. Cell. 2018; 174:1117–1126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Svergun D., Barberato C., Koch M.H.J.. CRYSOL – a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl. Crystallogr. 1995; 28:768–773. [Google Scholar]
  • 61. Rambo R.P., Tainer J.A.. Accurate assessment of mass, models and resolution by small-angle scattering. Nature. 2013; 496:477–481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Franke D., Svergun D.I.. DAMMIF, a program for rapid ab-initio shape determination in small-angle scattering. J. Appl. Crystallogr. 2009; 42:342–346. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkz697_Supplemental_Files

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES