Abstract
Pioneer transcription factors are proteins that induce cellular identity transitions by binding to inaccessible regions of DNA in nuclear chromatin. They contribute to chromatin opening and recruit other factors to regulatory DNA elements. The structural features and dynamics modulating their interaction with nucleosomes are still unresolved. From a combination of experiments and molecular simulations, we reveal here how the pioneer factor and master regulator of pluripotency, Oct4, interprets and enhances nucleosome structural flexibility. The magnitude of Oct4’s impact on nucleosome dynamics depends on the binding site position and the mobility of the unstructured tails of nucleosomal histone proteins. Oct4 uses both its DNA binding domains to propagate and stabilize open nucleosome conformations, one for specific sequence recognition and the other for nonspecific interactions with nearby regions of DNA. Our findings provide a structural basis for the versatility of transcription factors in engaging with nucleosomes and have implications for understanding how pioneer factors induce chromatin dynamics.
INTRODUCTION
In eukaryotic cells, the DNA containing all the information about the cell’s fate and function is packed into chromatin inside the nucleus. The nucleosome, the basic unit of chromatin, is a nucleoprotein complex formed by 145–147 base pairs of DNA wrapped around an octamer of four histones proteins (H3, H4, H2A and H2B). Histones have a globular domain and disordered, charged regions at the N-terminus (and at the C-terminus of histone H2A), known as the histone tails (1).
The reading and regulation of the genetic information is carried out by site-specific DNA binding proteins called transcription factors (TFs) (2). The extent of genome packing leaves a sizeable fraction of the DNA inaccessible for the majority of TF binding due to occlusion of binding sites by the histone core. Interestingly, a subgroup of TFs known as pioneer TFs (pTFs) can recognize and bind their motifs in nucleosome occupied regions of the genome (3–5). A recent systematic study revealed that several families of transcription factors can have pioneer activity, binding with different affinities and mechanisms to nucleosomes (6).
One particularly interesting pTF is Oct4, a master regulator of pluripotency (7). Oct4 cooperates with pTFs Sox2 and Klf4 in the conversion of somatic cells to pluripotent stem cells during reprogramming (8). Structurally, Oct4 contains a DNA binding domain divided into two subdomains, the POU specific domain (POUS) and the POU homeodomain (POUHD), connected by a flexible linker. Together, the two subdomains recognize eight sequential base pairs in free DNA (9): ATGC by the POUS and [A/T]AAT by the POUHD. On nucleosomes Oct4 recognizes only half of its eight base pair binding site due to the twist of DNA and steric clashes with the histone core, which inhibit the canonical binding of the second subdomain (10–14).
Oct4 is known to bind cooperatively with Sox2 in the genome (15–22). Recent studies suggest they may also work together as pioneers on nucleosomal DNA (10,11,23), where one TF binding event alters DNA-histone contacts and DNA positioning, facilitating the binding of the second TF (13,14,22,24). However, the molecular mechanisms of pTF binding to the nucleosome and higher order chromatin structures, the effect pTF binding has on chromatin dynamics at different levels, and the role of pTFs binding in propagating the chromatin rearrangements needed for cellular transitions are not understood.
The first structure of a pTF-nucleosome complex resolved experimentally suggested a mechanism by which Sox2 binds and unravels nucleosomes (25). Following up on this, another structure revealed Oct4 together with Sox2 bound to a nucleosome (13). However, both structures are missing essential parts. In the first, the entire outer gyres of the nucleosomal DNA were not resolved, whereas in the second only the Oct4 POUS was resolved. The histone tails were not resolved in either. These and many other structural studies of nucleosome interactions use synthetic DNA sequences, which bind tightly to the histone core. These are well-suited for structural techniques, but have limited relevance for nucleosome behavior in vivo.
Echigoya et al. resolved the structure of a genomic nucleosome that can be bound by Oct4 (12,26). While they did not present the structure of the Oct4-nucleosome complex, they revealed a novel Oct4 binding site, different from those previously reported by Soufi et al. (11). Echigoya et al. speculated that the POUS subdomain of Oct4 recognizes this binding site (12). Moreover, they showed that Oct4 interacts with histone H3 and competes with the linker histone for binding. Both this study and the work of Michael et al. (13) highlighted the importance of the POUS but not the POUHD subdomain in Oct4’s nucleosome binding. Nevertheless, molecular simulations using a reduced (coarse-grained) representation of the molecular species revealed binding of Oct4 to this novel site (14), but with the POUHD domain recognizing the site. This is also supported by the DNA sequence that bares a typical homeodomain binding site.
We previously used molecular modeling and short molecular dynamics (MD) simulations to build models of Oct4 bound to nucleosomes on the binding sites proposed by Soufi et al. (10,11). From these, we confirmed that the canonical configuration of Oct4 with the subdomains bound on opposite faces of DNA is incompatible with nucleosome binding and we proposed alternative binding modes (27).
Here we report atomic resolution structural features and the dynamics involved in the Oct4 interaction with two genomic nucleosomes that differ in the number and positions of the Oct4 binding sites. We first validated the sequence specific binding of Oct4 to both nucleosomes and probed how this is affected by nucleosome flexibility using in vitro biochemical experiments. Then, we turned to atomistic MD simulations to understand how Oct4 binds to its motifs and how it modifies the structural flexibility of the nucleosomes. The two nucleosomes have sequences from regulatory elements of LIN28B and the ESRRB genes. We selected these from the earliest data set demonstrating Oct4’s pioneer function (10) for their importance in defining stem cell pluripotency (28–31), their activation after 48 h of pluripotency induction (RNA-seq), and their distinct arrangement of Oct4 binding sites. We found that Oct4 favors binding sites positioned near the DNA ends, recognizes its binding using either DNA binding subdomain, and requires some nucleosome structural flexibility for efficient binding. Oct4 also modifies nucleosome breathing and stabilizes open nucleosome conformations depending on the location of the binding site, the mobility of histone tails, and the motions of its DNA binding sudomains.
MATERIALS AND METHODS
Full-length Oct4 expression and purification
Experiments were all performed with purified full-length Oct4 from Mus musculus. Briefly, Oct4 was cloned into the pOPIN expression vector using the SLIC method and Phusion Flash High-Fidelity PCR Master Mix (Finnzymes/New England Biolabs). SLIC reactions were then transformed into One Shot™ OmniMAC™ 2 T1® Chemically Competent Escherichia coli (ThermoFisher Scientific; Waltham, MA). After sequencing, the pOPIN-cHis-Oct4 construct was co-transfected with flashBACULTRA™ bacmid DNA (Oxford Expression Technologies; Oxford, UK) into Sf9 cells (ThermoFisher Scientific) using Cellfectin II® (ThermoFisher Scientific) to generate recombinant baculovirus. Mid-log phase Sf9 cells were used to amplify the virus. Suspension High Five™ cells were infected with P3 virus for two days at 27°C and 120 rpm shaking. After expression, crude lysates were purified on a HiTrap TALON column (GE Healthcare; Chicago, IL), cleaved on the column with 3C protease and followed by size exclusion chromatography (HiLoad Superdex 200, GE Healthcare). The final product was collected in 25 mM HEPES pH 7.8, 150 mM NaCl, 1 mM TCEP, and 5% glycerol with around 95% purity confirmed by SDS-PAGE.
Nucleosome reconstitution
The LIN28B and ESRRB nucleosome sequences were taken from Soufi et al. (10) (Hg18 chr6:105, 638,004-105,638,165 and Hg18 chr14:75,995,474-75,995,636, respectively, see also Supplementary Methods). WT and mutant sequences were purchased from IDT (Coralville, IA, USA) with flanking AvaI restriction sites, sequenced, amplified in E. coli, digested, and finally purified by native PAGE using electroelution. After Cy5 labeling, DNAs were reconstituted at DNA:octamer ratios ranging from 1:1.2 to 1:1.6 with purified full-length Drosophila melanogaster histone octamer using the salt-gradient dialysis method previously described (32), final buffer composition: 10 mM HEPES pH 7.6, 50 mM NaCl, 1 mM EDTA, and 0.5 mM DTT. Following dialysis, nucleosomes were heat shifted at 37°C for 2 h and then checked for quality and concentration by native PAGE. Histone stoichiometry was checked by
SDS-PAGE followed by coomassie (R-250) (SERVA, Heidelberg, Germany) staining.
Electrophoretic mobility shifts and competition assays
For binding reactions, 20 nM nucleosomes were incubated with 0.05–0.4 μM of purified Oct4 in binding buffer (25 mM HEPES pH 7.6, 50 mM NaCl, 0.5 mM EDTA, 1 μg/μl BSA, 0.8 mM DTT, and 10% glycerol) for 1 h at 25°C. After incubation, reactions were run directly on a 6% native polyacrylamide gel (acrylamide/bis-, 37.5/1) containing 27 mM Tris-borate, 0.6 mM EDTA, and 5% glycerol and run in the same buffer. All Cy5-labeled DNA was detected using Fujifilm FLA-9000 (GE Healthcare). Competition assays using specific and nonspecific oligos were performed as previously described (11) using 0.2-4 μM of competitor, 20 nM nucleosome, and 105 nM Oct4. Oct4 dissociation from nucleosomes was determined by incubating specific competitor with the pre-formed Oct4/nucleosome complex that occurs after 1 h incubation at 25°C. Off-rate conditions were empirically determined for LIN28B and ESRRB nucleosomes due to the substantial differences in complex stability: LIN28B – 9 nM nucleosome, 45 nM Oct4 and 0.2-3.5 μM unlabeled competitor at 25°C for 30 min; and ESRRB – 9 nM nucleosome, 67.5 nM Oct4, and 9–90 nM unlabeled competitor at 25°C for 5 min. A short oligo containing the ESRRB Oct4 binding sequence was used as competitor for the ESRRB nucleosomes: AAGTGATAGTTATGCAGAGCGAATGGAGGG. For LIN28B, the specific competitor sequence published in Soufi et al. was used (11). Experiments were performed in triplicate and densitometry was carried out using a DNA standard curve and Quantity One® software (Bio-Rad, Hercules, CA). The values reported in Figure 2 were calculated by dividing the portion of Oct4-nucleosome complex by the value of starting free nucleosome (control). Two-way ANOVA using Sidak’s multiple comparison test was performed on relevant mean and standard deviation values. Statistics were done using Prism 7.0a for Mac. We also estimated the apparent dissociation constants of Oct4 from nucleosomes (see Supplementary Methods).
Figure 2.
Nucleosome dynamics influence Oct4 binding. (A, B) LIN28B. (C, D) ESRRB. EMSAs of purified Oct4 with untreated (left) or crosslinked (right) nucleosomes (A, C). Filled arrowheads signal nucleosomes or nucleosome-protein complexes and empty arrowheads point to free DNA. The bar graphs display the mean and standard deviation of densitometry relative to starting nucleosome amounts (B, D). n = 3 per condition, * P= 0.0344,** P= 0.0045, *** P= 0.0001. (E) Densitometry of Oct4 competition EMSAs using LIN28B (green) or ESRRB (blue) WT nucleosomes (mean values with the standard deviations as error bars). For ESRRB n = 4, for LIN28B n = 5. (F) Same experiment as in panel E but comparing LIN28B WT and 4m nucleosomes (for 4m n = 6).
Crosslinking experiments
Assemblies were generated as described in a previous section. Half of the assembly preparation was incubated with a final concentration of 1% formaldehyde on ice for 15 min. Crosslinking reactions were quenched by adding glycine to 250 mM final concentration. Crosslinking efficiency was checked by incubating an aliquot of crosslinked and uncrosslinked nucleosome in 15 mM MgCl2 and 300 mM NaCl at 60○C for 15 min and then running the samples on a 6% native PA gel. Samples were purified on a 10–30% sucrose gradient spun at 30 000 × g and 4○C in a Beckman Coulter Optima L-100 XP swing bucket rotor (SW-41; Brea, CA) for 18 h. Fractions were collected from the bottom of the gradient, screened on native gels, pooled, and quantitated by densitometry using a DNA standard curve and Quantity One® software (Bio-Rad, Hercules, CA). Off-rates with crosslinked nucleosomes were performed as described in the previous section.
Modelling Oct4-nucleosome complexes
To build the structural models of the LIN28B and ESRRB nucleosomes, we first selected 168 base pair sequences from the human genome from the data by Soufi et al. (10,11). The selection was based on a comparison of ChIP-Seq data (accession code GSE36570) for Oct4 binding during reprogramming of fibroblasts to pluripotency (48 h after induction of Oct4 and the other three transcription factors required) with MNase-seq data revealing nucleosome positioning in human fibroblasts (accession code GSM543311) (33). We considered the region around the MNase peak to correspond to the dyad and optimized the accessibility of the Oct4 binding sites proposed by Soufi et al. (S–1.5 and HD–4.5 on LIN28B and S+5.5 on ESRRB) (10,11) (see Supplementary Methods). Then we threaded these selected sequences on a 168 base pairs nucleosome with the original Widom 601 DNA sequence (34) and with Drosophila or human histones by swapping each base to the new sequence with the ‘swapna’ function in Chimera (35). The complete 168 base pair Widom nucleosomes were built using the DNA from the 3LZ0 structure with the histones (including the histone tails) modelled using the 2PYO and 1KX5 structures as templates (see Supplementary Methods). We previously performed extensive MD simulations with these nucleosomes (36).
Next, we modelled Oct4-nucleosome complexes using the LIN28B and ESRRB DNA sequences from the human genome with Drosophila melanogaster and human histones for the LIN28B and ESRRB nucleosomes respectively (36) (see Supplementary Methods for details). The initial configuration of Oct4 we took from the following structures: (i) Oct4 bound to DNA in the canonical configuration with the two subdomains bound on opposite sides of DNA (37) that were built based on the structure of Oct4 bound as a homodimer to the PORE motif (38). (ii) Oct4 bound with both subdomains on the same side of DNA (MORE configuration) obtained from the structure of Oct4 bound as the homodimer to the MORE motif (39) by stripping one monomer. (iii) Oct4 configurations obtained from a 100 ns MD simulation of apo Oct4 (27) (see MD protocol below).
To build models of Oct4-nucleosome complexes, we superposed the binding sites from the structures of Oct4 bound to free DNA to the binding sites on the nucleosomes and removed the free DNA. We superposed the MD-generated configurations of Oct4 on the initial models of canonical Oct4-nucleosome complexes (27) by fitting the subdomain that binds specifically to the DNA and removed the canonical configuration of Oct4. The resulting models of Oct4-nucleosome complexes had no steric clashes between the nonspecifically bound subdomain and the core histones. All models were validated in initial 100 ns classical MD simulations (protocol below). The short simulations validating the complexes of Oct4 bound to the LIN28B nucleosome on the binding sites proposed by Soufi et al. (HD–4.5 and S–1.5) were presented in our previous study (27).
The models of free and Oct4-bound tail-less nucleosomes were obtained by removing all histone tails from the corresponding nucleosome and Oct4-nucleosome models after equilibration. The histone tails were defined as follows: residues 1–45 for H3 (Human and Drosophila), 1–32 for H4 (human and Drosophila), 1–18 and 119–129 for human H2A, 1–17 and 116–124 for Drosophila H2A, 1–33 for Human H2B and 1–31 for Drosophila H2B. 146 DNA basepairs centered on the dyad were defined as the nucleosome core DNA, whereas the remaining 11 basepair on each side were defined as the linker DNAs (L-DNAs). The outer gyres were defined as the last 40 bp at each end. The central 88 basepairs were considered as the inner gyre.
Molecular dynamics simulations
Classical MD (cMD) simulations were performed as previously described (27,36). Every simulated species was first solvated in a truncated octahedron box of SPC/E water molecules, with a layer of at least 12-15 Å of water around the solute. All systems had ∼350 000 atoms. Na+ ions were added to counter the negative charges of the system. K+ and Cl– ions were added, up to a concentration of 150 mM. The systems were then optimized with an energy minimization, performed with the AMBER software (40). Then, the systems were equilibrated for 13.5 ns, using NAMD (41). The equilibration protocol was adapted from Jerabek et al. (39) and is described in detail in the Supplementary Methods. Harmonic distance restraints were applied to maintain DNA base pairing and Oct4–DNA base interactions. The force constant for these was gradually decreased. At the latest stages, the equilibration was unrestrained. Then, we performed production simulations in NAMD, in the isobaric-isothermic (NPT, p = 1 atm, T = 300 K) ensemble, with Langevin dynamics for temperature control and a Nosé-Hoover and Langevin piston for pressure control. The Li-Merz ion parameters (42), the ff14SB (43) and the parmbsc1 force fields (44) were used for ions, protein, and DNA, respectively. Each individual simulation was 1 or 2 μs long and multiple replicas were performed (Table 1).
Table 1.
Overview of the simulations performed
| Number of Oct4–DNA contactsc | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Simulation namea | DNA | Oct4 site | Starting structureb | Time | Rgc | POUS-Bases | POUS-Backbone | POUHD-Bases | POUHD-Backbone |
| LIN28B1 | LIN28B | – | Free nuc | 1 μs | 49.2 (48.3–51.0) | – | – | – | – |
| LIN28B2 | LIN28B | – | Free nuc | 1 μs | 48.7 (48.0–49,9) | – | – | – | – |
| ESRRB1 | ESRRB | – | Free nuc | 1 μs | 48.6 (47.8–51.3) | – | – | – | – |
| ESRRB1–b | ESRRB | – | 1 μs of ESRRB1 | 1 μs | 50.2 (48.8–52.5) | – | – | – | – |
| ESRRB2 | ESRRB | – | Free nuc | 1 μs | 47.6 (47.1–48.2) | – | – | – | – |
| HD–7rev1 | LIN28B | HD–7 | Cano Oct4 | 1 μs | 49.8 (49.3–50.4) | 14 (5–28) | 77 (26–99) | 107 (86–126) | 208 (172–250) |
| HD–7rev2 | LIN28B | HD–7 | MD Oct4 | 1 μs | 50.1 (49.6–50.6) | 3 (0–10) | 30 (0–64) | 64 (36–81) | 133 (96–176) |
| HD–7rev3 | LIN28B | HD–7 | Cano Oct4 | 1 μs | 50.3 (49.6–50.9) | 95 (33–109) | 131 (63–155) | 105 (83–131) | 214 (147–254) |
| HD–7rev4 | LIN28B | HD–7 | MD Oct4 | 1 μs | 49.2 (48.6–49.8) | 7 (2–13) | 54 (22–98) | 62 (40–87) | 175 (135–214) |
| S+5.51 | ESRRB | S+5.5 | MD1 Oct4 | 3 μs | 54.5 (52.5–56.3) | 118 (102–135) | 202 (139–243) | 23 (9–39) | 135 (67–185) |
| S+5.51 – Oct4 | ESRRB | – | 1 μs of S+5.51 | 2 μs | 58.0 (55.6–59.0) | – | – | – | – |
| S+5.52 | ESRRB | S+5.5 | MD2 Oct4 | 3 μs | 49.5 (48.6–50.0) | 94 (75–115) | 104 (78–146) | 48 (22–71) | 59 (27–91) |
| S+5.52–b | ESRRB | S+5.5 | 1 μs of S+5.52 | 1 μs | 54.5 (51.3–57.6) | 94 (80–108) | 163 (136–187) | 59 (36–91) | 49 (22–78) |
| S+5.52–b – Oct4 | ESRRB | S+5.5 | 1 μs of S+5.52 | 1 μs | 51.1 (50.1–53.0) | – | – | – | – |
| S+5.53 | ESRRB | S+5.5 | MD1 Oct4 | 2 μs | 48.2 (47.4–48.9) | 123 (99–148) | 154 (93–191) | 25 (7–35) | 95 (62–132 |
| S+5.54 | ESRRB | S+5.5 | MD2 Oct4 | 1 μs | 48.7 (48.2–49.3) | 74 (55–112) | 138 (105–170) | 53 (3–87) | 83 (40–112) |
| S+5.55 | ESRRB | S+5.5 | MORE Oct4 | 1 μs | 48.2 (47.6–48.9) | 97 (82–111) | 115 (81–146) | 51 (12–64) | 157 (57–190) |
| S+5.5T1 | ESRRB | S+5.5 | MD1 Oct4 | 2 μs | 52.3 (50.0–56.8) | 117 (77–142) | 147 (105–187) | 10 (0–24) | 54 (27–77) |
| S+5.5T1 – Oct4 | ESRRB | – | 1 μs of S+5.5T1 | 1 μs | 48.0 (47.5–48.7) | – | – | – | – |
| S+5.5T1–b1 | ESRRB | S+5.5 | 800 ns of S+5.5T1 | 250 ns | 56.1 (54.9–58.2) | 98 (73–123) | 146 (112–183) | 1 (0–15) | 30 (0–71) |
| S+5.5T2 | ESRRB | S+5.5 | 250 ns of S+5.5T1–b1 | 1 μs | 53.8 (52.6–55.1) | 90 (74–104) | 139 (104–161) | 60 (28–84) | 144 (108–177) |
| S+5.5T1–b2 | ESRRB | S+5.5 | 800 ns of S+5.5T1 | 250 ns | 55.8 (54.2–58.3) | 100 (81–127) | 137 (103–169) | 3 (0–18) | 34 (0–76) |
| S+5.5T3 | ESRRB | S+5.5 | 200 ns of S+5.5T1–b2 | 1 μs | 54.1 (53.2–55.6) | 111 (85–130) | 172 (125–208) | 13 (4–56) | 137 (84–180) |
| ESRRB1–TL | ESRRB | – | eq of ESRRB1 | 2 μs | 51.3 (48.9–57.9) | – | – | – | – |
| ESRRB2–TL | ESRRB | – | eq of ESRRB2 | 2 μs | 52.3 (49.3–55.5) | – | – | – | – |
| S+5.51–TL | ESRRB | S+5.5 | eq of S+5.51 | 2 μs | 60.5 (53.9–62.8) | 94 (90–112) | 155 (111–186) | 58 (21–109) | 118 (79–154) |
| S+5.52–TL | ESRRB | S+5.5 | eq of S+5.52 | 2 μs | 57.7 (52.4–62.5) | 99 (81–125) | 159 (122–185) | 57 (29–88) | 100 (50–129) |
| S+5.53–TL | ESRRB | S+5.5 | eq of S+5.53 | 2 μs | 57.0 (51.1–61.6) | 93 (75–111) | 122 (89–158) | 85 (43–117) | 113 (67–151) |
| S+5.54–TL | ESRRB | S+5.5 | eq of S+5.54 | 2 μs | 58.5 (51.9–61.9) | 92 (77–107) | 135 (97–177) | 101 (53–134) | 99 (58–133) |
a ‘T’ = simulations started with H3 and H2AC tail configurations taken from the open nucleosome conformation established in S+5.51; ‘b’ = biased simulations; ‘TL’ = simulations with tail-less nucleosome.
b ‘Cano’, ‘MD’ = Canonical and MD-generated Oct4 configurations; ‘nuc’ = nucleosome; ‘eq’ = equilibration.
c Rg and the number of contacts are shown as the median with the percentiles 5 to 95 in brackets; a contact was defined as a non-hydrogen atom closer than 4.5 Å to another non-hydrogen atom.
Biased molecular dynamics simulations
We used minimal inter-atomic distances and coordination numbers (δmin and C) as collective variables for bias MD (bMD) simulations. These are defined in the Colvar module (45) implemented in NAMD and VMD (46) (https://colvars.github.io/). δmin between two groups of atoms were measured using the weighted mean distance collective variable distanceInv defined as follows:
![]() |
where ||dij|| is the distance between atoms i and j in groups 1 and 2 respectively, and n is an even integer. This distance will asymptotically approach the minimal distance when increasing n. We used n=100 because in test calculations we found it approximates best the minimal distance without compromising the measurement of the collective variable. C between two groups of atoms were measured as:
![]() |
where xi, xj are the atomic coordinates of atoms i and j, d0 is the threshold distance (4.0 Å), n and m are exponents that control the properties of the function (we used n = 6, m = 12).
The atom groups were defined using the Cα and P atoms from the proteins and DNA, respectively. To sample nucleosome conformations in which the histone tails do not interact with the linker DNA, we applied harmonic wall biases to keep δmin larger than 12 Å and C equal to 0 between the H3 and H2AC tails and the outer DNA gyre.
To sample the motion of the POUHD between DNA gyres, we applied additional steered MD harmonic biases in which δmin between POUHD and the inner and outer gyres were changed with constant velocity from 5 to 15 Å and from 32 to 12 Å ,respectively. To avoid the collapse of the POUHD on the outer gyre but near to the inner gyre, we also applied steered MD to bias the δmin between POUHD and the 3’ L-DNA between 58 and 28 Å. The steered MD biases were applied over 250 ns of simulation. To prevent the rapid closing of the nucleosome before the POUHD moved between the gyres, near the L-DNA, we also applied the harmonic wall biases to keep δmin and C between the H3 and H2AC tails and the outer DNA gyre larger than 30 Å and equal to 0, respectively, and to keep the δmin between the two gyres larger than 20 Å.
For all harmonic biases we used a force constant of 10 kcal/mol*Å2. After each biased simulation we performed additional, 1 μs long classical simulations with or without Oct4 to ensure that the application of the biases did not distort dynamics.
Analysis of the MD simulations
We fitted all simulations to the core region of the histones (excluding histone tails) using the initial minimized structure as a reference.
We characterized the breathing motions using the coordinate system originally described by Öztürk et al. (47), also used in our previous work (27,36). First, a coordinate system XYZ was defined, with the origin on the dyad. X was described as the vector along the dyad axis, Y as the cross product between X and a vector perpendicular to X intersecting it approximately at the center of the nucleosome, and Z as the cross product between X and Y. Two vectors, v3 and v5 were defined along the 3’ and 5’ L-DNAs. Then, the angle γ1 was defined as the angle between the projection of these vectors in the XZ plane and the Z axis, and γ2, as the angle between the projection of the vectors on the XY plane and the Y axis.
The mass-weighted radius of gyration of the DNA (Rg) and the number of contacts between histone tails or Oct4 and the DNA were calculated with cpptraj (48). A contact was defined when two non-hydrogen atoms were closer than 4.5 Å. Contacts were considered stable if they were present for more than 75% of the simulation.
RESULTS
Oct4 binds to specific sites on nucleosomes using either DNA binding subdomain
We first asked how each subdomain of Oct4 contributes to Oct4 binding on the LIN28B and ESRRB nucleosomes. To address this, we characterized Oct4 binding to a series of native (WT) and mutated nucleosomal DNAs (Figure 1, Supplementary Figures S1, S2A–D). The native sequences we selected depict the diversity of Oct4 genomic binding: one sequence, LIN28B (11), contains multiple sites for Oct4 (Figure 1A) and other pTFs, whereas the other, ESRRB (Figure 1B) has a single Oct4 binding site. On LIN28B, the binding sites are located either towards the 5’ side (using the genomic 5’–3’ orientation as reference) or in the core of the nucleosome (Figure 1C), whereas on ESRRB the binding site lies in the core, closer to the 3’ end (Figure 1D).
Figure 1.
Oct4 binds to native nucleosomes at different positions with either DNA binding subdomain. (A, B) Schematics of the consensus OCT4 binding site, LIN28B (A) and ESRRB (B) genomic nucleosomes and mutants showing the positioning and sequences of the Oct4 binding sites. The mutants are named with a ‘m’ preceded by the number of the binding sites mutated and followed by the names of these sites. The mutated sites are shown in brackets if there are more than three. The POUS, POUHD, and Sox2 binding sites are in orange, teal, and red, respectively. Arrows indicate the binding orientation. The POUS binds in a 5’-ATGC-3’ orientation to free DNA, whereas the POUHD in a 5’-T(A)AAT-3’ orientation with the N-terminal tail recognizing the first half and the globular part the second half. (C, D) Structural views of LIN28B (C) and ESRRB (D) with the histone core in gray cartoons, DNA in white surface, binding sites in the corresponding colors, and histone tails in blue (H3), green (H4), orange (H2AN), red (H2AC), and purple (H2B). These representations and coloring are kept throughout the manuscript. (E, F) EMSAs of purified Oct4 with free DNA (left) and reconstituted nucleosomes (right). The third gel in 1E shows a competition assay using excess specific and nonspecific competitor. Filled horizontal arrowheads indicate nucleosomes or nucleosome-protein complexes and empty arrowheads, free DNA or DNA–protein complexes.
We first confirmed Oct4 binding to putative nucleosomal sites by mutating key DNA bases within these sites (Figure 1A, B), and then evaluated nucleosome binding in electrophoretic mobility shift assays (EMSAs). Cy-5 labeled DNAs were assembled into nucleosomes using salt gradient dialysis and the histone core positioning was checked to ensure it was comparable between sequences (Supplementary Figure S1). Mutation of the Oct4 sites and the Sox site proposed by Soufi et al. (‘3m’) (11) did not prevent Oct4 binding to LIN28B (Figure 1E). Given that Oct4 is also known to interact with histones (12,49) or nonspecifically with DNA, we added a molar excess of specific or nonspecific unlabeled double-stranded DNA and confirmed that the persistent binding of Oct4 to the 3m mutant nucleosome was sequence specific (Figure 1E). This opened the possibility that additional Oct4 binding sites exist on this nucleosome. A search of the LIN28B sequence revealed two potential homeodomain binding sites at superhelical locations (SHL) –7 and –1 (Figure 1A, C, HD–7 and HD–1). When we mutated all four Oct4 sites and the Sox site (5m), Oct4 nucleosome binding was reduced substantially (Supplementary Figure S2A,B).
In order to tease out the relative contributions of each site to overall Oct4–LIN28B binding, we mutated each site individually and in combinations leaving one site untouched. We observed that each site contributes to Oct4’s affinity, except for the Sox and HD-1 sites (Figure 1E, Supplementary Figure S2C,D). Interestingly, mutating just the HD–7 site, 1mHD-7, resulted in loss of Oct4 binding, suggesting that HD–7 is the primary Oct4 binding site (Figure 1E). While we performed the experiments and prepared the pre-print of this manuscript, two independent studies reported Oct4 binding to the HD-7 site, confirming our findings (12,14). Our results also indicate that the sites previously proposed by Soufi et al. (10,11) HD–4.5 and S–1.5 serve as secondary binding sites.
The HD–7 site contains a generic homeodomain binding site preceded by an adjacent POUS-like half-site (ATGA, not the canonical ATGC). Nevertheless, mutation of only the HD half-site reduced Oct4 binding substantially, demonstrating that the binding of Oct4 to this site is driven by the POUHD (Figure 1E, 1mHD-7), in agreement with the coarse grained simulations by Tan et al. (14) but in contrast with Echigoya et al. who suggested that the binding is driven by the POUS (12). Moreover, the sequence of the HD–-7 and HD–4.5 are identical (Figure 1A), indicating Oct4’s preference for the HD–7 position and not the DNA sequence alone. This is also reflected in the apparent Oct4 dissociation constants estimated from the EMSAs, where the value for the 1mHD–7 nucleosome is approximately four times larger than that for 1mHD–4.5 and 12 times WT (Supplementary Figure S2E). Therefore, Oct4 binds to LIN28B mainly through sequence specific binding of the POUHD to the HD–7 site.
Reconstitution of ESRRB WT and mutant sequences resulted in an assembly doublet (Figure 1F, Supplementary Figure S1F), which may be explained by the co-existance of two predominant conformations. Assemblies were routinely heat-shifted following salt-gradient dialysis suggesting both populations are thermodynamically stable (see Materials and Methods). Equal nucleosome-histone stoichiometry was also checked on coomassie stained SDS-polyacrylamide gels and histones appeared in comparable proportions, suggesting the presence of conformational isoforms (Supplementary Figure S1E). Furthermore, adding increasing amounts of free H2A/H2B dimer to histone octamer during assembly had no effect on the presence of either band, suggesting neither band contains a hexasome, which could be formed by the eviction of one H2A/H2B dimer from the octamer in the nucleosome core (50). Footprinting demonstrated not only were assemblies comparable between sequences but also suggested the presence of a middle positioned nucleosome (Supplementary Figure S1G, H).
In contrast to LIN28B, ChIP-Seq data show ESRRB contains one clear Oct4 binding peak at a canonical POUS half-site followed by a potential POUHD binding site (Figure 1B, D). To confirm this binding site, we first mutated the POUS site, S+5.5, and observed a complete loss of Oct4 binding to both the nucleosome and free DNA (Figure 1F, WT and 1mS+5.5). At high concentrations of Oct4, both free DNA and nucleosomes shifted to a smear rather than a defined band, which may be due to nonspecific interactions. When we integrated a homeodomain half-site directly 3′ to the mutated POUS site, binding was not restored to either free DNA or the nucleosome (Figure 1F, 1mS+5.5+HD+6). This may be due to either the position of the POUHD facing into the histone core or the POUHD’s inability to form a stable complex without the POUS binding to its binding site. When we generated atomistic models of the ESRRB nucleosome we observed the POUS half-site facing away from the histone core suggesting it would be accessible for Oct4 binding (Figure 1D). To check the potential binding of the POUHD on an exposed position at SHL +5.5, we introduced a canonical homeodomain half-site in the POUS half-site location (Figure 1B, F, +HD+5.5). This mutation abolished Oct4 binding to the nucleosome and allowed weak but distinct binding of Oct4 to free DNA, suggesting that both DNA sequence and structure play a role in nucleosome binding by the POUHD. Therefore, Oct4 binds to ESRRB at S+5.5 and the binding is driven by the POUS domain.
Oct4 uses nucleosome structural flexibility to bind
Recently, we reported differences in local nucleosome dynamics that extended beyond the linker DNA arms into the body of the nucleosome (27,36). Notably, dynamics were sequence dependent and greater in native nucleosomes, compared to the synthetic Widom 601 positioning sequence (34). We wanted to know how restricting these nucleosome dynamics would influence Oct4 binding.
To test this, we crosslinked nucleosomes using formaldehyde (Supplementary Figure S3A) before performing binding assays. We performed densitometry on EMSA gels and show the ratio of Oct4-nucleosome complex relative to the amount of control nucleosome for each condition (Figure 2A–D). For LIN28B, crosslinking moderately increased binding at low Oct4 concentrations, but reduced binding at high concentrations (Figure 2A, B). Crosslinking ESRRB resulted in overall reduced Oct4 binding (Figure 2C, D). This suggests that the intrinsic dynamics of ESRRB facilitate Oct4 binding, while binding to LIN28B is more nuanced due to the presence of multiple Oct4 binding sites.
We then wanted to know how the different WT and mutated sequences affect the stability of the Oct4–nucleosome complexes. After protein–nucleosome complex formation, we added increasing amounts of unlabeled specific competitor and monitored Oct4’s dissociation from the nucleosome. Notably, Oct4 separation from the LIN28B nucleosome required 20–40 fold higher amounts of competitor and six times longer incubation than from the ESRRB nucleosome (Figure 2E, Supplementary Figure S3B). These results corresponded with the number of Oct4 binding sites on each sequence, the higher affinity of the POUHD to free DNA comparing to POUS (37), and the higher structural flexibility of the LIN28B nucleosome (36).
When we mutated all proposed Oct4 binding sites except the primary HD-7 site (‘4m’), the complex dissociated faster than the complex with the WT LIN28B nucleosome (Figure 2F, Supplementary Figure S3C). This shows that the mutated sites contribute to the affinity of Oct4 for LIN28B together with the primary HD-7 site. In summary, Oct4 has a higher affinity for the LIN28B nucleosome in part due to the multiple binding sites and the dynamics between the DNA and the histone core. Oct4’s affinity for the lone ESRRB nucleosome binding site is partially dependent on nucleosome dynamics and substantially lower than that for LIN28B (also seen from the apparent Kd values in Supplementary Figure 2E).
Oct4 modulates nucleosome structural flexibility
To characterize the structural basis for the Oct4-nucleosome recognition and to evaluate how Oct4 binding impacts nucleosome dynamics, we generated structural models of Oct4 bound to the different binding sites (Figure 3, Supplementary Figure S4), performed multiple MD simulations with these models (Table 1), and compared them with the published simulations of free nucleosomes (27,36). We quantified nucleosome dynamics by measuring overall nucleosome compactness using the radius of gyration (Rg) and the breathing motions in two orthogonal planes, the plane of the core histones in a 2D top view of the nucleosome (angle γ1) and a plane perpendicular to it (angle γ2) (27,36,47) (see Materials and Methods).
Figure 3.
Oct4 binding modifies nucleosome breathing. (A–C) Oct4-LIN28B. (D–F) Oct4-ESRRB. Representative structures were selected from two independent simulations of each complex (A, B, D, E). In Oct4–LIN28B, Oct4 is bound to the HD–7 binding site, in reverse orientation (A, B). The configuration of Oct4 is either canonical (A) or generated from a simulation of apo Oct4 (B). In Oct4-ESRRB, Oct4 is bound to the S+5.5 site (D, E). Both starting configurations of Oct4 were taken from the simulation of apo Oct4. (see Materials and Methods). The structures were selected to correspond to the most sampled conformation in the following 2D histograms. The 2D histogram depict the γ1/γ2 conformational sampling in two simulations (2 μs aggregate time) of the free nucleosome (black) and two simulations (2 μs aggregate time) of Oct4-nucleosome complexes (green scatter plot with blue and yellow contours for the individual simulations) (C, F). The motions described by γ1 and γ2 are indicated with double headed arrows. The arrows in the square insets indicate the direction of the nucleosome opening. The ‘*’ labels the sequence specific bound subdomain (also in subsequent figures).
First, we extended the simulations of Oct4 bound to the sites proposed by Soufi et al. (11) on LIN28B (27) to 1 μs. Oct4 binding remained stable to these sites on this time-scale (Table 1) and did not influence the nucleosome dynamics (Supplementary Figure S4A,B).
To model Oct4 bound to the HD–7 site without clashes with the inner DNA gyre, we had to use partially open conformations of LIN28B from the simulations of the free nucleosome. We modelled two binding orientations as for the HD–4.5 site (27). The ‘reverse’ orientation has the POUHD bound to the typical homeodomain site TAAT(AC) on the 3’-5’ genomic DNA strand. The ‘forward’ orientation has the POUHD bound to the GTAT(TA) motif on the genomic 5’-3’ DNA strand (Table 1, Supplementary Table S1). The globular region of the POUHD recognizes specifically the central AT bases in both orientations. In contrast to the HD-4.5 site on which Oct4 binding was stable only on the forward orientation (27), on the HD-7 site, Oct4 binding was stable in both orientations (Videos S1, S2). Because the two orientations have a similar effect on the nucleosome structural flexibility, we only present the data for the reverse orientation. The data for the forward orientation is available in the Supplementary Materials (Supplementary Table S1, Supplementary Figures S4, S5).
The HD-7 site is located in the L-DNA but near the inner gyre (Figure 1C). This allows Oct4 binding in the canonical configuration (Table 1, Figure 3A). To sample alternative configurations with the POUS interacting nonspecifically with the nearby DNA segments, we also built models of the complex with MD-generated Oct4 configurations (Figure 3B). The binding of Oct4 was stable on the 1μs timescale and maintained the LIN28B nucleosome in partly open conformations, rarely sampled in simulations of the free nucleosomes. Practically, the position of the sequence-specific bound POUHD between the two DNA gyres blocked LIN28B closing (Figure 3C, Supplementary Figure S4).
In contrast, the binding of Oct4 in the canonical configuration was not possible on the ESRRB nucleosome because the binding site is located in the core nucleosomal DNA (Figure 1D). Instead, we modelled the Oct4-nucleosome interaction using alternative, MD-generated Oct4 configurations (Table 1, Figure 3D,E, see also Methods).The DNA readout of the POUS domain was maintained in all simulations analysed, indicating stable binding.
In one simulation of the Oct4–ESRRB complex (S+5.51) the nucleosome opened with an amplitude significantly larger than the breathing amplitude of free nucleosomes on the same time scale (Table 1, Figure 3F, Supplementary Figure S6A and Video S3). In the other simulations, the sampled nucleosome conformations were similar to those observed in the simulations of the free nucleosome (Supplementary Figure S6A and Video S4). The 2D γ1/γ2 histograms from all simulations of Oct4-ESRRB complex (Table 1, Supplementary Table S1) are shown in Supplementary Materials (Supplementary Figures S6, S7).
Oct4 cooperates with histone tails to unravel genomic nucleosomes
Next, we wanted to know whether the differences between the simulations of the Oct4-ESRRB complex are due to the histone tails, which are known to regulate inter- and intranucleosome dynamics (36,51). For this, we characterized how Oct4 binding influences the interaction of histone tails with the DNA by calculating the number of stable tail–DNA contacts (defined as contacts present in at least 75% of the simulation). We focused on those tails in the proximity of the HD–7 and S+5.5 binding sites on LIN28B and ESRRB, respectively (one monomer of H3, H2AC, H4 and H2B), and compared the simulations with and without Oct4 bound.
We previously reported that free LIN28B nucleosome opening was facilitated when the H3 and H2AC tails established fewer interactions with the L-DNA and more interactions with the core DNA (36) (Figure 4A, B). When Oct4 was bound to the main HD-7 site (Figure 4A, C, Supplementary Figure S5A, B), the nucleosome mostly sampled partially open conformations (Figure 4A) with very few stable contacts between H3 and the 5’ L-DNA (Figure 4C, Supplementary Figure S5A, B). The position of the POUHD bound at the edge between the L-DNA and the core nucleosomal DNA and the Oct4 linker in the canonical configuration (HD-7rev1 simulation), together blocked H3 tail binding to L-DNA and near the dyad region (base pairs 90–95) (Figure 4A, C). Moreover, the binding of Oct4 to the S–1 but not HD–4.5 site also resulted in fewer stable contacts between histone tails and the DNA (Supplementary Figure S5C, D).
Figure 4.
Oct4 modifies the histone tail-DNA interaction profiles.(A–C) LIN28B, free or with Oct4 bound to HD–7 in reverse orientation. (D–F) ESRRB, free or with Oct4 bound to S+5.5. Representative structures of the free nucleosome and Oct4-nucleosome complexes are shown together with the Rg histograms (A, D) to illustrate the nucleosome conformations sampled. The histone tail–DNA interaction profiles show the number of stable contacts between the tails in the proximity of the Oct4 binding site and the DNA in the simulations of free nucleosomes (B, E) and Oct4–nucleosome complexes (C, F). A contact was defined as stable if it was present in more than the 75% of the 1 μs simulation time. The dotted line marks the position of the dyad
Both in the ESRRB alone and the Oct4-ESRRB complex the nucleosome remained closed (Figure 4D) in all simulations in which stable interactions were formed between the H3 and H2AC tails and the DNA at the 3’ end of the nucleosome (Figure 4E,F, Supplementary Figure S8A).
Independent of the DNA sequence and the location of the binding site, the number of stable contacts between the H4 tail and the DNA increased upon Oct4 binding (Figure 4C, F, Supplementary Figures S5, S8). While this had no significant impact in nucleosome dynamics, it may affect the inter-nucleosome stacking in the presence of Oct4 since H4 tail is known to play a role in these interactions (52–54).
When ESRRB remained closed with Oct4 bound, the H3 and H2AC tails interacted with both the inner and outer gyres of the DNA (Figure 5A, B). When ESRRB opened, these interactions were absent and the open conformation was stable even after removing Oct4 (Figure 5C, D). Based on these findings, we hypothesized that the large amplitude opening occurred only in a single simulation of the Oct4–ESRRB complex because of the limited sampling of the mobility of histone tails on the 1μs timescale (55,56). In most simulations, the tails collapsed on the DNA and tail–DNA binding/unbinding events required for nucleosome opening were very rarely sampled. If our hypothesis was right, minimizing the interactions between H3 and H2AC tails and the outer gyre of DNA should facilitate nucleosome opening.
Figure 5.
Oct4 stabilizes and enhances histone tail mediated nucleosome opening (A, B) Oct4–ESRRB complex with the nucleosome closed. (C, D) Oct4–ESRRB complex with the nucleosome open. Representative structures from the S5.52 (A) and S5.51 (C) simulations are shown together with the corresponding time series of Rg (orange) and the number of contacts between the H3 tail and the inner (cyan) and outer (blue) gyres of the DNA (B, D). In gray/black are data from a simulation started after 1 μs of S5.51 after removing Oct4. (E) ESRRB nucleosome opening in simulations biased to prevent interactions between the H3 and H2AC tails near the Oct4 binding site and the outer DNA gyre. Time series of Rg are shown from simulations started after 1 μs of S5.52 with (orange) or without Oct4 (grey) and after 1 μs of ESRRB1 (green) (see Methods). (F) ESRRB nucleosome opening in simulations started with the H3 and H2AC tails remodelled using a representative configuration selected from the open nucleosome conformation found in S5.51. Time series of Rg from three simulations are shown: S5.5T1 (orange), (S5.5T1–Oct4) started after 1 μs of S5.5T1 after removing Oct4 (grey), and a 1 μs unbiased simulation (green) started after 800 ns of S5.5T1 and 250 ns in which a bias was added (S5.5T1-b1) to move the POUHD in between the two DNA gyres (see Materials and Methods). (F, G) ESRRB nucleosome opening in simulations started after removing the histone tails (tail-less ESRRB-TL nucleosome). Histograms of Rg (G) and γ1/γ2 angles (H) were calculated from 4 μs ensembles (two independent simulations, each 2 μs long). The ensemble of the free ESRRB-TL (black) is compared with two ensembles of Oct4-ESRRB-TL complexes (dark and light orange in (G), green in (H)) which differ in the configuration Oct4 used to start the simulations (Table 1). Two contours are shown in the 2D histograms corresponding to 1 and 100 counts.
To test this, we took three different approaches. First, we performed MD simulations in which the H3 and H2AC tails were biased not to interact with the outer gyre of the DNA. For this, we added harmonic biases to the minimal tail-DNA interatomic distances and to the corresponding coordination numbers, which reflect the number of contacts (see Methods). We started simulations from two of the simulations in which the nucleosome remained closed (S+5.52 and S+5.53), with and without Oct4 present. We also started biased simulations from the classical simulations of the ESRRB alone (Table 1, Supplementary Table S1). Remarkably, ESRRB opened in all biased simulations and the amplitude of the opening was larger with Oct4 bound (Figure 5E, Supplementary Figures S6B–D, S8C, S9A, Supplementary Video S5). Moreover, the open conformations were stable in 1μs of unbiased simulations, started after the biased simulations (Supplementary Figure S9B, C). In all biased simulations, the harmonic wall biases were effectively active only at the beginning of the simulations. In most simulations of the Oct4–ESRRB complex, once ESRRB opened it did not close back, suggesting that the bias only enhanced a natural motion of the Oct4-bound nucleosome.
Second, we performed a classical simulation of the Oct4–ESRRB complex (S+5.5T1) starting with a partially closed nucleosome but with configurations of the H3 and H2AC tails selected from the open ESRRB conformation established in the simulation S+5.51 (Table1). Again, the nucleosome opened (Figure 5F, Supplementary Figures S7A, S8D Video S6). However, after 1μs, the nucleosome closed into a compact conformation (Figure 5F, orange). This conformation was even more compact in a simulation started after 1μs of S+5.5T1 after removing Oct4 (Figure 5F, grey).
Third, we performed simulations with tail-less (‘-TL’) nucleosomes. As expected, as we’ve previously shown, the breathing motions of the free ESRRB-TL nucleosome had a larger amplitude than those of the complete nucleosome (36). However, in the presence of Oct4, the opening of the ESRRB-TL nucleosome was much larger (Figure 5G,H, Supplementary Figure S9D-F).
In summary, these data shows that Oct4 stabilizes and further enhances the opening of ESRRB. However, in native, complete nucleosomes, the impact of Oct4 on nucleosome dynamics depends on the mobility of the H3 and H2AC tails.
Oct4’s subdomains have distinct roles in stabilizing open nucleosomes
Comparing two classical simulations of the Oct4-ESRRB complex in which ESRRB opened extensively (S+5.51 and S+5.5T1), we observed in the first that the open conformation was not only stable for 3μs in the presence of Oct4 but it also remained stable when Oct4 was removed (Figure 5D, Supplementary Figure S6A), whereas in the second, the nucleosome closed after 1 μs (Figure 5F, Supplementary Figure S7A). In the first simulation, the open conformation was stabilized by nonspecific interactions between the POUHD and both DNA gyres, which prevented the nucleosome from closing (Figure 5C). We confirmed this by performing two 250 ns biased MD simulations started after 800 ns of S+5.5T1 (at maximum opening). In these, we biased the POUHD to move in between the gyres using harmonic steered MD biases while the nucleosome was maintained open with harmonic wall biases preventing interactions between the H3 and H2AC tails and the outer DNA gyre (see Methods). After 250 ns of each simulation, we switched off the bias for 1 μs of classical simulation (S+5.5T2, S+5.5T3) (Table 1). In both these simulations, the nucleosome remained open (Figure 5F, Supplementary Figures S7, S8D, S9G, Supplementary Video S7).
Visualizing the position of the Oct4 subdomains during 10 μs aggregate simulation time of the Oct4-ESRRB complex (Figure 6A), we observed a narrow distribution for the position of the POUS, which forms sequence specific contacts to the outer gyre (Figure 6B). The position of the POUHD showed a wider distribution with a large number of contacts both with DNA bases and the backbone of the inner gyre. When the ESRRB opened (simulation S+5.51), its contacts with the inner gyre were maintained while new contacts were formed with the outer gyre (Figure 6A). Similar distributions were observed in the 4 μs aggregate simulation time of the Oct4-ESRRB complex started with H3 and H2AC configurations selected from an open nucleosome conformation (Figure 6C). In the first simulation (S+5.5T1) in which the nucleosome closed after 1μs, the POUHD interacted only with the inner gyre, whereas in the other two, it formed nonspecific interactions with both gyres, preventing nucleosome closing (Figure 6D).
Figure 6.
Oct4 modifies nucleosome dynamics by sequence specific DNA recognition with one subdomain and nonspecific DNA interactions with the other subdomain. (A) Sampling of the nucleosome surface by the Oct4 subdomains in 10 μs aggregate simulation time of the Oct4-ESRRB complex. (B) Number of contacts between the POUS (red, orange) or the POUHD (blue, cyan) with DNA bases (sequence specific, upper plots) or DNA backbone (nonspecific, lower plots) of the inner gyre (red, blue) or the outer gyre (orange, cyan) in two of the simulations shown in (A) (S+5.51 and S+5.52). (C) Same as (A) in 4 μs aggregate simulation time of the Oct4-ESRRB complex started with the H3 and H2AC tails remodelled using a configuration selected from the open nucleosome conformation found in S+5.51. (D) Same data as in (B) but from the simulations S+5.5T1,T2.
Moreover, we performed two additional 3 μs classical simulations, replicates of S+5.5T1 which provided further evidence for the POUHD-mediated stabilization of the open nucleosome conformation. In one (S+5.5T4 in Supplementary Table S1), ESRRB remained open with the POUHD moving spontaneously between the two gyres, while in the other ESRRB closed like in S+5.5T1 (Supplementary Figures S7, S8D, S9H) because the POUHD did not occupy the position between the gyres.
In contrast, the partially open conformation of the LIN28B nucleosome was stabilized by the sequence specific binding of the POUHD to the outer gyre and by nonspecific interaction of the POUS with the outer gyre (Supplementary Figure S10).
In summary, these data show that Oct4 uses both domains to bind to and stabilize open nucleosomes. With one of its subdomain it recognizes specific sequences and with the second it establishes barriers to nucleosome closing by nonspecific interactions with nearby DNA segments
DISCUSSION
How pioneer factors (4,5) contribute to the transition of closed, silent chromatin to transcriptionally active DNA remains poorly understood. They bind to DNA wrapped in nucleosomes sometimes using only partial binding motifs and a range of translational and orientation binding preferences, suggesting a diverse range of potential pTF mechanisms (6). From the structural studies providing the first evidence of a direct TF-induced nucleosome opening, it has been proposed that for Oct4-Sox2 composite motifs, Sox2 binding to the minor groove deforms the DNA and mechanically loosens the grip of histones, freeing up buried binding sites and facilitating Oct4 binding (13,25). This mechanism involves Sox2-induced DNA opening and the use of only one of Oct4’s DNA binding subdomains, the POUS domain. However, several questions remain unanswered: (i) Does the multi-domain pTF Oct4 alone have a similar impact on the nucleosome as Sox2?, (ii) How do pTFs bind native nucleosome sequences?, and (iii) How do histone tails modulate or adapt to the binding?
Here we found that Oct4 can use either domain to recognize its binding sites on nucleosomes and we describe at atomistic resolution the interplay between intra-nucleosome dynamics and pTF binding (summarized in Figure 7).
Figure 7.
Summary of Oct4 mediated nucleosome dynamics The Oct4 POUS and POUHD subdomains are shown as orange and teal circles, respectively. Larger circles with straight lines represent nucleosomes and their linker DNA as viewed from the top in closed, partially open, or fully open configurations. The red points DNA indicate Oct4’s sequence specific binding half-sites. Coiled lines represent histone tails. Dotted arrows indicate hypothetical transitions.
Oct4 binds to one native nucleosome, LIN28B, primarily on a POUHD binding site at the 5’ end of the nucleosomal DNA (the HD-7 site), but also to the POUS and POUHD sites located in the core nucleosomal DNA (11). From our experiments, the affinity of Oct4 for LIN28B appears lower than that measured by Soufi et al. (11). This is most likely due to differences in experimental setup (e.g. preparation of nucleosomes and Oct4 protein). Mutating only the HD-7 site reduced Oct4 binding substantially, whereas mutating the other binding sites had only a moderate effect, in agreement with other studies of Oct4 binding to the LIN28B nucleosome (11,12,14). Overall, these data point to Oct4’s adaptability as a pTF, as it is able to recognize multiple binding sites on the LIN28B nucleosome using either the POUS or POUHD.
On another native nucleosome, ESRRB, Oct4 binds to the POUS binding site positioned at SHL +5.5. The dissociation of Oct4 from this nucleosome was significantly faster than dissociation from LIN28B. This may be explained both by the difference in the number of binding sites between the two nucleosomes and by the higher affinity of the POUHD domain to the DNA compared to the POUS (9,37).
We turned to molecular modelling and simulations to reveal the structural basis of Oct4-nucleosome interaction at atomic resolution. These methods are reliable alternatives to experiments but depend on some approximations. For example, to build structural models of Oct4-nucleosome complexes we first had to build models of the free nucleosomes. For this, we positioned the dyad at the center of the reported micrococcal nuclease (MNase) peak and optimized the accessibility of the proposed Oct4 binding sites (10,27,36). This is an approximation as MNase-seq does not reveal the nucleosome position at base pair resolution. The cryo-EM structure of the free LIN28B, which was not available when we started our study, revealed that our approach provided reliable models. The structure differs from our models by just one base pair (26). This minimal difference can be attributed to the intrinsic flexibility of genomic nucleosomes (27,36). Moreover, the structure was resolved with 147 and not 168 base pairs of DNA, a difference that may also lead to minimal variations in positioning.
To validate our models of the Oct4-nucleosome complex, we performed short MD simulations after we fixed Oct4-DNA contacts in the equilibration phase. The contacts were selected from the crystal structure of Oct4 bound to free DNA (38). This assumed the same contacts are formed with the nucleosomes, which has been confirmed for the POUS domain in the new structure of Oct4-Sox2-nucleosome complex (13). The question whether the homeodomain of Oct4 prefers a certain orientation to bind to nucleosome remains unanswered because for the HD-7 site we obtained stable Oct4 binding in both orientations.
We show that the intrinsic nucleosome flexibility is important for Oct4 binding. This may be particularly relevant for genomic nucleosomes as they are generally believed to be more flexible than those assembled on engineered sequences (57–59). Oct4 binding to crosslinked nucleosomes was reduced at high concentrations of Oct4. The moderate increase in the binding of Oct4 to crosslinked LIN28B at low Oct4 concentrations can be explained by the location of the predominant HD-7 on the linker DNA arm. We show that the binding of Oct4 to this higher affinity site requires more open conformations of LIN28B, which LIN28B often adopts (27,36). When crosslinked, LIN28B nucleosomes are fixed in a range of conformations. At lower concentrations, Oct4 could predominantly bind to more open conformations with higher affinity, which then become saturated at higher Oct4 concentrations. Without crosslinking, the more open conformations are short-lived and Oct4 binds predominantly to partially open conformations on the HD-7 site but also to closed nucleosomes on the other binding sites with lower affinity. Another possibility is an induced fit mechanism in which Oct4 facilitates the opening of closed LIN28B conformations upon initial nonspecific binding. The sites with lower Oct4 binding affinity located on the nucleosome core are only occupied at higher Oct4 concentrations.
Moreover, we show that Oct4 not only stabilizes open nucleosome conformations but also induces nucleosome opening. When bound to the HD-7 site on LIN28B, Oct4 stabilized a partially open conformations by the sequence specific binding of the POUHD in the space between the core and linker DNA (Figure 7). The POUS domain interacted nonspecifically with the LIN28B DNA, sampling a range of positions. This suggests that one mechanism by which Oct4 impacts chromatin dynamics is by restricting the breathing of the nucleosome towards partly open conformations. Although we did not observe an Oct4 induced opening of LIN28B, we can’t exclude this as a possibility due to the limited sampling achieved in our simulations. When bound to the binding sites located in the core nucleosomal DNA, Oct4 did not affect the dynamics of LIN28B. Given that LIN28B is bound by several pTFs, it is also possible that nucleosome opening requires the binding of multiple TFs if the binding sites are located in the nucleosome core, as was recently proposed by an allosteric mechanism in which the binding of Sox2 affects Oct4 binding (14). Taken together, the current data suggest that the impact of Oct4 binding on nucleosome dynamics depends on the position of the binding site.
Using multiple approaches, including classical and biased simulations with complete and tail-less nucleosomes we revealed an Oct4-mediated opening of the ESRRB nucleosome. The open conformation was stable after Oct4 was removed and was conditioned by the mobility of the histone H3 and H2AC tails. The resulting open conformation was stabilized both by the sequence specific binding of the POUS and nonspecific interactions between the POUHD and both gyres of DNA, which prevented the closing of the nucleosomes. Interestingly, the intrinsic flexibility of free ESRRB was also higher at the 3’ end where the Oct4 binding site is located (36), suggesting that Oct4 is capable of mediating a large opening of the nucleosome only in regions with increased intrinsic flexibility.
The position and conformation of the H3 and H2AC tails are critical for nucleosome breathing (36,60,61) and Oct4 mediated opening. In general, binding and unbinding of histone tails from the DNA modulate intra-nucleosome dynamics. However, simulating this motion of the tails is the most challenging part of MD simulations involving nucleosomes and appears to depend not only on the time scale sampled but also on other factors such as the water and ion models and the force field used in the simulations (55,56). We propose that substantial, short lived opening of free nucleosomes as well as longer-lived Oct4-mediated opening is only possible when these tails establish a minimum number of interactions with the linker DNAs. As long as the tails embraced the L-DNA arms and bridged them to the core nucleosomal DNA, the nucleosome remained closed. When Oct4 occupied the HD-7 site on LIN28B, both tails adopted configurations with fewer contacts with the L-DNA arms. On the free LIN28B, these tails occupy the region where Oct4 binds, suggesting a competition between Oct4 and histone tail binding. However, for LIN28B to open and allow Oct4 binding, the tails need to free that region and interact with other pieces of DNA around it. We propose that Oct4 binds to pre-established partly open conformations in which the tails do not occupy the Oct4 binding region. Once bound, Oct4 can stabilize the partly open conformations by preventing the tails to occupy that region and close the nucleosome. The large opening of ESRRB was also only possible in the absence of H3 and H2AC interactions with L-DNA. Interestingly, in all our simulations, the binding of Oct4 to the nucleosome was accompanied by a substantial increase of the number of stable contacts between H4 tail and the nucleosomal DNA. Because the H4 tail is essential to establish nucleosome-nucleosome interactions (52–54), the binding of Oct4 might also alter the higher order chromatin structures. The interplay between TF binding and histone tails was also observed in the Sox2 bound nucleosome structure in which the H4 tail was displaced compared to the free nucleosome (25). Therefore, the interplay between pTF and histone tail binding to nucleosomal DNA is essential for nucleating the opening of nucleosome and ultimately chromatin structure.
In this interplay, the DNA-binding subdomains of Oct4 have distinct roles. One is recognizing specific DNA sequences while the other interacts nonspecific with nearby DNA segments. For sequence specific binding, the accessibility of the binding site is essential. One side of the DNA is occluded by the histone octamer and cannot be accessed unless the nucleosome unwraps. However, even when located on the accessible face of the DNA, the binding sites may still have different accessibilities depending on the location on the nucleosome relative to the dyad. For example, on LIN28B, Oct4 binds stronger to the HD-7 on L-DNA than to HD-4.5 on the core nucleosomal DNA although the sequences are identical. Moreover, on ESRRB, when replacing the specific binding site of the POUS with a consensus POUHD site, Oct4 did not bind, suggesting that in addition to sequence composition and accessibility, the binding is also determined by the local structure of the free nucleosome. Both Oct4 domains bind in the major groove and do not substantially alter the mechanical properties of the DNA structure, as other minor groove binders, such as Sox2, do (25). Therefore, we propose that Oct4 does not mediate nucleosome opening by modifying the local DNA structure but rather by altering the optimal wrapping of the two gyres around each other and the histones.
The Oct4 mediated opening is further stabilized by non-specific DNA interactions of the second subdomain which prevent the re-wrapping of the outer gyre, possibly propagating local chromatin opening. How often this happens in the genomic context remains unclear. This suggests that pTFs with multiple DNA binding domains use their domains not only to recognize the specific DNA sequence of a binding site but also to establish barriers to nucleosome closing or inter-nucleosome stacking by interacting with sequentially distant pieces of DNA. The relative motion of the domains in one TF is restricted by the linker between them. This may further explain why a major effect on nucleosome dynamics was observed only for certain binding site positions. For example, on LIN28B, the binding sites located in the nucleosome core are too far from the linker DNA arm to allow bridging of the two gyres by the Oct4 domains.
Finally, our study is a demonstration of the strength of molecular simulations in revealing the structural features and the dynamic mechanisms involved in pTF-nucleosome binding. On one hand, MD simulations are powerful tools to validate or exclude TF binding modes to nucleosomes (27). On the other hand, supported by direct experimental validation of Oct4 binding to different locations on different nucleosomes, MD simulations enabled the discovery of the elegant mechanism we present in this article by which Oct4 engages with and unravels nucleosomes. Many details of this mechanism are not accesssible from experiments. Although atomistic MD simulations have been extensively used to study nucleosomes structure, dynamics, and interaction (62), they are limited by the amount of sampling achieved (63), and coarse grained representations are currently needed for larger scale systems (64). Despite reporting here extensive aggregate simulation times, we did not observe multiple opening events in a single long simulation. This would be essential for obtaining a quantitative description of nucleosome dyanmics. However, Armeev et al. showed that even 15 μs are still not enough for observing such transitions in complete nucleosomes (65). In addition, the accuracy of the force fields used for simulating the dynamics of unstructured regions such as the histone tails is still under scrutiny (66,67). These limitations need to be addressed in future studies.
DATA AVAILABILITY
All MD simulations presented (Table 1 and Supplementary Table S1) are available for download at the following link: https://datashare.mpcdf.mpg.de/s/OeXa432AsqwbGMC. See the ‘README_Huertas_MacCarthy_et_al_2022’ file therein for details and instructions.
Supplementary Material
ACKNOWLEDGEMENTS
We would like to thank: Peter Becker and Catherine Regnard for providing us with plasmids, reagents, technical assistance, and for their advice and active discussions; Ken Zaret and Abdenour Soufi for re-analyzing their data to provide us with native nucleosome sequences; Francesca Mattiroli for advice regarding the experiments and the manuscript. V.C. thanks the Hubrecht Institute and Alexandre Bonvin for support. J.H. was part of the International Max Planck Research School-Molecular Biomedicine, Münster, Germany.
Contributor Information
Caitlin M MacCarthy, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany.
Jan Huertas, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany; Yusuf Hamied Department of Chemistry, University of Cambridge, UK.
Claudia Ortmeier, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany.
Hermann vom Bruch, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany.
Daisylyn Senna Tan, School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, University of Hong Kong, Hong Kong SAR, China.
Deike Reinke, Max Planck Institute of Molecular Physiology, Dortmund, Germany.
Astrid Sander, Max Planck Institute of Molecular Physiology, Dortmund, Germany.
Tim Bergbrede, Lead Discovery Center GmbH, Dortmund, Germany.
Ralf Jauch, School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, University of Hong Kong, Hong Kong SAR, China.
Hans R Schöler, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany; Medical Faculty, University of Münster, Germany.
Vlad Cojocaru, Department of Cellular and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany; Computational Structural Biology Group, University of Utrecht, The Netherlands; STAR-UBB Institute, Babeş-Bolyai University, Cluj-Napoca, Romania.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Max Planck Society; the Royal Netherlands Academy of Arts and Sciences; University of Utrecht; Babes-Bolyai University; high performance computing resources were provided by the Max Planck Computing and Data Facility (www.mpcdf.mpg.de) and the Gauss Centre for Supercomputing e.V. (www.gauss-centre.eu), project ID 12622, STRUCNUCREC running at the Leibniz Supercomputing Centre (www.lrz.de). Funding for open access charge: Max Planck Society.
Conflict of interest statement. None declared.
REFERENCES
- 1. Luger K., Mäder A.W., Richmond R.K., Sargent D.F., Richmond T.J.. Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature. 1997; 389:251–260. [DOI] [PubMed] [Google Scholar]
- 2. Cramer P. Organization and regulation of gene transcription. Nature. 2019; 573:45–54. [DOI] [PubMed] [Google Scholar]
- 3. Cirillo L.A., Lin F.R., Cuesta I., Friedman D., Jarnik M., Zaret K.S.. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol. Cell. 2002; 9:279–289. [DOI] [PubMed] [Google Scholar]
- 4. Zaret K.S., Mango S.E.. Pioneer transcription factors, chromatin dynamics, and cell fate control. Curr. Opin. Genet. Dev. 2016; 37:76–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Zaret K.S. Pioneer transcription factors initiating gene network changes. Annu. Rev. Genet. 2020; 54:367–385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Zhu F., Farnung L., Kaasinen E., Sahu B., Yin Y., Wei B., Dodonova S.O., Nitta K.R., Morgunova E., Taipale M.et al.. The interaction landscape between transcription factors and the nucleosome. Nature. 2018; 562:76–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jerabek S., Merino F., Schöler H.R., Cojocaru V.. OCT4: Dynamic DNA binding pioneers stem cell pluripotency. Biochim. Biophys. Acta Gene Regul. Mech. 2014; 1839:138–154. [DOI] [PubMed] [Google Scholar]
- 8. Takahashi K., Yamanaka S.. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006; 126:663–676. [DOI] [PubMed] [Google Scholar]
- 9. Verrijzer C.P., Alkemal M.J., Weperen W.W.V., Leeuwen H.C.V., Strating M.J.J., Viiet P.C.V.D.. Domain and Its Subdomains. EMBO J. 1992; 1:4993–5003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Soufi A., Donahue G., Zaret K.S.. Facilitators and impediments of the pluripotency reprogramming factors’ initial engagement with the genome. Cell. 2012; 151:994–1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Soufi A., Garcia M.F., Jaroszewicz A., Osman N., Pellegrini M., Zaret K.S.. Pioneer transcription factors target partial DNA motifs on nucleosomes to initiate reprogramming. Cell. 2015; 161:555–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Echigoya K., Koyama M., Negishi L., Takizawa Y., Mizukami Y., Shimabayashi H., Kuroda A., Kurumizaka H.. Nucleosome binding by the pioneer transcription factor OCT4. Sci. Rep. 2020; 10:11832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Michael A.K., Grand R.S., Isbel L., Cavadini S., Kozicka Z., Kempf G., Bunker R.D., Schenk A.D., Graff-Meyer A., Pathare G.R.et al.. Mechanisms of OCT4-SOX2 motif readout on nucleosomes. Science. 2020; 368:1460–1465. [DOI] [PubMed] [Google Scholar]
- 14. Tan C., Takada S.. Nucleosome allostery in pioneer transcription factor binding. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:20586–20596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Ambrosetti D.C., Basilico C., Dailey L.. Synergistic activation of the fibroblast growth factor 4 enhancer by Sox2 and Oct-3 depends on protein-protein interactions facilitated by a specific spatial arrangement of factor binding sites. Mol. Cell. Biol. 1997; 17:6321–6329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Nishimoto M., Fukushima A., Okuda A., Muramatsu M.. The gene for the embryonic stem cell coactivator UTF1 carries a regulatory element which selectively interacts with a complex composed of Oct-3/4 and Sox-2. Mol. Cell. Biol. 1999; 21:978–978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Nordhoff V., Hübner K., Bauer A., Orlova I., Malapetsa A., Schöler H.R.. Comparative analysis of human, bovine, and murine Oct-4 upstream promoter sequences. Mamm. Genome. 2001; 12:309–317. [DOI] [PubMed] [Google Scholar]
- 18. Tomioka M., Nishimoto M., Miyagi S., Katayanagi T., Fukui N., Niwa H., Muramatsu M., Okuda A.. Identification of Sox-2 regulatory region which is under the control of Oct-3/4±Sox-2 complex. Nucleic Acids Res. 2002; 30:3202–3213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Rodda D.J., Chew J.L., Lim L.H., Loh Y.H., Wang B., Ng H.H., Robson P.. Transcriptional regulation of Nanog by OCT4 and SOX2. J. Biol. Chem. 2005; 280:24731–24737. [DOI] [PubMed] [Google Scholar]
- 20. Chen X., Xu H., Yuan P., Fang F., Huss M., Vega V.B., Wong E., Orlov Y.L., Zhang W., Jiang J.et al.. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell. 2008; 133:1106–1117. [DOI] [PubMed] [Google Scholar]
- 21. Ng C.K., Li N.X., Chee S., Prabhakar S., Kolatkar P.R., Jauch R.. Deciphering the Sox-Oct partner code by quantitative cooperativity measurements. Nucleic Acids Res. 2012; 40:4933–4941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Malik V., Glaser L.V., Zimmer D., Velychko S., Weng M., Holzner M., Arend M., Chen Y., Srivastava Y., Veerapandian V.et al.. Pluripotency reprogramming by competent and incompetent POU factors uncovers temporal dependency for Oct4 and Sox2. Nat. Commun. 2019; 10:3477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Teif V.B., Vainshtein Y., Caudron-Herger M., Mallm J.P., Marth C., Höfer T., Rippe K.. Genome-wide nucleosome positioning during embryonic stem cell development. Nat. Struct. Mol. Biol. 2012; 19:1185–1192. [DOI] [PubMed] [Google Scholar]
- 24. Li S., Zheng E.B., Zhao L., Liu S.. Nonreciprocal and conditional cooperativity directs the pioneer activity of pluripotency transcription factors. Cell Rep. 2019; 28:2689–2703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Dodonova S.O., Zhu F., Dienemann C., Taipale J., Cramer P.. Nucleosome-bound SOX2 and SOX11 structures elucidate pioneer factor function. Nature. 2020; 580:669–672. [DOI] [PubMed] [Google Scholar]
- 26. Roberts G.A., Ozkan B., Gachulincová I., O’Dwyer M.R., Hall-Ponsele E., Saxena M., Robinson P.J., Soufi A.. Dissecting OCT4 defines the role of nucleosome binding in pluripotency. Nat. Cell Biol. 2021; 23:834–845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Huertas J., MacCarthy C.M., Schöler H.R., Cojocaru V.. Nucleosomal DNA dynamics mediate Oct4 pioneer factor binding. Biophys. J. 2020; 118:2280–2296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Adachi K., Kopp W., Wu G., Heising S., Greber B., Stehling M., Araúzo-Bravo M.J., Boerno S.T., Timmermann B., Vingron M.et al.. Esrrb unlocks silenced enhancers for reprogramming to naive pluripotency. Cell Stem Cell. 2018; 23:266–275. [DOI] [PubMed] [Google Scholar]
- 29. Zhang J., Ratanasirintrawoot S., Chandrasekaran S., Wu Z., Ficarro S.B., Yu C., Ross C.A., Cacchiarelli D., Xia Q., Seligson M.et al.. LIN28 regulates stem Cell metabolism and conversion to primed pluripotency. Cell Stem Cell. 2016; 19:66–80. [DOI] [PubMed] [Google Scholar]
- 30. Festuccia N., Owens N., Navarro P.. Esrrb, an estrogen-related receptor involved in early development, pluripotency, and reprogramming. FEBS Lett. 2018; 592:852–877. [DOI] [PubMed] [Google Scholar]
- 31. Kisa F., Shiozawa S., Oda K., Yoshimatsu S., Nakamura M., Koya I., Kawai K., Suzuki S., Okano H.. Naive-like ESRRB+ iPSCs with the Capacity for Rapid Neural Differentiation. Stem Cell Rep. 2017; 9:1825–1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Luger K., Rechsteiner T.J., Richmond T.J.. Expression and purification of recombinant histones and nucleosome reconstitution. Meth. Mol. Biol. (Clifton, N.J.). 1999; 119:1–16. [DOI] [PubMed] [Google Scholar]
- 33. Kelly T.K., Liu Y., Lay F.D., Liang G., Berman B.P., Jones P.a.. Genome-wide mapping of nucleosome positioning and DNA methylation within individual DNA molecules Genome-wide mapping of nucleosome positioning and DNA methylation within individual DNA molecules. Genome Res. 2012; 22:2497–2506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Lowary P.T., Widom J.. New DNA sequence rules for high affinity binding to histone octamer and sequence-directed nucleosome positioning. J. Mol. Biol. 1998; 276:19–42. [DOI] [PubMed] [Google Scholar]
- 35. Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E.. UCSF chimera: a visualization system for exploratory research and analysis. J. Comput. Chem. 2004; 25:1605–1612. [DOI] [PubMed] [Google Scholar]
- 36. Huertas J., Schöler H.R., Cojocaru V.. Histone tails cooperate to control the breathing of genomic nucleosomes. PLOS Comput. Biol. 2021; 17:e1009013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Merino F., Bouvier B., Cojocaru V.. Cooperative DNA recognition modulated by an interplay between protein-protein interactions and DNA-mediated allostery. PLoS Comput. Biol. 2015; 11:e1004287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Esch D., Vahokoski J., Groves M.R., Pogenberg V., Cojocaru V., Vom Bruch H., Han D., Drexler H.C., Arauzo-Bravo M.J., Ng C.K.et al.. A unique Oct4 interface is crucial for reprogramming to pluripotency. Nat. Cell Biol. 2013; 15:295–301. [DOI] [PubMed] [Google Scholar]
- 39. Jerabek S., Ng C.K.L., Wu G., Arauzo-Bravo M.J., Kim K., Esch D., Malik V., Chen Y., Velychko S., MacCarthy C.M.et al.. Changing POU dimerization preferences converts Oct6 into a pluripotency inducer. EMBO Rep. 2017; 18:e201642958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Case D.A., Cheatham T.E., Darden T., Gohlke H., Luo R., Merz K.M., Onufriev A., Simmerling C., Wang B., Woods R.J.. The AMBER biomolecular simulation programs. J. Comput. Chem. 2005; 26:1668–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Phillips J.C., Braun R., Wang W., Gumbart J., Tajkhorshid E., Villa E., Chipot C., Skeel R.D., Kalé L., Schulten K.. Scalable molecular dynamics with NAMD. J. Comput. Chem. 2005; 26:1781–802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Li P., Song L.F., Merz K.M.. Systematic parameterization of monovalent ions employing the nonbonded model. J. Chem. Theor. Comput. 2015; 11:1645–1657. [DOI] [PubMed] [Google Scholar]
- 43. Maier J.A., Martinez C., Kasavajhala K., Wickstrom L., Hauser K.E., Simmerling C.. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theor. Comput. 2015; 11:3696–3713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Ivani I., Dans P.D., Noy A., Pérez A., Faustino I., Hospital A., Walther J., Andrio P., Goñi R., Balaceanu A.et al.. Parmbsc1: a refined force field for DNA simulations. Nat. Methods. 2015; 13:55–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Fiorin G., Klein M.L., Hénin J.. Using collective variables to drive molecular dynamics simulations. Mol. Phys. 2013; 111:3345–3362. [Google Scholar]
- 46. Humphrey W., Dalke A., Schulten K.. Visual molecular dyanmics. J. Mol. Graph. 1996; 14:33–38. [DOI] [PubMed] [Google Scholar]
- 47. Öztürk M.A., Pachov G.V., Wade R.C., Cojocaru V.. Conformational selection and dynamic adaptation upon linker histone binding to the nucleosome. Nucleic Acids Res. 2016; 44:6599–6613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Roe D.R., Cheatham T.E.. PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theor. Comput. 2013; 9:3084–3095. [DOI] [PubMed] [Google Scholar]
- 49. Pardo M., Lang B., Yu L., Prosser H., Bradley A., Babu M.M., Choudhary J.. An expanded Oct4 interaction network: implications for stem cell biology, development, and disease. Cell Stem Cell. 2010; 6:382–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Arimura Y., Tachiwana H., Oda T., Sato M., Kurrumizaka H.. Structural analysis of the hexasome, lacking one histone H2A/H2B dimer from the conventional nucleosome. Biochemistry. 2012; 51:3302–3309. [DOI] [PubMed] [Google Scholar]
- 51. Zhou K., Gaullier G., Luger K.. Nucleosome structure and dynamics are coming of age. Nat. Struct. Mol. Biol. 2019; 26:3–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Collepardo-Guevara R., Portella G., Vendruscolo M., Frenkel D., Schlick T., Orozco M.. Chromatin unfolding by epigenetic modifications explained by dramatic impairment of internucleosome interactions: a multiscale computational study. J. Am. Chem. Soc. 2015; 137:10205–10215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Saurabh S., Glaser M.A., Lansac Y., Maiti P.K.. Atomistic simulation of stacked nucleosome core particles: Tail bridging, the H4 Tail, and effect of hydrophobic forces. J. Phys. Chem. B. 2016; 120:3048–3060. [DOI] [PubMed] [Google Scholar]
- 54. Ishida H., Kono H.. H4 tails potentially produce the diversity in the orientation of two nucleosomes. Biophys. J. 2017; 113:978–990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Peng Y., Li S., Onufriev A., Landsman D., Panchenko A.R.. Binding of regulatory proteins to nucleosomes is modulated by dynamic histone tails. Nat. Commun. 2021; 12:5280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Morrison E.A., Baweja L., Poirier M.G., Wereszczynski J., Musselman C.A.. Nucleosome composition regulates the histone H3 tail conformational ensemble and accessibility. Nucleic Acids Res. 2021; 49:4750–4767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Eslami-Mossallam B., Schiessel H., van Noort J.. Nucleosome dynamics: sequence matters. Adv. Coll. Int. Sci. 2016; 232:101–113. [DOI] [PubMed] [Google Scholar]
- 58. Culkin J., de Bruin L., Tompitak M., Phillips R., Schiessel H.. The role of DNA sequence in nucleosome breathing. Eur. Phys. J. E. Soft Matter. 2017; 40:106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Takizawa Y., Tanaka H., Machida S., Koyama M., Maehara K., Ohkawa Y., Wade P.A., Wolf M., Kurumizaka H.. Cryo-EM structure of the nucleosome containing the ALB1 enhancer DNA sequence. Open Biol. 2018; 8:170255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Li Z., Kono H.. Distinct roles of histone H3 and H2A tails in nucleosome stability. Sci. Rep. 2016; 6:31437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Lehmann K., Felekyan S., Kühnemuth R., Dimura M., Tóth K., Seidel C. A.M., Langowski J.. Dynamics of the nucleosomal histone H3 N-terminal tail revealed by high precision single-molecule FRET. Nucleic Acids Res. 2020; 48:1551–1571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Huertas J., Cojocaru V.. Breaths, twists, and turns of atomistic nucleosomes. J. Mol. Biol. 2021; 433:166744. [DOI] [PubMed] [Google Scholar]
- 63. Grossfield A., Zuckerman D.M.. Quantifying uncertainty and sampling quality in biomolecular simulations. Ann. Rep. Comput. Chem. 2009; 1400:23–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Huertas J., Woods E.J., Collepardo-Guevara R.. Multiscale modelling of chromatin organisation: resolving nucleosomes at near-atomistic resolution inside genes. Curr. Opin. Cell Biol. 2022; 75:102067. [DOI] [PubMed] [Google Scholar]
- 65. Armeev G.A., Kniazeva A.S., Komarova G.A., Kirpichnikov M.P., Shaytan A.K.. Histone dynamics mediate DNA unwrapping and sliding in nucleosomes. Nat. Commun. 2021; 12:2387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Rauscher S., Gapsys V., Gajda M.J., Zweckstetter M., De Groot B.L., Grubmüller H.. Structural ensembles of intrinsically disordered proteins depend strongly on force field: A comparison to experiment. J. Chem. Theor. Comput. 2015; 11:5513–5524. [DOI] [PubMed] [Google Scholar]
- 67. Duong V.T., Chen Z., Thapa M.T., Luo R.. Computational studies of intrinsically disordered proteins. J. Phys. Chem. B. 2018; 122:10455–10469. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All MD simulations presented (Table 1 and Supplementary Table S1) are available for download at the following link: https://datashare.mpcdf.mpg.de/s/OeXa432AsqwbGMC. See the ‘README_Huertas_MacCarthy_et_al_2022’ file therein for details and instructions.









