Abstract
The POU family transcription factor Oct4 plays pivotal roles in regulating pluripotency and somatic cell reprogramming. Previous studies have indicated an important role for major groove contacts in Oct4–DNA recognition; however, the contributions of the RK motif in the POUh domain and the linker segment joining the two DNA-binding domains remain poorly understood. Here, by combining molecular modelling and functional assays, we find that the RK motif is essential for Oct4–DNA association by recognizing the narrowed DNA minor groove. Intriguingly, computational simulations reveal that the function of the RK motif may be finely tuned by H-bond interactions with the partially disordered linker segment and that breaking these interactions significantly enhances the DNA binding and reprogramming activities of Oct4. These findings uncover a self-regulatory mechanism for specific Oct4–DNA recognition and provide insights into the functional crosstalk at the molecular level that may illuminate mechanistic studies of the Oct protein family and possibly transcription factors in the POU family. Our gain-of-function Oct4 mutants might also be useful tools for use in reprogramming and regenerative medicine.
INTRODUCTION
Oct4 lies at the centre of the core regulatory network governing pluripotency and somatic cell reprogramming (1–3). Both the knockdown and overexpression of Oct4 in embryonic stem cells (ESCs) result in cellular differentiation, suggesting that tight regulation of Oct4 expression level is essential for maintaining pluripotency (4–6). Oct4 belongs to the POU (Pit-Oct-Unc) transcription factor family. Members of this family bind to the consensus octamer motif AGTCAAAT through bipartite DNA-binding domains (POU domain) and modulate target gene expression by both N- and C-terminal transactivation domains (7–9).
The POU domain consists of two structurally independent subdomains—the POU-specific domain (POUs) and the POU homeodomain (POUh)—connected by a linker of variable length and composition (7–8,10). Previous studies have emphasized the importance of the helix-turn-helix (HTH) elements of POUs and POUh subdomains in the recognition of specific DNA sites by Oct4 and other POU family members (11–13). The HTH motif employs a base readout mechanism that involves insertion into the major groove of DNA and forming extensive hydrogen bond (H-bond) or hydrophobic contacts with bases. Disruption of these conserved interactions greatly reduces the DNA binding affinity of POU family members (14,15).
In addition to making contact with the major groove, the N-terminal arm of the POUh subdomain—a cluster of positively charged residues designated as RK motif—has been suggested to mediate minor groove interactions by tracing along the minor groove at the octamer site (11–12,15). In most POU family proteins, the majority of RK-motif residues mediate H-bond and electrostatic interactions with bases and the DNA backbone. However, the sidechains of the Oct4 RK motif are largely disordered (with the exception of R227), and only two H-bond contacts are formed between the RK motif and DNA sequences in the Oct4–DNA complex (12). These observations suggest that the RK motif is dispensable for Oct4–DNA recognition. However, multiple sequence alignment of POU family members and Oct4 orthologues reveals that the RK motif is highly conserved, including the residues with disordered sidechains in Oct4 (Supplementary Figure S1A and B), suggesting an important function of this motif. Moreover, the Drosophila POU protein I-POU, which lacks two basic residues in the RK motif, is incapable of binding to any DNA sequence. The introduction of two basic amino acids into the RK motif restored its DNA binding properties, implicating the non-trivial role of the RK motif in the DNA binding (16,17). These controversial lines of evidence call into question the importance of the RK motif for specific DNA recognition.
In contrast to the conserved POUs and POUh subdomains, the linker segment connecting these subdomains exhibits very low sequence conservation (Supplementary Figure S1A). In most POU protein–DNA complex structures, unstructured or partially visible linker segments do not come into direct contact with the target DNA (18). Studies of Oct1–DNA binding suggest that the linker may contribute favourably to the energetics of DNA binding through chelation, thereby maintaining a high local concentration of subdomains around the DNA site and increasing their binding to DNA. The length of the linker may also regulate the configurational freedom of the two structured subdomains relative to the DNA sites, thereby influencing DNA site selection via a tethering mechanism (19). In Oct4–DNA complex structure, a short α-helix is observed in the partially disordered linker segment (residues from N206 to A222), in which six residues have a disordered sidechain and four residues are completely disordered. The functional study showed that the short α-helix in the linker segment of Oct4 could modulate its reprogramming potential by serving as a protein–protein interaction (PPI) site. Mutations at the unique interface completely abolish the reprogramming potential but have minimal effects on DNA binding (12). Despite our increased understanding of the role played by linker segments in regulating Oct4–DNA association and transactivation, little attention has been paid to the impact of the partially disordered linker segment's sequence composition on DNA binding properties and the underlying mechanisms.
Here, by employing molecular dynamic (MD) simulations and functional assays with various Oct4 mutants, we show that the RK motif and its crosstalk with the linker segment are of great importance for Oct4–DNA binding. The clustered positive charges and unique structural features observed in MD simulations may enable the RK motif to use a shape-based readout mechanism to detect the enhanced negative electrostatic potential in the minor grove of Oct4 recognition sites. Our mutagenesis data demonstrated that the DNA binding capability of Oct4 decreased progressively with the stepwise removal of positive charge from the RK motif. Furthermore, the MD simulations suggested that H-bond interactions between the disordered region in the linker segment and the RK motif may finely tune the DNA association of Oct4 and that the disruption of these interactions enhanced the DNA binding affinity, transactivation potential and somatic cell reprogramming efficiency of OCT4. Our results revealed a novel regulatory mechanism underlying Oct4–DNA recognition and underlined the relevance of the interplay between the RK motif and linker segment in somatic cell reprogramming.
MATERIALS AND METHODS
REMD and MD simulations
Sixty-four replicas spanning a temperature range of 300–390 K were used in the replica exchange molecular dynamics (REMD) simulations involving apo-Oct4 and each replica was simulated for 400 ns. Three independent 200-ns MD simulations were performed for four systems with the DNA conformation from the Oct4–H2B complex, Oct4–PORE complex and 3DNA-generated models for both H2B and PORE. All of the MD and REMD simulations were carried out using the GROMACS 4.5.3 package (20,21). The Curves+ (22,23) and Delphi programs (24–26) were used to calculate the minor groove width and electrostatic potential as previously described (27,28). The details of the computational simulations are provided in the Supporting Information (SI).
Electrophoretic mobility shift assay
The probe-protein mixtures were electrophoresed on a native polyacrylamide gel (5%). The intensity of the DNA bands was then visualized and quantified after staining.
Luciferase reporter assay
Constructs encoding HA-Oct4 (WT or mutant), firefly luciferase reporter under control of the TK promoter and 6W enhancer and the control Renilla luciferase reporter (pRL-TK) were transfected into HEK293T cells using Fugene transfection reagent (Life Technologies). Firefly and Renilla luciferase activities were measured after 24 h with the Dual-Glo Luciferase Assay System (Promega) on an EnVision multilabel plate reader (Perkin Elmer). Firefly luciferase activity was normalized to Renilla luciferase activity for each sample.
Mouse iPSC generation
Mouse iPSCs were generated using pMXs retroviral vectors encoding mouse Oct4 (WT or mutant), Sox2 and Klf4. Briefly, MEFs carrying an Oct4–GFP reporter were infected with viruses containing 3F and re-seeded at a density of 10 000 cells per well onto 24-well plates pre-seeded with feeders (SI Materials and Methods). GFP+ colonies were photographed and counted at day 16 with an Olympus microscope using Image Pro Plus software.
Statistical analysis
Values are reported as means ± SEM and were analysed using the two-tailed Student's t-test. P < 0.05 was considered statistically significant.
Additional details
The remaining experimental details are given in SI Materials and Methods.
RESULTS
The RK-motif is flexible and solvent-accessible in apo-state Oct4
Comparisons between the structures of DNA-binding proteins in the presence and absence of DNA may help to identify the key elements and mechanisms underlying dynamic protein–DNA recognition (29,30). However, structural data are currently unavailable for apo-state Oct4 and other POU family members. To gain insight into the conformational space and structural features of Oct4 in the absence of DNA, the REMD method is employed in our study, with the initial conformation from the Oct4–DNA complex (PDB entry: 3L1P) (12). As an enhanced sampling approach compared to traditional MD simulations, REMD allows systems to escape from local energy traps by exchanging replicas concurrently simulated at different temperatures based on the Metropolis criterion (31,32). The large and uniform acceptance ratios (0.25 ± 0.01; Supplementary Table S1), as well as a substantial overlap of canonical probability distributions between all neighbouring pairs of temperatures (Supplementary Figure S1C), indicated that a sufficient number of replica exchanges between pairs of replicas could be achieved (31). These data are consistent with the observed random walk of the replica within the simulated temperature space (Supplementary Figure S1D). Consequently, all evidence suggests that a sufficient number of replicas were properly distributed in the temperature space and that the REMD simulations were performed properly and effectively. To understand the structural characteristics of apo-Oct4 in solution, we categorized the conformations sampled at a physiologically relevant temperature (300 K) based on structural similarity. The relative arrangement of the two subdomains (Supplementary Figure S2A), rather than the deformation of subdomain architecture, accounted for most of the significant conformational differences between apo-Oct4 and Oct4 present in an Oct4–DNA complex (Supplementary Figure S2B). These results are in agreement with experimental observations, which suggest that a remarkable conformational change is required for Oct1–DNA association even though its two subdomains maintain rigid folding in solution (19,33). Nevertheless, the highly conserved RK-motif in the POUh subdomain exhibits recurring structural features in REMD simulation. The root-mean-square fluctuation (RMSF), which represents the average displacement of residues throughout the simulation process, reflects the fact that the RK-motif has a high flexibility similar to the inter-helical loops in each of the subdomains (Figure 1A and Supplementary Figure S2A). Moreover, solvent accessibility analysis demonstrates that the positively charged RK-motif is highly solvent-accessible in solution, a prerequisite for bio-molecular interactions (34) Figure 1B). Collectively, the REMD simulations reveal that the RK-motif, which consists solely of positively charged residues that are highly conserved across both paralogues (Supplementary Figure S1A) and orthologues (Supplementary Figure S1B) of Oct4, has a remarkable flexibility and solvent accessibility in apo-state Oct4.
Figure 1.

The structural properties of RK-motif and Oct4 recognition sites. (A) Residue fluctuations obtained by averaging residual fluctuations over the REMD simulations of apo-state Oct4. (B) The relative solvent accessibility of each residue in REMD simulations of apo-state Oct4 and the RK-motif is highlighted. (C) The minor groove width and electrostatic potential (ESP) of the H2B site in the modelled Oct4–DNA complex. (D) The minor groove width and ESP of the H2B site in an MD simulation with the initial DNA conformation from the modelled Oct4–DNA complex. The data (means ± SD) were obtained from three independent MD simulations.
Minor groove narrowing and enhanced negative electrostatic potential are intrinsic properties of Oct4 recognition sites
Recent studies emphasize the importance of the unique three-dimensional structure of DNA and local variations in its topography for protein–DNA recognition (27–28,35–37). An AT-rich motif is observed within the binding sites of most Oct4-regulated genes (38). Multiple lines of evidence have revealed that the narrow minor grooves are often associated with the presence of AT-rich motifs termed ‘A-tracts’, which consist of runs of at least three consecutive ApA, TpT or ApT base pair steps (39,40). Rohs et al. further showed that the narrow width of the minor groove substantially enhances the negative electrostatic potential around the DNA motif, which plays essential roles in the specific recognition of DNA-binding proteins (27,35–36). Hence, to explore this unique structural characteristic within the context of Oct4 recognition sites, we calculated both the minor groove widths and electrostatic potentials of two A-tract-containing motifs (H2B and PORE) occupied by Oct4 both in vitro and in vivo. (41) As illustrated in Figure 1C and Supplementary Figure S3A, minor grooves as well as enhanced negative electrostatic potentials were clearly observed around the A-tracts in both the H2B and pseudo-palindromic PORE motifs in comparison with those of the ideal B-form double-helix models generated by the 3DNA program (Supplementary Figure S3B and C) (42).
To understand the dynamic behaviour of the Oct4 recognition sites, we performed MD simulations for the H2B and PORE sequence motifs with two different initial conformations for each. One conformation was extracted from the Oct4–DNA complex, whereas the other was the corresponding 3DNA-generated model for the ideal B-form double helix. Monitoring the minor groove width and electrostatic potential along each trajectory reveals that the trends of minor groove narrowing and enhanced electrostatic potential are maintained at H2B and PORE sites, as reflected by the global minimums of these properties at the A-tracts (Figure 1D and Supplementary Figure S3D). Intriguingly, when the ideal B-form models were taken as initial conformations, we observed highly similar profiles (Supplementary Figure S3E and F) to those employing the initial configurations from the protein–DNA complex (Figure 1D and Supplementary Figure S3D). Thus, these data collectively implicate minor groove narrowing and enhanced negative electrostatic potential as intrinsic properties of Oct4 recognition sites containing A-tracts.
Mutations in the RK motif impair Oct4 DNA binding affinity, transactivation potential and reprogramming efficiency
The molecular modelling data suggested that the flexible and solvent-exposed nature of the RK-motif might facilitate interactions between its positively charged residues and the minor groove at Oct4 recognition sites that are characterized by enhanced negative electrostatic potentials, which may thus be important in Oct4–DNA association. To test this hypothesis, we compared the DNA binding ability of Oct4 variants with mutations in the RK motif with an electrophoretic mobility shift assay (43). As shown in Figure 2A, replacing any of the lysine or arginine residues (except for R223) in the RK motif with alanine significantly reduced Oct4–DNA binding. The further decreased positive charges in K224A/K226A and R225A/R227A double mutants may account for the greater reduction observed in their DNA binding capabilities. The substitution of all residues in the RK-motif with alanine (RKA) almost completely abolished Oct4–DNA binding (Figure 2A). These data highlight the importance of each residue in the RK-motif for Oct4–DNA binding, which may form contacts via the enhanced electrostatic potential of the minor groove.
Figure 2.

Mutation of the RKRKR motif disrupts the Oct4–DNA interaction. (A) WT and RKRKR motif mutant Oct4 were assessed for their ability to bind to DNA sequences containing a single octamer motif with an electrophoretic mobility shift assay. A representative gel is presented at top and a statistical analysis of the bound and free forms of DNA is presented at bottom. Data are presented as means ± SEM (n = 3). ***P < 0.001 versus control; #P < 0.05, ##P < 0.01 versus WT Oct4. (B) and (C) (Top) The transcriptional activities of WT and mutant Oct4 were measured using a luciferase reporter containing an Oct4-driven enhancer. Luciferase activity was measured 24 h after transfection and the control group was transfected with the reporter only. (Bottom) Corresponding exogenous Oct4 protein levels were measured by western blot. Data are means ± SEM (n = 3). ***P < 0.001 versus control; ###P < 0.001 versus WT Oct4. (D) (Top) The number of GFP-positive iPSC colonies generated with three factors (WT or mutant Oct4, Sox2 and Klf4) at day 16 following infection. (Bottom) Corresponding exogenous Oct4 protein levels were measured on day 3 following infection by western blot. (E) (Left) Representative images of iPSC colonies; scale bar: 5 mm. (Right) The numbers of GFP-positive iPSC colonies generated with three factors (WT or mutant Oct4, Sox2 and Klf4) at day 16 following infection. The corresponding exogenous Oct4 protein levels were measured on day 3 by western blot. Data are presented as means ± SEM (n = 3). ***P < 0.001 versus WT Oct4.
The transcriptional function of the Oct4 mutants was tested with a luciferase reporter assay using six copies of octamer-containing oligonucleotide (6W) as an enhancer (44). All point mutations in the RK-motif displayed a detrimental impact on the transactivation activity of Oct4, with R223A being the least affected and R227A the most affected (Figure 2B and C). Only basal levels of activity were observed in Oct4s carrying two or more mutations (K224A/K226A, R225A/R227A, RKA and RKE), corresponding with their reduced DNA binding capabilities. Next, we investigated these RK-motif mutants in 3F (Oct4, Sox2 and Klf4)-mediated somatic cell reprogramming. The R223A and K224A mutants were able to generate iPSCs with reduced efficiency, but the R225A, K226A and R227A single site mutants and the double or multiple site mutants (K224A/K226A, R225A/R227A, RKA and RKE) lost the ability to generate iPSCs almost completely (Figure 2D and E). The greatly diminished reprogramming efficiency of these RK-motif mutants may result from the attenuated occupancy of Oct4 at pluripotency-related target genes (Oct4, SOX2 and JARID2), as demonstrated in a chromatin immunoprecipitation (ChIP) assay (Figure 4E), consistent with their reduced DNA binding capabilities. Because comparable levels of the mutant proteins were detected in these assays, the loss-of-function phenotype should not be attributed to differential expression between the mutants. Taken together, our data indicate that all of the residues, including those invisible in the Oct4–DNA complex structure, play indispensable roles in the DNA binding, transactivation and reprogramming functions of Oct4.
Figure 4.

Interaction of the RKRKR motif and the linker determines Oct4–DNA recognition and binding. (A) The fraction of the H-bond interactions (Top) and the time series of H-bond number (Bottom) between the RK-motif and Linker segment during the REMD simulations. (B) A representative conformation showing the hydrogen-bond interactions between the RK motif and the linker region (top) and the H-bond occupancy statistics for each residue in the linker region (bottom) during REMD simulations. (C) (Top) A representative gel from the electrophoretic mobility shift assay to assess the binding of mutant Oct4 at the linker region with DNA. (Bottom) Statistical analysis of bound and free forms of DNA. Data are means ± SEM (n = 3). ***P < 0.001 versus control; #P < 0.05. (D) (Top) The transcriptional activities of Oct4 mutants at the linker region were measured using a luciferase reporter assay. Luciferase activities were measured 24 h after transfection and the control group was transfected with the reporter only. (Bottom) Corresponding exogenous Oct4 protein levels were measured using Western blot. Data are means ± SEM (n = 3). ***P < 0.001 versus control; ###P < 0.001 versus WT Oct4. (E) (Top) iPSCs were induced with three factors (various Oct4 linker mutants, Sox2 and Klf4) and the numbers of GFP-positive colonies were counted at day 16 following infection. (Bottom) Corresponding exogenous Oct4 protein levels were measured on day 3 after infection using western blot. Data are means ± SEM (n = 3). ***P < 0.001 versus WT Oct4. (F) E14 mESCs were transfected with WT, R225A or E217A mutant Oct4, and their binding at the Oct4, Sox2 and Jarid2 loci was detected by ChIP analysis with anti-Oct4 antibody. Data are shown as fold enrichment relative to the IgG ChIP results. Data are presented as means ± SEM (n = 3). *P < 0.05, **P < 0.01, ***P < 0.001 versus WT Oct4. NC, negative control (a random DNA sequence).
The impaired transcriptional activities of Oct4 RK-motif mutants are independent of nuclear localization
Previous studies have identified the RK-motif as a nuclear localization signal (NLS) for Oct4 and mutation of the RK motif results in a random distribution of Oct4 throughout the cell (45). Immunofluorescence staining revealed that single point mutations (R225A, R227A) had very limited impact on the nuclear localization of Oct4 (Figure 3A–C). However, these mutants were almost completely defective in transactivating the 6W-luciferase reporter and in generating iPSCs (Figure 3D and E). Similarly, the 30–60% reduction in nuclear localization of Oct4s carrying double or multi-site mutations (R225A/R227A, R225E/R227E, RKA and RKE) could not account for the total ablation of its transactivation and reprogramming capabilities. This finding suggests that the RK motif not only functions as an NLS but also strongly regulates the transcriptional activity of Oct4 by mediating Oct4–DNA binding. Nevertheless, to unambiguously define the role of the RK motif in modulating the transcription activity of Oct4, a known NLS was fused to the C-terminus of each Oct4 mutant to restore their nuclear localization (Figure 3A). The exogenous NLS indeed re-directed the RK-motif mutant proteins exclusively to the nuclei (Figure 3A–C), as previously observed (45). However, the restoration of nuclear localization did not rescue the abolished transactivation and reprogramming functions, and virtually no improvement in transcriptional activity was obtained for the Oct4 mutants with an exogenous NLS (Figure 3D and E). Notably, similar expression levels of the mutant Oct4 proteins were observed in these assays (Supplementary Figure S4). Therefore, our data establish that the transcriptional regulatory function of the RK-motif is independent of its nuclear localization activity.
Figure 3.

The RKRKR motif mutants of Oct4 lose transcriptional activity independent of localization. (A) Schematic diagram of the Oct4 mutants at the RKRKR motif with or without an additional nuclear localization sequence (NLS). (B) Cellular localization of Oct4 mutants with or without an additional NLS. Immunofluorescence staining with an anti-HA antibody was carried out 48 h after transfection in HEK293 cells. Hoechst stain was used to mark the nuclei. Scale bar: 10 μM. (C) Statistical analysis of the nuclear localization of various Oct4 proteins presented in (B). Data are presented as means ± SEM (n = 10). ***P < 0.001 versus WT Oct4; ###P < 0.001 versus corresponding mutants without an additional NLS at the C terminal. (D) The transcriptional activities of WT and mutant Oct4 with or without an additional NLS were measured by luciferase reporter assay. Luciferase activities were measured 24 h after transfection and the control group was transfected with the reporter only. Data are presented as means ± SEM (n = 3). ***P < 0.001 versus WT Oct4. (E) iPSCs were induced with three factors (various Oct4 mutant, Sox2 and Klf4) and the number of GFP-positive colonies was counted at day 16 following infection. Representative images of iPSC colonies are presented; scale bar: 5 mm. Data are presented as means ± SEM (n = 3). ***P < 0.001 versus WT Oct4.
Mutations in the linker segment increase DNA binding and other functions of Oct4
Recent studies have emphasized the importance of disordered regions within transcription factors in regulating DNA recognition through transient and dynamic contact with the DNA binding interface (29,46–48). Although the linker segment displays low homology among members of the POU family, it contains an unusual number of polar residues (Supplementary Figure S1A). More specifically, three negatively charged residues that are present in and conserved between orthologues of Oct4 (Supplementary Figure S1B) are partially (E208 and E212) or totally (E217) disordered within the Oct4–DNA complex (12). Because these negatively charged residues are in the vicinity of the positively charged RK-motif, we investigated their H-bond interactions in REMD simulations. At least one H-bond was identified between the linker segment and RK-motif in 80% of all sampled conformations according to H-bond occupancy analysis (Figure 4A). Most conformations were predicted to form 1, 2 or 3 H-bonds, which accounted for over 85% of snapshots with H-bonds formed (Figure 4A bottom panel). To further map the molecular determinants of H-bond contacts, we individually investigated interactions with the side chains of the RK motif for each residue in the linker segment. As shown in Figure 4B, the three negatively charged residues (E208, E212 and E217) are most frequently involved in contacts with the positively charged RK motif, mainly through inter-side chain H-bond interactions. Among them, E217, invisible in the Oct4–DNA complex structure but highly conserved within Oct4 orthologues (Supplementary Figure S1B), has the highest H-bond occupancy in the linker segment. In addition to these glutamic acid residues, T218 and Q221, which also have relatively high H-bond occupancies, participate in H-bond contacts largely via their main chains. These data suggest that H-bond interactions between linker and RK motif might shield the positive charges on the RK motif from recognizing the minor groove electrostatic potential of the Oct4 binding site. Disruption of these H-bonds may thus facilitate Oct4–DNA binding and transactivation.
To test this hypothesis, we performed DNA binding experiments with Oct4s carrying mutations in the linker segment (E208, E212 or E217). As expected, E208A and E217A enhanced protein–DNA binding, whereas E212A showed an affinity for the binding site comparable to that of WT Oct4 (Figure 4C). An additive effect was observed for E208A/E212A/E217A (3EA) triple-mutant Oct4, for which the DNA binding affinity was further increased. Correspondingly, the transactivation potential and reprogramming efficiency of Oct4 were significantly elevated in E217A and 3EA mutants (Figure 4D and E). Furthermore, the ChIP assay indicated the increased occupancy of E217A mutant Oct4 at the pluripotency genes, which may contribute to its augmented reprogramming efficiency (Figure 4F). The comparable DNA binding potencies may account for the indistinguishable transactivation and reprogramming efficiencies observed for the E212A mutant and WT Oct4. The E208A Oct4 mutant transactivated the luciferase reporter with a potency similar to that of WT protein, possibly due to its moderately increased DNA binding affinity (Figure 4C and D). Surprisingly, this mutant displayed a significantly enhanced reprogramming efficiency (Figure 4E). Although the exact mechanism of this phenomenon remains unknown, we speculate that E208 might regulate interactions with other transcription factors or transcription machinery, thereby affecting reprogramming efficiency (12). Sequencing of genomic DNA from the iPSC lines confirmed the integration of the Oct4 mutations (Supplementary Figure S5A). The ESC-like morphology and immunofluorescence staining patterns of typical pluripotency markers (Supplementary Figure S5B), as well as the formation of teratomas in vivo (Supplementary Figure S5C), validated the authenticity of the iPSCs generated via linker segment mutation of Oct4. Collectively, these mutagenesis results reveal that the linker segment may regulate Oct4–DNA recognition through functional interplay with the RK-motif and breaking the H-bond interactions between them leads to enhanced Oct4–DNA binding affinity, transactivation potential and reprogramming efficiency.
DISCUSSION
Specific protein–DNA recognition is essential to many fundamental biological processes (49–51). In addition to the widely accepted base readout mechanism involving contacts between a protein and the unique chemical signatures of DNA bases in the major groove, the shape readout mechanism by which the protein recognizes sequence-dependent deformations of the DNA helix also plays an important role in regulating target sequence selectivity (27–28,52). Recent studies have discovered that the functional specificity of DNA binding proteins mediates the dynamic and transient contacts between disordered regions of proteins and their DNA binding interfaces (29,53).
As an essential regulator of stem cell pluripotency, the functions of the POU transcriptional factor Oct4 rely on the binding of the POU domain to a variety of regulatory DNA sites (2). In contrast to the well-characterized contacts with bases in the major groove, little attention has been paid to the role of minor groove recognition by the RK motif of Oct4 and other POU family members, presumably due to the limited interactions and smaller buried surface area when compared to the major groove. In particular, the partially disordered structural features of the side chains in the RK motif of the Oct4–DNA complex raise further questions regarding its importance for specific DNA recognition (12). Furthermore, in addition to being a covalent linkage that increases the local concentration of subdomains around DNA, the regulatory functions of the linker segment and its possible crosstalk with other structural elements in Oct4–DNA association are poorly understood.
Our MD simulations of Oct4 in the apo state and its A-tract-containing recognition sites suggested that Oct4 might utilize the positively charged lysine and arginine residues in the dynamic and solvent-accessible RK motif to detect the enhanced negative potential of the minor groove. Like other proteins, especially the homodomain containing transcriptional factors, these interactions may be critical for Oct4–DNA binding.(27–28,54) Our mutagenesis and in vitro binding experiments established that each of the residues in the RK-motif is amenable to DNA recognition by Oct4. Consistent with these findings, the decreased DNA binding affinity caused by RK-motif mutation leads to the attenuated occupancy at pluripotency gene loci (Figure 4E) as well as a marked loss of both transactivation and somatic cell reprogramming activity. It should be noted that, apart from the electronegative potential, other important factors may also associated with the narrowed minor groove and thus contribute to Oct4–DNA binding. Previous studies showed that a more ordered, multi-layered hydration network was observed in the narrowed DNA minor grooves. (55,56) A larger entropy gain from the release of the ordered water would thus be achieved upon protein binding.(57) The impaired DNA binding affinity after mutation of the positively charged residues inserted into the minor grooves, such as R227 in Oct4, may also be attributed to the decreased entropy gain, in addition to the reduced electrostatic attractions. Moreover, the narrowed minor groove may create a unique hydrophobic environment due to the propeller twisting, which results in the formation of inter base pair hydrogen bonds in the major groove.(58) Although the functional importance of these factors in Oct4–DNA binding is largely unknown, further investigation would provide us an in-depth understanding of the other mechanism in shape-based readout. Nevertheless, our results corroborate the crucial role of minor groove shape-based readout by the RK-motif in Oct4–DNA recognition.
The functional importance of the RK motif prompted us to explore the intramolecular contacts that might finely tune DNA recognition by Oct4. Our H-bond occupancy analysis revealed that negatively charged residues in the adjacent linker segment form H-bond interactions with the RK motif in most of the sampled conformations. Because in most cases three H-bonds or fewer are formed, it is likely that these interactions partially shield the positive charges of the RK motif and fine tune its ability to sense the unique minor groove electrostatic potential. Consistent with this hypothesis, disruption of these H-bonds by mutagenesis increased the DNA binding affinity of the E217A mutant and 3EA triple-mutant Oct4, which may account for the observed increases in their transactivation potential and reprogramming efficiency. Previously, Esch et al. demonstrated that the N-terminal short α-helix of the linker segment containing the E208 and E212 residues regulates the reprogramming efficiency of Oct4 by acting as a PPI interface with the key epigenetic players (12). Consequently, the moderately increased DNA binding affinity and favourable interactions with other pluripotency regulators may collectively contribute to the enhanced reprogramming efficiency of the E208A mutant. In contrast, the H-bond interaction mediated by E212 is not directly involved in the regulation of DNA binding and PPI during reprogramming. Taken together, our results demonstrate that in addition to acting as a PPI interface, the linker segment delicately regulates Oct4–DNA recognition via interaction with the RK-motif. Based on this evidence, we identified gain-of-function Oct4 mutants with enhanced DNA binding affinity and a nearly three-fold improvement of reprogramming efficiency.
Based on the aforementioned results, we propose the following model for the functional interplay of the RK-motif and linker segment in Oct4–DNA binding (Figure 5). The dynamic and solvent-accessible RK-motif is essential to Oct4–DNA recognition through the perception of the enhanced electronegative potential and other important physicochemical properties, such as the specific hydration and hydrophobic environment in the narrowed minor groove. The H-bond interactions between the negatively charged glutamic acid residues in the linker segment and RK-motif may help to prevent the non-specific interactions of RK-motif with the random DNA sequences in genome. Upon associating with potential Oct4 binding sites containing A-tracts, the attractions from the enhanced minor groove electronegative potential may acts as a counterbalance to the H-bond interactions from the linker segment and help the release of the RK-motif for minor groove recognition. The similar situations were observed for other transcription factors, such as the ultrabithorax (Ubx) protein, in which the intramolecular interactions of disordered segments (I1, I2 and R region) and homeodomain constraint the non-specific interactions of homeodomain and thus increase its DNA binding specificity. (59,60) Moreover, the post-translational modifications (PTM) of the disordered region can modulate the DNA binding activities by intramolecular interactions with the binding determinants in some protein–DNA complexes. (61,62) It is thus speculated that the PTM in the linker segment of Oct4, such as the phosphorylation of serine or threonine, may affect its interplay with the RK-motif and thus the reprogramming efficiency. Correspondingly, albeit no data has been reported now, the potential PTMs or PPI with other regulatory factors of the solvent accessible RK-motif might achieve the similar effects.
Figure 5.

A model for the role of the interaction between the RK motif (yellow) and linker segment (purple) in specific DNA recognition by Oct4. The POUs subdomain is coloured green, and the POUh subdomain, except the RK-motif, is coloured cyan. The enhanced minor groove electrostatic potential at the narrowed minor groove is highlighted in red.
Given the highly conserved nature of the RK motif and the recurrent emergence of negatively charged or polar residues in the linker segment of POU proteins (Supplementary Figures S1 and S2), the proposed regulatory mechanism may be universally adopted by other POU family members. Furthermore, the distinct number of negatively charged or polar residues and their diverse distributions along the linker segment may lead to the differential regulatory outputs and DNA binding properties of each POU protein. A previous study has found that the replacement of a glutamic acid residue with lysine in the linker segment of Oct1 results in a 2.5-fold reduction in DNA binding affinity (63). Our proposed model may therefore provide a plausible explanation for the functional specificity of different POU proteins. Recent computational and NMR spectroscopy studies indicate that the sliding, global jumping and intersegment transfer mechanisms are employed by Oct1 to facilitate the search for specific targets (64–67). The cooperative actions of the two globular subdomains of Oct1 evidently improve the search efficiency. In light of our current findings, further study is warranted to systematically compare the contributions of the RK-motif and its crosstalk with the linker segment in each of the relatively macroscopic mechanisms of Oct4. The gain-of-function Oct4 mutants identified here will be invaluable tools to manipulate the Oct4-related biological events with molecular precision and help to elucidate its role in pluripotency control.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
Acknowledgments
The authors acknowledge support from Computer Network Information Center, Chinese Academy of Sciences, Tianjin and Shanghai Supercomputing Center.
FUNDING
Ministry of Science and Technology of China [2015CB910304 to H.J., 2015CB964503 to X.X.]; Hi-Tech Research and Development Program of China [2012AA020302 to C.L.]; Chinese Academy of Sciences [XDA01040301 to X.X., XDA01040305 to C.L.]; National Natural Science Foundation of China [31371511 and 81425024 to X.X., 21210003 to H.J., 21472208 to C.L.]; National Science and Technology Major Project ‘Key New Drug Creation and Manufacturing Program’ [2014ZX09507002 to C.L., 2013ZX09507-004 to H.J.] and the project of State Key Laboratory of Toxicology and Medical Countermeasures (Academy of Military Medical Science) to C.L. Funding for open access charge: Hi-Tech Research and Development Program of China [2012AA020302 to C.L.].
Conflict of interest statement. None declared.
REFERENCES
- 1.Plath K., Lowry W.E. Progress in understanding reprogramming to the induced pluripotent state. Nat. Rev. Genet. 2011;12:253–265. doi: 10.1038/nrg2955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Radzisheuskaya A., Silva J.C.R. Do all roads lead to Oct4? The emerging concepts of induced pluripotency. Trends Cell Biol. 2013;24:275–284. doi: 10.1016/j.tcb.2013.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Loh Y.H., Wu Q., Chew J.L., Vega V.B., Zhang W., Chen X., Bourque G., George J., Leong B., Liu J., et al. The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells. Nat. Genet. 2006;38:431–440. doi: 10.1038/ng1760. [DOI] [PubMed] [Google Scholar]
- 4.Niwa H., Miyazaki J., Smith A.G. Quantitative expression of Oct-3/4 defines differentiation, dedifferentiation or self-renewal of ES cells. Nat. Genet. 2000;24:372–376. doi: 10.1038/74199. [DOI] [PubMed] [Google Scholar]
- 5.Radzisheuskaya A., Le Bin Chia G., dos Santos R.L., Theunissen T.W., Castro L.F.C., Nichols J., Silva J.C.R. A defined Oct4 level governs cell state transitions of pluripotency entry and differentiation into all embryonic lineages. Nat. Cell Biol. 2013;15:579–590. doi: 10.1038/ncb2742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Li M., Liu G.H., Izpisua Belmonte J.C. Navigating the epigenetic landscape of pluripotent stem cells. Nat. Rev. Mol. Cell Biol. 2012;13:524–535. doi: 10.1038/nrm3393. [DOI] [PubMed] [Google Scholar]
- 7.Verrijzer C.P., Alkema M.J., van Weperen W.W., Van Leeuwen H.C., Strating M.J., van der Vliet P.C. The DNA binding specificity of the bipartite POU domain and its subdomains. EMBO J. 1992;11:4993–5003. doi: 10.1002/j.1460-2075.1992.tb05606.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Phillips K., Luisi B. The virtuoso of versatility: POU proteins that flex to fit. J. Mol. Biol. 2000;302:1023–1039. doi: 10.1006/jmbi.2000.4107. [DOI] [PubMed] [Google Scholar]
- 9.Wegner M., Drolet D.W., Rosenfeld M.G. POU-domain proteins: structure and function of developmental regulators. Curr. Opin. Cell Biol. 1993;5:488–498. doi: 10.1016/0955-0674(93)90015-i. [DOI] [PubMed] [Google Scholar]
- 10.HA I. The POU-specific domain of Pit-1 is essential for sequence-specific, high affinity DNA binding and DNA-dependent Pit-1—Pit-1 interaction. Cell. 1990;61:1021–1033. doi: 10.1016/0092-8674(90)90067-o. [DOI] [PubMed] [Google Scholar]
- 11.Klemm J.D., Rould M.A., Aurora R., Herr W., Pabo C.O. Crystal structure of the Oct-1 POU domain bound to an octamer site: DNA recognition with tethered DNA-binding modules. Cell. 1994;77:21–32. doi: 10.1016/0092-8674(94)90231-3. [DOI] [PubMed] [Google Scholar]
- 12.Esch D., Vahokoski J., Groves M.R., Pogenberg V., Cojocaru V., vom Bruch H., Han D., Drexler H.C.A., Arauzo-Bravo M.J., Ng C.K.L., et al. A unique Oct4 interface is crucial for reprogramming to pluripotency. Nat. Cell Biol. 2013;15:295–301. doi: 10.1038/ncb2680. [DOI] [PubMed] [Google Scholar]
- 13.Ruvkun G., Finney M. Regulation of transcription and cell identity by POU domain proteins. Cell. 1991;64:475–478. doi: 10.1016/0092-8674(91)90227-p. [DOI] [PubMed] [Google Scholar]
- 14.van Leeuwen H.C., Strating M.J., Cox M., Kaptein R., van der Vliet P.C. Mutation of the Oct-1 POU-specific recognition helix leads to altered DNA binding and influences enhancement of adenovirus DNA replication. Nucleic Acids Res. 1995;23:3189–3197. doi: 10.1093/nar/23.16.3189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Jacobson E.M., Li P., Leon-del-Rio A., Rosenfeld M.G., Aggarwal A.K. Structure of Pit-1 POU domain bound to DNA as a dimer: unexpected arrangement and flexibility. Genes Dev. 1997;11:198–212. doi: 10.1101/gad.11.2.198. [DOI] [PubMed] [Google Scholar]
- 16.Treacy M.N., Neilson L.I., Turner E.E., He X., Rosenfeld M.G. Twin of I-POU: a two amino acid difference in the I-POU homeodomain distinguishes an activator from an inhibitor of transcription. Cell. 1992;68:491–505. doi: 10.1016/0092-8674(92)90186-g. [DOI] [PubMed] [Google Scholar]
- 17.Treacy M.N., He X., Rosenfeld M.G. I-POU: a POU-domain protein that inhibits neuron-specific gene activation. Nature. 1991;350:577–584. doi: 10.1038/350577a0. [DOI] [PubMed] [Google Scholar]
- 18.Herr W., Cleary M.A. The POU domain: versatility in transcriptional regulation by a flexible two-in-one DNA-binding domain. Genes Dev. 1995;9:1679–1693. doi: 10.1101/gad.9.14.1679. [DOI] [PubMed] [Google Scholar]
- 19.Klemm J.D., Pabo C.O. Oct-1 POU domain–DNA interactions: cooperative binding of isolated subdomains and effects of covalent linkage. Genes Dev. 1996;10:27–36. doi: 10.1101/gad.10.1.27. [DOI] [PubMed] [Google Scholar]
- 20.Van Der Spoel D., Lindahl E., Hess B., Groenhof G., Mark A.E., Berendsen H.J. GROMACS: fast, flexible, and free. J. Comput. Chem. 2005;26:1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
- 21.Pronk S., Pall S., Schulz R., Larsson P., Bjelkmar P., Apostolov R., Shirts M.R., Smith J.C., Kasson P.M., van der Spoel D., et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics. 2013;29:845–854. doi: 10.1093/bioinformatics/btt055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Blanchet C., Pasi M., Zakrzewska K., Lavery R. CURVES+ web server for analyzing and visualizing the helical, backbone and groove parameters of nucleic acid structures. Nucleic Acids Res. 2011;39:W68–W73. doi: 10.1093/nar/gkr316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lavery R., Moakher M., Maddocks J.H., Petkeviciute D., Zakrzewska K. Conformational analysis of nucleic acids revisited: Curves+ Nucleic Acids Res. 2009;37:5917–5929. doi: 10.1093/nar/gkp608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li L., Li C., Sarkar S., Zhang J., Witham S., Zhang Z., Wang L., Smith N., Petukh M., Alexov E. DelPhi: a comprehensive suite for DelPhi software and associated resources. BMC Biophys. 2012;5:9–19. doi: 10.1186/2046-1682-5-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Oron A., Wolfson H., Gunasekaran K., Nussinov R. Using DelPhi to compute electrostatic potentials and assess their contribution to interactions. Curr. Protoc. Bioinformatics. 2003;2:8.4.1–8.4.12. doi: 10.1002/0471250953.bi0804s02. [DOI] [PubMed] [Google Scholar]
- 26.Rocchia W., Sridharan S., Nicholls A., Alexov E., Chiabrera A., Honig B. Rapid grid-based construction of the molecular surface and the use of induced surface charge to calculate reaction field energies: applications to the molecular systems and geometric objects. J. Comput. Chem. 2002;23:128–137. doi: 10.1002/jcc.1161. [DOI] [PubMed] [Google Scholar]
- 27.Rohs R., West S.M., Sosinsky A., Liu P., Mann R.S., Honig B. The role of DNA shape in protein-DNA recognition. Nature. 2009;461:1248–1253. doi: 10.1038/nature08473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Joshi R., Passner J.M., Rohs R., Jain R., Sosinsky A., Crickmore M.A., Jacob V., Aggarwal A.K., Honig B., Mann R.S. Functional specificity of a Hox protein mediated by the recognition of minor groove structure. Cell. 2007;131:530–543. doi: 10.1016/j.cell.2007.09.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fuxreiter M., Simon I., Bondos S. Dynamic protein-DNA recognition: beyond what can be seen. Trends Biochem. Sci. 2011;36:415–423. doi: 10.1016/j.tibs.2011.04.006. [DOI] [PubMed] [Google Scholar]
- 30.Pufall M.A., Lee G.M., Nelson M.L., Kang H.S., Velyvis A., Kay L.E., McIntosh L.P., Graves B.J. Variable control of Ets-1 DNA binding by multiple phosphates in an unstructured region. Science. 2005;309:142–145. doi: 10.1126/science.1111915. [DOI] [PubMed] [Google Scholar]
- 31.Sugita Y., Okamoto Y. Replica-exchange molecular dynamics method for protein folding. Chem. Phys. Lett. 1999;314:141–151. [Google Scholar]
- 32.Earl D.J., Deem M.W. Parallel tempering: theory, applications, and new perspectives. Phys. Chem. Chem. Phys. 2005;7:3910–3916. doi: 10.1039/b509983h. [DOI] [PubMed] [Google Scholar]
- 33.Lundback T., Chang J.F., Phillips K., Luisi B., Ladbury J.E. Characterization of sequence-specific DNA binding by the transcription factor Oct-1. Biochemistry. 2000;39:7570–7579. doi: 10.1021/bi000377h. [DOI] [PubMed] [Google Scholar]
- 34.Connolly M.L. Solvent-accessible surfaces of proteins and nucleic acids. Science. 1983;221:709–713. doi: 10.1126/science.6879170. [DOI] [PubMed] [Google Scholar]
- 35.Rohs R., West S.M., Liu P., Honig B. Nuance in the double-helix and its role in protein-DNA recognition. Curr. Opin. Struct. Biol. 2009;19:171–177. doi: 10.1016/j.sbi.2009.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bishop E.P., Rohs R., Parker S.C., West S.M., Liu P., Mann R.S., Honig B., Tullius T.D. A map of minor groove shape and electrostatic potential from hydroxyl radical cleavage patterns of DNA. ACS Chem. Biol. 2011;6:1314–1320. doi: 10.1021/cb200155t. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chiu T.P., Yang L., Zhou T., Main B.J., Parker S.C., Nuzhdin S.V., Tullius T.D., Rohs R. GBshape: a genome browser database for DNA shape annotations. Nucleic Acids Res. 2014;43:D103–D109. doi: 10.1093/nar/gku977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jung M., Peterson H., Chavez L., Kahlem P., Lehrach H., Vilo J., Adjaye J. A data integration approach to mapping OCT4 gene regulatory networks operative in embryonic stem cells and embryonal carcinoma cells. PLoS One. 2010;5:e10709. doi: 10.1371/journal.pone.0010709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hizver J., Rozenberg H., Frolow F., Rabinovich D., Shakked Z. DNA bending by an adenine–thymine tract and its role in gene regulation. Proc. Natl. Acad. Sci. U.S.A. 2001;98:8490–8495. doi: 10.1073/pnas.151247298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Burkhoff A.M., Tullius T.D. Structural details of an adenine tract that does not cause DNA to bend. Nature. 1988;331:455–457. doi: 10.1038/331455a0. [DOI] [PubMed] [Google Scholar]
- 41.Ferraris L., Stewart A.P., Kang J., DeSimone A.M., Gemberling M., Tantin D., Fairbrother W.G. Combinatorial binding of transcription factors in the pluripotency control regions of the genome. Genome Res. 2011;21:1055–1064. doi: 10.1101/gr.115824.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lu X.J., Olson W.K. 3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. Nucleic Acids Res. 2003;31:5108–5121. doi: 10.1093/nar/gkg680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hellman L.M., Fried M.G. Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions. Nat. Protoc. 2007;2:1849–1861. doi: 10.1038/nprot.2007.249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Scholer H.R., Balling R., Hatzopoulos A.K., Suzuki N., Gruss P. Octamer binding proteins confer transcriptional activity in early mouse embryogenesis. EMBO J. 1989;8:2551–2557. doi: 10.1002/j.1460-2075.1989.tb08393.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Pan G., Qin B., Liu N., Scholer H.R., Pei D. Identification of a nuclear localization signal in OCT4 and generation of a dominant negative mutant by its ablation. J. Biol. Chem. 2004;279:37013–37020. doi: 10.1074/jbc.M405117200. [DOI] [PubMed] [Google Scholar]
- 46.Vuzman D., Azia A., Levy Y. Searching DNA via a ‘Monkey Bar’ mechanism: the significance of disordered tails. J. Mol. Biol. 2010;396:674–684. doi: 10.1016/j.jmb.2009.11.056. [DOI] [PubMed] [Google Scholar]
- 47.Dyson H.J. Roles of intrinsic disorder in protein-nucleic acid interactions. Mol. Biosyst. 2011;8:97–104. doi: 10.1039/c1mb05258f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Vuzman D., Levy Y. Intrinsically disordered regions as affinity tuners in protein–DNA interactions. Mol. BioSyst. 2012;8:47–57. doi: 10.1039/c1mb05273j. [DOI] [PubMed] [Google Scholar]
- 49.Rohs R., Jin X., West S.M., Joshi R., Honig B., Mann R.S. Origins of Specificity in Protein-DNA Recognition. Annu. Rev. Biochem. 2010;79:233–269. doi: 10.1146/annurev-biochem-060408-091030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Paillard G., Lavery R. Analyzing protein-DNA recognition mechanisms. Structure. 2004;12:113–122. doi: 10.1016/j.str.2003.11.022. [DOI] [PubMed] [Google Scholar]
- 51.Garvie C.W., Wolberger C. Recognition of specific DNA sequences. Mol. Cell. 2001;8:937–946. doi: 10.1016/s1097-2765(01)00392-6. [DOI] [PubMed] [Google Scholar]
- 52.Bouvier B., Zakrzewska K., Lavery R. Protein-DNA recognition triggered by a DNA conformational switch. Angew. Chem. Int. Ed. Engl. 2011;50:6516–6518. doi: 10.1002/anie.201101417. [DOI] [PubMed] [Google Scholar]
- 53.Watson M., Stott K., Thomas J.O. Mapping intramolecular interactions between domains in HMGB1 using a tail-truncation approach. J. Mol. Biol. 2007;374:1286–1297. doi: 10.1016/j.jmb.2007.09.075. [DOI] [PubMed] [Google Scholar]
- 54.West S.M., Rohs R., Mann R.S., Honig B. Electrostatic interactions between arginines and the minor groove in the nucleosome. J. Biomol. Struct. Dyn. 2010;27:861–866. doi: 10.1080/07391102.2010.10508587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Drew H.R., Dickerson R.E. Structure of a B-DNA dodecamer. III. Geometry of hydration. J. Mol. Biol. 1981;151:535–556. doi: 10.1016/0022-2836(81)90009-7. [DOI] [PubMed] [Google Scholar]
- 56.Privalov P.L., Dragan A.I., Crane-Robinson C., Breslauer K.J., Remeta D.P., Minetti C.A.S.A. What drives proteins into the major or minor grooves of DNA? J. Mol. Biol. 2007;365:1–9. doi: 10.1016/j.jmb.2006.09.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dunitz J.D. The entropic cost of bound water in crystals and biomolecules. Science. 1994;264:670–670. doi: 10.1126/science.264.5159.670. [DOI] [PubMed] [Google Scholar]
- 58.Yoon C., Prive G.G., Goodsell D.S., Dickerson R.E. Structure of an alternating-B DNA helix and its relationship to A-tract DNA. Proc. Natl. Acad. Sci. U.S.A. 1988;85:6332–6336. doi: 10.1073/pnas.85.17.6332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Liu Y., Matthews K.S., Bondos S.E. Multiple intrinsically disordered sequences alter DNA binding by the homeodomain of the Drosophila hox protein ultrabithorax. J. Biol. Chem. 2008;283:20874–20887. doi: 10.1074/jbc.M800375200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Liu Y., Matthews K.S., Bondos S.E. Internal regulatory interactions determine DNA binding specificity by a Hox transcription factor. J. Mol. Biol. 2009;390:760–774. doi: 10.1016/j.jmb.2009.05.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Stott K., Watson M., Howe F.S., Grossmann J.G., Thomas J.O. Tail-mediated collapse of HMGB1 is dynamic and occurs via differential binding of the acidic tail to the A and B domains. J. Mol. Biol. 2010;403:706–722. doi: 10.1016/j.jmb.2010.07.045. [DOI] [PubMed] [Google Scholar]
- 62.Tsunaka Y., Toga J., Yamaguchi H., Tate S., Hirose S., Morikawa K. Phosphorylated intrinsically disordered region of FACT masks its nucleosomal DNA binding elements. J. Biol. Chem. 2009;284:24610–24621. doi: 10.1074/jbc.M109.001958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.van Leeuwen H.C., Strating M.J., Rensen M., de Laat W., van der Vliet P.C. Linker length and composition influence the flexibility of Oct-1 DNA binding. EMBO J. 1997;16:2043–2053. doi: 10.1093/emboj/16.8.2043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Takayama Y. Intra- and intermolecular translocation of the bi-domain transcription factor Oct1 characterized by liquid crystal and paramagnetic NMR. Proc. Natl. Acad. Sci. U.S.A. 2011;108:169–176. doi: 10.1073/pnas.1100050108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Doucleff M., Clore G.M. Global jumping and domain-specific intersegment transfer between DNA cognate sites of the multidomain transcription factor Oct-1. Proc. Natl. Acad. Sci. U.S.A. 2008;105:13871–13876. doi: 10.1073/pnas.0805050105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Takayama Y., Clore G.M. Interplay between minor and major groove-binding transcription factors Sox2 and Oct1 in translocation on DNA studied by paramagnetic and diamagnetic NMR. J. Biol. Chem. 2012;287:14349–14363. doi: 10.1074/jbc.M112.352864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Takayama Y., Clore G.M. Impact of protein/protein interactions on global intermolecular translocation rates of the transcription factors Sox2 and Oct1 between DNA cognate sites analyzed by z-exchange NMR spectroscopy. J. Biol. Chem. 2012;287:26962–26970. doi: 10.1074/jbc.M112.382960. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
