Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2022 Nov 2;298(12):102670. doi: 10.1016/j.jbc.2022.102670

Mechanistic insights into poly(C)-binding protein hnRNP K resolving i-motif DNA secondary structures

Wen-Qiang Wu 1,∗,, Xin Zhang 1,, Di Bai 1, Song-Wang Shan 1, Li-Jun Guo 1,
PMCID: PMC9709238  PMID: 36334628

Abstract

I-motifs are four-strand noncanonical secondary structures formed by cytosine (C)-rich sequences in living cells. The structural dynamics of i-motifs play essential roles in many cellular processes, such as telomerase inhibition, DNA replication, and transcriptional regulation. In cells, the structural dynamics of the i-motif can be modulated by the interaction of poly(C)-binding proteins (PCBPs), and the interaction is closely related to human health, through modulating the transcription of oncogenes and telomere stability. Therefore, the mechanisms of how PCBPs interact with i-motif structures are fundamentally important. However, the underlying mechanisms remain elusive. I-motif structures in the promoter of the c-MYC oncogene can be unfolded by heterogeneous nuclear ribonucleoprotein K (hnRNP K), a PCBP, to activate its transcription. Here, we selected this system as an example to comprehensively study the unfolding mechanisms. We found that the promoter sequence containing 5 C-runs preferred folding into type-1245 to type-1234 i-motif structures based on their folding stability, which was further confirmed by single-molecule FRET. In addition, we first revealed that the c-MYC i-motif structure was discretely resolved by hnRNP K through two intermediate states, which were assigned to the opposite hairpin and neighboring hairpin, as further confirmed by site mutations. Furthermore, we found all three KH (hnRNP K homology) domains of hnRNP K could unfold the c-MYC i-motif structure, and KH2 and KH3 were more active than KH1. In conclusion, this study may deepen our understanding of the interactions between i-motifs and PCBPs and may be helpful for drug development.

Keywords: hnRNP K, i-motif, smFRET, c-MYC, oncogene, cancer

Abbreviations: hnRNP K, heterogeneous nuclear ribonucleoprotein K; PCBP, poly(C)-binding protein; smFRET, single-molecule FRET; TDP, transition density plot


In addition to duplex type, DNA can adopt some noncanonical structures, such as four-stand structure G-quadruplexes (1) and i-motifs (2). Among these two structures, G-quadruplexes that form within guanine (G)-rich sequences have been well studied from many different perspectives (1, 3, 4). However, their complementary i-motif structures are comparatively less well studied. I-motif structures are formed from cytosine (C)-rich DNA sequences at acidic pH, held together by hemi-protonated C and neutral C base pairs (C:C+) (2). Moreover, at neutral pH, molecular crowding can facilitate the formation of i-motif structures, which is in keeping with the intracellular environment (5). In addition, the existence of i-motif DNA structures was revealed using in-cell NMR spectroscopy (6) and confirmed by an antibody fragment (7) in cells. The structural dynamics of these motifs play key roles in many essential processes, such as telomerase inhibition, DNA replication, and transcriptional regulation (2).

The nuclease hypersensitive element III1 is located in the upstream of the oncogene c-MYC P1 promoter and modulates most c-MYC transcription. It contains 27-bp G/C-rich DNA sequences and can form G-quadruplexes and i-motifs in vivo (8, 9). In recent years, G-rich strands that form G-quadruplexes (known as Pu27) have been well studied (9). Although the complementary C-rich strands (referred to as Py27) that form i-motifs can also modulate c-MYC expression (8), they are comparatively less well studied. The core of Py27 (termed Py25 in this study) contains 5 runs of cytosines: C-tracts 1, 2, 3, 4, and 5 (Fig. 1A). Whether type-1245 (10) or type-1234 (11) is the major structure is the subject of controversy (Fig. 1B).

Figure 1.

Figure 1

Py25 and its modifications can fold into i-motif structures.A, the promoter sequences of NHE III1 and its modifications. Py25 indicates the 25-nt core of the C-rich NHE III1 promoter containing 5 runs of cytosines, as marked in red. Py25(1245) and Py25(1234) contain four runs of cytosines, which are indicated in the parentheses. B, a schematic representation of the C:C+ base pair (top) and the topological structures of type-1245 and type-1234 (bottom). C, CD spectra of Py25, Py25(1245), and Py25(1234) at different pH levels from 5.2 to 8.0, including 200 mM K+, 40% PEG200 (w/v), and 4 mM Trolox. Under pH 5.8, all three substrates could fold into i-motif structures, showing a positive peak at ∼287 nm and a negative peak at ∼262 nm. NHE, nuclear hypertensive element.

The structures of i-motifs are dynamic in living cells (7), and their structural dynamics regulate biological processes (2). Although the structural dynamics of i-motifs induced by environmental conditions such as pH and PEG have been studied using both bulk and single-molecule methods (12, 13, 14, 15), their dynamics in cells are mainly modulated by i-motif specialized proteins (2). Considering that i-motifs can exist stably in cells (6, 7), the unwinding of these structures is particularly critical. However, at this stage, little is known about their unfolding pathways facilitated by biologically specialized binding proteins, such as hnRNP LL (16), hnRNP A1 (17), BmILF (18), and poly(C) binding proteins (PCBPs) (19). Among the various proteins, heterogeneous nuclear ribonucleoprotein K (hnRNP K), a PCBP, stands out for its extensive cellular activity. This protein can activate the transcription of oncogenes though unfolding their promoter i-motifs, such as KRAS (20) and c-MYC (21, 22), and may modulate telomere stability by binding telomeric i-motifs (23, 24). However, the molecular mechanism of how hnRNP K unfolds i-motif structures remains elusive due to the limitations of research methods.

hnRNP K contains three KH (hnRNP K homology) domains (Fig. S1A) that mediate the resolution of i-motif structures (25, 26, 27). Previous structural data have shown that KH3 binds specifically to TCCC or CCCC ssDNA (25, 26). Biochemical assays have indicated that the KH1–KH2 domain can also bind poly(C) sequences (27). However, it remains unclear whether all three of these motifs can disrupt i-motif structures and which motif exhibits the most activity. Studying the interaction of hnRNP K with i-motifs will not only provide helpful data for understanding this biological process and designing drugs but also be useful for predicting the molecular mechanisms of KH domain containing PCBPs (19, 28).

In this study, hnRNP K was selected as an example to dissect the molecular mechanism of PCBP-unfolding i-motif structures. First, CD spectra were used to detect the folding of Py25, Py25(1245), and Py25(1234) under different pH conditions. Melting temperature comparison suggested that the type-1245 i-motif should be the major component of Py25, which was further confirmed by single-molecule FRET (smFRET). Then, it was confirmed that hnRNP K could resolve the Py25 i-motif using an ensemble FRET method, and the folding intermediates and pathways were determined using smFRET. Finally, it was revealed that all three of the KH domains were able to unfold Py25 i-motif structure, and KH2 was the most active. This study sheds light on the interactions between PCBPs and i-motif structures.

Results

Type-1245 is the main i-motif structure of Py25

Structure determines function, so it is fundamentally important to determine the folding structure of Py25 to reveal its function. The G-rich strand of the c-MYC promoter that forms G-quadruplexes has been well studied, and type-2345 and type-1245 c-MYC G-quadruplexes are two major structures (29, 30). Therefore, complementary type-1234 and type-1245 i-motifs were proposed to be the major folding structures; however, whether type-1245 or type-1234 is the main structure remains controversial (10, 11). To determine whether type-1245 or type-1234 was the major folding structure (Fig. 1B), the unrelated C in Py25 was mutated to T to rule out distractions, and they were referred to as Py25(1245) and Py25(1234), respectively (Fig. 1A). Their folding was determined at different pH levels (Fig. 1C), as pH was the main factor affecting the stability of i-motifs (15). The CD spectra of Py25, Py25(1245), and Py25(1234) showed that at pH ≥ 6.0, there were no obvious i-motif structures, while when at pH ≤ 5.8, a significant i-motif signal was detected, showing a positive peak at ∼287 nm and a negative peak at ∼262 nm (5). These results suggested that both type-1245 and type-1234 might form i-motif structures at pH values below 5.8.

To determine whether type-1245 or type-1234 was the main structure, this study first measured their melting temperature (Tm) values using the CD spectra at pH 5.2 and pH 5.8 (Fig. 2). The comparison of Tm values demonstrated that type-1245 was more thermally stable than type-1234, because smaller Tm values reflected more unstable structures. Based on these results, the type-1245 i-motif should be the major component of Py25. This result was inconsistent with a previous report, in which the Tm value of type-1234 was greater than that of type-1245 (11). The difference between these two studies was that the present study replaced the fifth C-tract of Py25(1234) with T, while the previous study removed the fifth C-tract entirely. The proximal ssDNA of the Py25(1234) used in the present study may have reduced its stability, as in the case of G-quadruplexes (31, 32). The substrate Py25(1234) used in the present study did not change the overall length of Py25; thus, our findings are likely more biologically relevant.

Figure 2.

Figure 2

CD melting of Py25(1245) and Py25(1234) at pH 5.2 and pH 5.8.AC, the CD spectra of Py25(1245) at 20 to 98 °C at pH 5.2 (A) and pH 5.8 (B). Melting temperature (Tm) values are calculated from the signal at 288 nm, resulting in the Tm values of 58.7 ± 0.4 °C for pH 5.2 and 47.6 ± 0.3 °C for pH 5.8. DF, the CD spectra of Py25(1234) at 20 to 98 °C at pH 5.2 (D) and pH 5.8 (E). Tm values were calculated from the signal at 288 nm, resulting in the Tm values of 55.2 ± 0.3 °C for pH 5.2 and 44.8 ± 0.3 °C for pH 5.8.

To further confirm the above results, smFRET, which can be used to directly determine the folding states and intermediates of DNA secondary structures through FRET efficiency (14, 32, 33, 34), was introduced. An i-motif–containing ssDNA was modified by a Cy3 at the 5′ end, and its 3′ end was hybridized with a complementary stem strand labeled by biotin at the 5′ end and a Cy5 at the fifth nucleotide from the 3′ end. This donor (Cy3) and acceptor (Cy5) spacing can sensitively reveal the conformational change in i-motif structures via the FRET signal (14, 32, 33, 34). Py25, Py25(1245), and Py25(1234) containing smFRET substrates were referred to as Py25′, Py25(1245)′, and Py25(1234)′, respectively (Fig. 3A). The biotin was used to immobilize the substrates on PEG-modified surface via streptavidin. With the decrease in pH, i-motif structures gradually folded (Fig. 1C), leading to the incremental increase of FRET efficiency (Fig. 3B) (13). Therefore, a FRET value of about 0.3 to 0.4 should correspond to the ssDNA state, and values of ∼0.8 and ∼0.9 should correspond to well-folded i-motif structures, consistent with a recent report (13). Based on the principle of FRET, the further the distance is, the lower the FRET value will be. Comparing the structures of Py25(1234)′ with that of Py25(1245)′ (Fig. 3A), the i-motif structure in Py25(1234)′ was separated from dsDNA with ssDNA; thus, the distance between Cy3 and Cy5 in Py25(1234)′ is greater than that in Py25(1245)′, leading to a lower FRET value. Therefore, it was easy to conclude that Py25 was folded into the structure of type-1245 instead of type-1234 based on the FRET values of the folded states. In addition, the corresponding smFRET substrates Py25(1235)′, Py25(1345)′, and Py25(2345)′ were designed (Fig. S2B) to check the potential for type-1235, type-1345, and type-2345 folding (Fig. S2A), and their FRET distributions under different pH from 5.2 to 8.0 are shown in Fig. S2C. Based on the principle of FRET, it is expected that the FRET values of folded Py25(1235)′ and Py25(1345)′ will be similar to that of Py25′, and the FRET values of folded Py25(2345)′ will be lower than that of Py25′ because of the extra ssDNA tail. The experimental FRET values were in agreement with this expectation. From the folding fractions, it was clear that Py25(1245)′ folding was significantly better than that of other mutants (Fig. S2D). In addition, like Py25(1234)′, Py25(2345)′ was ruled out again based on its lower FRET values than that of Py25′ (Figs. 3B and S2C). Therefore, we concluded that type-1245 is the most favored structure.

Figure 3.

Figure 3

The folding of Py25′, Py25(1245), and Py25(1234)at different pH levels from 5.2 to 8.0 using single-molecule fluorescence resonance energy transfer.A, schematic diagram of Py25′, Py25(1245)′, and Py25(1234)′. B, FRET distributions of Py25′, Py25(1245)′, and Py25(1234)′ at different pH levels. The FRET values less than 0.4 should be ssDNA, and the FRET values more than 0.8 should be unfolded i-motif DNA, according to the CD spectra in Fig. 1C.

Dynamics of hnPNP K unfolding Py25

After determining the major structure of Py25, this study sought to determine how proteins in cells unfolded this structure. PCBPs have been shown to play essential roles in i-motif dynamics. Among them, hnRNP K is involved in c-MYC i-motif unfolding through binding to the C-rich strand (21). However, the unfolding dynamics have remained elusive. This study first expressed and purified hnRNP K to homogeneity (Fig. S1B). Then, the binding of hnRNP K to Py25′ was assessed using PAGE bandshift assays (Fig. S3). The apparent DNA-binding affinity value was about 220 nM. At 4 μM, almost all Py25′ was bound by hnRNP K. Therefore, concentrations below 4 μM were selected for subsequent analyses. To determine whether hnRNP K could unfold the c-MYC i-motif in this system at pH 5.8, Cy3 dyes were excited using a laser at 532 nm, and the bulk fluorescence spectra of Py25′ were monitored after adding different concentrations of hnRNP K from 4 nM to 4 μM (Fig. 4A). The emission wavelengths of ∼670 nm and ∼570 nm represented Cy5 and Cy3, respectively. It was clear that at each concentration of hnRNP K, the fluorescence spectra shifted gradually over time from ∼670 nm to ∼570 nm, resulting in the decrease of the FRET efficiency and indicating that the Py25 i-motif structure was unfolded by hnRNP K. The increase in Cy3 intensity and decrease in Cy5 intensity confirmed that the FRET efficiency decrease resulted from i-motif structure unfolding and ruled out quenching or some other mechanisms. At a certain time, such as 10 min (Fig. 4B), the unwinding ratio gradually increased as the hnRNP K concentration increased. Furthermore, the emissions of Cy3 at different concentrations of hnRNP K were fitted using single-exponential decays to quantify the unwinding parameters (Fig. 4C). It was found that the higher the concentration, the faster the unwinding (Fig. 4C and Table S2).

Figure 4.

Figure 4

Dynamics of hnPNP K unfolding Py25′ i-motif DNA using the bulk fluorescence resonance energy transfer method at pH 5.8.A, serial fluorescence emission spectra of Py25′ with increasing hnRNP K concentrations from 4 nM to 4 μM at different times from 0 to 10 min. A laser at 532 nm was used to excite Cy3. B, fluorescence emission spectra of Py25′ with increasing hnRNP K concentrations from 4 nM to 4 μM at 10 min. The arrows at 570 nm and 670 nm represent the evolution of Cy3 and Cy5, respectively. C, single-exponential decays were used to fit the emission of Cy3 in (B), resulting in the unwinding parameters shown in Table S2. hnRNP K, heterogeneous nuclear ribonucleoprotein K.

hnRNP K unfolds Py25 i-motif structure with four discrete states

To further dissect the intermediates and dynamics of hnRNP K unfolding Py25 i-motif structure, the smFRET method was applied to monitor the structural changes of Py25′ in real time at varied concentrations of hnRNP K under pH 5.8 (Fig. 5A). With the addition of 1 nM–1 μM hnRNP K, the FRET efficiencies gradually shifted towards low FRET values over time (Fig. 5B), indicating that the i-motif structure was resolved. The FRET histograms at 10 min were well fitted using a multipeak Gaussian distribution (Fig. 5C), which is widely used to recognize dynamic states at the single molecule level (14, 32, 33, 34), resulting in 2 to 4 peaks. At 1 nM hnRNP K, two FRET value peaks of ∼0.9 and ∼0.7 were detected. At 5 to 50 nM hnRNP K, as many as four peaks of FRET values at ∼0.9, ∼0.7, ∼0.6, and ∼0.4 were captured. At 200 nM–1 μM hnRNP K, there were only three states, and the peak at ∼0.7 disappeared. The FRET peaks of ∼0.9 and ∼0.4 at pH 5.8 were assigned to complete unfolded (UF) and folded (F) states (Figs. 1C and 3B). Therefore, the two middle states should be the folding intermediates (referred to as I1 and I2 in Fig. 5C). Furthermore, transition density plots (TDPs) were constructed to directly show the state transitions between these states (Fig. 5D). Their representative traces are shown in Fig. S5A, and their proportions are shown in Figure 5E. As the protein concentration increased, the fully folded proportion gradually decreased and the unfolded proportion gradually increased.

Figure 5.

Figure 5

hnRNP K unfolding Py25′ i-motif DNA discretely revealed by single-molecule fluorescence resonance energy transfer.A, schematic diagram of hnRNP K and Py25′ i-motif DNA. B, smFRET histograms obtained by adding 1 nM–1 μM hnRNP K at a series time from 0 to 10 min at pH 5.8. C, multipeak Gaussian distributions were used to fit the smFRET histograms at 10 min. The peak values are shown in the corresponding figures. D, transition density plots (TDPs) were used to show the state transitions of hnRNP K unfolding Py25′ at 10 min and certain concentrations. E, the fractions of the different folding structures at increasing concentrations of hnRNP K. hnRNP K, heterogeneous nuclear ribonucleoprotein K; smFRET, single-molecule fluorescence resonance energy transfer.

To further confirm whether Py25′ and Py25(1245)′ folded into the same i-motif structure, this study investigated the interaction between hnRNP K and Py25(1245)′ under the same experimental conditions (Fig. S4A). As expected, the phenomena, states, and distributions were similar to those of hnRNP K unfolding Py25′ (Figs. S4 and S5B). One small difference was that hnRNP K resolved Py25′ more efficiently than it unwound Py25(1245)′ (Figs. 5E and S4E). This difference may have been caused by the third C-tract; hnRNP K may bind to this CCCC tract (25, 26) to accelerate the disruption of the i-motif structure.

hnRNP K resolves i-motif DNA with opposite and neighboring hairpins as intermediate states

Next, this study sought to determine which structures the intermediate states corresponded to. Based on the aggregation of i-motifs (Fig. 1B), it could be concluded that the intermediate states should represent the disruption of partial hydrogen bonds. The KH domain can stably bind one C-column (25, 26, 28). Therefore, it was hypothesized that the intermediates should be different kinds of C-hairpins. The i-motif structure consisted of two opposite hairpin structures (Fig. 1B). At a low hnRNP K concentration (1 nM), only one more state ∼0.7 appeared (Fig. 5, B and C, top panel). It was speculated that the ∼0.7 state should be an opposite hairpin, resulting from the binding of hnRNP K to one CCC column of the other opposite hairpin and disrupting the hydrogen bonds. Because one column was stripped, the distance between the two ends of this unfolded i-motif structure was predicted to be greater than in the well-folded state, leading to a lower FRET efficiency of Cy3 and Cy5. To confirm this hypothesis, one CCC column was replaced by TTT to mimic this intermediate structure, termed Py25(245)′, which can fold into one opposite hairpin at most (Fig. 6A). When the pH value was less than 5.8 (Fig. 6B), the FRET distributions consisted of three Gaussian peaks at ∼0.4, ∼0.6, and ∼0.7 (Fig. 6, C and D, left panel), in agreement with the state transitions shown in the corresponding TDPs (Fig. 6, C and D, right panel) and representative traces (Figs S6, A and B). This finding supported the possibility that the ∼0.7 state was an opposite hairpin, and the missing FRET ∼0.9 state further supported that FRET ∼0.9 represented well-folded i-motif structures (Figs. 3B and 5C).

Figure 6.

Figure 6

Formation of opposite C-hairpins at pH 5.2 and pH 5.8.A, schematic view of Py25(245)′. B, FRET distributions of Py25(245)′ at different pH values. C and D, single-molecule FRET (smFRET) histograms of Py25(245)′ at pH 5.2 (C) and pH 5.8 (D) were fitted by multipeak Gaussian distributions (right panel), and three peaks were detected for each histogram. TDPs were used to show their state transitions (left panel). TDP, transition density plot.

Further, this study aimed to determine what the ∼0.6 state was assigned to. This state was found in both hnRNP K unfolding Py25′ (Fig. 5) and the dynamics of Py25(245)′ (Fig. 6). In addition to opposite hairpins, poly(C) runs may also fold into neighboring hairpins, which may form when one column of the remaining opposite hairpin is further trapped by a KH domain. It was speculated that the ∼0.6 state should consist of neighboring hairpins. To verify this speculation, this study further mutated one column of the second opposite hairpin (Fig. 6A) and obtained two substrates that were termed Py25(24)′ and Py25(45)′ (Fig. 7A). Because one more column was stripped, the two ends of the neighboring-hairpin state should be further apart than observed in the opposite-hairpin state (Fig. 7A versus Fig. 6A), resulting in a lower FRET value. As expected, at pH 5.8, both of these substrates could fold into two states at ∼0.4 and ∼0.6 (Fig. 7B), consistent with the state transitions shown in the corresponding TDPs (Fig. 7C) and representative traces (Fig. S6, C and D). Because Py25(24)′ and Py25(45)′ can only fold into neighboring hairpins, this result confirmed the state of FRET ∼0.6 was a neighboring hairpin, and the missing ∼0.7 state was further demonstrated to be an opposite hairpin. As the hnRNP K concentration increased, the two opposite hairpins of the i-motif were gradually resolved, and the neighboring hairpins had an opportunity to form, resulting in a gradual decrease in the proportion of the ∼0.7 state (I2) and a gradual increase in the proportion of the 0.6 state (I1) (Fig. 5, CE). Therefore, this state assignment was reasonable. In addition, the state transitions between opposite hairpins and neighboring hairpins were captured directly from TDPs (Figs. 5D, 6C and D). The intermediate states were also directly captured in Py25′ at pH 6.1 (Fig. S7). Therefore, i-motif DNA-unfolding pathways involving opposite and neighboring hairpins were proposed (Fig. 8).

Figure 7.

Figure 7

Formation of neighboring C-hairpins at pH 5.8.A, schematic diagram of Py25(24)′ (top panel) and Py25(45)′ (bottom panel). B, FRET distributions of Py25(24)′ (top panel) and Py25(45)′ (bottom panel). Multiple-peak Gaussian distributions were used to fit their histograms. C, transition density plots (TDPs) of Py25(24)′ (top panel) and Py25(45)′ (bottom panel).

Figure 8.

Figure 8

Proposed pathways of hnRNP K unfolding i-motif DNA. The resolving processes include well-folded i-motif structures, opposite hairpins, neighboring hairpins, and unfolded ssDNA, corresponding to the FRET values of ∼0.9, ∼0.7, ∼0.6, and ∼0.4, respectively. Their state transitions are indicated by two-way arrows. hnRNP K, heterogeneous nuclear ribonucleoprotein K.

In previous studies, one (12, 13, 15) or four (14) folding-unfolding intermediate states were detected during i-motif thermodynamic fluctuations. However, the findings of the present study clearly demonstrated that there were two intermediates during the hnRNP K unfolding of the c-MYC i-motif structure. In addition, the intermediates were assigned to the specific opposite and neighboring hairpins. Therefore, there should be a difference between thermodynamic fluctuations and protein-assisted unfolding.

All three KH domains are involved in the i-motif unfolding

hnRNP K contains three conserved KH domains (Fig. S1A) that play a key role in i-motif unfolding. However, whether all three KH domains have unfolding abilities and which KH domain is the most active remain unclear. To answer these questions, each KH domain was expressed and purified (Fig. S1C), and 1 μM of each KH domain was separately incubated with Py25′ (Fig. 9A). It was clear that the proportion of the ∼0.9 peaks gradually decreased over time, indicating that all three KH domains were able to unfold the Py25′ i-motif structure (Fig. 9B). In addition, their FRET histograms were also well fitted to four states using a multipeak Gaussian distribution (Fig. 9C). The FRET values were close to the above fitting, further indicating that the proposed model was reasonable (Fig. 8). To compare the activity levels of the three KH domains, the unwinding fractions of the Py25′ i-motifs calculated based on the peak areas of FRET ∼0.9 at certain times were fitted using single-exponential decays (Fig. 9D) (33, 35). Their unwinding parameters are shown in Table S3. Based on the comparison of their unwinding fractions and rates, it was concluded the activity of KH2 was slightly greater than that of KH3, with an ∼1.1-fold increase in unfolding amplitude and rate, while both were significantly more active than KH1, based on the ∼4-fold and ∼2-fold increases in amplitude and unwinding rate, respectively. In addition, the activity of any single KH motif was less than that of the full-length hnRNP K under the same experimental conditions (Fig. 5B, bottom panel); for example, KH1 and KH2 activity levels were approximately half that of the full length hnRNP K in terms of unwinding amplitude and rate. From another perspective, at 10 min, the interaction between each KH motif and the Py25′ i-motif could be fitted to four peaks (Fig. 9C), while there were only three peaks for hnRNP K (Fig. 5C, bottom panel), where the ∼0.7 peak corresponding to the opposite hairpin disappeared, further indicating that hnRNP K was more active.

Figure 9.

Figure 9

All three KH domains can unfold i-motif DNA.A, schematic diagram of KH1-3 and Py25′ i-motif DNA. B, single-molecule FRET (smFRET) histograms obtained from adding 1 μM KH1 (top panel), KH2 (middle panel), and KH3 (bottom panel) at series time at pH 5.8. C, multipeak Gaussian distributions were used to fit the smFRET histograms at 10 min, showing four peaks. The peak values are inserted in the corresponding figures. D, resolving fractions of Py25′ i-motif DNA at different times after the addition of 1 μM proteins. Single-exponential decays were used to fit the resolving fractions. The parameters are shown in Table S3.

Discussion

The structural dynamics of i-motifs play a regulatory role in numerous life processes. Therefore, studying the molecular mechanisms of how proteins unfold i-motif structures is fundamentally essential. However, this process is not fully understood. This work selected the interaction between hnRNP K and c-MYC i-motif as an example to reveal the molecular dynamic details of how PCBPs unfold i-motif structures at the single-molecule level and proposed the unfolding pathway. In addition, the i-motif–unwinding abilities of all three KH domains of hnRNP K were explored. The results of this study should be useful in understanding the cellular mechanisms of i-motif dynamics and aid in drug design.

After verifying the formation of i-motif structures by CD (Fig. 1), the type-1245 structure folded from Py25(1245) was proposed to be the major contributor to Py25 sequences based on Tm comparison (Fig. 2). This proposal was further confirmed by the consistency of FRET values at the single-molecule level (Fig. 3). In addition, hnRNP K resolving Py25′ and Py25(1245)′ exhibited similar phenomena, further supporting the possibility that they have similar structures (Figs. 5 and S4). Therefore, it was concluded that Type-1245 was the main i-motif structure of Py25. This series of methods may be used to determine the major folded structures of other substrates.

The ability of hnRNP K to unfold the c-MYC i-motif structure was confirmed by a bulk FRET assay (Fig. 4). Using the smFRET technique, this work first reported that hnRNP K resolved i-motif DNA discretely with opposite hairpins and neighboring hairpins as intermediate states (Fig. 5), which was further verified by substrate mutations (Figs. 6 and 7), and the unfolding pathway was proposed (Fig. 8). In addition, individual KH domain unwinding experiments further supported this pathway (Fig. 9C). These details not only deepen the understanding of how PCBPs unfold i-motif structures but may also provide a reference for how other proteins resolve i-motif DNA. The step-by-step process of hnRNP K resolving the c-MYC i-motif may provide more regulatory dimensions in cellular processes and may be useful in drug design. Interestingly, G-quadruplexes are also reported to contain two kinds of intermediates during unfolding, which are G-triplex and G-hairpin (36, 37). Recently, research based on structural biology revealed the existence of the C-triplex (38), which could not be ruled out from the opposite hairpin in the present experiments because of their similar FRET values. Whether C-triplex is an intermediate state for i-motif unwinding deserves further study.

Finally, the present study analyzed the i-motif–unfolding ability of all three KH domains (Fig. 9). It was found that all three KH domains could resolve Py25′ i-motifs with activity levels that ranked as follows: KH2 ≥ KH3 > KH1 (Fig. 9B). However, the KH domains were less active than full-length hnRNP K (Fig. 9D). It has been reported that only KH3 of hnRNP K can bind poly(C) RNA, while KH1 and KH2 cannot (39). Therefore, our future work will focus on revealing the differences between hnRNP K unfolding i-motif DNA and RNA at the single-molecule level.

Experimental procedures

Buffers

All of the reaction buffers contained 50 mM PBS at the corresponding pH, 200 mM K+ (final concentration, supplemented with KCl), 40% PEG200 (w/v), and 4 mM Trolox. For single-molecule experiments, the oxygen scavenging system (0.8% D-glucose, 1 mg/ml glucose oxidase (266600 units/g, Sigma), and 0.4 mg/ml catalase (2000–5000 units/mg, Sigma) were added.

Substrate preparation

All of the DNA substrates used in this study are listed in Table S1. For the DNA preparation of CD experiments, the final concentration of annealed ssDNA was 10 μM. For the substrate preparation of ensemble FRET experiments, the ratio of Cy3-labeled ssDNA and Cy5-labeled ssDNA was 1:1 to ensure that the fluorescence intensity directly reflected the structural states, and the concentration was 2 μM. For single-molecule experiments, the ratio of biotin-modified ssDNA to biotin-free ssDNA was 5:7 annealed to prevent the possibility that nonannealed ssDNA might anchor to the coverslip surface. The final concentration of biotin-modified ssDNA was 2 μM. All of the annealing was performed by incubation at 95 °C for 5 min and then slowly cooled to 20 °C in the corresponding reaction buffers.

CD spectropolarimetry

CD experiments were performed on a Chirascan V100 (Applied Photophysics Ltd) that contained a temperature probe and a temperature-controlled cell holder (32). The CD signals were recorded in the 220 to 340 nm region using a quartz cell (0.5-mm path length). The melting curves were determined between 20 to 98 °C, and the temperature rose one degree per minute. The Tm values were calculated using the analysis software supplied with the instrument.

Protein expression and purification

Full-length hnRNP K and fragments of KH1 (hnRNP K1-127), KH2 (hnRNP K127-237), and KH3 (hnRNPK382-463) were expressed and purified (Fig. S1). The corresponding genes were cloned into pET21a using the NdeI and XhoI sites. For KH1 and KH3, GST tags were fused to their N-terminal to improve solubility. Each expression vector was transformed into Escherichia coli Rosetta2 (DE3). Expression strains were incubated in LB at 37 °C before the A600 reached about 0.7. Protein expression was induced with 0.3 mM IPTG at 16 °C overnight. After cell lysis, proteins were captured by Ni-NTA beads. The fused GST tag was removed by PreScission Protease, if applicable. Gel-filtration chromatography was subsequently carried out to polish proteins. All of the purified proteins were frozen in small aliquots and stored at -80 °C after being fast-frozen in liquid nitrogen. The purity of the proteins was more than 95%, as shown in Fig. S1 by SDS-PAGE.

DNA-binding assay

Different concentrations of hnRNP K were mixed with 80 nM fluorescently labeled Py25′ in reaction buffer (pH 5.8). The mixture was electrophoresed on an 8% native PAGE after being incubated for 20 min at 22 °C. The gels were run in TAE buffer containing 200 mM KCl and imaged on a ChemiDoc MP Visualization System (BioRad); the images were analyzed in ImageJ (National Institutes of Health).

Ensemble FRET

The bulk FRET experiments were carried out on a homebuilt optical system. The final concentration of 4 nM i-motif–contained substrate and different concentrations of hnRNP K were mixed in a quartz cell with a path length of 5 mm at a constant temperature of 20 °C, and a laser set at 532 nm was used to excite Cy3. The emission of optical signals at 550 to 800 nm was recorded on an Avantes Fluorescence Spectrometer LS55 at a certain time.

smFRET

The equipment, procedure, and data analysis were the same as reported in this study group’s previous work (32, 33). Briefly, all smFRET experiments were conducted using a homebuilt objective-type total-internal-reflection microscope at 22 °C. The reaction chambers consisted of slides and coverslips. The surfaces of the coverslips were coated with a mixture of 99% mPEG (m-PEG-5000, Laysan Bio, Inc) and 1% of biotin-PEG (biotinPEG-5000, Laysan Bio, Inc). Streptavidin was used to capture biotin-modified substrates and immobilize them to the coverslips through biotin-streptavidin conjunction. The oxygen scavenging system was used to prevent bleaching and blinking. The FRET values were calculated using IA/(ID + IA), where ID and IA represent the fluorescence intensity of the donor (Cy3) and acceptor (Cy5), respectively. The exposure time for imaging was 100 ms, and the reaction temperature was 22 °C. FRET histograms were obtained by collecting 50 frames of each trace from more than 300 molecules. The FRET distributions of different states were recognized and fitted using a multipeak Gaussian distribution by OriginPro 2017 (14, 32, 34). More than 200 traces containing state transitions were selected to construct TDPs following the previously reported method (33).

Data availability

All data are contained within the article.

Supporting information

This article contains supporting information.

Conflict of interest

The authors declare that they have no conflicts of interest with the contents of this article.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (32171272 and 31800644), Thousand Talents Plan of Henan Province (ZYQR201912187), and Open Project of the State Key Laboratory of Crop Stress Adaptation and Improvement.

Author contributions

W.-Q. W. conceptualization; W.-Q. W., X. Z., and D. B. data curation; W.-Q. W. funding acquisition; W.-Q. W., X. Z., and D. B. investigation; W.-Q. W. writing–original draft; W.-Q. W. and L.-J. G. project administration; W.-Q. W., X. Z., D. B., S.-W. S., L.-J. G. writing–review and editing.

Edited by Patrick Sung

Contributor Information

Wen-Qiang Wu, Email: wuwenqiang@henu.edu.cn.

Li-Jun Guo, Email: juneguo@henu.edu.cn.

Supporting information

Supplementary Data
mmc1.pdf (896.4KB, pdf)

References

  • 1.Hänsel-Hertsch R., Antonio M.D., Balasubramanian S. DNA G-quadruplexes in the human genome: Detection, functions and therapeutic potential. Nat. Rev. Mol. Cell Biol. 2017;18:279–284. doi: 10.1038/nrm.2017.3. [DOI] [PubMed] [Google Scholar]
  • 2.Abou Assi H., Garavís M., González C., Damha M.J. i-Motif DNA: structural features and significance to cell biology. Nucl. Acids Res. 2018;46:8038–8056. doi: 10.1093/nar/gky735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Robinson J., Raguseo F., Nuccio S.P., Liano D., Di Antonio M. DNA G-quadruplex structures: more than simple roadblocks to transcription? Nucl. Acids Res. 2021;49:8419–8431. doi: 10.1093/nar/gkab609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lane A.N., Chaires J.B., Gray R.D., Trent J.O. Stability and kinetics of G-quadruplex structures. Nucl. Acids Res. 2008;36:5482–5515. doi: 10.1093/nar/gkn517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Rajendran A., Nakano S., Sugimoto N. Molecular crowding of the cosolutes induces an intramolecular i-motif structure of triplet repeat DNA oligomers at neutral pH. Chem. Commun. 2010;46:1299–1301. doi: 10.1039/b922050j. [DOI] [PubMed] [Google Scholar]
  • 6.Dzatko S., Krafcikova M., Hänsel-Hertsch R., Fessl T., Fiala R., Loja T., et al. Evaluation of the stability of DNA i-motifs in the nuclei of living mammalian cells. Angew. Chem. Int. Ed. 2018;57:2165–2169. doi: 10.1002/anie.201712284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zeraati M., Langley D.B., Schofield P., Moye A.L., Rouet R., Hughes W.E., et al. I-motif DNA structures are formed in the nuclei of human cells. Nat. Chem. 2018;10:631–637. doi: 10.1038/s41557-018-0046-3. [DOI] [PubMed] [Google Scholar]
  • 8.González V., Hurley L.H. The c-MYC NHE III1: function and regulation. Annu. Rev. Pharmacol. Toxicol. 2010;50:111–129. doi: 10.1146/annurev.pharmtox.48.113006.094649. [DOI] [PubMed] [Google Scholar]
  • 9.Chaudhuri R., Bhattacharya S., Dash J., Bhattacharya S. Recent update on targeting c-MYC G-quadruplexes by small molecules for anticancer therapeutics. J. Med. Chem. 2021;64:42–70. doi: 10.1021/acs.jmedchem.0c01145. [DOI] [PubMed] [Google Scholar]
  • 10.Simonsson T., Pribylova M., Vorlickova M. A nuclease hypersensitive element in the human c-myc promoter adopts several distinct i-tetraplex structures. Biochem. Biophys. Res. Commun. 2000;278:158–166. doi: 10.1006/bbrc.2000.3783. [DOI] [PubMed] [Google Scholar]
  • 11.Dai J., Hatzakis E., Hurley L.H., Yang D. I-motif structures formed in the human c-MYC promoter are highly dynamic-insights into sequence redundancy and I-motif stability. PLoS One. 2010;5 doi: 10.1371/journal.pone.0011647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Megalathan A., Cox B.D., Wilkerson P.D., Kaur A., Sapkota K., Reiner J.E., et al. Single-molecule analysis of i-motif within self-assembled DNA duplexes and nanocircles. Nucl. Acids Res. 2019;47:7199–7212. doi: 10.1093/nar/gkz565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Paul S., Hossain S.S., Samanta A. Insights into the folding pathway of a c-MYC-promoter-based i-motif DNA in crowded environments at the single-molecule level. J. Phys. Chem. B. 2020;124:763–770. doi: 10.1021/acs.jpcb.9b10633. [DOI] [PubMed] [Google Scholar]
  • 14.Gao B., Zheng Y.T., Su A.M., Sun B., Xi X.G., Hou X.M. Remodeling the conformational dynamics of I-motif DNA by helicases in ATP-independent mode at acidic environment. iScience. 2022;25 doi: 10.1016/j.isci.2021.103575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Choi J., Kim S., Tachikawa T., Fujitsuka M., Majima T. pH-induced intramolecular folding dynamics of i-motif DNA. J. Am. Chem. Soc. 2011;133:16146–16153. doi: 10.1021/ja2061984. [DOI] [PubMed] [Google Scholar]
  • 16.Kang H.J., Kendrick S., Hecht S.M., Hurley L.H. The transcriptional complex between the BCL2 i-motif and hnRNP LL is a molecular switch for control of gene expression that can be modulated by small molecules. J. Am. Chem. Soc. 2014;136:4172–4185. doi: 10.1021/ja4109352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Miglietta G., Cogoi S., Pedersen E.B., Xodo L.E. GC-elements controlling HRAS transcription form i-motif structures unfolded by heterogeneous ribonucleoprotein particle A1. Sci. Rep. 2015;5 doi: 10.1038/srep18097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Niu K., Zhang X., Deng H., Wu F., Ren Y., Xiang H., et al. BmILF and i-motif structure are involved in transcriptional regulation of BmPOUM2 in Bombyx mori. Nucl. Acids Res. 2017;46:1710–1723. doi: 10.1093/nar/gkx1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Nazarov I.B., Bakhmet E.I., Tomilin A.N. KH-domain poly(C)-Binding proteins as versatile regulators of multiple biological processes. Biochemistry (Mosc.) 2019;84:205–219. doi: 10.1134/S0006297919030039. [DOI] [PubMed] [Google Scholar]
  • 20.Kaiser C.E., Van Ert N.A., Agrawal P., Chawla R., Yang D., Hurley L.H. Insight into the complexity of the i-motif and G-quadruplex DNA structures formed in the KRAS promoter and subsequent drug-induced gene repression. J. Am. Chem. Soc. 2017;139:8522–8536. doi: 10.1021/jacs.7b02046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tomonaga T., Levens D. Activating transcription from single stranded DNA. Proc. Natl. Acad. Sci. U. S. A. 1996;93:5830–5835. doi: 10.1073/pnas.93.12.5830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Takimoto M., Tomonaga T., Matunis M., Avigan M., Krutzsch H., Dreyfuss G., et al. Specific binding of heterogeneous ribonucleoprotein particle protein K to the human c-myc promoter, in vitro. J. Biol. Chem. 1993;268:18249–18258. [PubMed] [Google Scholar]
  • 23.Lacroix L., Liénard H.l. n., Labourier E., Djavaheri-Mergny M., Lacoste J.r. m., Leffers H., et al. Identification of two human nuclear proteins that recognise the cytosine-rich strand of human telomeres in vitro. Nucleic Acids Res. 2000;28:1564–1575. doi: 10.1093/nar/28.7.1564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Dvořáková Z., Renčiuk D., Kejnovská I., Školáková P., Bednářová K., Sagi J., et al. i-Motif of cytosine-rich human telomere DNA fragments containing natural base lesions. Nucl. Acids Res. 2018;46:1624–1634. doi: 10.1093/nar/gky035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Braddock D.T., Baber J.L., Levens D., Clore G.M. Molecular basis of sequence-specific single-stranded DNA recognition by KH domains: Solution structure of a complex between hnRNP K KH3 and single-stranded DNA. EMBO J. 2002;21:3476–3485. doi: 10.1093/emboj/cdf352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Backe P.H., Messias A.C., Ravelli R.B.G., Sattler M., Cusack S. X-ray crystallographic and NMR studies of the third KH domain of hnRNP K in complex with single-stranded nucleic acids. Structure. 2005;13:1055–1067. doi: 10.1016/j.str.2005.04.008. [DOI] [PubMed] [Google Scholar]
  • 27.Tomonaga T., Levens D. Heterogeneous nuclear ribonucleoprotein K is a DNA-binding transactivator. J. Biol. Chem. 1995;270:4875–4881. doi: 10.1074/jbc.270.9.4875. [DOI] [PubMed] [Google Scholar]
  • 28.Du Z., Lee J.K., Tjhen R., Li S., Pan H., Stroud R.M., et al. Crystal structure of the first KH domain of human poly(C)-binding protein-2 in complex with a C-rich strand of human telomeric DNA at 1.7 Å. J. Biol. Chem. 2005;280:38823–38830. doi: 10.1074/jbc.M508183200. [DOI] [PubMed] [Google Scholar]
  • 29.Siddiqui-Jain A., Grand C.L., Bearss D.J., Hurley L.H. Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription. Proc. Natl. Acad. Sci. U. S. A. 2002;99:11593–11598. doi: 10.1073/pnas.182256799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.You H., Wu J., Shao F., Yan J. Stability and kinetics of c-MYC promoter G-quadruplexes studied by single-molecule manipulation. J. Am. Chem. Soc. 2015;137:2424–2427. doi: 10.1021/ja511680u. [DOI] [PubMed] [Google Scholar]
  • 31.Ye S., Chen Z., Zhang X., Li F., Guo L., Hou X.-M., et al. Proximal single-stranded RNA destabilizes human telomerase RNA G-quadruplex and induces its distinct conformers. J. Phys. Chem. Lett. 2021;12:3361–3366. doi: 10.1021/acs.jpclett.1c00250. [DOI] [PubMed] [Google Scholar]
  • 32.Wang L., Xu Y.-P., Bai D., Shan S.-W., Xie J., Li Y., et al. Insights into the structural dynamics and helicase-catalyzed unfolding of plant RNA G-quadruplexes. J. Biol. Chem. 2022;298:102165. doi: 10.1016/j.jbc.2022.102165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wu W.Q., Zhang M.L., Song C.P. A comprehensive evaluation of a typical plant telomeric G-quadruplex (G4) DNA reveals the dynamics of G4 formation, rearrangement, and unfolding. J. Biol. Chem. 2020;295:5461–5469. doi: 10.1074/jbc.RA119.012383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lee J., Okumus B., Kim D., Ha T. Extreme conformational diversity in human telomeric DNA. Proc. Natl. Acad. Sci. U. S. A. 2005;102:18938–18943. doi: 10.1073/pnas.0506144102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Xue Z.Y., Wu W.Q., Zhao X.C., Kumar A., Ran X., Zhang X.H., et al. Single-molecule probing the duplex and G4 unwinding patterns of a RecD family helicase. Int. J. Biol. Macromol. 2020;164:902–910. doi: 10.1016/j.ijbiomac.2020.07.158. [DOI] [PubMed] [Google Scholar]
  • 36.Wu W.Q., Hou X.M., Li M., Dou S.X., Xi X.G. BLM unfolds G-quadruplexes in different structural environments through different mechanisms. Nucl. Acids Res. 2015;43:4614–4626. doi: 10.1093/nar/gkv361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zhou R., Zhang J., Bochman M.L., Zakian V.A., Ha T. Periodic DNA patrolling underlies diverse functions of Pif1 on R-loops and G-rich DNA. eLife. 2014;3 doi: 10.7554/eLife.02190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Geng Y., Liu C., Cai Q., Luo Z., Miao H., Shi X., et al. Crystal structure of parallel G-quadruplex formed by the two-repeat ALS- and FTD-related GGGGCC sequence. Nucl. Acids Res. 2021;49:5881–5890. doi: 10.1093/nar/gkab302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Dejgaard K., Leffers H. Characterisation of the nucleic-acid-binding activity of KH domains different properties of different domains. Eur. J. Biochem. 1996;241:425–431. doi: 10.1111/j.1432-1033.1996.00425.x. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data
mmc1.pdf (896.4KB, pdf)

Data Availability Statement

All data are contained within the article.


Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES