The human CCHC-type Zinc Finger Nucleic Acid Binding Protein binds G-rich elements in target mRNA coding sequences and promotes translation

Daniel Benhalevy; Sanjay K Gupta; Charles H Danan; Suman Ghosal; Hong-Wei Sun; Hinke G Kazemier; Katrin Paeschke; Markus Hafner; Stefan A Juranek

doi:10.1016/j.celrep.2017.02.080

. Author manuscript; available in PMC: 2017 Apr 17.

Published in final edited form as: Cell Rep. 2017 Mar 21;18(12):2979–2990. doi: 10.1016/j.celrep.2017.02.080

The human CCHC-type Zinc Finger Nucleic Acid Binding Protein binds G-rich elements in target mRNA coding sequences and promotes translation

Daniel Benhalevy ^1,⁵, Sanjay K Gupta ^2,⁵, Charles H Danan ¹, Suman Ghosal ¹, Hong-Wei Sun ³, Hinke G Kazemier ⁴, Katrin Paeschke ⁴, Markus Hafner ^1,^6,⁷, Stefan A Juranek ^2,^4,⁶

PMCID: PMC5393907 NIHMSID: NIHMS857597 PMID: 28329689

Summary

The CCHC-type Zinc Finger Nucleic Acid Binding Protein (CNBP/ZNF9) is conserved in eukaryotes and essential for embryonic development in mammals. It has been implicated in transcriptional as well as post-transcriptional gene regulation; however, its nucleic acid ligands and molecular function remain elusive. Here, we use multiple systems-wide approaches to identify CNBP targets and function. We used Photoactivatable Ribonucleoside Enhanced Crosslinking and Immunoprecipitation (PAR-CLIP) to identify 8420 CNBP binding sites on 4178 mRNAs. CNBP preferentially bound G-rich elements in the target mRNA coding sequences, most of which were previously found to form G-quadruplex and other stable structures in vitro. Functional analyses, including RNA sequencing, ribosome profiling, and quantitative mass spectrometry, revealed that CNBP binding did not influence target mRNA abundance but rather increased their translational efficiency. Considering that CNBP binding prevented G quadruplex structure formation in vitro, we hypothesize that CNBP is supporting translation by resolving stable structures on mRNAs.

eTOC Blurb

Benhalevy et al. characterize the RNA binding protein CNBP/ZNF9 using systems-wide approaches. They find that CNBP preferentially binds at mRNA regions previously found to form G quadruplex and other structures in vitro. Ribosome profiling revealed that CNBP enhances translation across these sites, which potentially form roadblocks for the ribosome.

graphic file with name nihms857597u1.jpg

INTRODUCTION

Gene regulation involves recognition of cis-acting sequence elements on both DNA and RNA by transcription factors and RNA-binding proteins (RBPs), respectively (Vaquerizas et al. 2009; Gerstberger et al. 2014). Nucleic acid recognition is mediated by a limited set of protein domains that are highly conserved throughout evolution (Wilson et al. 2007; Gerstberger et al. 2014; Lunde et al. 2007; Finn et al. 2010). One of the canonical nucleic acid-recognizing domains found in both DNA and RNA-binding proteins is the zinc finger (ZnF) domain. ZnF domains can be broadly subdivided into six distinct folds, of which CCCH- and CCHC-ZnFs are most common in eukaryotes (Klug and Rhodes 1987). A common feature of nucleic acid binding domains in general, and ZnF domains in particular, is their frequent occurrence in homo- or heterotypic arrays of multiple nucleic acid binding domains (Gerstberger et al. 2014). As an example, human ZFP100 contains up to 18 ZnF-domains of four different types (Gerstberger et al. 2014). Given that most nucleic acid binding domains recognize short and degenerate 4–6 nucleotide (nt) long segments, it is thought that such a modular design increases affinity and sequence-specificity of the protein-nucleic acid interaction (Ascano et al. 2012a; Lunde et al. 2007).

The cellular nucleic acid binding protein (CNBP/ZNF9) is among the proteins with the highest number of repeats of the same nucleic acid binding domain. It is evolutionary conserved in eukaryotes and harbors seven CCHC-type ZnF domains in addition to one arginine-glycine (RG/RGG)-rich motif (Figure 1A). CNBP is ubiquitously expressed at high levels in adult tissues (Figure S1A) (Gerstberger et al. 2014) and gene knockout in mice is embryonically lethal, presumably due to impaired embryonic forebrain development (Chen et al. 2003; Shimizu et al. 2003). In humans, a CCTG expansion in intron 1 of CNBP causes myotonic dystrophy type 2 (Liquori et al. 2001). Nevertheless, little is known about the function of CNBP, its targets, and its molecular mechanism.

(A) ClustalW multiple sequence alignment of CNBP isoform 1 from various eukaryotes. CCHC domains are indicated by light grey boxes, the RGG domain by a dark grey box. Signature C and H amino acid positions are highlighted (red).

(B) Clone count distribution across the six CNBP isoforms from 22 sequenced full length clones from HEK293 cDNA.

(C) Comparison and protein length (amino acids, Aa) of the six different CNBP isoforms.

(D) Analysis of the nucleocytoplasmic distribution of wild-type CNBP and FH-CNBP isoforms 1–6 HEK293 cell lines, as indicated. Cytoplasmic and nuclear fractions were probed with anti-CNBP, anti-FLAG (FH-CNBP), anti-Histone H3 (nuclear marker), anti-GAPDH (cytoplasmic marker), and anti-Calnexin (endoplasmic reticulum marker) antibodies.

CNBP was first described as a DNA-binding protein and potential regulator for the sterol regulatory element (SRE) (Rajavashisth et al. 1989). It scored high in an in vitro screen for binders of single-stranded DNA (ssDNA) with the defined sequence GTGCGGTG. A role for CNBP in transcriptional regulation was further suggested by its ability to bind the CT element in the promotor of the c-myc gene in vitro and activate it after overexpression in cultured murine teratocarcinoma cells (P19) (Michelotti et al. 1995; Shimizu et al. 2003). In contrast, pulldown of polyadenylated RNA after in vivo UV crosslinking identified CNBP as an RBP in amphibian oocytes and human cell lines (Pellizzoni et al. 1997; Calcaterra et al. 1999; Baltz et al. 2012; Castello et al. 2012). Other studies suggested that human CNBP interacts with G-rich single-stranded RNA (ssRNA) in vitro (Armas et al. 2008) and associates with the terminal 5′ oligopyrimidine (TOP) sequence of specific mRNAs (Iadevaia et al. 2008) to promote their translation (Huichalaf et al. 2009). CNBP was also found to enhance the cap-independent translation of reporter constructs containing ornithine decarboxylase (ODC) internal ribosome entry site (IRES), suggesting a function as a IRES trans-acting factor (ITAF) (Gerbasi and Link 2007). The yeast homolog of CNBP, Gis2, associates with PABPC1 and the translational machinery and localizes to stress-induced cytoplasmic granules, which further supports a potential function for CNBP in translational regulation (Rojas et al. 2012).

Here, we aimed to consolidate these seemingly conflicting hypotheses about CNBP’s regulatory function using biochemical and systems-wide approaches. We identified CNBP as a cytoplasmic RBP and profiled its RNA targets on a transcriptome-wide scale at nucleotide level resolution using Photoactivatable Ribonucleoside Enhanced Crosslinking and Immunoprecipitation (PAR-CLIP) (Hafner et al. 2010b). CNBP bound 4178 mRNAs at 7545 distinct sites, distributing mainly to G-rich regions in the first 50 nt downstream of the AUG start codon. CNBP binding sites were preferentially found in regions that were recently reported to form G-quadruplex (G4) and other stable structures (Guo and Bartel 2016). Interestingly, Guo and Bartel noted that in eukaryotes these structures are not detectable in vivo and likely require a dedicated machinery to resolve them and relieve their inhibitory effect on translation. Using a host of genome-wide approaches, including RNA sequencing (RNA-seq), ribosome profiling (Ribo-seq), and quantitative mass spectrometry we found that CNBP is an example of a sequence-specific RBP promoting the translation of its mRNA targets on a global scale. Considering that CNBP binding prevented G4 structure formation in vitro, we hypothesize that CNBP is supporting translation by resolving stable structures on mRNAs.

RESULTS

CNBP is a cytoplasmic protein consisting of multiple simultaneously expressed isoforms

Human CNBP is predicted to encode multiple splice isoforms that change the composition of its nucleic acid binding domains and potentially modify target specificity (Chen et al. 2013). In HEK293 cells, we detected the expression of six distinct isoforms by sequencing of complementary DNA (cDNA, Figure 1B). CNBP isoform 3, which harbors a deletion in the third CCHC domain, and isoform 6, which contains deletions in the third CCHC domain and at the C-terminus of the RG-rich domain, were the most abundant, representing more than 60% of the sequenced CNBP clones (Figure 1B,C).

RG-rich domains are often targets of arginine methylation (McBride et al. 2009), which could influence CNBP function in two ways: methylated arginine residues can be recognized by adapter proteins shuttling between cytoplasm and nucleus, resulting in changes of subcellular localization (Lee et al. 2012). Alternatively, arginine methylation could influence the nucleic acid binding properties of the RG-rich domain and/or the neighboring ZnFs (Wei et al. 2014). Because previous reports suggested nuclear as well as cytoplasmic functions for CNBP, we investigated whether differences in the RG-rich domain could influence subcellular localization. We generated stable HEK293 cell lines expressing FLAG/HA-tagged CNBP (FH-CNBP) isoforms 1–6 under control of a tetracycline-inducible promoter (Spitzer et al. 2013). Subcellular fractionation from these six cell lines, as well as from the parental HEK293 cells, revealed that all CNBP isoforms mainly localize to the cytoplasm (Figure 1D). This result favors a predominantly posttranscriptional - rather than transcriptional - gene regulatory function for CNBP.

CNBP interacts with mature mRNAs in G-rich regions

Next, we investigated the RNA binding properties of the six CNBP isoforms in living cells. We used 4-thiouridine (4SU) PAR-CLIP to irreversibly crosslink RNA and interacting proteins (Hafner et al. 2010b). Autoradiography of the crosslinked, ribonuclease-treated, immunoprecipitated, and radiolabeled FH-CNBP isoforms 1–6 ribonucleoprotein (RNP) complexes revealed one major band at the expected size of ~20 kDa molecular mass (Figure 2A). This indicated that all CNBP isoforms maintain the ability to interact with RNA ligands in vivo.

(A) Autoradiograph showing *in vivo* crosslinked CNBP-RNA RNPs from stably expressing FLAG-HA tagged CNBP isoform1–6 cell lines (arrow indicates crosslinked CNBP-RNPs). Cells not irradiated with UV served as control (lane 1). Western blot analysis of immunoprecipitated CNBP isoform1–6 probed with anti-FLAG antibody is shown in the lower panel.

(B) Analysis of target transcript preferences for CNBP. The number of exonic binding sites annotated as derived from the 5′ UTR, CDS, or 3′ UTR of a target transcript is shown (orange bars). Grey bars show the expected location distribution of clusters if CNBP bound without regional preference to the set of target transcripts.

(C) Density of CNBP binding sites (red line) downstream of the mRNA start codon, compared to 1,000 mismatched randomized controls (grey lines).

(D) Weblogo of representative RREs from CNBP PAR-CLIP binding sites generated by HOMER (Heinz et al. 2010). Percentage of CNBP binding sites containing the respective RRE is indicated.

(E) Alignment of top 19 PAR-CLIP clusters with G-rich RREs indicated in red. The number of total (Total) and crosslinked (XL) reads for each cluster is shown. Crosslinking sites determined by the diagnostic T-to-C mutation are underlined.

We chose to characterize the targeting profile of one of the two most abundant CNBP isoform in HEK293, isoform 3 (Figure 1B). We recovered the interacting RNA from the crosslinked FH-CNBP isoform 3 RNP and generated cDNA libraries for next-generation sequencing. Using PARalyzer software (Corcoran et al. 2011) we identified 8420 high confidence clusters (binding sites), of which 7545 distributed over 4178 mRNAs (Figure S1B, Table S1). Consistent with its cytoplasmic localization, CNBP binding sites were predominantly mapping to mature mRNAs, with 54% of the clusters in coding sequences (CDS) and 27% in the 3′ untranslated regions (UTR) (Figure 2B). A metagene analysis across different mRNA features revealed an enrichment of CNBP binding sites within the first 50 nt of the start codon (Figures 2C, S1C–G).

We used HOMER (Heinz et al. 2010) to computationally define the preferred RNA recognition element (RRE) on the complete set of PAR-CLIP-defined CNBP binding sites/clusters. This approach identified UGGAGNW as the most common RRE (>40% of all binding sites). Other G-rich motifs containing a GGA or GGG core also showed strong enrichment in the computational analysis and often occurred multiple times in a single CNBP binding site (Figure 2D,E).

CNBP recognizes G-rich sequences in vitro

We first used electrophoretic mobility shift assays (EMSA) with recombinant, purified CNBP isoform 3 to evaluate the impact of the putative GGAG containing RRE on the binding affinity. We designed synthetic 20 nt long ssRNAs with adenosines flanking a centered UUUU, GGAG, or 2×GGAG element (sequences are shown in Figure 3A). The UUUU-containing RNA did not show any appreciable interaction with CNBP. The addition of one GGAG resulted in a weak shift at high protein concentrations and the addition of a second GGAG resulted in a complete shift of oligoribonucleotide at CNBP concentrations >0.5 μM (Figure S2A). Accordingly, based on filter binding assays CNBP binding constants (K_d) of UUUU, GGAG, and 2×GGAG oligoribonucleotides were >10 μM, 330 nM, or 120 nM, respectively, confirming that a GGAG motif confers binding to CNBP (Figures 3A). Hill coefficients for these RNA substrates varied around 1, indicating that CNBP binding was non-cooperative (Figure S2C).

(A) Binding curves and dissociation constants (Kd) obtained from filter binding assay using recombinant CNBP with synthetic RNA targets containing no GGAG (black), one GGAG (red), and two GGAG (blue) sequence elements. Sequences are given below.

(B) Sequences of CNBP-binding sites used for filter binding assays. Nucleotides highlighted in red correspond to the predicted CNBP RREs. Mutations of the putative RRE are underlined.

(C) Binding curves and dissociation constants (K_d) obtained from filter binding assays using recombinant CNBP and sequences from (B).

Next, we validated the binding of CNBP to PAR-CLIP binding sites in vitro. We generated ssRNAs representing four different binding sites (Figure 3B) and quantified their affinity to CNBP by filter binding. The obtained K_d varied between 30 and 160 nM and mutations in the core G-rich binding motif resulted in loss of CNBP binding (Figure 3C). Finally, we attempted to dissect the contribution of the RG-rich domain and individual CCHC domains using three recombinantly expressed CNBP constructs either lacking the RG-rich domain, the N-terminal CCHC domain, or the two C-terminal CCHC domains. These constructs all had severely reduced binding affinities compared to full-length CNBP and we concluded that in addition to the ZnF-domains CNBP also requires the RG-rich domain for high-affinity RNA binding (Figure S2C–G). In summary, our in vitro studies validated the PAR-CLIP-derived CNBP binding sites and emphasized the requirement of G-rich sequences for high-affinity binding.

CNBP presence slightly reduces target mRNA abundance

To determine the effect of CNBP on its mRNA targets and to perform loss-of-function studies, we generated CNBP knockout (KO) HEK293 cell lines using Cas9-mediated gene editing. Genomic sequencing of the CNBP locus, as well as RNA sequencing of three different CNBP KO clones revealed that all contained frameshift mutations in exon 4, which resulted in loss of detectable protein by western blot (Figure S3A–C). CNBP KO did not significantly reduce the growth rate of HEK293 cells. However, when grown in continuous (unpassaged) culture CNBP KO cells started collapsing after 8–10 days of growth and were completely dead after 15 days, while parental HEK293 readily survived (Figure S3D–F). This suggests that CNBP supports cell survival under limiting conditions.

We used these CNBP KO cells to determine whether CNBP is involved in cytoplasmic posttranscriptional gene regulatory processes. We first investigated CNBP target mRNA abundance by RNA-seq (Figure S4A and Table S2). Loss of CNBP led to a marginal, albeit statistically significant increase of target mRNA levels compared to parental HEK293 cells. CNBP PAR-CLIP top targets, binned according to the number of crosslinked reads, increased in abundance by ~2% (Figure 4A). These results indicate that CNBP is most likely not influencing cytoplasmic mRNA turnover.

(A) Cumulative distribution analysis of change in average mRNA expression comparing CNBP KO cell lines (n=3) with parental HEK293 cells (n=3). Target mRNAs are binned based on the number of crosslinked reads. Significance was determined using a two-sided Kolmogorov-Smirnov (KS) test. Bin size is indicated.

(B) Sucrose gradient separation profile of HEK293 cells extract. The western blot below shows co-sedimentation of CNBP with free ribosomal subunits (fractions 5–8), monoribosomes (fractions 9–12), and polysomes (fractions 13–19). RPS6 from the 40S ribosomal subunit served as a control protein.

(C) Cumulative distribution analysis of change in ribosome protected fragments (RPF) upon CNBP knockout as determined by ribosome profiling. Target mRNAs are binned based on the number of crosslinked reads. Significance was determined using a two-sided KS test. Bin size is indicated.

(D) Same as in (C), except the cumulative distribution of the translation efficiency (TE, calculated as log₂[RPF/RNA abundance]) ratio is plotted.

(E) Distribution of RPF around CNBP binding sites in control (blue) and CNBP KO (red) cells. Curves were fitted using LOESS regression and the envelope indicates a 95% confidence interval.

CNBP promotes translation of its targets

Given the almost negligible effect on mRNA abundance, we next investigated whether CNBP controls other cytoplasmic posttranscriptional gene regulatory processes beyond mRNA stability. We showed above that CNBP binding was enriched in mRNA CDSs, particularly within 50 nt downstream of the start codon (Figure 2B,C). We also demonstrated that CNBP co-sediments quantitatively with the 40S, 80S, and polysomal fractions in polysome profiling experiments (Figure 4B). Taken together, these data pointed to a role for CNBP in translation and we hypothesized that CNBP might influence translational elongation across G-rich regions.

To directly measure the impact of CNBP on the ribosome occupancy of its mRNA targets we performed ribosome footprinting (Ribo-seq) and quantified ribosome protected fragments (RPF) (Ingolia et al. 2009) in CNBP KO cells and parental HEK293 cells (Table S3). CNBP loss resulted in a robust decrease in RPF numbers on target mRNAs and this effect depended on the number of crosslinked reads or the number of CNBP binding sites (Figures 4C, S4B), both of which we previously found to correlate well with the occupancy of the RBP on the RNA (Hafner et al. 2010b; Ascano et al. 2012b). We found a ~30% decrease (p<2.2×10⁻¹⁶) in RPFs on the 243 target mRNAs with more than 500 crosslinked reads compared to mRNAs showing no interaction with CNBP (Figure 4C). Even for the 1761 targets with only a single CNBP site, RPFs decreased by ~9% (p=13×10⁻¹⁰).

We further investigated the effect of the CNBP binding site location on ribosome density and first focused on mRNAs that were targeted by CNBP in the 3′UTR. CNBP KO did neither affect the RPFs on these mRNAs nor their abundance, suggesting that 3′UTR binding did not contribute to CNBP mediated gene regulation (Figure S4C–E). In contrast, CNBP binding anywhere in the CDS was sufficient to increase RPF density, with no preference detectable for sites close to the translational start, despite the enrichment of CNBP binding sites in that region (Figure S4F).

We calculated the density of ribosomes on each mRNA in CNBP KO and control cells by normalizing the number of RPFs with the mRNA abundance. This score, known as the translational efficiency (TE), removes effects of mRNA abundance and approximates the translational output for each mRNA molecule of a given gene (Ingolia et al. 2009; Guo et al. 2010; Bazzini et al. 2012). CNBP KO strongly correlated with decreased TE on CNBP targets (~33% decrease for the 243 top CNBP targets binned by intensity of crosslinking, Figure 4D, p<2.2×10⁻¹⁶).

G-rich sequences are capable of forming G-quadruplex and other secondary structures, which act as roadblocks for efficient translation (Rhodes and Lipps 2015; Guo and Bartel 2016). We thus reasoned that CNBP possibly facilitated translation by relieving restrictive mRNA structures; elongation rates of ribosomes around CNBP binding sites would in this case decrease upon CNBP loss. To test this hypothesis, we plotted RPF density around CNBP binding sites (Figure 4E). Ribosome density around the G-rich CNBP sites was increased in control cells suggesting reduced elongation rates even in the presence of CNBP. Upon CNBP loss, we observed a further 60% increase in ribosome density followed by a sharp drop in occupancy >200 nt downstream of the binding sites. Thus, our data suggest that CNBP loss resulted in increased stalling of elongating ribosomes around its G-rich binding sites. The local increase of ribosome density upstream of the CNBP binding sites was compatible with the overall decrease of ribosome footprints on CNBP target mRNAs, considering that CNBP binding sites were predominantly localized 50 nt of the start codon (Figure 2C).

CNBP increases protein levels of target transcript

CNBP KO cells showed a ~20% decrease in protein content per cell, supporting a role in promoting target mRNA translation (Figure 5A). Nevertheless, such a decrease may also be the result of indirect regulatory effects. To directly test the effect of CNBP binding on the target mRNA translational output we designed reporter assays using representative PAR-CLIP binding sites from the TMPO, LUC7L3, and DDX42 mRNAs, which showed a reduction in TE of 40%, 40%, and 50%, respectively. Each of these binding sites was documented to bind CNBP in vitro and was located within less than 50 nts from the start codon in the CDS (Figure 3B,C). The sequences were cloned in-frame with the firefly luciferase mRNA directly downstream of the AUG codon in the psiCHECK-2 dual luciferase reporter assay system (Figure S5A). Transfection of these reporter plasmids into CNBP KO cells resulted in an approximately 10-fold decrease in both, renilla control and firefly luciferase compared to wild-type cells, while luciferase RNA levels stayed constant (Figure S5B). This reduction in luciferase protein levels was likely due to a concentration of G-rich sequence elements in the CDS of the luciferase genes making them sensitive to complete loss of CNBP (Figure S5C). We argued that reduction of CNBP levels, rather than knockout would preferentially affect the strongest CNBP targets and used a siRNA-mediated gene knockdown for our luciferase experiments. Upon knockdown of CNBP we observed a highly significant (20–40%) decrease in firefly luciferase levels for all reporters (Figures 5B, S5D). This decrease was rescued by mutations within the G-rich CNBP binding sites that we had previously shown to abrogate CNBP binding in vitro (Figure 3C).

(A) Absolute protein abundance per cell in control (WT) and CNBP KO (KO) cells.

(B) Effect of CNBP knockdown on luciferase reporter gene expression. Firefly luciferase expression is normalized to Renilla luciferase expression and set to 1 for the control knockdown. Results of paired t-test are indicated (****, p <0.0001; ***, p <0.0006).

(C) Cumulative distribution analysis of protein level changes upon CNBP knock down as determined by pSILAC. Significance was determined using a two-sided KS test. Bin size is indicated.

(D) Same as in (C), except the cumulative distribution of a protein (derived by pSILAC) per mRNA (derived by RNA-seq) metric (log₂[protein abundance/RNA abundance]) ratio is plotted.

(E) CD measurement of a G4 structure without CNBP and with increasing concentrations of CNBP.

(F) Number of CNBP binding sites overlapping RT-stop reads (from (Guo and Bartel 2016) compared to a set of background sequence clusters. Background sequences for each CNBP binding site were randomly selected from the same mRNA and the same annotation category (CDS, UTR).

(G) Distribution of RT stop reads 100 nt up- and down-stream of CNBP PAR-CLIP sites (red line, anchored at cluster start coordinate). Distribution of RT-stop reads relative to the background sequence set from (F) (black line).

To systematically measure the impact of CNBP on target protein levels we performed Pulsed Stable Isotope Labeling by Amino Acids in Cell Culture (pSILAC) coupled with quantitative proteomics (Ong et al. 2003). We compared the protein expression profile of FH-CNBP expressing HEK293 cells with that of HEK293 cells after CNBP knockdown and could quantify 4887 proteins (Table S5). Knockdown of CNBP resulted in a small but significant decrease in abundance of proteins encoded by the top 509 CNBP target mRNAs (Figure 5C). We further normalized the observed changes in protein levels according to changes in mRNA levels (obtained by RNA-seq) for each gene upon CNBP knockdown and overexpression. Consistent with our reporter assays and ribosome profiling data we found that CNBP expression directly correlated with increased translational efficiency on CNBP targets (14% increase for the top 509 CNBP targets, p=2×10⁻⁵, Figure 5D).

CNBP binds sites close to G4 and other stable RNA structures

Finally, we asked whether a plausible mechanism for CNBP function would be to promote translation across G-rich elements in the target mRNA CDS that have the potential to form stable secondary RNA structures. We used SHAPE chemistry to probe for structure in the CNBP PAR-CLIP sites from TMPO and HNRNPH3 and found that the G-rich elements in that oligoribonucleotide were protected from modification in vitro, indicating that they were paired (Figure S5E,F). We next examined whether CNBP binding could disrupt formation of RNA secondary structures and performed circular dichroism measurements using a well-characterized G4 forming oligonucleotide (Paeschke et al. 2013) in the presence and absence of recombinant CNBP. The presence of CNBP prevented the formation of the G4 in a concentration dependent manner (Figure 5E), suggesting that indeed was capable of resolving stable secondary structure. We extended these analyses on a transcriptome-wide scale using data from a recent report, which identified regions in HEK293 cells forming stable structures in vitro (Guo and Bartel 2016). Guo and Bartel exploited the ability of G4-forming and other structured regions in RNA to stall reverse transcription during cDNA library preparation. Interestingly, 75% of CNBP binding sites overlapped with sequences that stalled reverse transcription (Figure 5F). The localization of CNBP binding sites at and near regions capable of stalling reverse transcription was highly significant in comparison to a set of length-matched sites randomly selected from CDS of mRNAs expressed in HEK293 (Figure 5G). In summary, our data are consistent with a role for CNBP in preventing the formation of stable secondary structures in G-rich regions within the mRNA CDS and thus promoting translation.

DISCUSSION

Here, we present a comprehensive characterization of the CNBP protein. We identified target RNA binding sites transcriptome-wide, delineated a consensus binding motif, and defined the effect of CNBP binding on mRNA abundance and translation.

CNBP was described as a multifunctional protein similar to other ZnF-containing proteins (Pace and Weerapana 2014; Hall 2005). It was suggested to interact with both, DNA and RNA, leading to conflicting reports claiming a function as a transcriptional as well as post-transcriptional gene regulator (Borgognone et al. 2010; Calcaterra et al. 1999; Sammons et al. 2011; Shimizu et al. 2003; Michelotti et al. 1995). In human cell lines CNBP encodes at least six protein isoforms, which contain differences in the third CCHC domain as well as in the RG-rich domain conceivably involved in nucleocytoplasmic shuttling. However, our biochemical analyses showed that CNBP could only be detected in the cytoplasm, similar to its homolog in Xenopus laevis (Calcaterra et al. 1999). Combined with our finding that PAR-CLIP binding sites mainly localized to exonic mRNA regions and that all six isoforms interacted with RNA in vivo these data suggest that CNBP is involved in cytoplasmic post transcriptional gene regulatory processes rather than acting as a transcription factor.

ZnF proteins show diverse binding sequence specificity. Recent structural work for the CCCH-type ZnF of the yeast Nab2 and the Drosophila melanogaster Unkempt proteins showed their strong preference for adenine and uridine bases, respectively (Kuhlmann et al. 2014; Murn et al. 2015). In contrast, the two CCHC domains of the LIN28 protein family, which are homologous to CNBP’s seven CCHC ZnFs, interact in vitro and in vivo with a GGAG motif (Nam et al. 2011; Wilbert et al. 2012; Piskounova et al. 2008; Graf et al. 2013). This motif is mirrored by the RREs of CNBP found by in vitro selection (Ray et al. 2016), as well as by PAR-CLIP in this study. RNA-protein crosslinking in 4SU-PAR-CLIP, HiTS-CLIP, and other CLIP-seq procedures occurs predominantly at uridines (Kramer et al. 2014; Sharma et al. 2015) and therefore requires the presence of uridine bases within a few nucleotides of the binding site. Our unbiased motif enrichment analysis revealed CNBP’s G-rich RRE mitigating concerns that results from UV-crosslinking based protocols are disproportionately skewed towards U-rich RREs.

While CNBP loss leads to a marginal accumulation of target mRNAs, their translation and protein output - as measured by Ribo-seq and mass spectrometry - is strongly reduced. The TE of the 243 top target mRNAs dropped on average by 30% and that of all ~4000 targeted mRNAs by 10% on average upon CNBP loss. Interestingly, cells from DM2 patients with CCTG expansion in the first intron of CNBP have reduced CNBP expression and also show an overall reduced rate of translation, similar to our CNBP KO cells (Huichalaf et al. 2009). Taken together, this represents a massive shift in gene regulation of the large group of target mRNAs, which can explain the embryonic lethality of CNBP knockouts (Chen et al. 2003; Shimizu et al. 2003). Previously, only a handful of RBPs with otherwise well-characterized roles in mRNA processing were implicated as ITAFs, possibly aiding the proper folding of individual IRES elements and thereby promoting translation (Holcik and Sonenberg 2005). In contrast, CNBP represents an RBP enhancing the translation of its target mRNAs on a global scale. We hypothesize that CNBP acts by relieving secondary structures on target mRNAs that exhibit G-rich sequence stretches. Secondary structures are often required for RNA function (Mortimer et al. 2014; Wan et al. 2011) and among them the G4 structure exhibits particular stability in vitro (Bochman et al. 2012; Millevoi et al. 2012; Kwok et al. 2016). A few proteins, including the ATP-dependent helicases eIF4A and DHX36 in the Aven complex were found to be necessary for the efficient translation of a subset of RNAs with potential G4 structures in the oncogenic gene expression program (Wolfe et al. 2014; Thandapani et al. 2015). A recent survey that mapped G4 and other stable structures on a transcriptome-wide scale found that they are widespread and stable in vitro (Guo and Bartel 2016). However, they remain unfolded in vivo in eukaryotic cells, suggesting the existence of a specialized machinery supporting resolution of such RNA structures in eukaryotes (Guo and Bartel 2016). Guo and Bartel excluded the involvement of DHX36, and using ATP depletion experiments demonstrated that this machinery likely operates in an ATP-independent manner, precluding helicases altogether. They hypothesized that tight binding by RNA binding proteins may help to prevent formation of structures. Considering the deep conservation of CNBP in eukaryotes, its absence in prokaryotes, and the inability for G4 structures to form in the presence of CNBP in vitro, we hypothesize that CNBP may represent one of these factors, helping to overcome the deleterious effects of G-rich structures on translation. Our work provides a starting point for the study of CNBP in translational regulation and represents a comprehensive resource for the study of individual target RNAs regulated by CNBP.

EXPERIMENTAL PROCEDURES

A detailed description of the experimental procedures can be found in the Supplementary Material.

Stable cell lines and their culture

Stable Flp-In T-REx HEK293 cell lines inducibly expressing FLAG-HA (FH-) tagged CNBP isoforms 1–6 were generated using the Gateway Recombination Cloning technology as described previously (Spitzer et al. 2013; Walhout et al. 2000) using the pFRT/TO/FLAG/HA-Dest plasmid (Landthaler et al. 2008).

siRNAs were transfected at a concentration of 50 nM using Lipofectamine RNAiMAX reagent (Life Technologies).

CNBP siRNA-1: GCAAGGAGCCCAAGAGAGAUU

CNBP siRNA-2: CAAGAGAGAGCGAGAGCAAUU

Cell fractionation

Subcellular fractionation of FH-CNBP isoforms 1-6-HEK293 cells was performed as described by Gagnon et al. unless otherwise stated (Gagnon et al. 2014).

PAR-CLIP

PAR-CLIP was performed as previously described (Hafner et al. 2012a; 2010a; 2010b) with minor modifications (see Supplemental Experimental procedures for the complete procedure). PAR-CLIP cDNA libraries were sequenced on an Illumina HiSeq 2500 platform. Clusters of overlapping sequence reads mapped against the human genome version hg19 were generated using the PARalyzer software (Corcoran et al. 2011) incorporated into a pipeline (PARpipe; https://ohlerlab.mdc-berlin.de/software/PARpipe_119/) with default settings. Binding sites were categorized using the Gencode GRCh37.p13 GTF annotation (gencode.v19.chr_patch_hapl_scaff.annotation.gtf, http://www.gencodegenes.org/releases/19.html).

DMS-Seq data for RT-stop profiling in HEK293 cells were downloaded from the dataset by Guo and Bartel (Guo and Bartel 2016) (GEO GSE83617). Reads from K+ sample were taken for the downstream analysis as they represent the strongest RT-stops. RT-stop site distribution around CNBP binding clusters was created using SAMtools mpileup. As background clusters for each CNBP cluster a 40 nt long sequence was randomly selected within the same transcript and at the same functional region (5′ UTR, CDS, or 3′ UTR).

Recombinant protein expression, EMSA, and filter binding assay

For the expression of recombinant protein CNBP isoform 3 was cloned into pET-M11 and was transformed into E. coli BL21 DE3 (pLysS). CNBP was purified using standard His-tag affinity purification using Ni-NTA agarose beads (Qiagen) and CNBP was eluted from the beads with 250 mM imidazole.

For electrophoretic mobility shift assays 5 nM radioactively labeled oligoribonucleotides (for sequences, see Figure 3B) were incubated with recombinant CNBP in binding buffer (see Supplemental Experimental procedures) and fractionated on a native 0.4% agarose gel.

For filter binding assays radiolabeled oligoribonucleotides were incubated with recombinant CNBP as described above and applied to a dotblot apparatus (BioRad). Unbound RNA was retained by nylon membrane, whereas the RNA-protein complex was retained on a nitrocellulose membrane. The bound and unbound spots were quantified using ImageQuant (GE Healthcare) and their relative K_ds were calculated using Prism software (GraphPad).

CRISPR/Cas9 gene editing

Flp-In T-REx HEK293 cells were generated using CNBP CRISPR/Cas9 KO Plasmids (sc-404090). KO clones were screened by immunoblotting with anti-CNBP antibody (Sigma SAB2100453). Knockout was confirmed by RNA-sequencing analysis.

RNA-seq

RNA from three different clones of CNBP KO cells and three biological replicates of the parental Flp-In T-REx HEK293 was isolated using TRIzol reagent according to the manufacturer’s instructions. rRNA was removed using the Ribo-Zero rRNA removal kit (Illumina) followed by directional cDNA library preparation (NEB). Sequenced reads were aligned to the human genome version hg19 using TopHat (Trapnell et al. 2012). Cufflinks and Cuffdiff were used to quantify transcripts and determine differential expression (Trapnell et al. 2012).

Ribo-seq

Ribo-seq was performed as described in (Ingolia et al. 2012) from single 10 cm plates of Flp-In T-REx HEK293 (either wt or CNBP KO). Small RNA cDNA libraries from ribosome footprints were constructed as in (Hafner et al. 2012b) and sequenced on an Illumina HiSeq 2500 platform. Sequenced reads were aligned to the human genome version hg19 using TopHat (Trapnell et al. 2012). RNAcounter was used to quantify transcript-mapped footprints. Overlaps of CNBP cluster and different genomic regions were calculated with BEDTools (Quinlan and Hall 2010). Data extraction, formatting, subgrouping, and preparation for analysis were all facilitated with customized scripts for Bash, Python, Java, and R.

Statistical methods

Equality of distributions in the empirical cumulative distribution functions for changes in transcript abundance (Fig. 4A) and changes in translational efficiency (Figs. 4C,D and 5C,D) was tested using the two-sample Kolmogorov-Smirnov test (ks.test command in the R statistical software package). Statistical significance of differences in luciferase reporter gene expression upon CNBP knockdowns (Fig. 5B) was assessed using a two-tailed t-test using the Graphpad Prism software.

Luciferase reporter assay

HEK293 cells were cultured at a density of 60–80% in 12 well plates and transfected with CNBP or control siRNA using Lipofectamine RNAiMax (Life Technologies). 72 hours post transfection cell were transfected with a dual luciferase plasmid (psiCHECK-2) containing wild type or mutant binding sequences (Figure S7A,B) using Lipofectamine 2000 (Life Technologies). 6 hours later cell lysates were generated using Promega passive lysis buffer and luciferase activity was measured using the Dual-luciferase reporter assay system (Promega) on a TriStar LB941 luminometer (Berthold Technologies).

Selective 2′-hydroxyl acetylation analyzed by primer extension (SHAPE)

SHAPE was performed as previously described (Wilkinson et al. 2006) with minor modifications described in the Supplemental Experimental procedures.

CD measurement

Oligonucleotides forming G4 structures were incubated with increasing concentrations of recombinant CNBP isoform 3 and CD measurement was performed using a Jasco J-815 spectropolarimeter with readings were recorded over a wavelength range of 200–350 nm. As control a mutant sequence incapable of forming G4 structure was used. (G4: AAAAAAAAAAGGGGGAGCTGGGGTAGATGGGAATGTGAGGG, control AAAAAAAAAAGCGCGAGCTGCGCTAGATGCGAATGTGAGCG).

Supplementary Material

NIHMS857597-supplement-1.docx^{(15.3MB, docx)}

NIHMS857597-supplement-2.xlsx^{(947.1KB, xlsx)}

NIHMS857597-supplement-3.xlsx^{(4.4MB, xlsx)}

NIHMS857597-supplement-4.xlsx^{(4.6MB, xlsx)}

NIHMS857597-supplement-5.xlsx^{(4MB, xlsx)}

NIHMS857597-supplement-6.xlsx^{(496.8KB, xlsx)}

Highlights.

-
CNBP is a cytoplasmic RNA binding protein binding mature mRNAs at G-rich elements.
-
CNBP binds at sites forming G quadruplex and other stable structures in vitro.
-
CNBP prevents formation of G quadruplex structures in vitro.
-
CNBP increases translational efficiency on its target mRNAs on a global scale.

Acknowledgments

We thank the NIAMS Genomics Core Facility and Gustavo Gutierrez-Cruz and Dr. Stefania Dell’Orso (NIAMS/NIH) for sequencing support. We also thank Jyotishman Veepaschit (Biocenter, Würzburg) for help with purification of CNBP and Dr. Noam Stern-Ginossar (Weizmann Institute of Science, Rehovot) for help with the ribosome profiling protocol. We also want to acknowledge Wesley Browne (Feringa lab, Zernike, Groningen) for help with the CD measurement. This work was supported by the Intramural Research Program of the National Institute for Arthritis and Musculoskeletal and Skin Disease.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

AUTHOR CONTRIBUTIONS

Conceptualization, M.H. and S.A.J.; Methodology, S.K.G., D.B., C.H.D., K.P., M.H., S.A.J.; Software, S.G. and H.-W.S.; Investigation, S.K.G., D.B., C.H.D., H.G.K.; Formal Analysis, D.B., S.G., H.-W.S., and M.H.; Writing - Original Draft, M.H. and S.A.J.; Writing - Review & Editing, D.B., M.H., S.A.J.; Funding Acquisition, M.H. and S.A.J.; Resources, M.H. and S.A.J.; Supervision, M.H. and S.A.J.

ACCESSION CODES

The PAR-CLIP and RNA-seq sequence data have been deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive under accession numbers GSE76193 and GSE76627, respectively.

COMPETING FINANCIAL INTERESTS

The authors declare no competing financial interests.

References

Armas P, Nasif S, Calcaterra NB. Cellular nucleic acid binding protein binds G-rich single-stranded nucleic acids and may function as a nucleic acid chaperone. J Cell Biochem. 2008;103:1013–1036. doi: 10.1002/jcb.21474. [DOI] [PubMed] [Google Scholar]
Ascano M, Jr, Hafner M, Cekan P, Gerstberger S, Tuschl T. Identification of RNA-protein interaction networks using PAR-CLIP. WIREs RNA. 2012a;3:159–177. doi: 10.1002/wrna.1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ascano M, Jr, Mukherjee N, Bandaru P, Miller JB, Nusbaum JD, Corcoran DL, Langlois C, Munschauer M, Dewell S, Hafner M, et al. FMRP targets distinct mRNA sequence elements to regulate protein expression. Nature. 2012b;492:382–386. doi: 10.1038/nature11737. [DOI] [PMC free article] [PubMed] [Google Scholar]
Baltz A, Munschauer M, Schwanhausser B, Drew K, Wyler E, Bonneau R, Selbach M, Dieterich C, Landthaler M. The mRNA-Bound Proteome and Its Global Occupancy Profile on Protein-Coding Transcripts. Mol Cell. 2012;46:674–690. doi: 10.1016/j.molcel.2012.05.021. [DOI] [PubMed] [Google Scholar]
Bazzini AA, Lee MT, Giraldez AJ. Ribosome profiling shows that miR-430 reduces translation before causing mRNA decay in zebrafish. Science. 2012;336:233–237. doi: 10.1126/science.1215704. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bochman ML, Paeschke K, Zakian VA. DNA secondary structures: stability and function of G-quadruplex structures. Nature Reviews Genetics. 2012;13:770–780. doi: 10.1038/nrg3296. [DOI] [PMC free article] [PubMed] [Google Scholar]
Borgognone M, Armas P, Calcaterra NB. Cellular nucleic-acid-binding protein, a transcriptional enhancer of c-Myc, promotes the formation of parallel G-quadruplexes. Biochem J. 2010;428:491–498. doi: 10.1042/BJ20100038. [DOI] [PubMed] [Google Scholar]
Calcaterra NB, Palatnik JF, Bustos DM, Arranz SE, Cabada MO. Identification of mRNA-binding proteins during development: characterization of Bufo arenarum cellular nucleic acid binding protein. Dev Growth Differ. 1999;41:183–191. doi: 10.1046/j.1440-169x.1999.00414.x. [DOI] [PubMed] [Google Scholar]
Castello A, Fischer B, Eichelbaum K, Horos R, Beckmann BM, Strein C, Davey NE, Humphreys DT, Preiss T, Steinmetz LM, et al. Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell. 2012;149:1393–1406. doi: 10.1016/j.cell.2012.04.031. [DOI] [PubMed] [Google Scholar]
Chen S, Su L, Qiu J, Xiao N, Lin J, Tan J-H, Ou T-M, Gu L-Q, Huang Z-S, Li D. Mechanistic studies for the role of cellular nucleic-acid-binding protein (CNBP) in regulation of c-myc transcription. Biochim Biophys Acta. 2013;1830:4769–4777. doi: 10.1016/j.bbagen.2013.06.007. [DOI] [PubMed] [Google Scholar]
Chen W, Liang Y, Deng W, Shimizu K, Ashique AM, Li E, Li Y-P. The zinc-finger protein CNBP is required for forebrain formation in the mouse. Development. 2003;130:1367–1379. doi: 10.1242/dev.00349. [DOI] [PubMed] [Google Scholar]
Corcoran DL, Georgiev S, Mukherjee N, Gottwein E, Skalsky RL, Keene JD, Ohler U. PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data. Genome Biology. 2011;12:R79. doi: 10.1186/gb-2011-12-8-r79. [DOI] [PMC free article] [PubMed] [Google Scholar]
Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, et al. The Pfam protein families database. Nucleic Acids Research. 2010;38:D211–22. doi: 10.1093/nar/gkp985. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gagnon KT, Li L, Janowski BA, Corey DR. Analysis of nuclear RNA interference in human cells by subcellular fractionation and Argonaute loading. Nat Prot. 2014;9:2045–2060. doi: 10.1038/nprot.2014.135. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gerbasi VR, Link AJ. The myotonic dystrophy type 2 protein ZNF9 is part of an ITAF complex that promotes cap-independent translation. Mol Cell Proteomics. 2007;6:1049–1058. doi: 10.1074/mcp.M600384-MCP200. [DOI] [PubMed] [Google Scholar]
Gerstberger S, Hafner M, Tuschl T. A census of human RNA-binding proteins. Nature Reviews Genetics. 2014;15:829–845. doi: 10.1038/nrg3813. [DOI] [PMC free article] [PubMed] [Google Scholar]
Graf R, Munschauer M, Mastrobuoni G, Mayr F, Heinemann U, Kempa S, Rajewsky N, Landthaler M. Identification of LIN28B-bound mRNAs reveals features of target recognition and regulation. RNA Biol. 2013;10:1146–1159. doi: 10.4161/rna.25194. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo H, Ingolia NT, Weissman JS, Bartel DP. Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature. 2010;466:835–840. doi: 10.1038/nature09267. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo JU, Bartel DP. RNA G-quadruplexes are globally unfolded in eukaryotic cells and depleted in bacteria. Science. 2016;353:aaf5371. doi: 10.1126/science.aaf5371. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M, Jr, Jungkamp A-C, Munschauer M, et al. PAR-CliP–a method to identify transcriptome-wide the binding sites of RNA binding proteins. J Vis Exp. 2010a doi: 10.3791/2034. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M, Jr, Jungkamp A-C, Munschauer M, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010b;141:129–141. doi: 10.1016/j.cell.2010.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hafner M, Lianoglou S, Tuschl T, Betel D. Genome-wide identification of miRNA targets by PAR-CLIP. Methods. 2012a;58:94–105. doi: 10.1016/j.ymeth.2012.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hafner M, Renwick N, Farazi TA, Mihailovic A, Pena JTG, Tuschl T. Barcoded cDNA library preparation for small RNA profiling by next-generation sequencing. Methods. 2012b;58:164–170. doi: 10.1016/j.ymeth.2012.07.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hall TMT. Multiple modes of RNA recognition by zinc finger proteins. Curr Opin Struct Biol. 2005;15:367–373. doi: 10.1016/j.sbi.2005.04.004. [DOI] [PubMed] [Google Scholar]
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Holcik M, Sonenberg N. Translational control in stress and apoptosis. Nat Rev Mol Cell Biol. 2005;6:318–327. doi: 10.1038/nrm1618. [DOI] [PubMed] [Google Scholar]
Huichalaf C, Schoser B, Schneider-Gold C, Jin B, Sarkar P, Timchenko L. Reduction of the rate of protein translation in patients with myotonic dystrophy 2. J Neurosci. 2009;29:9042–9049. doi: 10.1523/JNEUROSCI.1983-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
Iadevaia V, Caldarola S, Tino E, Amaldi F, Loreni F. All translation elongation factors and the e, f, and h subunits of translation initiation factor 3 are encoded by 5′-terminal oligopyrimidine (TOP) mRNAs. RNA. 2008;14:1730–1736. doi: 10.1261/rna.1037108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ingolia NT, Brar GA, Rouskin S, McGeachy AM, Weissman JS. The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments. Nat Prot. 2012;7:1534–1550. doi: 10.1038/nprot.2012.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–223. doi: 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klug A, Rhodes D. Zinc fingers: a novel protein fold for nucleic acid recognition. Cold Spring Harb Symp Quant Biol. 1987;52:473–482. doi: 10.1101/sqb.1987.052.01.054. [DOI] [PubMed] [Google Scholar]
Kramer K, Sachsenberg T, Beckmann BM, Qamar S, Boon K-L, Hentze MW, Kohlbacher O, Urlaub H. Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins. Nat Meth. 2014;11:1064–1070. doi: 10.1038/nmeth.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kuhlmann SI, Valkov E, Stewart M. Structural basis for the molecular recognition of polyadenosine RNA by Nab2 Zn fingers. Nucleic Acids Research. 2014;42:672–680. doi: 10.1093/nar/gkt876. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kwok CK, Marsico G, Sahakyan AB, Chambers VS, Balasubramanian S. rG4-seq reveals widespread formation of G-quadruplex structures in the human transcriptome. Nat Meth. 2016;13:841–844. doi: 10.1038/nmeth.3965. [DOI] [PubMed] [Google Scholar]
Landthaler M, Gaidatzis D, Rothballer A, Chen PY, Soll SJ, Dinic L, Ojo T, Hafner M, Zavolan M, Tuschl T. Molecular characterization of human Argonaute-containing ribonucleoprotein complexes and their bound target mRNAs. RNA. 2008;14:2580–2596. doi: 10.1261/rna.1351608. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lee Y-J, Hsieh W-Y, Chen L-Y, Li C. Protein arginine methylation of SERBP1 by protein arginine methyltransferase 1 affects cytoplasmic/nuclear distribution. J Cell Biochem. 2012;113:2721–2728. doi: 10.1002/jcb.24151. [DOI] [PubMed] [Google Scholar]
Liquori CL, Ricker K, Moseley ML, Jacobsen JF, Kress W, Naylor SL, Day JW, Ranum LP. Myotonic dystrophy type 2 caused by a CCTG expansion in intron 1 of ZNF9. Science. 2001;293:864–867. doi: 10.1126/science.1062125. [DOI] [PubMed] [Google Scholar]
Lunde BM, Lunde BM, Moore C, Moore C, Varani G, Varani G. RNA-binding proteins: modular design for efficient function. Nat Rev Mol Cell Biol. 2007;8:479–490. doi: 10.1038/nrm2178. [DOI] [PMC free article] [PubMed] [Google Scholar]
McBride AE, Conboy AK, Brown SP, Ariyachet C, Rutledge KL. Specific sequences within arginine-glycine-rich domains affect mRNA-binding protein function. Nucleic Acids Research. 2009;37:4322–4330. doi: 10.1093/nar/gkp349. [DOI] [PMC free article] [PubMed] [Google Scholar]
Michelotti EF, Tomonaga T, Krutzsch H, Levens D. Cellular nucleic acid binding protein regulates the CT element of the human c-myc protooncogene. J Biol Chem. 1995;270:9494–9499. doi: 10.1074/jbc.270.16.9494. [DOI] [PubMed] [Google Scholar]
Millevoi S, Moine H, Vagner S. G-quadruplexes in RNA biology. WIREs RNA. 2012;3:495–507. doi: 10.1002/wrna.1113. [DOI] [PubMed] [Google Scholar]
Mortimer SA, Kidwell MA, Doudna JA. Insights into RNA structure and function from genome-wide studies. Nature Reviews Genetics. 2014;15:469–479. doi: 10.1038/nrg3681. [DOI] [PubMed] [Google Scholar]
Murn J, Teplova M, Zarnack K, Shi Y, Patel DJ. Recognition of distinct RNA motifs by the clustered CCCH zinc fingers of neuronal protein Unkempt. Nat Struct Mol Biol. 2015;23:16–23. doi: 10.1038/nsmb.3140. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nam Y, Chen C, Gregory RI, Chou JJ, Sliz P. Molecular basis for interaction of let-7 microRNAs with Lin28. Cell. 2011;147:1080–1091. doi: 10.1016/j.cell.2011.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ong S-E, Foster LJ, Mann M. Mass spectrometric-based approaches in quantitative proteomics. Methods. 2003;29:124–130. doi: 10.1016/s1046-2023(02)00303-1. [DOI] [PubMed] [Google Scholar]
Pace NJ, Weerapana E. Zinc-binding cysteines: diverse functions and structural motifs. Biomolecules. 2014;4:419–434. doi: 10.3390/biom4020419. [DOI] [PMC free article] [PubMed] [Google Scholar]
Paeschke K, Bochman ML, Garcia PD, Cejka P, Friedman KL, Kowalczykowski SC, Zakian VA. Pif1 family helicases suppress genome instability at G-quadruplex motifs. Nature. 2013;497:458–462. doi: 10.1038/nature12149. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pellizzoni L, Lotti F, Maras B, Pierandrei-Amaldi P. Cellular nucleic acid binding protein binds a conserved region of the 5′ UTR of Xenopus laevis ribosomal protein mRNAs. J Mol Biol. 1997;267:264–275. doi: 10.1006/jmbi.1996.0888. [DOI] [PubMed] [Google Scholar]
Piskounova E, Viswanathan SR, Janas M, LaPierre RJ, Daley GQ, Sliz P, Gregory RI. Determinants of MicroRNA Processing Inhibition by the Developmentally Regulated RNA-binding Protein Lin28. J Biol Chem. 2008;283:21310–21314. doi: 10.1074/jbc.C800108200. [DOI] [PubMed] [Google Scholar]
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rajavashisth T, Taylor A, Andalibi A, Svenson K, Lusis A. Identification of a zinc finger protein that binds to the sterol regulatory element. Science. 1989;245:640–643. doi: 10.1126/science.2562787. [DOI] [PubMed] [Google Scholar]
Ray D, Ha KC, Nie K, Zheng H, Hughes TR, Morris QD. RNAcompete methodology and application to determine sequence preferences of unconventional RNA-binding proteins. Methods. 2016;16:30472–30478. doi: 10.1016/j.ymeth.2016.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rhodes D, Lipps HJ. G-quadruplexes and their regulatory roles in biology. Nucleic Acids Research. 2015;43:8627–8637. doi: 10.1093/nar/gkv862. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rojas M, Farr GW, Fernandez CF, Lauden L, McCormack JC, Wolin SL. Yeast Gis2 and its human ortholog CNBP are novel components of stress-induced RNP granules. In: Stoecklin G, editor. PLoS ONE. Vol. 7. 2012. p. e52824. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sammons MA, Samir P, Link AJ. Saccharomyces cerevisiae Gis2 interacts with the translation machinery and is orthogonal to myotonic dystrophy type 2 protein ZNF9. Biochem Biophys Res Commun. 2011;406:13–19. doi: 10.1016/j.bbrc.2011.01.086. [DOI] [PubMed] [Google Scholar]
Sharma K, Hrle A, Kramer K, Sachsenberg T, Staals RHJ, Randau L, Marchfelder A, van der Oost J, Kohlbacher O, Conti E, et al. Analysis of protein-RNA interactions by UV-induced cross-linking and mass spectrometry. Methods. 2015;89:138–148. doi: 10.1016/j.ymeth.2015.06.005. [DOI] [PubMed] [Google Scholar]
Shimizu K, Chen W, Ashique AM, Moroi R, Li Y-P. Molecular cloning, developmental expression, promoter analysis and functional characterization of the mouse CNBP gene. Gene. 2003;307:51–62. doi: 10.1016/s0378-1119(03)00406-2. [DOI] [PubMed] [Google Scholar]
Spitzer J, Landthaler M, Tuschl T. Rapid creation of stable mammalian cell lines for regulated expression of proteins using the Gateway® recombination cloning technology and Flp-In T-REx® lines. Meth Enzymol. 2013;529:99–124. doi: 10.1016/B978-0-12-418687-3.00008-2. [DOI] [PubMed] [Google Scholar]
Thandapani P, Song J, Gandin V, Cai Y, Rouleau SG, Garant J-M, Boisvert F-M, Yu Z, Perreault J-P, Topisirovic I, et al. Aven recognition of RNA G-quadruplexes regulates translation of the mixed lineage leukemia protooncogenes. In: Green R, editor. eLife. Vol. 4. 2015. p. e06234. [DOI] [PMC free article] [PubMed] [Google Scholar]
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Prot. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vaquerizas JM, Kummerfeld SK, Teichmann SA, Luscombe NM. A census of human transcription factors: function, expression and evolution. Nature Reviews Genetics. 2009;10:252–263. doi: 10.1038/nrg2538. [DOI] [PubMed] [Google Scholar]
Walhout AJ, Temple GF, Brasch MA, Hartley JL, Lorson MA, van den Heuvel S, Vidal M. GATEWAY recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes. Meth Enzymol. 2000;328:575–592. doi: 10.1016/s0076-6879(00)28419-x. [DOI] [PubMed] [Google Scholar]
Wan Y, Kertesz M, Spitale RC, Segal E, Chang HY. Understanding the transcriptome through RNA structure. Nature Reviews Genetics. 2011;12:641–655. doi: 10.1038/nrg3049. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wei H-M, Hu H-H, Chang G-Y, Lee Y-J, Li Y-C, Chang H-H, Li C. Arginine methylation of the cellular nucleic acid binding protein does not affect its subcellular localization but impedes RNA binding. FEBS Let. 2014;588:1542–1548. doi: 10.1016/j.febslet.2014.03.052. [DOI] [PubMed] [Google Scholar]
Wilbert ML, Huelga SC, Kapeli K, Stark TJ, Liang TY, Chen SX, Yan BY, Nathanson JL, Hutt KR, Lovci MT, et al. LIN28 binds messenger RNAs at GGAGA motifs and regulates splicing factor abundance. Mol Cell. 2012;48:195–206. doi: 10.1016/j.molcel.2012.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wilkinson KA, Merino EJ, Weeks KM. Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution. Nat Prot. 2006;1:1610–1616. doi: 10.1038/nprot.2006.249. [DOI] [PubMed] [Google Scholar]
Wilson D, Charoensawan V, Kummerfeld SK, Teichmann SA. DBD–taxonomically broad transcription factor predictions: new content and functionality. Nucleic Acids Research. 2007;36:D88–D92. doi: 10.1093/nar/gkm964. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wolfe AL, Singh K, Zhong Y, Drewe P, Rajasekhar VK, Sanghvi VR, Mavrakis KJ, Jiang M, Roderick JE, Van der Meulen J, et al. RNA G-quadruplexes cause eIF4A-dependent oncogene translation in cancer. Nature. 2014;513:65–70. doi: 10.1038/nature13485. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS857597-supplement-1.docx^{(15.3MB, docx)}

NIHMS857597-supplement-2.xlsx^{(947.1KB, xlsx)}

NIHMS857597-supplement-3.xlsx^{(4.4MB, xlsx)}

NIHMS857597-supplement-4.xlsx^{(4.6MB, xlsx)}

NIHMS857597-supplement-5.xlsx^{(4MB, xlsx)}

NIHMS857597-supplement-6.xlsx^{(496.8KB, xlsx)}

[R1] Armas P, Nasif S, Calcaterra NB. Cellular nucleic acid binding protein binds G-rich single-stranded nucleic acids and may function as a nucleic acid chaperone. J Cell Biochem. 2008;103:1013–1036. doi: 10.1002/jcb.21474. [DOI] [PubMed] [Google Scholar]

[R2] Ascano M, Jr, Hafner M, Cekan P, Gerstberger S, Tuschl T. Identification of RNA-protein interaction networks using PAR-CLIP. WIREs RNA. 2012a;3:159–177. doi: 10.1002/wrna.1103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Ascano M, Jr, Mukherjee N, Bandaru P, Miller JB, Nusbaum JD, Corcoran DL, Langlois C, Munschauer M, Dewell S, Hafner M, et al. FMRP targets distinct mRNA sequence elements to regulate protein expression. Nature. 2012b;492:382–386. doi: 10.1038/nature11737. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Baltz A, Munschauer M, Schwanhausser B, Drew K, Wyler E, Bonneau R, Selbach M, Dieterich C, Landthaler M. The mRNA-Bound Proteome and Its Global Occupancy Profile on Protein-Coding Transcripts. Mol Cell. 2012;46:674–690. doi: 10.1016/j.molcel.2012.05.021. [DOI] [PubMed] [Google Scholar]

[R5] Bazzini AA, Lee MT, Giraldez AJ. Ribosome profiling shows that miR-430 reduces translation before causing mRNA decay in zebrafish. Science. 2012;336:233–237. doi: 10.1126/science.1215704. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Bochman ML, Paeschke K, Zakian VA. DNA secondary structures: stability and function of G-quadruplex structures. Nature Reviews Genetics. 2012;13:770–780. doi: 10.1038/nrg3296. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Borgognone M, Armas P, Calcaterra NB. Cellular nucleic-acid-binding protein, a transcriptional enhancer of c-Myc, promotes the formation of parallel G-quadruplexes. Biochem J. 2010;428:491–498. doi: 10.1042/BJ20100038. [DOI] [PubMed] [Google Scholar]

[R8] Calcaterra NB, Palatnik JF, Bustos DM, Arranz SE, Cabada MO. Identification of mRNA-binding proteins during development: characterization of Bufo arenarum cellular nucleic acid binding protein. Dev Growth Differ. 1999;41:183–191. doi: 10.1046/j.1440-169x.1999.00414.x. [DOI] [PubMed] [Google Scholar]

[R9] Castello A, Fischer B, Eichelbaum K, Horos R, Beckmann BM, Strein C, Davey NE, Humphreys DT, Preiss T, Steinmetz LM, et al. Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell. 2012;149:1393–1406. doi: 10.1016/j.cell.2012.04.031. [DOI] [PubMed] [Google Scholar]

[R10] Chen S, Su L, Qiu J, Xiao N, Lin J, Tan J-H, Ou T-M, Gu L-Q, Huang Z-S, Li D. Mechanistic studies for the role of cellular nucleic-acid-binding protein (CNBP) in regulation of c-myc transcription. Biochim Biophys Acta. 2013;1830:4769–4777. doi: 10.1016/j.bbagen.2013.06.007. [DOI] [PubMed] [Google Scholar]

[R11] Chen W, Liang Y, Deng W, Shimizu K, Ashique AM, Li E, Li Y-P. The zinc-finger protein CNBP is required for forebrain formation in the mouse. Development. 2003;130:1367–1379. doi: 10.1242/dev.00349. [DOI] [PubMed] [Google Scholar]

[R12] Corcoran DL, Georgiev S, Mukherjee N, Gottwein E, Skalsky RL, Keene JD, Ohler U. PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data. Genome Biology. 2011;12:R79. doi: 10.1186/gb-2011-12-8-r79. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, et al. The Pfam protein families database. Nucleic Acids Research. 2010;38:D211–22. doi: 10.1093/nar/gkp985. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Gagnon KT, Li L, Janowski BA, Corey DR. Analysis of nuclear RNA interference in human cells by subcellular fractionation and Argonaute loading. Nat Prot. 2014;9:2045–2060. doi: 10.1038/nprot.2014.135. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Gerbasi VR, Link AJ. The myotonic dystrophy type 2 protein ZNF9 is part of an ITAF complex that promotes cap-independent translation. Mol Cell Proteomics. 2007;6:1049–1058. doi: 10.1074/mcp.M600384-MCP200. [DOI] [PubMed] [Google Scholar]

[R16] Gerstberger S, Hafner M, Tuschl T. A census of human RNA-binding proteins. Nature Reviews Genetics. 2014;15:829–845. doi: 10.1038/nrg3813. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Graf R, Munschauer M, Mastrobuoni G, Mayr F, Heinemann U, Kempa S, Rajewsky N, Landthaler M. Identification of LIN28B-bound mRNAs reveals features of target recognition and regulation. RNA Biol. 2013;10:1146–1159. doi: 10.4161/rna.25194. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Guo H, Ingolia NT, Weissman JS, Bartel DP. Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature. 2010;466:835–840. doi: 10.1038/nature09267. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Guo JU, Bartel DP. RNA G-quadruplexes are globally unfolded in eukaryotic cells and depleted in bacteria. Science. 2016;353:aaf5371. doi: 10.1126/science.aaf5371. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M, Jr, Jungkamp A-C, Munschauer M, et al. PAR-CliP–a method to identify transcriptome-wide the binding sites of RNA binding proteins. J Vis Exp. 2010a doi: 10.3791/2034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M, Jr, Jungkamp A-C, Munschauer M, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010b;141:129–141. doi: 10.1016/j.cell.2010.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Hafner M, Lianoglou S, Tuschl T, Betel D. Genome-wide identification of miRNA targets by PAR-CLIP. Methods. 2012a;58:94–105. doi: 10.1016/j.ymeth.2012.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Hafner M, Renwick N, Farazi TA, Mihailovic A, Pena JTG, Tuschl T. Barcoded cDNA library preparation for small RNA profiling by next-generation sequencing. Methods. 2012b;58:164–170. doi: 10.1016/j.ymeth.2012.07.030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Hall TMT. Multiple modes of RNA recognition by zinc finger proteins. Curr Opin Struct Biol. 2005;15:367–373. doi: 10.1016/j.sbi.2005.04.004. [DOI] [PubMed] [Google Scholar]

[R25] Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] Holcik M, Sonenberg N. Translational control in stress and apoptosis. Nat Rev Mol Cell Biol. 2005;6:318–327. doi: 10.1038/nrm1618. [DOI] [PubMed] [Google Scholar]

[R27] Huichalaf C, Schoser B, Schneider-Gold C, Jin B, Sarkar P, Timchenko L. Reduction of the rate of protein translation in patients with myotonic dystrophy 2. J Neurosci. 2009;29:9042–9049. doi: 10.1523/JNEUROSCI.1983-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Iadevaia V, Caldarola S, Tino E, Amaldi F, Loreni F. All translation elongation factors and the e, f, and h subunits of translation initiation factor 3 are encoded by 5′-terminal oligopyrimidine (TOP) mRNAs. RNA. 2008;14:1730–1736. doi: 10.1261/rna.1037108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Ingolia NT, Brar GA, Rouskin S, McGeachy AM, Weissman JS. The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments. Nat Prot. 2012;7:1534–1550. doi: 10.1038/nprot.2012.086. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–223. doi: 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] Klug A, Rhodes D. Zinc fingers: a novel protein fold for nucleic acid recognition. Cold Spring Harb Symp Quant Biol. 1987;52:473–482. doi: 10.1101/sqb.1987.052.01.054. [DOI] [PubMed] [Google Scholar]

[R32] Kramer K, Sachsenberg T, Beckmann BM, Qamar S, Boon K-L, Hentze MW, Kohlbacher O, Urlaub H. Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins. Nat Meth. 2014;11:1064–1070. doi: 10.1038/nmeth.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] Kuhlmann SI, Valkov E, Stewart M. Structural basis for the molecular recognition of polyadenosine RNA by Nab2 Zn fingers. Nucleic Acids Research. 2014;42:672–680. doi: 10.1093/nar/gkt876. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] Kwok CK, Marsico G, Sahakyan AB, Chambers VS, Balasubramanian S. rG4-seq reveals widespread formation of G-quadruplex structures in the human transcriptome. Nat Meth. 2016;13:841–844. doi: 10.1038/nmeth.3965. [DOI] [PubMed] [Google Scholar]

[R35] Landthaler M, Gaidatzis D, Rothballer A, Chen PY, Soll SJ, Dinic L, Ojo T, Hafner M, Zavolan M, Tuschl T. Molecular characterization of human Argonaute-containing ribonucleoprotein complexes and their bound target mRNAs. RNA. 2008;14:2580–2596. doi: 10.1261/rna.1351608. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] Lee Y-J, Hsieh W-Y, Chen L-Y, Li C. Protein arginine methylation of SERBP1 by protein arginine methyltransferase 1 affects cytoplasmic/nuclear distribution. J Cell Biochem. 2012;113:2721–2728. doi: 10.1002/jcb.24151. [DOI] [PubMed] [Google Scholar]

[R37] Liquori CL, Ricker K, Moseley ML, Jacobsen JF, Kress W, Naylor SL, Day JW, Ranum LP. Myotonic dystrophy type 2 caused by a CCTG expansion in intron 1 of ZNF9. Science. 2001;293:864–867. doi: 10.1126/science.1062125. [DOI] [PubMed] [Google Scholar]

[R38] Lunde BM, Lunde BM, Moore C, Moore C, Varani G, Varani G. RNA-binding proteins: modular design for efficient function. Nat Rev Mol Cell Biol. 2007;8:479–490. doi: 10.1038/nrm2178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] McBride AE, Conboy AK, Brown SP, Ariyachet C, Rutledge KL. Specific sequences within arginine-glycine-rich domains affect mRNA-binding protein function. Nucleic Acids Research. 2009;37:4322–4330. doi: 10.1093/nar/gkp349. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] Michelotti EF, Tomonaga T, Krutzsch H, Levens D. Cellular nucleic acid binding protein regulates the CT element of the human c-myc protooncogene. J Biol Chem. 1995;270:9494–9499. doi: 10.1074/jbc.270.16.9494. [DOI] [PubMed] [Google Scholar]

[R41] Millevoi S, Moine H, Vagner S. G-quadruplexes in RNA biology. WIREs RNA. 2012;3:495–507. doi: 10.1002/wrna.1113. [DOI] [PubMed] [Google Scholar]

[R42] Mortimer SA, Kidwell MA, Doudna JA. Insights into RNA structure and function from genome-wide studies. Nature Reviews Genetics. 2014;15:469–479. doi: 10.1038/nrg3681. [DOI] [PubMed] [Google Scholar]

[R43] Murn J, Teplova M, Zarnack K, Shi Y, Patel DJ. Recognition of distinct RNA motifs by the clustered CCCH zinc fingers of neuronal protein Unkempt. Nat Struct Mol Biol. 2015;23:16–23. doi: 10.1038/nsmb.3140. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] Nam Y, Chen C, Gregory RI, Chou JJ, Sliz P. Molecular basis for interaction of let-7 microRNAs with Lin28. Cell. 2011;147:1080–1091. doi: 10.1016/j.cell.2011.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] Ong S-E, Foster LJ, Mann M. Mass spectrometric-based approaches in quantitative proteomics. Methods. 2003;29:124–130. doi: 10.1016/s1046-2023(02)00303-1. [DOI] [PubMed] [Google Scholar]

[R46] Pace NJ, Weerapana E. Zinc-binding cysteines: diverse functions and structural motifs. Biomolecules. 2014;4:419–434. doi: 10.3390/biom4020419. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] Paeschke K, Bochman ML, Garcia PD, Cejka P, Friedman KL, Kowalczykowski SC, Zakian VA. Pif1 family helicases suppress genome instability at G-quadruplex motifs. Nature. 2013;497:458–462. doi: 10.1038/nature12149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] Pellizzoni L, Lotti F, Maras B, Pierandrei-Amaldi P. Cellular nucleic acid binding protein binds a conserved region of the 5′ UTR of Xenopus laevis ribosomal protein mRNAs. J Mol Biol. 1997;267:264–275. doi: 10.1006/jmbi.1996.0888. [DOI] [PubMed] [Google Scholar]

[R49] Piskounova E, Viswanathan SR, Janas M, LaPierre RJ, Daley GQ, Sliz P, Gregory RI. Determinants of MicroRNA Processing Inhibition by the Developmentally Regulated RNA-binding Protein Lin28. J Biol Chem. 2008;283:21310–21314. doi: 10.1074/jbc.C800108200. [DOI] [PubMed] [Google Scholar]

[R50] Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Rajavashisth T, Taylor A, Andalibi A, Svenson K, Lusis A. Identification of a zinc finger protein that binds to the sterol regulatory element. Science. 1989;245:640–643. doi: 10.1126/science.2562787. [DOI] [PubMed] [Google Scholar]

[R52] Ray D, Ha KC, Nie K, Zheng H, Hughes TR, Morris QD. RNAcompete methodology and application to determine sequence preferences of unconventional RNA-binding proteins. Methods. 2016;16:30472–30478. doi: 10.1016/j.ymeth.2016.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] Rhodes D, Lipps HJ. G-quadruplexes and their regulatory roles in biology. Nucleic Acids Research. 2015;43:8627–8637. doi: 10.1093/nar/gkv862. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] Rojas M, Farr GW, Fernandez CF, Lauden L, McCormack JC, Wolin SL. Yeast Gis2 and its human ortholog CNBP are novel components of stress-induced RNP granules. In: Stoecklin G, editor. PLoS ONE. Vol. 7. 2012. p. e52824. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] Sammons MA, Samir P, Link AJ. Saccharomyces cerevisiae Gis2 interacts with the translation machinery and is orthogonal to myotonic dystrophy type 2 protein ZNF9. Biochem Biophys Res Commun. 2011;406:13–19. doi: 10.1016/j.bbrc.2011.01.086. [DOI] [PubMed] [Google Scholar]

[R56] Sharma K, Hrle A, Kramer K, Sachsenberg T, Staals RHJ, Randau L, Marchfelder A, van der Oost J, Kohlbacher O, Conti E, et al. Analysis of protein-RNA interactions by UV-induced cross-linking and mass spectrometry. Methods. 2015;89:138–148. doi: 10.1016/j.ymeth.2015.06.005. [DOI] [PubMed] [Google Scholar]

[R57] Shimizu K, Chen W, Ashique AM, Moroi R, Li Y-P. Molecular cloning, developmental expression, promoter analysis and functional characterization of the mouse CNBP gene. Gene. 2003;307:51–62. doi: 10.1016/s0378-1119(03)00406-2. [DOI] [PubMed] [Google Scholar]

[R58] Spitzer J, Landthaler M, Tuschl T. Rapid creation of stable mammalian cell lines for regulated expression of proteins using the Gateway® recombination cloning technology and Flp-In T-REx® lines. Meth Enzymol. 2013;529:99–124. doi: 10.1016/B978-0-12-418687-3.00008-2. [DOI] [PubMed] [Google Scholar]

[R59] Thandapani P, Song J, Gandin V, Cai Y, Rouleau SG, Garant J-M, Boisvert F-M, Yu Z, Perreault J-P, Topisirovic I, et al. Aven recognition of RNA G-quadruplexes regulates translation of the mixed lineage leukemia protooncogenes. In: Green R, editor. eLife. Vol. 4. 2015. p. e06234. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Prot. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R61] Vaquerizas JM, Kummerfeld SK, Teichmann SA, Luscombe NM. A census of human transcription factors: function, expression and evolution. Nature Reviews Genetics. 2009;10:252–263. doi: 10.1038/nrg2538. [DOI] [PubMed] [Google Scholar]

[R62] Walhout AJ, Temple GF, Brasch MA, Hartley JL, Lorson MA, van den Heuvel S, Vidal M. GATEWAY recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes. Meth Enzymol. 2000;328:575–592. doi: 10.1016/s0076-6879(00)28419-x. [DOI] [PubMed] [Google Scholar]

[R63] Wan Y, Kertesz M, Spitale RC, Segal E, Chang HY. Understanding the transcriptome through RNA structure. Nature Reviews Genetics. 2011;12:641–655. doi: 10.1038/nrg3049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R64] Wei H-M, Hu H-H, Chang G-Y, Lee Y-J, Li Y-C, Chang H-H, Li C. Arginine methylation of the cellular nucleic acid binding protein does not affect its subcellular localization but impedes RNA binding. FEBS Let. 2014;588:1542–1548. doi: 10.1016/j.febslet.2014.03.052. [DOI] [PubMed] [Google Scholar]

[R65] Wilbert ML, Huelga SC, Kapeli K, Stark TJ, Liang TY, Chen SX, Yan BY, Nathanson JL, Hutt KR, Lovci MT, et al. LIN28 binds messenger RNAs at GGAGA motifs and regulates splicing factor abundance. Mol Cell. 2012;48:195–206. doi: 10.1016/j.molcel.2012.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R66] Wilkinson KA, Merino EJ, Weeks KM. Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution. Nat Prot. 2006;1:1610–1616. doi: 10.1038/nprot.2006.249. [DOI] [PubMed] [Google Scholar]

[R67] Wilson D, Charoensawan V, Kummerfeld SK, Teichmann SA. DBD–taxonomically broad transcription factor predictions: new content and functionality. Nucleic Acids Research. 2007;36:D88–D92. doi: 10.1093/nar/gkm964. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R68] Wolfe AL, Singh K, Zhong Y, Drewe P, Rajasekhar VK, Sanghvi VR, Mavrakis KJ, Jiang M, Roderick JE, Van der Meulen J, et al. RNA G-quadruplexes cause eIF4A-dependent oncogene translation in cancer. Nature. 2014;513:65–70. doi: 10.1038/nature13485. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

The human CCHC-type Zinc Finger Nucleic Acid Binding Protein binds G-rich elements in target mRNA coding sequences and promotes translation

Daniel Benhalevy

Sanjay K Gupta

Charles H Danan

Suman Ghosal

Hong-Wei Sun

Hinke G Kazemier

Katrin Paeschke

Markus Hafner

Stefan A Juranek

Summary

eTOC Blurb

INTRODUCTION

Figure 1. Overview of CNBP protein.

RESULTS

CNBP is a cytoplasmic protein consisting of multiple simultaneously expressed isoforms

CNBP interacts with mature mRNAs in G-rich regions

Figure 2. CNBP PAR-CLIP.

CNBP recognizes G-rich sequences in vitro

Figure 3. CNBP interacts with G-rich sequence elements in vitro.

CNBP presence slightly reduces target mRNA abundance

Figure 4. CNBP increases the ribosome density on its targets.

CNBP promotes translation of its targets

CNBP increases protein levels of target transcript

Figure 5. CNBP promotes translation of mRNA targets.

CNBP binds sites close to G4 and other stable RNA structures

DISCUSSION

EXPERIMENTAL PROCEDURES

Stable cell lines and their culture

Cell fractionation

PAR-CLIP

Recombinant protein expression, EMSA, and filter binding assay

CRISPR/Cas9 gene editing

RNA-seq

Ribo-seq

Statistical methods

Luciferase reporter assay

Selective 2′-hydroxyl acetylation analyzed by primer extension (SHAPE)

CD measurement

Supplementary Material

Highlights.

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases