Abstract
RNA–protein interactions are the structural and functional basis of significant numbers of RNA molecules. RNA–protein interaction assays though, still mainly depend on biochemical tests in vitro. Here, we establish a convenient and reliable RNA fluorescent three-hybrid (rF3H) method to detect/interrogate the interactions between RNAs and proteins in cells. A GFP tagged highly specific RNA trap is constructed to anchor the RNA of interest to an artificial or natural subcellular structure, and RNA–protein interactions can be detected and visualized by the enrichment of RNA binding proteins (RBPs) at these structures. Different RNA trapping systems are developed and detection of RNA–protein complexes at multiple subcellular structures are assayed. With this new toolset, interactions between proteins and mRNA or noncoding RNAs are characterized, including the interaction between a long noncoding RNA and an epigenetic modulator. Our approach provides a flexible and reliable method for the characterization of RNA–protein interactions in living cells.
Graphical Abstract
INTRODUCTION
The RNA in cells is commonly associated with RNA binding proteins (RBPs), which is required for the proper function of both RNAs and proteins. On the one hand, the processing, transport, function and stability of RNAs are modulated by the RBPs, e.g. mRNA processing and ribosome assembly require different groups of RBPs to accomplish these biological processes (1–3). On the other hand, proteins can also be functionally modulated by the binding of RNAs. A typical example is the widely applied CRISPR/Cas9 genome editing technique, in which binding of a guide RNA (gRNA) to the Cas9 protein modulates the conformation of Cas9 protein and activates its endonuclease activity on the targeted DNA sequences (4–7). Therefore, identification and characterization of the physical interactions between RNA and protein is the basis for revealing the function of RNAs and RBPs.
Being physically flexible and biochemically unstable, this intrinsic property of RNA molecules makes it difficult to identify the interaction between RNAs and proteins. Traditional approaches like electrophoretic mobility shift assay (EMSA) require purification of RNAs as well as proteins to identify their physical interaction by electrophoresis in vitro, which is in praxis complicated and with limited throughput. Other biochemical methods like immunoprecipitation, cross-linking and proximity-labeling, in combination with high throughput sequencing or mass spectrometry (8–11), have been developed and massively applied for screening of RNAs binding to protein or proteins binding to RNA, but it is still challenging to visualize and characterize the interaction inside living cells. Recently developed RNA visualization techniques, using fluorescent RNA aptamers (like Spinach, Broccoli etc.) (12,13) or RNA tags bound by specific proteins (like ms2, pp7 and λN22 etc.) (14–17), allow for imaging RNAs or RNA translation in cells. One of the most frequently used RNA tags, the bacteriophage ms2 RNA hairpin structure, which is specifically bound by the MS2 coat protein (MCP), is fused to RNAs of interest (ROIs), and the tagged ROIs can thus be visualized by the fluorescently labeled MCP. It is, however, still challenging to image RNA–protein interaction directly in cells as the detection is limited by the abundance of RNAs or proteins, as well as the generally low binding affinities between them (18).
To overcome these limitations, here we introduced a RNA fluorescence three-hybrid (rF3H) method for RNA–protein interaction analyses in cells. In this method, RNA molecules are recruited and anchored at specific subcellular structures by a designed RNA trap, and the interaction between the trapped RNAs and fluorescently labeled RBPs is visualized and identified via fluorescence co-localization at these subcellular structures. With this new method, we measured the interactions between proteins and different types of RNAs, and in particular studied the interaction between an epigenetic factor EZH2 protein and the HOTAIR non-coding RNA (ncRNA). Different RNA trapping systems and multiple cellular anchoring structures were also explored for broad applications of this tool. Our study established a fluorescence hybrid assay in mammalian cells, providing a flexible and reliable approach for the characterization of RNA–protein interactions.
MATERIALS AND METHODS
Plasmids
The information of plasmids constructed in this study was shown in Supplementary Figure S1, and the source of the fragments used were listed in Supplementary Table S1. In Generally, the MCP RNA trap plasmid pMCP-EGFP-LacI was constructed by replacing the GFP binder of pGBP-LacI plasmid (19) with ms2 coat protein (MCP) and enhanced green fluorescent protein (EGFP) open reading frames (ORF). All the other RNA trap plasmids were constructed on pMCP-EGFP-LacI. To construct RNA traps anchoring to the nuclear envelope, Cajal bodies, and genome loci, the LacI coding part of pMCP-EGFP-LacI was replaced by the ORFs for Lamin B1, Coilin and dCas9, respectively. The MCP was replaced by the artificially designed PUF domain and Lbu-dCas13a to get PUF and dCas13a mediated RNA traps.
The CMV cassette test RNA plasmids were developed on pEGFP-N1 (Clontech). The whole EGFP ORF was firstly replaced with 6 times of ms2 stem–loop sequences to construct the ms2 RNA plasmid. After that, 4 of 6 times ms2 stem–loops were replaced by 4 times of wildtype or mutant (GGAGCAGACGATGGCGTCGCTCC, synthesized by Eurofins) pp7 stem–loops, whole-length NORAD, HOTAIR 1–300 and its shortened fragments, as well as NORAD/HOTAIR 201–300 hybrid fragment, to get the respective ms2 tagged RNA plasmids. The PUF recognition sequence-tagged RNAs were constructed by inserting a 9-nt sequence (TGTTGTATA) to the 3′ end of ms2, pp7 and HOTAIR 1–300 sequences of their plasmids. And the U6 cassette ms2-pp7 RNA plasmid was constructed by cloning the RNA sequence from its CMV cassette plasmid into the U6 cassette of the pEX-A-u6 plasmid (20).
The test protein plasmids were also derived from pEGFP-N1. The EGFP fragment was replaced by mCherry ORF in the beginning, followed by cloning the ORFs coding for MCP, PCP, PABPC1, PUM2, EZH2, EZH2N (1–370 amino acids of whole-length EZH2 protein), EZH2C (371–751 amino acids of whole-length EZH2 protein) and the mutant proteins (MCP S47R, EZH2N T350A and EZH2N T350D) with a N-terminal nuclear localization sequence (NLS) to the upstream of mCherry. The mScarlet tagged PCP protein was constructed by replacing the mCherry ORF of the PCP-mCherry protein with a mScarlet-I ORF.
Cell culture, transfection and manipulation
BHK cells containing a genomic integration of multiple lacO sites (21) and HeLa cells were cultured in modified Eagle's medium (DMEM, Sigma) supplemented with 10% fetal bovine serum (FBS, Sigma) and 10 μg/ml gentamicin (Thermo Scientific); mouse J1 embryo stem cells were cultured in DMEM supplemented with 16% fetal bovine serum, 10 U/ml penicillin/streptomycin, 2 mM l-glutamine, 0.1 mM β-mercaptoethanol, 1 μM PD0325901, 3 μM CHIR99021 and 1000 U/ml LIF, 1× non-essential amino acid (NEAA, Thermo Scientific). All the cells were incubated at 37 °C in a humidified environment with 5% CO2.
Transient transfection was performed with Lipofectamine 3000 (Thermo Scientific) following the manufacturer's instruction. For one well of a six-well plate, in total 2.4 μg plasmid DNA (with the mass ratio of RNA trap: ROI: POI = 0.8 μg: 0.8 μg: 0.8 μg) was used for transfection. For transfection, 4 μl of Lipofectamine 3000 was diluted in 120 μl Opti-MEM (Thermo Scientific) in one tube and incubated for 5 min at room temperature. The three plasmids for rF3H (2.4 μg) as well as 4 μl of P3000 were diluted and mixed in another tube with 120 μl Opti-MEM. The contents of both tubes were then mixed gently and incubated for 10 min at room temperature. The mixtures were then added to cells drop by drop and the cells were put back into the incubator overnight.
For fixed cell imaging, cells were seeded on 18 mm × 18 mm coverslips. About 24 h after transfection, cells were firstly fixed with 3.7% formaldehyde in PBS for 10 min. The fixed cells were then stained with 1 μg/ml DAPI in PBS directly and then mounted onto slides with Vectashield mounting medium (Vector Laboratories).
For immunofluorescence, cells were fixed as described before and permeabilized with 0.25% Triton X-100 in PBS for 5 min, blocked with 1% BSA for 30 min, incubated with BSA diluted mouse anti-HA primary antibody (Abcam, ab18181) and Alexa Fluor 594 labeled donkey anti-mouse IgG secondary antibody (Abcam, ab150108), then counterstained with DAPI, and the samples were mounted as described above.
For RNA fluorescence in situ hybridization (FISH), a set of Cy5 labeled single stranded oligonucleotide probes (CTCTGCTGGTTTGTACAATC, AATGAACCCGGGAATACTGC, AGGAATTAGG TCCTTAGG, ATATCGTCTGCTCCTTTCTG, synthesized by Eurofins) that target pp7 sequence was used. After fixation and permeabilization of the cells, FISH was processed according to Vidisha's protocol (22).
For live-cell imaging, cells were pre-plated on an 8-wells μ-Slide (ibidi) 1 day before transfection. Nuclear staining was performed by adding 1μM SiR-DNA (Cytoskeleton) 30 min before imaging.
Fixed and live-cell imaging and quantification
Both fixed and live-cell imaging were carried out with an SP8 confocal microscope (Leica). A 405 nm diode laser was used for DAPI excitation, while the 488, 561, 594 and 647 nm beams from a 470–670 nm white laser were used for the excitations of EGFP, mCherry, Alexa Fluor 594, Cy5 and SiR-DNA. The emission of GFP was detected by a PMT sensor while the other three emissions were all received by HyD sensors. A 63x oil objective was chosen for imaging, and a sequential imaging method, which detects the fluorescence of DAPI, EGFP, mCherry, Alexa Fluor 594, Cy5 or SiR-DNA individually, was set as the default scanning method. Image analyses were performed with LAS X and ImageJ software.
As shown in Supplementary Figure S2, to quantify the relative fluorescence at lacO spots, the signal from DAPI channel was applied to partition the area of nucleus at first, the mean fluorescence intensities (average gray values) of the lacO spot and the whole nucleus from EGFP channel were calculated as GreenlacO and Greennucleus, respectively, and then the same process was utilized to get RedlacO and Rednucleus from mCherry channel. Both green and red signals of the whole nucleus were applied to normalize variation in expression levels in cells. The relative fluorescence at the lacO spots was calculated as follows.
For each experiment, the relative fluorescence was normalized by the control group without RNA.
RNA extraction and quantitative PCR
Nuclear extraction was performed as follows. Cells were detached by trypsin and harvested by centrifugation Cells were washed twice with cold PBS and the pellet was gently resuspended in 500 μl 1x hypotonic buffer (20 mM Tris–HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2) by pipetting up and down several times and incubated on ice for 15 min. 20 μl detergent (10% NP40) was added and vortexed for 10 s. The homogenate was centrifuged for 10 min at 3000 rpm and at 4°C. The pellet corresponded to the nuclear fraction. The total and nuclear RNA were extracted with the NucleoSpin RNA kit (MACHEREY-NAGEL) following the manufacturer's instructions, and the synthesis of cDNA was performed with High-Capacity cDNA Reverse Transcription Kit (Applied Biosystems) as instructed by the manufacturer. The expression level of the ms2-pp7 RNA under the control of CMV and U6 cassettes were quantified by real-time PCR with the primers (Forward: 5′ ATATCTGCAGGTCGACTC 3′, Reverse: 5′ CTGCTCCTTTCTGAATTCC 3′).
Statistics
Student's t-test was applied for the experimental data. P < 0.05 was chosen as the limit of significance, and marked as * P < 0.05, ** P < 0.01, *** P < 0.001. All the relative fluorescence data were presented as scatter plots with arithmetic means ± standard deviations.
RESULTS AND DISCUSSION
Development of the RNA fluorescence three-hybrid (rF3H) assay
To visualize RNA–protein interactions, we designed an RNA fluorescence three-hybrid (rF3H) assay (Figure 1A). In this assay, a RNA trap, which consists of a MS2 coat protein (MCP), a Lac repressor (LacI), and an EGFP, is used to capture RNAs onto a bacterial lac operon (lacO) array that is integrated into the genome of a mammalian cell line. The MCP binds the ms2 stem–loop tagged RNA of interest (ROI) and anchors the RNAs at the genomic lacO loci via the fused LacI protein. This RNA trap, together with the trapped RNA of interest, is visualized as a fluorescent spot in cell nuclei via EGFP under a fluorescence microscope. The potential RNA binding protein (referred as protein of interest, POI) tagged with a red fluorescent protein (e.g. RFP or mCherry) is recruited to the lacO array by interacting with the trapped ROIs, and the RNA–protein interaction thereby can be identified by co-localization of the green and red fluorescence at the nuclear lacO spot.
As a proof-of-principle, we tested this rF3H strategy with a well-characterized RNA–protein interaction pair. We tagged the pp7 RNA with ms2 stem–loops and attempted to visualize the interaction between pp7 RNA and the pp7 coating protein (PCP). We tested the interaction between pp7 and PCP protein. When triply expressed in cells containing lacO array, the RNA trap together with the trapped pp7 RNA was clearly imaged as a nuclear spot marked by EGFP fluorescence. The enrichment of the mCherry-PCP at the lacO loci was also exclusively detected in the presence of pp7 RNAs, but not with the RNA trap itself, irrelevant RNAs and the mutant pp7 RNA (Figure 1B), which indicates an interaction between pp7 RNA and PCP protein. Quantification of the relative fluorescence intensity showed a significant (about two times higher) enrichment of PCP-mCherry at the lacO spot resulted by pp7-PCP interaction (Figure 1C), demonstrating the feasibility of this rF3H strategy for RNA–protein interaction detection. We applied RNA FISH with anti-pp7 probes to confirm the capture of ROI, as well as IF with anti-HA antibody to verify the aggregation of POI (Supplementary Figure S3A, B), which further demonstrates the feasibility of this method.
Optimization of the rF3H assay
To detect RNA–protein interactions sensitively and precisely, we optimized the stoichiometry for RNA trap and POI used in the rF3H assay. With a constant amount of plasmid for ROI transcription, we adjusted the amounts of plasmids for either RNA trap or RBP (POI). Under the conditions tested, although a higher RNA trap amount increased the relative signal at the lacO foci both in the presence and absence of the ROI, the enrichment ratio between the experimental and background binding control groups was basically constant (Supplementary Figure S4A, B), suggesting the amounts of RNA trap used here are all redundant. We then optimized the amount of the POI (PCP-mCherry) for the assay and found that the 0.2 ng PCP-mCherry group performed as good as the 0.4 ng group, both showed higher enrichment of the RBP than the 0.8 ng group (Supplementary Figure S4C, D). This optimized POI amount was applied for the subsequent studies.
Moreover, we tried to enhance the trapping efficiency of the MCP RNA trap by doubling the ms2 RNA binding unit, and the RNA trap with two tandem ms2 binding units (2MCP RNA trap) showed a slight improvement of the POI enrichment over the original MCP RNA trap (Supplementary Figure S5). We also tested mScarlet-I, a brighter fluorescent protein than mCherry, to label the test protein, which worked as good as the mCherry for the assay, but not showing enhancement of the measurement or sensitivity as both the signal and background readout were raised (Supplementary Figure S6).
Different types of RNAs are transcribed, processed, modified and located differently in cells, which may affect the measurement of rF3H assay. Therefore, we compared two RNA transcription cassettes with different properties. In the CMV cassette, a CMV promoter (transcribed with RNA polymerase II) together with the SV40 polyadenylation (poly(A)) signal was applied for target RNA transcription; and for the U6 cassette, the RNA was transcribed via a U6 promoter (transcribed with RNA polymerase III), which does not cause additional poly(A) modification on the transcribed RNAs (Supplementary Figure S7A). We observed that the RNA generated from the U6 cassette resulted in a better recruitment of the RBP to lacO foci than RNAs produced by the CMV cassette (Supplementary Figure S7B, C). After obtaining the total cellular RNA and nuclear RNA from CMV or U6 cassette transfected cells, we measured the amount of the test RNA in these two components by qPCR respectively. We found that while the total RNA produced by CMV and U6 cassettes were almost the same (Supplementary Figure S7D), the U6 products showed higher nuclear retention (Supplementary Figure S7E), which may explain the different performances of the two cassettes for the assay.
Visualization of mRNA–protein interactions with rF3H
The mRNA is a large category of RNAs coding for proteins, and the synthesis, processing and ribosomal translation of mRNAs require interacting and formation of complexes with multiple proteins (23,24). For example, the poly(A) binding family proteins (PABPs) recognize poly(A) sequences at the 3′ end of the mRNA, and binding of PABPs to mRNAs facilitates mRNA translation and regulates mRNA stability (25,26). To test the feasibility of the rF3H assay on mRNA–protein interactions, we checked the interaction between a polyadenylated mRNA mimic and PABP1. The mRNA mimic, containing 2 times ms2 stem–loop structures, was transcribed from a CMV cassette and, therefore, modified with 5′ cap and 3′ poly(A) tail (Figure 2A). The mRNA mimic was trapped at the lac operator array by the MCP RNA trap, and recruitment of the mCherry tagged PABPC1 to the lacO array was observed in the presence of ms2 mRNA mimic (Figure 2B, Supplementary Figure S8A), indicating a physical interaction between the mRNA mimic and PABP1. Image quantification clearly showed that the enrichment of PABP1 at the lacO array doubled in the presence of the mRNA mimic, which is significantly different in comparison to the control groups (Figure 2C). In addition, we tested the interaction between PABPC1-mCherry mRNA and its products PABPC1-mCherry protein (Figure 2D). Anchored to the lacO array via the fused ms2 stem–loops, the mRNA successfully recruited PABPC1-mCherry protein to the lacO spot (Figure 2E, F, Supplementary Figure S8B), confirming the interaction between protein coding mRNA and PABPC1 protein.
Interaction determination between non-coding RNA and protein
Long non-coding RNAs (lncRNAs) are non-coding RNAs with the length exceeding 200 nucleotides. Although thousands of lncRNAs are transcribed in the human genome, for most of them the functions are not known. The non-coding RNA activated by DNA damage (NORAD) is one conserved lncRNA transcribed in multiple species and is critical for the maintenance of genome stability (27,28). While NORAD depletion leads to premature aging and genome instability, the molecular mechanisms behind these phenotypes remain elusive. Sequence analysis indicated multiple PUMILIO family protein binding sites (with UGUANAAUA consensus sequence) in NORAD (Figure 2G). Therefore, we tested the interaction between the NORAD and PUMILIO 2 (PUM2) protein. We constructed a ms2 tagged NORAD and mCherry tagged PUM2 and measured their interaction with the rF3H assay. The co-localization of PUM2 and RNA trap was observed specifically when the NORAD RNA was transcribed (Figure 2H, Supplementary Figure S8C), and the quantification of mCherry-PUM2 showed a significant enrichment of the PUM2 protein at the lacO array recruited by NORAD RNA (Figure 2I), confirming the interaction between NORAD and PUM2 protein (29,30).
Characterization of the interaction between EZH2 and HOTAIR
Non-coding RNAs could also act as epigenetic regulators for modulation of gene expression. The HOX transcript antisense intergenic RNA (HOTAIR) is an ncRNA transcribed from the Homeobox C (HoxC) cluster (31). This ncRNA forms RNP complexes with certain epigenetic modulators, like the PRC2 complex, to regulate histone methylation and gene expression (Figure 3A). One of the major components of PRC2, the enhancer of zeste homolog 2 (EZH2) protein, has been shown to interact with the first 300 nucleotides of HOTAIR (32,33), and its N-terminal part was considered to play a critical role in RNA binding (34–37) (Figure 3B). To validate the interaction between HOTAIR and EZH2 (full-length, N- and C-terminal parts), we constructed mCherry labeled full-length as well as N-terminus (EZH2N) and C-terminus (EZH2C) of EZH2 and determined their interactions with ms2 tagged HOTAIR 1–300 nt (H300). Our results showed that both the EZH2N and EZH2C are recruited to the lacO spot by the H300 RNA fragment (Figure 3C, Supplementary Figure S9A) albeit EZH2C at a lower extent. EZH2N showed a comparable binding to H300 RNA as the full-length EZH2 (Figure 3D), which indicates that EZH2N preserves most RNA binding ability of the full-length EZH2 and likely plays a major role in HOTAIR binding, while EZH2C also kept some RNA binding ability, both results were similar to the discoveries in a previous study (36).
Post-transcriptional modifications, especially phosphorylation play crucial roles in regulation of protein activities. Prior studies showed that several residues of EZH2 could be modified by phosphorylation (38,39), and the phosphorylation of the threonine at the 350th position was proposed to affect its ncRNA binding activity (34). To find out whether this phosphorylation influences the interaction between EZH2N and HOTAIR H300 RNA, we constructed a phosphorylation mimic of EZH2N by substitution of threonine at 350th position with aspartic acid (T350D) as well as an alanine mutant (T350A) for non-phosphorylation mimic (Figure 3E). Both the phosphorylation and non-phosphorylation mimics bound HOTAIR H300 at the lacO site (Figure 3F, Supplementary Figure S9B), and the phosphorylation mimic T350D showed a higher binding ability to HOTAIR H300 (Figure 3G), providing the possibility that T350 phosphorylation may regulate RNA binding ability of EZH2N. As the RNA binding sites in EZH2N were considered distantly from this position (36), we believed the regulation was performed by some indirect effects such as conformational change caused by phosphorylation. EED, another major component of the PRC2 complex, was considered to facilitate EZH2-HOTAIR formation (32). Our approach demonstrated that EED is able to contribute to the binding between EZH2N and HOTAIR H300 RNA, and the phosphorylation of EZH2N T350 can further enhance this interaction (Figure 3H, I, Supplementary Figure S9C).
To narrow down the HOTAIR sequences responsible for EZH2 binding, we divided the HOTAIR H300 into smaller fragments and identify the interactions between them and EZH2 protein. As shown in Figure 4A–C, three basic RNA fragments (HOTAIR 1–100 nt, HOTAIR 101–200 nt and HOTAIR 201–300 nt) and two combinations (HOTAIR 1–200 nt and HOTAIR 101–300 nt) were tagged with ms2 loops and checked for their interactions with EZH2. All the truncations except the HOTAIR 1–100 nt one showed recruitment of EZH2-mCherry, revealing that the first 100 nucleotides of the HOTAIR are dispensable for EZH2 binding. Quantitative analyses further showed that the fragments containing 200–300 nt part had a higher enrichment of EZH2 than others, which indicates that the 200–300 nt part of HOTAIR is the major EZH2 binding region, consistent with a previous study (32). Furthermore, a chimeric NORAD/HOTAIR 201–300 RNA acquired the ability to bind EZH2, which further confirmed the result (Figure 4D, E).
The research between three different RNA–protein pairs showed the reliable application of this rF3H assay in RNA–protein interaction study. In comparison to traditional biochemical assays, this imaging-based method is more flexible, less labor intensive, quantitative and with high throughput. And more importantly, interaction information is obtained in individual cells, instead of the average of a whole-cell population as in traditional biochemical assays. Besides, RNA protein interaction can be analyzed in living cells under different culture conditions, therefore, characterization of the RNA–protein dynamics can be studied and issues like heterogeneity in a cell population or RNP formation/disassembly can possibly be addressed with this assay, providing deeper insights on RNA–protein interaction within the cellular context.
Detection of RNA–protein interactions at multiple cellular structures
The LacI fused RNA trap requires the special lacO array containing cell lines for anchoring of RNAs. To overcome this limitation, we developed a RNA trap that targets RNAs to the nuclear lamina by fusing the RNA binding unit to Lamin B1 protein, which is a major component of the nuclear lamina (Figure 5A) (40–42). The RNA molecules, in association with the RNA trap, were successfully anchored to the inner nuclear membrane (INM) in HeLa cells and the interactions between ms2 RNA and MCP protein could be detected on the nuclear envelope (Figure 5B, C), supporting an rF3H assay on natural subcellular structures.
Nuclear bodies are membrane-less organelles in the cell nucleus with multiple functions, including RNA processing (43–45). Next, we developed an RNA trap fused to Coilin protein, which is the major component of the Cajal nuclear bodies (46–49), to anchor the RNA of interest specifically there (Figure 5D). As expected, RNA traps were detected as fluorescent nuclear spots, and the interaction between ms2 RNA and MCP protein can also be visualized clearly on the Cajal nuclear bodies in HeLa cells (Figure 5E, F).
Catalytically deactivated Cas9 (dCas9) protein has been used for targeting and visualization of genomic loci (20,50,51). Taking the advantage of this versatile technique, we designed a RNA trap that applies dCas9 to anchor RNAs to genomic structures (Figure 5G). Guided by gRNAs targeting genomic major satellite repeats, this dCas9 fused RNA trap anchored ms2 RNA molecules to chromocenters in mouse embryonic stem cells (mESCs), being visualized as multiple nuclear spots (Figure 5H, I). Accordingly, the mCherry tagged MCP protein was recruited to chromocenters by the trapped ms2 RNAs, showing a successful detection of RNA–protein interactions with this dCas9 based RNA trap.
All together, RNA traps anchoring to these multiple subcellular structures expanded the applicability of the rF3H assay, suitable for characterization of the interaction between RNAs and proteins with different properties.
Development of PUF and dCas13a mediated RNA trapping systems
Besides ms2-MCP RNA binding pair, RBPs which bind RNA in a sequence-specific manner with high binding ability could also be engineered for trapping of RNAs. As we have shown previously, the PUMILIO family proteins (PUMs) bind to RNAs containing the PUM binding sequences via the Pumilio and FBF homology (PUF) domain. The PUF domain typically consists of multiple tri-helix motifs, and the crystal structure showed that each of the tri-helix motifs recognizes and binds one nucleotide, which makes the PUF domain a suitable candidate for RNA binding engineering (Figure 6A) (52–54). Using a designed synthetic PUF domain that recognizes a nine-nucleotide sequence (UGUUGUAUA) (55), we constructed an RNA trap anchoring the corresponding target RNAs at the lacO spot. The ms2 RNAs tagged with the 9-nt PUF binding sequence were recruited and visualized by the GFP fused PUF RNA trap, and enrichment of the mCherry tagged MCP at the lacO array was detected by confocal imaging (Figure 6B, C, Supplementary Figure S10A). Although a nuclear localization signal was added to the PUF RNA trap fusion, part of the protein remains cytoplasmic. Nonetheless, enough protein is nuclear to score binding or lack thereof. Moreover, with this PUF RNA trap the reduced RNA binding ability of the S47R mutant MCP could also be detected (Figure 6D, E, Supplementary Figure S10B), which demonstrates the sensitive detection of RNA–protein interactions with this PUF based RNA trap.
The CRISPR/Cas system is a prokaryotic defense mechanism against foreign viral nucleic acids. Besides the Cas9 protein, which is a DNA nuclease, Cas proteins that cut RNAs have also been identified recently (56–58). Cas13a is one of such RNA-activated RNases which target and cut RNA molecules specifically under the control of small gRNAs (59,60). Similar to Cas9, catalytically deactivated Cas13a (dCas13a) has also been applied for RNA tracking and visualization in cells (61,62) (Figure 6G). Featured by flexible RNA targeting, we generated a Leptotrichia buccalis-sourced dCas13a (Lbu-dCas13a) derived RNA trap, together with the corresponding gRNA that recognizes a specific RNA sequence (GAU UCU AGA ACU AGU GGA UCC UAA GGU A) in the 5′ of ms2 RNAs. The dCas13a RNA trap showed successful trapping of target RNAs and detected RNA–protein interactions as shown by ms2-MCP interaction pair (Figure 6H, I, Supplementary Figure S10C), providing another potential tool for endogenous RNA capture.
In the beginning, our MCP RNA trap was designed for RNAs tagged with ms2 stem loops, now, the engineered PUF RNA trap, and also the dCas13a based RNA trap, have the potentials that broadly used for programmable targeting and binding of RNAs with ideally any sequences. These two types of RNA trapping systems offer valuable endogenous RNA targeting and trapping tools, allowing for flexible study of not only interactions but also the dynamics of endogenous RNAs and RNPs. Besides, multiple subcellular structures in both nuclei and cytosol were developed for RNA anchoring to fit different types of RNAs and protein. Furthermore, combining various subcellular localization components and programmable RNA catching tools, RNA trap could show the interaction between different RNA and proteins in suitable subcellular positions with immunofluorescence, which further expands the spectrum of its applications.
Detection of RNA–protein binding in living cells
As RNA–protein interactions are usually dynamic under physiological conditions, detection of the interaction in living cells may offer insights on the dynamic regulation of RNPs. To extend the rF3H assay from fixed cells to living cells, we tested the well characterized pp7-PCP interaction under live conditions. As shown in Supplementary Figure S11, enrichment of MCP RNA traps, as well as mCherry fused PCP at the lacO site in the nucleus could be observed in living cells. As homeostasis of RNP complexes is tightly controlled to ensure their proper functions in cells, the dynamics of RNA–protein interactions can provide more insights into their functions. Our RNA trap system and the rF3H assay allows us to study these dynamical RNA–protein binding processes under physiological conditions. In combination with other technologies such as Fluorescence Recovery After Photobleaching (FRAP), precise measurements of RNA–protein binding kinetics may offer quantitative data on these dynamic processes.
DATA AVAILABILITY
All data could be found in paper or supplementary data. Additional data could be requested from the corresponding authors.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Weihua Qin, Jack Bates and Joel Ryan for suggestions; Hartmann Harz and David Hörl for microscopy. We are indebted to David L. Spector for the kind gift of the lacO containing BHK cell line.
Contributor Information
Ningjun Duan, Department of Biology II, Ludwig Maximilians University Munich, Munich 81377, Germany; Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China.
Maria Arroyo, Department of Biology, Technical University of Darmstadt, Darmstadt 64287, Germany.
Wen Deng, Department of Biology II, Ludwig Maximilians University Munich, Munich 81377, Germany; College of Veterinary Medicine, Northwest A&F University, Yangling 712100, China.
M Cristina Cardoso, Department of Biology, Technical University of Darmstadt, Darmstadt 64287, Germany.
Heinrich Leonhardt, Department of Biology II, Ludwig Maximilians University Munich, Munich 81377, Germany.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
German Research Foundation [DFG LE 721/18-1 to H.L. and DFG CA 198/16-1 to M.C.C.]; Bayerische Forschungsstiftung [AZ-1286-17 to H.L.]; N.D. was funded by a fellowship of the China Scholarship Council. Funding for open access charge: Deutsche Forschungsgemeinschaft.
Conflict of interest statement. None declared.
REFERENCES
- 1. Busch H., Reddy R., Rothblum L., Choi Y.C.. SnRNAs, SnRNPs, and RNA processing. Annu. Rev. Biochem. 1982; 51:617–654. [DOI] [PubMed] [Google Scholar]
- 2. Varani G., Nagai K.. Rna recognition by rnp proteins during rna processing. Annu. Rev. Biophys. Biomol. Struct. 1998; 27:407–445. [DOI] [PubMed] [Google Scholar]
- 3. Kressler D., Hurt E., Baβler J.. Driving ribosome assembly. Biochim. Biophys. Acta. 2010; 1803:673–683. [DOI] [PubMed] [Google Scholar]
- 4. Hsu P.D., Scott D.A., Weinstein J.A., Ran F.A., Konermann S., Agarwala V., Li Y., Fine E.J., Wu X., Shalem O.et al.. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 2013; 31:827–832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Mali P., Yang L., Esvelt K.M., Aach J., Guell M., DiCarlo J.E., Norville J.E., Church G.M.. RNA-Guided human genome engineering via Cas9. Science. 2013; 339:823–826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Nishimasu H., Ran F.A., Hsu P.D., Konermann S., Shehata S.I., Dohmae N., Ishitani R., Zhang F., Nureki O.. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell. 2014; 156:935–949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jiang F., Doudna J.A.. CRISPR-Cas9 structures and mechanisms. Annu. Rev. Biophys. 2017; 46:505–529. [DOI] [PubMed] [Google Scholar]
- 8. Scheibe M., Butter F., Hafner M., Tuschl T., Mann M.. Quantitative mass spectrometry and PAR-CLIP to identify RNA–protein interactions. Nucleic Acids Res. 2012; 40:9897–9902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Ascano M., Gerstberger S., Tuschl T.. Multi-disciplinary methods to define RNA–protein interactions and regulatory networks. Curr. Opin. Genet. Dev. 2013; 23:20–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Buenrostro J.D., Araya C.L., Chircus L.M., Layton C.J., Chang H.Y., Snyder M.P., Greenleaf W.J.. Quantitative analysis of RNA–protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes. Nat. Biotechnol. 2014; 32:562–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Ramanathan M., Majzoub K., Rao D.S., Neela P.H., Zarnegar B.J., Mondal S., Roth J.G., Gai H., Kovalski J.R., Siprashvili Z.. RNA–protein interaction detection in living cells. Nat. Methods. 2018; 15:207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Paige J.S., Wu K.Y., Jaffrey S.R.. RNA mimics of green fluorescent protein. Science. 2011; 333:642–646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Filonov G.S., Moon J.D., Svensen N., Jaffrey S.R.. Broccoli: rapid selection of an RNA mimic of green fluorescent protein by fluorescence-based selection and directed evolution. J. Am. Chem. Soc. 2014; 136:16299–16308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Koning R., van den Worm S., Plaisier J.R., van Duin J., Abrahams J.P., Koerten H.. Visualization by cryo-electron microscopy of genomic RNA that binds to the protein capsid inside bacteriophage MS2. J. Mol. Biol. 2003; 332:415–422. [DOI] [PubMed] [Google Scholar]
- 15. Daigle N., Ellenberg J.. λ N-GFP: an RNA reporter system for live-cell imaging. Nat. Methods. 2007; 4:633–636. [DOI] [PubMed] [Google Scholar]
- 16. Tyagi S. Imaging intracellular RNA distribution and dynamics in living cells. Nat. Methods. 2009; 6:331–338. [DOI] [PubMed] [Google Scholar]
- 17. Hocine S., Raymond P., Zenklusen D., Chao J.A., Singer R.H.. Single-molecule analysis of gene expression using two-color RNA labeling in live yeast. Nat. Methods. 2013; 10:119–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Yang K., Yang Y., Zhang C.. Single-molecule FRET for ultrasensitive detection of biomolecules. NanoBioImaging. 2013; 2013:13–24. [Google Scholar]
- 19. Herce H.D., Deng W., Helma J., Leonhardt H., Cardoso M.C.. Visualization and targeted disruption of protein interactions in living cells. Nat. Commun. 2013; 4:2660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Anton T., Bultmann S., Leonhardt H., Markaki Y.. Visualization of specific DNA sequences in living mouse embryonic stem cells with a programmable fluorescent CRISPR/Cas system. Nucleus. 2014; 5:163–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Tsukamoto T., Hashiguchi N., Janicki S.M., Tumbar T., Belmont A.S., Spector D.L.. Visualization of gene activity in living cells. Nat. Cell Biol. 2000; 2:871–878. [DOI] [PubMed] [Google Scholar]
- 22. Tripathi V., Fei J., Ha T., Prasanth K.V.. RNA fluorescence in situ hybridization in cultured mammalian cells. Regulatory Non-Coding RNAs. 2015; Springer; 123–136. [DOI] [PubMed] [Google Scholar]
- 23. Green M.R. Pre-mRNA splicing. Annu. Rev. Genet. 1986; 20:671–708. [DOI] [PubMed] [Google Scholar]
- 24. Wilkie G.S., Dickson K.S., Gray N.K.. Regulation of mRNA translation by 5′-and 3′-UTR-binding factors. Trends Biochem. Sci. 2003; 28:182–188. [DOI] [PubMed] [Google Scholar]
- 25. Bernstein P., Ross J.. Poly (A), poly (A) binding protein and the regulation of mRNA stability. Trends Biochem. Sci. 1989; 14:373–377. [DOI] [PubMed] [Google Scholar]
- 26. Kahvejian A., Roy G., Sonenberg N.. The mRNA closed-loop model: the function of PABP and PABP-interacting proteins in mRNA translation. Cold Spring Harb. Symp. Quant. Biol. 2001; 66:293–300. [DOI] [PubMed] [Google Scholar]
- 27. Lee S., Kopp F., Chang T.-C., Sataluri A., Chen B., Sivakumar S., Yu H., Xie Y., Mendell J.T.. Noncoding RNA NORAD regulates genomic stability by sequestering PUMILIO proteins. Cell. 2016; 164:69–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Yang Z., Zhao Y., Lin G., Zhou X., Jiang X., Zhao H.. Noncoding RNA activated by DNA damage (NORAD): Biologic function and mechanisms in human cancers. Clin. Chim. Acta. 2019; 489:5–9. [DOI] [PubMed] [Google Scholar]
- 29. Tichon A., Perry R.B.-T., Stojic L., Ulitsky I.. SAM68 is required for regulation of Pumilio by the NORAD long noncoding RNA. Genes Dev. 2018; 32:70–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Kopp F., Elguindy M.M., Yalvac M.E., Zhang H., Chen B., Gillett F.A., Lee S., Sivakumar S., Yu H., Xie Y.. PUMILIO hyperactivity drives premature aging of Norad-deficient mice. Elife. 2019; 8:e42650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Kogo R., Shimamura T., Mimori K., Kawahara K., Imoto S., Sudo T., Tanaka F., Shibata K., Suzuki A., Komune S.. Long noncoding RNA HOTAIR regulates polycomb-dependent chromatin modification and is associated with poor prognosis in colorectal cancers. Cancer Res. 2011; 71:6320–6326. [DOI] [PubMed] [Google Scholar]
- 32. Wu L., Murat P., Matak-Vinkovic D., Murrell A., Balasubramanian S.. Binding interactions between long noncoding RNA HOTAIR and PRC2 proteins. Biochemistry. 2013; 52:9519–9527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Somarowthu S., Legiewicz M., Chillón I., Marcia M., Liu F., Pyle A.M.. HOTAIR forms an intricate and modular secondary structure. Mol. Cell. 2015; 58:353–361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Kaneko S., Li G., Son J., Xu C.-F., Margueron R., Neubert T.A., Reinberg D.. Phosphorylation of the PRC2 component Ezh2 is cell cycle-regulated and up-regulates its binding to ncRNA. Genes Dev. 2010; 24:2615–2620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Wang D., Ding L., Wang L., Zhao Y., Sun Z., Karnes R.J., Zhang J., Huang H.. LncRNA MALAT1 enhances oncogenic activities of EZH2 in castration-resistant prostate cancer. Oncotarget. 2015; 6:41045–41055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Long Y., Bolanos B., Gong L., Liu W., Goodrich K.J., Yang X., Chen S., Gooding A.R., Maegley K.A., Gajiwala K.S.et al.. Conserved RNA-binding specificity of polycomb repressive complex 2 is achieved by dispersed amino acid patches in EZH2. Elife. 2017; 6:e31558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Zhang Q., McKenzie N.J., Warneford-Thomson R., Gail E.H., Flanigan S.F., Owen B.M., Lauman R., Levina V., Garcia B.A., Schittenhelm R.B.et al.. RNA exploits an exposed regulatory site to inhibit the enzymatic activity of PRC2. Nat. Struct. Mol. Biol. 2019; 26:237–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Cha T.-L., Zhou B.P., Xia W., Wu Y., Yang C.-C., Chen C.-T., Ping B., Otte A.P., Hung M.-C.. Akt-mediated phosphorylation of EZH2 suppresses methylation of lysine 27 in histone H3. Science. 2005; 310:306–310. [DOI] [PubMed] [Google Scholar]
- 39. Chen S., Bohrer L.R., Rai A.N., Pan Y., Gan L., Zhou X., Bagchi A., Simon J.A., Huang H.. Cyclin-dependent kinases regulate epigenetic gene silencing through phosphorylation of EZH2. Nat. Cell Biol. 2010; 12:1108–1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Aebi U., Cohn J., Buhle L., Gerace L.. The nuclear lamina is a meshwork of intermediate-type filaments. Nature. 1986; 323:560–564. [DOI] [PubMed] [Google Scholar]
- 41. Gerace L. Nuclear lamina and organization of nuclear architecture. Trends Biochem. Sci. 1986; 11:443–446. [Google Scholar]
- 42. Gruenbaum Y., Margalit A., Goldman R.D., Shumaker D.K., Wilson K.L.. The nuclear lamina comes of age. Nat. Rev. Mol. Cell Biol. 2005; 6:21–31. [DOI] [PubMed] [Google Scholar]
- 43. Zhong S., Salomoni P., Pandolfi P.P.. The transcriptional role of PML and the nuclear body. Nat. Cell Biol. 2000; 2:E85–E90. [DOI] [PubMed] [Google Scholar]
- 44. Gall J.G. The centennial of the Cajal body. Nat. Rev. Mol. Cell Biol. 2003; 4:975–980. [DOI] [PubMed] [Google Scholar]
- 45. Sirri V., Urcuqui-Inchima S., Roussel P., Hernandez-Verdun D.. Nucleolus: the fascinating nuclear body. Histochem. Cell Biol. 2008; 129:13–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Andrade L.E., Chan E.K., Raska I., Peebles C.L., Roos G., Tan E.M.. Human autoantibody to a novel protein of the nuclear coiled body: immunological characterization and cDNA cloning of p80-coilin. J. Exp. Med. 1991; 173:1407–1419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Bellini M. Coilin, more than a molecular marker of the cajal (coiled) body. Bioessays. 2000; 22:861–867. [DOI] [PubMed] [Google Scholar]
- 48. Ogg S.C., Lamond A.I.. Cajal bodies and coilin–moving towards function. J. Cell Biol. 2002; 159:17–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Neugebauer K.M. Special focus on the Cajal body. RNA Biol. 2017; 14:669–670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Chen B., Gilbert L.A., Cimini B.A., Schnitzbauer J., Zhang W., Li G.-W., Park J., Blackburn E.H., Weissman J.S., Qi L.S.et al.. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell. 2013; 155:1479–1491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Hsu P.D., Lander E.S., Zhang F.. Development and applications of CRISPR-Cas9 for genome engineering. Cell. 2014; 157:1262–1278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Wang X., Zamore P.D., Hall T.M.. Crystal structure of a pumilio homology domain. Mol. Cell. 2001; 7:855–865. [DOI] [PubMed] [Google Scholar]
- 53. Wang X., McLachlan J., Zamore P.D., Hall T.M.T.. Modular recognition of RNA by a human pumilio-homology domain. Cell. 2002; 110:501–512. [DOI] [PubMed] [Google Scholar]
- 54. Filipovska A., Razif M.F.M., Nygård K.K.A., Rackham O.. A universal code for RNA recognition by PUF proteins. Nat. Chem. Biol. 2011; 7:425–427. [DOI] [PubMed] [Google Scholar]
- 55. Zhao Y.-Y., Mao M.-W., Zhang W.-J., Wang J., Li H.-T., Yang Y., Wang Z., Wu J.-W.. Expanding RNA binding specificity and affinity of engineered PUF domains. Nucleic. Acids. Res. 2018; 46:4771–4782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Carte J., Wang R., Li H., Terns R.M., Terns M.P.. Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes. Genes Dev. 2008; 22:3489–3496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Gootenberg J.S., Abudayyeh O.O., Lee J.W., Essletzbichler P., Dy A.J., Joung J., Verdine V., Donghia N., Daringer N.M., Freije C.A.et al.. Nucleic acid detection with CRISPR-Cas13a/C2c2. Science. 2017; 356:438–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Smargon A.A., Cox D.B.T., Pyzocha N.K., Zheng K., Slaymaker I.M., Gootenberg J.S., Abudayyeh O.A., Essletzbichler P., Shmakov S., Makarova K.S.et al.. Cas13b is a type VI-B CRISPR-Associated RNA-Guided RNase differentially regulated by accessory proteins Csx27 and Csx28. Mol. Cell. 2017; 65:618–630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Liu L., Li X., Ma J., Li Z., You L., Wang J., Wang M., Zhang X., Wang Y.. The molecular architecture for RNA-Guided RNA cleavage by Cas13a. Cell. 2017; 170:714–726. [DOI] [PubMed] [Google Scholar]
- 60. O’Connell M.R. Molecular mechanisms of RNA targeting by Cas13-containing type VI CRISPR-Cas systems. J. Mol. Biol. 2019; 431:66–87. [DOI] [PubMed] [Google Scholar]
- 61. Abudayyeh O.O., Gootenberg J.S., Essletzbichler P., Han S., Joung J., Belanto J.J., Verdine V., Cox D.B.T., Kellner M.J., Regev A.et al.. RNA targeting with CRISPR-Cas13. Nature. 2017; 550:280–284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Kim V.N. RNA-targeting CRISPR comes of age. Nat. Biotechnol. 2018; 36:44–45. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data could be found in paper or supplementary data. Additional data could be requested from the corresponding authors.