Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2023 Jan 18;51(3):1297–1316. doi: 10.1093/nar/gkac1237

Depletion of the RNA-binding protein PURA triggers changes in posttranscriptional gene regulation and loss of P-bodies

Lena Molitor 1,3, Melina Klostermann 2,3, Sabrina Bacher 3, Juliane Merl-Pham 4, Nadine Spranger 5, Sandra Burczyk 6, Carolin Ketteler 7, Ejona Rusha 8, Daniel Tews 9, Anna Pertek 10, Marcel Proske 11,12, Anke Busch 13, Sarah Reschke 14, Regina Feederle 15, Stefanie M Hauck 16, Helmut Blum 17, Micha Drukker 18,19, Pamela Fischer-Posovszky 20, Julian König 21, Kathi Zarnack 22,, Dierk Niessing 23,24,
PMCID: PMC9943675  PMID: 36651277

Abstract

The RNA-binding protein PURA has been implicated in the rare, monogenetic, neurodevelopmental disorder PURA Syndrome. PURA binds both DNA and RNA and has been associated with various cellular functions. Only little is known about its main cellular roles and the molecular pathways affected upon PURA depletion. Here, we show that PURA is predominantly located in the cytoplasm, where it binds to thousands of mRNAs. Many of these transcripts change abundance in response to PURA depletion. The encoded proteins suggest a role for PURA in immune responses, mitochondrial function, autophagy and processing (P)-body activity. Intriguingly, reduced PURA levels decrease the expression of the integral P-body components LSM14A and DDX6 and strongly affect P-body formation in human cells. Furthermore, PURA knockdown results in stabilization of P-body-enriched transcripts, whereas other mRNAs are not affected. Hence, reduced PURA levels, as reported in patients with PURA Syndrome, influence the formation and composition of this phase-separated RNA processing machinery. Our study proposes PURA Syndrome as a new model to study the tight connection between P-body-associated RNA regulation and neurodevelopmental disorders.

Graphical Abstract

Graphical Abstract.

Graphical Abstract

Multi-omics analyses reveal a role of the RNA-binding protein PURA in immune responses, mitochondrial function, autophagy and P-body activity, helping to understand the etiology of the neurodevelopmental disorder PURA Syndrome.

INTRODUCTION

In recent years, an increasing number of RNA-binding proteins have been implicated in the pathology of human disorders, in particular in neuronal diseases (1,2). One of them is the purine rich element binding protein A (PURA, formerly known as Pur-alpha) (3). In 2014, mutations in the PURA gene have been linked to the neurodevelopmental disorder PURA Syndrome (4,5). Patients with heterozygous de novo mutations in PURA show a range of symptoms including neurodevelopmental delay, intellectual disability, hypotonia, and epilepsy (6,7). For unknown reasons, the phenotypes of patients with PURA Syndrome vary considerably. In addition, mice with homozygous deletions of PURA from two independent studies show phenotypes partially resembling patients with PURA Syndrome, including tremor and movement abnormalities (8,9). While it is generally assumed that symptoms are caused by haploinsufficiency of the PURA protein (10,11), the affected molecular pathways remain largely elusive. Furthermore, mice with a heterozygous mutation in PURA show very little overlap with human symptoms (12), underlining the importance of PURA-related studies in human cells.

PURA was initially reported as a transcription factor binding to a specific DNA sequence motif upstream of the c-myc gene (13,14). While this and other DNA targets of PURA have been described in the past (15), the majority of recent reports indicated an involvement in RNA-related processes. In particular, PURA was repeatedly found in cytoplasmic, kinesin- and myosin-containing mRNA-transport complexes in neurons of mice (16–20). Furthermore, PURA was reported to localize to cytoplasmic stress granules in human cells. In a cell culture model for amyotrophic lateral sclerosis (ALS), changes in PURA expression modulated the pathological appearances of stress granules and mitigated ALS-related neurotoxicity, indicating a functional importance of PURA for these membrane-less organelles (21). While these reports indicate that PURA associates with both DNA and RNA, it remains unclear which interaction is functionally more relevant.

PURA belongs to the PC4 family of proteins (22) that bind single-stranded (ss) nucleic acids. This family of proteins has been found in all kingdoms of life and they often show similarities in their nucleic acid binding properties. Nuclear magnetic resonance (NMR) measurements and quantitative binding studies revealed that PURA interacts with ssDNA and ssRNA with comparable affinities and properties (23). Crystal structures of PURA proteins from different species revealed two so-called PUR domains, which mediate PURA’s dimerization, and indicated that there are no structural constraints favoring DNA or RNA binding (23,24). Thus, although we have a relatively clear understanding of PURA’s structural and molecular properties (3), a link is missing that explains how its nucleic acid binding impacts cellular functions. Hence, the aim of this study was to identify target pathways of PURA that could be dysregulated in patients with PURA Syndrome.

MATERIALS AND METHODS

Generation of a PURA-specific antibody

The monoclonal IgG2-2a anti-PURA12D11 antibody was raised against the unstructured linker region (21 amino acids) between PUR I-II of human PURA (peptide sequence: Cys-AQLGPSQPPDLAQAQDEPRRA; Peps4LS GmbH, Heidelberg) (Supplementary Figure S1). Several antibodies were generated after immunization of Sprague Dawley rats with ovalbumin-coupled peptide and hybridoma fusion as described before (25). Antibody supernatants were validated for binding to biotinylated PURA peptide in a solid-phase immunoassay. Subsequently, the panel of antibody supernatants was tested by Western blot on total cell lysate from HeLa cells. Selected clones were subcloned twice by limiting dilution to obtain stable monoclonal cell lines. Finally, the best antibody was tested in Western blot experiments on total cell lysate from PURA knockdown (KD) and control (CTRL) HeLa cells.

Cross-reactivity to PURB was excluded using recombinantly expressed GST-PURA and GST-PURB in Western blot assays (Supplementary Figure S1G). For this, human PURA and PURB were cloned into the pOPIN-J expression vector and expressed in Escherichia coli Rosetta 2 (DE3) cells, followed by Western blot with cell lysates. Experiments in this work were performed with hybridoma supernatant clone 12D11 (IgG2a/k). For additional details, see ‘Western blot experiments’.

Generation of inducible PURA overexpression HeLa cell line

In the overexpression construct GFP-P2A-FLAG-PURA, GFP and FLAG-PURA are separated by a viral P2A cleavage site which mediates translation into two disconnected protein products (26). PiggyBac plasmids (27) harboring GFP-P2A-FLAG-PURA under control of a doxycycline-inducible promoter were generated using Gibson assembly (GeneArt Thermo Fisher Scientific) of three fragments containing overhangs according to manufacturer's instructions. Using these plasmids, overexpression cell lines were generated. The plasmid containing GFP-P2A-FLAG-PURA as well as a PiggyBac Helper plasmid (27) were transfected into cells using Lipofectamine 3000 (Thermo Fisher Scientific) according to manufacturer's instructions. Transfected cells were incubated at 37°C, 5% CO2 for 2–4 days. Transfected cells were subsequently selected by addition of hygromycin (700 μg/ml) for at least ten days. GFP-P2A-FLAG-PURA overexpression was induced by addition of doxycycline (1 μg/ml) for 24 h. Successful translation of the overexpression construct was validating by showing GFP+ cells using GFP fluorescence, while the overexpressed FLAG-PURA protein was detected using the anti-PURA12D11 antibody.

siRNA-mediated PURA knockdown

siRNA-mediated knockdown of PURA in HeLa cells was performed using a predesigned PURA siRNA pool (Dharmacon, M-012136-01-000). One day prior to transfection, HeLa cells were plated at 20% confluency in six-well plates. The next day, lipofection was performed using RNAiMAX (Thermo Fisher Scientific) according to manufacturer's instructions. Briefly, 20 μM siRNA pool was mixed with OptiMEM (Thermo Fisher Scientific) in one reaction and lipofectamine RNAiMAX was mixed with OptiMEM in a second reaction. Both reactions were incubated at room temperature (RT) for 5 min and mixed by flicking. Meanwhile, culture media on HeLa cells was changed to 1 ml DMEM + 10% FBS lacking antibiotics. Contents of both reactions were mixed and further incubated for 20 min at RT. Finally, 400 μl of lipofection mix were added drop-wise onto HeLa cells. As a control, siGENOME Non-Targeting Pool #1 (Dharmacon, D-001206-13-05) and a second independent predesigned PURA siRNA (Thermo Fisher Scientific, 289567) were used as described for the PURA siRNA pool. The transfected cells were incubated for 48 h at 37°C and 5% CO2 and subsequently analyzed in downstream assays.

RNA sequencing

Four independent biological replicates of PURA KD (siGENOME Non-Targeting Pool #1; Dharmacon, D-001206-13-05) and control (CTRL) HeLa cells were prepared as described above. Control cells were only treated with lipofectamine RNAiMAX (Thermo Fisher Scientific) and OptiMEM (Thermo Fisher Scientific) without adding the non-targeting pool siRNA. After 48 h incubation with siRNA, cells were washed using PBS and subsequently collected in TRIzol Reagent (Thermo Fisher Scientific). Total RNA was extracted using Maxwell® RSC miRNA Tissue Kit (AS1460) on the Maxwell® RSC instrument according to the manufacturer's instructions. Briefly, shock-frozen cells were resuspended in 500 μl TRIzol. 6 μl 1-thioglycerol were added to 300 μl TRIzol suspension. The suspension was mixed vigorously and total RNA extracted by phase separation using RSC SimplyRNA Tissue (AS1340) setup. The concentration was measured on a NanoDrop spectrometer (Thermo Fisher Scientific) and the quality of the RNA samples checked on an Agilent Bioanalyzer (Agilent; RNA 6,000 Nano kit).

Libraries were generated with the Lexogen SENSE mRNA-seq Library Prep Kit V2 (Lexogen) with half of the volume described in the manual. Briefly, 100 ng of total RNA were bound to oligo-dT beads, reverse-transcribed on the beads and the second strand of cDNA was synthesized after ligation of an adapter. The eluted cDNA was amplified with barcoded primers during 11 cycles of PCR with the following program: 98°C for 10 s, 65°C for 20 s, 72°C for 30 s and a final extension at 72°C for 1 min (hold at 10°C). The PCR products were purified with Agencourt AMPure XP beads (Beckman Coulter). The quality of the libraries was validated with an Agilent Bioanalyzer (DNA 1000 Kit). The finished, barcoded libraries were pooled and sequenced on an Illumina HiSeq1500 as 50-nt single-end reads. Each library yielded on average 35 million reads.

Reads were aligned to the human genome (GRCh38.p12 from GENCODE) with STAR (version 2.7.6a) (28) with up to 4% mismatches and no multimapping allowed. Reads per gene were counted using htseq-count (version 0.11.3) (29) with default settings and GENCODE gene annotation (release 31) (30). Differential expression analysis between PURA KD and CTRL samples was performed using DESeq2 (version 1.33.4) (31) with a significance cutoff at an adjusted P value < 0.01 (Benjamini–Hochberg correction). This yielded a total of 3415 significantly differentially expressed transcripts, including 1663 with increased and 1752 with decreased expression upon PURA KD. Note that we use the term transcripts equivalent to genes here. For the heatmap in Figure 4C, library size-corrected RNA-seq read counts per sample were rlog-transformed (DESeq2, version 1.33.4) (31). For visualization, the counts were converted into row-wise z-scores.

Figure 4.

Figure 4.

Loss of PURA results in widespread changes in gene and protein expression. (A) Differentially expressed transcripts in PURA knockdown (KD) vs. control (CTRL) conditions (n = 4) are shown in a volcano plot giving the RNA expression change [log2] against the significance as adjusted P value [log10] (black: 3415 transcripts with FDR < 0.01). (B) Changes in protein abundance upon PURA KD (n = 4) are shown in a volcano plot giving the protein expression change [log2] against the significance as adjusted P value [log10] (black: 995 proteins with FDR < 0.05). (C) 334 targets are regulated by PURA on the RNA and protein levels. Heatmap shows z-score-normalized RNA counts and protein abundances of all replicates with the associated PURA-bound transcript region (iCLIP). (D) Changes in protein abundances are similar to changes in RNA levels. Scatter plot compares changes in RNA (y-axis) and protein abundance (x-axis) for 4391 transcripts that were measured in both assays. Targets with significant changes in both data sets (n = 334) are marked in red. Infinite log2 fold change values from protein measurements are shown as asterisks. (E) Overlap of PURA-bound RNAs with RNAs or proteins significantly changing upon PURA KD. The 234 RNAs overlapping between the three experiments were defined as highly validated PURA targets. (F–I) SQSTM1 and LSM14A levels are significantly reduced upon PURA KD. Representative Western blots (F, H) and quantifications of replicates (G, I) are shown in CTRL and PURA KD conditions. PURA KD was confirmed by detection with anti-PURA12D11. H3 served as loading controls (n = 3 [SQSTM1] or 4 [LSM14A, IL6ST] biological replicates, paired one-sided Student's t-test). (J) Immunofluorescence staining of PURA (red) together with the IL6ST (green) and DAPI (blue) as a nuclear stain in CTRL (top) and PURA KD (bottom) HeLa cells. Scale bars, 10 μm. (K) Quantification (ImageJ) of IL6ST intensity of approximately 80 cells per replicate and condition (three biological replicates with four technical replicates each, unpaired two-sided Student's t-test). (L, M) IL6ST levels significantly increase upon PURA KD. Representative Western blot (L) and quantification of replicates (M) are shown in CTRL and PURA KD conditions. PURA KD was confirmed by detection with anti-PURA12D11. H3 served as loading controls (n = 4 biological replicates, paired one-sided Student's t-test).

Shotgun proteomics

Generation of protein lysates

Four independent biological replicates of PURA KD and CTRL HeLa cells were prepared as described above. Cell samples were collected by scraping them in PBS and subsequent centrifugation at 500 × g for 3 min. Cell pellets were lysed in 100 μl RIPA buffer and subsequently centrifuged at 16,000 × g at 4°C for 10 min. The supernatant was transferred to a new tube and the amount of protein in the lysate was determined by Bradford assay (ROTI®Quant, Carl Roth).

Sample preparation for mass spectrometric analysis

10 μg per sample were digested with Lys-C and trypsin using a modified FASP procedure (32,33). Briefly, after reduction and alkylation using DTT and IAA, the proteins were centrifuged on a 30 kDa cutoff filter device (Sartorius) and washed each thrice with UA buffer (8 M urea in 0.1 M Tris/HCl pH 8.5) and with 50 mM ammonium bicarbonate. The proteins were digested for 2 h at RT using 0.5 μg Lys-C (Wako Chemicals) and for 16 h at 37°C using 1 μg trypsin (Promega). After centrifugation (10 min at 14,000 × g), the eluted peptides were acidified with 0.5% TFA and stored at –20°C.

LC–MS/MS measurements

Liquid chromatography with tandem mass spectrometry (LC–MS/MS) analysis was performed on a QExactive HFX mass spectrometer (Thermo Fisher Scientific) online coupled to a UItimate 3000 RSLC nano-HPLC (Dionex). Samples were automatically injected and loaded onto the C18 trap cartridge and after 5 min eluted and separated on the C18 analytical column (Acquity UPLC M-Class HSS T3 Column, 1.8 μm, 75 μm × 250 mm; Waters) by a 90 min non-linear acetonitrile gradient at a flow rate of 250 nl/min. MS spectra were recorded at a resolution of 60,000 with an AGC target of 3 × 10e6 and a maximum injection time of 30 ms from 300 to 1500 m/z. From the MS scan, the 15 most abundant peptide ions were selected for fragmentation via HCD with a normalized collision energy of 28, an isolation window of 1.6 m/z, and a dynamic exclusion of 30 s. MS/MS spectra were recorded at a resolution of 15,000 with a AGC target of 10e5 and a maximum injection time of 50 ms. Unassigned charges, and charges of + 1 and > 8 were excluded from precursor selection.

Quantitative MS analysis

Acquired raw data was analyzed in the MaxQuant software (MPI Biochemistry, Martinsried; version 1.6.7.0), (34) for peptide and protein identification via a database search (Andromeda search engine), (35) against the SwissProt Human database (Release 2020_02, 20,435 sequences; 11 490 581 residues), considering full tryptic specificity, allowing for up to one missed tryptic cleavage sites, precursor mass tolerance 10 ppm, fragment mass tolerance 0.02 Da. Carbamidomethylation of cysteine was set as a static modification. Dynamic modifications included deamidation of asparagine and glutamine and oxidation of methionine. Identifications were filtered for a PSM false discovery rate < 1% and protein false discovery rate of 5%. Label-free quantifications were based on unique peptides applying the LFQ algorithm (36) with LFQ min, count of 1 in combination with the default match-between runs settings, allowing for matching of identifications (37) between the individual runs resulting in quantifications of MS features throughout the dataset.

Differential protein expression was analyzed based on MaxQuant LFQ values using DEqMS (1.12.1) (38). We treated missing values as follows: if protein abundance was not measured in any of the samples in one condition it was set to zero, which results in infinite values for the calculated ratios. Peptides with two or more missing values in one condition were treated as non-quantifiable and no ratios were calculated. When only one value was missing per condition, the ratios were calculated on the other three samples.

P values were adjusted for multiple testing by Benjamini-Hochberg correction. An advantage of DEqMS is that it implements scaling of (adjusted) P values to the number of detected unique peptides. We deemed proteins with a scaled adjusted P value < 0.05 significant. In the heatmap in Figure 4C, protein abundance is shown after z-score normalization.

Quantitative real-time PCR (qPCR)

RNA was extracted using the High Pure RNA Isolation kit (Roche Molecular Systems) according to manufacturer's instructions and following the general precautions required for RNA work (39). RNA was eluted in 50 μl DEPC water. Two rounds of DNase digestion using TURBO DNA-free kit (Thermo Fisher Scientific) according to manufacturer's instructions were performed to safely remove all remaining DNA. The amount of pure RNA was analyzed by measuring the OD260 using a NanoDrop microliter photometer. The extracted RNA was reverse transcribed using PrimeScript RT Master Mix (Takara) according to manufacturer's instructions. On the generated cDNA libraries, qPCR experiments were performed using SYBR Green Master Mix (Thermo Fisher Scientific) in a LightCycler 480 Instrument II (Roche Molecular Systems) in 96-well format. Ct values were used for the analysis of differential mRNA abundance between PURA KD and CTRL conditions. This analysis was done using the ΔΔct method (40) with the housekeeping genes GAPDH and RPL32 for normalization. Changes were tested for significance using an unpaired two-sided Student's t-test on the ΔΔct values. All oligonucleotides used for qPCR experiments are listed in Supplementary Table S8.

Western blot experiments

Western blot experiments were performed using Mini Blot module (Thermo Fisher Scientific) according to manufacturer's instructions. NuPage loading dye (Thermo Fisher Scientific) was added onto cell lysates and the mix was heated to 70°C for 10 min prior to gel loading. The blotting module was filled with NuPAGE 1× SDS running buffer (Thermo Fisher Scientific) and 500 μl NuPAGE antioxidant (Thermo Fisher Scientific) was added to the front part of the chamber. Samples were cooled to RT and loaded onto NuPAGE 4–12% Bis–Tris pre-cast gradient gels (Thermo Fisher Scientific) in the buffer-filled blotting module. For protein molecular weight estimation, BlueStar Plus Prestained Protein Marker (Nippon Genetics) was used. The gel was run for 45 min at 200 V. The PVDF blotting membrane was activated in methanol for 30 s prior to usage. All other components were soaked in 1x transfer buffer (Thermo Fisher Scientific) with 20% methanol prior to blotting module assembly. The assembled module was inserted into the Mini Blot module (Thermo Fisher Scientific), the surrounding chamber was filled with MQ water and the blotting reaction was run for 90 min at 30 V. Subsequently, the membrane was blocked using 1% casein in PBST. This was incubated for 30 min at RT while rotating on tube revolver machine (neoLab). Then, the blocking solution was discarded and primary antibody (dilutions in Supplementary Table S11) in 0.5% casein in PBST was added. Primary antibody was incubated overnight at 4°C rotating on a tube revolver device. On the next day, the primary antibody solution was discarded, and the blot was washed. Next, the secondary antibody (dilutions in Supplementary Table S12) in PBST was incubated on the membrane for 1 h at RT while rotating. Finally, the secondary antibody solution was discarded, and the membrane was washed. Then, the membrane was incubated with Amersham ECL Prime Western Blotting Detection Reagent (GE Healthcare). 1:1 mix of solution A and solution B of the above-mentioned ECL kit was added onto the membrane. The membrane was analyzed using Fusion SL 4 device (Vilber Lourmat, software FusionCapt Advance SL 4 16.04). Changes in signal intensity between PURA KD and CTRL samples were tested for significance in R using a paired one-sided Student's t-test. For visualization, all samples were normalized to the mean of CTRL samples.

Immunofluorescence staining

Cells were cultivated in culture medium on coverslips (Thermo Fisher Scientific) in 12-well plates and subsequently washed using PBS and fixed using 3.7% formaldehyde in PBS. Fixing media was incubated for 10 min at RT in the cell culture dish. Permeabilization was induced by incubating the cells in 0.5% Triton X-100 in PBS for 5 min at RT. Afterwards, cells were washed twice using PBS and the coverslips were transferred to a pre-labelled parafilm that was placed onto wet Whatman paper to prevent drying. Cells were then blocked for 10 min in blocking buffer (1% donkey serum in PBST) and incubated with primary antibodies (Supplementary Table S11) in blocking buffer for 1 h at RT. Secondary antibodies (Supplementary Table S12) were diluted in blocking buffer and incubated for 1 h at RT. Washing steps after antibody incubation were performed with PBST. DNA was stained with DAPI at 0.5 μg/ml in PBS by incubation for 5 min at RT, followed by washing twice using PBS. Finally, cells were mounted in ProLong Diamond Antifade (Thermo Fisher Scientific) and imaging was performed by fluorescence microscopy using an Axio.Observer.Z1 (Carl Zeiss AG) microscope. Immunofluorescence staining for P-bodies using the marker proteins DCP1A, DDX6 and LSM14A was performed on a LSM 880 Airyscan confocal microscope (Carl Zeiss AG) using a Zeiss Plan-Apochromat 20×/0.8 M27 objective and ZEN Black software (version 14.0.18.201).

Quantitative analysis of immunofluorescence images

Quantification of DCP1A, LSM14A and DDX6 granules in different cell lines (HeLa and NHDF) was done using Fiji software (version 2.3.0/1.53q). For each image, the cell number was determined using the auto threshold function on the DAPI channel, followed by particle analysis selecting a particle size of 1000-infinity pixel units (μm2). Then, all the granules on the image were quantified using the same particle analysis tool, after processing the corresponding channel with auto threshold. Granules were counted by selecting a particle size of 40–200 pixel units (μm2) and a circularity of 0.1–1.00 for HeLa cells (Figure 5BD) (41,42). In NHDF cells, a particle size of 30–100 pixel units (μm2) was selected (Supplementary Figure S10G and H). Quantification was done in three biological replicates and for each replicate, four confocal pictures (20x magnification) of each condition (CTRL, PURA KD) were analyzed.

Figure 5.

Figure 5.

PURA localizes to P-bodies and P-body RNAs are upregulated upon PURA KD. (A) Venn diagram depicts overlap of PURA-bound RNAs (red, TPM > 0) with the transcriptomes of P-bodies (blue, n = 4767, odds ratio 5.7 [5.3, 6.2] (95% CI), P value < 2.2 × 10−16, Fisher's exact test) (42) and stress granules (grey, n = 2723, odds ratio 12.0 [10.8, 13.4], P value < 2.2 × 10−16, Fisher's exact test) (51). (B) Confocal micrographs of PURA (red), P-body marker DCP1A (green) and colocalized signal (yellow). Nuclei were stained with DAPI (blue). Arrowheads indicate examples of P-bodies with overlapping staining. Scale bars, 10 μm. (C) Staining as in (A) using LSM14A as P-body marker (measured in 555 nm channel, depicted in green). Colocalization (yellow) of PURA (red) and LSM14A (green). Scale bars, 10 μm. (D) Colocalization of PURA (anti-PURA12D11) with DCP1A- and LSM14A-positive granules (6 samples each, mean over all samples is given by bars). Granules were defined as described in Methods. (E) P-body RNAs are upregulated in PURA KD. Violin plot of RNA expression changes (log2 fold change, PURA KD/CTRL) for RNAs enriched in P-bodies (blue) versus all other RNAs (grey) (P value < 0.0001, unpaired two-sided Wilcoxon rank-sum test).

For the analysis of signal intensity of IL6ST and PURA immunofluorescence images, the channel for quantification was split from the DAPI channel. For each image, the background noise was removed using the subtract background tool with a rolling ball radius of 100 pixels. The rectangle tool was used to select parts of the picture without any signal and then that area was measured for integrated intensity. Afterwards the whole picture was measured with the same tool. For analysis, the background integrated intensity was subtracted from the integrated intensity of the channel of interest. The intensity of each image was also normalized to the cell number using the same particle analysis tool as described above (Figure 4J and K, Supplementary Figure S9G).

Colocalization was determined using a plugin from the Fiji software named BioIP JaCoP (43). The two channels to be analyzed were merged and then the plugin was used with a set threshold for Channel A and B ‘Otsu’ (Figure 5D).

Nuclear-cytoplasmic fractionation

Five million HeLa cells were utilized for cell fractioning. Cytoplasmic and nuclear fractions were generated using the NE-PER Kit (Thermo Scientific, Cat.-No: 78833) following the manufacturer's instructions. In short, HeLa cells were resuspended, vortexed and centrifuged in cytoplasmic extraction buffer to obtain the cytoplasmic fraction. Afterwards, the pellet consisting of nuclei was resuspended in nuclear extraction buffer and frequently vortexed every 10 minutes for a total of 40 minutes to degrade the nuclear membrane. After centrifugation, the total protein concentration of the cytoplasmic and nuclear fraction were determined using BCA assay (Pierce™ BCA Protein Assay Kit, Cat.-No. 23227). For subcellular localization analysis of PURA, 20 μg of total protein for each cell fraction were used for Western blot analysis. To validate successful separation of cell fractions protein levels of PARP1 (nuclear control: Sigma-Aldrich, Cat.-No. HPA045168) and GAPDH (cytoplasmic control: Sigma-Aldrich, Cat.-No. HPA040067) were additionally analysed (Figure 1B and C).

Figure 1.

Figure 1

PURA is predominantly located in the cytoplasm. (A) Immortalized cell lines from different tissue origins were analyzed for PURA’s subcellular localization by immunofluorescence (IF) staining HeLa, cervix, U251MG, brain, HEK293T, kidney, H1299, lung, and HepG2, liver. Cells were stained with DAPI (blue) and anti-PURA12D11 (white). In the merged image, PURA staining is shown in orange for better visualization. Scale bars, 20 μm. (B) Nuclear-cytoplasmic fractionation of HeLa cell lysates. Representative Western blot using anti-PARP1 antibody (116 kDa) as marker for nuclear fraction, anti-GAPDH antibody (36 kDa) as marker for cytoplasmic fraction and anti-PURA12D11 (35 kDa). (C) Densitometric quantification of PURA levels in cell fractions in 3 replicates (of which one is shown in (B)). Lines represent the mean ± standard deviation (SD) of three independent experiments. P value is calculated by unpaired two-sided Student's t-test.

Electrophoretic mobility shift assay (EMSA)

For EMSAs, GST-PURA (pOPIN-J) was recombinantly expressed in E. coli Rosetta 2 (DE3). Proteins were purified via GSTrap FF column (Cytiva). The GST-Tag was removed via 3C protease cleavage and the sample passed through HiTrap Q FF and HiTrap Heparin FF column. Protein bound to the HiTrap Heparin column was eluted and the sample further purified using HiLoad 16/600 Superdex 75 size-exclusion chromatography column (Cytiva) (Supplementary Figure S3A–C).

RNAs for EMSAs were in vitro transcribed using the MEGAshortscript T7 Transcription kit (Ambion). As a template, purchased HPLC-purified primers were used (Supplementary Table S10). T7 primer and the DNA template were annealed by incubation at 60°C for 5 min and cooled down to RT. The in vitro transcription was performed as described by the manufacturer (Ambion). Afterwards, the remaining DNA was digested using DNase I (Ambion, same kit) and the RNA was purified by phenol/chloroform extraction. Ethanol precipitation was performed at −20°C with the addition of GlycoBlueTM coprecipitant (Thermo Fisher Scientific) and centrifugation for 1 min at 4°C. DEPC water was used to dissolve the RNA and its quality was confirmed by denaturing and native polyacrylamide gel electrophoresis (PAGE).

For labeling of in vitro transcribed RNA, RNase-free buffers, materials, and reagents were used. In vitro transcribed RNA (15 pmol) was 5′ dephosphorylated using FastAP thermosensitive alkaline phosphatase (Thermo Fisher Scientific) in 20 μl reaction containing 1× FastAP buffer (Thermo Fisher Scientific) and 20 U of the RNase inhibitor SUPERaseIn (Thermo Fisher Scientific). The reaction was incubated for 30 min at 37°C and the dephosphorylated RNA was extracted using phenol/chloroform and precipitated with 1 V 3 M NaOAc, 3 V absolute ethanol and cooled down at −20°C for at least 15 min. For radioactive labeling, 15 pmol dephosphorylated RNA was 5′ phosphorylated with 32P from γ-32P ATP (Hartmann Analytic) with T4 PNK (New England Biolabs) in 1x PNK buffer in a 20 μl reaction. The reaction was incubated for 30 min at 37°C and afterwards stopped for 10 min at 72°C. NucAwayTM Spin column kit (Ambion) was used to remove unlabeled free nucleotides based on the manufacturer's instructions. Eluted radiolabeled RNA was diluted to a final concentration of 100 nM in DEPC water and stored at −20°C.

EMSAs for PURA-RNA interactions were performed as described before (23). The protein-nucleic acid complexes were formed in RNase-free binding buffer containing 250 mM NaCl, 20 mM HEPES pH 8.0, 3 mM MgCl2, 4% glycerol and 2 mM DTT. Serial protein dilutions and a constant amount of radiolabeled RNA (2.5 nM) were incubated in a total reaction volume of 20 μl for 20 min at RT. All experiments contained 100 μg/ml yeast tRNA (Ambion) as competitor. 10 μl of the reactions were loaded onto 6% TBE polyacrylamide gels that were prepared beforehand. After electrophoresis (45 min, 100 V), gels were incubated for 15 min in fixing solution ([v/v] 10% acetic acid, [v/v] 30% methanol), dried in a gel dryer (BioRad) and analyzed with radiograph films using a Phosphoimager (Fujifilm VLA- 3000) (Figure 2D, Supplementary Figure S3D–M).

Figure 2.

Figure 2.

PURA globally binds RNAs via specific binding sites and regulates a subset of its target transcripts. (A) Autoradiogram with radiolabeled RNA after immunoprecipitation of endogenous PURA (anti-PURA12D11) from HeLa cells shows crosslinked PURA-RNA complexes. Co-purified RNA fragments of variable lengths result in a smear upstream of the expected molecular weight of PURA (35 kDa). Four biological replicates are shown. (B) Schematic depiction of iCLIP experiment that identified 55,031 binding sites for endogenous PURA in HeLa cells. (C) Genome browser view shows the pileup of crosslink events of endogenous PURA from HeLa cells (top) and neural progenitor cells (bottom) at binding sites (red) in the 3′UTR of the STARD7 transcript. Inlay (blue box) shows complete STARD7 3′UTR. Red line denotes RNA fragment used for EMSA validation in (D). (D) EMSA with recombinantly expressed human full-length PURA and increasing concentrations (187 nM to 12 μM) of a radiolabeled 54-nt RNA fragment harboring two PURA binding sites from the STARD7 3′UTR (marked in (C)) confirms direct binding in vitro. For more EMSAs, see Supplementary Figure S3. (E) Cellular compartments (Gene Ontology) enriched in PURA-bound transcripts that relate to neurons (false discovery rate [FDR] < 0.05). Gene ratio (x-axis) depicts ratio of PURA-bound transcripts over all expressed transcripts (transcripts per million [TPM] > 0) attributed to this term. FDR is displayed as log10 by color scale. For a complete list of significantly enriched terms, see Supplementary Table S3. (F) PURA crosslink events from neural progenitor cells (NPCs) enrich in PURA binding sites identified in HeLa cells. Metaprofile (top) and heatmap (bottom) display crosslink events per nucleotide of endogenous PURA in NPCs for 1000 binding sites with highest signal in the NPC iCLIP data. BS, binding site.

Neural progenitor cell derivation from human induced pluripotent stem cells

The protocol for differentiation of neural progenitor cells was based on the generation of neurospheres (44). Briefly, the human induced pluripotent stem cell (hiPSC) line HMGU12 was harvested using StemMACS Passaging Solution (Miltenyi Biotec) and resuspended in DMEM/F-12 medium supplemented with 20% KSR, 1% NEAA, 1% GlutaMAX (Thermo Fisher Scientific), 10 μM SB431542, 5 μM dorsomorphin, 3 μM CHIR99021, 10 μM purmorphamine (Miltenyi Biotec), and 10 μM Y-27632 (Bio-Techne). The suspension was plated on an ultra-low attachment 6-well plate (Corning). Fresh medium was applied 24 h later without Y-27632. 48 h later, the basal medium was exchanged with N2B27-based medium containing a 1:1 mixture of DMEM-F-12 and Neurobasal A supplemented with 0.5% N-2, 1% B-27 minus vitamin A, 1% NEAA, and 1% GlutaMAX (Thermo Fisher Scientific), and the above-described small molecules. On day 5, N2B27-based medium was supplemented only with 50 μg/ml l-ascorbic acid, SB431542, and dorsomorphin. Approximately 24 h prior to replating, the medium was supplemented additionally with 5 ng/ml bFGF (Peprotech). On day 8, the neurospheres were mechanically dissociated and plated 1:6 on Matrigel-coated plates. Plated neurospheres were maintained for 7 days on N2B27 basal medium supplemented with Dual Smad inhibitors (Miltenyi Biotec (SB), Tocris (DM)), ascorbic acid and bFGF. On day 14, confluent neuroepithelial outgrowths were passaged in a 1:10 dilution using collagenase IV. The resulting neural progenitor cell (NPC) cultures were passaged every 7 days and maintained in N2B27 basal medium as described above with medium change every other day.

iCLIP experiments

iCLIP experiments for PURA were performed according to the published iCLIP2 protocol (45) with minor adaptions. Cells were grown to confluence, washed using PBS, crosslinked using Stratalinker 2400 (Vilber Lourmat) at 254 nm and 300 mJ/cm2 and lysed. Immunoprecipitation of PURA from cell lysates was either performed using anti-PURA12D11-coated Dynabeads protein G (Thermo Fisher Scientific) or FLAG M2 beads (Sigma Aldrich). Subsequently, co-purified RNAs were dephosphorylated at the 3′ end, the 3′ adapter was ligated and the 5′ end was labeled with radioactive isotopes (γ-32P ATP; Hartmann Analytic). Samples were loaded onto a NuPAGE 4–12% Bis-Tris pre-cast SDS gradient gel (Thermo Fisher Scientific), transferred to a nitrocellulose membrane (GE Healthcare) and visualized using a Phosphoimager (Fujifilm). After cutting the correct region from the nitrocellulose membrane using a cutting mask, RNA was isolated by Phenol/Chloroform/Isoamylalcohol (pH 8.0, Sigma Aldrich) and the mix was transferred to a 2 ml Phase Lock Gel Heavy tube (Quantabio). After separation of the phases the RNA was precipitated by the addition of 0.75 μl GlycoBlueTM Coprecipitant (Thermo Fisher Scientific), 40 μl 3 M sodium acetate pH 5.5 and 1 ml 100% ethanol. It was mixed and stored at −20°C for at least 2 h. Reverse transcription was performed on the resulting purified RNA using Superscript III (Thermo Fisher Scientific) according to manufacturer's instructions. Then, cDNA was immobilized on MyONE SILANE beads (Thermo Fisher Scientific) as described by the manufacturer and the second adapter was ligated. The library was amplified for six PCR cycles with short primers followed by a ProNex size selection step (Promega) in a 1:2.95 (v/v) sample/bead ratio according to manufacturer's instructions. The size-selected library was amplified for 13/16 cycles (endogenous PURA, anti-PURA12D11, HeLa cells; samples 1 & 4 and samples 2 & 3, respectively), 22 cycles (endogenous PURA, anti-PURA12D11, NPCs), 12 cycles (overexpressed FLAG-PURA, anti-PURA12D11, HeLa cells), 13/14 cycles (overexpressed FLAG-PURA, anti-FLAG, HeLa cells; sample 1/2, respectively) using Illumina primers (Supplementary Table S9). Afterwards, a second size selection was performed using ProNex chemistry (Promega) at 1/2.4 (v/v) sample/bead ratio according to manufacturer's instructions and cDNA was eluted in 20 μl MQ water. The final library was analyzed using a D1000 Chip (Agilent) in a Bioanalyzer (Agilent). Samples were equimolarly mixed before sequencing on an Illumina HiSeq1500 platform with 75–110-nt single-end reads and 6.7–29.5 million reads per sample (Supplementary Table S1).

Processing of iCLIP reads

Basic quality controls were done with FastQC (version 0.11.8) (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). The first 15 nt of iCLIP reads hold a 6 nt sample barcode and 5 + 4 nt unique molecular identifiers (UMIs) flanking the sample barcode. Based on the sequence qualities (Phred score) in this 15 nt barcode region, reads were filtered using the FASTX-Toolkit (version 0.0.14) (http://hannonlab.cshl.edu/fastx_toolkit/) and seqtk (version 1.3) (https://github.com/lh3/seqtk/). The sample barcodes, found on positions 6–11 of the reads, were used to de-multiplex the set of all quality filtered reads using Flexbar (46) (version 3.4.0). Afterwards, barcode regions and adapter sequences were trimmed from read ends using Flexbar, requiring a minimal overlap of 1 nt of read and adapter. UMIs were added to the read names and reads shorter than 15 nt were removed from further analysis. Downstream analysis was done as described in Chapters 3.4 and 4.1 of Busch et al. (47). GENCODE (30) release 31 genome assembly and annotation were used during mapping.

Binding site definition

Using the procedure described in (47), we defined 5-nt wide binding sites on the iCLIP data for the endogenous PURA (anti-PURA12D11) from HeLa cells. In brief, the processed crosslink events of the four biological replicates were merged into two pseudo-replicates and subjected to peak calling by PureCLIP (version 1.3.1) (48) with default parameters. To define binding sites, PureCLIP sites closer than 5 nt were merged into regions, and isolated PureCLIP sites without an adjacent site within 4 nt were discarded. Binding site centers were defined iteratively by the position with the highest number of crosslink events and enlarged by 2 nt on both sides to obtain 5-nt binding sites. Binding site centers were required to harbor the maximum PureCLIP score within the binding site. Furthermore, binding sites were filtered for reproducibility by requiring a binding site to be supported by a sufficient number of crosslink events in at least two out of four replicates. The threshold for sufficient crosslink event coverage, using the 0.05 percentile, was determined as described in (47).

Genomic location of binding sites and crosslink events

We mapped binding sites or crosslink events to the gene and transcript annotation from GENCODE (release 31, genome version GRCh38.p12) (Figure 3A and B) (30). Genes of gene level 3 were included only when no genes of higher level were overlapping. Similarly, transcripts of transcript level NA were included only when no transcripts of levels 1–3 were annotated for the gene. When a binding site overlapped with two genes, one was chosen at random. For the assignment of PURA binding sites to transcript regions (Figure 3B), when binding sites overlapped with different regions from different transcripts, the region was chosen by a hierarchical rule with 3′UTR > 5′UTR > CDS > intron, which we established from visual evaluation of the crosslink event and binding site distribution. We divided the PURA-bound mRNAs into groups according to the location of the strongest PURA binding site on the transcript (3′UTR, n = 2462; 5′UTR n = 98; CDS, n = 1646; intron, n = 1; noncoding RNA = 185). To normalize the number of binding sites per region to the lengths of the respective regions (Supplementary Figure S6H), we selected all genes with at least one PURA crosslink in any region and calculated a mean region length over all transcripts annotated for a gene. We then summed up the mean region lengths of all selected genes and divided the number of binding sites per region by this sum.

Figure 3.

Figure 3.

PURA binds to CDS and 3′UTRs of mRNAs. (A) Transcript types bound by endogenous PURA in HeLa cells show its preference for binding to protein-coding RNAs. Bars indicate the number of bound RNAs of each type on a log10 scale. (B) PURA binding sites per transcript region in protein-coding transcripts. Most binding sites are located in 3′UTRs or CDS. Bars show total number of binding sites per region. (C) PURA displays one or more distinct peaks in the 3′UTR of bound transcripts. Heatmap shows normalized, smoothed distribution of PURA crosslink events in the first 300 nt of 1,550 exemplary 3′UTRs with intermediate crosslink signal (102–106 crosslink events per window). (D) Metaprofile of PURA crosslink events along transcripts with strongest PURA binding site in the 3′UTR (red) or CDS (beige). Smoothed density of crosslink events is shown against scaled positions within the respective region. (E) Sequence logo of the nucleotide composition in a 21-nt window around PURA binding sites reveals purine enrichment. Y-axis shows sequences constraint in bits. (F) Comparison of the relative frequency of 5-mers in PURA-bound RNAs (y-axis) and a random background (x-axis) is shown for three locations around the binding site: Left – [–27 nt; –8 nt] upstream of the binding site center, middle – [–7 nt; 7 nt] around the binding site center, right – [8 nt; 27 nt] downstream of the binding site center (see annotation above). 5-mers with three or more purines are colored in blue.

The metaprofiles of the distribution of crosslink events along the transcript regions for mRNAs with strongest binding sites in the 3′UTR or CDS (Figure 3D) was calculated and visualized in R using the Bioconductor package cliProfiler (http://bioconductor.org/packages/release/bioc/html/cliProfiler.html).

Comparison of crosslink patterns

We compared the distribution of PURA crosslink events from the iCLIP data for endogenous PURA (anti-PURA12D11) from HeLa cells to the two iCLIP datasets in which FLAG-tagged PURA was overexpressed in HeLa cells and immunoprecipitated with either anti-PURA12D11 or an anti-FLAG antibody using the following methods: First, to visualize the crosslink patterns in 3′UTRs, we generated heatmaps of the first 300 nt of 1550 3′UTRs or 573 CDS with intermediated PURA crosslink coverage (102–106 crosslink events per window). In order to account for expression level and other differences, the crosslink events in each window were scaled to the minimum (set to 0) and maximum (set to 1) therein (‘min-max normalization’) followed by spline-smoothing using the smooth.spline function (R stats package version 4.1.1, spar = 0.5) and inflated dimensions (dim = 500). The heatmaps show the min-max normalized, spline-smoothed crosslink patterns for the three iCLIP experiments (Figure 3C, Supplementary Figure S4B). Second, we assessed differences in the iCLIP signal using metaprofiles of the min-max normalized signal in a 65-nt window around all PURA binding sites identified for endogenous PURA in HeLa cells (Supplementary Figure S4C).

Motif analysis

As PURA predominantly acts on mature RNAs, all motif analyses were performed on the sequences of spliced transcripts. Binding sites were transferred from genomic to transcriptomic coordinates with the Bioconductor package GenomicFeatures (version 1.45.2) (49) using GENCODE transcript annotation (release 31, genome version GRCh38.p12) (30). 49,602 binding sites could be unambiguously assigned and were used for further analyses. Sequences around binding sites were extracted with Biostrings::getSeq (R package, version 2.61.2) (https://bioconductor.org/packages/release/bioc/html/Biostrings.html). For sequence logos, a 51-nt window centered at the binding site was used and logos are generated with ggseqlogo (R package, version 0.1). 23,734 out of 49,602 (47.8%) were centered on a G and 30,454 out of 49,602 (61.4%) enclosed three or more purines.

To search for enriched motifs of 5-nt length (5-mers) within and around the PURA binding sites, we calculated the frequencies of all overlapping 5-mers with Biostrings::oligonucleotideFrequency (version 2.61.2) (https://bioconductor.org/packages/release/bioc/html/Biostrings.html) in three windows: (i) [−7 nt; 7 nt], i.e. including the binding sites plus 5 nt flanking on either side, and (ii) [−27 nt; −8 nt] and [8 nt; 27 nt], i.e. representing the 20 nt on either side of the binding site-containing windows. For comparison, we randomly selected 49,602 5-nt windows from expressed transcripts (i.e. harboring at least one PURA crosslink event).

Accessibility prediction

We used RNAplfold (from the ViennaRNA package, version 2.4.17) (50) to predict the accessibility in and around PURA binding sites on transcripts of at least 501 nt length (n = 48,525) in comparison to randomly selected positions (n = 10,017). Binding sites were assigned to transcripts as described above. In brief, we used 501-nt windows either around PURA binding sites or randomly selected and predicted the probability for each single nucleotide to be single-stranded. RNAplfold was set to a sliding window w = 100 nt and a maximum span l = 30 nt. As the obtained unpaired probabilities follow a bimodal distribution, we transferred them to log-odds to obtain a bell-shaped distribution: log(odds ratio) = log(prob/1-prob). These were further converted into z-scores using the mean log-odds ratios of the binding sites versus the mean and standard deviation of the background regions (drawing 1000 regions for 1000 times). In Supplementary Figure S6I, results are displayed only for the 201 nt in the center of the 501-nt window as RNA folding is strongly biased toward single-strandedness at the edges of a given RNA sequence. P values were calculated from the z-scores using 2 × pnorm(-abs(z_score)) and then adjusted for multiple testing with Benjamini–Hochberg correction.

Comparison to published transcriptomes

We used published processed RNA-seq data comparing RNAs in stress granules (human osteosarcoma U-2 OS cells) or P-bodies (human embryonic kidney HEK293 cells) to total RNA (42,51), taking RNAs with FDR < 0.01 and log2-transformed fold change (l2fc) > 0 (as calculated by the cited authors) to belong to the respective P-body or stress granule transcriptome.

To determine the overlap of PURA-bound RNAs with dendritically localized RNAs, we used data from (52) that summarizes eight studies describing the dendritic transcriptome in mouse neurons from enrichment in RNA-seq data of dendrites compared to complete neurons. We used only RNAs that were matched to their human orthologs using the Ensembl database (version GRCh38.p13) (53) keeping only one-to-one orthologs (n = 5070). From these, only RNAs expressed in HeLa cells (baseMean > 0 in our RNA-seq data) were considered for the overlap (n = 4522). From the dendritically localized RNAs, we only considered RNAs that were found enriched in at least three of the eight studies from (52) (n = 337).

Significance of overlaps was tested using Fisher's exact test, and differences in fold change distributions between groups were tested using two-sided Wilcoxon Rank-sum test.

Functional enrichment analyses

We performed functional enrichment analysis in R using the ‘hypergeometric’ mode of the hypeR package (version 1.9.1) on the gene sets from REACTOME (version 7.5.1) and Gene Ontology (version 7.5.1) databases (54,55). HypeR adjusts the gene sets connected to every term specifically to genes present in a given background set. We used the following background sets for the enrichment analyses: All genes with at least one PURA crosslinked nucleotide as background for enrichment of genes bound by PURA (Figure 2E, Supplementary Figure S6E and F), all genes with a TPM > 0 in RNA-seq experiment as background for enrichment of RNA expression changes in PURA KD (Supplementary Figure S8B and C) and all proteins detected by MaxQuant from shotgun proteomics as background for enrichment of protein expression changes in PURA KD (Supplementary Figure S8D and E). We calculated the gene ratios as the number of detected genes in the respective experiment that belong to a given term divided by the number of genes from the background set of the term. P values obtained from HypeR were adjusted using Benjamini Hochberg adjustment and are referred to as FDR. We display the 25 terms with the highest FDR in Supplementary Figure S6E, F and Supplementary Figure S8B–E. A full list with the statistics of all terms can be found in Supplementary Table S3, S6, S7.

RESULTS

PURA localizes to the cytoplasm in multiple cell lines

In previous reports, PURA has been associated with both nuclear and cytoplasmic functions. To investigate the subcellular localization of PURA in different cell lines, we raised a monoclonal antibody against the human PURA protein. In the light of earlier studies in which antibodies cross-reacted between PURA and its close paralog PURB (18), we selected as epitope a 21 amino acid (aa) peptide in the unstructured linker region between PUR repeats I and II. This sequence harbors multiple substitutions between PURA and PURB (Supplementary Figure S1A and B) but is 100% identical to the mouse Pura ortholog. We confirmed in immunofluorescence (IF) stainings of HeLa cells and Western blots of cell lysates that the obtained antibody (anti-PURA12D11) specifically recognized the endogenous PURA protein, detected reduced levels upon PURA KD and showed no cross-reactivity against recombinant PURB in E. coli lysates (Supplementary Figure S1C–G). Hence, we used the in-house antibody for the analyses in this study.

Using anti-PURA12D11 in IF stainings, we assessed the localization of the endogenous PURA protein in five immortalized cell lines from a diverse set of tissue origins including epithelial, neuronal, kidney, lung and liver tissue. We found that in all cell lines, PURA located predominantly in the cytoplasm (Figure 1A). This could be further validated in nuclear-cytoplasmic fractionation experiments in HeLa cells (Figure 1B and C). These observations suggest that PURA may be primarily involved in cytoplasmic regulatory processes which we decided to elucidate further.

PURA acts as a global cytoplasmic RNA-binding protein

Since PURA was previously found to bind RNA in vitro (24), we next investigated whether PURA directly associates with RNAs in cells. We therefore performed individual-nucleotide resolution UV crosslinking and immunoprecipitation (iCLIP) experiments, which map the RNA binding of an RBP in living cells (45,56). Endogenous PURA was crosslinked to its bound RNAs in HeLa cells using UV radiation (254 nm) and subsequently immunoprecipitated with anti-PURA12D11 (Figure 2A). In total, we obtained more than 20 million unique PURA crosslink events from four biological replicates (Supplementary Table S1).

Assessing the global distribution of crosslink events before binding site definition, we observed that only a small fraction of PURA crosslink events occurred within introns (Supplementary Figure S2A). The relative depletion of introns in the iCLIP experiments suggests that PURA predominantly binds mature RNAs, consistent with its subcellular localization (Figure 1) and cytoplasmic mRNA binding.

We next defined PURA binding sites from the iCLIP crosslink events by peak calling and merging adjacent peaks into binding sites of 5-nt width. In total, we identified 55,031 PURA binding sites in the transcripts of 4391 genes (Figure 2B and C, Supplementary Table S2). The strength of PURA binding at these sites, measured as the relative number of crosslink events inside the binding site, was highly reproducible between replicates (Supplementary Figure S2B).

To assess PURA binding to RNAs by orthogonal methods, we performed electrophoretic mobility shift assays (EMSAs) on 8 selected PURA-bound RNAs. We thereby confirmed direct PURA binding to the iCLIP-identified RNA binding sites in STARD7, COX7C, NEAT1, LSM14A, NDUFS5, CLTA, CLTC and EEF2 (Figure 2D, Supplementary Figure S3). As controls, we used the previously described PURA target sequence MF0677 as well as the unspecific yeast RNA ASH1 E3 (Supplementary Figure S3K–M) (23). We conclude that the iCLIP-identified RNA targets can be bound by PURA in vitro, albeit with moderate specificity.

In a control experiment, we overexpressed FLAG-tagged PURA in HeLa cells and confirmed that comparable RNA binding profiles were obtained from immunoprecipitation with either anti-PURA12D11 or an anti-FLAG antibody (Supplementary Figure S4, Supplementary Table S1, Supplementary Material S1), underlining the specificity of our in-house antibody. However, in addition to the defined binding sites, the PURA overexpression led to increased crosslink events along transcripts in both experiments, indicating unspecific RNA binding by the unphysiologically abundant ectopic PURA. We therefore decided to focus on the RNA binding profiles detected for the endogenous PURA protein.

PURA was repeatedly found as a component of neural transport granules (16–20). Of note, even though HeLa cells are of non-neural origin, the proteins encoded by the PURA-bound RNAs were enriched in cellular compartments of the neuron, axon and dendrite, among others (Figure 2E). Also, more than half of previously reported dendritically localized RNAs (52) were bound by PURA in our iCLIP data (170 out of 337, 50.4%; Supplementary Figure S5A–C). To directly test RNA binding of PURA in neural cells, we performed an iCLIP experiment for endogenous PURA (anti-PURA12D11) in neural progenitor cells (NPCs) derived from human induced pluripotent stem cells. The NPC state was monitored by the rosette morphology of the cells and the expression of several known marker genes for NPC differentiation (Supplementary Figure S5D and E). Although we obtained an iCLIP dataset with only a limited signal depth due to the scarce material, the crosslink events for endogenous PURA in NPCs enriched within the PURA binding sites identified from HeLa cells, indicating a similar RNA binding behavior in both cell types (Figure 2C and F, Supplementary Figure S5F).

Altogether, we identified >50,000 binding sites of endogenous PURA in HeLa cells. The observed RNA binding pattern was independent of the antibody used and comparable between HeLa cells and NPCs. Hence, these results indicate that the binding sites identified in HeLa cells reflect the bona fide binding of PURA in living cells.

PURA binds in coding sequences and 3′UTRs

Examining the distribution of PURA binding, we found that 16% of all expressed transcripts (transcripts per million [TPM] > 0) harbored at least one PURA binding site (median 7 binding sites per transcript; Supplementary Figure S6A). The number of detected PURA binding sites moderately increased with the transcripts’ expression levels, most likely reflecting the better representation of more abundant RNAs in an iCLIP experiment (57) (Supplementary Figure S6B). Among the highly expressed transcripts (TPM > 10), 46% harbored at least one PURA binding site, underlining the widespread binding of PURA throughout the transcriptome (Supplementary Figure S6C and D).

The 4391 PURA-bound transcripts comprised predominantly protein-coding mRNAs (n = 4206, 95.5%, Figure 3A). These encoded for functions in nervous system development, mitochondrial processes, the innate immune system and RNA metabolism, among others (Supplementary Figure S6E and F, Supplementary Table S3). PURA also showed binding to 154 long non-coding RNAs (lncRNAs; Figure 3A), including the previously reported PURA target lncRNAs RN7SL1 (20,58), which has been implicated in transcriptional control, as well as NEAT1 and MALAT1, two architectural RNAs forming distinct nuclear bodies (59) (Supplementary Figure S6G). This supports that a minor fraction of PURA localizes to the nucleus and interacts with nuclear RNAs.

Within the protein-coding transcripts, the PURA binding sites were almost equally distributed between the coding sequences (CDS, n = 25,113) and the 3′ untranslated regions (3′UTRs, n = 24,553; Figure 3BD, Supplementary Figure S6H), which together accounted for 93.6% of all binding sites. In contrast, 5′UTRs and intronic regions were almost completely devoid of PURA binding sites. Within the 3′UTRs or CDS, PURA usually displayed one or more defined peaks of binding, as seen in STARD7 mRNA (Figures 2C and 3C, Supplementary Figure S4B).

The transcriptome-wide profiles enabled us to revisit the RNA sequence and structure preferences of PURA. We therefore predicted the local folding probability around PURA binding sites. This suggested a propensity of PURA’s RNA binding sites to be single-stranded (Supplementary Figure S6I), contrasting past reports about its double-stranded nucleic acid binding preference and unwinding ability (23,60). The increased single-strandedness coincided with an apparent sequence preference. Beyond the increased uridine content, likely reflecting the known UV-crosslinking bias (61), we observed a strong enrichment for purines (A and G) within and around the PURA binding sites: 47.8% of binding sites were centered on a guanine and 61.4% enclosed three or more purines (Figure 3E, Supplementary Figure S6J). Globally, the binding sites and their surrounding sequences were enriched for k-mers comprising purine combinations like GAAGA and AAGAA (Figure 3F). Together, these observations underline the specificity of PURA to bind single-stranded, purine-rich regions, primarily in CDS and 3′UTRs.

Loss of PURA causes widespread changes in gene and protein expression

While iCLIP data revealed the RNAs directly bound by PURA, they do not inform about the regulatory effects of binding events on such transcripts. In order to globally determine the impact of PURA on gene expression, we depleted PURA in HeLa cells using siRNAs (Supplementary Figures S1C–G, S7). This partial loss of PURA mimics the haploinsufficiency in PURA Syndrome patients that harbor heterozygous mutations in the PURA gene. By performing transcriptomic analyses, we observed widespread changes in transcript abundance upon PURA knockdown (KD), relative to control HeLa cells: In total, 3415 transcripts were significantly down- or upregulated upon PURA KD (false discovery rate [FDR] < 0.01, Figure 4A, Supplementary Table S4).

To test for changes at the protein level, we performed mass-spectrometric analysis of protein extracts from PURA KD and control HeLa cells. We detected a total of 4351 proteins with good confidence (peptide spectrum match FDR < 1% and protein FDR < 5%) and observed a significant regulation of 995 proteins upon PURA KD (FDR < 0.05, Figure 4B, Supplementary Table S5). Of these, 334 proteins were significantly regulated also at the transcript level, with the majority changing in the same direction in both RNA expression and protein abundance (249, 74.6%, Figure 4C and D, Supplementary Figure S8A).

Gene ontology (GO) analysis revealed that the regulated transcripts encoded for diverse functional categories, including mitochondrial functions, vesicle-mediated transport, membrane trafficking and RNA metabolism (Supplementary Figure S8B and C, Supplementary Table S6). Similarly, the regulated proteins were enriched in mitochondrial and signaling functions and several membrane and granule components (Supplementary Figure S8D, E, Supplementary Table S7).

Out of 334 proteins whose levels changed at both transcript and protein level in response to PURA KD, 234 were independently identified as PURA-bound RNAs by iCLIP (Figure 4E). These showed similar changes in RNA and protein expression upon PURA KD, irrespective of the location of PURA binding sites in the CDS or 3′UTR (Figure 4C, Supplementary Figure S9A–F).

The most significantly downregulated transcript was the mRNA encoding the transcription factor CUX1 (Figure 4AD), which is involved in the control of neuronal differentiation by specifically regulating dendrite development, branching, and spine formation in cortical neurons (62). Another strongly regulated PURA target was the cytoskeleton associated Catenin alpha-1 (CTNNA1) that links actin and adherence junction components (63). Furthermore, a coherent downregulation upon PURA KD was also observed for the PURA-bound RNAs encoding for the autophagy receptor Sequestome 1 (SQSTM1; also known as p62), consistent with a previous report (64), and the essential P-body component LSM14A. The downregulation of the SQSTM1 and LSM14A proteins was orthogonally confirmed by Western blot analysis (Figure 4FI).

Due to the very different detection limits of RNA-seq and shotgun proteomics, it is likely that we did not detect all direct PURA targets in both data sets. To assess a representative target that did not pass the threshold for a reliable quantification in our proteomics data, we analyzed the levels of interleukin-6 cytokine family signal transducer (IL6ST) using fluorescence microscopy (Figure 4J and K, Supplementary Figure S9G) and Western blot (Figure 4L and M). As suggested in our RNA-seq data, we observed an increased signal intensity and accumulation of IL6ST in the perinuclear region upon PURA KD in both HeLa and normal human dermal fibroblast (NHDF) cells, supporting a role of PURA in IL6ST protein expression.

In essence, we find that loss of PURA results in widespread changes in gene and protein expression, affecting central cellular features such as mitochondria, autophagy and granule assembly. The quantitative comparison of transcriptome and proteome suggests that the majority of changes might originate at the RNA level. Ultimately, we identify a set of 234 targets that change consistently on RNA and protein level upon PURA KD and are bound by PURA on the RNA level. As the example of IL6ST shows, the list of PURA targets could potentially be extended beyond this stringent set of targets that are coherently identified by iCLIP, RNA-seq and proteomics.

PURA binds to P-body transcripts and localizes to P-bodies

Under unstressed conditions, we observed the PURA immunofluorescence signal in cytoplasmic foci (Figure 1). To obtain a first indication what kind of foci these might be, we overlapped the PURA-bound RNAs with transcripts that were enriched in P-bodies (42) or stress granules (51). In line with the previously reported PURA localization to stress granules (21), more than half of the stress granule-enriched transcripts were bound by PURA (44%, P value < 2.2 × 10−16, Fisher's exact test; Figure 5A, Supplementary Figure S10A–C). Surprisingly, however, we detected an almost equally strong overlap with transcripts enriched in P-bodies (42) (43%, P value < 2.2 × 10−16, Fisher's exact test; Figure 5A, Supplementary Figure S10D–F), indicating that PURA might be linked to these granules as well.

To test for a direct association, we used immunofluorescence staining to overlay PURA with the P-body marker proteins DCP1A and LSM14A. Indeed, we identified a strong colocalization of PURA with both marker proteins in HeLa and NHDF cells (Figure 5B and C, Supplementary Figure S10G–J). Of note, we detected PURA in 88.5% of all microscopically visible DCP1A-positive granules and in 77.8% of all LSM14A-positive granules (Figure 5D). This indicates that under basal growth conditions, PURA efficiently localizes to P-bodies in HeLa and NHDF cells.

PURA depletion reduces the number of P-bodies per cell

Based on the strong association of PURA with P-bodies, we wondered whether PURA binding directly affects the abundance of the P-body-associated RNAs. Intriguingly, the P-body-enriched transcripts were selectively upregulated upon PURA KD, while the remaining RNAs were not affected (Figure 5E). Based on these observations, we hypothesized that P-bodies might be altered in the absence of PURA.

In order to focus on PURA’s role in the physiological context of unstressed cells, we inspected the PURA iCLIP data from HeLa cells and were intrigued to find that PURA directly bound to LSM14A and DDX6 mRNAs, encoding two P-body core components (Figure 6A and B). Moreover, LSM14A was strongly downregulated upon PURA KD at the transcript as well as protein level, and DDX6 was downregulated at the protein level (Figure 4AD). This was orthogonally validated by a significant reduction in LSM14A and DDX6 protein in Western blot analysis after PURA KD, relative to control HeLa cells (Figure 4H and I, Supplementary Figure S11A and B).

Figure 6.

Figure 6.

PURA KD leads to a loss of P-bodies. (A, B) Genome browser view shows the pileup of crosslink events of endogenous PURA from HeLa cells at binding sites (red) in the LSM14A (A) and DDX6 (B) transcripts. (CH) Left: Immunofluorescence staining of PURA (red) together with the P-body proteins LSM14A (C), DCP1A (E) and DDX6 (G) (measured in 555 nm channel, depicted in green) and DAPI (blue) as a nuclear stain in CTRL (top) and PURA KD (bottom) HeLa cells. Scale bars, 10 μm. Right: Quantification (ImageJ) of LSM14A-positive granules on approximately 400 cells per replicate (n = 3) comparing CTRL and PURA KD conditions. Shown is the number of granules positive for LSM14A (D), DCP1A (F) and DDX6 (H) per cell for all replicates. Three biological replicates (independent cultures) are indicated by different icons, with each containing 4 technical replicates (multiple images). P value from unpaired two-sided Student's t-test on all replicates.

Strikingly, when analyzing either LSM14A-, DCP1A- or DDX6-positive granules in PURA KD cells using IF staining, we observed a drastic decrease in granule numbers per cell using all markers (Figure 6CH). Although several proteins modulate the appearance of P-bodies, LSM14A and DDX6 are two of a handful of currently known factors that have been shown to be essential for P-body formation (41). To test whether the PURA-dependent reduction of LSM14A and DDX6 globally impairs P-bodies, we visualized P-body occurrence using IF staining for the P-body marker DCP1A. Remarkably, PURA KD led to a dramatic reduction in P-body number in HeLa and NHDF cells (Figure 6CH, Supplementary Figure S11E–H). Importantly, in contrast to LSM14A and DDX6 (Figure 4H and I, Supplementary Figure S11A and B), the protein levels of the P-body marker DCP1A remained unchanged upon PURA KD (Supplementary Figure S11C and D), suggesting that PURA acts selectively by modulating the LSM14A and DDX6 levels in tested cell lines. These observations were validated using a second siRNA against PURA (Supplementary Figure S11I and J).

Together, our data suggest that PURA binds the LSM14A and DDX6 mRNAs and regulates the concomitant protein levels. It is conceivable that the combined reduction of both factors causes the almost complete loss of P-bodies upon PURA KD. Given the important role of P-bodies in cellular RNA surveillance, our findings reveal a putative link between the neuronal PURA Syndrome and the cytoplasmic control of mRNA levels and P-body condensation.

DISCUSSION

PURA is a cytoplasmic RNA-binding protein

The PURA protein was initially described as a transcription factor with binding preferences for purine-rich single-stranded DNA regions (13,14). Consistent with a nuclear role of PURA, we do observe binding of PURA to nuclear RNAs such as NEAT1 and MALAT1. The minor fraction of PURA in the nucleus might be involved in transcriptional regulation as reported previously (3,15). While this might be one of PURA's functions, it was subsequently shown to associate with cellular RNAs and to bind single-stranded DNA and RNA sequences in vitro in the same way and with similar affinities (23). In this study, we developed a paralog-specific antibody to demonstrate that the vast majority of PURA protein resides in the cytoplasm in a range of different cell lines. This is in line with more recent studies describing PURA in cytoplasmic RBP granules (17–19,21). It also underlines the need to study PURA’s function in processes other than DNA regulation.

To date, a physiological RNA binding function of PURA had only been shown for a few specific target RNAs like the lncRNA RN7SL1 (20,58) or the circular RNAs circSamD4 (65) and circCwc27 (66). Most other studies were based on either in vitro binding assays, colocalization studies or enrichment in membrane-less organelles (reviewed in (3)). Here, we provide the first transcriptome-wide analysis of PURA’s RNA binding in living cells. Our iCLIP data show that PURA binds to almost half of all expressed mRNAs, thereby bearing the potential to regulate a multitude of different cellular processes (see below).

In line with the first description of PURA as purine-rich element binding protein (13,14,67,68), we identified a binding preference for purine-rich RNA regions. Similar to previous studies (23,24), we did not observe a distinct sequence motif. A degenerate sequence preference is characteristic for many RBPs (69) and might suggest that PURA’s RNA recognition in cells is driven by additional factors, such as RNA secondary structure or co-factor binding.

PURA regulates mRNAs associated with mitochondria, neural transport granules and autophagy

Omics allow for the global characterization and quantification of pools of biological molecules that translate into a holistic view of cellular functions and dynamics. Our integration of multiscale omics data revealed widespread changes in expression upon loss of PURA in cells. We identify >3000 dysregulated transcripts upon PURA KD and almost 1000 dysregulated proteins. Despite the high sequence similarity to its close paralog PURB, we do not observe a compensatory upregulation of PURB on RNA or protein level upon PURA depletion (Figure 4A, B and D), speaking against compensatory effects in the data. The observed changes could result from complementary RNA-regulatory mechanisms, which may include translation, RNA stability or degradation, RNA transport as well as subcellular localization and phase condensation, among others.

Integration with transcriptome-wide RNA binding sites revealed 234 putative direct targets of PURA. The molecular roles of these targets link PURA to various processes including interleukin signaling, autophagy, mitochondrial homeostasis and neurodevelopment. One interesting target that was directly bound by PURA and upregulated upon PURA KD is the IL6ST mRNA encoding interleukin-6 cytokine family signal transducer (IL6ST; also known as gp130). IL6ST is a signal transducer shared by many cytokines, including interleukin 6, ciliary neurotrophic factor, leukemia inhibitory factor, and oncostatin M (70). Patients with PURA Syndrome appear to have a compromised immune response that may be responsible for the severe infections reported by affected families (personal communication). It is possible that an overexpression of IL6ST upon PURA haploinsufficiency may contribute to such symptoms.

Another direct PURA target with potential links to the phenotype of PURA Syndrome patients is the SQSTM1 mRNA encoding the autophagy receptor Sequestome 1 (SQSTM1; also known as p62). In a zebrafish model for C9 ALS, RNA-triggered toxicity could be rescued through ectopic overexpression of PURA, and this rescue could be related to the upregulation of SQSTM1 by PURA (64). Interestingly, we observed that PURA directly binds to SQSTM1 mRNA and multiple other targets from the autophagy pathway. This suggests that PURA may act as a modulator of autophagy in human cells, which has long been proposed as a neuroprotective mechanism in cellular surveillance (71).

Furthermore, several of the newly identified PURA targets that are downregulated upon PURA KD are involved in mitochondrial functions, including the STARD7, COX7C and FIS1 mRNAs. Indeed, mitochondrial respiration was already described to be impaired upon PURA depletion (72). Since mitochondrial dysfunctions are associated with various neuronal disorders (73), it is tempting to speculate that mitochondrial defects caused by PURA haploinsufficiency may contribute to the neurodevelopmental phenotype of PURA Syndrome patients. This may converge with direct effects of PURA on neurodevelopmentally relevant targets. In line with this, PURA has been associated with neuronal RNA transport in different studies in mouse and human cells. For instance, a misdistribution of the dendritic marker protein MAP2 was reported in one PURA KO mouse model (8). The direct PURA targets identified in our study will be relevant in establishing the neuronal and neurodevelopmental functions of PURA in the future.

PURA depletion leads to an upregulation of P-body-associated transcripts and a loss of P-bodies

Cytoplasmic RBPs are known to play important roles in the formation of cytoplasmic membrane-less organelles such as P-bodies and stress granules (74). While omics studies provided a broad view on their transcriptomes and proteomes (42,51), only a small number of proteins are considered essential for P-body formation, including DDX6, 4E-T and LSM14A (41).

Here, we show that PURA is enriched in P-bodies and binds to more than 40% of the P-body transcriptome (42). Interestingly, upon PURA KD, P-body-enriched transcripts are globally upregulated. This indicates that RNAs stored in P-bodies might be destabilized in these condensates, while they are potentially selectively stabilized upon dissolving of P-bodies. It will require further studies to understand if this observation has functional consequences, for instance for the etiology of patients with PURA Syndrome. In addition, we find that depletion of PURA in different human cell lines leads to a loss of P-bodies. Interestingly, this observation is linked to direct binding of PURA to the LSM14A and DDX6 mRNAs, a downregulation of LSM14A mRNA and a concomitant downregulation of the LSM14A and DDX6 proteins. Since both proteins are essential P-body factors, their combined PURA-dependent depletion most likely triggers the observed loss of P-bodies. Interestingly, full-length mutant DDX6 protein is associated with another neurodevelopmental disorder in which a mutation in the DDX6 gene leads to several phenotypes including intellectual disability, as observed in PURA Syndrome (75). It would be interesting to understand the similarities in phenotypes between patients with PURA and DDX6 mutations.

P-bodies were initially proposed as a location of RNA depletion (76). Seemingly in line with this, we find that RNAs of the P-body transcriptome are globally upregulated upon PURA KD and P-body loss. However, more recently P-bodies were proposed to act in RNA storage rather than depletion (42). Moreover, stress granules and P-bodies were shown to partially share their proteomes and transcriptomes (42) and suggested to dynamically interact with each other (77,78). Since in a previous study (21), PURA was found in stress granules under stressed conditions and we now show its P-body localization, it becomes clear that PURA is one of the proteins shared between both granule types.

Implications for understanding the etiology of the PURA Syndrome

As PURA Syndrome is described to result from a haploinsufficiency and thereby lowered cellular PURA levels, our findings on PURA-dependent, LSM14A/DDX6-mediated P-body formation could potentially relate to a molecular dysfunction in patients. Altered granule dynamics might play a role in the patho-mechanism of PURA Syndrome. Similarly, the observed changes in the sequestome degradation machinery and the mitochondrial proteome may be in functional correlation to symptoms of PURA Syndrome patients. Among the most downregulated targets was also the transcription factor CUX1 that is involved in the control of neuronal differentiation (62). This and other factors suggest a connection of our molecular observations of misregulated factors to the neurodevelopmental delay of patients.

While the above-mentioned potential links are highly suggestive, it should be considered that the spectrum of symptoms of PURA Syndrome patients is broad and variable (6,7). Furthermore, the understanding of the medical condition of patients has only begun to be investigated, rendering it very difficult to directly and safely correlate molecular PURA-dependent events to symptoms of patients with haploinsufficiency in the PURA gene. This consideration is even more important as not all molecular changes are likely to have an impact on the symptoms observed in humans. While this study opened the door for an unbiased and systematic understanding of molecular PURA-dependent pathways, it will require more research on both ends, clinical and non-clinical, to convincingly connect our findings with the symptoms in PURA Syndrome patients.

DATA AVAILABILITY

Uncropped images are available in Supplementary Figure S12. All sequencing data is available in the Gene Expression Omnibus (GEO) under the SuperSeries accession number GSE193905. The collection includes the RNA-seq data from PURA KD and CTRL HeLa cells (GSE193900) as well as the iCLIP data for endogenous PURA (anti-PURA12D11 antibody) in HeLa cells (GSE193901) and NPC (GSE193902) as well as for overexpressed FLAG-PURA in HeLa cells precipitated with anti-PURA12D11 (GSE193904) and anti-FLAG antibody (GSE193903). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (79) partner repository with the dataset identifier PXD030266. Scripts used to process the files are accessible under the Zenodo repository located at: https://doi.org/10.5281/zenodo.7273088.

Supplementary Material

gkac1237_Supplemental_Files

ACKNOWLEDGEMENTS

The authors would like to thank the members of the Niessing and Zarnack labs for support and discussion. We thank Saskia Hutten for help with immunofluorescence experiments, Annika Niedner-Boblenz and Nadine Körtel for help in setting up iCLIP experiments, and Stefan Krebs for help in planning and performing the sequencing runs. We also thank Jana Tretter, Verena Kirchner, Elke Pertler, Deniz Yavru, Hania Ahmed, Robert Janowski and Vera Roman for advice and technical assistance. We particularly thank You Zhou, Mirko Brüggemann and Mario Keller for internal code review. We would like to thank Ruth Brack-Werner for providing us with additional cell lines, Robert Schneider for using their BioAnalyzer and the confocal microscopy facility at HMGU for using their microscopes. We also thank Eran Hornstein for helpful comments and discussions, the PURA-Syndrome Foundation and the Global PURA-Research Network (GRN).

Author contributions: L.M. performed iCLIP, PURA knockdown, immunofluorescence (IF), Western blot and EMSA, validated the anti-PURA12D11 antibody and analyzed data. M.K. performed most bioinformatics analyses of iCLIP, RNA-seq and shotgun proteomics experiments. S.Ba. performed and quantified IF experiments of PURA and P-body marker proteins and IL6ST as well as Western blots and EMSAs. J.M.-P. prepared, measured and analyzed shotgun proteomics samples. N.S. performed IF experiments for the PURA cell atlas. S.Bu. performed nuclear-cytoplasmic extraction and sequence alignments. C.K. performed Western blot experiments. E.R. differentiated hiPSCs to NPCs and validated NPCs for subsequent iCLIP experiments. D.T. performed Western blot experiments for SQSTM1. A.P. reprogrammed and validated the HMGU12 hiPS cell line. M.P. assisted with IF experiments for P-body marker DCP1A. S.R. performed RNA extraction and library preparation for RNA-seq samples. H.B. planed and supervised sequencing of iCLIP and RNA-seq libraries. A.B. performed initial analysis of iCLIP data sets. The antibody facility headed by R.F. generated the in-house anti-PURA12D11 antibody. S.H. planed and supervised shotgun proteomics experiments. M.D. planed and supervised reprogramming and differentiation experiments of hiPSC. P.F.-P. planed and supervised the validation of autophagy marker SQSTM1. J.K. planed and supervised iCLIP experiments. Study was designed by L.M., M.K., K.Z. and D.N. K.Z. supervised most of the bioinformatics analyses. D.N. supervised most of the experimental work. L.M., M.K., K.Z. and D.N. wrote the manuscript with comments from all co-authors.

Contributor Information

Lena Molitor, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Melina Klostermann, Buchmann Institute for Molecular Life Sciences (BMLS) and Institute of Molecular Biosciences, Goethe University Frankfurt, 60438 Frankfurt a.M., Germany.

Sabrina Bacher, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Juliane Merl-Pham, Metabolomics and Proteomics Core, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Nadine Spranger, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Sandra Burczyk, Institute of Pharmaceutical Biotechnology, Ulm University, 89081 Ulm, Germany.

Carolin Ketteler, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Ejona Rusha, Induced Pluripotent Stem Cell Core Facility, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Daniel Tews, Department of Pediatrics and Adolescent Medicine, Ulm University Medical Center, 89070 Ulm, Germany.

Anna Pertek, Induced Pluripotent Stem Cell Core Facility, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Marcel Proske, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany; Institute of Pharmaceutical Biotechnology, Ulm University, 89081 Ulm, Germany.

Anke Busch, Institute of Molecular Biology (IMB), 55128 Mainz, Germany.

Sarah Reschke, Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians University Munich, 81377 Munich, Germany.

Regina Feederle, Monoclonal Antibody Core Facility, Institute for Diabetes and Obesity, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Stefanie M Hauck, Metabolomics and Proteomics Core, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany.

Helmut Blum, Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians University Munich, 81377 Munich, Germany.

Micha Drukker, Institute of Stem Cell Research, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany; Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research (LACDR), Leiden University, 2333 CC Leiden, The Netherlands.

Pamela Fischer-Posovszky, Department of Pediatrics and Adolescent Medicine, Ulm University Medical Center, 89070 Ulm, Germany.

Julian König, Institute of Molecular Biology (IMB), 55128 Mainz, Germany.

Kathi Zarnack, Buchmann Institute for Molecular Life Sciences (BMLS) and Institute of Molecular Biosciences, Goethe University Frankfurt, 60438 Frankfurt a.M., Germany.

Dierk Niessing, Institute of Structural Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany; Institute of Pharmaceutical Biotechnology, Ulm University, 89081 Ulm, Germany.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Deutsche Forschungsgemeinschaft (DFG) via the Research Unit FOR 2333 [ZA 881/3-1 to K.Z., KO 4566/5-1 to J.K., Ni1110/5-1, 6-2, 7-1/2 to D.N.]; Heisenberg professorship [398707781 to P.F.-P.]; research grant [TE012-2/2 to D.T.]; SFB 902 [B13 to K.Z.]; SPP 1935 [Ni1110/8-1 to D.N., ZA 881/5-2 to K.Z.]; Science Award of the Care-for-Rare Foundation [to D.N.]. Funding for open access charge: Intramural funding from Ulm University.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Gebauer F., Schwarzl T., Valcárcel J., Hentze M.W.. RNA-binding proteins in human genetic disease. Nat. Rev. Genet. 2021; 22:185–198. [DOI] [PubMed] [Google Scholar]
  • 2. Gerstberger S., Hafner M., Tuschl T.. A census of human RNA-binding proteins. Nat. Rev. Genet. 2014; 15:829–845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Molitor L., Bacher S., Burczyk S., Niessing D. The molecular function of PURA and its implications in neurological diseases. Front Genet. 2021; 12:638217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Hunt D., Leventer R.J., Simons C., Taft R., Swoboda K.J., Gawne-Cain M., study D.D.D., Magee A.C., Turnpenny P.D., Baralle D. Whole exome sequencing in family trios reveals de novo mutations in PURA as a cause of severe neurodevelopmental delay and learning disability. J. Med. Genet. 2014; 51:806–813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Lalani S.R., Zhang J., Schaaf C.P., Brown C.W., Magoulas P., Tsai A.C., El-Gharbawy A., Wierenga K.J., Bartholomew D., Fong C.T.et al.. Mutations in PURA cause profound neonatal hypotonia, seizures, and encephalopathy in 5q31.3 microdeletion syndrome. Am. J. Hum. Genet. 2014; 95:579–583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Reijnders M.R.F., Janowski R., Alvi M., Self J.E., van Essen T.J., Vreeburg M., Rouhl R.P.W., Stevens S.J.C., Stegmann A.P.A., Schieving J.et al.. PURA syndrome: clinical delineation and genotype-phenotype study in 32 individuals with review of published literature. J. Med. Genet. 2018; 55:104–113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Johannesen K.M., Gardella E., Gjerulfsen C.E., Bayat A., Rouhl R.P.W., Reijnders M., Whalen S., Keren B., Buratti J., Courtin T.et al.. PURA-Related developmental and epileptic encephalopathy: phenotypic and genotypic spectrum. Neurol Genet. 2021; 7:e613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Hokkanen S., Feldmann H.M., Ding H., Jung C.K., Bojarski L., Renner-Müller I., Schüller U., Kretzschmar H., Wolf E., Herms J.. Lack of Pur-alpha alters postnatal brain development and causes megalencephaly. Hum. Mol. Genet. 2012; 21:473–484. [DOI] [PubMed] [Google Scholar]
  • 9. Khalili K., Del Valle L., Muralidharan V., Gault W.J., Darbinian N., Otte J., Meier E., Johnson E.M., Daniel D.C., Kinoshita Y.et al.. Puralpha is essential for postnatal brain development and developmentally coupled cellular proliferation as revealed by genetic inactivation in the mouse. Mol. Cell. Biol. 2003; 23:6857–6875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Deciphering Developmental Disorders Study Large-scale discovery of novel genetic causes of developmental disorders. Nature. 2015; 519:223–228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Huang N., Lee I., Marcotte E.M., Hurles M.E.. Characterising and predicting haploinsufficiency in the human genome. PLoS Genet. 2010; 6:e1001154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Barbe M.F., Krueger J.J., Loomis R., Otte J., Gordon J.. Memory deficits, gait ataxia and neuronal loss in the hippocampus and cerebellum in mice that are heterozygous for Pur-alpha. Neuroscience. 2016; 337:177–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Bergemann A.D., Johnson E.M.. The HeLa Pur factor binds single-stranded DNA at a specific element conserved in gene flanking regions and origins of DNA replication. Mol. Cell. Biol. 1992; 12:1257–1265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Bergemann A.D., Ma Z.W., Johnson E.M.. Sequence of cDNA comprising the human pur gene and sequence-specific single-stranded-DNA-binding properties of the encoded protein. Mol. Cell. Biol. 1992; 12:5673–5682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Daniel D.C., Johnson E.M.. PURA, the gene encoding pur-alpha, member of an ancient nucleic acid-binding protein family with mammalian neurological functions. Gene. 2018; 643:133–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. El Fatimy R., Davidovic L., Tremblay S., Jaglin X., Dury A., Robert C., De Koninck P., Khandjian E.W.. Tracking the fragile X mental retardation protein in a highly ordered neuronal RiboNucleoParticles population: a link between stalled polyribosomes and RNA granules. PLoS Genet. 2016; 12:e1006192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Johnson E.M., Kinoshita Y., Weinreb D.B., Wortman M.J., Simon R., Khalili K., Winckler B., Gordon J.. Role of Pur alpha in targeting mRNA to sites of translation in hippocampal neuronal dendrites. J. Neurosci. Res. 2006; 83:929–943. [DOI] [PubMed] [Google Scholar]
  • 18. Kanai Y., Dohmae N., Hirokawa N.. Kinesin transports RNA: isolation and characterization of an RNA-transporting granule. Neuron. 2004; 43:513–525. [DOI] [PubMed] [Google Scholar]
  • 19. Mitsumori K., Takei Y., Hirokawa N.. Components of RNA granules affect their localization and dynamics in neuronal dendrites. Mol. Biol. Cell. 2017; 28:1412–1417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Ohashi S., Koike K., Omori A., Ichinose S., Ohara S., Kobayashi S., Sato T.A., Anzai K.. Identification of mRNA/protein (mRNP) complexes containing Pur-alpha, mStaufen, fragile X protein, and myosin va and their association with rough endoplasmic reticulum equipped with a kinesin motor. J. Biol. Chem. 2002; 277:37804–37810. [DOI] [PubMed] [Google Scholar]
  • 21. Daigle J.G., Krishnamurthy K., Ramesh N., Casci I., Monaghan J., McAvoy K., Godfrey E.W., Daniel D.C., Johnson E.M., Monahan Z.et al.. Pur-alpha regulates cytoplasmic stress granule dynamics and ameliorates FUS toxicity. Acta Neuropathol. 2016; 131:605–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Janowski R., Niessing D. The large family of PC4-like domains - similar folds and functions throughout all kingdoms of life. RNA Biol. 2020; 17:1228–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Weber J., Bao H., Hartlmuller C., Wang Z., Windhager A., Janowski R., Madl T., Jin P., Niessing D. Structural basis of nucleic-acid recognition and double-strand unwinding by the essential neuronal protein pur-alpha. Elife. 2016; 5:1197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Graebsch A., Roche S., Niessing D. X-ray structure of Pur-alpha reveals a whirly-like fold and an unusual nucleic-acid binding surface. Proc. Natl. Acad. Sci. U.S.A. 2009; 106:18521–18526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Feederle R., Gerber J.K., Middleton A., Northrup E., Kist R., Kremmer E., Peters H.. Generation of Pax1/PAX1-Specific monoclonal antibodies. Monoclon Antib Immunodiagn Immunother. 2016; 35:259–262. [DOI] [PubMed] [Google Scholar]
  • 26. Lin Y.J., Huang L.H., Huang C.T.. Enhancement of heterologous gene expression in Flammulina velutipes using polycistronic vectors containing a viral 2A cleavage sequence. PLoS One. 2013; 8:e59099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Krendl C., Shaposhnikov D., Rishko V., Ori C., Ziegenhain C., Sass S., Simon L., Muller N.S., Straub T., Brooks K.E.et al.. GATA2/3-TFAP2A/C transcription factor network couples human pluripotent stem cell differentiation to trophectoderm with repression of pluripotency. Proc. Natl. Acad. Sci. U.S.A. 2017; 114:E9579–E9588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Dobin A., Davis C.A., Schlesinger F., Drenkow J., Zaleski C., Jha S., Batut P., Chaisson M., Gingeras T.R.. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013; 29:15–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Anders S., Pyl P.T., Huber W.. HTSeq–a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015; 31:166–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Frankish A., Diekhans M., Ferreira A.M., Johnson R., Jungreis I., Loveland J., Mudge J.M., Sisu C., Wright J., Armstrong J.et al.. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019; 47:D766–D773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Love M.I., Huber W., Anders S.. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15:550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Wisniewski J.R., Zougman A., Nagaraj N., Mann M.. Universal sample preparation method for proteome analysis. Nat. Methods. 2009; 6:359–362. [DOI] [PubMed] [Google Scholar]
  • 33. Grosche A., Hauser A., Lepper M.F., Mayo R., von Toerne C., Merl-Pham J., Hauck S.M.. The proteome of native adult muller glial cells from murine retina. Mol. Cell. Proteomics. 2016; 15:462–480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Tyanova S., Temu T., Cox J.. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 2016; 11:2301–2319. [DOI] [PubMed] [Google Scholar]
  • 35. Cox J., Neuhauser N., Michalski A., Scheltema R.A., Olsen J.V., Mann M.. Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 2011; 10:1794–1805. [DOI] [PubMed] [Google Scholar]
  • 36. Cox J., Hein M.Y., Luber C.A., Paron I., Nagaraj N., Mann M.. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol. Cell. Proteomics. 2014; 13:2513–2526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Yu S.H., Kyriakidou P., Cox J.. Isobaric matching between runs and novel PSM-Level normalization in MaxQuant strongly improve reporter ion-Based quantification. J. Proteome Res. 2020; 19:3945–3954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Zhu Y., Orre L.M., Zhou Tran Y., Mermelekas G., Johansson H.J., Malyutina A., Anders S., Lehtio J.. DEqMS: a method for accurate variance estimation in differential protein expression analysis. Mol. Cell. Proteomics. 2020; 19:1047–1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Edelmann F.T., Niedner A., Niessing D. Production of pure and functional RNA for in vitro reconstitution experiments. Methods. 2014; 65:333–341. [DOI] [PubMed] [Google Scholar]
  • 40. Pfaffl M.W. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001; 29:e45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Ayache J., Benard M., Ernoult-Lange M., Minshall N., Standart N., Kress M., Weil D. P-body assembly requires DDX6 repression complexes rather than decay or Ataxin2/2L complexes. Mol. Biol. Cell. 2015; 26:2579–2595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Hubstenberger A., Courel M., Benard M., Souquere S., Ernoult-Lange M., Chouaib R., Yi Z., Morlot J.B., Munier A., Fradet M.et al.. P-Body purification reveals the condensation of repressed mRNA regulons. Mol. Cell. 2017; 68:144–157. [DOI] [PubMed] [Google Scholar]
  • 43. Bolte S., Cordelieres F.P.. A guided tour into subcellular colocalization analysis in light microscopy. J. Microsc. 2006; 224:213–232. [DOI] [PubMed] [Google Scholar]
  • 44. Reinhardt P., Glatza M., Hemmer K., Tsytsyura Y., Thiel C.S., Hoing S., Moritz S., Parga J.A., Wagner L., Bruder J.M.et al.. Derivation and expansion using only small molecules of human neural progenitors for neurodegenerative disease modeling. PLoS One. 2013; 8:e59252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Buchbender A., Mutter H., Sutandy F.X.R., Körtel N., Hänel H., Busch A., Ebersberger S., König J.. Improved library preparation with the new iCLIP2 protocol. Methods. 2020; 178:33–48. [DOI] [PubMed] [Google Scholar]
  • 46. Roehr J.T., Dieterich C., Reinert K.. Flexbar 3.0 - SIMD and multicore parallelization. Bioinformatics. 2017; 33:2941–2942. [DOI] [PubMed] [Google Scholar]
  • 47. Busch A., Brüggemann M., Ebersberger S., Zarnack K.. iCLIP data analysis: a complete pipeline from sequencing reads to RBP binding sites. Methods. 2020; 178:49–62. [DOI] [PubMed] [Google Scholar]
  • 48. Krakau S., Richard H., Marsico A.. PureCLIP: capturing target-specific protein-RNA interaction footprints from single-nucleotide CLIP-seq data. Genome Biol. 2017; 18:240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Lawrence M., Huber W., Pages H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J.. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 2013; 9:e1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Bernhart S.H., Hofacker I.L., Stadler P.F.. Local RNA base pairing probabilities in large sequences. Bioinformatics. 2006; 22:614–615. [DOI] [PubMed] [Google Scholar]
  • 51. Khong A., Matheny T., Jain S., Mitchell S.F., Wheeler J.R., Parker R.. The stress granule transcriptome reveals principles of mRNA accumulation in stress granules. Mol. Cell. 2017; 68:808–820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Middleton S.A., Eberwine J., Kim J.. Comprehensive catalog of dendritically localized mRNA isoforms from sub-cellular sequencing of single mouse neurons. BMC Biol. 2019; 17:5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Howe K.L., Achuthan P., Allen J., Allen J., Alvarez-Jarreta J., Amode M.R., Armean I.M., Azov A.G., Bennett R., Bhai J.et al.. Ensembl 2021. Nucleic Acids Res. 2021; 49:D884–D891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Fabregat A., Sidiropoulos K., Viteri G., Forner O., Marin-Garcia P., Arnau V., D’Eustachio P., Stein L., Hermjakob H.. Reactome pathway analysis: a high-performance in-memory approach. BMC Bioinf. 2017; 18:142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. The Gene Ontology Consortium The gene ontology Resource: 20 years and still going strong. Nucleic Acids Res. 2019; 47:D330–D338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. König J., Zarnack K., Rot G., Curk T., Kayikci M., Zupan B., Turner D.J., Luscombe N.M., Ule J.. iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution. Nat. Struct. Mol. Biol. 2010; 17:909–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Chakrabarti A.M., Haberman N., Praznik A., Luscombe N.M., Ule J.. Data science issues in studying protein–RNA interactions with CLIP technologies. Annu Rev Biomed. Data Sci. 2018; 1:235–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Tretiakova A., Gallia G.L., Shcherbik N., Jameson B., Johnson E.M., Amini S., Khalili K.. Association of Puralpha with rnas homologous to 7 SL determines its binding ability to the myelin basic protein promoter DNA sequence. J. Biol. Chem. 1998; 273:22241–22247. [DOI] [PubMed] [Google Scholar]
  • 59. West J.A., Davis C.P., Sunwoo H., Simon M.D., Sadreyev R.I., Wang P.I., Tolstorukov M.Y., Kingston R.E.. The long noncoding rnas NEAT1 and MALAT1 bind active chromatin sites. Mol. Cell. 2014; 55:791–802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Darbinian N., Gallia G.L., Khalili K.. Helix-destabilizing properties of the human single-stranded DNA- and RNA-binding protein puralpha. J. Cell. Biochem. 2001; 80:589–595. [PubMed] [Google Scholar]
  • 61. Sugimoto Y., König J., Hussain S., Zupan B., Curk T., Frye M., Ule J.. Analysis of CLIP and iCLIP methods for nucleotide-resolution studies of protein-RNA interactions. Genome Biol. 2012; 13:R67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Cubelos B., Sebastian-Serrano A., Beccari L., Calcagnotto M.E., Cisneros E., Kim S., Dopazo A., Alvarez-Dolado M., Redondo J.M., Bovolenta P.et al.. Cux1 and Cux2 regulate dendritic branching, spine morphology, and synapses of the upper layer neurons of the cortex. Neuron. 2010; 66:523–535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Escobar D.J., Desai R., Ishiyama N., Folmsbee S.S., Novak M.N., Flozak A.S., Daugherty R.L., Mo R., Nanavati D., Sarpal R.et al.. alpha-Catenin phosphorylation promotes intercellular adhesion through a dual-kinase mechanism. J. Cell Sci. 2015; 128:1150–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Swinnen B., Bento-Abreu A., Gendron T.F., Boeynaems S., Bogaert E., Nuyts R., Timmers M., Scheveneels W., Hersmus N., Wang J.et al.. A zebrafish model for C9orf72 ALS reveals RNA toxicity as a pathogenic mechanism. Acta Neuropathol. 2018; 135:427–443. [DOI] [PubMed] [Google Scholar]
  • 65. Pandey P.R., Yang J.H., Tsitsipatis D., Panda A.C., Noh J.H., Kim K.M., Munk R., Nicholson T., Hanniford D., Argibay D.et al.. circSamd4 represses myogenic transcriptional activity of PUR proteins. Nucleic Acids Res. 2020; 48:3789–3805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Song C., Zhang Y., Huang W., Shi J., Huang Q., Jiang M., Qiu Y., Wang T., Chen H., Wang H.. Circular RNA Cwc27 contributes to Alzheimer's disease pathogenesis by repressing pur-alpha activity. Cell Death Differ. 2021; 29:393–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Haas S., Gordon J., Khalili K.. A developmentally regulated DNA-binding protein from mouse brain stimulates myelin basic protein gene expression. Mol. Cell. Biol. 1993; 13:3103–3112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Haas S., Thatikunta P., Steplewski A., Johnson E.M., Khalili K., Amini S.. A 39-kD DNA-binding protein from mouse brain stimulates transcription of myelin basic protein gene in oligodendrocytic cells. J. Cell Biol. 1995; 130:1171–1179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Singh R., Valcárcel J.. Building specificity with nonspecific RNA-binding proteins. Nat. Struct. Mol. Biol. 2005; 12:645–653. [DOI] [PubMed] [Google Scholar]
  • 70. Jones S.A., Jenkins B.J.. Recent insights into targeting the IL-6 cytokine family in inflammatory diseases and cancer. Nat. Rev. Immunol. 2018; 18:773–789. [DOI] [PubMed] [Google Scholar]
  • 71. Bar-Yosef T., Damri O., Agam G.. Dual role of autophagy in diseases of the central nervous system. Front. Cell Neurosci. 2019; 13:196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Sun Y., Gao J., Jing Z., Zhao Y., Sun Y., Zhao X.. PURalpha promotes the transcriptional activation of PCK2 in oesophageal squamous cell carcinoma cells. Genes (Basel). 2020; 11:1301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Ortiz-Gonzalez X.R. Mitochondrial dysfunction: a common denominator in neurodevelopmental disorders?. Dev. Neurosci. 2021; 43:222–229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Youn J.Y., Dyakov B.J.A., Zhang J., Knight J.D.R., Vernon R.M., Forman-Kay J.D., Gingras A.C.. Properties of stress granule and P-Body proteomes. Mol. Cell. 2019; 76:286–294. [DOI] [PubMed] [Google Scholar]
  • 75. Balak C., Benard M., Schaefer E., Iqbal S., Ramsey K., Ernoult-Lange M., Mattioli F., Llaci L., Geoffroy V., Courel M.et al.. Rare de novo missense variants in RNA helicase DDX6 cause intellectual disability and dysmorphic features and lead to P-Body defects and RNA dysregulation. Am. J. Hum. Genet. 2019; 105:509–525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Sheth U., Parker R.. Decapping and decay of messenger RNA occur in cytoplasmic processing bodies. Science. 2003; 300:805–808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Riggs C.L., Kedersha N., Ivanov P., Anderson P.. Mammalian stress granules and P bodies at a glance. J. Cell Sci. 2020; 133:jcs242487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Kedersha N., Stoecklin G., Ayodele M., Yacono P., Lykke-Andersen J., Fritzler M.J., Scheuner D., Kaufman R.J., Golan D.E., Anderson P.. Stress granules and processing bodies are dynamically linked sites of mRNP remodeling. J. Cell Biol. 2005; 169:871–884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Perez-Riverol Y., Csordas A., Bai J., Bernal-Llinares M., Hewapathirana S., Kundu D.J., Inuganti A., Griss J., Mayer G., Eisenacher M.et al.. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res. 2019; 47:D442–D450. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkac1237_Supplemental_Files

Data Availability Statement

Uncropped images are available in Supplementary Figure S12. All sequencing data is available in the Gene Expression Omnibus (GEO) under the SuperSeries accession number GSE193905. The collection includes the RNA-seq data from PURA KD and CTRL HeLa cells (GSE193900) as well as the iCLIP data for endogenous PURA (anti-PURA12D11 antibody) in HeLa cells (GSE193901) and NPC (GSE193902) as well as for overexpressed FLAG-PURA in HeLa cells precipitated with anti-PURA12D11 (GSE193904) and anti-FLAG antibody (GSE193903). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (79) partner repository with the dataset identifier PXD030266. Scripts used to process the files are accessible under the Zenodo repository located at: https://doi.org/10.5281/zenodo.7273088.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES