Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2026 Apr 7;45(4):117090. doi: 10.1016/j.celrep.2026.117090

Incorporation of genome-bound cellular proteins into HIV-1 particles regulates viral infection

Manuel Garcia-Moreno 1, Azman Embarc-Buh 2,8, Robin Truman 1,8, Marko Noerenberg 2, Louisa Iselin 2,3, Honglin Chen 1, Caroline E Lenz 1, Jeffrey Y Lee 1,5, Kate Dicker 1,2,4, Snehith Dyavari Shetty 2, Natasha Palmalux 2, Quan Gu 2, Thibault JM Sohier 6, Aino I Järvelin 1, Wael Kamel 2, Vincenzo Ruscica 2, Emiliano P Ricci 6, Ilan Davis 1,5, Shabaz Mohammed 1,2,7, Alfredo Castello 1,2,9,
PMCID: PMC13124994  PMID: 41955099

Summary

The initial steps of the human immunodeficiency virus type 1 (HIV-1) life cycle are regulated by cellular RNA-binding proteins, but only a few have been identified. Here, we developed in virion RNA interactome capture (ivRIC) to comprehensively profile the direct protein interactors of the HIV-1 genomic (g)RNA inside the viral particles. We identified 104 cellular RNA-binding proteins in virions (ivRBPs), many of which are nuclear. We determined the interactome of the viral RBP Rev and discovered that nuclear ivRBPs may associate gRNA in the nucleus and continue bound after the genesis of the viral particles. We also observed that ivRBPs are not incorporated into viral particles based on their abundance, but likely through selective mechanisms. Moreover, we show that the ivRBPs PURA and its homolog PURB control HIV-1 particle infectivity and engage with several viral proteins and key elements within HIV-1 gRNA, showcasing the importance of ivRBPs for HIV-1 infection.

Keywords: HIV, RNA, RNA-binding protein, virus, virion, viral particle, Rev, RNA-interactome capture, PURA

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • ivRIC elucidates the composition of viral ribonucleoproteins inside virions

  • 104 cellular ivRBPs bind to HIV-1 genomic (g)RNA within virions

  • ivRBPs are selectively incorporated into virions

  • PURA interacts with HIV-1 gRNA and promotes reverse transcription


Does HIV-1 package host proteins that promote infection? Using in virion RNA interactome capture, Garcia-Moreno et al. mapped human proteins bound to the viral genome inside virions. They identified over 100 host proteins that are likely packaged selectively. One of them, PURA, supports reverse transcription and enables efficient early infection.

Introduction

HIV-1 genomic (g)RNA is reverse transcribed into DNA and integrated into the host chromosome. Cellular RNA-binding proteins (RBPs) have been extensively connected to the following steps of the viral life cycle, which starts with the transcription of viral RNAs by the host RNA polymerase II and its co-factors and continues with their capping, splicing, and polyadenylation.1 HIV-1 produces several viral RNA species that can be divided into single spliced, fully spliced, and unspliced (i.e., genome) RNAs.2 Fully spliced HIV-1 RNAs are transported to the cytoplasm using the canonical mRNA export pathways.3 However, unspliced and partially spliced RNAs must interact with the viral RBP Rev to hijack CRM1 and avoid nuclear retention.2,4 Viral RNA stability and decay, translation, and RNA transport also involve cellular RBPs and complexes, including the ribosome.1,2,5,6,7,8,9,10 Recent reports have suggested roles of cellular RBPs in viral particle formation and posterior steps such as reverse transcription and integration. However, the scope and relevance of these host-virus interactions remain poorly characterized.9

It was thought that HIV-1 particles disassemble upon cell entry, releasing their (g)RNA molecules into the cytoplasm. However, recent paradigm-shifting advances challenged this view, showing that the capsid core can remain intact during its transit throughout the cell and can be visualized in the nuclear pore and nucleoplasm.11,12,13,14 Moreover, it is now accepted that reverse transcription can occur inside the capsid core.11,15,16 Several cellular RBPs have been proposed to regulate HIV-1 particle assembly and reverse transcription.9 However, the capsid-confined reverse transcription model implies that host proteins involved in this process must be present inside the virion. Proteomic analyses of purified HIV-1 particles revealed the presence of hundreds of RBPs within virions.9,17,18,19,20,21 While exciting, these proteomic analyses were affected by biological and technical limitations, including (1) the uptake of a portion of the cytosol by budding particles, leading to the passive acquisition of bystander proteins; (2) the presence of extracellular vesicles with similar sizes to virions; and (3) the lack of appropriate negative controls and/or quantitative information.17,18,19,20,21 Therefore, progress in elucidating the scope of host RBPs incorporated into virions requires the development of new strategies to differentiate between passive bystanders and proteins actively engaging with the gRNA.

Here, we applied a new approach to determine the complement of proteins that interact with the HIV-1 gRNA inside the viral particles called in virion RNA interactome capture (ivRIC). We discover that the in virion packaged genomic ribonucleoprotein (ivRNP) contains over 100 cellular proteins, many of which are nuclear. We show that many of these nuclear RBPs are also interactors of the viral RBP Rev, suggesting that they associate with the viral RNP during its nuclear life. Moreover, we found that the components of the ivRNPs play important regulatory roles in HIV-1 infection. Particularly, PURA and PURB bind to critical regulatory elements on the HIV-1 gRNA and the viral reverse transcriptase (RT), aiding viral particle infectivity probably through the regulation of reverse transcription. Our study thus provides a new landscape of host-HIV interactions with regulatory potential.

Results

ivRIC, a new approach to analyze the composition of the viral RNPs assembled into virions

Proteomic analysis of purified viral particles revealed that hundreds of cellular proteins are incorporated into virions, including cellular RBPs.9 However, we noticed sparse overlapping between datasets (Figure S1A).17,18,19,20,21 This limited consistency is probably due to the impossibility to discriminate between bystander proteins captured passively during virion assembly and RBPs actively interacting with the HIV-1 gRNA. To identify RBPs directly bound to gRNA in HIV-1 particles, we developed ivRIC. In brief, viral particles are purified in a sucrose cushion, followed by protein-RNA “zero distance” ultraviolet (UV) cross-linking, lysis under denaturing conditions, and isolation of the polyadenylated HIV-1 gRNA with oligo(dT) magnetic beads (Figure 1A). RBPs are released by RNase treatment and identified by quantitative proteomics. ivRIC was applied to infectious HIV-1mCherry-Nef particles purified from the supernatant of transfected HEK293T (Figures S1B, S1D, and S1E) or infected CD4+ T lymphocytic cells (SupT1; Figures 1B and S1C). The purification of viral particles from the supernatant of CD4+ T lymphocytic cells yielded strong enrichment of the viral capsid (CA-p24) relative to the whole-cell proteome (Figure 1B). In contrast, abundant cellular proteins such as albumin and β-actin were strongly depleted (Figures 1B and S1E). Moreover, extracellular vesicle markers (CD45, CD43 [also known as SPN], and SERINC3) were absent or markedly reduced compared with whole-cell lysates (Figure S1B). Trace SERINC3 signal was detected in purified samples but at comparable levels in supernatants from mock- and HIV-1-infected cells; we therefore included mock purifications to control for any potential carryover (Figures 1B and S1F).

Figure 1.

Figure 1

In virion RNA interactome capture (ivRIC) reveals the composition of the genomic ribonucleoprotein packaged within HIV-1 virions (ivRNP)

(A) Schematic representation of the ivRIC protocol. Samples and controls are depicted in Figure S1.

(B) Western blotting of the viral particles before and after sucrose cushion purification using antibodies against markers of extracellular vesicles (CD45, CD43, and SERINC3) and viral proteins; representative gel of n = 3.

(C) Western blotting of the whole-cell lysates (WCLs), inputs, and eluates of a representative ivRIC experiment performed in CD4+ T-lymphocytic (SupT1) cells. Asterisk marks unspecific bands; representative gel of n = 3.

(D) Schematic representation of the proteolysis of the Gag polyprotein. MA, matrix; CA, capsid; NC, nucleocapsid.

(E) Relative proportion of the HIV-1 gRNA in ivRIC eluates estimated by absolute RT-qPCR; n = 3; the line within the box represents the mean and error bars ± standard deviation (SD).

(F) Analysis of oligo(dT) isolated RNA from sucrose purified HIV-1mCherry-Nef particles by RNA-sequencing; n = 3.

(G and H) Volcano plots of the quantitative proteomic analysis of the ivRIC eluates from virions produced by SupT1 cells; n = 4. Red and dark blue dots are proteins enriched with 1% FDR, while orange and cyan dots are proteins enriched with 10% FDR. Gray dots are non-enriched proteins. E, eluate.

(I) Venn diagram showing the overlapping between the UV irradiated vs. non-irradiated samples and HIV-1-infected versus mock-infected comparisons.

(J) Bar plots showing the proportion of ivRBPs in the ivRNP annotated with RNA- and DNA-binding (GO terms); virus-related (GO terms), and HIV-1-related (text-mining) functions.

Related to Figure S1 and S2.

Despite its high abundance in virions (∼2k copies/virion) and proximity to the viral RNA, the processed HIV-1 capsid (CA-p24) is depleted in eluates of the oligo(dT) capture, while the polyprotein Gag-p55 is enriched in a UV-dependent manner (Figures 1C and S1D). These striking results agree with our knowledge of the HIV-1 particle, where the CA-p24 forms a shell around the viral RNP, while the nucleocapsid (NC-p11) interacts with the viral RNA. CA-p24 can associate with viral RNA in immature particles as part of the polyprotein Gag-p55 because of the RNA-binding activity of NC-p11 (Figure 1D). In addition, the integrase (IN-p31) was also enriched in ivRIC eluates (Figure 1C), aligning with its reported RNA-binding activity.22 Conversely, the glycoprotein gp120 was absent in eluates, consistently with its location in the viral envelope (Figure 1C).

The success of ivRIC depends on the specificity and efficiency by which HIV-1 gRNA is isolated. Notably, HIV-1 gRNA represented ∼98% of the RNA detected in eluates (Figures 1E and 1F), implying that it is the major (if not the sole) contributor to the following proteomic results. These results were obtained by two orthogonal methods (absolute quantitative reverse transcription PCR [RT-qPCR] quantification and RNA sequencing) and are in agreement with studies corroborating that the population of cellular mRNA in viral particles is very small.23,24,25 Our RNA analysis also supports the efficiency of our purification protocol at removing potential sources of contamination such as extracellular vesicles and cellular debris.

To identify the complement of cellular RBPs bound to the gRNA in viral particles, we analyzed the particles collected from CD4+ T cells by ivRIC in conjunction with quantitative proteomics (Figures S1F, S1G, and S2A). A total of 147 proteins were significantly enriched in eluates from UV-irradiated HIV-1-infected samples compared with either non-irradiated or mock-infected controls (Figures 1G–1I). Of these, 104 cellular and 9 viral proteins were consistently enriched relative to both controls (Figures 1G–1I; Table S1), which we designate as in-virion RNA-binding proteins (ivRBPs). We next validated our results by western blot, showing that the previously established ivRBP MOV1026,27 was strongly enriched in ivRIC eluates in a UV and infection-dependent manner (Figure 1C). Additionally, the newly discovered ivRBPs PURA and SUB1 were also detected in ivRIC eluates by western blotting (Figures 1C and S2B). As anticipated from a bona fide set of RBPs, most proteins enriched in ivRIC eluates were annotated with the “RNA binding” GO term and functions associated with RNA metabolism (Figures 1J, 2A, and S2). Interestingly, ∼25% of the ivRBPs also contain DNA-binding activity (Figures 1J and 2A), which is potentially significant given the RNA/DNA duality of HIV-1. Over half of the discovered ivRBPs are annotated with virus-related GO terms, and ∼25% of them are also linked to HIV-1 (Figure 1J), aligning with roles in infection. Notably, eight ivRBPs are involved in HIV-1 particle formation (PACSIN2, LCK, and PDCD6IP [also ALIX])28,29,30 or infectivity (MSN, UPF1, MOV10, DHX9, and EEF1A).26,27,31,32,33,34,35 Our data thus revealed that the viral genome engages with over 100 cellular RBPs within virions.

Figure 2.

Figure 2

Analysis of the properties of ivRBPs

(A) Analysis of the GO terms enriched in the ivRNP over the cellular proteome.

(B and C) Volcano plots of the comparison between ivRIC inputs (I, purified viral particles) from HIV-1 infected and mock-infected cells (B) and ivRIC inputs versus eluates (E, ivRNP) (C); n = 4. Red and dark blue dots are proteins enriched with 1% FDR, while orange and cyan dots are proteins enriched with 10% FDR. Gray dots are non-enriched proteins.

(D and E) Scatterplots showing the intensity of proteins in ivRIC inputs (D) or eluates (E) against the whole-cell proteome (WCP).

(F) Scatterplot showing the protein intensity in the WCP (y axis) and the Log2 fold change in the ivRIC eluates from mock- and HIV-1-infected samples. Color code as in (B) and (C).

(G) Density plots showing the distribution of fold changes in UV irradiated infected and non-infected samples over non-irradiated HIV-1-infected samples. Cellular proteins and viral proteins are shown in gray and purple, respectively.

Related to Figure S3.

ivRBPs are incorporated selectively into viral particles

Our next goal was to determine whether ivRBPs are incorporated into HIV-1 particles passively, based on their abundance, or selectively. For this, it was critical to characterize the proteome of the producer cells and purified particles (i.e., inputs of the ivRIC experiment). We identified 187 proteins enriched in the HIV-1-infected input samples over the mock controls, representing the viral particle proteome (Figures 2B, S3A, and S3B; Table S2). This dataset was additionally enriched in GO terms and pathways related to the plasma membrane and immunological receptors when compared to the ivRNP, reflecting the presence of plasma membrane-derived envelope in full viral particles (Figures S3C–S3E). We next calculated the protein intensity ratios between the inputs (total viral particle) and eluates (ivRNP), revealing two groups of proteins (Figure 2C). ivRBPs with high eluate/input ratios reflecting high cross-link-ability, typically supported by geometrically optimal protein-RNA interfaces and the presence of UV-favored amino acids and nucleotides. Additionally, ivRBPs with low ratios likely reflect transient, low-occupancy interactions (low bound/unbound) and/or suboptimal amino acid/nucleotide composition and spatial arrangement.

To discriminate between abundance-driven passive incorporation and selective packaging of proteins into virions, we normalized the protein intensities in ivRIC input (viral particle proteome) and eluates (ivRNP) against those in the whole CD4+ T cell proteome (WCP). Interestingly, the intensity of proteins in viral particles correlated well with protein abundance (R = 0.34; Figure 2D), while the ivRNP did so to a lower extent (R = 0.15; Figure 2E). When considering fold changes instead of raw intensities, the result was even more stark, with ivRBPs displaying anticorrelation with the WCP (Figure 2F and S2D).

A remaining question is whether ivRBPs are present in biologically meaningful quantities within virions. To test this, we compared the protein intensity and fold change distribution in eluates of ivRIC from HIV-1-infected and mock samples against the non-irradiated samples (i.e., no RNA-dependent purification).Viral proteins are present in both comparisons because they are detected, even if with low intensity, in the non-irradiated HIV-1-infected samples (Figure 2G). Strikingly, many cellular ivRBPs exhibited similar intensity levels and fold changes to HIV-1 proteins, which have high stoichiometry in viral particles (Figure 2G, gray vs. purple). These proteins however, had very small fold changes in the comparison to mock samples, suggesting that HIV-1 infection is an essential determinant of their high abundance in the RNPs isolated from the culture media. Several known regulators of HIV-1 particle infectivity were among the most prevalent ivRBPs in the gRNP, including UPF134 and MOV10.36 All together, our data suggest that many ivRBPs are incorporated into the virion through selective mechanisms.

The HIV-1 ivRNP heavily overlaps with the Rev protein-protein interactome

It was surprising to find that many ivRBPs were nuclear, despite virion assembly occurring at the plasma membrane (Figure 2A). To test if nuclear ivRBPs associate with the gRNA during its nuclear life, we analyzed the interactome of the viral RBP Rev as a proxy for the nuclear gRNP. There have been several attempts at establishing the Rev interactome.37,38,39 However, resulting datasets had a very poor overlap due to technical limitations (Figure S4A).2,37,38,39 To study the Rev interactome in infected CD4+ T lymphocytic cells and in the context of HIV-1 infection, we generated a chimeric HIV-1 replicon that expressed Rev fused to Flag-Myc (HIV-1-R-E-Rev-Flag-Myc) or HaLo tag (HIV-1-R-E-Rev-HaLo) (Figure 3A and S4B). Infection of SupT1 cells with pseudotyped HIV-1 particles led to normal levels of Gag-p55, processed CA-p24, and mCherry-Nef, when compared to the parental HIV-1-R-E-mCherry-Nef (Figures 3B and 3C). By contrast, nullification of Rev by adding a stop codon (HIV-1-R-E-ΔRev) led to undetectable levels of Gag-p55 (Figure 3D). Single molecule in situ RNA hybridization (smFISH) of cells infected with HIV-1-R-E-Rev-HaLo revealed a normal cytoplasmic accumulation of HIV-1 gRNA when compared to the strictly nuclear distribution observed in HIV-1-R-E-ΔRev (Figure S4C, upper panels). Rev exhibits a nucleolar localization when overexpressed in non-infected cells.40 Conversely, expression of Rev from the viral genome led to additional localization in nuclear pores and nucleoplasm (Figure S4C, bottom panels), which is compatible with its known role in nuclear export of HIV-1 RNAs.2 These additional localizations support the establishment of native protein-protein interactions involved in viral RNA localization and metabolism.

Figure 3.

Figure 3

The Rev protein-protein interactome and its similarities with the ivRNP

(A) Schematic representation of the HIV-1R-E-Rev-Flag-Myc replicon.

(B–D) Western blotting showing viral protein expression in CD4+ T lymphocytic cells (SupT1) infected with HIV-1R-E-Rev-Flag-Myc (B), HIV-1R-E-Rev-Halo (C), or HIV-1R-E-ΔRev (D) relative to the parental HIV-1R-E-mCherry-Nef. hpi, hours post-infection; representative gels of n = 3.

(E) Schematic representation of the Rev protein-protein interaction experiment, using chimeric pseudotyped viruses to infect CD4+ T lymphocytic cells (SupT1).

(F) Western blot of inputs and eluates of Rev IP using Flag and p24 antibodies.

(G)Volcano plot of the Rev-Flag-Myc immunoprecipitation (IP) versus the untagged control (Rev WT); n = 3. Red and dark blue dots are proteins enriched with 1% FDR, while orange and cyan dots are proteins enriched with 10% FDR. Gray dots are non-enriched proteins.

(H) Bar plots showing the proportion of Rev interactors annotated with RNA- and DNA-binding (GO terms); virus-related (GO terms), and HIV-1-related (text-mining) functions.

(I) Proportion of novel and previously reported Rev interactors in the Rev interactome.

(J) Pie chart showing the proportion of proteins classified as novel Rev interactors and previous annotations in HIV-1 NCBI database.

(K) Venn diagram showing the overlapping between the ivRIC (ivRNP) and the Rev IP (Rev interactome) experiments.

(L) Scatterplots comparing fold changes in the Rev interactome and the ivRIC experiment.

Related to Figures S4, S5 and S6.

To elucidate the Rev interactome, we infected SupT1 cells with the pseudotyped HIV-1-R-E-Rev-Flag-Myc, followed by Flag immunoprecipitation (IP) at 48 h post-infection (hpi) (Figure 3E). The IP was performed in presence of RNases to minimize the incidence of RNA-bridged interactions. While RNases will reduce the incidence of RNA-dependent interactions, the IP was performed in native conditions, which is expected to co-purify Rev direct binders as well as proteins interacting with it indirectly (e.g., through a complex protein-protein interaction network). Proteomic analysis revealed high sample correlation for Rev-Flag IP eluates, with Rev being the most enriched protein (Figures 3F, 3G, and S4D–S4F; Table S3). We identified 13 peptides mapping unambiguously to Rev across its protein sequence, confirming its expression and sequence identity. Two hundred eighty-four cellular proteins were significantly enriched in Rev IP eluates. Eighty-one percent of the interactors were RBPs themselves (Figure 3H), reflecting Rev’s prominent role in viral RNA metabolism. We noticed the presence of many proteins from the transcriptional apparatus, spliceosome, and ribosome in the IP eluates (Figure S5A and S5B), which is consistent with the observed nucleoplasmatic and nucleolar location of Rev. The previously established Rev interactomes showed very poor overlap2; however, 43% of the proteins identified here were also reported in these earlier studies (Figure 3I). Our results thus reconcile the contradictions in these datasets, while still adding many additional Rev interactors. We also observed a strong overlap between our Rev interactome and the annotation in the NCBI HIV database, with 25% being annotated as Rev interactors and a further 44% previously linked to HIV-1 in some capacity (Figure 3J).

To determine to what extent the ivRNP and nuclear gRNP are similar, we compared the ivRIC and Rev IP results. Surprisingly, 68% of the components of the ivRNP are also interactors of Rev, displaying similar fold changes in both datasets that suggest analogous stoichiometry (R > 0.61; Figures 3K–3L). Proteins shared between these two datasets are mostly involved in RNA transcription, splicing, nuclear export, and translation (Figure S5C). By contrast, proteins only present in the ivRNP are associated mainly with membrane biology, including membrane trafficking, endocytosis, and vesicle transport. Our results are thus compatible with two populations of ivRBPs: one that associates with the gRNA in the nucleus and remains associated with it, and another that engages with the gRNA later at the plasma membrane. Interestingly, we also observed that the previously established Gag41 and Staufen42 interactomes (proxies for the cytoplasmic gRNPs) overlap to some extent with the ivRNP and the Rev interactome (Figure S6A). Moreover, we found a moderate overlap with intracellular HIV-1 RNA interactomes (Figure S6B).10,43

PURA and PURB are regulators of HIV-1 particle infectivity

To assess if ivRBPs are functionally relevant, we generated knockouts (KOs) in CD4+ T lymphocytic cells (SupT1) for the transcriptional activator protein Pur-alpha (PURA), its homolog Pur-beta (PURB) and FAM120A, which were identified by both ivRIC and Rev protein-protein interaction analyses. We also selected moesin (MSN) that is a membrane-associated protein and was only present in the ivRNP. No major effects in cell viability, proliferation, or cytotoxicity were detected in KO cells (Figures S7A–S7D).

Infection of CD4+ T lymphocytic KO cells with HIV-1mCherry-Nef infection led in all cases, except for FAM120A, to a reduction in the number of mCherry-positive cells (Figures 4A, 4B, and S7E), suggesting that PURA, PURB, and MSN are important for viral gene expression and/or spread in CD4+ T lymphocytic cells. The effects of PURA and PURB on HIV-1 gene expression were also observed by western blot (Figures 4C, S7F, and S7G). Previous work linked PURA to HIV-1 transcription, but this work was mostly done by protein overexpression and using reporter genes.44,45 We applied smFISH to PURA KO HEK293 cells infected with HIV-1R-E-Gag-mCherry and observed an overall reduction in HIV-1 RNA molecules, in both the nucleus and cytoplasm, that correlates with a reduction in Gag-derived mCherry signal (Figures S7H and S7I). Strikingly, HIV-1 nuclear transcription foci had similar fluorescence intensity in WT and KO cells, but their number exhibited a non-significant but substantial decrease in PURA KO cells (Figure S7I). This suggests that PURA’s roles in HIV-1 infected cells involves transcription activation and not elongation, in agreement with previous results.44 The concurrence of our results with the known function of PURA in HIV-1 cells reinforces the validity of our cell lines.

Figure 4.

Figure 4

Effect of ivRBPs in viral gene expression and virion production and infectivity

(A) Representation of the experimental design.

(B) Flow cytometry analysis of WT and KO SupT1 cells infected with HIV-1mCherry-Nef for 48 h. y axis show the percentage of KO cells expressing mCherry as compared to WT cells (dashed line).

(C) Western blotting analysis of WT, PURA, and PURB KO SupT1 cells infected with HIV-1mCherry-Nef for 24 h; a representative gel of n = 3.

(D) RT-qPCR analysis of the HIV-1 gRNA in the supernatant of WT- and KO-infected SupT1 cells from (B).

(E) Viral particles purified in (D) from KO and WT cells were used to infect WT SupT1 cells upon normalization by gRNA levels. mCherry positive cells were quantified by flow cytometry at 48 hpi.

For (B), (D), and (E), n = 4; p < 0.05; ∗∗p < 0.01; ∗∗∗p < 0.001. (B), (D), and (E) show box-and-whisker plots, in which the boxes represent the interquartile range (Q1–Q3) with the median indicated, and the whiskers denote the minimum and maximum values.

(F) Flow cytometry analysis of HEK293 WT cells infected at MOI 1 with single round HIV-1R-E-mCherry-Nef produced from HEK293 Flp-In T-Rex WT or PURA KO cells; n = 3.

(G) Analysis of early, intermediate, and late reverse transcription (RT) products in HEK293 Flp-In T-Rex WT cells infected with normalized single round HIV-1R-EmCherry-Nef particles produced in HEK293 Flp-In T-Rex WT or PURA KO cells; n ≥ 3.

(H) Analysis of early, intermediate, and late HIV-1 RT products in HEK293 Flp-In T-Rex WT or PURA KO cells infected with normalized single round HIV-1R-E-mCherry-Nef particles produced in WT cells; n = 4.

(I) Flow cytometry analysis of HEK293 Flp-In T-Rex WT cells infected with normalized single round HIV-1R-E-mCherry-Nef particles produced in Huh-7 WT or PURA KO cells; n = 3.

(J) Analysis of early, intermediate, and late RT products at 8 hpi in HEK293 Flp-In T-Rex WT cells infected with HIV-1R-E-mCherry-Nef produced in Huh-7 WT or PURA KO cells; n ≥ 4.

(F–J) Data is mean ± standard error of the mean (SEM), including individual data points.

For all panels, ∗p < 0.05, ∗∗p < 0.01.

Related to Figures S7 and S8.

To determine if viral particle production is affected by the absence of PURA, PURB, MSN, or FAM120A, we collected the supernatant of the infected SupT1 KO cells and quantified it by RT-qPCR (Figures 4A and 4D). We observed a remarkable decrease of HIV-1 gRNA in the supernatant of KO cells, which correlated well with the reduced infection observed by flow cytometry (Figures 4B vs. 4D) as well as the transcription activation defects observed by smFISH (Figure S7I). These results confirmed the importance of PURA, PURB, and MSN in the HIV-1 life cycle by an orthogonal approach.

We next tested if ivRBPs are important for HIV-1 infectivity by infecting WT cells with the same number of viral particles produced in WT or KO cells (Figure 4A). Viruses lacking MSN or FAM120A exhibited nearly no differences in infectivity compared to a virus produced in WT cells (Figure 4E). By contrast, viruses lacking PURA and PURB produced a substantially lower number of HIV-1-positive WT cells (Figure 4E). All together, our results suggest that ivRBPs can act at several stages of the HIV-1 life cycle, with PURA and PURB having roles in HIV-1 viral transcription and viral particle infectivity.

To validate these results, we generated HIV-1R-E-mcherry-Nef viruses in HEK293 WT or PURA KO cells and, after normalization by CA-p24 abundance, we infected HEK293 WT cells. In agreement with the experiments in T CD4+ lymphocytic cells, the number of mCherry-positive cells was significantly lower with viruses produced in PURA KO cells than in WT cells (Figure 4F). We collected samples at various times post-infection and analyzed by PCR the presence of early, intermediate, and late HIV-1 reverse transcription products. Strikingly, we observed a significant reduction of the three types of reverse transcription products when infecting with viruses produced in PURA KO cells compared to those produced in WT cells (Figure 4G). To evaluate the requirement of PURA inside HIV-1 particles, we infected PURA KO cells with viruses produced in WT cells (i.e., containing PURA inside virions). Notably, we observed no differences in reverse transcription, highlighting that the defects depend upon the presence of PURA inside the virions (Figure 4H). Infectivity and reverse transcription defects also occurred with HIV-1R-E-mcherry-Nef generated in Huh-7 PURA KO cells (Figures 4I and 4J), demonstrating reproducibility across three cell lines (SupT1, HEK293 and Huh7).

PURA and PURB interact with HIV-1 proteins and gRNA

Our next step was to determine the scope of cellular and viral proteins that associate with PURA/B. To do so, we generated lymphocytic Jurkat Flp-In T-REx cells able to express PURA or PURB fused to EGFP in a doxycycline-dependent manner (Tet-on) (Figures S7J and S7K). IP of PURA-EGFP and PURB-EGFP with GFP-Trap revealed a specific enrichment of both baits (Figures 5A and S8A–S8C). Proteomic analysis revealed hundreds of proteins, including many ivRBPs, co-precipitating with PURA-EGFP and PURB-EGFP (Figures 5A and S8C; Table S4). The interactome of the two proteins largely overlapped and exhibited high protein intensity correlation, implying similar protein composition and stoichiometries (Figures 5B and S8D). PURB was the protein with highest fold change in PURA-EGFP IP and vice versa (Figures 5A and S8C), indicating that both proteins associate in heteromeric complexes. Both PUR proteins interact with numerous components of the splicing, nuclear RNA export, and translation machinery (Figure S8E), suggesting their participation at multiple steps of the RNA life cycle. Many proteins exhibited differential association with PURA and PURB upon HIV-1 infection (Figures 5C, S8C, and S8F), including viral proteins MA, IN, and the RNaseH-p15 that were present in both interactomes. These interactions support a potential involvement of PURA/B in processes that dictate the ability of HIV-1 virions at establishing infection, including reverse transcription (Figure 4G) and/or integration.

Figure 5.

Figure 5

Analysis of the interactions that PURA and PURB establish in CD4+ lymphocytic cells

(A) Volcano plot showing the proteins enriched in PURA-EGFP IP over the EGFP control IP in Jurkat Flp-In T-REx cells expressing these proteins. Dots represent proteins enriched with 1% FDR (red and dark blue) or with 10% FDR (orange and cyan). Gray dots are non-enriched proteins; n = 3.

(B) Venn diagram comparing the proteins that are enriched in both PURA-EGFP and PURB-EGFP IPs.

(C) Volcano plot showing the proteins enriched in PURA-EGFP IP in mock versus HIV-1 infected cells. Color code as in (A).

(D) iCLIP2 (n = 3) was applied to Jurkat Flp-In T-REx expressing PURA-EGFP or PURB-EGFP and infected with HIV-1R-E-mCherry-Nef or mock-infected. Density plot showing the distribution of binding sites across 5′ UTRs, CDSs and 3′ UTRs of target RNAs.

(E) As in (D) but for introns.

(F) Motifs enriched in RNA binding sites of PURA including or excluding intronic sequences.

(G) Binding profile of PURA-EGFP and PURB-EGFP on HIV-1 RNA. Significant binding sites (p < 0.01) are indicated with salmon (for PURA) or green (for PURB) boxes underneath the density plot.

(H and I) Binding site density at the Gag-Pol frameshift (H) and TAR (I). The significant (p < 0.01) binding sites are indicated with a gray box, while the AG- and U-rich motifs are indicated with yellow and red boxes, respectively. The secondary structure of these RNA elements was generated using SHAPE data,46 indicating the fold change in the IP over the SMI control at each nucleotide position.

Related to Figures S9 and S10.

To validate the interaction of PURA with HIV-1 gRNA inside HIV-1 virions, we transfected HIV-1R-E-mCherry-Nef in 293 Flp-In T-REx cells expressing unfused EGFP or PURA-EGFP. Resulting particles were purified and subjected to IP with an anti-EGFP nanobody. Presence of HIV-1 gRNA in eluates was determined by RT-qPCR. Importantly, we observed ∼10-fold enrichment in viral gRNA in PURA-EGFP IP eluates over EGFP (Figure S8G), further confirming PURA as ivRBP by an orthogonal method.

To define the position(s) within the gRNA where binding occurs, we applied iCLIP247 to infected and uninfected PURA-EGFP and PURB-EGFP Jurkat cells. Both proteins were strongly enriched in IP eluates, forming detectable protein-RNA complexes with decreased mobility in SDS-PAGE (Figures S9A–S9C). We used size-matched inputs (SMIs) and unfused EGFP as control for the iCLIP2 analysis, and we performed standard quality assessments (Figures S9D–S9E). PURA/B interact predominantly with mRNAs, particularly near the 5′ end and the start and stop codons (Figures 5D and S9F–S9H; Table S5). We also observed sharp binding peaks at the 5′ and 3′ splice sites of introns (Figure 5E). The RNA targets and binding sites of PURA and PURB largely overlap, resulting in clustering in PCA analysis (Figures 5D, 5E, S9F–S9I, and S10A). The extensive overlapping of PURA and PURB binding sites further supports the existence of heteromeric complexes.

We next tested if PURA/B interact with specific sequences using motif enrichment tools, revealing two prominent binding motifs: a purine-rich sequence that is consistent with the well-known binding motif of PURA,48 preceded by a previously undetected U-rich motif (Figures 5F, S10B, and S10C). These motifs strongly resemble the sequences found at splice sites.49 To rule out that these motifs are artifact derived from the proximity of splicing junctions to PURA/B binding sites, we searched again for motifs after removal of all intronic sequences. A single motif combining the U-rich sequence followed by the AG-rich sequence was observed (Figures 5F and S10B), validating PURA/B binding to the observed bi-partite motif.

We next tested if PURA/B interact with viral RNA. Our results revealed that a substantial proportion of the reads (5%–20%) map to the HIV-1 gRNA in infected cells (Figures S10D and S10E). Binding sites distributed across the gRNA overlapping with important regulatory elements, including most canonical splice sites, which contained in most cases the AG-rich or/and U-rich motifs (Figures 5G and S10F–S10K). Similar read density distribution across the splice sites suggests that PURA/B binds primarily to the gRNA. Moreover, a prominent PURA/B binding site preceded the slippery sequence that causes the Gag/Pol frameshift, which contained two repeats of the AG- and U-rich motifs (Figure 5H). However, the Gag/Pol ratio was not affected in PURA and PURB KO cells, suggesting that these proteins do not affect frameshift efficiency (Figure S7G). Early biochemical assays proposed that PURA could interact with the TAR structure in the HIV-1 genome to regulate Tat-dependent transcription.44,45 iCLIP2 did not reveal binding sites proximal to the TAR structure in cellulo (Figure 5I). However, we detected binding to other regions of the 5′ UTR containing important regulatory structures (Figure 5G). Both PURA and PURB displayed nearly identical binding distribution across the gRNA, except for a binding site of PURB nearby the Rev response element (RRE) that was not detected for PURA (Figures 5G and S10E). An additional binding site immediately downstream the RRE that overlaps with several 3′ splice sites was detected for both proteins. Binding to RRE and its proximities is compatible with the co-purification of PURA/B with Rev. All together, these results reveal a widespread binding of PURA/B across the gRNA that is compatible with their high levels in the ivRNP (Figures 1B and 2G).

Discussion

Many approaches have been applied to profile the composition of viral particles.9 However, the fact that virions assemble in the crowded cellular environment is compatible with passive incorporation of cellular proteins with no roles in the viral life cycle. Viral particle proteomes are usually large and exhibit little consistency between studies, which complicates the distinction between incidental bystander proteins and those that are functionally relevant for the viral particles. Our study shows that the total HIV-1 particle proteome (ivRIC input) correlates well with protein abundance in the producer cell. By contrast, ivRIC underscores the proteins that engage with the HIV-1 genome within virions. The reduced correlation of the identified proteins with the proteome of the producer cell supports a model in which the incorporation of genome-bound proteins occur through active mechanisms. Incorporation of these RBPs into virions likely involves specific binding to HIV-1 gRNA or/and proteins. Importantly, several ivRBPs have been implicated in HIV-1 particle formation or infectivity by independent studies,26,27,28,29,30,31,32,33,34,50 supporting the high quality of our results. Strikingly, we observed a pronounced overlap between the ivRNP and the Rev interactome, suggesting a fascinating connection between the nuclear life of the HIV-1 gRNA and the formation of the viral particles. HIV-1 gRNA is produced by the cellular RNA polymerase II, and it is processed in the cell’s nucleus as other cellular mRNAs, including the addition of the cap and poly(A) tail. However, the gRNA is “perceived” by the host cell as immature, as it contains intronic sequences, leading to nuclear retention.2 The viral protein Rev is produced by a multiple spliced RNA that is exported to the cytoplasm by a conventional mRNA pathway. Rev accumulates over time and travels to the nucleus, where it is crucial for the engagement with the CRM1 RNA export pathway for nuclear HIV-1 gRNA export.2 Once in the cytoplasm, the gRNA can be translated or packaged into viral particles. Evidence suggests the existence of two structurally distinct populations of gRNAs, generated by differential transcription initiation, that are distinguished by the number of 5′ terminal guanosines.51,52,53,54,55 gRNAs with a single guanosine are prone to cap sequestration, forming a secondary structure conducive to gRNA dimerization and packaging. Conversely, gRNAs with two or three terminal guanosines exhibit higher translational efficiency due to greater accessibility of the cap structure.51,52,53,54,55 Consequently, virion-packaged gRNAs predominantly contain a single 5′ terminal guanosine and are less likely to be translated. Given that translation factors and ribosomes play a crucial role in remodeling RNPs during translation initiation and elongation,56,57 it is anticipated that lack of translation of virion-packaged gRNAs will lead to retention of nuclear RBPs. This phenomenon could explain the similarity between virion-packaged RNPs and the Rev interactome, reflecting the nuclear history of the gRNA.

In the early 2000s, PURA was linked to Tat-dependent transcription using LTR-controlled plasmids.44 However, these early publications were contested by subsequent studies, as reviewed in detail elsewhere.58 It was suggested that the use of PURA constructs that violated domain boundaries (defined by later resolved crystal structures) could lead to misfolded dysfunctional proteins. Additionally, the experiment conducted made it difficult to separate between transcriptional and post-transcriptional effects of PURA.44,45 Our single-molecule RNA analyses now show that provirus activation is partially inhibited in cells lacking PURA and that a lower number of transcriptionally active proviruses cause a reduction in gRNA. Additionally, we also show that both PURA and PURB interact with transcriptional and post-transcriptional machineries, as well as with key post-transcriptional regulatory elements on the HIV-1 gRNA, suggesting additional roles for PURA beyond HIV-1 transcription, for example in HIV-1 splicing given its binding to splice sites. We also show a new function of PURA and PURB in the viral particles, affecting their infectivity. The ivRBPs DHX9 and EEF1A were proposed to modulate reverse transcription through its interaction with the RT.32,33 PURA and PURB co-precipitate with several components of the Gag-Pol polyprotein, including MA, RT, the RNase H domain (p15) and IN. Our results show that loss of PURA causes an inhibition of reverse transcription perhaps through its interaction with RNaseH-p15 or indirectly through its binding to the gRNA. Our results suggest that the regulatory roles of cellular proteins such as PURA and PURB may span different stages of HIV-1 infection, affecting HIV-1 gene expression and viral particle infectivity.

ivRIC was applied here to HIV-1 viral particles generated in the T CD4+ lymphocytic SupT1 cell line. However, variations in the cellular proteome configuration can lead to different ivRBPs being incorporated into virions. This intriguing possibility warrants further investigation using various producer cells, such as primary T CD4+ lymphocytes and macrophages. Additionally, the impact of cell-dependent changes in the composition of virion-packaged gRNPs on the properties of viral particles should be thoroughly examined. These analyses will offer fresh perspectives on the cellular proteins that govern the functionality of viral particles.

Limitations of the study

ivRIC employs UV light to establish “zero distance” RNA-to-protein cross-linking. While highly specific, UV cross-linking suffers from low efficiency, particularly with proteins binding to double-stranded RNA tracts. To achieve a full picture of the virion-packaged gRNP, other cross-linkers such as formaldehyde could be used, acknowledging that chemical cross-linkers such as formaldehyde have lower specificity as they can also mediate longer distance covalent bonds between proteins.59,60 Here, we used oligo(dT) for viral RNA enrichment because HIV-1 gRNA has a poly(A) tail. To target other viruses with no poly(A) tail, it is possible to use specific antisense probes or total RNA purification methods.59,60 Additional quality controls, such as treatment of purified particles with RNases, will be required to rule out interactions with extra-virion RNA, particularly in experiments where the proportion of isolated viral RNA over cellular RNA is lower than with HIV-1. Such experiments would provide unambiguous evidence that the binding of ivRBPs to the isolated RNA occurs within the viral capsid. As with any proteomics-based method, protein identification can be affected by protein abundance, length, and amino acid composition, potentially resulting in false negatives. We optimized the amount of input viral particles used in ivRIC to achieve a comprehensive analysis of the gRNP packaged in HIV-1 virions. However, other viruses may require larger starting material to achieve similar depth, which will depend upon the number of viral RNA molecules per particle, the efficiency of the viral RNA purification and the sensitivity of the downstream mass spectrometry. Therefore, optimization and pilot experiments are likely required if applying ivRIC to other viruses. Several key control samples and quality control checks were used here to account for all potential sources of contamination, as described in Figures 1 and S1. If applying ivRIC to another viral model, these control samples and analysis (and others if relevant) should be put in place to confirm data quality.

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Alfredo Castello (alfredo.castello@glasgow.ac.uk).

Materials availability

All unique reagents and materials generated for this study, including plasmids and engineered cell lines, are available from the lead contact upon reasonable request and in accordance with the standard material transfer procedures of the University of Glasgow. This study generated plasmids and engineered cell lines that are derived from commercially available vectors and cell lines subjected to third-party licensing restrictions. For this reason, these materials have not been deposited in public repositories. Requests for materials should be directed to the lead contact.

Data and code availability

  • All proteomics data generated in this study have been deposited in the PRIDE database under accession number PRIDE: PXD042829. iCLIP2 RNA sequencing data have been deposited in the Gene Expression Omnibus (GEO) under accession number GEO: GSE262435. RNA sequencing data from purified HIV-1 particles have been deposited in the European Nucleotide Archive (ENA) under accession number ENA: PRJEB105389.

  • This study did not generate original code.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon reasonable request.

Acknowledgments

A.C. is funded by the European Research Council (ERC) Consolidator Grant “vRNP-capture” no. 101001634, the Career Development Award #MR/L019434/1, the John Fell Funds from the University of Oxford, and the Medical Research Council (MRC) grants MR/R021562/1 and MC_UU_00034/2. M.G.-M. is funded by the European Union’s Horizon 2020 research and innovation program under the Marie-Sklodowska-Curie grant agreement no. 700184. R.T. is funded by a BBSRC DTP scholarship DD01.20. L.I. is funded by the Biotechnology and Biological Sciences Research Council (BBSRC) DTP scholarship no. BB/M011224/1. C.E.L. is funded by the Department of Biochemistry Graduate Scholarship and the Hertford College Graduate Studies Scholarship of the University of Oxford. A.E.-B. is funded by Fundación Ramón Areces Post-doctoral fellowship program. T.J.M.S. is funded by Fondation pour la Recherche Médicale (FRM - FDT202001010798). W.K. is funded by the European Union’s Horizon 2020 research and innovation program under Marie-Sklodowska-Curie no. 842067. Q.G. was funded by the UK MRC MC_UU_00034/5. V.R. is funded by the European Union’s Horizon 2020 research and innovation program under Marie-Sklodowska-Curie no. 892756. E.P.R. is funded by Agence Nationale des Recherches sur le SIDA et les Hépatites Virales (ANRS – ECTZ3306), Fondation Finovi, and the ERC (ERC-StG-LS6-805500) under the European Union’s Horizon 2020 research and innovation program as well as the ATIP-Avenir program. I.D. was funded by the Wellcome Trust Investigator Award 209412/Z/17/Z and the University of Glasgow.

While this work is funded by the European Union, views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. Neither the European Union nor the granting authority can be held responsible for them.

Author contributions

Conception/design, M.G.-M., A.E.-B., R.T., and A.C.; data acquisition, M.G.-M., A.E.-B., R.T., C.E.L., H.C., M.N., T.J.M.S., K.D., S.D.S., W.K., N.P., V.R., and S.H.; data analysis, M.G.-M., R.T., L.I., C.E.L., H.C., S.D.S., Q.G., A.I.J., W.K., J.Y.L., I.D., S.M., and A.C.; data interpretation, M.G.-M., A.E.-B., R.T., L.I., J.Y.L., I.D., M.N., S.M., and A.C.; writing – original draft, A.C.; writing – editing, all authors; funding acquisition, M.G.-M., A.E.-B., I.D., S.M., and A.C.; resources, T.J.M.S., J.Y.L., M.N., E.P.R., I.D., S.M., and A.C.; supervision, M.G.-M., M.N., E.P.R., I.D., S.M., and A.C.

Declaration of interests

The authors declare no competing interests.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
OMICS Data

Dataset Repository ID
Proteomics data PRIDE PXD042829
iCLIP2 data GEO GSE262435
RNAseq of purified HIV particles ENA PRJEB105389

Antibodies

Human monoclonal HIV-1 p24 NIBSC – Center for AIDS Reagents Cat#ARP3279
Mouse monoclonal HIV-1 Integrase (IN-2) Santa Cruz Cat#sc-69721
Rabbit HIV-1 gp120 NIBSC – Center for AIDS Reagents Cat#ARP421
Mouse monoclonal HIV-1 Nef (clone 3D12) NIBSC – Center for AIDS Reagents Cat#EVA3067.5
Rabbit polyclonal MOV10 Cusabio Cat#CSB-PA862068LA01HU
Rabbit polyclonal PURA Abcam Cat#ab79936
Rabbit polyclonal PURA Proteintech Cat#17733-1-AP
Rabbit polyclonal SUB1 Abcepta Cat#AP19807c-ev
Mouse monoclonal β-ACTIN (clone AC-15) Merck Cat#A1978
Rabbit monoclonal Myc-Tag (71D10) Cell Signaling Cat#2278S
Rabbit polyclonal PURB Proteintech Cat#18128-1-AP
Rabbit polyclonal FAM120A ProSci Cat#5307
Rabbit polyclonal MSN (clone FIC-13) Abcepta Cat#AP13752b
Rat monoclonal GFP (3H9) ChromoTek GmbH Cat#3h9-100
Mouse monoclonal Tubulin (clone DM1A) Sigma Aldrich Cat#T9026
Rabbit monoclonal PTPRC/CD45 (clone 6O19) Thermo Fisher Scientific Cat#80297-1-RR100UL
Mouse monoclonal SPN/CD43 (clone 2A11D6) Proteintech Cat#66224-1-Ig
Rabbit polyclonal SERINC3 Proteintech Cat#20267-1-AP
Mouse monoclonal TetR (clone 9G9) Takara Bio Cat#631131
IRDye 680RD Goat anti-Human IgG Secondary Antibody Li-Cor Cat#926-68078
IRDye 800CW Goat anti-Human IgG Secondary Antibody Li-Cor Cat#926-32232
IRDye 680RD Donkey anti-Mouse IgG Secondary Antibody Li-Cor Cat#926-68072
IRDye 800CW Donkey anti-Mouse IgG Secondary Antibody Li-Cor Cat#926-32212
IRDye 680RD Donkey anti-Rabbit IgG Secondary Antibody Li-Cor Cat#926-68073
IRDye 800CW Donkey anti-Rabbit IgG Secondary Antibody Li-Cor Cat#926-32213
IRDye 680RD Goat anti-Rat IgG Secondary Antibody Li-Cor Cat#926-68076
IRDye 800CW Goat anti-Rat IgG Secondary Antibody Li-Cor Cat#926-32219

Chemicals, peptides, and recombinant proteins

Janelia Fluore Halo-646 ligand Promega Cat#GA1120
Cell Mask Invitrogen Cat#C37608
3X FLAG peptide Merck Cat#F4799-4MG

Critical commercial assays

Oligo(dT)25 beads New England Biolabs Cat#S1419S
Luna Universal One-Step RT-qPCR kit New England Biolabs Cat#E3005S
HiScribe T7 ARCA mRNA kit with tailing New England Biolabs Cat#E2060S
HIV-1 Gag p24 DuoSet ELISA kit Bio-Techne Cat#DY7360-05
NextSeq500/NextSeq 550 Illumina Cat#20024907/20024906
anti-FLAG M2 magnetic beads Merck Cat#M8823-1ML
GFP_Trap agarose bead slurry ChromoTek Cat#gta

Deposited data

Proteomic datasets PRIDE PXD042829
RNA sequencing data (iCLIP2) GEO GSE262435
Viral particle RNAseq ENA PRJEB105389
Human proteome reference (downloaded Nov 2016) UniProt UP000005640
Human genome (Release 104) ENSEMBL GRCh38
RBPbase v0.23 alpha EMBL https://rbpbase.shiny.embl.de/

Experimental models: Cell lines

SupT1 ECACC Cat#95013123; RRID:CVCL_1714
SupT1 PURA KO; PURB KO; MSN KO; FAM120A KO This paper N/A
Flp-In Jurkat Thermo Fisher Scientific Cat#R76207; RRID:CVCL_U426
Flp-In T-Rex Jurkat This paper N/A
Flp-In T-Rex Jurkat EGFP; PURA-EGFP; PURB-EGFP This paper N/A
Flp-In T-Rex 293 Thermo Fisher Scientific Cat#R78007
Flp-In T-Rex 293 PURA-EGFP This paper N/A
Flp-In T-Rex 293 PURA KO This paper N/A
HeLa ATCC Cat#CCL-2
HEK293T ATCC Cat#CRL-3216
Huh-7 Prof. Matthias Hentze (gift)
Huh-7 PURA KO Yeh et al.61 Dr. Mariano A Garcia-Blanco (gift)

Oligonucleotides

(RT-qPCR) HIV genomic qPCR F: ctgaagcgcgcacggcaa N/A
(RT-qPCR) HIV genomic qPCR R: gacgctctcgcacccatctc N/A
(Early RT) hRU5-F: GCCTCAATAAAGCTTGCCTTGA N/A
(Early RT) hRU5-R: TGACTAAAAGGGTCTGAGGGATCT N/A
(Intermediate RT) Gag-F1: CTAGAACGATTCGCAGTTAATCCT N/A
(Intermediate RT) Gag-R1: CTATCCTTTGATGCACACAATAGAG N/A
(Late RT) MH531: TGTGTGCCCGTCTGT
TGTGT
N/A
(Late RT) MH532: GAGTCCTGCGTCGAG
AGATC
N/A
GAPDH –F: ATGGGGAAGGTGAAGGTCG N/A
GAPDH –R: GGGGTCATTGATGGC
AACAATA
N/A
Primers for cloning, sgRNAs and knock out validation This paper See Table S6

Recombinant DNA

pNL4-3 NIBSC – Center for AIDS Reagents Cat#2006
pNL4-3.Luc.R-E− NIBSC – Center for AIDS Reagents Cat#2128
pNL4.3-mCherry-T2A-Nef This paper N/A
pNL4-3.R-E-mCherry-T2A-Nef Garcia-Moreno et al.62 N/A
pNL4-3.R-E-mCherry-T2A-Nef-Rev-Flag-3xMyc This paper N/A
pNL4-3.R-E-mCherry-T2A-Nef -Rev-Halo This paper N/A
pNL4-3.R-E-mCherry-T2A-Nef-ΔRev This paper N/A
pNL4.3-mCherry-T2A-gag Garcia-Moreno et al.62 N/A
psPAX-2 Addgene Cat#12260
pHEF-VSVG NIH AIDS Reagent Program Cat#4693
pcR-Blunt vector Thermo Fisher Scientific Cat#K270020
pcR-Blunt-NL4.3-mCherry-T2A-Nef This paper N/A
pcDNA6/TR plasmid Thermo Fisher Scientific Cat#V102520
pOG44 Thermo Fisher Scientific Cat#V600520
pcDNA5/FRT/TO-EGFP Castello et al.63 N/A
pcDNA5/FRT/TO-EGFP-linker Garcia-Moreno et al.62 N/A
MGC Human PURA Sequence-Verified cDNA (CloneId:5284976) Horizon Cat#MHS6278-202808724
BLADE plasmid Mageot et al.64 N/A

Software and algorithms

CFX Manager Software v3.1 Bio-Rad https://www.bio-rad.com/en-uk/sku/1845000-cfx-manager-software?ID=1845000
ImageJ NIH https://imagej.net/ij/
MaxQuant v1.6.3.4 Max-Planck-Institute of Biochemistry https://www.maxquant.org/
Image Studio Software LICORbio https://www.licorbio.com/image-studio
FlowJo BD Life Sciences https://www.flowjo.com/flowjo/download
R The R Foundation https://www.r-project.org/
BiomaRt Durinck et al.65,66 https://bioconductor.org/packages/release/bioc/html/biomaRt.html
UpsetR Conway et al.67 https://cran.r-project.org/web/packages/UpSetR/index.html
venneuler https://cran.r-project.org/web/packages/venneuler/index.html
imputeLCMD https://CRAN.R-project.org/package=imputeLCMD
FISH-quant v2 Imbert et al.68
limma Ritchie et al.69
STAR aligner Dobin et al.70
Cutadapt Cutadapt removes adapter sequences from high-throughput sequencing reads | Martin | EMBnet.journal
Je Suite Girardot et al.71
HTSeq-clip Sahadevan et al.72
BEDTools Quinlan and Hall73
DEW-Seq Sahadevan et al.72
GenomicRanges R package Lawrence et al.74
BBMap https://sourceforge.net/projects/bbmap/
AnnotationDbi https://bioconductor.org/packages/release/bioc/html/AnnotationDbi.html
HOMER Heinz et al.75 http://homer.ucsd.edu/homer/
universalmotif Tremblay et al.76 https://www.bioconductor.org/packages/release/bioc/html/universalmotif.html
ggseqlogo Wickam et al.77 https://cran.r-project.org/web/packages/ggseqlogo/index.html
ggplot2 Wickam et al.77 https://ggplot2.tidyverse.org/
Cytoscape v3.9.1 Shannon et al.78 https://cytoscape.org/
stringApp Docheva et al.79
clusterMaker2 Morris et al.78
MCODE Bader and Hogue81
DyNet Goenawan et al.82
EnrichR Xie et al.83 https://maayanlab.cloud/Enrichr/
REVIGO Supek et al.84
TIDE Brinkman et al.85
Prism v10 GraphPad https://www.graphpad.com/features

Experimental model and study participant details

Viruses and cells

The following human cell lines are available in cell culture collections or commercially: T-lymphoblast SupT1 (male; ECAAC, #95013123), T-lymphocyte Jurkat Flp-In (male; Thermo Fisher Scientific, #R76207), embryonic kidney HEK293 Flp-In T-REx (female; Thermo Fisher Scientific, #R78007), HeLa (female; ATCC, #CCL-2), Huh-7 (male; a gift from Matthias W Hentze) and HEK293T (female; ATCC, #CRL-3216). Jurkat Flp-In and tetracycline inducible Jurkat Flp-In T-REx expressing tagged RBPs were obtained as described below. Most cell lines utilised in this study were sourced from reputable commercial suppliers, with authentication being managed by the respective providers. In certain instances, cell lines were generously supplied by established collaborators; while we did not independently verify their authentication, these cells have been used in several peer-reviewed publications and were used only for confirmatory cross-validation experiments. SupT1, Jurkat Flp-In and Jurkat Flp-In T-REx were cultured in RPMI-1640 with 1 mM sodium pyruvate and 10mM HEPES, while HEK293 Flp-In T-REx, HeLa, HEK293T and Huh-7 were cultured in DMEM. Cell culture media was supplemented with 10% fetal bovine serum (FBS), 1x penicillin/streptomycin (Sigma Aldrich, #P4458), 1.25 μg/mL amphotericin B (Sigma Aldrich, #A2942) and the following specific antibiotics: 100 μg/mL Zeocin (Thermo Fisher Scientific, #R25001) for Jurkat Flp-In; 100 μg/mL Zeocin and 7.5 μg/mL Blasticidin S Hydrochloride (Cambridge Bioscience, #B001-100mg) for Jurkat Flp-In T-REx; 350 μg/mL Hygromycin B (Cambridge Bioscience, #H011-20mL) and 7.5 μg/mL Blasticidin for inducible Jurkat Flp-In T-REx EGFP, PURA-EGFP and PURB-EGFP; and 100 μg/mL Zeocin and 5 μg/mL Blasticidin for HEK293 Flp-In TRex. All cells were cultured in a humidified incubator at 37°C with 5% CO2. Only mycoplasma-negative cells, confirmed by regular testing, were used in this study.

Full length, infectious HIV-1mCherry-Nef was obtained by transfection of pNL4.3-mCherry-T2A-Nef plasmid into HEK293T. Single round viruses HIV-1R-E-mCherry-Nef (mCherry tagged to Nef) and HIV-1R-E-Gag-mCherry (mCherry tagged to matrix) were produced by co-transfecting HEK293T cells with pNL4-R-E-mCherry-Nef or pNL4.3-R-E-mCherry-gag plasmids,62 a plasmid encoding the vesicular stomatitis virus glycoprotein (pHEF-VSVG, NIH AIDS Reagent Program, #4693) and, in the case of HIV-1Gag-mCherry, also pPAX-2 (Addgene plasmid #12260).

Method details

Plasmids and recombinant DNA procedures

To generate HIV-1mCherry-Nef we followed a strategy analogous to86 but using mCherry (pNL4.3-mCherry-T2A-Nef) instead Renilla luciferase. The pcR-Blunt-NL4.3-mCherry-T2A-Nef plasmid, used for in vitro transcription of HIV-1 genomic RNA, was generated by cloning the fragment between FspAI and PdiI from pNL4-3.mCherry-T2A-Nef into pcR-Blunt vector (Thermo Fisher Scientific, #K270020).

Rev-tagged plasmids pNL4.3.R-E-mCherry-T2A-Nef-Rev-Flag-3xMyc and pNL4.3.R-E-mCherry-T2A-Nef-Rev-Halo were made as follows. A synthetic DNA sequence containing a BamHI site, 173 bp of the NL4-3 Rev C-terminal domain, a flexible glycine-serine linker (TCGGCCGGAGGA), the relevant tag, a stop codon and a HpaI site was generated and inserted into pNL4.3.R-E-mCherry-T2A-Nef using BamHI and HpaI restriction sites. pNL4.3.R-E-mCherry-T2A-Nef-ΔRev plasmid was made by introducing 2 point mutations (T5974C and T6041A) in the Rev ORF of the pNL4.3.R-E-mCherry-T2A-Nef plasmid. Firstly, pNL4.3.R-E-mCherry-T2A-Nef was treated with BamHI-HF and EcoRI enzymes to generate a 2.7kb template. To generate the T5974C point mutation, fusion PCR was performed with T5974C_Primer_A, T5974C_Primer_D, T5974C_Primer_B and T5974C_Primer_C. This process was then repeated with primers targeting the T6041A point mutation.

Plasmids encoding RBP-tagged proteins for the generation of inducible cell lines were cloned by 1) amplification of the protein with specific primers from SupT1 cDNA or available template plasmids, and 2) insertion into a pcDNA5/FRT/TO-EGFP-linker vector containing the EGFP tag before (N terminus tagging) the multicloning site.63 Single-guide (sg)RNA expression plasmids for CRISPR/Cas9-mediated knock out were generated by inserting annealed oligos into the BLADE plasmid as described before.87 All plasmids were validated by sequencing.

Gene knock-out in SupT1 and HEK293 cells

We produced nanoblades loaded with Cas9 and specific sgRNAs targeting two regions for each gene as described before.87 Nanoblades particles loaded with the Cas9 protein and specific sgRNAs targeting the gene of interests were used in SupT1 and HEK293 Flp-In T-Rex cell lines. In both cases, we transduced 1 × 105 cells with 5–20 μL of nanoblades and 4 μg/mL polybrene in 200 μL of growth medium. 4 h later, 300 μL of fresh medium with 10% FBS was added to reach 500 μL. Cells were incubated for 48 h and then assessed for editing efficiency by genomic DNA extraction (Monarch Genomic DNA Purification Kit, New England Biolabs, #T3010S) followed by PCR, PCR cleanup (QIAquick PCR Purification Kit, Qiagen, #28104), sequencing and sequence trace decomposition using TIDE.85 Single knock out (KO) clones were obtained by serial dilution and validated by western blot and cytoplasmic mRNA quantification by RT-qPCR. Huh-7 PURA KO cells were a kind gift of Dr. Mariano A Garcia-Blanco.61

Virus production

To generate infectious HIV-1mCherry-Nef, we transfected HEK293T with pNL4.3-mCherry-T2A-Nef plasmid.88 The supernatant was collected 48 h post-transfection (hpt), cleared by centrifugation (3000g for 10 min, 4°C), filtrated with a 0.45μm PVDF Stericup-HV system (Merck, #S2HVU01RE), and precipitated with PEG 6000.88 This primary virus stock was titrated by infecting SupT1 and counting mCherry-expressing cells in a flow cytometer. To produce the virus in SupT1, we infected cells at MOI 0.1 by spinoculation and expanded the infected culture by replacing the growth medium every 48 h, keeping cell concentration at 1 × 106 cells/ml. Viruses were purified as above.

To obtain samples for ‘in virion RNA-interactome capture’ (ivRIC), infected SupT1 cells were mixed with uninfected cells at a 1:4 ratio and a concentration of 1 × 106 cells/ml. Medium was replaced by centrifugation at 400g for 5 min, and cells were incubated at 37°C and 5% CO2 in a T175 flask with 120 mL of medium. At 48 hpi, cells were collected by centrifugation (400g for 5 min) and supernatant was kept. Cell pellet was washed in PBS 1X and lysed in 5 mL of RIC lysis buffer 1X (20 mM Tris-HCl pH 7.5, 0.5 M LiCl, 1 mM EDTA, 0.1% IGEPAL, fresh 0.5% LiDS wt/vol and fresh 5 mM DTT) during 1h at 4°C. Cell lysates were homogenized by pipetting and passing them through a 27G needle several times, and frozen at −80°C for whole cell proteome analysis. Supernatants were further cleared by centrifugation (3000g for 10 min at 4°C), followed by filtration with 0.45μm PVDF Stericup-HV filters. Viral particles were purified on a 10% sucrose cushion (50 mM TrisHCl pH 7.4; 100 mM NaCl; 0.5 mM EDTA; 10% sucrose) at a 4:1 vol:vol ratio in 50 mL conical tubes (Sarstedt, #62.546.254) by centrifugation at 10000g for 4h and 4°C.89 The pellet was resuspended in PBS 1X containing 50 mM HEPES, incubated overnight at 4°C, and finally aliquoted and stored at −80°C until use.

Single-round, pseudotyped HIV-1 particles were produced by co-transfecting HEK293T cells with pNL4-3.R-E− derived plasmids together with pHEF-VSV-G using calcium phosphate. Virus was collected and concentrated 48 hpt88 and titrated by flow cytometry.

For analysis of HIV-1 reverse transcription in viruses produced in WT or PURA KO cells, and second round of infection studies, HEK293 Flp-In T-Rex WT and PURA KO were co-transfected with pNL4-3.R-E-mCherry-T2A-Nef and pHEF-VSVG plasmids as described above. Virus-containing supernatants were collected 48 hpt, cleared by centrifugation (2000g for 10 min, 4°C), filtrated with a 0.45μm PVDF Stericup-HV, and viral particles were purified on a 10% sucrose cushion (50mM TrisHCl pH7.4; 100mM NaCl; 0.5mM EDTA, 10% Sucrose) at 1:3 (sucrose:supernatant) in 50mL flacon tube by centrifugation at 10000g overnight at 4°C and resuspended in PBS 1X, 50mM HEPES, pH 7.4.

Virus titration by RT-qPCR

The number of viral particles employed in ivRIC was calculated as follows. 100 μL of purified HIV-1mCherry-Nef (or uninfected samples as control) were mixed with 100 μL of RIC lysis buffer 2X (40 mM Tris-HCl pH 7.5, 1 M LiCl, 2 mM EDTA, 0.2% IGEPAL, fresh 1% LiDS wt/vol and fresh 10 mM DTT) and incubated for 1h at 4°C to inactivate the virus. Poly(A) RNA was purified using 100 μL of oligo(dT)25 beads (New England Biolabs, #S1419S) following the procedure described in the ‘ivRIC’ section, but eluting in 100 μL. Eluted RNA concentration was measured in a Qubit 2.0 Fluorometer using Qubit RNA High Sensitivity Assay Kit (Thermo Fisher Scientific, #Q32855), and HIV-1 RNA at different sample dilution was quantified using the Luna Universal One-step RT-qPCR kit (New England Biolabs, #E3005S) with specific primers against the 5′ end of HIV-1 gRNA (Key Resources Table). RT-qPCRs were performed on a CFX96 Touch Real-Time PCR Detection System (Bio-Rad).

HIV-1 RNA abundance was calculated in reference to a standard curve (Ct value vs. RNA copy number) generated with purified, in vitro synthesized HIV-1mCherry-Nef RNA. This RNA was generated from 1 μg of pcR-Blunt-NL4.3-mCherry-T2A-Nef plasmid that was the template to synthesize capped HIV-1 RNA with the HiScribe T7 ARCA mRNA Kit with tailing (New England Biolabs, #E2060S). The reaction was incubated for 2 h at 37°C in 20 μL. The template DNA was removed by treatment with 4 units of DNase I for 15 min at 37°C. Poly(A) tail was added with E. coli Poly(A) Polymerase for 30 min at 37°C. The in vitro synthesized HIV-1 RNA was purified using oligo(dT)25 beads and quantified in a Qubit 2.0 Fluorometer as above. Then, 5 × 10-fold serial dilutions were prepared for the One-step RT-qPCR reaction described above to obtain the corresponding Ct values and generate a standard curve. HIV-1 RNA copy number in ivRIC eluates was estimated by interpolating in the standard curve, using the formula:

[1 × 10−9 g/ng 6.022 1023 molecules/mol amount of ssRNA in ng]/[(An x 329.2) + (Un x 306.2) + (Cn x 305.2) + (Gn x 345.2) + 159 g/mol],90 similar to the commercial kit Lenti-X qRT-PCR Titration Kit(Takara, #631235).

Data was analyzed with CFX Manager Software v3.1 (Bio-Rad). The number of viral particles was estimated considering that each virion contains 2 copies of viral RNA.

To estimate the percentage of HIV-1 RNA relative to total poly(A) RNA in purified virions (Figure 1E), we measured total RNA concentration using Qubit after oligo(dT)25 RNA isolation from viral particles. The expected RNA copy number if 100% of poly(A) RNA molecules were HIV-1 RNA was estimated using the previous formula and the nucleotide content of HIV-1mCherry-Nef genome. Finally, the real proportion of HIV-1 RNA in purified virions was obtained from RNA copy number measured by RT-qPCR:

%HIV1RNA=MeasuredHIV1RNAcopynumber100/ExpectedHIV1RNAcopynumber.

HIV-1 reverse transcription analysis

The analysis of HIV-1 reverse transcription (RT) was carried out using a protocol previously described with minor modifications.91 Prior to infection, to eliminate free DNA, 50μL of HIV-1 particles generated in WT or PURA-KO cells (HEK293 Flp-In T-Rex or Huh-7) were treated with 4U of TurboDNase I (Invitrogen, #AM2238) plus 4mM of MgCl2 for 1h at 37°C. CA-p24 levels were quantified with the DuoSet ELISA kit (Bio-Techne, #DY7360-05). 0.8 × 106 WT or PURA KO cells (HEK293 Flp-In T-Rex or Huh-7) were seeded in 6-well plates, infected with single round HIV-1R-E-mCherry-Nef virus at MOI 1 upon normalisation by CA-p24 levels, and incubated at 37°C for 2 h in 5% FBS DMEM. Cells were then washed twice with PBS 1X and incubated at 37°C in 5% FBS DMEM. Cells were collected at 0, 2, 4 and 8 hpi in PBS 1X and sedimented at 1500g for 10min. Total DNA was extracted using Monarch Genomic DNA Purification Kit (New England Biolabs). 50 ng of DNA was used for each reaction, along with 100μM of primers (see Key Resources Table) and qPCR Brilliant III SYBR Master Mix (Agilent Technologies Inc, #600883) to a total volume of 10μL per well. The amplification was carried out in a MicroAmp Optical 96-Well Reaction Plate (Applied Biosystems, #N8010560). qPCR data was analyzed using the ΔΔCt method normalised to GAPDH levels. Difference in distribution between two samples were analyzed by unpaired t test.

RNA sequencing of purified viral particles

The compositional analysis of oligo(dT) isolated RNA from sucrose purified viral particles was done by RNA sequencing as follows. Uninfected cells were mixed with highly infected cells at a 10:1 ratio at a concentration of 1×106 cells/ml and co-cultivated for 48h. Cell pellet was sedimented at 400 rcf and supernatant precleared at 3000 rcf for 10 min before filtering through 0.45μm PES filter. Viral particles were purified over 10% sucrose cushion by centrifugation at 10000 rcf for 4h at 4°C. Total RNA was purified by Trizol extraction and processed with Illumina Truseq stranded kit with Invitrogen ERCC RNA Spike-In Mix and sequenced on an Illumina NextSeq 500 sequencer using a high-output cartridge (Cat# 20024907).

Quality control of HIV-1 viral preparations

Viral particles were purified as above. Producer cells were lysed in 1x RIPA buffer (50mM TRIS pH 7.5, 150mM NaCl, 0.1% SDS, 0.5% sodium deoxycholate, 1% Triton X-100) supplemented with 50 μg/ml AEBSF protease inhibitor, while the supernatant was mixed 1:1 with 1x RIPA buffer, and the viral pellet (after sucrose cushion purification) was resuspended in 1x RIPA buffer. Samples were heat inactivated for 1h at 60°C before analysis by SDS-PAGE probed with antibodies specific to PTPRC/CD45 (ThermoFisher, #80297-1-RR100UL), SPN/CD43 (Proteintech, #66224-1-Ig), SERINC3 (Proteintech, #20267-1-AP), p24 (NIBSC – Center for AIDS Reagents, Cat#ARP3279), ACTB (Merck, Cat#A1978) following the protocol outlined below.

In virion RNA interactome capture (ivRIC)

7 × 1011 purified HIV-1mCherry-Nef particles per condition (or the equivalent volume from uninfected cells) were spread on a 6-well plate in 1 mL of total volume (topped up with 1x PBS) on ice and were irradiated with 150 mJ/cm2 of UV light at 254 nm. 333 μL of RIC lysis buffer 4X (80 mM Tris-HCl pH 7.5, 2 M LiCl, 4 mM EDTA, 0.4% IGEPAL, fresh 2% LiDS wt/vol and fresh 20 mM DTT) was added to achieve 1X. Samples were recovered using a rubber cell scraper, transferred to a tube and homogenised by pipetting. Lysates were incubated for 1 h at 4°C to ensure the full inactivation of the virus, and then kept at −80°C.

Samples were thawed at room temperature, and 10% of the sample was taken as input (i.e., viral particle proteome). The rest of the sample was incubated under gentle rotation for 2 h at 4°C with 450 μL of oligo(dT)25 beads, pre-equilibrated with RIC lysis buffer. Beads were collected in the magnet and the supernatant was stored at 4°C in a new tube. Beads were washed once with 1.5 mL of 1X RIC lysis buffer for 5 min at 4°C, inverting the tubes gently 10 times every minute. Beads were subsequently washed twice with 1.5 mL of cold RIC buffer 1 (20 mM Tris-HCl pH 7.5, 500 mM LiCl, 0.1% LiDS wt/vol, 1 mM EDTA, 0.1% IGEPAL and 5 mM DTT), twice with 1.5 mL of cold RIC buffer 2 (20 mM Tris-HCl pH 7.5, 500 mM LiCl, 1 mM EDTA, 0.01% IGEPAL and 5 mM DTT) and finally twice with 1.5 mL of room temperature RIC buffer 3 (20 mM Tris-HCl pH 7.5, 200 mM LiCl, 1 mM EDTA and 5 mM DTT). Beads were resuspended in 200 μL of elution buffer (20 mM Tris-HCl pH 7.5 and 1 mM EDTA) and incubated for 3 min at 55°C with agitation (200 rpm). Beads were recycled in 0.1M NaOH for 5 min at 55°C with agitation (200rpm), equilibrated in 1X RIC lysis buffer, and re-used for one additional capture following the same protocol. Eluates of the first and second round of oligo(dT) capture were combined and stored at −80°C. Total eluted RNA was quantified using the Qubit RNA High Sensitivity Assay.

For proteomics, samples were treated with RNase A/T1 mix for 1.5 h at 37°C and then 15 min at 50°C as previously described.92 Whole cell lysates were treated with 250U/ml of benzonase (Millipore, #70746-4) for 30 min. Protein concentration was measured using Pierce 660m Protein Assay Reagent and Ionic Detergent Compatibility Reagent (Thermo Fisher Scientific, #22660 and #22663 respectively) on a microplate reader (CLARIOstar Plus, BMG Labtech). Samples were further processed by single-pot, solid-phase-enhanced sample preparation (SP3) as described in.93 For conventional protein analyses, ivRIC eluates were concentrated on an Amicon Ultra-0.5 centrifugal filter unit 3KDa cut-off (Millipore, #UFC500324) following manufacturer’s recommendations. RNA in eluates was digested with RNases T1 and A as above.

Conventional protein analyses

Samples were mixed with NuPAGE LDS Sample Buffer 4X (Thermo Fisher Scientific, #NP0008), incubated for 10 min at 70°C, resolved by SDS-PAGE and analyzed by 1) Western Blot using specific antibodies (Key Resources Table and Table S6), LI-COR Odyssey Fc imaging system for visualization and the Image Studio Software for quantification, or 2) silver staining using the SilverQuest kit (Invitrogen, #LC6070). Statistical significance of Western blot quantifications was analyzed by two-way ANOVA with Dunnett’s correction.

HaLo labeling and single molecule RNA FISH

High Precision Coverslips (Marienfeld, #0107052) were washed once in 1 M HCl for 30 min on a rocking machine, twice in double distilled water for 10 min and once in ethanol 70% for 10 min 150,000 HeLa cells were seeded on the prewashed coverslips in wells of a 6-well plate and incubated in DMEM with 10% FBS. Cells were infected 24 h later using VSV-G pseudotyped single round virus. At 48 hpi, cells were washed with PBS and fixed with 4% paraformaldehyde for 10 min at room temperature. Cells were washed with PBS three times with gentle rocking, permeabilised with PBS +0.1% Triton X-(PBSTX), and washed three times with PBSTX for 5 min with gentle rocking. Cells were incubated with 50 nM of Janelia Fluor Halo-646 ligand (Promega, #GA1120) in PBSTX. Coverslips were gently washed 3 times with PBSTX before incubation for 5 min at 37°C with 2 μg/mL of DAPI in PBSTX for 5 min. Cells were then washed twice with PBSTX, once with PBS for 5 min, and once with milliQ H2O, followed by mounting on glass slides using Vectashield Antifade mounting medium (Vector Laboratories, #H-1000).

RNA FISH probes were designed using the LGC Biosearch Technologies’ Stellaris RNA FISH Probe Designer focusing on the first 2000 bp of the gag-pol ORF that is only present in gRNA. For single molecule (sm)FISH, cells were seeded and treated as above. After permeabilization, coverslips were washed at 37°C for 10 min with PBSTX, PBSTX with 1X saline sodium citrate solution (SSC), PBSTX with 2X SSC and finally in pre-hybridisation buffer (2x SSC and 10% deionized formamide in DEPC water). Next, cells were incubated in a wet chamber for 16 h at 37°C with 125 nM HIV-1 gRNA-specific Stellaris probes (LGC Biosearch Technologies) in hybridization buffer (2x SSC, 10% deionized formamide and 10% dextran sulfate in DEPC water). Cells were subsequently washed twice with pre-hybridization buffer for 10 min at 37°C and incubated with DAPI and mounted as above.

For the HEK293 PURA KO analyses, 2 × 105 cells were seeded on coverslips (Fisherbrand Glass Circle Coverslips, #12333138) prewashed in 100% ethanol for 30 min and precoated with poly-L-Lysine solution (Sigma-Aldrich, #P4832) for 5 min at 37°C in a 24-well plate, and incubated in DMEM supplemented with 10% FBS. The following day, the cells were infected using MOI 0.5 HIV-1R-E-Gag-mCherry in DMEM with 5% FBS. 24 hpi, cells were washed twice with PBS 1X and fixed as above, with the difference that after the wash with pre-hybridization buffer and PBS, cells were incubated with DAPI (Invitrogen, #D1306) and Cell Mask (Invitrogen, #C37608) with 1:1000 and 1:2000 dilution ratio, respectively, for 30 min at 37°C. After 2 washes with PBS and one with nuclease free water, the coverslips were mounted on glass slides using SlowFade Diamond Antifade Mountant. Images were acquired using a 63x/1.40 oil DIC M27 Plano Apochromat objective lens with a total magnification of 63× on a Zeiss LSM 880 Axio-Observer confocal microscope. Cell and nuclear segmentation was performed in ImageJ using 2D maximum intensity projected images of CellMask Green and DAPI stains. Total number of cells, cytoplasm and nucleus per image was counted in ImageJ. Single-molecule-level quantification of smFISH images was performed using a custom Python pipeline.68 Tif files were converted to a numpy array, and individual cells were segmented as described above. Images where cells were labeled with the CellMask stain were pre-processed with a median filter, radius = 50. Background signal in the smFISH channel was subtracted using ImageJ. Threshold setting for smFISH spot detection was set specifically based on mock samples for each set of images collected in each session. ImageJ was also used to manually quantify the intensity of the transcription foci on HIV-1 infected cells. Nucleoplasmic foci with fluorescence intensity evidently superior that single gRNA molecules were considered transcription sites as previously described.94 Statistical significance was analyzed by unpaired t test with Welch correction where n.s. denotes not significant, ∗∗∗∗ denotes p < 0.001.

Analysis of cell viability and proliferation

To evaluate cell growth and viability, WT and KO cells were seeded at a concentration of 4 × 105 cells/ml. 24, 48 and 72 h later, the number of cells and the percentage of living cells after trypan blue staining was estimated in a Countess II FL Automated Cell Counter (Thermo Fisher Scientific). Cell viability was also assessed by adding CellTiter96 Aqueous One Solution Cell Proliferation Assay (Promega, #G3580) and measuring absorbance at 490 nm on a microplate reader.

Flow cytometry analysis of HIV fitness

We infected 2 × 106 SupT1 cells (WT and KO) with infectious HIV-1mCherry-Nef at 0.1 MOI by spinoculation, replaced growth medium at 2 hpi and incubated the cells for 48 h. Cells were collected by centrifugation at 400g for 5 min 1) For viral gene expression assessment, cell pellet was resuspended in 2 mL of fresh medium (approximately 1 × 106 cells/ml). 500 μL of infected cells were fixed in formaldehyde (4% final concentration), incubated 1 h at 4°C and analyzed by flow cytometry. 2) The supernatant of these cells was collected and further cleared by centrifugation at 18000g for 10 min. Viral particles were precipitated with PEG 600088 and titrated by RT-qPCR. 3) The remaining viral particle sample was normalised by RNA levels and used to infect 1.5 × 105 SupT1 WT cells (second round) by spinoculation. After 48 h, cells were fixed and analyzed by flow cytometer as before. Flow cytrometry data was analyzed using FlowJo Software (BD Life Sciences). Statistical significance was analyzed by ordinary one-way ANOVA with Dunnett’s correction.

Flow cytometry analysis of HIV-1 infection in HEK293 and Huh-7 cells was done as follows. Cells were seeded at a density of 6 × 104 cells/well with 100 μL DMEM 5% FBS without phenol red in a 96-well plate. After 24 h, cells were infected with 1 and 0.5 MOI using CA-p24 normalised single round HIV-1R-E-mCherry-Nef produced in WT or PURA KO cells (see above). After 24h, cells were harvested using TrypLE Express Enzyme(1X) (Gibco, #12604013) and fixed in 2% PFA for 10 min. Cells were analyzed for red fluorescence using a Guava EasyCyte flow cytometer (Cytek Biosciences) counting 15,000 events. Unpaired t test was carried out to analyze statistical significance.

Generation of Tet-on inducible Jurkat cells

To obtain the Jurkat Flp-In T-REx cell line, we first linearized pcDNA6/TR plasmid (Thermo Fisher Scientific, #V102520) with SapI restriction enzyme and transfected it into Jurkat Flp-In cells (Thermo Fisher, #R76207) using a Lonza Amaxa Nucleofector II and the Cell Line Nucleofector Kit V (Lonza, #VCA-1003), according to the manufacturer’s recommendations for Jurkat E6-1. Stably transfected single clones were selected by serial dilution in Zeocin plus Blasticidin containing media. Expression of Tet-R repressor was verified by Western blot and tested for inducible expression upon nucleofection of a control EGFP plasmid. To generate PURA/B-tagged inducible expression cell lines, we co-transfected Jurkat Flp-In T-REx cells with pOG44 (Thermo Fisher Scientific, #V600520) and the corresponding pcDNA5/FRT/TO plasmid (Table S6) at a 2:8 ratio. At 48 hpt, zeocin was replaced by Hygromycin B for selection.

Protein-protein interaction (PPI) analysis

1.5 × 107 SupT1 cells were seeded in 15 cm dishes and infected with HIV-1R-E-mCherry-Nef and HIV-1R-E-Rev-FLAG-Myc at MOI 1. Cells were pelleted at 48h by centrifugation at 400g for 5 min and resuspended in 1 mL ice-cold lysis buffer (50 mM Tris pH 7.5, 150 mM NaCl, 1% Triton X-, 0.5 mM EDTA, 25 U/ml of benzonase, 0.1 mM AEBSF) and lysed for 30 min on ice. Lysates were then vortexed and centrifuged at 300g for 3 min. The supernatant was stored in 1.5 mL Eppendorf tubes. Samples were precleared with 100 μL of Pierce Control Agarose beads (Thermo Fisher Scientific, #26150) for 30 min at 4°C. Beads were collected at 2500g for 2 min and supernatant was then mixed with 40 μL of pre-equilibrated anti-FLAG M2 magnetic beads (Merck, #M8823-1ML) in a new tube and incubated for 1h. Beads were washed 6 times with 1 mL of wash buffer (50 mM Tris pH 7.5, 150 mM NaCl, 0.2% IGEPAL, 0.5 mM EDTA) using a magnet. For elution, beads were resuspended in a mixture of 20 μL of wash buffer and 10 μL 3X FLAG peptide (Merck, #F4799-4MG) and kept for 1 h on ice. The supernatant containing Rev-Flag-Myc was collected and stored. Elution was repeated twice.

Jurkat Flp-In T-REx PURA-EGFP and PURB-EGFP cells were induced with 1 μg/mL of doxycycline overnight. Cells were then infected with 1 MOI of VSV-G-pseudotyped HIV-1R-E-mCherry-Nef for 48 h and then lysed (10 mM Tris HCl pH 7.5, 150 mM NaCl, 0.5mM EDTA, 1% Triton X-100, 2mM MgCl2, 1mM DTT, 0.2 mM AEBSF and 25 U benzonase). Cell lysates were cleared by centrifugation (17000 g, 10 min, 4°C). Lysates were pre-cleared with 40 μl of control agarose beads as above. Supernatants were transferred to a new tube and then incubated with 40 μL of pre-equilibrated GFP_Trap agarose bead slurry (ChromoTek, #gta) for 2 h at 4°C with gentle rotation. Beads were sedimented by centrifugation at 2500g for 1 min at 4°C and washed six times with 500 μL of wash buffer (10 mM Tris HCl pH 7.5, 150 mM NaCl, 0.5 mM EDTA, 0.2% IGEPAL, 0.1 mM AEBSF serine protease inhibitor, 1mM DTT). Two additional wash steps were performed without IGEPAL. Proteins were eluted with 50 μL of 200 mM glycine pH 2.5 for 60 s followed by neutralisation with 5 μL of 1 M Tris base pH 10.4. Elution was repeated twice, and eluates were combined.

PURA/B binding sites on target RNAs by iCLIP2

PURA and PURB binding sites on target RNAs was determined by iCLIP2. The original iCLIP2 protocol47 was used with the following modifications. Jurkat Flp-In T-REx PURA-EGFP and PURB-EGFP cells were induced with 1 μg/mL of doxycycline overnight. Cells were then infected with 1 MOI of VSV-G-pseudotyped HIV-1R-E-mCherry-Nef for 48 h. Next, cells were washed twice in PBS 1X, resuspended in 3 mL of PBS 1X, cross-linked with two rounds of 0.15 J/cm2 UV light irradiation at 254 nm, washed with PBS 1X and lysed in 1 mL RIPA buffer (50 mM Tris pH 7.5, 150 mM NaCl, 1% Triton X-100, 0.1% SDS, 0.5% wt/vol Na deoxycholate and 0.2 mM AEBSF). Lysates were incubated for 30 min on ice and then stored at −80°C until use. Lysates were thawed on ice, sonicated with 3 cycles of 10 s at 4°C (with 15 s pause between pulses) using a Digenonde bioruptor at level M, homogenized by passing through a 27G needle several times, and finally cleared by centrifugation (17000 g for 10 min at 4°C). EGFP signal was measured on a microplate reader to normalize the amount of lysate for each sample.

4 U TurboDNase and 5 U RNase I (Thermo Fisher Scientific, #AM2294) were added, mixed with a vortex and incubated for 3 min at 37°C at 1100 rpm. 200 U RiboLock RNase Inhibitor (Thermo Fisher Scientific, #EO0381) was then added, followed by incubation for 3 min on ice. Lysates were pre-cleared with 25 μl of pre-equilibrated control agarose beads for 30 min at 4°C, followed by centrifugation at 2500g for 2 min. Supernatants were transferred to a new tube and 2 × 10 μL of sample was taken for size-matched input (SMI) processing. Lysates were incubated with 25 μL of pre-equilibrated GFP_Trap agarose bead slurry for 2 h at 4°C with rotation. Beads were washed twice with 900 μL of cold high-salt wash buffer (50 mM Tris-HCl pH 7.4, 1 M NaCl, 1mM EDTA, 1% IGEPAL, 0.1% SDS, 0.5% sodium deoxycholate and 0.2 mM AEBSF), twice with 900 μL of cold medium-salt wash (20 mM Tris HCl pH 7.4, 250 mM NaCl, 0.05% IGEPAL, 1 mM MgCl2 and 0.2 mM AEBSF), and twice with 900 μL of cold PNK wash buffer (20 mM Tris HCl pH 7.4, 10 mM MgCl2 and 0.2% Tween 20). RNA 3′-end was dephosphorylated at 37°C for 40 min with agitation in PNK buffer (70 mM Tris HCl pH 6.5, 10 mM MgCl2 and 1 mM DTT) with 5 U PNK (New England Biolabs, #M0201L), 0.25 U FastAP alkaline phosphatase (Thermo Fisher Scientific, #EF0654), 0.5 U TurboDNase and 20 U RNasin (Promega, #N2111). Beads were washed once with 500 μL cold PNK wash buffer, twice with the same volume of cold high-salt wash buffer and twice with cold PNK wash buffer. 125 nM of L3-IR-App adapter95 was ligated using 30 U T4 RNA ligase I High Concentration (New England Biolabs, #M0437M), 20 U RNasin, 4 U PNK, 22.5% PEG8000 and 5% DMSO in ligation buffer (50 mM Tris HCl pH 7.8, 10 mM MgCl2 and 1 mM DTT) at 16°C overnight with shaking at 1100 rpm in the dark. Beads were washed once with 500 μL PNK wash buffer, twice with the same volume of high-salt wash buffer and twice with PNK wash buffer. IP samples and inputs were denatured in 1X NuPage LDS Sample Buffer (Thermo Fisher Scientific, #NP0007) with 100 mM DTT for 5 min at 70°C, spun down for 2 min at 2500 g and separated on a NuPAGE 4–12% Bis-Tris gel (Thermo Fisher Scientific, #NP0321BOX). Protein-RNA complexes were transferred onto an iBLOT2 nitrocellulose membrane (Thermo Fisher Scientific, #IB23001) for 2 h at 30 V and visualized. The region corresponding to the RBP-EGFP band and above was cut (for both IP and SMI samples) and digested using ∼350 μg Proteinase K (Roche, #3115828001) in 180 μL PK-SDS solution (10 mM Tris HCl pH 7.4, 100 mM NaCl, 1 mM EDTA and 0.2% SDS) for 60 min at 50°C with shaking (1100 rpm). RNA was purified by adding 1X volume of Phenol:Chloroform:Isoamyl Alcohol pH 6.6–6.9 (Sigma-Aldrich, #P3803), incubating for 10 min at 37°C with shaking (1100 rpm) and phase separation in MaxTract tubes by centrifugation at 16000g for 5 min. RNA was cleaned using Zymo RNA Clean & Concentrator-5 (ZYMO Research, #R1013). For SMI library preparation, samples were treated first with 5 U PNK, 0.5 U FastAP and 20 U RNAsin in PNK buffer pH 6.5 for 40 min at 37°C and 1100 rpm. RNA was cleaned up with Dynabeads MyOne Silane (Thermo Fisher Scientific, #37002D). L3-IR-App adapter ligation was performed with 45 U T4 RNA ligase I High Concentration in 1X T4 RNA Ligase Reaction Buffer with 2% DMSO, 27% PEG8000, and 133 nM L3-IR-adapter for 75 min at room temperature, followed by MyOne bead purification. SMI was treated with 25 U 5′ Deadenylase (New England Biolabs, #M0331S) and 15 U RecJf endonuclease (New England Biolabs, #M0264S) in 1X New England Biolabs buffer 2 with 20 U RNAsin, 20% PEG8000 for 1 h at 30°C and then 30 min at 37°C at 1100 rpm, followed by a MyONE clean-up. RNA from IP and SMI samples were reverse transcribed using Superscript IV reverse transcriptase (Thermo Fisher Scientific, #18090010) and hydrolyzed by adding 1.25 μL of 1 M NaOH for 15 min at 85°C before neutralization with 1.25 μL of 1 M HCl. cDNA was purified using MyOne silane beads. L#clip2.0 adapters with barcodes for multiplexing95 were ligated to cDNA by mixing first 2 μL of 10 μM adapter with 5 μL of cDNA and 1 μL of DMSO and incubating at 75°C for 2 min before placing on ice. Then, ligation mix (45 U T4 RNA ligase I High Concentration in 1X RNA ligase buffer with 1 mM ATP and 22.5% PEG8000) was added to the cDNA-bead solution and incubated overnight at 20°C with shaking 1100 rpm. cDNA was cleaned up with MyONE beads before PCR amplification. Pre-amplification was performed using 2X Phusion HF PCR Master mix (New England Biolabs, #M0531L) with P5Solexa_s and P3Solexa_s primers for six cycles, followed by ProNex (Promega, #NG2001) size-selective purification. Optimal qPCR cycles were determined on a CFX96 Touch Real-Time PCR Detection System (Bio-Rad) using EvaGreen (Biotium, #31000), 2X Phusion HF PCR Master mix and P5/P3 Solexa primers. Final PCR products were purified using two consecutive rounds of ProNex Size selection. Libraries were quantified by qPCR using the KAPA Library Quantification DNA standards (Roche, #07960387001) and High Sensitivity DNA kit (Agilent, #5067-4626) in an Agilent 2100 Bioanalyzer instrument. Each group of samples was pooled equimolarly and then mixed at the following proportions: 50% IP library pool, 37.5% SMI library pool, and 12.5% negative control EGFP. Sequencing was performed on a NextSeq 550 sequencer with a 75 cycle High-output kit v2.5 (Illumina, #20024906).

PURA - HIV-1 gRNA interaction analysis

1 × 107 HEK293 Flp-In T-Rex EGFP or PURA-EGFP cells were induced with 1μg/ml Doxycycline for 24 h before transfection with pNL4.3-R-E-mCherry-T2A-Nef and pHEF-VSV-G using X-tremeGENE 9 (Merck, #6365779001). 48 hpt supernatant was precleared at 5000rpm for 5 min before filtering through 0.4μm PEF sterile filter. Virus was precipitated by PEG purification and resuspended in 25mM HEPES in D-PBS. Purified virus was lysed in ice-cold RIPA (50mM Tris pH 7.5, 150 mM NaCl, 0.1% SDS, 0.5% sodium deoxycholate, 1% Triton X-100) supplemented with 50μg/ml AEBSF, 2U/ml TurboDNase, 40U/ml RiboLock and 5mM DTT. Immunoprecipitation was performed with 20μL GFP-TRAP agarose/sample at 4°C for 1h. Samples were washed 3x with ice-cold RIPA buffer supplemented with 5mM DTT and 40U/ml Ribolock. RNA from input and beads were isolated using Trizol following manufacturers’ recommendations. RT-qPCR was performed following manufacturers’ recommendations with NEB Luna Universal One-Step RT-qPCR Kit using HIV-1 genomic primers on a QuantStudio3 Real-Time PCR system. Data was normalised to input and visualised as fold change in R using ggplot2.

Sample preparation for LC-MS/MS

ivRIC and PURA-EGFP and PURB-EGFP IP eluates were processed by single-pot, solid-phase-enhanced sample preparation (SP3).93 Eluates of Rev IP were processed using Filter-Aided Sample Preparation (FASP) protocol.96 Peptides were analyzed in an Ultimate 3000 ultra-HPLC system (Thermo Fisher Scientific) as described before.92 Protein identification and quantification were performed using Andromeda search engine implemented in MaxQuant (1.6.3.4). Mass spectra were searched against human proteome reference (Uniprot_id: UP000005640, downloaded Nov 2016) and a custom HIV-1 (NL4-3) proteome including all known viral proteins. Flag-Myc-tagged Rev and EGFP were also included in the searches of the Rev and PURA/B interactomes, respectively. Search parameters included: full tryptic specificity with maximal two missed cleavage sites, carbamidomethyl (C) set as fixed modification, acetylation (protein N-term) and oxidation (M) set as variable modifications. False discovery rate (FDR) cut-off for peptide identification was set to 1%. For ivRIC and SupT1 whole cell proteome iBAQ and LFQ options were toggled ON. All other settings were set to default.

Mapping and comparing gene IDs

For all datasets analyses, including mass spectrometry data, genes were mapped to Hugo gene nomenclature committee IDs (HGNC) using R package BiomaRt. Any genes that could not be mapped were manually curated by manual addition or were removed if it was a pseudogene or its HGNC entry was missing. For HIV-1 analysis, the HIV-1 NCBI database was downloaded from the NCBI web server and parsed to gene IDs following the same strategy. Upset plots and Euler diagrams were generated using the R packages upsetR and venneuler respectively.

GO and STRING network analyses

General GO terms were extracted using the tool EnrichR83 (https://maayanlab.cloud/Enrichr/) and summarized using REVIGO.84

To generate virus GO term plots, the R package AnnotationDbi was queried with HGNC IDs and resulting GO terms filtered by the following GO categories: ‘viral’, ‘immune’, ‘infection, ’pathogen, ‘immune cell’ and ‘immune molecule’.

Network analyses were performed using the Cytoscape 3.9.1 platform78 with the following add-ons: stringApp79 for STRING protein network analysis and GO enrichment; clusterMaker280 for clustering data using MCODE algorithm81 and DyNet82 for comparison of two networks.

We classified proteins as RBPs if they were identified in at least 3 independent RNA interactome studies based on the EMBL RBPbase (https://rbpbase.shiny.embl.de/) resource.

Quantification and statistical analyses

Statistical analyses and data presentation were performed with Prism v.10 (GraphPad Software) and specific R (The R Foundation) packages as described in the different sections of STAR Methods. Statistical details can be found in figure legends and figures, including the value of n (biological replicates; at least three), means and error bars (standard deviation of all the replicates). Functional data (Figures 4 and S7) was subjected to unpaired t test with Welch correction, or one-way or two-way ANOVA followed by Dunnett’s correction, as appropriate; differences were considered significant if p < 0.05. Proteomics and iCLIP2 data was analyzed as indicated below.

Quantitative analysis of proteomics data

The proteinGroup files of MaxQuant search results were imported in RStudio (R Project) for further processing. Protein intensities were log 2 transformed. In each dataset proteins with less than 2 valid intensity measurements across experimental conditions were removed prior to downstream analysis. Batch effects in each comparison were assessed using principal component analysis (PCA). Normalisation was performed to ivRIC inputs ad PPI analyses using variance stabilisation normalisation (vsn) method.97 Missing values were imputed with deterministic minimum method (R package version 2.0. https://CRAN.R-project.org/package=imputeLCMD) using 1% quantile of global intensities. All missing values were imputed in ivRIC and Rev interactome data. For PURA/PURB interactome data, only proteins with all values missing in one condition were imputed as described before.98 Linear modeling and Bayesian-model-based moderated t test was performed using the R-package limma.69 Batch effects in ivRIC eluates and Rev and PURA/B interactomes were accounted for by incorporation in modeling using “block” argument provided in limma. p values obtained in the moderated t test were adjusted to account for multiple testing using Benjamini-Hochberg method.

iCLIP2 data processing

The raw FASTQ files were demultiplexed according to the sample barcode using Je Suite (Girardot et al., 2016) and adapter trimmed with Cutadapt (Martin, 2011). Trimmed reads were aligned to a concatenated human (GRCh38, ENSEMBL Release 104) and HIV-1 NL4.3 genome using STAR with end-to-end alignment mode.70 Only uniquely aligned reads were considered for the downstream analysis. PCR duplicated reads were collapsed using unique molecular identifiers (UMIs) attached to the read header with the Je Suite. The GRCh38 and HIV-1 genomic annotations were pre-processed to generate sliding windows (50nt window, 20nt step size) using HTSeq-clip.72 Cross-link truncation sites (position −1 relative to the 5′ end of the read start) were extracted using BEDTools73 and quantified against the sliding windows using HTSeq-clip. For peak calling, a R/Bioconductor package DEW-Seq was used to identify significantly enriched sliding windows in PURA/PURB immunoprecipitated samples over the corresponding size-matched input control samples (log2FoldChange >2 and p.adj <0.01).72 The Independent Hypothesis Weighting (IHW) method was used for the multiple hypothesis correction.99 To remove background signal resulting from non-specific binding of RNA to GFP, significantly enriched sliding windows (log2FoldChange >2 and p.adj <0.01) from GFP-immunoprecipitated control samples were removed. Overlapping significant sliding windows were merged to binding regions, and these sites were curated to 8nt long binding sites based on peak width and maxima. Binding sites were queried against the genome annotation (ENSEMBL release 104) using the GenomicRanges R package to extract overlaps with genes and transcript features.74 The overlap information was used to construct the meta-transcript coverage of binding sites and the heatmap distribution of PURA/PURB binding in the proximity of start and stop codons.

Given the repetitive motifs present in 5′ and 3′ of HIV-1 genome, we used a custom transcriptome to identify binding motifs at the 5′ and 3′ untranslated regions (UTRs). In brief, we superimposed the LTR-1 and LTR-2 sequences flanked by 100bp of protein-coding sequences from HIV-1 open reading frames at both 5′ and 3′ ends. HIV-1 reads were initially filtered using BBMap with kmer = 25 mode (https://sourceforge.net/projects/bbmap/), then mapped to the custom transcriptome using STAR. Peak-calling was performed as described above using DEW-Seq except for using smaller sliding windows (10nt window, 2nt step size) and a more lenient threshold (log2FoldChange >1.5 and p.adj <0.01).

Sequences for motif enrichment analysis were defined for each binding site as a 70-nucleotide region, centered on the peak in BigWig signal. For each binding site, a gene and gene region matched background sequence were defined to allow for differential enrichment. Enrichment analysis was performed using HOMER. Motifs were processed and plotted using the R packages universalmotif and ggseqlogo.

Published: April 7, 2026

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.celrep.2026.117090.

Supplemental information

Document S1. Figures S1–S10
mmc1.pdf (5.2MB, pdf)
Table S1. Defining the HIV-1 ivRNP

This table reports the proteomic results of the ivRIC analysis focusing on the comparison between the eluates from experimental samples and controls.

mmc2.xlsx (48.4KB, xlsx)
Table S2. The viral particle and whole CD4+ T cell proteome

This table reports the proteomic results of the ivRIC inputs and the whole cells and their comparison between them and with the ivRIC eluates.

mmc3.xlsx (497.3KB, xlsx)
Table S3. The Rev protein-protein interactome

This table reports the proteomic results of the Rev-Flag-Myc immunoprecipitation and controls using anti-Flag antibody and Flag.

mmc4.xlsx (147.8KB, xlsx)
Table S4. The PURA and PURB protein-protein interactome

This table reports the proteomic results of the immunoprecipitation in HIV-1 infected and uninfected cells of PURA-EGFP, PURB-EGFP and unfused EGFP using the GFP_Trap.

mmc5.xlsx (357.9KB, xlsx)
Table S5. The PURA and PURB iCLIP

This table reports the RNA sequencing results of the iCLIP2 analysis of PURA-EGFP and PURB-EGFP in mock and HIV-1 infected cells.

mmc6.xlsx (21.1MB, xlsx)
Table S6. Key resources

List and details of antibodies, primers and synthesized cDNAs used in this study.

mmc7.xlsx (30.1KB, xlsx)
Document S2. Article plus supplemental information
mmc8.pdf (20.3MB, pdf)

References

  • 1.Nchioua R., Bosso M., Kmiec D., Kirchhoff F. Cellular Factors Targeting HIV-1 Transcription and Viral RNA Transcripts. Viruses. 2020;12 doi: 10.3390/v12050495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Truman C.T.S., Järvelin A., Davis I., Castello A. HIV Rev-isited. Open Biol. 2020;10 doi: 10.1098/rsob.200320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kohler A., Hurt E. Exporting RNA from the nucleus to the cytoplasm. Nat. Rev. Mol. Cell Biol. 2007;8:761–773. doi: 10.1038/nrm2255. [DOI] [PubMed] [Google Scholar]
  • 4.Rausch J.W., Le Grice S.F.J. HIV Rev Assembly on the Rev Response Element (RRE): A Structural Perspective. Viruses. 2015;7:3053–3075. doi: 10.3390/v7062760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ajamian L., Abel K., Rao S., Vyboh K., García-de-Gracia F., Soto-Rifo R., Kulozik A.E., Gehring N.H., Mouland A.J. HIV-1 Recruits UPF1 but Excludes UPF2 to Promote Nucleocytoplasmic Export of the Genomic RNA. Biomolecules. 2015;5:2808–2839. doi: 10.3390/biom5042808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Frohlich A., Rojas-Araya B., Pereira-Montecinos C., Dellarossa A., Toro-Ascuy D., Prades-Pérez Y., García-de-Gracia F., Garcés-Alday A., Rubilar P.S., Valiente-Echeverría F., et al. DEAD-box RNA helicase DDX3 connects CRM1-dependent nuclear export and translation of the HIV-1 unspliced mRNA through its N-terminal domain. Biochim. Biophys. Acta. 2016;1859:719–730. doi: 10.1016/j.bbagrm.2016.03.009. [DOI] [PubMed] [Google Scholar]
  • 7.Toro-Ascuy D., Rojas-Araya B., Valiente-Echeverria F., Soto-Rifo R. Interactions between the HIV-1 Unspliced mRNA and Host mRNA Decay Machineries. Viruses. 2016;8:320. doi: 10.3390/v8110320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.de Breyne S., Ohlmann T. Focus on Translation Initiation of the HIV-1 mRNAs. Int. J. Mol. Sci. 2018;20 doi: 10.3390/ijms20010101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dicker K., Järvelin A.I., Garcia-Moreno M., Castello A. The importance of virion-incorporated cellular RNA-Binding Proteins in viral particle assembly and infectivity. Semin. Cell Dev. Biol. 2021;111:108–118. doi: 10.1016/j.semcdb.2020.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Knoener R., Evans E., 3rd, Becker J.T., Scalf M., Benner B., Sherer N.M., Smith L.M. Identification of host proteins differentially associated with HIV-1 RNA splice variants. eLife. 2021;10 doi: 10.7554/eLife.62470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Muller T.G., Zila V., Muller B., Krausslich H.G. Nuclear Capsid Uncoating and Reverse Transcription of HIV-1. Annu. Rev. Virol. 2022;9:261–284. doi: 10.1146/annurev-virology-020922-110929. [DOI] [PubMed] [Google Scholar]
  • 12.Zila V., Margiotta E., Turoňová B., Müller T.G., Zimmerli C.E., Mattei S., Allegretti M., Börner K., Rada J., Müller B., et al. Cone-shaped HIV-1 capsids are transported through intact nuclear pores. Cell. 2021;184:1032–1046.e18. doi: 10.1016/j.cell.2021.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kreysing J.P., Heidari M., Zila V., Cruz-León S., Obarska-Kosinska A., Laketa V., Rohleder L., Welsch S., Köfinger J., Turoňová B., et al. Passage of the HIV capsid cracks the nuclear pore. Cell. 2025;188:930–943.e21. doi: 10.1016/j.cell.2024.12.008. [DOI] [PubMed] [Google Scholar]
  • 14.Hou Z., Shen Y., Fronik S., Shen J., Shi J., Xu J., Chen L., Hardenbrook N., Engelman A.N., Aiken C., Zhang P. HIV-1 nuclear import is selective and depends on both capsid elasticity and nuclear pore adaptability. Nat. Microbiol. 2025;10:1868–1885. doi: 10.1038/s41564-025-02054-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Jacques D.A., McEwan W.A., Hilditch L., Price A.J., Towers G.J., James L.C. HIV-1 uses dynamic capsid pores to import nucleotides and fuel encapsidated DNA synthesis. Nature. 2016;536:349–353. doi: 10.1038/nature19098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.James L.C. The HIV-1 Capsid: More than Just a Delivery Package. Adv. Exp. Med. Biol. 2019;1215:69–83. doi: 10.1007/978-3-030-14741-9_4. [DOI] [PubMed] [Google Scholar]
  • 17.Chertova E., Chertov O., Coren L.V., Roser J.D., Trubey C.M., Bess J.W., Jr., Sowder R.C., 2nd, Barsov E., Hood B.L., Fisher R.J., et al. Proteomic and biochemical analysis of purified human immunodeficiency virus type 1 produced from infected monocyte-derived macrophages. J. Virol. 2006;80:9039–9052. doi: 10.1128/JVI.01013-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bregnard C., Zamborlini A., Leduc M., Chafey P., Camoin L., Saïb A., Benichou S., Danos O., Basmaciogullari S. Comparative proteomic analysis of HIV-1 particles reveals a role for Ezrin and EHD4 in the Nef-dependent increase of virus infectivity. J. Virol. 2013;87:3729–3740. doi: 10.1128/JVI.02477-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Saphire A.C.S., Gallay P.A., Bark S.J. Proteomic analysis of human immunodeficiency virus using liquid chromatography/tandem mass spectrometry effectively distinguishes specific incorporated host proteins. J. Proteome Res. 2006;5:530–538. doi: 10.1021/pr050276b. [DOI] [PubMed] [Google Scholar]
  • 20.Denard J., Rundwasser S., Laroudie N., Gonnet F., Naldini L., Radrizzani M., Galy A., Merten O.W., Danos O., Svinartchouk F. Quantitative proteomic analysis of lentiviral vectors using 2-DE. Proteomics. 2009;9:3666–3676. doi: 10.1002/pmic.200800747. [DOI] [PubMed] [Google Scholar]
  • 21.Santos S., Obukhov Y., Nekhai S., Bukrinsky M., Iordanskiy S. Virus-producing cells determine the host protein profiles of HIV-1 virion cores. Retrovirology. 2012;9:65. doi: 10.1186/1742-4690-9-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kessl J.J., Kutluay S.B., Townsend D., Rebensburg S., Slaughter A., Larue R.C., Shkriabai N., Bakouche N., Fuchs J.R., Bieniasz P.D., Kvaratskhelia M. HIV-1 Integrase Binds the Viral RNA Genome and Is Essential during Virion Morphogenesis. Cell. 2016;166:1257–1268.e12. doi: 10.1016/j.cell.2016.07.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Telesnitsky A., Wolin S.L. The Host RNAs in Retroviral Particles. Viruses. 2016;8 doi: 10.3390/v8080235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kutluay S.B., Zang T., Blanco-Melo D., Powell C., Jannain D., Errando M., Bieniasz P.D. Global changes in the RNA binding specificity of HIV-1 gag regulate virion genesis. Cell. 2014;159:1096–1109. doi: 10.1016/j.cell.2014.09.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Simonova A., Svojanovská B., Trylčová J., Hubálek M., Moravčík O., Zavřel M., Pávová M., Hodek J., Weber J., Cvačka J., et al. LC/MS analysis and deep sequencing reveal the accurate RNA composition in the HIV-1 virion. Sci. Rep. 2019;9:8697. doi: 10.1038/s41598-019-45079-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Abudu A., Wang X., Dang Y., Zhou T., Xiang S.H., Zheng Y.H. Identification of molecular determinants from Moloney leukemia virus 10 homolog (MOV10) protein for virion packaging and anti-HIV-1 activity. J. Biol. Chem. 2012;287:1220–1228. doi: 10.1074/jbc.M111.309831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Furtak V., Mulky A., Rawlings S.A., Kozhaya L., Lee K., Kewalramani V.N., Unutmaz D. Perturbation of the P-body component Mov10 inhibits HIV-1 infectivity. PLoS One. 2010;5 doi: 10.1371/journal.pone.0009081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sette P., Dussupt V., Bouamr F. Identification of the HIV-1 NC binding interface in Alix Bro1 reveals a role for RNA. J. Virol. 2012;86:11608–11615. doi: 10.1128/JVI.01260-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Popov S., Popova E., Inoue M., Wu Y., Göttlinger H. HIV-1 gag recruits PACSIN2 to promote virus spreading. Proc. Natl. Acad. Sci. USA. 2018;115:7093–7098. doi: 10.1073/pnas.1801849115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Strasner A.B., Natarajan M., Doman T., Key D., August A., Henderson A.J. The Src kinase Lck facilitates assembly of HIV-1 at the plasma membrane. J. Immunol. 2008;181:3706–3713. doi: 10.4049/jimmunol.181.5.3706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Boeras I., Song Z., Moran A., Franklin J., Brown W.C., Johnson M., Boris-Lawrie K., Heng X. DHX9/RHA Binding to the PBS-Segment of the Genomic RNA during HIV-1 Assembly Bolsters Virion Infectivity. J. Mol. Biol. 2016;428:2418–2429. doi: 10.1016/j.jmb.2016.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Brady S., Singh G., Bolinger C., Song Z., Boeras I., Weng K., Trent B., Brown W.C., Singh K., Boris-Lawrie K., Heng X. Virion-associated, host-derived DHX9/RNA helicase A enhances the processivity of HIV-1 reverse transcriptase on genomic RNA. J. Biol. Chem. 2019;294:11473–11485. doi: 10.1074/jbc.RA119.007679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Li D., Wei T., Rawle D.J., Qin F., Wang R., Soares D.C., Jin H., Sivakumaran H., Lin M.H., Spann K., et al. Specific Interaction between eEF1A and HIV RT Is Critical for HIV-1 Reverse Transcription and a Potential Anti-HIV Target. PLoS Pathog. 2015;11 doi: 10.1371/journal.ppat.1005289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Serquina A.K., Das S.R., Popova E., Ojelabi O.A., Roy C.K., Göttlinger H.G. UPF1 is crucial for the infectivity of human immunodeficiency virus type 1 progeny virions. J. Virol. 2013;87:8853–8861. doi: 10.1128/JVI.00925-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Barrero-Villar M., Cabrero J.R., Gordón-Alonso M., Barroso-González J., Alvarez-Losada S., Muñoz-Fernández M.A., Sánchez-Madrid F., Valenzuela-Fernández A. Moesin is required for HIV-1-induced CD4-CXCR4 interaction, F-actin redistribution, membrane fusion and viral infection in lymphocytes. J. Cell Sci. 2009;122:103–113. doi: 10.1242/jcs.035873. [DOI] [PubMed] [Google Scholar]
  • 36.Burdick R., Smith J.L., Chaipan C., Friew Y., Chen J., Venkatachari N.J., Delviks-Frankenberry K.A., Hu W.S., Pathak V.K. P body-associated protein Mov10 inhibits HIV-1 replication at multiple stages. J. Virol. 2010;84:10241–10253. doi: 10.1128/JVI.00585-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Naji S., Ambrus G., Cimermančič P., Reyes J.R., Johnson J.R., Filbrandt R., Huber M.D., Vesely P., Krogan N.J., Yates J.R., 3rd, et al. Host cell interactome of HIV-1 Rev includes RNA helicases involved in multiple facets of virus production. Mol. Cell. Proteomics. 2012;11 doi: 10.1074/mcp.M111.015313. M111 015313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Arizala J.A.C., Takahashi M., Burnett J.C., Ouellet D.L., Li H., Rossi J.J. Nucleolar Localization of HIV-1 Rev Is Required, Yet Insufficient for Production of Infectious Viral Particles. AIDS Res. Hum. Retroviruses. 2018;34:961–981. doi: 10.1089/AID.2017.0306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jager S., Kim D.Y., Hultquist J.F., Shindo K., LaRue R.S., Kwon E., Li M., Anderson B.D., Yen L., Stanley D., et al. Vif hijacks CBF-beta to degrade APOBEC3G and promote HIV-1 infection. Nature. 2011;481:371–375. doi: 10.1038/nature10693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Cochrane A.W., Perkins A., Rosen C.A. Identification of sequences important in the nucleolar localization of human immunodeficiency virus Rev: relevance of nucleolar localization to function. J. Virol. 1990;64:881–885. doi: 10.1128/JVI.64.2.881-885.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Engeland C.E., Brown N.P., Börner K., Schümann M., Krause E., Kaderali L., Müller G.A., Kräusslich H.G. Proteome analysis of the HIV-1 Gag interactome. Virology. 2014;460–461:194–206. doi: 10.1016/j.virol.2014.04.038. [DOI] [PubMed] [Google Scholar]
  • 42.Milev M.P., Ravichandran M., Khan M.F., Schriemer D.C., Mouland A.J. Characterization of staufen1 ribonucleoproteins by mass spectrometry and biochemical analyses reveal the presence of diverse host proteins associated with human immunodeficiency virus type 1. Front. Microbiol. 2012;3:367. doi: 10.3389/fmicb.2012.00367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Knoener R.A., Becker J.T., Scalf M., Sherer N.M., Smith L.M. Elucidating the in vivo interactome of HIV-1 RNA by hybridization capture and mass spectrometry. Sci. Rep. 2017;7 doi: 10.1038/s41598-017-16793-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chepenik L.G., Tretiakova A.P., Krachmarov C.P., Johnson E.M., Khalili K. The single-stranded DNA binding protein, Pur-alpha, binds HIV-1 TAR RNA and activates HIV-1 transcription. Gene. 1998;210:37–44. doi: 10.1016/s0378-1119(98)00033-x. [DOI] [PubMed] [Google Scholar]
  • 45.Wortman M.J., Krachmarov C.P., Kim J.H., Gordon R.G., Chepenik L.G., Brady J.N., Gallia G.L., Khalili K., Johnson E.M. Interaction of HIV-1 Tat with Puralpha in nuclei of human glial cells: characterization of RNA-mediated protein-protein binding. J. Cell. Biochem. 2000;77:65–74. doi: 10.1002/(sici)1097-4644(20000401)77:1<65::aid-jcb7>3.0.co;2-u. [DOI] [PubMed] [Google Scholar]
  • 46.Watts J.M., Dang K.K., Gorelick R.J., Leonard C.W., Bess J.W., Jr., Swanstrom R., Burch C.L., Weeks K.M. Architecture and secondary structure of an entire HIV-1 RNA genome. Nature. 2009;460:711–716. doi: 10.1038/nature08237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Buchbender A., Mutter H., Sutandy F.X.R., Körtel N., Hänel H., Busch A., Ebersberger S., König J. Improved library preparation with the new iCLIP2 protocol. Methods. 2020;178:33–48. doi: 10.1016/j.ymeth.2019.10.003. [DOI] [PubMed] [Google Scholar]
  • 48.Molitor L., Klostermann M., Bacher S., Merl-Pham J., Spranger N., Burczyk S., Ketteler C., Rusha E., Tews D., Pertek A., et al. Depletion of the RNA-binding protein PURA triggers changes in posttranscriptional gene regulation and loss of P-bodies. Nucleic Acids Res. 2023;51:1297–1316. doi: 10.1093/nar/gkac1237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Sibley C.R., Blazquez L., Ule J. Lessons from non-canonical splicing. Nat. Rev. Genet. 2016;17:407–421. doi: 10.1038/nrg.2016.46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Bishop K.N., Verma M., Kim E.Y., Wolinsky S.M., Malim M.H. APOBEC3G inhibits elongation of HIV-1 reverse transcripts. PLoS Pathog. 2008;4 doi: 10.1371/journal.ppat.1000231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ding P., Kharytonchyk S., Kuo N., Cannistraci E., Flores H., Chaudhary R., Sarkar M., Dong X., Telesnitsky A., Summers M.F. 5'-Cap sequestration is an essential determinant of HIV-1 genome packaging. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2112475118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Brown J.D., Kharytonchyk S., Chaudry I., Iyer A.S., Carter H., Becker G., Desai Y., Glang L., Choi S.H., Singh K., et al. Structural basis for transcriptional start site control of HIV-1 RNA fate. Science. 2020;368:413–417. doi: 10.1126/science.aaz7959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Rawson J.M.O., Nikolaitchik O.A., Shakya S., Keele B.F., Pathak V.K., Hu W.S. Transcription Start Site Heterogeneity and Preferential Packaging of Specific Full-Length RNA Species Are Conserved Features of Primate Lentiviruses. Microbiol. Spectr. 2022;10 doi: 10.1128/spectrum.01053-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Yasin S., Lesko S.L., Kharytonchyk S., Brown J.D., Chaudry I., Geleta S.A., Tadzong N.F., Zheng M.Y., Patel H.B., Kengni G., et al. Role of RNA structural plasticity in modulating HIV-1 genome packaging and translation. Proc. Natl. Acad. Sci. USA. 2024;121 doi: 10.1073/pnas.2407400121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Yoshida T., Kasuya Y., Yamamoto H., Kawai G., Hanaki K.I., Matano T., Masuda T. HIV-1 RNAs whose transcription initiates from the third deoxyguanosine of GGG tract in the 5' long terminal repeat serve as a dominant genome for efficient provirus DNA formation. J. Virol. 2024;98 doi: 10.1128/jvi.01825-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Gehring N.H., Lamprinaki S., Kulozik A.E., Hentze M.W. Disassembly of exon junction complexes by PYM. Cell. 2009;137:536–548. doi: 10.1016/j.cell.2009.02.042. [DOI] [PubMed] [Google Scholar]
  • 57.Gebauer F., Hentze M.W. Molecular mechanisms of translational control. Nat. Rev. Mol. Cell Biol. 2004;5:827–835. doi: 10.1038/nrm1488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Molitor L., Bacher S., Burczyk S., Niessing D. The Molecular Function of PURA and Its Implications in Neurological Diseases. Front. Genet. 2021;12 doi: 10.3389/fgene.2021.638217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Iselin L., Palmalux N., Kamel W., Simmonds P., Mohammed S., Castello A. Uncovering viral RNA-host cell interactions on a proteome-wide scale. Trends Biochem. Sci. 2022;47:23–38. doi: 10.1016/j.tibs.2021.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Castello A., Álvarez L., Kamel W., Iselin L., Hennig J. Exploring the expanding universe of host-virus interactions mediated by viral RNA. Mol. Cell. 2024;84:3706–3721. doi: 10.1016/j.molcel.2024.08.027. [DOI] [PubMed] [Google Scholar]
  • 61.Yeh S.C., Diosa-Toro M., Tan W.L., Rachenne F., Hain A., Yeo C.P.X., Bribes I., Xiang B.W.W., Sathiamoorthy Kannan G., Manuel M.C., et al. Characterization of dengue virus 3’UTR RNA binding proteins in mosquitoes reveals that AeStaufen reduces subgenomic flaviviral RNA in saliva. PLoS Pathog. 2022;18 doi: 10.1371/journal.ppat.1010427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Garcia-Moreno M., Noerenberg M., Ni S., Järvelin A.I., González-Almela E., Lenz C.E., Bach-Pages M., Cox V., Avolio R., Davis T., et al. System-wide Profiling of RNA-Binding Proteins Uncovers Key Regulators of Virus Infection. Mol. Cell. 2019;74:196–211.e11. doi: 10.1016/j.molcel.2019.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Castello A., Fischer B., Eichelbaum K., Horos R., Beckmann B.M., Strein C., Davey N.E., Humphreys D.T., Preiss T., Steinmetz L.M., et al. Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell. 2012;149:1393–1406. doi: 10.1016/j.cell.2012.04.031. [DOI] [PubMed] [Google Scholar]
  • 64.Mangeot P.E., Guiguettaz L., Sohier T.J.M., Ricci E.P. Delivery of the Cas9/sgRNA Ribonucleoprotein Complex in Immortalized and Primary Cells via Virus-like Particles (“Nanoblades”) J. Vis. Exp. 2021 doi: 10.3791/62245. [DOI] [PubMed] [Google Scholar]
  • 65.Durinck S., Spellman P., Birney E., Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nature Protocols. 2009;4:1184–1191. doi: 10.1038/nprot.2009.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Durinck S., Moreau Y., Kasprzyk A., Davis S., De Moor B., Brazma A., Huber W. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21:3439–3440. doi: 10.1093/bioinformatics/bti525. [DOI] [PubMed] [Google Scholar]
  • 67.Conway J.R., Lex A., Gehlenborg N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics. 2017;33:2938–2940. doi: 10.1093/bioinformatics/btx364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Imbert A., Ouyang W., Safieddine A., Coleno E., Zimmer C., Bertrand E., Walter T., Mueller F. FISH-quant v2: a scalable and modular tool for smFISH image analysis. Rna. 2022;28:786–795. doi: 10.1261/rna.079073.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Ritchie M.E., Phipson B., Wu D., Hu Y., Law C.W., Shi W., Smyth G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Dobin A., Davis C.A., Schlesinger F., Drenkow J., Zaleski C., Jha S., Batut P., Chaisson M., Gingeras T.R. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Girardot C., Scholtalbers J., Sauer S., Su S.Y., Furlong E.E. Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers. BMC Bioinformatics. 2016;17:419. doi: 10.1186/s12859-016-1284-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Sahadevan S., Sekaran T., Schwarzl T. A Pipeline for Analyzing eCLIP and iCLIP Data with Htseq-clip and DEWSeq. Methods Mol. Bio.l. 2022;2404:189–205. doi: 10.1007/978-1-0716-1851-6_10. [DOI] [PubMed] [Google Scholar]
  • 73.Quinlan A.R., Hall I.M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Lawrence M., Huber W., Pages H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 2013;9 doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Heinz S., Benner C., Spann N., Bertolino E., Lin Y.C., Laslo P., Cheng J.X., Murre C., Singh H., Glass C.K. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Tremblay B.J.M. An R package for biological motif analysis. J. Open Source Softw. 2024;9:7012. doi: 10.21105/joss.07012. [DOI] [Google Scholar]
  • 77.Wickham H., Navarro D., Pedersen T.L. Springer-Verlag; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]
  • 78.Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Doncheva N.T., Morris J.H., Gorodkin J., Jensen L.J. Cytoscape StringApp: Network Analysis and Visualization of Proteomics Data. J. Proteome Res. 2019;18:623–632. doi: 10.1021/acs.jproteome.8b00702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Morris J.H., Apeltsin L., Newman A.M., Baumbach J., Wittkop T., Su G., Bader G.D., Ferrin T.E. clusterMaker: a multi-algorithm clustering plugin for Cytoscape. BMC Bioinformatics. 2011;12:436. doi: 10.1186/1471-2105-12-436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Bader G.D., Hogue C.W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4:2. doi: 10.1186/1471-2105-4-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Goenawan I.H., Bryan K., Lynn D.J. DyNet: visualization and analysis of dynamic molecular interaction networks. Bioinformatics. 2016;32:2713–2715. doi: 10.1093/bioinformatics/btw187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Xie Z., Bailey A., Kuleshov M.V., Clarke D.J.B., Evangelista J.E., Jenkins S.L., Lachmann A., Wojciechowicz M.L., Kropiwnicki E., Jagodnik K.M., et al. Gene Set Knowledge Discovery with Enrichr. Curr. Protoc. 2021;1:e90. doi: 10.1002/cpz1.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Supek F., Bosnjak M., Skunca N., Smuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One. 2011;6 doi: 10.1371/journal.pone.0021800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Brinkman E.K., Chen T., Amendola M., van Steensel B. Easy quantitative assessment of genome editing by sequence trace decomposition. Nucleic Acids Res. 2014;42:e168. doi: 10.1093/nar/gku936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Edmonds T.G., Ding H., Yuan X., Wei Q., Smith K.S., Conway J.A., Wieczorek L., Brown B., Polonis V., West J.T., et al. Replication competent molecular clones of HIV-1 expressing Renilla luciferase facilitate the analysis of antibody inhibition in PBMC. Virology. 2010;408:1–13. doi: 10.1016/j.virol.2010.08.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Mangeot P.E., Risson V., Fusil F., Marnef A., Laurent E., Blin J., Mournetas V., Massourides E., Sohier T.J.M., Corbin A., et al. Genome editing in primary cells and in vivo using viral-derived Nanoblades loaded with Cas9-sgRNA ribonucleoproteins. Nat. Commun. 2019;10:45. doi: 10.1038/s41467-018-07845-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Kutner R.H., Zhang X.Y., Reiser J. Production, concentration and titration of pseudotyped HIV-1-based lentiviral vectors. Nat. Protoc. 2009;4:495–505. doi: 10.1038/nprot.2009.22. [DOI] [PubMed] [Google Scholar]
  • 89.Jiang W., Hua R., Wei M., Li C., Qiu Z., Yang X., Zhang C. An optimized method for high-titer lentivirus preparations without ultracentrifugation. Sci. Rep. 2015;5:13875. doi: 10.1038/srep13875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Farrell R.E.J. RNA Methodologies: Laboratory Guide for Isolation and Characterization. 5th edition. Academic Press - Elsevier; 2017. [Google Scholar]
  • 91.Mbisa J.L., Delviks-Frankenberry K.A., Thomas J.A., Gorelick R.J., Pathak V.K. Real-time PCR analysis of HIV-1 replication post-entry events. Methods Mol. Biol. 2009;485:55–72. doi: 10.1007/978-1-59745-170-3_5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Castello A., Horos R., Strein C., Fischer B., Eichelbaum K., Steinmetz L.M., Krijgsveld J., Hentze M.W. System-wide identification of RNA-binding proteins by interactome capture. Nat. Protoc. 2013;8:491–500. doi: 10.1038/nprot.2013.020. [DOI] [PubMed] [Google Scholar]
  • 93.Perez-Perri J.I., Noerenberg M., Kamel W., Lenz C.E., Mohammed S., Hentze M.W., Castello A. Global analysis of RNA-binding protein dynamics by comparative and enhanced RNA interactome capture. Nat. Protoc. 2021;16:27–60. doi: 10.1038/s41596-020-00404-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Yang C.P., Samuels T.J., Huang Y., Yang L., Ish-Horowicz D., Davis I., Lee T. Imp and Syp RNA-binding proteins govern decommissioning of Drosophila neural stem cells. Development. 2017;144:3454–3464. doi: 10.1242/dev.149500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Zarnegar B.J., Flynn R.A., Shen Y., Do B.T., Chang H.Y., Khavari P.A. irCLIP platform for efficient characterization of protein-RNA interactions. Nat. Methods. 2016;13:489–492. doi: 10.1038/nmeth.3840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Wisniewski J.R., Zougman A., Nagaraj N., Mann M. Universal sample preparation method for proteome analysis. Nat. Methods. 2009;6:359–362. doi: 10.1038/nmeth.1322. [DOI] [PubMed] [Google Scholar]
  • 97.Huber W., von Heydebreck A., Sultmann H., Poustka A., Vingron M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002;18(Suppl 1):S96–S104. doi: 10.1093/bioinformatics/18.suppl_1.s96. [DOI] [PubMed] [Google Scholar]
  • 98.Kamel W., Ruscica V., Embarc-Buh A., de Laurent Z.R., Garcia-Moreno M., Demyanenko Y., Orton R.J., Noerenberg M., Madhusudhan M., Iselin L., et al. Alphavirus infection triggers selective cytoplasmic translocation of nuclear RBPs with moonlighting antiviral roles. Mol. Cell. 2024;84 doi: 10.1016/j.molcel.2024.11.015. [DOI] [PubMed] [Google Scholar]
  • 99.Ignatiadis N., Klaus B., Zaugg J.B., Huber W. Data-driven hypothesis weighting increases detection power in genome-scale multiple testing. Nat. Methods. 2016;13:577–580. doi: 10.1038/nmeth.3885. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S10
mmc1.pdf (5.2MB, pdf)
Table S1. Defining the HIV-1 ivRNP

This table reports the proteomic results of the ivRIC analysis focusing on the comparison between the eluates from experimental samples and controls.

mmc2.xlsx (48.4KB, xlsx)
Table S2. The viral particle and whole CD4+ T cell proteome

This table reports the proteomic results of the ivRIC inputs and the whole cells and their comparison between them and with the ivRIC eluates.

mmc3.xlsx (497.3KB, xlsx)
Table S3. The Rev protein-protein interactome

This table reports the proteomic results of the Rev-Flag-Myc immunoprecipitation and controls using anti-Flag antibody and Flag.

mmc4.xlsx (147.8KB, xlsx)
Table S4. The PURA and PURB protein-protein interactome

This table reports the proteomic results of the immunoprecipitation in HIV-1 infected and uninfected cells of PURA-EGFP, PURB-EGFP and unfused EGFP using the GFP_Trap.

mmc5.xlsx (357.9KB, xlsx)
Table S5. The PURA and PURB iCLIP

This table reports the RNA sequencing results of the iCLIP2 analysis of PURA-EGFP and PURB-EGFP in mock and HIV-1 infected cells.

mmc6.xlsx (21.1MB, xlsx)
Table S6. Key resources

List and details of antibodies, primers and synthesized cDNAs used in this study.

mmc7.xlsx (30.1KB, xlsx)
Document S2. Article plus supplemental information
mmc8.pdf (20.3MB, pdf)

Data Availability Statement

  • All proteomics data generated in this study have been deposited in the PRIDE database under accession number PRIDE: PXD042829. iCLIP2 RNA sequencing data have been deposited in the Gene Expression Omnibus (GEO) under accession number GEO: GSE262435. RNA sequencing data from purified HIV-1 particles have been deposited in the European Nucleotide Archive (ENA) under accession number ENA: PRJEB105389.

  • This study did not generate original code.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon reasonable request.

RESOURCES