Skip to main content
Communications Biology logoLink to Communications Biology
. 2022 Nov 27;5:1302. doi: 10.1038/s42003-022-04253-4

Single-cell RNA-sequencing data analysis reveals a highly correlated triphasic transcriptional response to SARS-CoV-2 infection

Pablo A Gutiérrez 1,2,, Santiago F Elena 2,3
PMCID: PMC9701238  PMID: 36435849

Abstract

Single-cell RNA sequencing (scRNA-seq) is currently one of the most powerful techniques available to study the transcriptional response of thousands of cells to an external perturbation. Here, we perform a pseudotime analysis of SARS-CoV-2 infection using publicly available scRNA-seq data from human bronchial epithelial cells and colon and ileum organoids. Our results reveal that, for most genes, the transcriptional response to SARS-CoV-2 infection follows a non-linear pattern characterized by an initial and a final down-regulatory phase separated by an intermediate up-regulatory stage. A correlation analysis of transcriptional profiles suggests a common mechanism regulating the mRNA levels of most genes. Interestingly, genes encoded in the mitochondria or involved in translation exhibited distinct pseudotime profiles. To explain our results, we propose a simple model where nuclear export inhibition of nsp1-sensitive transcripts will be sufficient to explain the transcriptional shutdown of SARS-CoV-2 infected cells.

Subject terms: Systems virology, SARS-CoV-2


A pseudotime analysis of the host cell response to SARS-CoV-2 infection using publicly available scRNA-seq data derived from in vitro studies reveals a non-lineal, triphasic transcriptional response and suggests a global regulatory mechanism.

Introduction

Since the beginning of the SARS-CoV-2 pandemic in 2019, an unprecedented global research effort has taken place to elucidate the detailed molecular biology of this virus and its effects within infected cells13. Among the vast amount of information generated, single-cell RNA sequencing (scRNA-seq) datasets are particularly interesting as they can provide a detailed picture of the transcriptional state of individual cells at different stages during the infection cycle4,5. This is possible because the synthesis of cDNA libraries is performed on individualized cells and each cDNA product is labeled with an identifier that provides unique markers for each transcribed molecule68. Therefore, instead of describing the average transcriptional state of thousands to millions of cells, as in bulk RNA-seq, scRNA-seq captures a continuum of transcriptional states that can be used as a proxy to reconstruct the chronological response of a cell population to stimuli911.

In virology, scRNA-seq was first used to study the heterogeneity of hepatitis C virus quasispecies in liver cells12. Since then, most virology scRNA-seq studies have focused on characterizing the heterogeneity of host cells infected with a wide range of viruses such as human papillomavirus, West Nile virus, yellow fever virus, Zika virus, or human immunodeficiency virus type 1, among others5,13,14. In addition, more recent studies have started to address questions related to the transcriptional dynamics of viruses and infected cells upon infection5,15,16. SARS-CoV-2 is not an exception, and since the emergence of the COVID-19 pandemic, dozens of papers have been published addressing different aspects of the molecular biology of this virus1720. Unfortunately, despite the large volume of available data, most scRNA-seq investigations on SARS-CoV-2 have focused mostly on characterizing cell tropisms or molecular mechanisms at macroscopic timepoints along the infection cycle.

In this study, we present an analysis of the host cell response during the progression of SARS-CoV-2 infection using virus accumulation as a proxy of time. To obtain common features of the SARS-CoV-2 response, we performed a meta-analysis of three publicly available scRNA-seq datasets derived from in vitro inoculation studies of human bronchial epithelial cells21, and colon and ileum organoids22. Our results revealed that the average transcriptional gene response to SARS-CoV-2 infection is non-linear and cannot be described adequately using conventional bulk RNA-seq analysis methods. For this reason, instead of conventional two-group comparisons (i.e., infected vs noninfected), our analyses involved pairwise comparisons of individual transcriptional profiles and the average transcriptional response to within-cell virus accumulation. Our results revealed that about 90% of genes exhibited the same qualitative response to infection, suggesting that the transcription of most genes is modulated by a global mechanism. Additionally, a different but highly correlated response was found among mitochondrial-encoded genes and genes involved in protein translation. A mechanism explaining the genome-wide transcriptional response to SARS-CoV-2 infection is proposed.

Results

The global transcriptional response to SARS-CoV-2 exhibits a wave-like pattern

First, we tried to understand the global transcriptional response of infected cells by plotting the distribution of log2 fold changes (log2FC) of transcripts as a function of the log2 of SARS-CoV-2 levels (Fig. 1a and Supplementary Data 1). This analysis indicated that the average transcriptional response displays an oscillatory behavior that can be divided qualitatively into three phases. The early phase occurs at low viral RNA levels (≲1 TPT) and is characterized by a global shutdown of transcription where 99.5% of genes in hBECs, 98.7% in colon, and 98.0% in ileum are downregulated. The intermediate phase occurs at medium viral loads and is characterized by an increase in expression levels with maximum fold changes between 1.4 and 4.1. Finally, the late phase corresponds to the highest SARS-CoV-2 levels and was characterized by a second downregulation of cellular transcripts.

Fig. 1. Global transcriptional response to SARS-CoV-2 in human bronchial epithelial cells, colon- and ileum-organoids.

Fig. 1

a Distribution of log2FC values for all genes plotted against SARS-CoV-2 reads using a log2 scale. Mean log2FC values at each interval are shown to the left of each histogram. b Volcano plot analysis of infected cells with respect to noninfected cells using bulked cells, and subgroups comprising cells at early, middle, and intermediate phases of infection. For clarity, corrected P values were removed from the ordinate axis. c Mean transcriptional response of each cell type. Each plot was calculated from individual gene profiles scaled using z-scores with respect to the transcript levels observed in uninfected cells. These plots confirm the global oscillatory transcriptional profile of genes at increasing SARS-CoV-2 levels.

As the oscillatory transcriptional response to SARS-CoV-2 might bias conventional bulk differential gene expression (DGE) analyses, we tested the effect of selecting cells from different phases as contrasting groups. In Fig. 1b, we show the volcano plots that result from comparison with all the infected cells, in addition to comparison involving early, intermediate, and late phase cells (Fig. 1b). These results unveil that the classification of genes as down- or up-regulated is highly dependent on how the contrasting cell populations are selected. For example, in hBECs, the bulk DGEs analysis revealed 9429 down- and 481 up-regulated genes; and similar results were obtained using cells from the early phase (9710 down- vs 200 up-regulated genes). However, this trend is reversed when cells from the middle phase are used: 263 down- vs 9647 up-regulated genes are now observed. Finally, as expected from the distribution of log2FC values, DGE trends are reversed again when cells from the late phase are compared (9,810 down- vs 99 up-regulated genes). Similar results were obtained in the analysis of infected colon and ileum cells (Fig. 1b and Supplementary Data 3).

To better compare the global transcriptional response at different stages of the viral infection cycle, we plotted the average scaled gene expression values at different viral loads (Fig. 1c). To do this, expression values at each log2 interval were transformed into z-scores with respect to the mean and standard deviations of uninfected cells. This correction removes biases in the mean response patterns that might result from highly-expressed genes. After this transformation, intervals of up- or down-regulation correspond to positive and negative values, respectively. Data ordered in this way can be interpreted as a pseudotime, which, here, represents doublings in viral concentration9,11,23. In hBECs, z-scores fluctuated between −1.2 and 0.9, and transitions between phases occurred at pseudotimes of 3.5 (~10.3 TPT) and 6.2 (~72.5 TPT) (Fig. 1c). In colon- and ileum-organoids downregulation during the early phase was less pronounced but exhibited a stronger upregulatory response in the intermediate phase. In these cases, z-scores fluctuated between −0.5 and 2.3 in the colon, and −0.3 and 4.7 in the ileum. Transition points occurred at pseudotimes of 4.9 (~28.9 TPT) and 11.4 (~2,701.3 TPT) in the colon, and 4.0 (~15 TPT) and 12.7 (~6653 TPT) in the ileum (Fig. 1c). To verify the specificity of the SARS-CoV-2 response, we performed a similar analysis on ileum cells infected HAstV1 (Supplementary Fig. 3) which revealed a different profile characterized by a rapid drop in transcription levels followed by a steady increase at high virus accumulations. A strong wave-like behavior was not observed in this case.

Classification of gene responses

Next, we proceeded to identify genes with abnormal transcriptional profiles using two metrics: the root-mean-square deviation (RMSD) to the average response vector and the magnitude of the initial response (Δ0) (Fig. 2a). The latter was used to give a stronger weight to datapoints at earlier infection times and to select for genes where the initial transcriptional response was either weaker or stronger than expected. A transcriptional profile was considered an outlier when either of these parameters was outside the intervals defined by the 1.5×IQR rule. Our results indicate that most cellular transcripts have a response pattern qualitatively similar to the mean response vector, with only 7.2 to 10.1% of genes exhibiting atypical transcriptional profiles (Fig. 2b and Supplementary Data 3); 183 outliers were shared between the three cell types (Fig. 2c). A gene ontology (GO) analysis of all the outliers from the three cell types (Fig. 2d) revealed that this subset was enriched in genes involved in immune response (e.g., GO:0030593, GO:0031640, and GO:0061844), translation (e.g., GO:0042274, GO:0000028, GO:000641, and GO:0002181), respiration (e.g., GO:0006120 and GO:0042776), and control of cellular proliferation and death (e.g., GO:0001895). The subset was also significantly depleted in genes involved in intracellular protein transport (e.g., GO:0006886), protein phosphorylation (e.g., GO:0006468), and regulation of transcription (e.g., GO:0006355 and GO:0006357).

Fig. 2. Classification of genes based on their similarity to the mean response vector.

Fig. 2

a Examples of selected individual transcriptional profiles. Individual transcriptional responses (blue) were compared to the mean response (gray) using the root-mean-square deviation (RMSD) and the magnitude of the initial response (Δ0). The Pearson correlation (r) for each comparison is also provided. b A plot of RMSD and initial response values for all genes, allows the identification of genes exhibiting abnormal transcriptional responses (outliers, red). c Venn diagrams illustrating the distribution of outliers among the three datasets used in the analysis. d GO-enrichment analysis of outliers from the three datasets indicating that most outliers correspond to genes involved in immune response, translation, and cellular respiration.

As the differential transcriptional response might be the result of general regulatory mechanisms induced or inhibited by SARS-CoV-2, we decided to interrogate the data to identify common responses involving interferon signaling, transcription factors, and RNA binding proteins.

Transcriptional response of IFN-stimulated genes

At early infection times, SARS-CoV-2 downregulates the expression of IFN-stimulated genes (ISGs) involved in antiviral defense2427. We then decided to investigate whether genes involved in the IFN response follow an abnormal transcription pattern upon SARS-CoV-2 infection. Firstly, we analyzed the transcriptional profiles of genes encoding for MDA-5 and RIG-I, which are the main intracellular sensors that recognize double-stranded RNAs and trigger the innate immune response upon infection2729. Analysis of the transcriptional profiles of MDA-5 and RIG-I, encoded by genes IFIH1 and DDX58, respectively, revealed that both have a transcriptional response like the average response vector (Fig. 3a upper panels and Supplementary Data 3). This result suggests that the transcriptional shutdown of the IFN response induced by MDA-5 and RIG-I is probably not specific, and results from a more general mechanism affecting most genes in the infected cell.

Fig. 3. Global transcriptional response of ISGs in human bronchial epithelial cells, colon-, and ileum-organoids.

Fig. 3

a The transcriptional response of the cytoplasmic viral receptors MDA-4 and RIG-I correlated with the mean transcriptional response. Enrichment of GO terms associated with both signaling pathways are indicated. b Distribution of RMSD and Δ0 profiles of ISGs. Genes considered to be outliers are shown in red. c Upset plots illustrating the number of outlier ISGs shared among the three datasets. The first row corresponds to ISGs exhibiting an average response in the three datasets. The six genes exhibiting a differential response to SARS-CoV-2 infections in the three cell types are indicated. d Representative examples of ISGs exhibiting average (IRF9) and abnormal (HSP90AA1) transcriptional responses to SARS-CoV-2 infection.

We then proceeded to analyze the response of individual ISGs in the three datasets. To do this, we selected genes using a recently curated and validated dataset of ISGs30. A total of 238, 179, and 219 ISGs were identified in hBECs, colon, and ileum, respectively (Supplementary Data 3). As expected, a GO analysis revealed that most of the identified ISGs are involved in the defense response to viruses (Supplementary Fig. 3). Like the MDA-5 and RIG-I receptors, 77–90% of ISGs exhibited an average transcriptional response to SARS-CoV-2 indicating that the mRNA levels of these genes are negatively regulated at the onset of infection and their response is probably a consequence of the global transcriptional shutdown (Fig. 3b). Examples of ISGs exhibiting different response are shown in Fig. 3d. An analysis of abnormal transcriptional responses identified 55 genes from hBECs, 18 from colon and 25 from ileum; six of them common to the three datasets: heat shock protein HSP90α (HSP90AA1), interferon-induced protein with tetratricopeptide repeats 2 (IFIT2), interferon-induced protein with tetratricopeptide repeats 3 (IFIT3), heat shock cognate 71 kDa protein (HSPA8), normal mucosa of esophagus-specific gene 1 protein (C15orf48), and interferon alpha-inducible protein 27 (IFI27) (Fig. 3c). HSP90AA1 and HSPA8 are chaperone proteins which are typically required for viral replication31. In addition, HSPA8 has been shown to be an attachment factor for avian infectious bronchitis coronavirus32. C15orf48 expression, on the other hand, has been shown to be involved in the mitochondrial stress response (MISTR) and is expressed by pathogenic macrophages in severe COVID-1933,34. Interestingly, IFI27 and IFIT3 are also involved with the mitochondrial processes; the former locates in the nuclear inner membrane and is indispensable for mitochondrial function35, and the latter is a mediator of the mitochondrial antiviral signaling (MAVS) complex triggered by the MDA-5 and RIG-I signaling pathways36. However, a connection between these genes and the global transcriptional profiles observed upon SARS-CoV-2 infection does not seem obvious.

Transcriptional response of transcription factors

To further understand the effect of SARS-CoV-2 on transcription factors (TF), which could explain the initial transcriptional shutdown, we investigated the effect of SARS-CoV-2 on individual transcription factors from a curated collection of known and likely human transcription factors37. Our analysis identified a total of 785, 448, and 531 TFs in hBEC, colon, and ileum, respectively, with 409 TFs in common to the three datasets (Supplementary Data 3). Again, most transcriptions factors behaved like the mean response vector, and only 36 TFs in hBECs, 16 in colon, and 22 in ileum were classified as outliers (Fig. 4a and Supplementary Data 3). Three TFs were found to be differentially regulated in the three datasets: the Y-box binding protein 1 (YBX1), and the proto-oncogene TFs JUN and JUND (Fig. 4b and Supplementary Data 3). YBX1 encodes a highly conserved cold shock domain protein with DNA and RNA binding properties implicated in the regulation of transcription and translation, pre-mRNA splicing, DNA reparation, mRNA packaging, cell proliferation, stress response, and apoptosis. It has been shown to be a component of messenger ribonucleoprotein (mRNP) complexes with a role in microRNA processing38,39. The product of gene JUN is a component of the AP1 TF complex involved in oncogenic transformation40 and is targeted to the nucleolus, where it seems to modulate nucleolar architecture and ribosomal RNA accumulation41. The expression profile of JUN stands out as its transcriptional profile suggests transcriptional activation at later infection times and its similar response in the three datasets suggests that this gene might be a marker that induces apoptosis in infected cells (Fig. 4c). JUND encodes a JUND proto-oncogene that is a functional component of the AP1 TF complex and has been proposed to protect cells from p53-dependent senescence and apoptosis42. These results suggest that the transcriptional response of most TFs is non-specific; however, exceptions probably include TFs expressed after the initial transcriptional shutdown induced after the activation of cell-death pathways at late SARS-CoV-2 infection stages. Again, a connection between JUN, JUND, or YBX1 and the global transcriptional response is unclear.

Fig. 4. Global transcriptional response transcription factors.

Fig. 4

a Distribution of transcriptional responses of identified transcription factors from hBECs, colon, and ileum datasets. Outliers are colored red. b Upset plots illustrating the number of outlier IFs shared among the three datasets. Most TFs (359) were common to the three datasets and exhibited an average response. The three TFs exhibiting a differential response to SARS-CoV-2 infections in the three cell types are indicated. c Representative examples of TFs exhibiting average (NFKB1) and abnormal (JUN) transcriptional responses.

RNA binding proteins

As levels of mRNA might be modulated by their interaction with RNA binding proteins (RBPs), we explored the transcriptional profiles of genes from a dataset of canonical RBPs from the eukaryotic RBP database, EuRBPDB43. More than a thousand RBPs were identified in each dataset (Fig. 5a and Supplementary Data 3), of which ~10% exhibited abnormal transcriptional profiles (Fig. 5b). Interestingly, 83 abnormal responding genes were shared between the three datasets (Fig. 5b) which contrast with the analysis of ISGs and TFs where only a handful of abnormal responding genes shared among datasets. Figure 5c shows illustrative examples of representative RBPs with average (SRPNP35) and abnormal (RPS5) transcriptional responses. GO analysis of genes abnormal responding genes revealed that these were enriched in terms involved in the synthesis of proteins involved in translation (Fig. 5d, lower panel). A detailed analysis of the outliers revealed that the common response involved 41 large ribosomal subunit proteins, 30 small ribosomal subunit proteins, four elongation factors (EEF1A1, EEF1B2, EEF1G, and EEF2), the FAU ubiquitin-like and ribosomal protein S30 fusion (FAU), the histidine triad nucleotide-binding protein 1 (HINT1), the heterogeneous nuclear ribonucleoprotein A1 (HNRNPA1) involved in pre-mRNA processing in the nucleus, nucleolin (NCL) involved in ribosome processing, the ras-related nuclear protein (RAN) essential for the translocation of RNA and proteins through the nuclear pore complex, the signal recognition particle 14 (SRP14) involved in protein targeting to ER, and the ubiquitin A-52 residue ribosomal protein fusion product 1 (UBA52) involved in protein degradation by the 26S proteosome, and the YBX1 gene identified in the previous TF analysis. Finally, a search in the STRING database, revealed that most abnormal responding RBPs are part of a highly connected physical network, which suggests a distinct response to SARS-CoV-2 infection of genes implicated in translation (Fig. 5e).

Fig. 5. Transcriptional response of RNA binding proteins.

Fig. 5

a Distribution of transcriptional responses of RBPs identified in hBECs, colon, and ileum datasets. Outliers are colored in red. b Upset plots illustrating the number of outlier RBPs shared among the three datasets. In contrast to ISGs and TFs, a large number of RBP outliers were common to the three datasets (83). c Representative examples of RBPs with average (SRPNP35) and abnormal (RPS5) transcriptional responses. d GO annotation of the 25 most common terms of RBPs with normal and abnormal transcriptional responses. e A STRING database search of differentially regulated RBPs revealed a highly connected functional network suggesting that transcripts involved in translation are regulated differently with respect to most genes.

Gene regulatory networks

As gene expression is often modular and usually involves the coordinated expression of functionally related genes9, we investigated the correlation of transcriptional responses between pairs of genes in the three datasets. First, we constructed individual correlation networks using the normalized transcriptional profiles (Supplementary Data 2, 4). Then, we used the individual networks to construct a general response network comprising all the edges shared between the three individual networks (Fig. 6a). The resulting network comprised 25 connected components, the largest of which consisted of 348 genes that exhibited transcriptional profiles similar to the mean response vector (Fig. 6b). The second largest connected component comprised 54 genes, and included mostly ribosomal protein genes, and proteins involved in translation. In this case, all genes within the network exhibited a qualitative behavior that was characterized by weaker downregulation and upregulation of transcription during the early and middle phases of infection as compared to the mean response vector (Fig. 6b). Interestingly, the third largest connected component was comprised exclusively of mitochondrially encoded genes, suggesting that response of mitochondrial genes upon SARS-CoV-2 infection follows a different mechanism from the global response and genes involved in translation (Fig. 6b). The remaining connected components comprised between two to three genes and some of them exhibited transcriptional responses like those observed in the largest components (Supplementary Data 4); they were no included as part of the largest subnetworks as the some of the edges failed to meet the filtering criteria. Previous studies have suggested a role of the 5′ UTR of mRNAs in the ability to escape the global suppression of translation induced by SARS-CoV-2 and related coronaviruses4446. To investigate the presence of potential sequence motifs in transcripts from the largest connected components, we generated a sequence logo using the first 20 nucleotides of reference sequences from these clusters. This analysis suggests that transcripts from the first connected component tend to have A, C, or G as the first nucleotide with equal probability and higher GC content between positions 7–20 (Fig. 6c). In contrast, sequences from the second connected component contain TOP-motifs in their 5′ UTRs, which is to be expected for transcripts encoding ribosomal proteins and components of the translation apparatus46,47. Interestingly, analysis of protein translation in cells expressing SARS-CoV-2 protein Nsp1 revealed that the translation efficiency was slightly higher for genes with lower GC content in their 5′ UTR46. Finally, an analysis of the transcriptional profiles in cells infected with HAstV1 did not reveal a distinct correlated response in genes from modules 1, 2, and 3 (Supplementary Fig. 3), suggesting that transcriptional clusters are probably specific to SARS-CoV-2.

Fig. 6. Correlated responses to SARS-CoV-2 infection common to the three cell types.

Fig. 6

a The network was constructed by connected pairs of genes exhibiting a significant correlation in their transcriptional profiles in the three datasets. This analysis resulted in a network with 25 connected components comprising between 348 to 2 genes. Node size is scaled to represent the clustering coefficient. b The transcriptional profiles of the three largest connected components revealed distinct response profiles associated with the mean response (component 1), genes involved in translation (component 2), and mitochondria-encoded genes (component 3). In each example, profiles of individual genes are colored gray; the average transcriptional response of genes within each module is shown in red. For comparison, the mean response profile is shown in blue. c Sequence logos of the 5′ UTRs reveal the presence of characteristic sequence signatures for genes belonging to modules 1 and 2.

Discussion

Our analysis indicates that the global transcriptional response to SARS-CoV-2 infection exhibits a behavior characterized by three phases: an initial downregulatory phase, a middle upregulatory phase, and a late downregulatory phase. Clearly, this type of response cannot be analyzed using the standard two-group comparisons typically used in bulk RNA-seq methods (e.g., volcano plots)4850. This is of special concern in the case of viral infection studies, as macroscopic timepoints (e.g., days or hours) comprise a complex mixture of cellular states that must be disentangled prior to running any biologically meaningful analysis. Failure to do this can introduce biases in the selection of DGEs as opposing regulatory trends in cells at different transcriptional stages might cancel each other. Analysis of the viral infection process should probably include other metrics, and as suggested here, transcriptional profiles might be a better representation of the response of individual genes along the infection timeline. Standard filtering procedures of cells in scRNA-seq are also another important source of bias, as is customary to discard cells with a low number of mapped genes and/or a high proportion of mitochondrial transcripts5154. This filtering strategy might not be adequate in studies of viral dynamics as, at late infection times, cells are expected to be dominated by viral transcripts and the diversity of transcribed genes is expected to be low, thus removing cells that might provide valuable information on the molecular events happening in the late stages of viral infection. In our opinion, in single-cell virology studies, cells should be selected with respect to the number of individual transcripts and not the total number of mapped genes. A similar logic applies to the removal of cells with high levels of mitochondrial-encoded transcripts, as overexpression of mitochondrial genes might represent a legitimate response to viral infection, as has been shown for SARS-CoV-25557.

Considering our observations, we wish to propose an updated model of SARS-CoV-2 infection. A good model must be able to explain: (i) the transcriptional downregulatory phase at early infection times exhibited by about ~90% of genes, (ii) the selective transcriptional modulation of mitochondrial-encoded genes and genes involved translation, (iii) the global increase in host transcript levels observed at the middle of the infection cycle, and (iv) the second downregulatory phase at late infection times. This behavior seems to be a feature of SARS-CoV-2 infection as a similar analysis performed on ileum cells infected with HAstV1 did not reveal a correlated response for most genes, and did not exhibit an intermediate upregulatory phase (Supplementary Fig. 3 and Supplementary Data 6).

We believe that the multiprong strategy proposed by Finkel et al.26, in combination with the known biology of the viral nonstructural protein 1 (nsp1), can explain the results presented here. The multiprong strategy explains the shutdown of host protein synthesis induced by SARS-CoV-2 as a combination of three effects: a global inhibition of protein translation, degradation of cytosolic cellular transcripts, and blockage of nuclear mRNA export26. Additionally, it is also well-established that nsp1 is responsible for orchestrating the transcriptional and translational shutdown induced by SARS-CoV-2. First, nsp1 is a strong inhibitor of translation, affecting translation of both host and viral mRNA58 by docking its C-terminal domain to the mRNA entry channel of the 40S ribosomal subunit25,59,60. Second, the expression of nsp1 is sufficient to induce the global degradation of host mRNAs26,44,6163 by an unknown mechanism independent of a viral encoded RNase and/or ribonuclease L62,64. Third, immunoprecipitation and mass spectrometry studies have shown that the N-terminal domain of nsp1 interacts with the mRNA export receptor protein NFX1 preventing its binding to mRNA export adapters that results in the accumulation of mRNA in the nucleus65. Clearly, nsp1 is the key factor involved in shutting down host protein expression.

The observed downregulatory phase is well-supported by many studies that have shown that soon after infection, SARS-CoV-2 mRNA transport is stalled and there is a rapid degradation of cytoplasmic mRNA26,44,6163. Some believe that the inhibition of nuclear mRNA export is a direct consequence of the widespread mRNA degradation in the cytosol62. However, we believe the opposite to be true and propose a mechanism where the downregulatory phase results from altering the natural steady-state dynamics between mRNA degradation and export to the cytoplasm (Fig. 7). The levels of mRNAs in the cytoplasm are determined by a steady-state equilibrium between the rates of transcription, nuclear export, and natural RNA turnover in the cytoplasm66, therefore, a global blockage of mRNA export will compromise the input of newly synthesized mRNA into the cytoplasm resulting in global decrease of cytoplasmic mRNA levels and an increase in concentration in the nucleus (Fig. 7b). In other words, expression of SARS-CoV-2 nsp1 leads to a cellular state where mRNA is stalled at the nucleus and cannot replace the mRNA being degraded in the cytoplasm by its natural turnover rate (Fig. 7b). This is supported by RNA-seq data on infected cells that revealed nuclear accumulation of mRNA and a reduction of mRNAs in the cytoplasm and increased levels of intronic reads26,67. Moreover, the mRNA export block precedes the reduction in mRNA levels and only requires expression of nsp1;65 and a study of IFNB1 induction upon SARS-CoV-2 infection found that soon after infection, transcribed mRNAs fail to disseminate from transcriptional foci and are preferentially retained in the nucleus62.

Fig. 7. Proposed model to explain the transcriptional response of infected cells to SARS-CoV-2 infection.

Fig. 7

a Selective nuclear export of nsp1-sensitive transcripts will result in the degradation of cytoplasmic mRNAs and their accumulation in the nucleus. Once the blockage is removed, retained transcripts are released, resulting in a rapid increase in cytoplasmic mRNA levels. A shutdown of transcription during the late infection stage will result in a global decrease of host mRNAs. b A simplified dynamical model simulating nuclear mRNA export and transcription as on/off switches is enough to model the mean transcriptional profile. c Reinterpretation of the mean transcriptional profile of hBECs. In contrast to the on-off switch model presented in panel b, it is most likely that the shutdown of transcription and nuclear export occurs gradually, as shown in the upper panel. The dotted lines represent early infection states that could not be detected in the present study.

Inhibition of nuclear transport by nsp1 can also explain the differential transcriptional response of transcripts involved in translation and mitochondrial-encoded transcripts. Most nuclear-encoded transcripts are exported into the cytoplasm by a mechanism involving the heterodimeric export receptor NXF1·NXT168 and it has been shown that nsp1 displaces NXF1 from the nuclear pore complex, impairing the docking of mRNA65. However, some transcripts can be exported by alternative pathways, indeed, regulation of mRNA export is a mechanism used to modulate several critical biological processes such as DNA repair, stress response, and maintenance of pluripotency68. We postulate that host transcripts can be classified into nsp1-sensitive and nps1-insensitive. In our view, nsp1-sensitive transcripts correspond to transcriptionally active loci at the time of infection that are blocked by the interaction of nsp1 with NXF1. These transcripts correspond to about 90% of the transcriptome, including genes involved in the interferon response. This could explain the differential response of mRNAs involved in translation. Interestingly, a selective nuclear export mechanism of mRNAs related to translational function, including ribosomal proteins, have been shown to exist involving protein TDP-4369, an RNA binding protein involved in the regulation of transport and translation of mature mRNA in the cytoplasm70. The differential regulation of genes involved in translation has been independently demonstrated by several studies, which revealed that genes with 5′ terminal oligopyrimidine tracts tend to escape the suppression of translation induced by SARS-CoV-246 and replacement of these sequences with purines results in reduced translational efficiency46. However, we believe that possessing the TOP motif is not a sufficient condition for the selective transport of translational mRNAs, as several TOP containing sequences are part of the mean response subset (Supplementary Data 3). Selective mRNA export inhibition also explains the differential response of mitochondrial-encoded genes as these genes do not require a nuclear export mechanism and should not be affected by the nsp1 blockage. The differential regulation of mitochondrial-encoded genes is also well-supported by experimental evidence that showed that they are less affected by SARS-CoV-2 infection than cellular transcripts26,46,56.

The upregulatory response observed during the middle phase of SARS-CoV-2 infection is, to the best of our knowledge, an unexpected observation that deserves additional discussion. If we assume that nsp1 is degraded or inactivated after it has accomplished its function, then the stalled mRNA transcripts can be finally exported to the nucleus in a rapid burst which would look like an upregulatory event during the intermediate infection stage (Fig. 7). This rapid burst of newly synthesized transcripts can also explain the late but strong response of the interferon system typical of COVID-19 disease63,71,72. It is also reasonable to assume that, when the nuclear blockage stops, it is already too late for the host cell to outcompete the replicating virus, which results in the final downregulatory phase observed at high SARS-CoV-2 levels. Qualitatively, this behavior can be simulated with a simple dynamical model where global mRNA export and transcription act like switches (Fig. 7a). Expression of nsp1 acts like an off-switch for nsp1-sensitive transcripts that results in their accumulation in the nucleus while nsp1-insensitive transcripts can transit normally to the cytoplasm. The middle phase corresponds to the inactivation of the nsp1-induced blockage, which releases the stalled transcript to the cytoplasm that results in the increased transcription levels observed during the middle phase. Finally, the late phase corresponds to a shutdown of transcription that also results in a global decrease in transcript levels.

In summary, our analysis reveals a complex transcriptional response to SARS-CoV-2 infection that can be easily explained using the blockage of mRNA transport as the key molecular event. We believe that this model provides a parsimonious explanation of the host transcriptional and translational shutdown induced by SARS-CoV-2 and suggests that targeting the nsp1 ability to disrupt nuclear export might be the key to counteract the effect of this virus within infected cells.

Methods

Data

Raw sequencing files were downloaded from the Sequence Read Archive (SRA) at NCBI using the SRA toolkit 2.11.1 (https://www.ncbi.nlm.nih.gov/sra/) and included previously published scRNA-seq data on SARS-CoV-2 infection of human bronchial epithelial cells (hBECs)21 and human intestinal epithelial cells (hIECs)22. The hBECs data comprised transcripts from a mock infection, and cells at 1-, 2-, and 3-days post-infection (dpi) with SARS-CoV-2 isolate USA/USA-WA1/202021. The hIECs data was produced from colon- and ileum-derived organoids infected with SARS-CoV-2 isolate strain Germany/BavPat1/202022, and comprised transcripts from a mock infection, and infected cells at 12- and 24-hours post-infection (hpi). In both studies, gel bead-in emulsions (GEMs) were prepared from single-cell suspensions using the 10× Genomics Single Cell 3′ Library Kit NextGem V3.1 (10× Genomics, USA). hBECs libraries were sequenced on a NovaSeq600 system21 and hIECs using a HiSeq4000 system22. In addition, a dataset involving infection of ileum organoids with human astrovirus 1 (HAstV1), and studied under similar conditions, was used as a control73.

Data processing

Datasets were processed using custom Python scripts. Only sequences with a perfect match to the 10× genomics whitelist (3M-February-2018.txt.gz) and no ambiguous nucleotide calls in the UMI portion were used in the analysis. Selected sequences were mapped to a custom database comprising the complete set of reference human messenger RNAs available at NCBI, mitochondrial-encoded transcripts from the human reference mitochondrial genome (NC_012920), and SARS-CoV-2 genomes corresponding to the reference sequence Wuhan-Hu-1 (NC_045512) and isolate USA/USA-WA1/2020 (MW811435) and Germany/BavPat1/2020 (MZ558051) used for infection in both datasets (Supplementary Data 1). Read mapping was performed with MagicBLAST74. Only sequences that mapped unambiguously to a single gene with more than 95% coverage and a single UMI were included in this analysis. In contrast to the originally published analyses21,22, we selected GEMs with respect to the total number of transcripts (UMIs) instead of the total number of transcribed genes. Empty GEMs and multiplets were removed from the datasets using a custom Python script that ordered GEMs with respect to the number of transcripts transformed using a log10 scale; then the local standard deviation of the ordered data was calculated using a window size of five datapoints. Upper and lower thresholds delimiting multiplets and void beads were determined using the 1.5×IQR rule (Supplementary Table 1 and Supplementary Fig. 1). Mapping information from the selected cells was then transformed into a matrix of counts which included genes detected in at least one hundred cells (Supplementary Data 1 and Supplementary Fig. 2). Furthermore, in contrast to the original analysis of these datasets, we did not discard cells with a high percentage of transcribed mitochondrial genes as overexpression of mitochondrial genes could be a legitimate response to SARS-CoV-2 infection52,56.

Estimation of gene frequencies

The probability of detecting n transcripts from a gene in a cell depends on its relative frequency and the total number of sampled transcripts. Therefore, as cDNA amplification of cellular transcripts represents only 10 to 20% of the total mRNA content6,75,76, gene count data from scRNA-seq contains many missing values, or dropouts, that result from the Poisson sampling of low abundance transcripts in cells with few sequenced transcripts9,76. The probability of observing ngi read counts for a given gene g ∈ {1, 2,…, G} across cells i ∈ {1, 2,…, I} can be modeled using a Poisson distribution where the likelihood of observing ngi counts given a particular transcript abundance θ gi is given by:

L(θgi)=pθgi(ngi,Ni)=(θgiNi)ngieθNingi!,0θgi1 1
withNi=g=1Gngi

In this work, gene abundances were calculated as the weighted average of gene frequencies (θ) with respect to the normalized likelihood function (L^) and multiplied by a scaling factor of 104 (transcripts per ten thousand or TPT):

TPTgi=104θ=01θgiL^(θgi) 2
withL^(θgi)=L(θgi)i=1IL(θg).

An uncorrected normalized matrix was also calculated. Corrected and uncorrected frequency matrices are available in Supplementary Data 2.

Volcano plot analysis

Basal levels of expression for each gene were determined using the average expected TPT values from mock cells with at least one gene count per gene of interest. Infected cells, on the other hand, were binned with respect to viral loads using a log2(TPTSARS-CoV-2 + 1) scale. Only cells with at least one count for SARS-CoV-2 and the target gene were used. P values were calculated using a two-tailed Mann–Whitney U-test and corrected using the Benjamini–Hochberg method77 using a significance threshold of 0.01. Differential expression analysis results are provided in Supplementary Data 3.

Z-score profiles and analysis

Normalized transcriptional responses were calculated using a z-score of the transcription levels observed at each log2(TPTSARS-CoV-2 + 1) corrected by the levels and standard deviation observed in the uninfected cells. For each cell type, a mean response vector was calculated by averaging the normalized response of all genes. Pairwise comparisons between individual transcriptional profiles and the mean response vector were performed using a root-mean-square deviation (RMSD) metric and the magnitude of the initial response (Δ0) was measured as the average z-score of the first five datapoints. A transcriptional response was considered an outlier when either the RMSD or Δ0 was outside the intervals defined by the 1.5×IQR rule.

Gene ontology (GO) analysis

Analysis of biological functions was performed using a custom database of high-quality GO terms of human proteins downloaded from the Uniprot database (UniProt Consortium, 2015; Supplementary Data 3). Quantification of the most abundant GO terms was performed using custom scripts that mapped selected genes to a dictionary of GO annotations. GO-enrichment analysis was performed by comparison of gene subsets to the list of mapped genes as background. P values were calculated using Fisher’s exact test and corrected Benjamini–Hochberg method (with a threshold significance of 0.01).

Construction of correlation networks

Undirected correlation networks were built using the normalized transcription profiles. A link between two genes was established when the two-sample Kolmogorov–Smirnov test indicated that both z-score distributions followed the same distribution (P ≤ 0.01) and the corrected Pearson correlation coefficient between expression profiles was r ≥ 0.9. In both instances, P values were corrected using the Benjamini–Hochberg method77. Only those genes which had at least 50 measurements in common were included in the analysis (Supplementary Data 5). Network properties were characterized with the Python networkX package78 and visualized with Gephi79.

Statistics and reproducibility

Details about the statistical analyses used in this study are described in the corresponding methods section. For volcano plots, the statistical significance was calculated using a two-tailed Mann–Whitney U-test. Transcriptional responses were classified as outliers when their RMSD or initial response (Δ0) was outside the intervals defined by the 1.5×IQR rule. For GO-enrichment analyses, P values were calculated using Fisher’s exact test. In the construction of the correlation network, correlations were measured using the two-sample Kolmogorov–Smirnov test on the Z-score distributions (P ≤ 0.01) and the corrected Pearson correlation coefficient between expression profiles (r ≥ 0.9). In all these cases, P values were corrected using the Benjamini–Hochberg method77.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Peer Review File (5.3MB, pdf)
42003_2022_4253_MOESM3_ESM.pdf (489.9KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1 (276.8MB, gz)
Supplementary Data 2 (425.5MB, zip)
Supplementary Data 3 (8.4MB, gz)
Supplementary Data 4 (1.1GB, gz)
Supplementary Data 5 (27MB, zip)
Supplementary Data 6 (1.2GB, zip)
Reporting Summary (1.1MB, pdf)

Acknowledgements

We thank Sergi Valverde, Salvador Durán-Nebreda, Irene Otero-Muras, the members of the I2SysBio-CRM UA DisCoVir, and our labmates for useful suggestions and productive discussions. This work was supported by European Commission – NextGenerationEU (Regulation EU 2020/2094) through CSIC’s Global Health Platform (PTI + Salud Global) grants SGL2021-03-009 and SGL2021-03-052 to S.F.E.

Author contributions

Conceptualization, Supervision, and Writing—reviewing and editing: P.A.G. and S.F.E. Data curation, investigation, Illustrations, Writing—original draft: P.A.G. Funding acquisition: S.F.E.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Zhijuan Qiu. Peer reviewer reports are available.

Data availability

The original datasets used in the analysis can be accessed at NCBI archived under BioProject accession codes PRJNA701930, PRJNA658984, and PRJNA720321. All data generated or analyzed during this study are included in this article and its supplementary information files. The Tables and processed datasets that support the findings of this study are available at the Zenodo repository (https://zenodo.org/) under the identifier (10.5281/zenodo.7198900).

Code availability

Codes that support the findings of this research are available at GitHub (https://github.com/paguties/SARS-CoV-2_scRNAseq.git).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s42003-022-04253-4.

References

  • 1.Hu B, Guo H, Zhou P, Shi ZL. Characteristics of SARS-CoV-2 and COVID-19. Nat. Rev. Microbiol. 2021;19:141–154. doi: 10.1038/s41579-020-00459-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Barrantes FJ. The contribution of biophysics and structural biology to current advances in COVID-19. Annu. Rev. Biophys. 2021;50:493–523. doi: 10.1146/annurev-biophys-102620-080956. [DOI] [PubMed] [Google Scholar]
  • 3.Brant AC, Tian W, Majerciak V, Yang W, Zheng Z. MSARS-CoV-2: from its discovery to genome structure, transcription, and replication. Cell Biosci. 2021;11:136. doi: 10.1186/s13578-021-00643-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ciuffi A, Rato S, Telenti A. Single-cell genomics for virology. Viruses. 2016;8:123. doi: 10.3390/v8050123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cristinelli S, Ciuffi A. The use of single-cell RNA-Seq to understand virus-host interactions. Curr. Opin. Virol. 2018;29:39–50. doi: 10.1016/j.coviro.2018.03.001. [DOI] [PubMed] [Google Scholar]
  • 6.Hwang B, Lee JH, Bang D. Single-cell RNA sequencing technologies and bioinformatics pipelines. Exp. Mol. Med. 2018;50:1–14. doi: 10.1038/s12276-018-0071-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ziegenhain C, et al. Comparative analysis of single-cell RNA sequencing methods. Mol. Cell. 2017;65:631–643.e4. doi: 10.1016/j.molcel.2017.01.023. [DOI] [PubMed] [Google Scholar]
  • 8.Ding J, et al. Systematic comparison of single-cell and single-nucleus RNA-sequencing methods. Nat. Biotechnol. 2020;38:737–746. doi: 10.1038/s41587-020-0465-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chen G, Ning B, Shi T. Single-cell RNA-seq technologies and related computational data analysis. Front. Genet. 2019;10:317. doi: 10.3389/fgene.2019.00317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kharchenko PV. The triumphs and limitations of computational methods for scRNA-seq. Nat. Methods. 2021;18:723–732. doi: 10.1038/s41592-021-01171-x. [DOI] [PubMed] [Google Scholar]
  • 11.Ding J, Sharon N, Bar-Joseph Z. Temporal modelling using single-cell transcriptomics. Nat. Rev. Genet. 2022;23:355–368. doi: 10.1038/s41576-021-00444-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.McWilliam Leitch EC, McLauchlan J. Determining the cellular diversity of hepatitis C virus quasispecies by single-cell viral sequencing. J. Virol. 2013;87:12648–12655. doi: 10.1128/JVI.01602-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Russell AB, Trapnell C, Bloom JD. Extreme heterogeneity of influenza virus infection in single cells. eLife. 2018;7:e32303. doi: 10.7554/eLife.32303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Steuerman Y, et al. Dissection of influenza infection in vivo by single-cell RNA sequencing. Cell Syst. 2018;6:679–691. doi: 10.1016/j.cels.2018.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shnayder M, et al. Defining the transcriptional landscape during cytomegalovirus latency with single-cell RNA sequencing. mBio. 2018;9:e00013–e00018. doi: 10.1128/mBio.00013-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zanini F, et al. Virus-inclusive single-cell RNA sequencing reveals the molecular signature of progression to severe dengue. Proc. Natl Acad. Sci. USA. 2018;115:E12363–E12369. doi: 10.1073/pnas.1813819115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ziegler C, et al. SARS-CoV-2 receptor ACE2 is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues. Cell. 2020;181:1016–1035.e19. doi: 10.1016/j.cell.2020.04.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ramirez Alvarez C, et al. The endogenous cellular protease inhibitor SPINT2 controls SARS-CoV-2 viral infection and is associated to disease severity. PLoS Pathog. 2021;17:e1009687. doi: 10.1371/journal.ppat.1009687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Schuler BA, et al. Age-determined expression of priming protease TMPRSS2 and localization of SARS-CoV-2 in lung epithelium. J. Clin. Investig. 2021;131:e140766. doi: 10.1172/JCI140766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sen K, et al. Single-cell immunogenomic approach identified SARS-CoV-2 protective immune signatures in asymptomatic direct contacts of COVID-19 cases. Front. Immunol. 2021;12:733539. doi: 10.3389/fimmu.2021.733539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ravindra NG, et al. Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium identifies target cells, alterations in gene expression, and cell state changes. PLoS Biol. 2021;19:e3001143. doi: 10.1371/journal.pbio.3001143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Triana S, et al. Single-cell analyses reveal SARS-CoV-2 interference with intrinsic immune response in the human gut. Mol. Syst. Biol. 2021;17:e10232. doi: 10.15252/msb.202110232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Griffiths JA, Scialdone A, Marioni JC. Using single-cell genomics to understand developmental processes and cell fate decisions. Mol. Syst. Biol. 2018;14:e8046. doi: 10.15252/msb.20178046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Schoggins JW, et al. A diverse range of gene products are effectors of the type I interferon antiviral response. Nature. 2011;472:481–485. doi: 10.1038/nature09907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Thoms M, et al. Structural basis for translational shutdown and immune evasion by the Nsp1 protein of SARS-CoV-2. Science. 2020;369:1249–1255. doi: 10.1126/science.abc8665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Finkel Y, et al. SARS-CoV-2 uses a multipronged strategy to impede host protein synthesis. Nature. 2021;594:240–245. doi: 10.1038/s41586-021-03610-3. [DOI] [PubMed] [Google Scholar]
  • 27.Rebendenne A, et al. SARS-CoV-2 triggers an MDA-5-dependent interferon response which is unable to control replication in lung epithelial cells. J. Virol. 2021;95:e02415–e02420. doi: 10.1128/JVI.02415-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sampaio NG, et al. The RNA sensor MDA5 detects SARS-CoV-2 infection. Sci. Rep. 2021;11:13638. doi: 10.1038/s41598-021-92940-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Thorne LG, et al. SARS-CoV-2 sensing by RIG-I and MDA5 links epithelial infection to macrophage inflammation. EMBO J. 2021;40:e107826. doi: 10.15252/embj.2021107826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Martin-Sancho L, et al. Functional landscape of SARS-CoV-2 cellular restriction. Mol. Cell. 2021;81:2656–2668.e8. doi: 10.1016/j.molcel.2021.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Geller R, Taguwa S, Frydman J. Broad action of Hsp90 as a host chaperone required for viral replication. Biochim. Biophys. Acta. 2012;1823:698–706. doi: 10.1016/j.bbamcr.2011.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Zhu P, et al. Heat shock protein member 8 is an attachment factor for infectious bronchitis virus. Front. Microbiol. 2020;11:1630. doi: 10.3389/fmicb.2020.01630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Liao M, et al. Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19. Nat. Med. 2020;26:842–844. doi: 10.1038/s41591-020-0901-9. [DOI] [PubMed] [Google Scholar]
  • 34.Clayton SA, et al. Inflammation causes remodeling of mitochondrial cytochrome c oxidase mediated by the bifunctional gene C15orf48. Sci. Adv. 2021;7:eabl5182. doi: 10.1126/sciadv.abl5182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Jin W, Jin W, Pan D. Ifi27 is indispensable for mitochondrial function and browning in adipocytes. Biochem. Biophys. Res. Commun. 2018;501:273–9 s. doi: 10.1016/j.bbrc.2018.04.234. [DOI] [PubMed] [Google Scholar]
  • 36.Liu XY, Chen W, Wei B, Shan YF, Wang C. IFN-induced TPR protein IFIT3 potentiates antiviral signaling by bridging MAVS and TBK1. J. Immunol. 2011;187:2559–2568. doi: 10.4049/jimmunol.1100963. [DOI] [PubMed] [Google Scholar]
  • 37.Lambert SA, et al. The human transcription factors. Cell. 2018;172:650–665. doi: 10.1016/j.cell.2018.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lyabin DN, Eliseeva IA, Ovchinnikov LP. YB-1 protein: functions and regulation. RNA. 2014;5:95–110. doi: 10.1002/wrna.1200. [DOI] [PubMed] [Google Scholar]
  • 39.Nagasu S, et al. Y‑box‑binding protein 1 inhibits apoptosis and upregulates EGFR in colon cancer. Oncol. Rep. 2019;41:2889–2896. doi: 10.3892/or.2019.7038. [DOI] [PubMed] [Google Scholar]
  • 40.Meng Q, Xia Y. c-Jun, at the crossroad of the signaling network. Protein Cell. 2011;2:889–898. doi: 10.1007/s13238-011-1113-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Miyake T, McDermott JC. Nucleolar localization of c-Jun. FEBS J. 2022;289:748–765. doi: 10.1111/febs.16187. [DOI] [PubMed] [Google Scholar]
  • 42.Ruiz EJ, et al. JunD, not c-Jun, is the AP-1 transcription factor required for ras-induced lung cancer. JCI Insight. 2021;6:e124985. doi: 10.1172/jci.insight.124985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Liao JY, et al. EuRBPDB: a comprehensive resource for annotation, functional and oncological investigation of eukaryotic RNA binding proteins (RBPs) Nucleic Acids Res. 2020;48:D307–D313. doi: 10.1093/nar/gkz823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Huang C, et al. SARS coronavirus nsp1 protein induces template-dependent endonucleolytic cleavage of mRNAs: viral mRNAs are resistant to nsp1-induced RNA cleavage. PLoS Pathog. 2011;7:e1002433. doi: 10.1371/journal.ppat.1002433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Nakagawa K, et al. The endonucleolytic RNA cleavage function of nsp1 of Middle East respiratory syndrome coronavirus promotes the production of infectious virus particles in specific human cell lines. J. Virol. 2018;92:e01157–18. doi: 10.1128/JVI.01157-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Rao S, et al. Genes with 5’ terminal oligopyrimidine tracts preferentially escape global suppression of translation by the SARS-CoV-2 Nsp1 protein. RNA. 2021;27:1025–1045. doi: 10.1261/rna.078661.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Meyuhas O, Kahan T. The race to decipher the top secrets of TOP mRNAs. Biochim. Biophys. Acta. 2015;1849:801–811. doi: 10.1016/j.bbagrm.2014.08.015. [DOI] [PubMed] [Google Scholar]
  • 48.Brennecke P, et al. Accounting for technical noise in single-cell RNA-seq experiments. Nat. Methods. 2013;10:1093–1095. doi: 10.1038/nmeth.2645. [DOI] [PubMed] [Google Scholar]
  • 49.Vallejos CA, Richardson S, Marioni JC. Beyond comparisons of means: understanding changes in gene expression at the single-cell level. Genome Biol. 2016;17:70. doi: 10.1186/s13059-016-0930-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Slovin S, et al. Single-cell RNA sequencing analysis: a step-by-step overview. Methods Mol. Biol. 2021;2284:343–365. doi: 10.1007/978-1-0716-1307-8_19. [DOI] [PubMed] [Google Scholar]
  • 51.Grün D, van Oudenaarden A. Design and analysis of single-cell sequencing experiments. Cell. 2015;163:799–810. doi: 10.1016/j.cell.2015.10.039. [DOI] [PubMed] [Google Scholar]
  • 52.Ilicic T, et al. Classification of low quality cells from single-cell RNA-seq data. Genome Biol. 2016;17:29. doi: 10.1186/s13059-016-0888-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zheng GX, et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Qi R, Ma A, Ma Q, Zou Q. Clustering and classification methods for single-cell RNA-sequencing data. Brief. Bioinform. 2020;21:1196–1208. doi: 10.1093/bib/bbz062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Singh KK, Chaubey G, Chen JY, Suravajhala P. Decoding SARS-CoV-2 hijacking of host mitochondria in COVID-19 pathogenesis. Am. J. Physiol. 2020;319:C258–C267. doi: 10.1152/ajpcell.00224.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Miller B, et al. Host mitochondrial transcriptome response to SARS-CoV-2 in multiple cell models and clinical samples. Sci. Rep. 2021;11:3. doi: 10.1038/s41598-020-79552-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Yang SL, et al. Comprehensive mapping of SARS-CoV-2 interactions in vivo reveals functional virus-host interactions. Nat. Commun. 2021;12:5113. doi: 10.1038/s41467-021-25357-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lapointe CP, et al. Dynamic competition between SARS-CoV-2 NSP1 and mRNA on the human ribosome inhibits translation initiation. Proc. Natl Acad. Sci. USA. 2021;118:e2017715118. doi: 10.1073/pnas.2017715118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Schubert K, et al. SARS-CoV-2 Nsp1 binds the ribosomal mRNA channel to inhibit translation. Nat. Struct. Mol. Biol. 2020;27:959–966. doi: 10.1038/s41594-020-0511-8. [DOI] [PubMed] [Google Scholar]
  • 60.Yuan S, et al. Nonstructural protein 1 of SARS-CoV-2 is a potent pathogenicity factor redirecting host protein synthesis machinery toward viral RNA. Mol. Cell. 2020;80:1055–1066.e6. doi: 10.1016/j.molcel.2020.10.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Kamitani W, et al. Severe acute respiratory syndrome coronavirus nsp1 protein suppresses host gene expression by promoting host mRNA degradation. Proc. Natl Acad. Sci. USA. 2006;103:12885–12890. doi: 10.1073/pnas.0603144103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Burke JM, St Clair LA, Perera R, Parker R. SARS-CoV-2 infection triggers widespread host mRNA decay leading to an mRNA export block. RNA. 2021;27:1318–1329. doi: 10.1261/rna.078923.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Lee WS, Yousefi M, Yan B, Yong CL, Ooi YS. Know your enemy and know yourself - the case of SARS-CoV-2 host factors. Curr. Opin. Virol. 2021;50:159–170. doi: 10.1016/j.coviro.2021.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Liang SL, Quirk D, Zhou A. RNase L: its biological roles and regulation. IUBMB Life. 2006;58:508–514. doi: 10.1080/15216540600838232. [DOI] [PubMed] [Google Scholar]
  • 65.Zhang K, et al. Nsp1 protein of SARS-CoV-2 disrupts the mRNA export machinery to inhibit host gene expression. Sci. Adv. 2021;7:eabe7386. doi: 10.1126/sciadv.abe7386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Garneau NL, Wilusz J, Wilusz CJ. The highways and byways of mRNA decay. Nat. Rev. Mol. Cell Biol. 2007;8:113–126. doi: 10.1038/nrm2104. [DOI] [PubMed] [Google Scholar]
  • 67.Banerjee AK, et al. SARS-CoV-2 disrupts splicing, translation, and protein trafficking to suppress host defenses. Cell. 2020;183:1325–1339.e21. doi: 10.1016/j.cell.2020.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Xie Y, Ren Y. Mechanisms of nuclear mRNA export: A structural perspective. Traffic. 2019;20:829–840. doi: 10.1111/tra.12691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Nagano S, et al. TDP-43 transports ribosomal protein mRNA to regulate axonal local translation in neuronal axons. Acta Neuropathol. 2020;140:695–713. doi: 10.1007/s00401-020-02205-y. [DOI] [PubMed] [Google Scholar]
  • 70.Ratti A, Buratti E. Physiological functions and pathobiology of TDP-43 and FUS/TLS proteins. J. Neurochem. 2016;138:95–111. doi: 10.1111/jnc.13625. [DOI] [PubMed] [Google Scholar]
  • 71.Wilk AJ, et al. A single-cell atlas of the peripheral immune response in patients with severe COVID-19. Nat. Med. 2020;26:1070–1076. doi: 10.1038/s41591-020-0944-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Zhou P, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Triana S, et al. Single-cell transcriptomics reveals immune response of intestinal cell types to viral infection. Mol. Syst. Biol. 2021;17:e9833. doi: 10.15252/msb.20209833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Boratyn GM, Thierry-Mieg J, Thierry-Mieg D, Busby B, Madden TL. Magic-BLAST, an accurate RNA-seq aligner for long and short reads. BMC Bioinform. 2019;20:405. doi: 10.1186/s12859-019-2996-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Islam S, et al. Quantitative single-cell RNA-seq with unique molecular identifiers. Nat. Methods. 2014;11:163–166. doi: 10.1038/nmeth.2772. [DOI] [PubMed] [Google Scholar]
  • 76.Rostom R, Svensson V, Teichmann SA, Kar G. Computational approaches for interpreting scRNA-seq data. FEBS Lett. 2017;591:2213–2225. doi: 10.1002/1873-3468.12684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. 1995;57:289–300. [Google Scholar]
  • 78.Hagberg, A., Schult, D. & Swart, P. Exploring network structure, dynamics, and function using NetworkX. https://conference.scipy.org/proceedings/SciPy2008/paper_2/ (2008).
  • 79.Bastian, M., Heymann, S. & Jacomy, M. Gephi: an open source software for exploring and manipulating networks. http://www.aaai.org/ocs/index.php/ICWSM/09/paper/view/154 (2009).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (5.3MB, pdf)
42003_2022_4253_MOESM3_ESM.pdf (489.9KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1 (276.8MB, gz)
Supplementary Data 2 (425.5MB, zip)
Supplementary Data 3 (8.4MB, gz)
Supplementary Data 4 (1.1GB, gz)
Supplementary Data 5 (27MB, zip)
Supplementary Data 6 (1.2GB, zip)
Reporting Summary (1.1MB, pdf)

Data Availability Statement

The original datasets used in the analysis can be accessed at NCBI archived under BioProject accession codes PRJNA701930, PRJNA658984, and PRJNA720321. All data generated or analyzed during this study are included in this article and its supplementary information files. The Tables and processed datasets that support the findings of this study are available at the Zenodo repository (https://zenodo.org/) under the identifier (10.5281/zenodo.7198900).

Codes that support the findings of this research are available at GitHub (https://github.com/paguties/SARS-CoV-2_scRNAseq.git).


Articles from Communications Biology are provided here courtesy of Nature Publishing Group

RESOURCES