Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection often causes severe complications and even death. However, asymptomatic infection has also been reported, highlighting the difference in immune responses among individuals. Here we performed single-cell chromatin accessibility and T cell-receptor analyses of peripheral blood mononuclear cells collected from individuals convalescing from COVID-19 and healthy donors. Chromatin remodelling was observed in both innate and adaptive immune cells in the individuals convalescing from COVID-19. Compared with healthy donors, recovered individuals contained abundant TBET-enriched CD16+ and IRFl-enriched CD14+ monocytes with sequential trained and activated epig- enomic states. The B-cell lineage in recovered individuals exhibited an accelerated developmental programme from immature B cells to antibody- producing plasma cells. Finally, an integrated analysis of single-cell T cell-receptor clonality with the chroma- tin accessibility landscape revealed the expansion of putative SARS-CoV-2- specific CD8+ T cells with epigenomic profiles that promote the differentiation of effector or memory cells. Overall, our data suggest that immune cells of individuals convalescing from COVID-19 exhibit global remodelling of the chromatin accessibility landscape, indicative of the establishment of immuno- logical memory.
Main
The global coronavirus disease 2019 (COVID-19) pandemic caused by the SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) has resulted in over 70 million recorded cases with a mortality rate of ~2.3% 1. To date, the main strategy for ending the pandemic involves the development of effective vaccines, many of which are being evaluated2, 3. Although most leading vaccine candidates can elicit virus-neutralizing antibodies, there is considerable uncertainty on how protective or durable these responses might be. Thus, a greater understanding of the innate and adaptive immune responses of individuals convalescing from COVID-19 is needed, particularly knowledge on whether immunological memory is established.
To date, studies on the immune responses to SARS-CoV-2 have mainly focused on the aspects of pathological inflammation 3–5. According to single-cell transcriptome analyses, monocytes from the peripheral blood or the bronchoalveolar lavage fluid of COVID-19 patients are characterized by high levels of pro-inflammatory cytokines 5–8 such as IL-6, IL-1β, and TNFα. In addition, most of the patients with COVID-19 develop lymphocytopenia —especially reduced numbers of CD4+ and CD8+ T cells—within 2 weeks of disease onset, contributing to increased serum IL-6 levels. Lymphocytopenia is also predictive of disease severity9–11, indicating that CD4+ and CD8+ T cells are critical for viral control and disease recovery. Moreoever, single-cell transcriptomic analyses showed an increased ratio of plasma cells in patients infected with COVID-19 and early convalescent individuals7, 12.
The epigenomic regulation of immune responses to primary SARS-CoV-2 infection remains unknown at present. Here, we applied single-cell transposase-accessible chromatin with sequencing (scATAC-seq) to analyze chromatin remodeling in the peripheral immune cells of six matched and uninfected healthy donors (five donors collected in this study and an additional dataset from a public database13) and ten individuals convalescing from moderate or severe COVID-19 infection at 4–12 weeks following recovery. The approach of 10x-based scATAC-seq enables the production of high-quality single-cell accessible chromatin profiles at massive scale across all major immune-cell types in peripheral blood mononuclear cells (PBMCs). For the T cell analysis, we developed a single-cell method that combines single-cell T cell-receptor (TCR) sequencing (scTCR-seq), fluorescence-activated cell sorting (FACS) with index sorting, and ATAC-seq—termed TCR-FACS-index-ATAC sequencing (Ti-ATAC- seq)—thereby allowing us to obtain expression data for cell-surface markers, paired TCR sequences, and the chromatin accessibility landscape for each cell analysed. This single-cell chromatin landscape approach facilitates a comprehensive understanding of how effector or memory cells are established in both the innate and adaptive immune responses of individuals convalescing from COVID-19.
Results
Overview of the peripheral immune-cell profiling in individuals convalescing from COVID-19.
We mapped the transposase-accessible chromatin at the single-cell level by using 10x chromium scATAC-seq (10x Genomics; Fig. 1a and Supplementary Table 1 ). We collected data from 72,318 PBMCs from eight individuals convalescing from COVID-19 as well as 24,997 PBMCs from five healthy donors and one additional dataset from a public database13 as controls (Fig. 1b, Extended Data Fig. 1). Using uniform manifold approximation and projection (UMAP), we analysed the distribution of the immune-cell populations based on the gene scores of canonical lineage markers, which demonstrate the aggregate accessibility of several enhancers linked to the indicated genes—including CD14, CD16 (FCGR3A), and CEBPB for monocytes; PAX5, MS4A1 for B cells; CD8A and CD3G for CD8+ T cells; CD4 and CD3G for CD4+ T cells; and KLRB1 for natural killer (NK) cells (Fig. 1c–e). The frequencies of monocyte-lineage and effector and memory CD8+ T cells increased in individuals convalescing from COVID-19, whereas those of B-cell lineages decreased (Fig. 1f,g). We further analysed and clustered the cells of each lineage separately.
Fig. 1. Overview of the immune cell epigenomic landscape of individuals convalescing from COVID-19.
a, Outline of the two approaches used for scATAC-seq: 10x-based scATAC-seq (top) and Ti- ATAC-seq (bottom). b, Cell numbers (log10-transformed) in each sample for 10x-based scATAC- seq. c, UMAP plot showing 97,315 10x-based scATAC-seq profiles of immune cells in peripheral blood—including B, nK and T cells, monocytes and other clusters. The dots indicate individual cells and the cell-type identity is indicated by colour. d, UMAP plot showing the sample origin. e, Canonical markers overlaid on UMAP embedding, including CD14 and CEBPB for monocytes; PAX5 and MS4A1 for B cells; CD3G, CD8A and CD4 for T cells; and KLRB1 for NK cells. The UMAP plot is coloured based on the log-transformed normalized gene scores, which exhibited the accessibility of peaks linked to the indicated genes. The gene scores were calculated as log2(normalized count + 1). f, Cell-type frequencies in each sample, determined using 10x-based scATAC-seq. The colours indicate the cell type. g, Differences in the proportion of monocyte and B-cell lineages as well as effector and memory CD8+ T cells in the samples from individuals convalescing from COVID-19 (COV; n = 8 samples collected from eight individuals, one sample per individual) and healthy donors (HD; n = 4 samples, three samples collected from five HD in this study (including a pooled sample from three HD) and an additional sample from published data (GSE139369) as indicated in b and Extended Data Fig. 1a). A two-sided unpaired Student’s i-test was performed to determine the P values. The boxplots denote the median with the quartile range (25–75%), and the length of whiskers represents 1.5x the interquartile range (IQR). h, number of T cells in each sample that had both scTCR-seq and ATAC-seq data.
In addition, we explored Ti-ATAC-seq to further investigate special clonally expanded T- cell clones (Fig. 1a, Extended Data Fig. 2). We collected 2,880 αβ T cells from ten individuals convalescing from COVID-19, among which 1,574 quality-control-positive T cells had both paired TCRαβ sequences and ATAC-seq (Fig. 1h). Ti-ATAC-seq had similar performance to previously published microfluidic platform-based T-ATAC-seq14 and other joint ATAC-RNA approaches15–16 (Extended Data Fig. 2c).
Sequential trained and activated monocytes in COVID-19.
We identified and annotated 13 subclusters of monocytes (Fig. 2a). Clusters of CD14+ monocytes (clusters 11, 12 and 13) showed a high CD14 gene score. Clusters of CD16+ monocytes (clusters 3 and 4) were characterized by a high CD16 gene score. CD1c+ conventional dendritic cells (clusters 1 and 8) were defined by their high CD1c gene score. CLEC4C+ plasmacytoid dendritic cells showed high gene accessibility in the CLEC4C locus. Clusters 9 and 10, with their predominantly high KLF4 and CD68 gene scores, probably represented monocyte-derived dendritic cells (Fig. 2b and Extended Data Fig. 3a,b).
Fig. 2. Epigenomic signatures of trained and activated monocytes in individuals convalescing from COVID-19.
a, Subclustering UMAP of all monocytes (see Fig. 1c). The five clusters of CD14+ and CD16+ monocytes indicated in the legend were annotated according to sequential differentiation states; cDC, conventional dendritic cells; pDC, plasmacytoid dendritic cells and moDC, monocyte-derived dendritic cells. b, Surface markers overlaid on UMAP embedding coloured according to the log- transformed normalized gene scores. c, Differences in the proportions of clusters 3, 4, 11 and 12 in the samples of individuals convalescing from COVID-19 (COV; n = 8 samples collected from eight individuals, one sample per individual) and healthy donors (HD; n = 4 samples, three samples collected from five HD in this study (including a pooled sample from three HD) and an additional sample from published data (GSE139369)). A two-sided unpaired Student’s t-test was performed to determine the P values. The boxplots denote the median with the quartile range (25–75%), and the length of whiskers represents 1.5x the IQR. d-g, Volcano plots showing the differential TF motif accessibility using the mean TF motif accessibility in the chromVAR TF bias-corrected deviation in clusters 3 and 4 (d), 12 and 11 (e), 13 and 11 (f), and 13 and 12 (g). The P values were calculated using two-sided pairwise Wilcoxon test and the false discovery rate (FDR) was corrected using the Benjamini-Hochberg procedure.
We focused on CD14+ and CD16+ monocytes because the individuals convalescing from COVID-19 had a greater abundance of two clusters of CD14+ monocytes (clusters 11 and 12) and one cluster of CD16+ monocytes (cluster 4) than healthy individuals (Fig. 2c). We found and annotated sequential differentiation states from homeostasis to the mature immune inflammatory effector state in the COVID-19 group through differential cis- and trans-element analysis within the CD14+ and CD16+ clusters (Fig. 2a). For example, cluster 4 (CD16- activation) was associated with leukocyte activation and the regulation of cytokine production and showed high levels of activity of transcription factors (TFs) involved in IFNγ-induced rapid signatures and myeloid differentiation, including TBET, JUN, and FOSB17–20, whereas cluster 3 (CD16-maturation) was associated with hemopoiesis and the regulation of immune response (Fig. 2d, Extended Data Fig. 3c). Interestingly, cluster 4 had increased chromatin accessibility in the CCL5, PRF1, and GZMB loci (Extended Data Fig. 3a), demonstrating that IFNγ drives the differentiation and activation of CD16+ monocytes in the hyperinflammatory microenvironment of patients with COVID-19. For the CD14+ monocytes, the clusters 12 and 13 (CD14-activation and CD14-maturation, respectively) showed increased chromatin accessibility at inflammatory cytokine genes—including IL1fi, IL6, IL8, CCL2, CCL3, and CCL7 (Extended Data Fig. 3a)—and enriched TFs that represent a mature state, including FOS, JUN, and MAF (Fig. 2e–g)19, 21. Meanwhile, cluster 11 (CD14-trained) was characterized by the accessibility of genes involved in leukocyte activation (Extended Data Fig. 3c) and showed high activity of TFs involved in hematopoietic commitment and survival of monocytes, including HOXA9 and NR4A1 (Fig. 2e,f)22, 23. In addition, NR4A1 TF activity, which was enriched in cluster 11 (Fig. 2e,f), was also enriched in the activation stage of the CD16+ monocyte (Fig. 2d).
Innate immune cells have recently been shown to display immunological memory after certain infections or vaccines, also termed trained immunity24–26. To understand the developmental dynamics of these monocytes, we constructed a lineage trajectory of CD14+ based on sequential differentiation states, which progressed from trained to activation and finally to maturation, and generated ordered single cells (termed as ‘pseudotime’) based on their epigenetic similarity (Fig. 3a). The dynamic c/s-element and TF motif accessibility across the trajectory were consistent with the sequential differentiation states (Fig. 3b). For instance, the trained, activated, and mature monocytes showed sequential chromatin accessibility in the IL1β and CCL5 loci (Fig. 3b,c). The trained stage of the CD14+ lineage trajectory was accompanied by increased accessibilities of IRF1, IRF3, and IRF8 TF motifs (Fig. 3b,d,e), thus enhancing susceptibility to regulate rapid IFN-β induction and host inflammatory defences in human blood monocytes20, 27. The activation and maturation stages of these trajectories shared accessibility at TFs involved in monocyte activation and maturation for the activator protein 1 (AP-1) factors FOS and JUN (Fig. 3b,d,e)19 28 29.
Fig. 3. Epigenomic differentiation trajectory of CD14+ and CD16+ monocytes.
a, UMAP showing the lineage trajectory of CD14+ monocytes ordered based on trained, activation and maturation states. Pseudotime values were overlaid on the UMAP embedding; the smoothed line and arrow represent the visualization of the trajectory path from the spline fit. b, Heatmaps of the ordered cis-element accessibility (left) and TF motif accessibility (right) across pseudotime in the CD14+ monocytes (see Fig. 3a). The cis-element and TF motif accessibilities are indicated by the gene score and chromVAR TF-motif bias-corrected deviation, respectively. c, Aggregated single-cell genome tracks for the indicated clusters at the IL1β (left) and CCL5 (right) loci with peak co-accessibility (Co-access). The Co-access is indicated by the inferred peak-to-gene links for distal regulatory elements. Green shading indicates differential peaks within clusters. d, Violin plots showing the ChromVAR TF-motif bias-corrected deviation scores of the indicated TF regulators across clusters 3 (n = 1,003 cells), 4 (n = 1,472 cells), 11 (n = 988 cells), 12 (n = 5,719 cells) and 13 (n = 4,537 cells). The boxplots denote the medians and the quartile range (25–75%), and the length of whiskers represents 1.5x the IQR. e, TF footprints of the TBET motif in clusters 3 and 4 (left) as well as the FOS (middle) and IRF1 (right) motifs in clusters 11, 12 and 13. The Tn5 insertion bias track is shown. f, Levels of IL-β and IL-6 secreted by virus-exposed PBMCs (COV-S, n = 3 samples from three individuals that recovered from severe COVID-19; COV-M, n = 7 samples from seven individuals that recovered from moderate COVID-19; HD, n = 10 samples from ten healthy donors). The error bars indicate the mean ± s.e.m. A two-sided unpaired Student’s i-test was performed to determine the P values.
To validate this, we re-challenged PBMCs with spike-nCoV pseudovirus or hepatitis B virus for 24 h in vitro, as monocytes predominantly showed high gene scores for IL1β and IL- 6 (Fig. 1e). The PBMCs from individuals convalescing from COVID-19 secreted higher levels of IL-1β and IL-6 than those from the healthy donors (Fig. 3f), which is consistent with trained monocytes having enhanced sensitivity in response to different pathogens after an initial challenge25. There was no significant difference in the levels of cytokine secretion between the patients that were tested early (4 weeks) or late (10–12 weeks) following discharge (Fig. 3f).
Collectively, although systemic inflammation had returned to a normal and healthy level (Supplementary Table 1 ), the CD14+ and CD16+ monocytes maintained chromatin reprogramming that promotes trained immunity and thus enables a rapid inflammatory response against subsequent infection.
A facilitated B-cell developmental programme in individuals convalescing from COVID-19.
We identified ten B-cell clusters: immature B cells (clusters 6 and 7) with a high SDC1 score; naïve B cells (clusters 4 and 5) with high TCL1A, CD19, and CD20 scores; memory B cells (clusters 1, 2, and 3) with high CD27 and CD38 scores; and plasma cells (clusters 8, 9, and 10) with a high XBP1 score (Fig. 4a,b, and Extended Data Fig. 4a).
Fig. 4. Accelerators facilitated the B cell developmental programme in individuals convalescing from COVID-19.
a, Subclustering UMAP plot of all B cells (see Fig. 1c). b, Gene markers overlaid on UMAP coloured according to the log-transformed normalized gene scores for SDC1, TCL1A, CD27 and XBP1. c, Relative frequencies of the B-cell subclusters in the different samples. d, Integrative lineage trajectory of B-cell states in healthy donors (HD) and individuals convalescing from COVID-19 (COV). The smoothed line and arrow represent the visualization of the trajectory path across different states (immature, naive, memory and plasma), and the sample origins are denoted by colour. e, Heatmaps showing the ordered gene-score trajectory across pseudotime for B-cell differentiation. f, Heatmaps showing the positive TF regulators obtained from the integration of ordered TF gene scores (right) with ordered TF motif accessibility (left) across pseudotime for B- cell differentiation. Positive TF regulators are TF motifs that show high bias-corrected chromVAR TF-motif deviations that also exhibit similarly dynamic gene scores across differentiation states. g, Volcano plots demonstrating the differential TF motif accessibility using the mean TF motif accessibility in the chromVAR TF bias-corrected deviation between COV and HD individuals in the indicated B-cell states. The P values were calculated using a two-sided pairwise Wilcoxon test and the false discovery rate (FDR) was corrected using the Benjamini-Hochberg procedure. h, Schematic of differential positive regulatory TFs driving B-cell differentiation of immature B cells to plasma cells in HD and COV individuals.
Although the individuals convalescing from COVID-19 and healthy donors had comparable frequencies of B-cell subsets (Fig. 4c), the UMAP projection revealed preferential cell distribution in each B cell subset between healthy donors and individuals convalescing from COVID-19 (Fig. 4d and Extended Data Fig. 4b). The transcriptional regulation of the B-cell developmental programme in healthy donors has been extensively studied30, 31. However, the B-cell lineage trajectories based on accessible chromatin differed between the healthy donors and individuals convalescing from COVID-19 (Extended Data Fig. 4b–d), suggesting that SARS-CoV-2 infection induced different B-cell developmental programmes.
To investigate this, we reconstructed the lineage trajectory of B cells from healthy donors and individuals convalescing from COVID-19 (Fig. 4d). We observed c/s-elements near known regulators of every stage of B-cell development—such as TCL1A, CXCR4,MEF2C, BHLHE41, BAFF, PLCG2, PAX5, EBF1, and RORA (Fig. 4e, Extended Data Fig. 5a)30–34—suggesting that the integrated B-cell lineage trajectory was a well-defined developmental program that could be used to compare regulatory mechanisms between the healthy donors and individuals convalescing from COVID-19. Next, through integrated chromVAR TF deviations with similarly dynamic gene scores across differentiation states, we identified positive TF regulators with sequential activities of BCL11A, IRF8, PAX5, REL, BATF, IRF4, EBF1, POU2F2, TBET, and LEF1 that promote B cell commitment, differentiation, maintenance, and class switch recombination30, 35–39, thus resolving the integrated timing of TF activity for comparison (Fig.4f and Extended Data Fig. 5b–d).
We then measured the TF deviation scores and variations to identify the differential positive TF regulators for four B-cell subsets of the healthy donors and individuals convalescing from COVID-19 (Fig. 4g). Interestingly, NF-kB subunits, including REL, RELA, and RELB, which are involved in germinal center B-cell maintenance and homeostasis40, 41, were enriched in four subsets of B cells of the healthy donors (Fig. 4g, and Extended Data Fig. 5d). Meanwhile, AP-1 factors, including FOS and JUN, which are involved in B-cell receptor signaling pathway and indicate B-cell differentiation and activation42, were enriched in naïve, memory, and plasma B cells in individuals convalescing from COVID-19. Transcription factors—including SPI1, EBF1, IRF4, and POU2F2—that are vital for B-cell survival, differentiation, and receptor signaling responses were enriched in naïve B cells of individuals convalescing from COVID-1936, 38, 43–46. In comparison to the memory and plasma B cells of healthy donors that showed a high REL deviation score, the plasma cells and a subset of memory B cells of the COVID-19 group showed high activity of TFs that are involved in class switch recombination and promote specialized immune function in class-specific IgG+ memory B cells, including TBET and BATF39, 47(Fig. 4g, Extended Data Fig. 5d). We also detected specific IgG antibodies of the spike protein of SARS-CoV-2 in the blood samples of the individuals convalescing from COVID-19 (Extended Data Fig. 5e). Collectively, according to the results of integrated timing analysis of TF activity, differential positive TF regulators promoted B-cell maintenance and homeostasis in healthy donors, whereas they facilitated B- cell activation, differentiation, and IgG class-switch recombination in the individuals convalescing from COVID-19 (Fig. 4h).
CD8+ T cell-fate decisions in individuals convalescing from COVID-19.
We first sub-grouped NK and T cells into 14 subclusters (Fig. 5a, and Extended Data Fig. 6a–c). The CD8+ T cell states consisted of naïve, intermediate, memory and effector T cells as well as mucosal- associated invariant T cells (MAIT), whereas the CD4+ T-cell states were composed of naïve, central memory (TCM) and effector memory (TEM) T cells. In contrast to healthy donors enriched in naïve CD8+ T cells, the individuals convalescing from COVID-19 showed increased number of effector and memory CD8+ T cells (Fig. 1h and 5b). Thus, clusters 6 and 8 may represent SARS-CoV-2-induced effector and memory CD8+ T cells, respectively.
Fig. 5. Single-cell epigenomic profiles of T cells in individuals convalescing from COVID-19.
a, Subclustering UMAP of nK and T cells (see Fig. 1c). For the CD8+ T cells, naive cells had high TCF7 and CCR7 gene scores; effector cells had high CD8A, TBX21 (TBET) and IFNG gene scores; memory cells had high CD8A, TBX21 and KLF2I13 gene scores but low effector gene scores; and MAIT cells had high SLC4A10 gene scores. For the CD4+ T cells, naive cells had high TCF7 and CCR7 gene scores; TCM cells had high CD4, CCR7, AQP3 and SELL gene scores; and TEM cells had high CD4 and effector gene scores but low CCR7 gene scores. The nK cells had high NCAM, KLRC1, KLRD1 and FCGR3A gene scores; and nKT cells had high NCAM and CD3E gene scores.b Cell-type frequencies in the different samples with the cluster identities indicated. c, Lineage trajectory of CD8+ T-cell states, which included naive, intermediate, effector and memory CD8+ T cells. Pseudotime values overlapped on the UMAP. d, e, Pseudotime heatmap showing the ordered cz’s-element accessibility (left) and TF motif accessibility (right) in effector (d) and memory (e) CD8+ T-cell lineage trajectories (see c). f, Aggregated single-cell genome tracks for the indicated clusters and states of CD8+ T cells at the GZMB (top) and IFNG (bottom) loci with peak co-accessibility (Co-access). The regions shaded in blue indicate differential peaks within clusters. g, Ridge plots showing TF deviation scores across different clusters and states of CD8+ T cells. h,i, Gene scores of IFNG (left) and GZMB (right; h), as well as the gene score (left) and TF deviation score of TBET (right; i) in the indicated cell types. COV, individuals convalescing from COVID- 19; HD, healthy donors; COV-CD8-effector, n = 12,353 cells; COV-CD8-memory, n = 1,313 cells; COV-CD8-intermediate, n = 2,737 cells; COV-CD8-naive, n = 1,858 cells; HD-CD8-effector, n = 2,052 cells; HD-CD8-memory, n = 279 cells; HD-CD8-intermediate, n = 1,408 cells; HD-CD8- naive, n = 1,272 cells. The boxplots denote the medians and the quartile range (25–75%), and the length of whiskers represents 1.5x the IQR.
We next orchestrated the effector and memory CD8+ T-cell trajectories to explore CD8+ T cell-fate decisions. The effector and memory trajectories almost overlapped at the early state and diverged into two distinct branches, that were predominantly comprised of cells from the individuals convalescing from COVID-19 (Fig. 5c). The analysis of cis elements near effector genes revealed distinct regulatory patterns of accessibility across the pseudotime of effector and memory CD8+ T cells (Fig. 5d,e). For example, the cis-element accessibility in the promoter and distal enhancers in the GZMB and IFNG loci gradually increased from the naïve to the intermediate and then to the memory and effector states (Fig. 5f), demonstrating that the expression of effector genes was accurately regulated by a distinct cis-element network in the different states.
The results of TF activity analysis demonstrated the shared and unique TF programmes across the pseudotime of effector and memory CD8+ trajectories in the individuals convalescing from COVID-19 (Fig. 5d,e). For instance, in the first stage of trajectories from the naïve to the intermediate state, effector and memory CD8+ T cells shared accessibility at the AP-1 factors FOS and JUNB, consistent with their roles in T-cell differentiation and activation14, 31; however, effector CD8+ T cells also showed accessibility at NFKB1/2 and RUNX3, whereas memory CD8+ T cells were characterized by accessibility of BATF. IRF4 activity gradually increased across the effector CD8+ T cell trajectory but was depleted in that of memory CD8+ T cells (Fig. 5g, and Extended Data Fig. 6d), which is consistent with their role in limiting the development of memory T cells48. In contrast, BATF motifs showed accessibility earlier in the trajectory of memory CD8+ T cells than that of effector cells (Fig. 5d,e). Similarly, the second stage of trajectories from the intermediate state to either the effector or memory state identified the pivotal roles of EOMES and TBET motifs in each pathway (Fig. 5d, e, g). Effector CD8+ T cells were also characterized by accessibility of SREBF1/2 (Fig. 5d,g), which is involved in the metabolic reprogramming of effector T cells during extensive clonal expansion49. Memory CD8+ T cell commitment was accompanied by accessibility of motifs involved in promoting memory T-cell trafficking and maintenance of Krüppel-Like Factor 2/13 (KLF2/3; Fig. 5e,g)50– 53. Notably, compared with healthy donors, we also observed gradual increased gene accessibility of GZMB, IFNG, and TBET in the intermediate, memory and effector CD8 T cells of individuals convalescing from COVID-19 (Fig. 5h,i).
Furthermore, we performed TCR clonality analysis using the paired TCR-αβ sequences generated from Ti-ATAC-seq. The clonal expansion rates of CD8+ T cells were significantly higher than those of CD4+ T cells of individuals convalescing from COVID-19 (Fig. 6a,b). Interestingly, CD8+ T cells were strongly dominated by one or two T cell clones in each patient, and the clonal expansion rate of the largest clone ranged from 2.6% to 41.1% of the total CD8+ T cells (Fig. 6a). These results suggest that these putative SARS-CoV-2-specific CD8+ T-cell clones, rather than CD4+ T cells, play a critical role in viral control and long-term immune protection. We next integrated TCR clonality with single-cell epigenomic profiles generated from Ti-ATAC-seq. Notably, on comparison with unexpanded CD8+ T cell clones, the two largest clonally expanded CD8+ T-cell clones were particularly enriched in TBET and EOMES TFs (Fig. 6c). This is consistent with our 10x-scATAC-seq data showing that these two TFs are important for the development of effector and memory CD8+ T cells in individuals convalescing from COVID-19 (Fig. 6d).
Fig.6. Single-cell epigenomic and TCR profiling of CD8+ T cells in individuals convalescing from COVID-19.
a, Distribution of CD4+ (top) and CD8+ (bottom) T-cell TCR clonotypes, according to size, from ten individuals convalescing from COVID-19 (COV). The full-length paired TCR sequences were obtained from the Ti-ATAC-seq platform-based scTCR-seq. For the expanded TCRs with clonotype size > 1, each slide indicates a unique clonotype. Single viable DAPI-CD3+TCRαβ+ T cells were gated and sorted, and the T-cell types were identified by FACS-indexed sorting. The CDR3α- CDR3β sequences and frequencies of the largest CD8+ T-cell clonotypes are shown. b, Clonal expansion rate of CD4+ and CD8+ T cells in ten individuals convalescing from COVID-19. Each data point indicates a single individual and the matched data of the CD4+ and CD8+ T cells are linked by distinct lines. A two-sided unpaired Student’s i-test was performed to determine the P value. c, Ranked TF deviation enrichment values in the aggregated largest (left; TCR clone: CLVGGGDnTDKLIF-CASSQDRFYEQYF, n = 50 cells) and second-largest clonally expanded CD8+ T cells (right; TCR clone: CAVGDTDKLIF-CASSLGSLGGGELFF, n = 24 cells) versus aggregated unexpanded CD8+ T cells (n = 298 cells). The T-cell clonality and single-cell epigenomic profiles were obtained from the Ti-ATAC-seq platform. The TF enrichment values were calculated as the difference in the mean TF deviation between the two single-cell populations. d, Anti-SARS-CoV-2 CD8+ T-cell response. Schematic of the differential regulatory TFs driving CD8+ T-cell differentiation from the naive state to the effector-cell or memory state in individuals convalescing from COVID-19.
In summary, our in-depth analysis of the epigenomic landscape and single-cell TCR clonality in individuals convalescing from COVID-19 revealed pivotal roles for effector CD8+ T cells in initial viral control and the formation of memory CD8+ T cells via global chromatin accessibility remodeling and accurate regulatory programmes.
Discussion
SARS-CoV-2 causes severe pulmonary disease and complications with significant morbidity and mortality1. Our current understanding of the epigenomic regulatory mechanisms in the host immune response to SARS-CoV-2 infection and long-term immune protection is limited; thus, it is challeging to develop urgent therapeutics or assess the effect of different vaccine candidates.
Here, we utilized high-throughput 10x-based scATAC-seq technology to measure the chromatin accessibility of all PBMCs, including monocytes, B cells, NK cells, and T cells. We also used Ti-ATAC-seq to further investigate special clonally expanded T-cell clones with paired TCRaP sequencing and responded chromatin accessibility from each T cell. Chromatin remodeling was significantly altered in almost all immune-cell compartments in the individuals convalescing from COVID-19. Our epigenomic profiles revealed a sequential differentiation state from homeostasis to the mature immune inflammatory effector response in CD14+ and CD16+ monocytes of individuals convalescing from COVID-19, which featured trained, activation, and maturation states. The trained and activation states of CD14+ and CD16+ monocytes were dominantly enriched in individuals convalescing from COVID-19. Although immune memory is a well-known feature of the acquired immune system, the activation of the innate immune system can also heighten responsiveness to subsequent triggers, which may provide protection during the early stage of reinfection. In B-lineage cells, we found substantial differences in the TF regulators of each state between the individuals convalescing from COVID-19 and healthy donors. Furthermore, B-cell lineage trajectories revealed an accelerated developmental programme from immune B cells to antibody-producing plasma cells in COVID-19. Integrated analysis of single-cell TCR clonality with the single-cell chromatin accessibility landscape showed the dramatic clonal expansion of CD8+ T cells in individuals convalescing from COVID-19 as well as the bifurcation of cell-fate decisions for the putative SARS-CoV-2-specific clonally proliferating CD8+ T cells. This is consistent with effector CD8+ T cells in initial viral control and memory CD8+ T cells in long-term immune protection.
Consistent with our data, the reported scRNA-seq profiles of PBMCs identified the abundant IL-1+ CD14++ monocytes and activated monocytes in individuals convalescing from COVID-197. However, our scATAC-seq profiles revealed further epigenetic changes in individuals convalescing from COVID-19, which can be attributable to the fact that epigenetic changes are more long-lived compared with transcriptional differences.Trained immunity caused by epigenetic and metabolic reprogramming is a “double-edged sword”. Although it offers broad benefits for the host immune defence against pathogens, it identifies potentially detrimental outcomes in immune-mediated and chronic inflammatory diseases24‘ 25. In the context of COVID-19, persistent infection with SARS-CoV-2 results in long-term chromatin accessibility reprogramming, which permits monocytes to remain in a ‘trained’ functional state. Well-controlled trained immunity is likely to protect against subsequent infection25. The effect of cytokine storms on trained immunity should be studied in the future.
Previous studies of immune responses to SARS-CoV-1 and Middle East respiratory syndrome coronavirus (MERS-CoV) can partially offer insights into SARS-CoV-2 memory immunity. Similar to SARS-CoV-2, SARS-CoV-1 and MERS-CoV induced strong inflammatory responses and associated lymphopenia9. Furthermore, long-term immune protection has been observed in patients that have recoveed from SARS-CoV-1 and MERS-CoV and is attributed to virus-specific antibody responses and long-lived virus-specific memory T-cell responses2, 54, 55, which is consistent with our observations on the chromatin reprogramming of B cells and CD8+ T cells. Recent studies on memory immune protection against SARS-CoV-2 reinfection, including a rhesus macaque reinfection model and COVID-19 vaccines under phase I, II and III clinical trials, suggest that primary infection with SARS- CoV-2 and vaccines against SARS-CoV-2 can provide partial immune protection56–59.
The mechanisms of memory immunity—including that in trained and activated monocytes, accelerated B cells and putative SARS-CoV-2-specific clonally expanded effector and memory CD8+ T cells—to SARS-CoV-2 may be induced by cytokine storms, such as the high production of IFNγ in the serum. As we observed, IFNγ-induced TBET was significantly enriched in the activation state of trained monocytes, B cell acceleration, and CD8+ T cell fate decisions, which is consistent with its role in monocyte activation and maturation, IgG+ memory B cell development and effector and memory T cell differentiation17, 20, 24, 31, 39, 60–62. Therefore, the epigenomic regulation of innate and adaptive immune memory responses we demosntrated may not be specific for SARS-CoV-2 and could be elicited following other infections, such as SARS-CoV-1 and MERS. Future studies comparing epigenetic changes among convalescing individuals infected with SARS-CoV-2 and other viruses would be valuable.
Overall, our broad analysis of the epigenomic landscape and TCR profiling demonstrated that individuals convalescing from COVID-19 established immune memory formation and trained immunity via the global remodeling of the chromatin accessibility landscape. The stability of these changes over a longer period requires further study.
METHODS
Subjects and specimen collection.
We collected blood from five healthy donors, seven convalescent individuals who have recovered from moderate COVID-19, and three convalescent individuals who have recovered from severe COVID-19. They were enrolled at the Fifth Medical Center of PLA General Hospital in May 2020. PBMCs were isolated using Ficoll solution according to the manufacturer’s instructions. We defined recovered patients as COVID-19 convalescent individuals who were diagnosed by real-time RT-PCR at admission, have recovered, and had been discharged from the hospital for least one month. Convalescent individuals were classified into mild and severe groups based on the Fifth Revised Trial Version of the Novel Coronavirus Pneumonia Diagnosis and Treatment Guidance. Healthy donors, included three women and two men, ranging from 18–55 years old, were defined by negative test results in real-time RT-PCR analysis of SARS-CoV-2 gene and IgG antibody detection. This study and study protocol was approved by the Ethics Committees of the Fifth Medical Center of Chinese PLA General Hospital, Beijing, China (2020005D) on healthy volunteers or COVID-19 convalescent individuals in Medical Research, and written informed consent was obtained from all participants. All the participants recieved the participants compensation. The clinical features of these convalescent individuals and healthy donors are listed in Supplementary Table 1.
Single T cell index sorting.
Human PBMCs were thawed after storage in liquid nitrogen and allowed to recover overnight in R10 culture medium (RPMI1640 medium supplemented with 10% fetal calf serum (FCS), 25 mM HEPES, 1* nonessential amino acids, 50 μM β-mercaptoethanol, and 1 × penicillin and streptomycin). Cell surface staining was performed with the following antibodies: DAPI, Alexa Fluor® 700 anti-human CD3 (Clone SK7, 1:100, BioLegend, Cat #344822), FITC anti-human TCR α/β (Clone IP26, 1:50, BioLegend, Cat #306706), PerCP/Cyanine5.5 anti-human CD8 (Clone SK1, 1:200, BioLegend, Cat #344710), PE/Dazzle™ 594 anti-human CD4 (Clone A161A1, 1:200, BioLegend, Cat #357411), PE anti-human CD25 (Clone BC96, 1:200, BioLegend, Cat #302606), Brilliant Violet 650™ anti-human CD127 (Clone A019D5, 1:200, BioLegend, Cat #351326), APC anti-human CD137 (4– 1BB) (Clone 4B4–1, 1:100, BioLegend, Cat #309810), Brilliant Violet 605 anti-human CD279 (PD-1) (Clone EH12.2H7, 1:100, BioLegend, Cat #329923). Single viable DAPI-CD3+TCRaß+ T cells were immediately indexed sorted into 96-well plates for Ti-ATAC-seq analysis using BD FACS Aria III.
Preparation of 10x-based scATAC-seq libraries.
PBMCs were thawed after storage in liquid nitrogen, allowed to recovered in R10 culture medium overnight, and washed for three times with PBS buffer to remove debris and aggregated cells. The cell viability of each sample exceeded 90%. For the HD controls, we performed 10x-based scATAC-seq on PBMCs from 5 healthy donors, including a sample of mixed PBMCs from three healthy donors. Cells were lysed; nuclei were isolated; nuclei suspension were washed and counted; and the transposition reaction and nuclei barcoding were performed according to the manufacturer’s instructions. Approximately 5,000–10,000 single nuclei were collected in each sample. The protocol for library preparation and the settings for the instrument and sequencing were provided by the manufacturer (Annoroad and Berry Genomics) and are available at https://support.10xgenomics.com/single-cell-atac. We also collected scATAC-seq data from an HD control from published datasets (GSE139369) 13.
Preparation of single-cell Ti-ATAC-seq libraries.
Single antibody-indexed cells were sorted into 96-well plates with 6 pl of ATAC-RSB buffer (10 mM Tris-HCl at pH 7.5, 10 mM NaCl, 3 mM MgCl2, 0.8 U/μl RNase inhibitor) per well. Then, 1.5 μl transposition reaction mix with 0.1 pl of TTE Mix V5 (Vazyme Biotech), 50 mM TAPS-NaOH (pH 8.5), 25 mM MgCL, 50% dimethylformamide, 0.8 U/μl RNase inhibitor, and 0.2% NP40 was carefully added to each well. The transposition reaction was performed by incubating the mixture at 37 °C for 30 min, was stopped by adding 1.5 μl of the release buffer (125 mM EDTA and 10 mM Tris- HCl at pH 8.0), following which the reaction mixture was incubated at 50 °C for 30 min. Then, 1 μl of quenching buffer (187.5 mM MgCL and 10 mM Tris-HCl at pH 8.0) was added for MgCL quenching. Next, TRA and TRB chains and transposed DNA fragments and TRA and TRB chains were reverse-transcribed with 11 pl of the RT-PCR reaction mix (8.8 pl of 2× 1 Step buffer, 1.5 pl of TRA and TRB RT primer mix, 0.2 pl of nonindexed custom Nextera ATAC-seq PCR primer mix (forward: 5’-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3’, reverse: 5’-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-3’), and 0.5 pl of PrimeScript 1 Step Enzyme mix (Takara Bio USA) under the following RT-PCR conditions: 50 °C for 36 min; 95 °C for 15 min; and 8 cycles at 94 °C for 30 s, 62 °C for 1 min, and 72°C for 1 min. Immediately, the RT-PCR product was equally divided into two 96-well plates for amplification and the barcoding of transposed DNA fragments and TRA and TRB chains, which were performed as previously described.63 Briefly, 1.25 pM of dual-index Nextera ATAC-seq PCR primers with 1x TruePrep Amplify reaction mix (Vazyme Biotech) was prepared, and ATAC- seq amplification and barcoding were performed under the following PCR conditions: 72 °C for 5 min, 98 °C for 30 s, and 15 cycles of 98 °C for 15 s and 63 °C for 30 s. Finally, scTCR- seq and scATAC-seq libraries were respectively pooled and purified on a single Zymo DNA Clean and Concentrator 5 column. Fragment length selection was performed (0.5x/1.5x for scATAC-seq and 300–400 bp for scTCR-seq). The scTCR-seq and scATAC-seq libraries were 150-bp paired-end sequenced on a HiSeq X Ten platform (Illumina) with the sequencing read length and dual indexing according to the manufacturer’s instructions (Annoroad).
Data processing of single-cell TCR-seq libraries.
scTCR-seq sequencing data were analyzed as previously described63, 64. Briefly, raw reads were unpacked and joined by shared regions and demultiplexed using a custom pipeline. TCR V, D, and J segments were analyzed and assigned by MiXCR65.
Primary data preprocessing of scATAC-seq libraries.
We used a previously described workflow with minor modifications14. Briefly, a sample name was added to the header of each read; adapter sequences were trimmed using Cutadapt; reads were mapped to the hg38 human genome with Bowtie2; and those aligned to the mitochondria were removed. The resulting cleaned single-cell files (./bam files) of each COVID-19 patient were merged and sorted by read name by using SAMtools. Finally, the resulting merged file of each COVID-19 patient was converted to a fragment file; the Tn5 insertion site was adjusted; and unique nuclear fragments were retained using BEDtools. Then, we calculated the TF deviation of cells by using chromVAR66.
For the preprocessing of the 10x scATAC sequencing data, we used cellranger-atac count (cellranger-atac, v1.2.0) to generate single-cell accessibility counts for each single library (https://support.10xgenomics.com/single-cell-atac/software/pipelines/latest/using/count). Reads were aligned to the hg19 human genome. Similarly, we created an Arrow file for each resulting fragment file and included those single-cell libraries with at least 1,000 unique fragments and a TSS enrichment of 6 by using the ArchR package67 in R statistical environment (v3.5.1). Finally, we created an ArchRProject for downstream analysis by combining all the Arrow files of 10x-scATAC-seq.
Dimensionality reduction and clustering analysis.
We used the addlterativeLSI function of ArchR to perform iterative Latent Semantic Indexing (LSI)31, 67. Then, we used the harmony algorithm to correct the batch effect differences68 and added clusters by using the addClusters function. We ran UMAP by using the addUMAP function and plotted the results by using the plotEmbedding function in ArchR67.
Identification of marker features and differential analysis for clusters.
We created pseudo-bulk replicates by using the addGroupCoverages function for each cluster and called peaks by using findMacs2 function. We used the addDeviationsMatrix function to compute per-cell deviations across all motif annotations to create the deviation matrix ‘MotifMatrix.’ Then, we used the getMarkerFeatures function with bias parameter to account for TSS enrichment and the number of unique fragments per cell to identify the markers of clusters, including peaks, genes (based on gene scores), and transcription factor motifs (based on chromVAR deviations). We applied the getMarkers function to obtain the marker list of each cluster, the addlmputeWeights function to impute the weights of marker features, and the plotEmbedding function to visualize the marker features. We performed pairwise comparison of peaks, genes, and TFs for the indicated two clusters by using the getMarkerFeatures function and plotted an MA or volcano plot by using the markerPlot function in ArchR67.
Plotting browser tracks of cis-element coaccessibility.
Coaccessibility represents the correlation between accessibility peaks across many single cells and is useful for identifying cell type-specific peaks. We used the addCoAccessibility function to examine coaccessibility in ArchR67, which returned a loop track that represented the coaccessibility information by using the getCoAccessibility function. Finally, we plotted the genome browser tracks of peaks and coaccessibility by using the plotBrowserTrack function.
TF footprinting.
We performed TF footprinting to precisely predict the binding location of a TF at a particular locus as previously described31, 67 using ArchR. The Tn5 bias signal was normalized using the Divide strategy.
Trajectory construction.
We performed cellular trajectory analysis as previously described using ArchR67. First, we defined the trajectory backbone of cell groups or clusters based on the cell differentiation states, such as the order of naive, intermediate, and effector CD8+ cells in the effector CD8+ T cell trajectory. Then, we created a trajectory by using the addTrajectory function and plotted the pseudotime values on UMAP embedding using the plotTrajectory function. We next plotted pseudotime heatmaps of TFs, gene scores, and peak accessibility by using the plotTrajectoryHeatmap function. To identify positive TF regulators, we performed an integrative analysis of gene scores and motif accessibility across pseudotime by using the correlateTrajectories and plotTrajectoryHeatmap functions.
Statistics and reproducibility.
Statistical analysis was performed in GraphPad Prism (v6) or R (version 3.5.1). For the frequencies of the identified clusters and virus rechallenged assay, a two-sided unpaired Student’s t test were performed, and P value are reported. For the differential TF motif accessibility, differential cis-element accessibility and GO analysis, multiple testing corrections were performed, and P values were calculated using a two-sided pairwise Wilcoxon test, and the false discovery rate (FDR) was corrected using the Benjamini- Hochberg procedure. No data were excluded from the analysis.
Reporting summary.
Reporting Summary Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability.
Data are deposited in the Sequence Read Archive (SRA) under the accession PRJNA718009. Data are also deposited in the Genome Sequence Archive for Human (GSA- Human) under the accession HRA000562. Source data are provided with this study. The minimum dataset is posted on GitHub (https://github.com/YouMaojun/COVID19-scATAC-seq). There are no restrictions on data availability or use. Previously published data were reanalyzed here are available under accession GSE139369, GSE117089, GSE126074, GSE107817.
Code availability.
All of the R packages that were used are available online, as described in the Methods. Customized Code used in this study is posted on GitHub (https://github.com/Yang-Chen-Lab-co/COVID19-scATAC-seq).
Extended Data
Extended Data Fig. 1. Quality control for the 10x-based single-cell ATAC-seq datasets.
a, Table showing the information of 10x scATAC-seq based samples. b, the transcriptional start site (TSS) enrichment score compared with the number of unique ATAC-seq nuclear fragments in each single cell that passed. c, nucleosomal periodicity fragment lengths of aggregate single-cell profiles of 10x scATAC-seq based samples. d, the enrichments of normalized Tn5 insertions around the TSSs of 10x scATAC-seq based samples. e, ChromVAR TF motif bias-corrected deviations overlaid on UMAP projection of 10x-scATAC-seq-based single PBMCs as showed in Fig. 1c. f, Heatmap showing the 3,967 differentially accessible genes (FDR = < 0.05 and fold change > =2) in various clusters of different sample origin (HD or COV) as indicated in Fig. 1c.
Extended Data Fig. 2. Workflow and quality control for the Ti-ATAC-seq datasets.
a, Workflow of plated-based single T cell Ti-ATAC-seq. Briefly, single DAPI-CD3+TCRαβ+ T cells were FACS index sorted into 96-well plates, transposed as single cells with the Tn5 transposase, reverse transcribed to TRA and TRB mRnA, and then the ATAC-seq product and TCR-seq product were split for further scATAC-seq and scTCR-seq analyses. b, Shown are the gating strategy to gate and sort single viable DAPI-CD3 + TCRαβ + T cells into 96-well plates. c, number of unique ATAC-seq fragments for Ti-ATAC-seq (this study, n = 1,574 cells), T-ATAC-seq (n = 300 cells, GSE107817), SnARE-seq (n = 1048 cells, GSE126074), and sci-CAR-seq (n = 1000 cells, GSE117089). Boxplots show the medians and the quartile range (25% and 75%), and 1.5x interquartile ranges (IQRs) of the lengths of whiskers. d, TSS enrichment score compared with the number of unique ATAC-seq nuclear fragments in each single cell that passed. e, nucleosomal periodicity fragment lengths of aggregate single-cell profiles of Ti-ATAC-seq based samples. f, the enrichments of normalized Tn5 insertions around the TSSs of Ti-ATAC-seq based samples. g, the read counts for TRA and TRB sequences of each single T cell TCR clone in Ti-ATAC-seq-based scTCR-seq data. h, TRA and TRB dominance of the top clone for each single T cell TCR clone in Ti-ATAC-seq-based scTCR-seq data. Each dot represents a dominant TCR clone of a single T cell.
Extended Data Fig. 3. Differentially accessible genes in various monocyte clusters.
a, UMAP plot of monocytes as indicated in Fig. 2a coloured by log-normalized gene scores. b, Heatmap showing the 5,868 differentially accessible genes (FDR = < 0.05 and fold change > =2) in various monocyte clusters as indicated in Fig. 2a. c, Left: MA plots showing the differential ATAC-seq peaks (red or blue dot: log2 fold change > = 1 and false discovery rate (FDR) = < 0.05) for the indicated clusters including cluster C11 versus C13, cluster C4 versus C3. Right: Bar charts showing the top GO biological process pathways of differential peaks enriched in each cluster using GREAT analysis.
Extended Data Fig. 4. Lineage trajectory of B cells in healthy donors and individuals convalescing from COVID19 respectively.
a, Aggregated single-cell genome tracks for clusters as indicated in Fig. 4a at the TCL1A, CD19, CD20 (MS4A1), CD27, CD38 and XBP1 gene loci with peak co-accessibility (Co-Access). b, Lineage trajectory of B-cell states in healthy donors and COVID-19 convalescent individuals as indicated in Fig. 4a, respectively. c, Heatmaps showing the positive TF regulators by the ordered TF gene scores (right) with ordered TF motif accessibility (left) across pseudotime for B-cell differentiation in healthy donors. Positive TF regulators are TF motifs that show high bias-corrected chromVAR TF motif deviations that also exhibit similarly dynamic gene scores across differentiation state. d, Heatmaps showing the positive TF regulators by the ordered TF gene scores (right) with ordered TF motif accessibility (left) across pseudotime for B-cell differentiation in COVID-19 convalescent individuals.
Extended Data Fig. 5. Integrative lineage trajectory of B cell states in healthy donors and individuals convalescing from COVID-19.
a, Aggregated single-cell genome tracks for clusters as indicated in Fig. 4d at the TCL1A, BCL11A, BHLHE41, and LEF1 gene loci with peak co-accessibility (Co-Access). b, Single-cell gene scores of the positive TF regulators as defined in Fig. 4f across pseudotime in the B-cell trajectory. Dots represent gene scores of an individual pseudotime-ordered scATAC-seq profiles. The smoothed line and arrow represent the visualization of the trajectory path from the spline fit. c, Single-cell bias- corrected chromVAR TF motif deviation scores of the positive TF regulators as defined in Fig. 4f across pseudotime in the B-cell trajectory. d, Ridge plots showing TF gene scores (top) and bias- corrected chromVAR TF motif deviation scores (bottom) of the indicated positive TF regulators across different states and samples origins as indicated in Fig. 4d. e, Shown are the SARS-CoV-2 virus specific IgG detection assays in the 5 indicated convalescent individuals.
Extended Data Fig. 6. Differentially accessible genes in various T- and NK-cell clusters.
a, Aggregated single-cell genome tracks for clusters as indicated in Fig. 5a at the CD4, CD8A, CD8B, CD3E, CD3D, CD3G, NCAM and IFNG gene loci with peak co-accessibility (Co-Access). b, Canonical surface markers overlaid on UMAP embedding as indicated in Fig. 5a coloured by log-normalized gene scores for indicated genes. c, Heatmap showing the 3,055 differentially accessible genes (FDR = < 0.05 and fold change > =2) in various clusters as indicated in Fig. 5a. d, TF footprint of the LEF1, IRF4, and KLF4 motifs in the CD8+ T cell clusters as indicated in Fig. 5a. The Tn5 insertion bias track is shown below.
Supplementary Material
Acknowledgments
We thank all the healthy donors and COVID-19 convalescent individuals involved in this project. We also thank J.Jia. and S.Meng. for their technical assistance on FACS analysis. We would like to thank T. Niu from the HPC-Service Station of the Center for Biological Imaging, Institution of Biophysics, Chinese Academy of Science. This work was supported by grant from Chongqing International Institute for Immunology (No. 2020YJC02 to P.Y.), Strategic Priority Research Program of CAS (XDB29040102, XDB37030206 to P.Y.), National Natural Science Foundation of China (81672464 to P. Y.). Liang Chen was a funded by Cancer Research Institute (CRI) Irvington Postdoctoral Fellowship.
Footnotes
Competing Interests Statement
The authors declare that they have no conflict of interest.
References
- 1.(https://covid19.who.int/; World Health Organization, 2020).
- 2.Sariol A. & Perlman S. Lessons for COVID-19 Immunity from Other Coronavirus Infections. Immunity (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vabret N. et al. Immunology of COVID-19: Current State of the Science. Immunity 52, 910–941 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhou Z. et al. Heightened Innate Immune Responses in the Respiratory Tract of COVID-19 Patients. Cell Host Microbe 27, 883–890 e882 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Liao M. et al. Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19. Nat Med 26, 842–844 (2020). [DOI] [PubMed] [Google Scholar]
- 6.Wilk AJ et al. A single-cell atlas of the peripheral immune response in patients with severe COVID-19. Nat Med 26, 1070–1076 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wen W. et al. Immune cell profiling of COVID-19 patients in the recovery stage by single-cell sequencing. Cell Discov 6, 31 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zheng Y. et al. A human circulating immune cell landscape in aging and COVID-19. Protein Cell (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lucas C. et al. Longitudinal analyses reveal immunological misfiring in severe COVID-19. Nature (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Diao B. et al. Reduction and Functional Exhaustion of T Cells in Patients With Coronavirus Disease 2019 (COVID-19). Front Immunol 11, 827 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Del Valle DM et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nat Med (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhu L. et al. Single-Cell Sequencing of Peripheral Mononuclear Cells Reveals Distinct Immune Response Landscapes of COVID-19 and Influenza Patients. Immunity (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Granja JM et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat Biotechnol 37, 1458–1465 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Satpathy AT et al. Transcript-indexed ATAC-seq for precision immune profiling. Nat Med 24, 580–590 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Chen S, Lake BB & Zhang K. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nat Biotechnol 37, 1452–1457 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cao J. et al. Joint profiling of chromatin accessibility and gene expression in thousands of single cells. Science 361, 1380–1385 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lighvani AA et al. T-bet is rapidly induced by interferon-gamma in lymphoid and myeloid cells. Proc Natl Acad Sci U S A 98, 15137–15142 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Yi W. et al. T-bet-mediated Tim-3 expression dampens monocyte function during chronic hepatitis C virus infection. Immunology 150, 301–311 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Friedman AD Transcriptional control of granulocyte and monocyte development. Oncogene 26, 6816–6828 (2007). [DOI] [PubMed] [Google Scholar]
- 20.Langlais D, Barreiro LB & Gros P. The macrophage IRF8/IRF1 regulome is required for protection against infections and is associated with chronic inflammation. J Exp Med 213, 585–603 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kikuchi K. et al. Macrophages Switch Their Phenotype by Regulating Maf Expression during Different Phases of Inflammation. J Immunol 201, 635–651 (2018). [DOI] [PubMed] [Google Scholar]
- 22.Hanna RN et al. The transcription factor NR4A1 (Nur77) controls bone marrow differentiation and the survival of Ly6C- monocytes. Nat Immunol 12, 778–785 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ramos-Mejia V. et al. HOXA9 promotes hematopoietic commitment of human embryonic stem cells. Blood 124, 3065–3075 (2014). [DOI] [PubMed] [Google Scholar]
- 24.Netea MG et al. Defining trained immunity and its role in health and disease. Nat Rev Immunol 20, 375–388 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Netea MG et al. Trained Immunity: a Tool for Reducing Susceptibility to and the Severity of SARS-CoV-2 Infection. Cell 181, 969–977 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Saeed S. et al. Epigenetic programming of monocyte-to-macrophage differentiation and trained innate immunity. Science 345, 1251086 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li P. et al. IRF8 and IRF3 cooperatively regulate rapid interferon-beta induction in human blood monocytes. Blood 117, 2847–2854 (2011). [DOI] [PubMed] [Google Scholar]
- 28.Fontana MF et al. JUNB is a key transcriptional modulator of macrophage activation. J Immunol 194, 177–186 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Behmoaras J. et al. Jund is a determinant of macrophage activation and is associated with glomerulonephritis susceptibility. Nat Genet 40, 553–559 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nutt SL & Kee BL The transcriptional regulation of B cell lineage commitment. Immunity 26, 715–725 (2007). [DOI] [PubMed] [Google Scholar]
- 31.Satpathy AT et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat Biotechnol 37, 925–936 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Stehling-Sun S, Dade J, Nutt SL, DeKoter RP & Camargo FD Regulation of lymphoid versus myeloid fate ‘choice’ by the transcription factor Mef2c. Nat Immunol 10, 289–296 (2009). [DOI] [PubMed] [Google Scholar]
- 33.Mackay F. & Browning JL BAFF: a fundamental survival factor for B cells. Nat Rev Immunol 2, 465–475 (2002). [DOI] [PubMed] [Google Scholar]
- 34.Gutierrez A Jr. et al. LEF-1 is a prosurvival factor in chronic lymphocytic leukemia and is expressed in the preleukemic state of monoclonal B-cell lymphocytosis. Blood 116, 2975–2983 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ng AP et al. An Erg-driven transcriptional program controls B cell lymphopoiesis. Nat Commun 11, 3013 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Milanovic M. et al. Differential requirements for the canonical NF-kappaB transcription factors c-REL and RELA during the generation and activation of mature B cells. Immunol Cell Biol 95, 261–271 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Almaden JV et al. B-cell survival and development controlled by the coordination of NF-kappaB family members RelB and cRel. Blood 127, 1276–1286 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Vilagos B. et al. Essential role of EBF1 in the generation and function of distinct mature B cell types. J Exp Med 209, 775–792 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang NS et al. Divergent transcriptional programming of class-specific B cell memory by T-bet and RORalpha. Nat Immunol 13, 604–611 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.De Silva NS, Silva K, Anderson MM, Bhagat G. & Klein U. Impairment of Mature B Cell Maintenance upon Combined Deletion of the Alternative NF-kappaB Transcription Factors RELB and NF-kappaB2 in B Cells. J Immunol 196, 2591–2601 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.De Silva NS et al. Transcription factors of the alternative NF-kappaB pathway are required for germinal center B-cell development. Proc Natl Acad Sci U S A 113, 9063–9068 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ohkubo Y. et al. A role for c-fos/activator protein 1 in B lymphocyte terminal differentiation. J Immunol 174, 7703–7710 (2005). [DOI] [PubMed] [Google Scholar]
- 43.Heise N. et al. Germinal center B cell maintenance and differentiation are controlled by distinct NF-kappaB transcription factor subunits. J Exp Med 211, 2103–2118 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Willis SN et al. Environmental sensing by mature B cells is controlled by the transcription factors PU.1 and SpiB. Nat Commun 8, 1426 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Xu H. et al. Regulation of bifurcating B cell trajectories by mutual antagonism between transcription factors IRF4 and IRF8. Nat Immunol 16, 1274–1281 (2015). [DOI] [PubMed] [Google Scholar]
- 46.Ubieta K. et al. Fra-2 regulates B cell development by enhancing IRF4 and Foxo1 transcription. J Exp Med 214, 2059–2071 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ise W. et al. The transcription factor BATF controls the global regulators of class- switch recombination in both B cells and T cells. Nat Immunol 12, 536–543 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Man K. et al. Transcription Factor IRF4 Promotes CD8(+) T Cell Exhaustion and Limits the Development of Memory-like T Cells during Chronic Infection. Immunity 47, 1129–1141 e1125 (2017). [DOI] [PubMed] [Google Scholar]
- 49.Kidani Y. et al. Sterol regulatory element-binding proteins are essential for the metabolic programming of effector T cells and adaptive immunity. Nat Immunol 14, 489–499 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Weinreich MA et al. KLF2 transcription-factor deficiency in T cells results in unrestrained cytokine production and upregulation of bystander chemokine receptors. Immunity 31, 122–130 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lai D. et al. KLF13 sustains thymic memory-like CD8+ T cells in BALB/c mice by regulating IL-4-generating invariant natural killer T cells. The Journal of Experimental Medicine 208, 1093–1103 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Das H. et al. Kruppel-like factor 2 (KLF2) regulates proinflammatory activation of monocytes. Proc Natl Acad Sci U S A 103, 6653–6658 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Bai A, Hu H, Yeung M. & Chen J. Kruppel-like factor 2 controls T cell trafficking by activating L-selectin (CD62L) and sphingosine-1-phosphate receptor 1 transcription. J Immunol 178, 7632–7639 (2007). [DOI] [PubMed] [Google Scholar]
- 54.Ng OW et al. Memory T cell responses targeting the SARS coronavirus persist up to 11 years post-infection. Vaccine 34, 2008–2014 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Tang F. et al. Lack of peripheral memory B cell responses in recovered patients with severe acute respiratory syndrome: a six-year follow-up study. J Immunol 186, 7264–7268 (2011). [DOI] [PubMed] [Google Scholar]
- 56.Deng W. et al. Primary exposure to SARS-CoV-2 protects against reinfection in rhesus macaques. Science 369, 818–823 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Mulligan MJ et al. Phase 1/2 study of COVID-19 RNA vaccine BNT162b1 in adults. Nature (2020). [DOI] [PubMed] [Google Scholar]
- 58.Edwards KM Vaccines targeting SARS-CoV-2 tested in humans. Nature Medicine (2020). [DOI] [PubMed] [Google Scholar]
- 59.Jackson LA et al. An mRNA Vaccine against SARS-CoV-2 - Preliminary Report. N Engl J Med (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Intlekofer AM et al. Effector and memory CD8+ T cell fate coupled by T-bet and eomesodermin. Nat Immunol 6, 1236–1244 (2005). [DOI] [PubMed] [Google Scholar]
- 61.Stone SL et al. T-bet Transcription Factor Promotes Antibody-Secreting Cell Differentiation by Limiting the Inflammatory Effects of IFN-gamma on B Cells. Immunity 50, 1172-1187 e1177 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Li H, Wojciechowski W, Dell’Agnola C, Lopez NE & Espinoza-Delgado I. IFN- gamma and T-bet expression in human dendritic cells from normal donors and cancer patients is controlled through mechanisms involving ERK-1/2-dependent and IL-12- independent pathways. J Immunol 177, 3554–3563 (2006). [DOI] [PubMed] [Google Scholar]
- 63.Han A, Glanville J, Hansmann L. & Davis MM Linking T-cell receptor sequence to functional phenotype at the single-cell level. Nat Biotechnol 32, 684–692 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Glanville J. et al. Identifying specificity groups in the T cell receptor repertoire. Nature 547, 94–98 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bolotin DA et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat Methods 12, 380–381 (2015). [DOI] [PubMed] [Google Scholar]
- 66.Schep AN, Wu B, Buenrostro JD & Greenleaf WJ chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat Methods 14, 975–978 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Jeffrey M Granja MRC., Sarah E Pierce S Tansu Bagdatli, Hani Choudhry, Howard Y Chang William J Greenleaf ArchR: An integrative and scalable software package for single-cell chromatin accessibility analysis. biorxiv (2020). [Google Scholar]
- 68.Korsunsky I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods 16, 1289–1296 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data are deposited in the Sequence Read Archive (SRA) under the accession PRJNA718009. Data are also deposited in the Genome Sequence Archive for Human (GSA- Human) under the accession HRA000562. Source data are provided with this study. The minimum dataset is posted on GitHub (https://github.com/YouMaojun/COVID19-scATAC-seq). There are no restrictions on data availability or use. Previously published data were reanalyzed here are available under accession GSE139369, GSE117089, GSE126074, GSE107817.












