Chromatin remodeling in peripheral blood cells reflects COVID-19 symptom severity

Nicholas S Giroux; Shengli Ding; Micah T McClain; Thomas W Burke; Elizabeth Petzold; Hong A Chung; Grecia R Palomino; Ergang Wang; Rui Xi; Shree Bose; Tomer Rotstein; Bradly P Nicholson; Tianyi Chen; Ricardo Henao; Gregory D Sempowski; Thomas N Denny; Emily R Ko; Geoffrey S Ginsburg; Bryan D Kraft; Ephraim L Tsalik; Christopher W Woods; Xiling Shen

doi:10.1101/2020.12.04.412155

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2020 Dec 9:2020.12.04.412155. Originally published 2020 Dec 5. [Version 3] doi: 10.1101/2020.12.04.412155

Chromatin remodeling in peripheral blood cells reflects COVID-19 symptom severity

Nicholas S Giroux ^1,^#, Shengli Ding ^1,^#, Micah T McClain ^2,^3,^4,^#, Thomas W Burke ², Elizabeth Petzold ², Hong A Chung ¹, Grecia R Palomino ¹, Ergang Wang ¹, Rui Xi ¹, Shree Bose ⁵, Tomer Rotstein ¹, Bradly P Nicholson ⁶, Tianyi Chen ⁷, Ricardo Henao ², Gregory D Sempowski ⁸, Thomas N Denny ⁸, Emily R Ko ², Geoffrey S Ginsburg ², Bryan D Kraft ^2,³, Ephraim L Tsalik ^2,^3,⁴, Christopher W Woods ^2,^3,^4,^*, Xiling Shen ^1,^*

PMCID: PMC7724678 PMID: 33300002

Abstract

SARS-CoV-2 infection triggers highly variable host responses and causes varying degrees of illness in humans. We sought to harness the peripheral blood mononuclear cell (PBMC) response over the course of illness to provide insight into COVID-19 physiology. We analyzed PBMCs from subjects with variable symptom severity at different stages of clinical illness before and after IgG seroconversion to SARS-CoV-2. Prior to seroconversion, PBMC transcriptomes did not distinguish symptom severity. In contrast, changes in chromatin accessibility were associated with symptom severity. Furthermore, single-cell analyses revealed evolution of the chromatin accessibility landscape and transcription factor motif occupancy for individual PBMC cell types. The most extensive remodeling occurred in CD14+ monocytes where sub-populations with distinct chromatin accessibility profiles were associated with disease severity. Our findings indicate that pre-seroconversion chromatin remodeling in certain innate immune populations is associated with divergence in symptom severity, and the identified transcription factors, regulatory elements, and downstream pathways provide potential prognostic markers for COVID-19 subjects.

One sentence summary:

Chromatin accessibility in immune cells from COVID-19 subjects is remodeled prior to seroconversion to reflect disease severity.

Introduction

Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, manifests with highly variable symptom severity (1). Infected subjects demonstrate clinical trajectories that range from remaining asymptomatic to developing life-threatening illness. The development of specific antibodies (IgG) against SARS-CoV-2 marks an inflection point in a COVID-19 patient’s disease progression, indicating a transition from innate immunity to acquired immunity (2, 3). IgG seroconversion typically occurs within two weeks of symptom onset and roughly coincides with the time that patients without critical illness will see clinical improvement (4–6). However, limited data are available examining the COVID-19 peripheral blood immune response in the context of seroconversion.

Several recent publications described bulk or single cell RNA-sequencing (scRNA-seq) to profile transcriptomic responses in immune cells of subjects with COVID-19 (7–11). Specifically, they reported suppressed immune responses in subjects with mild symptoms, as indicated by deficient expression of Type I and III interferons (12). Subjects with more severe disease demonstrated up-regulation of pro-inflammatory factors, including IL-6 and TNF-alpha (13). The number of monocytes and associated IL-6, CCL2, and CCL8 production in the peripheral blood are also elevated in subjects with severe COVID-19 (14, 15). However, no investigation of the association between molecular profiles in PBMCs prior to seroconversion and symptom severity has been performed.

To test the hypothesis that the landscape of chromatin accessibility harbors biomarkers that define early molecular mechanisms underpinning divergent immunologic responses in SARS-CoV-2 infection, we performed bulk and single-cell RNA-seq and ATAC-seq on longitudinal PBMC samples from subjects before and after seroconversion.

Results

PBMCs were collected from healthy controls, uninfected close contacts (CC), and COVID-19 subjects and profiled using bulk and single-cell RNA-seq and ATAC-seq sequencing (Fig. 1A). PBMCs from healthy subjects (n=7) were collected before the pandemic started. Infection with SARS-CoV-2 was confirmed using polymerase chain reaction (PCR) on nasopharyngeal (NP) swab samples, and serology testing for IgG against the SARS-CoV-2 spike domain was performed for each COVID-19 subject on study collection days. COVID-19 subjects included in this cohort were out-patients, and molecular profiling was performed at three timepoints, corresponding to early (IgG−), mid, and late (IgG+) acute disease within a 14-day window (Fig. 1B). At the mid timepoint, IgG seroconversion was observed in only some subjects; by the late timepoint, all subjects were observed to be IgG+ (Fig. 1B). CC subjects (n=7) were profiled at three timepoints and were PCR negative and IgG− at all timepoints. Subject symptoms were recorded for 39 symptom categories, and the total score was used to stratify the subjects into mild symptom (MS, n=7; mean score = 12.9) and pronounced symptom (PS, n=7; mean score = 33.6) cohorts. Complete subject demographics and clinical metadata are summarized in Tables S1–2.

Figure 1. — **(A)** Summary of COVID-19 subject cohorts and multi-omic profiling experimental design. **(B)** Longitudinal timepoints available for COVID-19 subject cohort and IgG serological testing data. **(C)** Differential analysis of chromatin accessibility peaks from seropositive COVID-19 subjects identifies unique epigenetic biomarkers corresponding to symptom severity. Accessibility count data were normalized to reads per kilobase per million mapped reads and converted to z-scores for each group. **(D)** Differentially accessible peaks clustered by fuzzy c-means clustering with n=12. Membership indicates similarity of a peak to the centroid for each cluster. **(E)** Gene set enrichment analysis performed for peaks with membership > 50% for each cluster. Selected significant pathways plotted by -log(adj_p). **(F)** Differential gene expression of MS and PS subjects compared to a common healthy control baseline. **(G)** Accessible chromatin biomarkers that distinguish seronegative MS and PS subjects. Significant peaks have p < 0.05 and are plotted in pink. **(H)** ATAC-PCR enrichment of PS-specific markers LGALS17A and DEFA4 relative to internal control KIF26B. ATAC-seq libraries from seronegative MS (n=7) and PS (n=7) subjects were used as input. * p < 0.05 **(I)** Transcription factor footprinting analysis comparing motif flanking accessibility in MS and PS subjects compared to healthy controls. Outliers were determined by distance from BaGFoot center point. **(J)** BaGFoot comparing motif occupancies between MS and PS subjects. Elevated flanking accessibility and increased footprint depth indicates higher transcription factor occupancy at genome-wide motifs.

Samples from each COVID-19 (MS and PS) subject’s first IgG+ timepoint were compared with healthy controls and CC subjects to determine whether differential chromatin accessibility in PBMCs can stratify subjects by symptom severity (Fig. 1C). A total of 6168 significantly differentially accessible peaks were identified that distinguished each group in a pairwise comparison. Patterns of correlated chromatin peak accessibility emerged following analysis with soft clustering, indicating trends of increased accessibility that correspond with differential symptom severity (Fig. 1D, Fig. S1A). Clusters 3 and 6 were composed of peaks with decreased accessibility in healthy controls and increased accessibility following exposure (CC) or infection (MS and PS). Conversely, cluster 10 was composed of peaks with a negative correlation between accessibility and disease severity. This observed dependence on subject symptom severity suggested that the epigenome harbors biomarkers associated with COVID-19 severity. To evaluate whether the identified clusters contribute to disease variability, we performed gene set enrichment analysis (GSEA) for each cluster of peaks (Fig. 1E). Peaks with increased accessibility in MS and PS subjects were enriched in pathways related to interleukin signaling and regulation of cell differentiation and morphology. In contrast, healthy controls had the highest accessibility in peaks that were enriched for genes related to exocytosis, transendothelial migration, and vascular adhesion.

We compared bulk RNA-seq of PBMCs from the CC, MS, and PS cohorts at the earliest IgG− timepoint to the healthy controls. Activation of interferon response genes (IFI27, SIGLEC1, IFI44L) was observed in both seronegative MS and PS subjects compared to healthy controls (Fig. 1F, S1B) (9). However, these early transcriptomic differences do not reliably distinguish between MS and PS subjects (Fig. 1F).

Regulatory chromatin containing transcription factor motifs becomes accessible prior to downstream gene expression (16). To test whether chromatin changes can better distinguish disease severity, differential chromatin accessibility analysis was applied to the earliest seronegative samples from COVID-19 subjects compared to healthy controls. A set of 443 peaks of differentially accessible chromatin identified in seronegative subjects differentiates mild from more pronounced disease severity (Fig. 1G, S1C). As proof-of-principle, ATAC-PCR was performed to confirm enrichment of peaks annotated to LGALS17A and DEFA4 that were differentially accessible in PS subjects (Fig. 1H, S1D). Identification of an association between LGALS17A, a known interferon-stimulated gene, DEFA4, a marker of neutrophil activation in COVID-19 subjects, and disease severity confirmed the observations of the differential analysis (17, 18).

Motif occupancy analysis was applied to samples from the seronegative timepoint to identify regulatory transcription factors that bind newly accessible chromatin and poise gene expression. The seronegative timepoints from MS or PS subjects were compared to healthy controls using bivariate analysis to estimate the change in flanking accessibility and footprint depth at each motif (Fig. S1E–F). A group of transcription factors motifs with increased accessibility in PS subjects compared to MS subjects was identified by comparing samples from COVID-19 subjects before IgG seroconversion to a common baseline of healthy controls, in contrast to a second group of transcription factor motifs that demonstrated similar accessibility in both MS and PS subjects (Fig. 1I). The differential occupancies in the first group identified alternative regulation of myeloid activation pathways in MS vs. PS subjects. For example, KLF and CREB transcription factor families, which are known to regulate monocyte-macrophage polarization, have elevated motif occupancy in PS subjects compared to MS (20, 21). In the second group, CPEB1 displacement in both MS and PS subjects, marked by an increase in motif flanking accessibility and decrease in footprint depth, suggests involvement of the IL-6 and NFkB signaling pathways which was not observed in healthy controls (S1E–F) (19). Direct comparison of seronegative MS and PS subjects also demonstrated enrichment of KLF and CREB motif occupancy in PS subjects and served to further characterize differential regulation of immunity in each cohort (Fig. 1J). Furthermore, MS subjects were characterized by elevated AP-1 and C/EBP motif occupancies, which are known to negatively regulate IFN-gamma (22, 23).

The robust chromatin accessibility signature that was detected in the bulk ATAC-seq datasets confirms that PBMCs undergo extensive chromatin remodeling in response to SARS-CoV-2 infection. To understand how each cell type contributes to this signature and track the evolution of the chromatin landscape of each cell type during seroconversion, we performed single-cell ATAC-seq profiling of five subjects from each of the healthy, MS, and PS cohorts (Fig. 2A, Fig. S2A–F, Table S3). Cell type annotations were transferred from complementary single-cell gene expression libraries using canonical-correlation analysis, and expression of marker genes for each cell type was confirmed (Fig. S2G, Fig. S3A–E) (see Methods). Differential transcription factor motif occupancy analysis was applied to MS and PS PBMCs to identify shared regulatory mechanisms activated before seroconversion. Increased occupancy at transcription factor motifs including the proinflammatory C/EBP family and BACH1 marked the seronegative timepoint in both MS and PS subjects and further confirmed that the chromatin landscape is being remodeled prior to seroconversion (Fig. 2B) (24, 25). Differential accessibility in motifs of SMAD3, a known regulator of TGF-beta signaling, suggests a potential molecular driver that differentiates PS and MS subjects (26). Following seroconversion, both MS and PS subjects exhibit an increased accessibility at motifs of the transcription factor KDM2B, which is known to promote IL-6 production via chromatin remodeling (27).

Figure 2. — **(A)** UMAP plot of single-cell ATAC-seq datasets generated from healthy controls, uninfected CC, MS, and PS subjects. **(B)** BaGFoot analysis comparing transcription factor motif occupancies in PBMCs from MS and PS subjects collected prior to and after IgG seroconversion. (Red circle = average of all points; dark blue circle = 50% of all data; light blue circle = 75% of all data). Motifs with the top 5% change in flanking accessibility are plotted in red. **(C-E)** Supervised trajectory analysis using CD4+ mature T cells **(C)**, CD8+ T cells **(D)**, and CD14+ monocytes **(E)** collected from both MS and PS subjects. Differential transcription factor motif enrichment, chromatin accessibility, and gene expression were correlated with seroconversion. **(F)** Transcription factor regulators with correlated gene expression and transcription factor motif accessibility identified in CD14+ monocytes.

To identify the cell types with the most significant chromatin remodeling over time, we applied a global trajectory analysis to major cell types, including CD4+ mature T cells, CD8+ T cells, and CD14+ monocytes, represented in the scATAC-seq datasets (Fig. 2C–E). Similar analysis was also applied to CD4+ naïve T cells and NK cells, which contribute to the epigenetic signal to a lesser extent (Fig. S4A–B). Cells from MS and PS subjects were pooled into early (IgG−), mid, and late (IgG+) timepoints in pseudotime using a supervised trajectory algorithm (see Methods). Transcription factor motif enrichment, chromatin accessibility, and differential gene expression analyses were then performed for each individual cell population. A monotonic trend correlated with pseudotime was observed for each trajectory, demonstrating the progression of the host response in peripheral blood cells during development of acquired immunity. The shift in differential transcription factor motif and peak accessibility associated with pseudotime preceded changes in gene expression for all cells analyzed. We identified transcription factor activators, defined as genes with highly correlated expression and motif accessibility that act as robust multi-omic biomarkers that develop over time, in CD14+ monocytes (Fig. 2F). Gene activity scores, estimated using accessibility counts, were analyzed to identify chromatin biomarkers associated with seroconversion, which showed a clear transition in CD14+ monocytes (Fig. S4C).

Comparison of pooled PBMCs collected from MS and PS subjects identified specific peaksets enriched on the day of study enrollment (IgG−) versus a timepoint 2–4 weeks later (IgG+) (Fig. S5A, Fig. 3A top). A total of 5615 peaks were identified as uniquely accessible at one of the three longitudinal (early, mid, and late) timepoints (Fig. 3A bottom). A similar experimental approach was applied to each major cell type to detect those with extensive chromatin remodeling prior to seroconversion (Fig. 3B–C, S5B). The only cell types with enrichment of significant peaks at the early, IgG− timepoint were CD14+ monocytes (and, to a much lesser extent, CD4+ naïve T cells), which are the main contributor to the IgG− signature in the PBMCs shown in Fig. 3A–B. The remaining cell types, including CD4+ mature T cells, CD8+ T cells, and NK cells, were not characterized by peaks detectable prior to seroconversion. Transcription factor motifs enriched in peaks accessible prior to seroconversion had the highest occupancy in cells from the myeloid lineage, specifically CD14+ monocytes, dendritic cells, and CD16+ monocytes (Fig. 3D–E). In contrast, motifs enriched in peaks accessible after seroconversion lack elevated occupancy in monocytes, instead showing the highest activity in B cells and plasmablasts (Fig. S5C). The enrichment of accessible transcription factor motifs established early further supports the observation that CD14+ monocytes underwent the most chromatin remodeling among PBMCs prior to seroconversion.

Figure 3. — **(A)** Top: Differentially accessible chromatin peaksets between early and late acute infection timepoints in all cells collected from both MS and PS subjects. Significance is defined as adjusted p < 0.05 and absolute LFC > 0.5. Bottom: Chromatin peaks with differential accessibility associated with time in all cells collected from MS and PS subjects. **(B)** Chromatin peaks with differential accessibility between early and late timepoints in CD14+ monocytes collected from MS and PS subjects. **(C)** Chromatin peaks with differential accessibility between early and late timepoints in CD4+ mature T cells and CD8+ T cells collected from MS and PS subjects. **(D)** Transcription factor motifs enriched in peaks specific to either early (IgG−) or late (IgG+) timepoints. **(E)** Transcription factors motifs enriched at the early timepoint have footprints with increased occupancy in CD14+ monocytes, dendritic cells, and CD16+ monocytes. **(F)** Transcription factor activators and repressors with correlated gene expression and motif accessibility associated with seroconversion in all cells. **(G)** Activator accessibility specifically enriched in CD14+ monocytes. Accessibility was overlaid on a UMAP of all cell types. **(H)** Peak-to-gene linkages computed for all cells collected from seronegative MS and PS subjects. Genes with > 10 linkages are defined as domains of regulatory chromatin (DORCs). Labeled genes are known to be regulated by a super-enhancer. **(I)** DORCs regulated by super-enhancers have increased accessibility in CD14+ monocytes. Peak-to-gene linkages are plotted with a correlation cutoff of 0.5. **(J)** Differentially accessible chromatin peaksets in all cells collected from seronegative MS and PS subjects. Significance is defined as adjusted p < 0.05 and absolute LFC > 1. **(K)** Transcription factor motifs enriched in peaks specific to either seronegative MS or PS subjects. **(L)** Seronegative MS-specific transcription factors have increased occupancy m CD14+ monocytes, dendritic cells, and CD16+ monocytes. (M) Seronegative PS-specific transcription factors have increased occupancy in pDCs, plasmablasts, and B cells.

Our analysis further identified transcription factors acting as activators or repressors, which have a positive or negative correlation between motif accessibility and gene expression, respectively, within the context of seroconversion (Fig. 3F). chromVAR deviation scores and accessibility at these activator motifs revealed an enrichment in CD14+ monocytes and other myeloid cells, including dendritic cells and CD16+ monocytes (Fig. 3F–G, Fig. S5D–E). A minority of activators (4 of 23) instead have enriched activity in B or T cells, potentially serving to poise these cells for future activation (Fig. S5D–E). These observations confirm that CD14+ monocytes are marked by a transcription factor activity signature that was established prior to seroconversion and undergo extensive chromatin remodeling during infection. Domains of regulatory chromatin (DORCs) were identified in all cell types collected from both seronegative MS and PS subjects by counting the number of peak-to-gene linkages for each gene in the scATAC-seq datasets (16). A total of 1109 genes with greater than 10 peak-to-gene linkages were defined as DORC genes, and those known to be regulated by a super-enhancer were labeled (Fig. 3H). These DORCs showed cell type-specific profiles and appeared to have higher accessibility than gene expression in these cells, suggesting that these genes are being primed for activation (Fig. S5F). Accessibility at the DORCs regulated by a super-enhancer was enriched in CD14+ monocytes and other myeloid cells, including dendritic cells and CD16+ monocytes (Fig. 3I, S5G).

We then determined the chromatin accessibility changes and corresponding cell types that discriminate seronegative MS and PS subjects. First, specific peaksets and transcription factor motifs enriched in all cells from seronegative MS and PS subjects were identified (Fig. 3J–K). MS-specific transcription factors, including subunits of AP-1, reproduce many of the same observations from bulk accessibility data comparing MS and PS subjects, and the motifs have the highest occupancy in CD14+ monocytes (Fig. 1I–J, Fig. 3K–L). In contrast, PS-specific transcription factor motifs have the highest occupancy in plasmacytoid dendritic cells (pDCs), plasmablasts, and B cells (Fig. 3K, 3M). These observations suggest that alternative regulation of PBMC chromatin by adaptive and innate immunity mechanisms is established prior to seroconversion and can be leveraged as potential biomarkers of disease severity.

CD14+ monocytes were identified as a key cell population that both responds to extensive chromatin reprogramming before seroconversion and distinguishes between subjects with mild or more pronounced symptoms. By applying a supervised trajectory algorithm and ordering monocytes from seronegative subjects in pseudotime from MS to PS, a strong correlation between differential chromatin accessibility and disease severity was identified (Fig. 4A). A differential gene expression and transcription factor motif enrichment signature distinguished CD14+ monocytes collected from MS and PS subjects (Fig. 4A). One timepoint was also profiled from six additional subjects who presented with critical symptoms (CS) and required intensive care unit (ICU) care (Table S4). Similar trends were observed in this independent cohort of IgG+, hospitalized subjects in which up-regulation of gene expression was observed in interferon response, IL-1 signaling, and neutrophil degranulation pathways (Fig. S6A–B). Additionally, we identified differential transcription factor motif enrichment and chromatin accessibility that distinguished CS subjects from MS and PS subjects (Fig. S6C).

Figure 4. — **(A)** Supervised trajectory analysis using CD14+ monocytes collected from seronegative MS and PS subjects. Differential transcription factor motif enrichment, chromatin accessibility, and gene expression were correlated with symptom severity. **(B)** scRNA-seq UMAP of all CD14+ monocytes collected from seronegative MS and PS subjects. UMAP colored by disease severity (left) and cluster number (right). **(C)** Regulon activity computed for each scRNA-seq cluster using SCENIC. Activity of the top 10 regulons is plotted as a heatmap for each cluster of cells from the UMAP. Black box indicates clusters of interest from UMAP in 4D. **(D)** scATAC-seq UMAP of all CD14+ monocytes collected from seronegative MS and PS subjects. UMAP colored by disease severity (left) and cluster number (right). **(E)** Transcription factor motif enrichment of regulons identified in 4G plotted for each scATAC-seq cluster. **(F)** Correlation plot using regulon activity to link clusters between scRNA-seq (columns) and scATAC-seq (rows). Black box indicates clusters of interest from UMAP in 4F. **(G)** Gene set enrichment analysis for regulons down-regulated (KLF6, FOS, POU2F2, HMGB1) or up-regulated (MS: IRF7, IRF1, STAT1; PS: CEBPB, KLF3) in scRNA-seq clusters 2, 4, and 7.

We next identified sub-populations of CD14+ monocytes characterized by both transcriptomic and epigenetic biomarkers that distinguish seronegative MS and PS subjects. Single-cell gene expression data for CD14+ monocytes were processed using the uniform manifold and projection (UMAP) algorithm and labeled by either subject disease severity or cluster (Fig. 4B). Clusters 2 and 4 identified a unique MS-specific sub-population of CD14+ monocytes from MS subjects, whereas cluster 7 identified a PS-specific sub-population. The gene regulatory networks were inferred for each scRNA-seq cluster to describe the activity of regulons, including transcription factors and their downstream target genes. Down-regulation of the KLF6, FOS, and POU2F2 regulons was observed in both the MS- and PS-specific monocyte subpopulations (Fig. 4C). The MS-specific sub-population was marked by up-regulation of the IRF7, IRF1, and STAT1 regulons, whereas the PS-specific sub-population was marked by up-regulation of the CEBPB and KLF3 regulons (Fig. 4C). Single-cell accessibility data for CD14+ monocytes was processed using a similar approach to identify clusters with correlated regulon activity (Fig. 4D, S6D). Activity at the transcription factor motifs was estimated, and the correlations between scRNA-seq and scATAC-seq clusters were calculated in a pairwise manner (Fig. 4E–F, S6E). Differential gene expression between the MS- and PS-specific sub-populations identified interferon-stimulated gene activation as a distinguishing feature of symptom severity (Fig. S6F–G). In addition to these transcriptomic markers, GSEA was then performed for the transcription factors with differential activity and their downstream target genes (Fig. 4G). Pathways identified using this epigenetic approach suggest alternative regulation of the adaptive immune response between MS and PS subjects. Differential regulation of pathways related to the transition from innate immunity, marked by the implication of dectin-1 and the toll-like receptor cascade, to adaptive immunity, suggested by activation of T and B cell receptor signaling, were observed in these sub-populations (28–30). These observations confirm that chromatin accessibility remodeling corresponds to symptom severity, as first indicated by differential transcription factor activity.

Discussion

In this study, we identified an association between the evolution of the chromatin landscape in PBMCs and the severity of COVID-19 in a longitudinal cohort of subjects that underwent IgG seroconversion. We present evidence here that the epigenomes of PBMCs are remodeled extensively following infection by SARS-CoV-2 and reflect disease severity. Specifically, differential activity of transcription factors and chromatin accessibility prior to seroconversion were observed to distinguish disease severity, preceding the later transcriptional response. Upon analysis of each major cell type, we showed that CD14+ monocytes underwent extensive chromatin remodeling over time and harbor epigenetic biomarkers in seronegative subjects that distinguish disease severity. We further identified enrichment of DORC genes regulated by super-enhancers in CD14+ monocytes which play a role in priming active chromatin states and cell fate decisions and observed sub-populations of CD14+ monocytes with severity-specific transcription factor activities (Fig. S6H) (16). Similar analysis applied to tissue-resident immune cell populations may further elucidate the factors driving divergence of disease severity (31, 32). These observations will inform new hypotheses to characterize the immunological host response to SARS-CoV-2 infection and be used to assess new strategies for treatment.

We demonstrated proof-of-principle that an ATAC-PCR-based assay can potentially be used as an epigenetic test to stratify disease severity (33). Development of a host-response assay that leverages the highly sensitive epigenetic biomarkers established early during infection and capable of predicting disease severity has the potential to fill an unmet clinical need in the care of COVID-19 patients (34, 35). It will be important to apply this analysis to a longitudinal cohort of subjects who go on to develop worsening symptoms and require critical care in order to generate data for a prognostic epigenetic assay.

In summary, we propose that evolution of the chromatin landscape in the peripheral blood of COVID-19 subjects primes the immunological host response mediated by CD14+ monocytes and drives the observed divergence in disease severity. We also demonstrate that these changes precede transcription of pathways related to adaptive immunity, and these observations can potentially be used to inform clinical approaches to COVID-19 patient treatment.

Materials and Methods

IRB Approvals

Research protocols were approved by the relevant institutional IRBs and were performed in accordance with the Declaration of Helsinki. Written informed consent was obtained from all research participants or their legally authorized representatives (LAR).

Subjects and PBMC sample collection

Subjects with confirmed or suspected SARS-CoV-2 infection, or their close contacts, were identified either in the hospital (Duke University Medical Center, Durham, NC; Duke Regional Hospital, Durham, NC; or Durham VA Health Care System, Durham, NC) or outpatient setting, and enrolled into the Molecular and Epidemiological Study of Suspected Infection protocol (MESSI, IRB Pro00100241). Where possible, longitudinal samples and data were collected through convalescence. The CC group consisted of seven participants with a close contact to someone in the same household with confirmed COVID-19 disease, but CC subjects had negative SARS-CoV-2 PCR and serological testing for at least two months after exposure.

Eight subjects with mild symptoms (MS) had a total symptom score <20 on the day of enrollment (mean = 12.8). Seven subjects with more pronounced symptoms (PS) had a total symptom score >20 on the day of enrollment (mean = 33.6). Six subjects were hospitalized due to COVID-19 and were categorized as having critical symptoms (CS) including three with acute respiratory distress syndrome (ARDS) requiring intensive care unit (ICU) care (36). All MS and PS COVID-19 subjects were longitudinally sampled from enrollment to convalescent phase. Seven healthy controls were asymptomatic and enrolled prior to the COVID-19 pandemic.

SARS-CoV-2 IgG ELISA:

Antibody response testing was performed using the anti-SARS-CoV-2 IgG ELISA assay (EUROIMMUN Medizinische Labordiagnostika AG, Lübeck, Germany) according to manufacturer’s instructions. Test results were evaluated by calculating the ratio of the OD (optical density) of the test sample over the OD of the calibrator sample. Ratio of <0.8 was interpreted as negative and ratio of 1.1 or greater as positive (ratio of 0.8 to <1.1 as indeterminate and not utilized in phenotyping).

SARS-CoV-2 quantification by qRT-PCR:

Nasal swab Viral Transport Medium (VTM) was aliquoted and cryopreserved from study subjects to determine SARS-CoV-2 N1 gene copy number by RT-PCR to stratify subjects as COVID PCR positive or negative. Viral RNA was extracted from 140 uL of VTM according to manufacturer’s instructions (QiaAmp Viral RNA minikit). SARS-CoV-2 nucleocapsid (N1) and human RNase P (RPP30) RNA copies were determined using 5 μL of isolated RNA in the CDC-designed kit (CDC-006–00019, Revision: 03, Integrated DNA Technologies 2019-nCoV kit). Standard quantitative RT-PCR (TaqPath 1-step RT qPCR Master Mix, Thermofisher) was run with test RNA and gene-specific standard curves (2e5 copy/mL – 20 copy/mL). Regression analysis was used to determine gene copy number and corrected to report copies/mL of VTM. Samples with a Ct value less than 35 are called as COVID PCR Negative and samples greater than or equal to 35 are called COVID PCR Positive.

Collection of PBMCs

PBMCs were prepared using the Ficoll-Hypaque density gradient method. Whole blood was collected in ACD vacutainer tubes and processed within 8 hours. Blood was diluted 1:2 in PBS, layered onto the Ficoll-Hypaque in 50 ml conical tubes, and centrifuged at 420 g for 25 minutes. Buffy coat was collected and washed with D-PBS by centrifugation at 400 g for 10 minutes. Cell pellets were resuspended in D-PBS and washed again. PBMCs were assessed for viability and cell count using a Vi-Cell automated cell counter (Beckman-Coulter). PBMCs were adjusted to 10×106 cells/ml in cryopreservation media (90% FBS, 10% DMSO) and then aliquoted into cryopreservation vials on ice. Cells underwent controlled freezing at −80C using CoolCell LX (BioCision) for 12–24 hours and were then transferred to liquid nitrogen vapor phase.

RNA extraction, total RNA-seq, and analysis

For each sample, RNA was extracted from 300K cells using Zymo Direct-zol miniprep kit (Cat# R2051). RNA quality was assessed using Agilent DNA tape screen assay. The RNA Integrity Number (RIN) scores for all samples were > 7.0. Total RNA libraries were generated using NuGEN Ovation® SoLo RNA-Seq Library Preparation Kit (Cat# 0500–96).

FASTQ processing

FASTQ files were generated from the NovaSeq BCL outputs and quality was assessed with FASTQC(37). Eukaryotic rRNA sequences were removed using SortMeRNA, and the remaining reads were aligned against the hg19 human reference genome using STAR and the following commands: STAR –genomeDir /path/to/STARIndex/ --sjdbGTFfile /path/to/gene.gtf –readFilesIn /path/to/R1.fastq /path/toR2.fastq –runThreadN 8 –twopassMode Basic –outWigType bedGraph – outSAMtype BAM SortedByCoordinate –readFilesCommand zcat –outReadsUnmapped Fastx – outFileNamePrefix $sampleID. Following alignment, the gene count matrix was generated using featureCounts (38, 39).

Differential gene expression analysis

Differentially expressed genes were identified between subjects with different disease severity using the ‘wald’ test in DESeq2(40). Subject sex and RNA-seq library batch were added as variables to the design formula to account for expected technical variation in the counts. Ribosomal protein genes and genes not annotated as protein coding in Ensembl were filtered from the gene count matrix. False discovery rate adjustment was performed for the p-values using the ‘BH’ method, and log2 fold change shrinkage was performed using the ‘ashr’ method. A gene was defined as significantly differentially expressed if the adjusted p-value < 0.05 and the shrunken absolute log2 fold changes > 1. Results from DESeq2 were passed to the EnhancedVolcano package to generate volcano plots with the same p-value and log2 fold change thresholds(41).

ATAC-seq and analysis

Nuclei were extracted from frozen PBMCs. Briefly, 100K cells were spun down at 300 g for 5 minutes at 4°C. The supernatant was removed, and cells were mixed with 100 ul of lysis buffer (10mM NaCl, 3mM MgCl2, 10mM Tris-HCl pH7.4, 0.1% Tween-20, 0.1% Nonidet^™ P40) and lysed on ice for 4 minutes. Wash buffer (1 mL; 10mM NaCl, 3mM MgCl2, 10mM Tris-HCl pH7.4, 0.1% Tween20) was added before nuclei spin at 500 g for 5 minutes at 4°C. ATAC-seq libraries were generated as presented earlier (42). Briefly, transposition mix (25 μl 2× TD buffer, 2.5 μl transposase (Tn5, 100 nM final), 22.5 μl water) (Illumina Cat# 20031198) was added to the nuclear pellet. The reaction was incubated at 37 °C for 30 minutes. Samples were purified using Qiagen MinElute PCR Purification Kit (Qiagen Cat#28004). DNA fragments were PCR amplified for a total of 10–11 cycles. The resulting libraries were purified again using Qiagen MinElute PCR Purification Kit. The libraries were sequenced with an Illumina Novaseq 6000 S4 flow cell using 100 bp paired-end reads.

FASTQ processing

FASTQ files were generated from the NovaSeq BCL outputs and used as input to the ENCODE ATAC-seq pipeline (https://github.com/ENCODE-DCC/atac-seq-pipeline) using the MACS2 peak-caller with all default parameters. Output narrowPeak files and aligned BAM files were used for downstream analysis.

Differential chromatin accessibility analysis

Differential accessibility was calculated between groups of subjects with different disease severity using the TCseq package(43). A peak count matrix was generated using a set of consensus peaks and the following parameter values: filter.type = “raw”, filter.value = 30, samplePassfilter = 6. Peaks that were differentially expressed (p < 0.05) in seropositive subjects were used as input to the comparison of healthy controls, CC, MS, and PS subjects. Differentially accessible peaks were defined as having a false discovery rate adjusted p-value < 0.1 and absolute fold change > 0.5. Clustering was applied to the differentially accessible peaks using the ‘cm’ method and k = 12. Peaks with a cluster membership > 0.5 were annotated to the closest gene using the ChIPseeker package and UCSC hg19 annotation database (available through the TxDb.Hsapiens.UCSC.hg19.knownGene package) (44, 45). Protein-protein interactions were estimated using STRING and the network was trimmed using k-means clustering to remove genes with few interactions (46). Functional profiling was performed for these genes using g:Profiler (47).

Outputs from the ENCODE pipeline were also used for analysis with the DiffBind package to estimate differential accessibility between seronegative MS and PS subjects (48). A consensus peakset was generated requiring minOverlap = 0.9 in all samples. Peak count normalization was applied using the following parameter values: background = TRUE, method = DBA_EDGER, normalize = DBA_NORM_LIB, library = DBA_LIBSIZE_FULL, offsets = TRUE. Significant differentially accessible peaks were defined as having p<0.05.

Bivariate genomic footprinting analysis

Bivariate genomic footprinting analysis was performed using the bagfoot package and all default parameter values (49). NarrowPeak files for all samples were merged to generate a consensus peakset for motif enrichment analysis. BAM alignment files were merged using picard MergeSamFiles and indexed with samtools (http://broadinstitute.github.io/picard/).

scRNA-seq and analysis

Frozen PBMCs were thawed, and count and cell viability were measured by Countess II. The cell viability exceeded 80% for all samples except PBMC samples from CS subjects, which had viability between 70–80%. For single cell RNA-seq, 200K cells were aliquoted, spun down, resuspended in 30 ul PBS+0.04%BSA+0.2U/ul RNase inhibitor, and counted using Countess II. GEM generation, post GEMRT cleanup, cDNA amplification, and library construction were performed following 10X Genomics Single Cell 5’ v1 chemistry. Quality was assessed using Agilent DNA tape screen assay. Libraries were then pooled and sequenced using Illumina NovaSeq platform with the goal of reaching saturation or 20,000 unique reads per cell on average. Sequencing data were used as input to the 10x Genomics Cell Ranger pipeline to demultiplex BCL files, generate FASTQs, and generate feature counts for each library.

Dimensionality reduction and cell type annotation

Gene-barcode matrices generated using CellRanger count were analyzed using Seurat 3 with the default parameters unless otherwise specified (50). Cells with > 5% of reads mapping to the mitochondrial genome or > 2500 genes detected were removed from the analysis. Counts were log-normalized, and the top 2000 variable features were identified. Principal component analysis was performed using these variable genes, and the top 20 principal components were used for downstream analysis. UMAP dimensionality reduction was performed using the top 20 principal components identified using the Harmony package (51). Graph-based clustering was performed with resolution = 0.5. Cell types were inferred by using the DatabaseImmuneCellExpressionData() method from the SingleR package (52). Labels were confirmed by identification of differentially expressed genes using the FindAllMarkers() method.

Regulatory network inference

The scRNA-seq Seurat object was converted into a SingleCellExperiment and used as input to analysis with the SCENIC package (53). Cells from seronegative MS and PS subjects were re-clustered using Monocle 3, and the top 100 marker genes were computed for each cell partition (54). The standard workflow for running the SCENIC analysis was then performed using the count matrix for these marker genes as input (https://github.com/aertslab/SCENIC). Briefly, GENIE3 was used to identify regulons of transcription factors and their downstream regulatory targets with correlated co-expression, and AUCell was then used to score the activity of these regulons in each cluster. The ‘top10perTarget’ co-expression parameter value was used to prune the list of scored regulons.

scATAC-seq and analysis

PBMCs were thawed and aliquoted as mentioned above. Nuclei were extracted as for ATAC-seq. The single-cell suspensions of scATAC-seq samples were converted to barcoded scATAC-seq libraries using the Chromium Single Cell 5′ Library, Gel Bead and Multiplex Kit, and Chip Kit (10x Genomics). The Chromium Single Cell 5′ v2 Reagent (10x Genomics, 120237) kit was used to prepare single-cell ATAC libraries according to the manufacturer’s instructions. Quality was assessed using Agilent DNA tape screen assay. Libraries were then pooled and sequenced using Illumina NovaSeq platform with the goal of reaching saturation or 25,000 unique reads per nuclei on average. Sequencing data were used as input to the 10x Genomics Cell Ranger ATAC pipeline to demultiplex BCL files, generate FASTQs, and generate feature counts for each library.

scRNA-seq and scATAC-seq integration

Fragment file outputs generated using CellRanger ATAC count were analyzed using ArchR following the standard workflow and with default parameters unless otherwise specified (55). Cells with a transcription start site enrichment score < 4, cells with fewer than 1000 detected fragments, and putative doublets were removed from downstream analysis. Dimensionality reduction was computed using iterative latent semantic indexing (LSI), and batch effect correction was applied using Harmony. Graph-based clustering was performed using the FindClusters() method from Seurat 3 with resolution = 0.8. UMAP embeddings were calculated with the top 30 principal components from either LSI or Harmony. Constrained integration was performed using the addGeneIntegrationMatrix() method and scRNA-seq cell type annotations were used to label the identify of scATAC-seq clusters.

Feature and motif enrichment analysis

Peak calling was performed using MACS2 via the addReproduciblePeakSet() method in ArchR which uses pseudo-bulk replicates of cells grouped on a specific design variable. Differentially accessible peaks were identified between two groups and visualized using the ArchR methods getMarkerFeatures() and markerPlot(), respectively. Significance was defined as FDR <= 0.1 and absolute log2 fold change >= 0.5 unless otherwise specified. The ‘cisbp’ motif set was imported from TFBSTools using the ArchR addMotifAnnotations() method, and motif enrichment in differentially accessible peaks was performed using the peakAnnoEnrichment() method. Additionally, chromVAR deviation scores for these motifs were computed using the ArchR implementation (56). Motif footprinting was performed by measuring Tn5 insertions in genome-wide motifs and normalized by substracting the Tn5 bias from the footprinting signal.

Integrative analysis with scRNA-seq

The correlations between chromVAR transcription factor deviation scores and gene expression data were calculated using the ArchR method correlateMatrices() to identify activators and repressors.

Peak-to-gene linkages were calculated using the addPeak2GeneLinks() method in ArchR using a correlation cutoff of 0.5 and resolution = 1. This approach uses low-overlapping cell aggregates to reduce noise that arises from doing correlative analyses with sparse scATAC-seq datasets. Peak-to-gene linkages were plotted against peak accessibility at DORC genes for each cell type. DORC genes were defined as gene loci with > 10 peak-to-gene linkages, and these sites were used as input to the web tool Seanalysis to identify regulation by a known super-enhancer in peripheral blood cells (16, 57).

The activity of the top-ranked transcription factor regulators that were correlated with scRNA-seq clusters was estimated for each scATAC-seq cluster. These activities were used to calculate Pearson correlation coefficients between scATAC-seq clusters (C1-C8) and scRNA-seq clusters (C1-C11) were calculated to identify scATAC-seq clusters with similar regulatory network activity.

Supervised pseudotime trajectory analysis

Cellular trajectories were established in a low-dimensional space using LSI embeddings and a user-defined trajectory backbone. For this study, a rough ordering of 2–3 groups of cells specified with a design variable was provided to the ArchR method addTrajectory(). Then, a k-nearest neighbors algorithm was used to order cells based on the Euclidean distance of each cell to the nearest cluster’s centroid. Cells were then assigned pseudotime value estimates, and a heatmap was plotted using differential feature z-scores that were associated with the pseudotime trajectory.

ATAC-qPCR

The optimal primer regions for peaks of interests were designed by the ATACPrimerTool (APT) (https://github.com/ChangLab/ATACPrimerTool). The identification of the optimal primer regions and primer design were performed as previously described (33). In short, original .bam files and .bed files containing peak of interests were provided as inputs for the APT, and hg19 was used as the reference genome. Primers were designed by Primer3 Plus based on the APT identified optimal primer regions with the parameters listed in the original APT paper. The ATAC-qPCR was done by assembling a qPCR reaction containing 1ng ATAC-seq library, 125nM of forward and reverse primers and SYBR Green Master Mix with the following cycling conditions: 2 minutes at 98C, 40 cycles of 10 seconds at 98C, 20 seconds at 60C and 30 seconds at 72C. The two Universal Normalization Primers AK5 and KIF26B were used as internal control for the qPCR experiment.

Primers used:

Universal Normalization primers:

AK5 Forward primer: AGCGCGGAGACCACAG

AK5 Reverse Primer: CGGTGCAGCCCTCTTTC

KIF26B Forward primer: AAGCTCGGTGAAGGAGACAA

KIF26B Reverse primer: ACGAGGAAAGCGAGGGATAC

PS-enriched gene primers:

DEFA4 Forward primer: CACTGGGCTATGGAGGACTG

DEFA4 Reverse primer: AGCCGACTTCACTGCTCTG

LGALS17A Forward primer: GTGTGTGCTGGGATGTGACT

LGALS17A Reverse primer: CTGCTGTGTTGGGAGGAAAC

Supplementary Material

NIHPP2020.12.04.412155-supplement-1.pdf^{(4MB, pdf)}

Acknowledgments:

We thank Drs. Eric van Gieson and Thomas Thomou for their advice throughout the project. We also thank Drs. Andrew Macintyer and Thomas Oguin, and their research teams for running, for performing COVID IgG serology and PCR viral load testing in the Core Services units of the Duke Regional Biocontainment Laboratory. We thank Raul Louzuo, Christina Nix, Tyffany Locklear, Pam Isner, Anna Mazur, Jack Anderson, Maria Miggs, Aleah Bowie, Julie Steinbrink, Robert Rolfe, and Jorge Prado for their contributions.

Funding: This work was supported by NIH/NIAID (U01AI066569, UM1AI104681), the U.S. Defense Advanced Projects Agency (DARPA, N66001-09-C-2082 and HR0011-17-2-0069), the Veterans Affairs Health System, and Virology Quality Assurance (VQA) 75N93019C00015. BK: K08HL130557. XS: DARPA/USMRAA W911NF1920111, NIH R35GM122465. COVID-19 samples were processed under BSL2 with aerosol management enhancement or BSL3 in the Duke Regional Biocontainment Laboratory which received partial support for construction from NIH/NIAID (UC6AI058607).

Competing interests: MTM reports grants on biomarker diagnostics from the Defense Advanced Research Projects Agency (DARPA), National Institutes of Health (NIH), Sanofi, and the Department of Veterans Affairs. TWB reports grants from DARPA and is a consultant for Predigen; MTM, TWB, ELT, GSG, and CWW report patents pending on Molecular Methods to Diagnose and Treat Respiratory Infections. ELT reports grants on biomarker diagnostics from DARPA, the NIH/Antibacterial Resistance Leadership Group (ARLG) ; an ownership stake in Predigen; consultancy fees from bioMerieux; GSG reports an ownership stake in Predigen; CWW reports grants on biomarker diagnostics from DARPA, NIH/ARLG, Predigen, and Sanofi; and has received consultancy fees from bioMerieux, Roche, Biofire, Giner, and Biomeme. NSG declares no competing interests. GDS declares no competing interests. XS declares no competing interests.

Footnotes

Data and materials availability: The sequencing datasets and related clinical metadata tables will be made available via the Gene Expression Omnibus (GEO) repository prior to publication.

Supplementary Materials: Materials and Methods Tables S1-S4

Figures Sl-S6

References

1.Blanco-Melo D. et al. , Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19. Cell 181, 1036–1045.e1039 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Ibarrondo F. J. et al. , Rapid Decay of Anti–SARS-CoV-2 Antibodies in Persons with Mild Covid-19. New England Journal of Medicine 383, 1085–1087 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Bournazos S., Gupta A., Ravetch J. V., The role of IgG Fc receptors in antibody-dependent enhancement. Nature Reviews Immunology 20, 633–643 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Long Q.-X. et al. , Antibody responses to SARS-CoV-2 in patients with COVID-19. Nature Medicine 26, 845–848 (2020). [DOI] [PubMed] [Google Scholar]
5.Zhao J. et al. , Antibody Responses to SARS-CoV-2 in Patients With Novel Coronavirus Disease 2019. Clinical Infectious Diseases, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Ni L. et al. , Detection of SARS-CoV-2-Specific Humoral and Cellular Immunity in COVID-19 Convalescent Individuals. Immunity, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Wilk A. J. et al. , A single-cell atlas of the peripheral immune response in patients with severe COVID-19. Nature Medicine, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Maucourant C. et al. , Natural killer cell immunotypes related to COVID-19 disease severity. Science Immunology 5, eabd6832–eabd6832 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.McClain M. T. et al. , Dysregulated transcriptional responses to SARS-CoV-2 in the periphery support novel diagnostic approaches. medRxiv, 2020.2007.2020.20155507–20152020.20155507.20155520.20155507 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Mathew D. et al. , Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications. Science 369, eabc8511–eabc8511 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Hadjadj J. et al. , Impaired type I interferon activity and inflammatory responses in severe COVID-19 patients. Science 369, 718–724 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Park A., Iwasaki A., Type I and Type III Interferons – Induction, Signaling, Evasion, and Application to Combat COVID-19. Cell Host & Microbe 27, 870–878 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Del Valle D. M. et al. , An inflammatory cytokine signature predicts COVID-19 severity and survival. Nature Medicine 26, 1636–1643 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Schulte-Schrepping J. et al. , Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment. Cell, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Zhou Z. et al. , Heightened Innate Immune Responses in the Respiratory Tract of COVID-19 Patients. Cell Host & Microbe 27, 883–890.e882 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Ma S. et al. , Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin. Cell 183, 1103–1116.e1120 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.El-Diwany R. et al. , CMPK2 and BCL-G are associated with type 1 interferon–induced HIV restriction in humans. Science Advances 4, eaat0843–eaat0843 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Hemmat N. et al. , Neutrophils, Crucial, or Harmful Immune Cells Involved in Coronavirus Infection: A Bioinformatics Study. Frontiers in Genetics 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Groppo R., Richter J. D., CPEB Control of NF- B Nuclear Localization and Interleukin-6 Production Mediates Cellular Senescence. Molecular and Cellular Biology 31, 2707–2714 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Sweet D. R., Fan L., Hsieh P. N., Jain M. K., Krüppel-Like Factors in Vascular Inflammation: Mechanistic Insights and Therapeutic Potential. Frontiers in Cardiovascular Medicine 5, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Kapoor N. et al. , Transcription Factors STAT6 and KLF4 Implement Macrophage Polarization via the Dual Catalytic Powers of MCPIP. The Journal of Immunology 194, 6011–6023 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Meng Q., Raha A., Roy S., Hu J., Kalvakolanu D. V., IFN-γ-Stimulated Transcriptional Activation by IFN-γ-Activated Transcriptional Element-Binding Factor 1 Occurs via an Inducible Interaction with CAAAT/Enhancer-Binding Protein-β. The Journal of Immunology 174, 6203–6211 (2005). [DOI] [PubMed] [Google Scholar]
23.Li H., Gade P., Xiao W., Kalvakolanu D. V., The interferon signaling network and transcription factor C/EBP-beta. Cellular & molecular immunology 4, 407–418 (2007). [PMC free article] [PubMed] [Google Scholar]
24.Itoh-Nakadai A. et al. , A Bach2-Cebp Gene Regulatory Network for the Commitment of Multipotent Hematopoietic Progenitors. Cell Reports 18, 2401–2414 (2017). [DOI] [PubMed] [Google Scholar]
25.Patsalos A. et al. , The BACH1–HMOX1 Regulatory Axis Is Indispensable for Proper Macrophage Subtype Specification and Skeletal Muscle Regeneration. The Journal of Immunology 203, 1532–1547 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Delisle J. S. et al. , The TGF-β-Smad3 pathway inhibits CD28-dependent cell growth and proliferation of CD4 T cells. Genes & Immunity 14, 115–126 (2013). [DOI] [PubMed] [Google Scholar]
27.Zhou Q. et al. , KDM2B promotes IL-6 production and inflammatory responses through Brg1-mediated chromatin remodeling. Cellular & Molecular Immunology 17, 834–842 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Brown G. D., Dectin-1: a signalling non-TLR pattern-recognition receptor. Nature Reviews Immunology 6, 33–43 (2006). [DOI] [PubMed] [Google Scholar]
29.Beutler B. A., TLRs and innate immunity. Blood 113, 1399–1407 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Niu X. et al. , Longitudinal Analysis of T and B Cell Receptor Repertoire Transcripts Reveal Dynamic Immune Response in COVID-19 Patients. Frontiers in Immunology 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Merad M., Martin J. C., Pathological inflammation in patients with COVID-19: a key role for monocytes and macrophages. Nature Reviews Immunology 20, 355–362 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Jafarzadeh A., Chauhan P., Saha B., Jafarzadeh S., Nemati M., Contribution of monocytes and macrophages to the local tissue inflammation and cytokine storm in COVID-19: Lessons from SARS and MERS, and potential therapeutic interventions. Life Sciences 257, 118102–118102 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Yost K. E., Carter A. C., Xu J., Litzenburger U., Chang H. Y., ATAC Primer Tool for targeted analysis of accessible chromatin. Nature Methods 15, 304–305 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Côté A., Ternacle J., Pibarot P., Early prediction of the risk of severe coronavirus disease 2019: A key step in therapeutic decision making. EBioMedicine 59, 102948–102948 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Altschul D. J. et al. , A novel severity score to predict inpatient mortality in COVID-19 patients. Scientific Reports 10, 16726–16726 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Force T. A. D. T., Acute Respiratory Distress Syndrome. JAMA 307, (2012). [Google Scholar]
37.Andrews S., FastQC: A Quality Control Tool for High Throughput Sequence Data. Available, (2010).
38.Kopylova E., Noé L., Touzet H., SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 28, 3211–3217 (2012). [DOI] [PubMed] [Google Scholar]
39.Dobin A. et al. , STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Love M. I., Huber W., Anders S., Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15, 550–550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Blighe K., Rana S., Lewis M., EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling. R package.
42.Buenrostro J. D., Wu B., Chang H. Y., Greenleaf W. J., ATAC‐seq: A Method for Assaying Chromatin Accessibility Genome‐Wide. Current Protocols in Molecular Biology 109, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Wu M., Gu L., TCseq: Time course sequencing data analysis. R package, (2020).
44.Yu G., Wang L.-G., He Q.-Y., ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015). [DOI] [PubMed] [Google Scholar]
45.Carlson M., Maintainer B. P., TxDb.Hsapiens.UCSC.hg19.knownGene: Annotation package for TxDb object(s). R package.
46.Szklarczyk D. et al. , STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research 47, D607–D613 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Raudvere U. et al. , g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Research 47, W191–W198 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Ross-Innes C. S. et al. , Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481, 389–393 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Baek S., Goldstein I., Hager G. L., Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity. Cell Reports 19, 1710–1722 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Stuart T. et al. , Comprehensive Integration of Single-Cell Data. Cell 177, 1888–1902.e1821 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Korsunsky I. et al. , Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods 16, 1289–1296 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Aran D. et al. , Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nature Immunology 20, 163–172 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Aibar S. et al. , SCENIC: single-cell regulatory network inference and clustering. Nature Methods 14, 1083–1086 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Trapnell C. et al. , The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nature Biotechnology 32, 381–386 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Granja J. M. et al. , ArchR: An integrative and scalable software package for single-cell chromatin accessibility analysis. bioRxiv, 2020.2004.2028.066498–062020.066404.066428.066498 (2020). [Google Scholar]
56.Schep A. N., Wu B., Buenrostro J. D., Greenleaf W. J., chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nature Methods 14, 975–978 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Qian F.-C. et al. , SEanalysis: a web tool for super-enhancer associated regulatory analysis. Nucleic Acids Research 47, W248–W255 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHPP2020.12.04.412155-supplement-1.pdf^{(4MB, pdf)}

[R1] 1.Blanco-Melo D. et al. , Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19. Cell 181, 1036–1045.e1039 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Ibarrondo F. J. et al. , Rapid Decay of Anti–SARS-CoV-2 Antibodies in Persons with Mild Covid-19. New England Journal of Medicine 383, 1085–1087 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Bournazos S., Gupta A., Ravetch J. V., The role of IgG Fc receptors in antibody-dependent enhancement. Nature Reviews Immunology 20, 633–643 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Long Q.-X. et al. , Antibody responses to SARS-CoV-2 in patients with COVID-19. Nature Medicine 26, 845–848 (2020). [DOI] [PubMed] [Google Scholar]

[R5] 5.Zhao J. et al. , Antibody Responses to SARS-CoV-2 in Patients With Novel Coronavirus Disease 2019. Clinical Infectious Diseases, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Ni L. et al. , Detection of SARS-CoV-2-Specific Humoral and Cellular Immunity in COVID-19 Convalescent Individuals. Immunity, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Wilk A. J. et al. , A single-cell atlas of the peripheral immune response in patients with severe COVID-19. Nature Medicine, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Maucourant C. et al. , Natural killer cell immunotypes related to COVID-19 disease severity. Science Immunology 5, eabd6832–eabd6832 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.McClain M. T. et al. , Dysregulated transcriptional responses to SARS-CoV-2 in the periphery support novel diagnostic approaches. medRxiv, 2020.2007.2020.20155507–20152020.20155507.20155520.20155507 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Mathew D. et al. , Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications. Science 369, eabc8511–eabc8511 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Hadjadj J. et al. , Impaired type I interferon activity and inflammatory responses in severe COVID-19 patients. Science 369, 718–724 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Park A., Iwasaki A., Type I and Type III Interferons – Induction, Signaling, Evasion, and Application to Combat COVID-19. Cell Host & Microbe 27, 870–878 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Del Valle D. M. et al. , An inflammatory cytokine signature predicts COVID-19 severity and survival. Nature Medicine 26, 1636–1643 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Schulte-Schrepping J. et al. , Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment. Cell, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Zhou Z. et al. , Heightened Innate Immune Responses in the Respiratory Tract of COVID-19 Patients. Cell Host & Microbe 27, 883–890.e882 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Ma S. et al. , Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin. Cell 183, 1103–1116.e1120 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.El-Diwany R. et al. , CMPK2 and BCL-G are associated with type 1 interferon–induced HIV restriction in humans. Science Advances 4, eaat0843–eaat0843 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Hemmat N. et al. , Neutrophils, Crucial, or Harmful Immune Cells Involved in Coronavirus Infection: A Bioinformatics Study. Frontiers in Genetics 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Groppo R., Richter J. D., CPEB Control of NF- B Nuclear Localization and Interleukin-6 Production Mediates Cellular Senescence. Molecular and Cellular Biology 31, 2707–2714 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Sweet D. R., Fan L., Hsieh P. N., Jain M. K., Krüppel-Like Factors in Vascular Inflammation: Mechanistic Insights and Therapeutic Potential. Frontiers in Cardiovascular Medicine 5, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Kapoor N. et al. , Transcription Factors STAT6 and KLF4 Implement Macrophage Polarization via the Dual Catalytic Powers of MCPIP. The Journal of Immunology 194, 6011–6023 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Meng Q., Raha A., Roy S., Hu J., Kalvakolanu D. V., IFN-γ-Stimulated Transcriptional Activation by IFN-γ-Activated Transcriptional Element-Binding Factor 1 Occurs via an Inducible Interaction with CAAAT/Enhancer-Binding Protein-β. The Journal of Immunology 174, 6203–6211 (2005). [DOI] [PubMed] [Google Scholar]

[R23] 23.Li H., Gade P., Xiao W., Kalvakolanu D. V., The interferon signaling network and transcription factor C/EBP-beta. Cellular & molecular immunology 4, 407–418 (2007). [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Itoh-Nakadai A. et al. , A Bach2-Cebp Gene Regulatory Network for the Commitment of Multipotent Hematopoietic Progenitors. Cell Reports 18, 2401–2414 (2017). [DOI] [PubMed] [Google Scholar]

[R25] 25.Patsalos A. et al. , The BACH1–HMOX1 Regulatory Axis Is Indispensable for Proper Macrophage Subtype Specification and Skeletal Muscle Regeneration. The Journal of Immunology 203, 1532–1547 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Delisle J. S. et al. , The TGF-β-Smad3 pathway inhibits CD28-dependent cell growth and proliferation of CD4 T cells. Genes & Immunity 14, 115–126 (2013). [DOI] [PubMed] [Google Scholar]

[R27] 27.Zhou Q. et al. , KDM2B promotes IL-6 production and inflammatory responses through Brg1-mediated chromatin remodeling. Cellular & Molecular Immunology 17, 834–842 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Brown G. D., Dectin-1: a signalling non-TLR pattern-recognition receptor. Nature Reviews Immunology 6, 33–43 (2006). [DOI] [PubMed] [Google Scholar]

[R29] 29.Beutler B. A., TLRs and innate immunity. Blood 113, 1399–1407 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Niu X. et al. , Longitudinal Analysis of T and B Cell Receptor Repertoire Transcripts Reveal Dynamic Immune Response in COVID-19 Patients. Frontiers in Immunology 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Merad M., Martin J. C., Pathological inflammation in patients with COVID-19: a key role for monocytes and macrophages. Nature Reviews Immunology 20, 355–362 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Jafarzadeh A., Chauhan P., Saha B., Jafarzadeh S., Nemati M., Contribution of monocytes and macrophages to the local tissue inflammation and cytokine storm in COVID-19: Lessons from SARS and MERS, and potential therapeutic interventions. Life Sciences 257, 118102–118102 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Yost K. E., Carter A. C., Xu J., Litzenburger U., Chang H. Y., ATAC Primer Tool for targeted analysis of accessible chromatin. Nature Methods 15, 304–305 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Côté A., Ternacle J., Pibarot P., Early prediction of the risk of severe coronavirus disease 2019: A key step in therapeutic decision making. EBioMedicine 59, 102948–102948 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Altschul D. J. et al. , A novel severity score to predict inpatient mortality in COVID-19 patients. Scientific Reports 10, 16726–16726 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Force T. A. D. T., Acute Respiratory Distress Syndrome. JAMA 307, (2012). [Google Scholar]

[R37] 37.Andrews S., FastQC: A Quality Control Tool for High Throughput Sequence Data. Available, (2010).

[R38] 38.Kopylova E., Noé L., Touzet H., SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 28, 3211–3217 (2012). [DOI] [PubMed] [Google Scholar]

[R39] 39.Dobin A. et al. , STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Love M. I., Huber W., Anders S., Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15, 550–550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Blighe K., Rana S., Lewis M., EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling. R package.

[R42] 42.Buenrostro J. D., Wu B., Chang H. Y., Greenleaf W. J., ATAC‐seq: A Method for Assaying Chromatin Accessibility Genome‐Wide. Current Protocols in Molecular Biology 109, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] 43.Wu M., Gu L., TCseq: Time course sequencing data analysis. R package, (2020).

[R44] 44.Yu G., Wang L.-G., He Q.-Y., ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015). [DOI] [PubMed] [Google Scholar]

[R45] 45.Carlson M., Maintainer B. P., TxDb.Hsapiens.UCSC.hg19.knownGene: Annotation package for TxDb object(s). R package.

[R46] 46.Szklarczyk D. et al. , STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research 47, D607–D613 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] 47.Raudvere U. et al. , g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Research 47, W191–W198 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] 48.Ross-Innes C. S. et al. , Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481, 389–393 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] 49.Baek S., Goldstein I., Hager G. L., Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity. Cell Reports 19, 1710–1722 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] 50.Stuart T. et al. , Comprehensive Integration of Single-Cell Data. Cell 177, 1888–1902.e1821 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] 51.Korsunsky I. et al. , Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods 16, 1289–1296 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] 52.Aran D. et al. , Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nature Immunology 20, 163–172 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] 53.Aibar S. et al. , SCENIC: single-cell regulatory network inference and clustering. Nature Methods 14, 1083–1086 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] 54.Trapnell C. et al. , The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nature Biotechnology 32, 381–386 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] 55.Granja J. M. et al. , ArchR: An integrative and scalable software package for single-cell chromatin accessibility analysis. bioRxiv, 2020.2004.2028.066498–062020.066404.066428.066498 (2020). [Google Scholar]

[R56] 56.Schep A. N., Wu B., Buenrostro J. D., Greenleaf W. J., chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nature Methods 14, 975–978 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R57] 57.Qian F.-C. et al. , SEanalysis: a web tool for super-enhancer associated regulatory analysis. Nucleic Acids Research 47, W248–W255 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

This is a preprint.

Chromatin remodeling in peripheral blood cells reflects COVID-19 symptom severity

Nicholas S Giroux

Shengli Ding

Micah T McClain

Thomas W Burke

Elizabeth Petzold

Hong A Chung

Grecia R Palomino

Ergang Wang

Rui Xi

Shree Bose

Tomer Rotstein

Bradly P Nicholson

Tianyi Chen

Ricardo Henao

Gregory D Sempowski

Thomas N Denny

Emily R Ko

Geoffrey S Ginsburg

Bryan D Kraft

Ephraim L Tsalik

Christopher W Woods

Xiling Shen

Abstract

One sentence summary:

Introduction

Results

Figure 1. Chromatin accessibility in peripheral immune cells distinguishes COVID-19 symptom severity.

Figure 2. Epigenetic signature in cells collected from COVID-19 subjects evolves with disease progression.

Figure 3. CD14+ monocytes undergo extensive chromatin remodeling prior to seroconversion and harbor severity-specific epigenetic biomarkers.

Figure 4. CD14+ monocytes distinguish seronegative MS and PS subjects.

Discussion

Materials and Methods

IRB Approvals

Subjects and PBMC sample collection

SARS-CoV-2 IgG ELISA:

SARS-CoV-2 quantification by qRT-PCR:

Collection of PBMCs

RNA extraction, total RNA-seq, and analysis

FASTQ processing

Differential gene expression analysis

ATAC-seq and analysis

FASTQ processing

Differential chromatin accessibility analysis

Bivariate genomic footprinting analysis

scRNA-seq and analysis

Dimensionality reduction and cell type annotation

Regulatory network inference

scATAC-seq and analysis

scRNA-seq and scATAC-seq integration

Feature and motif enrichment analysis

Integrative analysis with scRNA-seq

Supervised pseudotime trajectory analysis

ATAC-qPCR

Supplementary Material

Acknowledgments:

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases