Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Dec 2;15(12):e0242900. doi: 10.1371/journal.pone.0242900

In silico immune infiltration profiling combined with functional enrichment analysis reveals a potential role for naïve B cells as a trigger for severe immune responses in the lungs of COVID-19 patients

Yi-Ying Wu 1, Sheng-Huei Wang 2,3, Chih-Hsien Wu 4, Li-Chen Yen 5, Hsing-Fan Lai 4,6, Ching-Liang Ho 1, Yi-Lin Chiu 4,*
Editor: Mrinmoy Sanyal7
PMCID: PMC7710067  PMID: 33264345

Abstract

COVID-19, caused by SARS-CoV-2, has rapidly spread to more than 160 countries worldwide since 2020. Despite tremendous efforts and resources spent worldwide trying to explore antiviral drugs, there is still no effective clinical treatment for COVID-19 to date. Approximately 15% of COVID-19 cases progress to pneumonia, and patients with severe pneumonia may die from acute respiratory distress syndrome (ARDS). It is believed that pulmonary fibrosis from SARS-CoV-2 infection further leads to ARDS, often resulting in irreversible impairment of lung function. If the mechanisms by which SARS-CoV-2 infection primarily causes an immune response or immune cell infiltration can be identified, it may be possible to mitigate excessive immune responses by modulating the infiltration and activation of specific targets, thereby reducing or preventing severe lung damage. However, the extent to which immune cell subsets are significantly altered in the lung tissues of COVID-19 patients remains to be elucidated.

This study applied the CIBERSORT-X method to comprehensively evaluate the transcriptional estimated immune infiltration landscape in the lung tissues of COVID-19 patients and further compare it with the lung tissues of patients with idiopathic pulmonary fibrosis (IPF). We found a variety of immune cell subtypes in the COVID-19 group, especially naïve B cells were highly infiltrated. Comparison of functional transcriptomic analyses revealed that non-differentiated naïve B cells may be the main cause of the over-active humoral immune response. Using several publicly available single-cell RNA sequencing data to validate the genetic differences in B-cell populations, it was found that the B-cells collected from COVID-19 patients were inclined towards naïve B-cells, whereas those collected from IPF patients were inclined towards memory B-cells. Further differentiation of B cells between COVID-19 mild and severe patients showed that B cells from severe patients tended to be antibody-secreting cells, and gene expression showed that B cells from severe patients were similar to DN2 B cells that trigger extrafollicular response. Moreover, a higher percentage of B-cell infiltration seems associated with poorer clinical outcome. Finally, a comparison of several specific COVID-19 cases treated with targeted B-cell therapy suggests that appropriate suppression of naïve B cells might potentially be a novel strategy to alleviate the severe symptoms of COVID-19.

Introduction

COVID-19 (Coronavirus Disease-2019) caused by the SARS-CoV-2 (Severe Acute Respiratory Syndrome coronavirus-2) has rapidly spread to over 160 countries worldwide since the beginning of 2020 [13]. Nearly 40 million patients with COVID-19 and over 1,000,000 deaths have occurred up to now. At the same time, the rate of increase in the number of COVID-19 cases is gradually accelerating. Despite the tremendous efforts and resources spent by scientists and clinicians around the world trying to produce vaccines and explore antiviral drugs [4, 5], there is still no potent drug or effective clinical treatment for COVID-19 to date [6]. The clinical indications of COVID-19 disease are quite diverse, symptoms including fever, dry cough, loss of smell or taste, fatigue, diarrhea, conjunctivitis, and pneumonia have been reported [79]. About 80% of COVID-19 cases are asymptomatic or exhibit mild to moderate symptoms, about 15% of COVID-19 cases progress to pneumonia [10], and patients with severe pneumonia may die from acute respiratory distress syndrome (ARDS) or multi-organ failure [11, 12]. Moreover, further pulmonary fibrosis caused by SARS-CoV-2 infection-induced ARDS often result in irreversible impairment of lung function, which may leave permanent or semi-permanent damage even if patients recover from COVID-19 [13, 14].

The mechanism by which SARS-CoV-2 causes lung injury is only partially understood. It is believed that SARS-CoV-2 infection activates both innate and adaptive immune responses against the virus. However, excessive inflammatory innate responses and dysregulated adaptive host immune defense may cause harmful tissue damage at sites of SARS-CoV-2 entry, such as the lungs and bronchi, accelerating the process of acute lung injury (ALI) and ARDS. Worse, certain organs, including the lungs, may be irreversibly damaged after SARS-CoV-2 infection [15]. Regarding the association of ARDS with lung fibrosis, ARDS is thought to originate from plasma infiltration into the alveolar lumen due to persistent alveolar epithelial damage [16, 17]. Activation of the coagulation system in the plasma and production of proinflammatory cytokines and chemotactic factors leads to a massive influx of neutrophils, lymphocytes and monocytes/macrophages into the lungs, resulting in the dysregulated release of potent cytotoxic mediators associated with inflammation, which ultimately leads to damage to the lung endothelium and epithelial cells [18]. When epithelial and endothelial cells are damaged, inflammatory mediators are released to further recruit multiple types of inflammatory immune cell infiltrates, prompting fibroblasts to activate and migrate to the wound center while releasing collagen to remodel the extracellular matrix (ECM). Chronic inflammation and persistent repair can trigger an excessive accumulation of ECM components, which will lead to permanent fibrosis. Therefore, it is generally accepted that persistent inflammation due to infiltration of immune cells is a major contributor to pathological fibrosis in the lungs [19].

While maintaining immune defense against SARS-CoV-2 infection in COVID-19 patients is certainly important, the over-activated inflammatory response is also directly linked to poor prognosis for recovery [20]. If the mechanism by which SARS-CoV-2 infection primarily elicits the immune system response and immune cell infiltration can be found, it would be possible to alleviate the extent of ARDS and lung fibrosis by modulating the infiltration and activation of specific immune cells to attenuate the excessive immune response. We believe that understanding the infiltration status of immune cells in SARS-CoV-2 infected lung tissues is the key to access the originate that raised immune hyperactivation and is also a critical step in developing new therapeutic strategies for COVID-19.

Currently, clinical studies of COVID-19 have focused on the collection and analysis of peripheral blood or bronchoalveolar lavage fluid from patients [21, 22], the extent to which immune cell subsets in the lung tissue of COVID-19 patients are significantly altered remains unclear. The present study applied the CIBERSORT method developed by Newman et al. to comprehensively evaluate the simulated infiltration of 22 immune cells in the lung tissues of COVID-19 deceased patients and to further compare them with immune cells in the lung tissues of patients with idiopathic pulmonary fibrosis (IPF) [23]. By analyzing the immune cell infiltration, we found that multiple immune cell subtypes, especially naïve B cells, were highly infiltrated in the lung tissues of COVID-19 patients. Comparison of functional gene sets revealed that non-differentiated naïve B cells may be the main reason for the overactive humoral immune response. Further analysis of the defined B-cell population using single-cell RNA sequencing databases showed that the B-cells from COVID-19 patients not only tended to be naïve B-cells, but also tended to be antibody-secreting cells in patients with severe disease, and the proportion of B-cell infiltration seemed to correlate with the severity of the disease. We further compared several cases of specific COVID-19 with therapies targeting B cells and found that suppression of naïve B cells is likely to be a new strategy to alleviate the severe symptoms of COVID-19.

Material and methods

Evaluation of infiltrating immune cells using CIBERSORT-X

CIBERSORT-X is a powerful tool for simulating tissue immune infiltration in samples using computational algorithms [2325]. The application of CIBERSORT-X allows accurate quantification of the relative abundance or absolute scores of different immune cell types in complex gene expression mixtures. To characterize and calculate each immune cell subtype, CIBERSORT-X uses approximately 500 genes with consistent gene expression signatures. Here, we applied the original gene signature file LM22, which characterizes 22 immune cell subtypes, including B cells, T cells, natural killer cells, macrophages, dendritic cells, eosinophils, and neutrophils, and analyzed datasets from COVID-19 patient lung tissue and controls from the same database (NCBI GEO accession number: GSE150316), IPF patient lung tissues and healthy donor lung tissues (NCBI GEO accession numbers: GSE53845 and GSE124685) [26, 27]. After normalization of all sample data, the immune cell profiles were analyzed using the CIBERSORT-X web tool to calculate absolute scores, and the mean of each group’s immune cell profile was then computed using GraphPad software.

Identification of B cell specific signatures

For Naive B cell and memory B cell specific signatures, we used LM22 document to define the top 100 enriched genes in subtypes of Naive B cells and Memory B cells respectively, and then analyzed genes in the intersection or non-intersection by Venn diagram. The genes in non-intersection areas were defined as the specific gene set of the cell.

Single cell RNA sequencing data analysis

For single-cell RNA sequencing data analysis, Bio-turning Browser (BBrowser 2.6.22) software were utilized to download two COVID-19 single-cell sequencing data (GSE145926 published by Liao et al. and another single-cell RNA sequencing data published by Chua et al.) and one IPF single-cell sequencing data (GSE128033 published by Morse et al.) [22, 28, 29]. The annotation of immune cell clusters and disease severity was based on the original author-defined cell clusters and meta-data built in the downloaded data within BBrowser. The matrix of gene expression of mentioned B cells was extracted using BBrowser as well for further GSEA analysis.

Gene Set Enrichment Analysis (GSEA) and enrichment map visualization

GSEA is a computational method used to investigate whether a given whole gene expression profiling with user defined phenotype is significantly enriched in a set of gene sets [30]. The GSE53845, GSE124685, and GSE150316 databases were downloaded from NCBI and normalized before being input into the GSEA program. The BP:GO biological process (7530 gene sets) and the PID subset of CP (196 gene sets) from the C2CP canonical pathways were downloaded from MsigDB and used as functional gene sets [30]. Samples were categorized into “SARS-CoV-2 vs NegControl” and “IPF vs Healthy donor” according to the original database annotations respectively. P-value <0.05 and FDR <0.05 will be considered statistically significant. Enriched gene sets of GSEA identification on "GO:BP" and "PID pathways" were visualized via the Enrichment Map v3.2.1 plugin in Cytoscape 3.8 [31]. Enrichment maps represent the degree of enrichment of functional gene clusters for each disease group compared to control groups.

For single-cell GSEA analysis, the annotation from the original authors was used to distinguish whether the B cells in matrix were from patients with mild or severe disease. The defined B cell populations were then analyzed using the gene set of “C7:immunologic signature gene sets” describing the B cell lineage in GSEA with default settings. Those that were significantly enriched (p-value < 0.05, FDR < 0.05) in both databases were considered to be the predominant differences in B-cell populations from patients with mild or severe COVID-19 disease. Regarding gene set interpretation, since the C7:immunologic signature gene sets mostly compare the differential gene expression between the two types of immune cells, with "UP" denoting that the moderate is associated with the former and "DN" denoting that the moderate is associated with the latter, we used this principle to define the propensity of B cells in severe disease.

B cell infiltration and clinical status

For the analysis of B-cell infiltration and clinical status, all data were calculated based on the number of immune cells and clinical status of COVID-19 patients as quoted from S1 Data in the article published by Chua et al.

Statistical analysis

For CIBERSORT abs scores, the Student’s t-test was used for comparisons between the two groups with normally distributed data and the Mann-Whitney test was used for comparisons between the two groups with abnormally distributed data. Pearson correlation analysis was used to estimate the consistency among the 22 immune cell transcriptional estimated infiltration score distributions. Statistical analysis was performed using GraphPad Prism software (GraphPad Software). P <0.05 were regarded as statistically significant differences.

Results

Rationale and design of an in silico simulated immune cell infiltration profiling study

Nearly all of the patients who died from COVID-19 had severe lung tissue damage and pulmonary fibrosis [32]. On the other hand, mortality in IPF is generally the result of progressive fibrotic lung disease. We believe that comparing the gene expression profiling between lung tissues of COVID-19 and IPF will allow us to access the phenotypes that are specific to SARS-CoV-2 infection. Among these, differences in the level of immune cell infiltration are considered to be the most critical factor in the assessment of an over-active immune system. However, it is difficult to assess experimentally the infiltration of multiple immune cells in clinical. The first step is to obtain ethical approval and valuable COVID-19 lung samples, followed by tissue sampling and analysis in laboratories with adequate biosafety levels, and then staining and analysis of various immune cell populations with specific biomarkers. Moreover, the biomarkers for the analysis were limited, and the proportion of multiple immune cells could not be analyzed simultaneously.

In recent years, algorithms to precisely simulate the proportion of multiple immune cells infiltrating tissue samples using whole gene mapping have emerged and have been used in many studies [33, 34]. The simulation results were confirmed to correlate significantly with the proportion of actual immune cells in several studies [24, 35]. In this study, CIBERSORT was utilized for tissue immune cell infiltration scoring, which was based on 22 types of immune cell subsets profiling, and abs mode was performed to enable cross-database comparison [23]. For sample collection, we used the valuable COVID-19 patient organ RNA-sequencing whole gene expression results uploaded to the NCBI GEO database by Ting et al. as the target for analysis (SARS-CoV-2 infected lung tissue sample N = 16, Negative Control lung tissue sample N = 5). GSEA was then used to evaluate the enrichment score of functional gene sets associated with disease groups. The functional gene set significance filter was set at P-value <0.05, FDR <0.05. Further, Cytoscape Enrichment map was applied to visualize the GSEA results and perform related gene set clustering, the whole process is shown in Fig 1A.

Fig 1. Overall procedure of the study and transcriptional estimated infiltration of 22 types of immune cells in the lungs of COVID-19 patients.

Fig 1

(A) Diagram showing the comprehensive procedure of the study; (B) CIBERSORT abs score of 22 immune cells, colors represent the classification of the following: T cell lineage, B cell lineage, Myeloblast lineage, and other immune cells; (C) Similarity matrix representing the correlation among 22 immune cells in GSE150316.

In the lung tissue of COVID-19 patients and negative controls, 22 immune cell infiltration scores were calculated by CIBERSORT and visualized in Morpheus developed by Broad institute [36]. Comparison of the differences by T test showed that T cell CD8+, B cell plasma, monocyte, and Macrophage M1 were significantly increased in the lung tissues of COVID-19 patients (Fig 1B). Similarity matrix analysis of the distribution of 22 immune cells showed that B cell naïve, monocyte, T cell CD8+, B cell plasma, mast cell activated, and T cell CD4+ mem clustered together, indicating that the distribution of these immune cells was similar, and the percentage of infiltration was increased in the lung tissue of COVID-19 patients (Fig 1C).

Comparison of T-cell lineage transcriptional estimated infiltration in lung tissue between COVID-19 and IPF patients

In order to understand the detailed differences between the immune cell infiltration landscape in COVID-19 and IPF, CIBERSORT-X was carried out on two IPF databases (GSE124685 and GSE53845), and the results were divided into T cell lineage, B cell lineage, Myeloblast lineage, and other cells respectively.

T cell lineage analysis showed significant increases in CD8+, CD4+ naïve, and CD4+ memory activated in both SARS-CoV-2 infection and fibrotic lung tissues (Fig 2). In contrast, there was no consistent comparisons among CD4+ memory resting, regulatory T cell and γδ T cell subgroups.

Fig 2. In silico simulated T-cell lineage infiltration in lung tissue between COVID-19 and IPF patients.

Fig 2

*P<0.05, **P<0.01 and ***P<0.001, with comparisons indicated by brackets.

Related studies showed that CD8+ T cells significantly increased bronchoalveolar lavage fluid in SARS-CoV-2 infected patients [22], and CD4+ memory T cells have been reported to respond to viral spike protein after SARS infection [37]. In addition, both CD4+ naïve T cells and CD8+ T were reported to be significantly increased by mass cytometry (CyTOF) analysis [38].

Comparison of B-cell lineage transcriptional estimated infiltration in lung tissue between COVID-19 and IPF patients

The results of the B-cell lineage analysis showed that the CIBERSORT abs score of Plasma B cells was significantly increased in both COVID-19 and IPF patients (Fig 3). In contrast, there were significantly elevated naïve B cells in the lung tissue of SARS-CoV-2-infected patients and significantly elevated memory B cells in the lung tissue of IPF patients respectively, suggesting that SARS-CoV-2 infection is associated with the infiltration of naïve B cells rather than memory B cells in the lungs, a phenomenon exactly the opposite of the observations in the IPF. In addition, the infiltration of follicular T cells was significantly increased in one of the IPF databases and was not significantly different in COVID-19 group.

Fig 3. In silico simulated B-cell lineage infiltration in lung tissue between COVID-19 and IPF patients.

Fig 3

*P<0.05, **P<0.01 and ***P<0.001, with comparisons indicated by brackets.

Correlative studies have shown significantly increased expression of memory B cells, plasmablast and BAFF (B cell-activating factor of the TNF family) in lung tissue and peripheral blood of IPF patients [39, 40], where BAFF is considered important for the survival of plasma cells [41]. In addition, the proportion of T follicular helper cells in the peripheral blood of IPF patients increased significantly [42, 43]. In COVID-19 patients, recent studies have shown a significant increase in plasma cells and a significant decrease in naïve B cells in peripheral blood [21]. In an analysis of B cell compartment of SARS-CoV-2 infected patients, Nielsen et al. found that most of the B cells recruited to respiratory tracts in the early stage of infection lacked significant somatic mutation, suggesting that the recruited B cells were similar to naïve B cells [44].

Comparison of transcriptional estimated myeloblast lineage and other immune cell simulated infiltration in lung tissue between COVID-19 and IPF patients

The analysis of myeloblast lineage showed an increase in monocyte in the lung tissue of SARS-CoV-2 infected patients, and a decrease in the lung tissue of IPF patients (Fig 4). In common, the CIBERSORT-X abs score of M1 macrophage was significantly increased in both disease groups. The results of M0 and M2 macrophage infiltration do not vary significantly or are not consistent between the two IPF databases.

Fig 4. In silico simulated myeloblast lineage infiltration in lung tissue between COVID-19 and IPF patients.

Fig 4

*P<0.05, **P<0.01 and ***P<0.001, with comparisons indicated by brackets.

Monocytes and lung macrophages are thought to be involved in the pathogenesis of pulmonary fibrosis [45]. Previous studies have shown that higher monocyte counts observed in the peripheral blood of patients with IPF are significantly associated with poor prognosis [46]. Recent studies have shown that alveolar macrophages (AM), which are primarily involved in the pathogenesis of pulmonary fibrosis, arise more from in situ proliferation than from bone marrow supplementation [47, 48]. These studies suggest that during pulmonary fibrosis, cytokine released from immune cells may locally induce AM polarization to M1 or M2 subtypes and thus influence the progression of fibrosis [49]. The role of monocyte in the lungs of IPF patients remains to be defined.

On the other hand, a high degree of monocyte infiltration has been observed in the lung tissue of COVID-19 patients [22, 38], macrophage infiltration has also been reported recently [50]. The infiltration of monocytes in the lung may be activated by the immune response to form macrophages, which may ultimately promote acute inflammation and cause lung damage through increased M1-polarized macrophages [51]. Other types of immune cells were not significantly different in the lungs of COVID-19 patients, as shown in S1 Fig in S1 Data.

GO:BP analysis based on GSEA reveals specific enrichment of B-cell-mediated innate humoral responses in the lungs of COVID-19 patients

To understand the similarities and differences in the functional enrichment of gene sets in the lung tissues of COVID-19 and IPF patients compared to controls, the GO:BP (Gene Ontology Biological Processing) gene set was employed to evaluate a variety of biological responses, including immune responses.

Based on the GSEA results, Fig 5A lists the functional gene sets with NES scores greater than 2 (COVID-19: Red, IPF: Green; pre-screening criteria: nom. P < 0.05, FDR < 0.05). The purple text shows the set of genes that are co-enriched in COVID-19 or IPF patients, including Humoral immune response mediated by circulating immunoglobulin, B cell mediated immunity, complement activation, phagocytosis recognition, positive regulation of B cell activation and B cell receptor signalign pathway ranked first. Showing that an immune response caused by B-cell mediators may be a consistent phenomenon that causes damage to lung tissue in patients with COVID-19 or IPF. Green and red text represent the separately enriched gene sets in IPF or COVID-19 respectively. A number of gene sets related to lung fibrosis, including extracellular structure organization and collagen associated processes, were enriched in IPF. At the same time, a variety of genes related to B-cell proliferation, differentiation and maturation as well as adaptive immune response were also solely associated with IPF. Interestingly, none of these gene sets associated with B-cell maturation and differentiation were prominent in COVID-19, and instead were enriched for the innate immune response and FC receptor signaling pathway. This phenomenon is consistent with the results of the CIBERSORT-X analysis.

Fig 5. GO:BP functional enrichment map summary of GSEA in the lung tissues of COVID-19 and IPF patients.

Fig 5

(A) The top NES-ranked enrichment results of biological processing by gene ontology (GO) functional enrichment analysis in IPF or COVID-19 (NES > 2; FDR < 0.05). The normalized enrichment score (NES) was presented in the X-axis, and the names of enriched functional gene sets were shown in the Y-axis. The color gradation on the right side indicates false discovery rate (FDR). Size represents the number of genes in the gene set. Color of the text represents whether the gene set was enriched in both (purple) or in individual type of disease (COVID-19: Red, IPF: Green). (B) Venn diagram: two circles represent the intersection status of the top 100 genes from the Naive B cell (Red) and Memory B cell (Green) subtypes extracted from the LM22 immune cell gene signatures, respectively. The detailed list of genes in the intersection is shown in the table below. Enrichment map: Commonality of positive enrichments in the GO:BP gene sets after GSEA assessment using enrichment map visualization in COVID-19 (GSE150316, Red) and IPF (GSE124685 and GSE53845, Green). Single color: the node is positively correlated in only one disease condition; Mixed: the node is positively correlated in two or three databases. Diamond: Naïve or Memory B cell specific signature, genes in the signature were shown in the table below. Lines connecting nodes or diamonds represent the degree of overlapping between two gene sets. Criteria for enrichment significance screening: P-value <0.05, FDR <0.05.

To understand which genes affect the CIBERSORT-X algorithm more, we ranked the top 100 genes enriched in naive B cell or memory B cell within LM22 signature document and performed Venn diagram analysis (Fig 5B). There are 20 genes that are not involved in intersection in both types of B cell, we define them separately as “Naïve B cell specific signature” and “Memory B cell specific signature”. In addition, to systematically visualize the enriched gene set clusters, we performed the enrichment map app in Cytoscape and set the enriched gene cluster in COVID-19 as red and IPF as green. The visualized results showed that clusters belong to immune cell activation were divergent, while the other clusters were clearly polarized. For example, the chromatin centromere and sister segregation gene sets are enriched only in COVID-19, while others are enriched only in IPF. We further overlap the defined B cell specific signature with the visualized clusters, the graphical representation showed that memory B cell specific signature was mostly associated with multiple immune-related gene sets of IPF. In contrast, the naive B cell specific signature was associated with COVID-19 enriched immune gene sets. These results are consistent with the data from the CIBERSORT-X analysis of B cell lineage (Fig 3). That is, in the lungs of COVID-19 patients, undifferentiated naïve B cells may be the main immune cells that elicit humoral immune response, which has been observed in clinical practice [52, 53].

PID pathway analysis reveals distinct enrichment between immune cell signaling and migration responses and shared signaling pathways in damaged lung

The function of immune cells is thought to be associated with multiple signaling pathways. Through a comprehensive analysis of the enrichment of signaling pathways, not only the status of specific immune cells can be assessed, but also potential therapeutic targets can be identified. The PID (Pathway Interaction Database) pathway gene set is known for its accuracy in reflecting specific signal pathways, which helps us to precisely define the specific variation in COVID-19 group [54]. S2 Fig in S1 Data shows the visualized results of the GSEA analysis using PID pathway. In terms of the relevance of immune cell-associated signaling pathways, TCR signaling in Naïve CD4+/CD8+ T cells was significantly activated in both COVID-19 and IPF groups. In addition, CD40/CD40L signaling were negatively correlated only in SARS-CoV-2 infected lung tissues, which was suggested to be associated with the differentiation and proliferation of activated B cells in germinal center [55]. In the cluster of cytokine signaling, IL-1 and IL-6 signaling pathways were negatively correlated, while IL-4 and IL-12 signaling pathways were positively correlated, shows that IL-4 and IL-12 have more significant effects on COVID-19 lung tissue than other cytokines. Among the integrin associated interactions, β1, 3, and 5–7 integrin cell surface interaction were all negatively correlated in COVID-19 group. In contrast, β2 integrin (LFA-1) and α4β1 integrin were positively correlated, which are thought to be essential for B-cell activation and adhesion [56, 57]. Furthermore, among the many signaling pathways, the Insulin pathway showed a significant negative enrichment in both COVID-19 and IPF groups, the physiological significance of which needs to be further verified. Most of the other signaling pathways are unknown or require further study in relation to the immune response, all of which are listed in S3 Fig in S1 Data.

Differences in B-cell clustering subtypes between COVID-19 and IPF patients verified in single-cell RNA sequencing databases

CIBERSORT and similar deconvolution methods can be used to analyze the distribution of immune cells in the Bulk RNA-seq data, but the analysis of the signal pathways may not be accurate due to the overlap of activation between different cells. In order to further validate the phenomenon observed in bulk RNA-seq, we searched several publicly available single-cell RNA-sequencing databases and restricted the selecting criteria to those sampling from lung-associated body fluids or tissues. Chua et al. published single-cell RNA sequencing analysis of respiratory fluid samples from 20 patients with COVID-19 mild to severe disease, and Liao et al. published single-cell RNA sequencing analysis of bronchoalveolar lavage fluid (BALF) from 9 patients with COVID-19 mild to severe disease (GSE145926). On the other hand, Morse et al. published single-cell RNA sequencing analysis of lung tissues from IPF patients (GSE128033). According to the annotation provided by the authors, we circled B-cell populations from patients or healthy donors and evaluated the overall percentages of expression and the average standardized expression of each gene in the B-cell specific signatures. The average standardized expression of all genes was further analyzed (Fig 6). The results showed that genes in naive B cell specific signature was significantly higher in COVID-19 patient B cells, whereas genes in memory B cell specific signature were generally increased in IPF patient B cells, echoing our observations in bulk RNA-seq databases.

Fig 6. Validation of the phenotypes of B-cell populations in single-cell RNA sequencing of lung-associated body fluid samples from COVID-19 and IPF patients.

Fig 6

UMAP shows the visualized distribution of whole cells in single-cell RNA sequencing data from two COVID-19 databases (A) and IPF database (B). the “B-cell clusters” annotated by the original authors are shown in red, the unassigned cells are shown in gray. B-cell clusters are further colored red and blue to represent those from disease or healthy donors (HC) respectively. The proportion and average expression of genes related to Naive or Memory B cells in the B-cell population are shown as dot plots. Relative gene expression is shown in color and the overall B-cell expression gene percentage is shown in dot size (%). The mean Z scores of all genes’ expression in B cell populations from HC or disease groups are shown in Violin plot. Statistical significance was performed using Welch’s t-test approach. *: P < 0.05, **: P < 0.01, ***: P < 0.001. The actual P-values are shown under the asterisks as well.

B cells from patients with severe COVID-19 have genetic phenotype similar to antibody-secreting cells, and the percentage of B cell infiltration appears to associate with the severity of the disease

Given the results of the bulk RNA seq analysis, we were curious whether any specific distinctions existed between the B-cell lineage populations of patients with mild or severe COVID-19. Firstly, we divided the B cells from COVID-19 in two single-cell databases by disease severity and analyzed them by GSEA together with gene sets from C7 immune signature that belonged to B cell lineage, and then we cross match the two databases with the gene set enriched in moderate disease. The results showed that B cells from COVID-19 patients with severe symptoms had a tendency to be antibody secreting cells, such as plasma cells, plasmablast, IgM memory B cells, Follicular B cells, and unstimulated B cells compared to those from patients with mild symptoms (Fig 7A). As possessing both naïve B-cell and antibody-secreting B-cell properties, these cells are similar to the DN2 (Double negative) B-cells found in systemic lupus erythematosus proposed by Tipton et al. The researchers found that they are derived from naïve B cells and appear to be precursors of plasma cells [58]. The term double negative (DN) comes from the absence of immunoglobulin D and memory B-cell marker CD27 [59]. DN B cells are further classified into DN1(CXCR5+) and DN2(CXCR5-) by whether CXCR5 is expressed or not [60]. An increase in IRF4 and a decrease in IRF8 are thought to be associated with the promote naïve B cell differentiation into DN2 [61, 62]. The process of induced differentiation is accompanied by a decrease in TRAF5 and CD21 and an increase in CD11c [63]. These DN2 B cells are usually found in B-cell follicles rather than germinal centers and may therefore be genetically phenotypically different from GC-derived B cells. Taken together with the aforementioned observations, we hypothesized that B cells from the lungs of COVID-19 critically ill patients may be associated with DN2 B cells, which have been considered in recent years to be the main inducer of extrafollicular response [63, 64]. In-depth analysis of DN2 B-cell related genes revealed that the overall expression ratio and average expression of genes that were thought to be reduced in DN2 B cells, such as CD21, CXCR5, IRF8 and TRAF5, were lower in B cells from critically ill patients, especially in samples from BALF (Fig 7B). Further, we analyzed the ratio of B-cell infiltration and the clinical characteristics of the patients and showed that when B-cell infiltration > 1%, there was a longer hospitalization and ICU duration, a higher proportion of severe disease (66.7%), a higher proportion of no recovery (33.3%) and a higher proportion of multi-organ failure (55.6%), indicating an association between increased B-cell infiltration and severity of disease in COVID-19 patients (Fig 7C).

Fig 7. Assessing the phenotypic and clinical status differences of B-cell populations between mild and severe symptoms of COVID-19.

Fig 7

(A) Schematic diagram of the GSEA analyzing process for B cells of mild and severe COVID-19 patients in different databases. GSEA results were cross-referenced and the overlapping enriched data sets of the two databases were listed. The NES scores of each gene set are additionally represented by shades of red to represent the relative enrichment level. Those considered relevant to the B-cell population of patients with severe COVID-19 in the two immune cell comparisons are marked in red. Criteria for enrichment significance screening: P-value <0.05, FDR <0.25. (B) The dot plots represent the overall expression ratio and average expression of genes thought to be related to antibody-secreting cells and DN2 B cells. The overall expression percentage is represented by the size of the dots and the average expression is represented by the color. Welch’s t-test was performed to analyze the significance of gene expression in two conditions. **: P < 0.01, ***: P < 0.001. (C) Comparison of B-cell infiltration ratio and clinical status of patients with COVID-19. Box plots are shown for patients with B cell infiltration ratio < 1% (Black box) or > 1% (Red box) for duration of hospitalization and ICU (Days). Pie charts show the severity, recovery and complications of multi-organ failure in COVID-19 patients with different levels of B-cell infiltration.

Discussion

Based on in silico simulation of immune infiltration, this study reveals the impact of SARS-CoV-2 infection on the transcriptional estimated immune infiltration landscape of lung tissue. Significantly increased infiltration of CD4+/CD8+ T cells (Fig 2), Plasma cells (Fig 3) and M1 macrophage (Fig 4) was a common observation in patients with SARS-CoV-2 infection or IPF. It is believed that the increased infiltration of these immune cells is a major contributor to lung damage or fibrosis. However, the elevated naïve B cell infiltration (Fig 3), uncorrelated gene-sets of B cell proliferation and differentiation in the cluster of B cell mediated immune response (Fig 5), and the negative enrichment of CD40/CD40L signaling are all specific to the lung tissue of COVID-19 patients (S2 Fig in S1 Data). These results imply that the onset of the immune response in the lung tissue of COVID-19 patients may be correlated with elevated infiltration of naive B cells, which was further verified in single-cell RNA-seq databases (Figs 6 and 7).

The following hypothesis was developed based on the results of functional gene-set analysis combined with in silico immune infiltration profiling, which is similar to the context of influenza-specific B cell response [65]: SARS-CoV-2 infection may activate and promote the adhesion and accumulation of naïve B cells to the mediastinal lymph node through the activation of β2 integrin (LFA-1) and α4β1 integrin [57, 66], resulting in a decreased proportion of naïve B cells in the peripheral blood [67, 68]. The increased infiltration of naïve B cells activated by spike proteins of SARS-CoV-2 secretes large amounts of IgM to promote humoral immune response [65, 69]. Monocytes are recruited and differentiate to macrophage in response to the amplified humoral immune response [70]. Secreted IgM simultaneously activates the complement system and Fc receptor in dendritic cells and macrophage to increase antigen presentation and phagocytosis to facilitate innate and adaptive immunity [71, 72]. In addition, abundant spike protein from SARS-CoV-2 presented by antigen-presenting cell (APC) may leads to rapid induction of extrafollicular (EF) response through stimulating IL-12-dependent plasma cell differentiation in naïve B cell to produce more IgM [65, 73], instead of promoting B cell proliferation and differentiation by CD40/CD40L signaling mediated germinal center formation [74].

Based on this hypothesis, we believe that by suppressing the growth or migration of naïve B cells, it may help to reduce the excessive immune response caused by naïve B cells associated humoral immune response to reduce the risk of lung damage after SARS-CoV-2 infection, which have also been reported recently [75]. Remarkably, Quinti et al. reported several COVID-19 patients with primary antibody deficiencies (PAD) who clinically exhibited strikingly different extents of symptoms [76]. Five of the patients with Common Variable Iummune Deficiency (CVID) had severe COVID-19 symptoms. B cells in patients with CVID fail to differentiate into memory B cells, which maintain their properties similar to naïve B cells and continue to release IgM and IgG [77]. In contrast, 2 COVID-19 patients with agammaglobulinemia had mild symptoms and favorable outcome. These patients were congenitally deficient in B cells and plasma cells due to mutations in the gene encoding bruton kinase, which is essential for B cell survival [78]. This report increases the likelihood of our hypothesis that naïve B cells act as the trigger of severe respiratory and pulmonary symptoms of COVID-19.

In the clinical treatment of abnormal increases in serum IgM or neoplastic B cells, ibrutinib, which inhibits the activity of Bruton Kinase in the B cell receptor signaling pathway [79], is now commonly used to reduce the abnormal proliferation of B cells [80, 81]. Treon et al. reported that the use of ibrutinib may be useful to reduce lung damage from SARS-CoV-2 infection by suppressing the number of B cells [82]. Fingolimod, as an agonist of the S1P1/3 receptor, is thought to inhibit B cell egress out of lymph node through overstimulation of the B cell S1P1/3 receptor [83, 84]. Foerch et al. reported that multiple sclerosis patient with severe COVID-19 infection was being treated with Fingolimod. The patient improved rapidly right after appropriate therapy [85]. Integrin complexes have multiple implications in immune cell migration and viral infection. Immune cells use LFA-1 and α4β1 integrin for migration and adhesion [86, 87], which are also important for B cell [56, 57]. In addition, SARS-CoV-2 viral protein is known to bind to ACE2 or integrin heterodimers to facilitate virus entry and infection [88]. Borriello et al. reported on a COVID-19 patient who was using the α4β1 integrin targeted monoclonal antibody natalizumab [89], which functions to reduces B-cell migration by blocking α4β1 integrin and have significant effect in increasing circulating B cells [90, 91]. The patient improved significantly with appropriate treatment and no new symptoms developed or worsened. In terms of targeting interleukin, increased expression of IL-4 is thought to be associated with IPF, and dupilumab was also effective in asthma exacerbations, implying that inhibition of IL-4 may alleviate lung damage caused by SARS-CoV-2 by attenuating inflammation [92, 93]. Conversely, IL-12 is associated with the suppression of pulmonary fibrosis, and the clinical application in COVID-19 remains need to be further clarified [94, 95].

In summary, in silico simulated immune infiltration combined with gene function enrichment analysis provide us novel perspective on the immune system impact of SARS-CoV-2 infection. It is hoped that these findings will lead to new opportunities for the clinical treatment of COVID-19. All hypothetical mechanisms and locations of B cell targeted treatment acting are shown in Fig 8.

Fig 8. Diagrammatic representation illustrate the postulated role of naïve B cell in triggering humoral immune response in lung tissues of COVID-19 patients and the locations where B cell targeted therapeutic strategies function.

Fig 8

SARS-CoV-2 infection may activate and promote the adhesion and accumulation of naïve B cells to the mediastinal lymph node with β2 integrin (LFA-1) and α4β1 integrin. The increased infiltration of naïve B cells activated by spike proteins from SARS-CoV-2 secretes large amounts of IgM to promote extrafollicular response and humoral immune response. Monocytes are recruited and differentiate to macrophage in response to the robust humoral immune response. Secreted IgM simultaneously activates the complement system and Fc receptor in dendritic cells and macrophage to increase antigen presentation and phagocytosis to facilitate innate and adaptive immunity. On the other hand, high affinity or abundant SARS-CoV-2 presented by antigen-presenting cell (APC) such as dendritic cell may stimulate IL-12-dependent plasma cell differentiation in naïve B cells to produce more IgM, instead of promoting B cell proliferation and differentiation by CD40/CD40L signaling mediated germinal center formation. Ibrutinib inhibits B-cell growth by specifically inhibiting Bruton kinase, which is thought to be critical for the BCR signaling pathway; Natalizumab reduces B-cell migration by blocking α4β1 integrin. Fingolimod reduces B-cell egress out of the lymph node by stimulating S1PR1/3 to internalize the receptors.

Supporting information

S1 Data

(PDF)

Acknowledgments

We would like to thank Dr. Ting from Massachusetts General Hospital for uploading the valuable RNA-seq analysis results of organ samples from COVID-19 patients (NCBI GEO Access ID: GSE150316). We would also like to thank Dr. Kaminski at Yale University and Dr. Arron at Genentech, Inc. for uploading the valuable results of lung tissue sample analysis in IPF patients (NCBI GEO Access ID: GSE124685 and GSE53845).

Limitations of the study

We understand that computer modelling of immuno-infiltration has its limits and problems of over-interpretation, and further experimental validation is often required. However, in the midst of the COVID-19 pandemic, we are eager to provide any significant data that can be used for interpretation or to increase confidence in COVID-19 treatment. Given the support of grant funding, the difficulty of collecting samples from COVID-19 patients, the limitations of the research environment and resources, and the fact that time is of the essence, we are strong-minded to announce the data as soon as possible, in the hope that we can do our part to contribute to the treatment of COVID-19.

Data Availability

All gene expression profiling files are available from the NCBI GEO database or website of the original articles published by Chua et al. (GEO accession numbers: GSE53845, GSE128033, GSE124685, GSE145926, and GSE150316.).

Funding Statement

This work was supported by grants from Ministry of National Defense-Medical Affairs Bureau [MND-MAB-109-021 to YLC], Tri-Service General Hospital [TSGH-C108-044 and TSGH-E-109213 to YYW] and the Ministry of Science and Technology, Taiwan (R.O.C.) [MOST108-2311-B-016-001- and MOST109-2320-B-016-004- to YLC; MOST108-2635-B-016-001- to YYW; MOST108-2314-B-016-036- and MOST109-2314-B-016-050- to CLH].

References

  • 1.Zhu N., et al. , A Novel Coronavirus from Patients with Pneumonia in China, 2019. N Engl J Med, 2020. 382(8): p. 727–733. 10.1056/NEJMoa2001017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Li Q., et al. , Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia. N Engl J Med, 2020. 382(13): p. 1199–1207. 10.1056/NEJMoa2001316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Arshad Ali S., et al. , The outbreak of Coronavirus Disease 2019 (COVID-19)-An emerging global health threat. J Infect Public Health, 2020. 13(4): p. 644–646. 10.1016/j.jiph.2020.02.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Thanh Le T., et al. , The COVID-19 vaccine development landscape. Nat Rev Drug Discov, 2020. 19(5): p. 305–306. 10.1038/d41573-020-00073-5 [DOI] [PubMed] [Google Scholar]
  • 5.Yao X., et al. , In Vitro Antiviral Activity and Projection of Optimized Dosing Design of Hydroxychloroquine for the Treatment of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Clin Infect Dis, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Arabi Y.M., Fowler R., and Hayden F.G., Critical care management of adults with community-acquired severe respiratory viral infection. Intensive Care Med, 2020. 46(2): p. 315–328. 10.1007/s00134-020-05943-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Klopfenstein T., et al. , Diarrhea: An underestimated symptom in Coronavirus disease 2019. Clin Res Hepatol Gastroenterol, 2020. 10.1016/j.clinre.2020.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Chen T., et al. , Clinical characteristics and outcomes of older patients with coronavirus disease 2019 (COVID-19) in Wuhan, China (2019): a single-centered, retrospective study. J Gerontol A Biol Sci Med Sci, 2020. 10.1093/gerona/glaa089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hjelmesaeth J. and Skaare D., Loss of smell or taste as the only symptom of COVID-19. Tidsskr Nor Laegeforen, 2020. 140(7). [DOI] [PubMed] [Google Scholar]
  • 10.Guan W.J., et al. , Clinical Characteristics of Coronavirus Disease 2019 in China. N Engl J Med, 2020. 382(18): p. 1708–1720. 10.1056/NEJMoa2002032 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Young B.E., et al. , Epidemiologic Features and Clinical Course of Patients Infected With SARS-CoV-2 in Singapore. JAMA, 2020. 10.1001/jama.2020.3204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Huang C., et al. , Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet, 2020. 395(10223): p. 497–506. 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mo X., et al. , Abnormal pulmonary function in COVID-19 patients at time of hospital discharge. Eur Respir J, 2020. 10.1183/13993003.01217-2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hull J.H., Lloyd J.K., and Cooper B.G., Lung function testing in the COVID-19 endemic. Lancet Respir Med, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Spagnolo P., et al. , Pulmonary fibrosis secondary to COVID-19: a call to arms? Lancet Respir Med, 2020. 10.1016/S2213-2600(20)30222-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Matthay M.A. and Zemans R.L., The acute respiratory distress syndrome: pathogenesis and treatment. Annu Rev Pathol, 2011. 6: p. 147–63. 10.1146/annurev-pathol-011110-130158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Matthay M.A., Ware L.B., and Zimmerman G.A., The acute respiratory distress syndrome. J Clin Invest, 2012. 122(8): p. 2731–40. 10.1172/JCI60331 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zemans R.L., Colgan S.P., and Downey G.P., Transepithelial migration of neutrophils: mechanisms and implications for acute lung injury. Am J Respir Cell Mol Biol, 2009. 40(5): p. 519–35. 10.1165/rcmb.2008-0348TR [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wynn T.A., Cellular and molecular mechanisms of fibrosis. J Pathol, 2008. 214(2): p. 199–210. 10.1002/path.2277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Felsenstein S., et al. , COVID-19: Immunology and treatment options. Clin Immunol, 2020. 215: p. 108448 10.1016/j.clim.2020.108448 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wen Wen, S. W., Tang Hao, Le Wenqing, Zhang Xiaopeng, Zheng Yingfeng, Liu XiuXing, et al. Xiao Chuan-Le, Chen Wei, Wang Hongyang, Immune Cell Profiling of COVID-19 Patients in the Recovery Stage by Single-Cell Sequencing. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Liao M., et al. , Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19. Nat Med, 2020. 10.1038/s41591-020-0901-9 [DOI] [PubMed] [Google Scholar]
  • 23.Newman A.M., et al. , Robust enumeration of cell subsets from tissue expression profiles. Nat Methods, 2015. 12(5): p. 453–7. 10.1038/nmeth.3337 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Chen B., et al. , Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol Biol, 2018. 1711: p. 243–259. 10.1007/978-1-4939-7493-1_12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Newman A.M., et al. , Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat Biotechnol, 2019. 37(7): p. 773–782. 10.1038/s41587-019-0114-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.DePianto D.J., et al. , Heterogeneous gene expression signatures correspond to distinct lung pathologies and biomarkers of disease severity in idiopathic pulmonary fibrosis. Thorax, 2015. 70(1): p. 48–56. 10.1136/thoraxjnl-2013-204596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.McDonough J.E., et al. , Transcriptional regulatory model of fibrosis progression in the human lung. JCI Insight, 2019. 4(22). 10.1172/jci.insight.131597 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chua R.L., et al. , COVID-19 severity correlates with airway epithelium-immune cell interactions identified by single-cell analysis. Nat Biotechnol, 2020. 38(8): p. 970–979. 10.1038/s41587-020-0602-4 [DOI] [PubMed] [Google Scholar]
  • 29.Morse C., et al. , Proliferating SPP1/MERTK-expressing macrophages in idiopathic pulmonary fibrosis. Eur Respir J, 2019. 54(2). 10.1183/13993003.02441-2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Subramanian A., et al. , Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A, 2005. 102(43): p. 15545–50. 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Merico D., et al. , Enrichment map: a network-based method for gene-set enrichment visualization and interpretation. PLoS One, 2010. 5(11): p. e13984 10.1371/journal.pone.0013984 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.George P.M., Wells A.U., and Jenkins R.G., Pulmonary fibrosis and COVID-19: the potential role for antifibrotic therapy. Lancet Respir Med, 2020. 10.1016/S2213-2600(20)30225-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sturm G., et al. , Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology. Bioinformatics, 2019. 35(14): p. i436–i445. 10.1093/bioinformatics/btz363 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Yang X., et al. , Identification and validation of an immune cell infiltrating score predicting survival in patients with lung adenocarcinoma. J Transl Med, 2019. 17(1): p. 217 10.1186/s12967-019-1964-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Rohr-Udilova N., et al. , Deviations of the immune cell landscape between healthy liver and hepatocellular carcinoma. Sci Rep, 2018. 8(1): p. 6220 10.1038/s41598-018-24437-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.https://software.broadinstitute.org/morpheus/
  • 37.Libraty D.H., et al. , Human CD4(+) memory T-lymphocyte responses to SARS coronavirus infection. Virology, 2007. 368(2): p. 317–21. 10.1016/j.virol.2007.07.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wang W., et al. , High-dimensional immune profiling by mass cytometry revealed immunosuppression and dysfunction of immunity in COVID-19 patients. Cell Mol Immunol, 2020. 17(6): p. 650–652. 10.1038/s41423-020-0447-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Xue J., et al. , Plasma B lymphocyte stimulator and B cell differentiation in idiopathic pulmonary fibrosis patients. J Immunol, 2013. 191(5): p. 2089–95. 10.4049/jimmunol.1203476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Heukels P., et al. , Enhanced Bruton’s tyrosine kinase in B-cells and autoreactive IgA in patients with idiopathic pulmonary fibrosis. Respir Res, 2019. 20(1): p. 232 10.1186/s12931-019-1195-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Benson M.J., et al. , Cutting edge: the dependence of plasma cells and independence of memory B cells on BAFF and APRIL. J Immunol, 2008. 180(6): p. 3655–9. 10.4049/jimmunol.180.6.3655 [DOI] [PubMed] [Google Scholar]
  • 42.Vinuesa C.G., et al. , Follicular Helper T Cells. Annu Rev Immunol, 2016. 34: p. 335–68. 10.1146/annurev-immunol-041015-055605 [DOI] [PubMed] [Google Scholar]
  • 43.Asai Y., et al. , Aberrant populations of circulating T follicular helper cells and regulatory B cells underlying idiopathic pulmonary fibrosis. Respir Res, 2019. 20(1): p. 244 10.1186/s12931-019-1216-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Nielsen Sandra C. A., Yang Fan, Hoh Ramona A., Jackson Katherine J. L., Roeltgen Katharina, Lee Ji-Yeun, et al. , B cell clonal expansion and convergent antibody responses to SARS-CoV-2. Infectious Diseases, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Byrne A.J., Maher T.M., and Lloyd C.M., Pulmonary Macrophages: A New Therapeutic Pathway in Fibrosing Lung Disease? Trends Mol Med, 2016. 22(4): p. 303–316. 10.1016/j.molmed.2016.02.004 [DOI] [PubMed] [Google Scholar]
  • 46.Teoh A.K.Y., et al. , Blood monocyte counts as a potential prognostic marker for idiopathic pulmonary fibrosis: analysis from the Australian IPF registry. Eur Respir J, 2020. 55(4). 10.1183/13993003.01855-2019 [DOI] [PubMed] [Google Scholar]
  • 47.Hashimoto D., et al. , Tissue-resident macrophages self-maintain locally throughout adult life with minimal contribution from circulating monocytes. Immunity, 2013. 38(4): p. 792–804. 10.1016/j.immuni.2013.04.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Guilliams M., et al. , Alveolar macrophages develop from fetal monocytes that differentiate into long-lived cells in the first week of life via GM-CSF. J Exp Med, 2013. 210(10): p. 1977–92. 10.1084/jem.20131199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wynn T.A. and Vannella K.M., Macrophages in Tissue Repair, Regeneration, and Fibrosis. Immunity, 2016. 44(3): p. 450–462. 10.1016/j.immuni.2016.02.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Luca Carsana M., et al. , Pulmonary post-mortem findings in a series of COVID-19 cases from northern Italy: a two-centre descriptive study. The Lancet Infectious Diseases, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cabrera-Benitez N.E., et al. , Mechanical ventilation-associated lung fibrosis in acute respiratory distress syndrome: a significant contributor to poor outcome. Anesthesiology, 2014. 121(1): p. 189–98. 10.1097/ALN.0000000000000264 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Ni L., et al. , Detection of SARS-CoV-2-Specific Humoral and Cellular Immunity in COVID-19 Convalescent Individuals. Immunity, 2020. 10.1016/j.immuni.2020.04.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Long Q.X., et al. , Antibody responses to SARS-CoV-2 in patients with COVID-19. Nat Med, 2020. 10.1038/s41591-020-0897-1 [DOI] [PubMed] [Google Scholar]
  • 54.Schaefer C.F., et al. , PID: the Pathway Interaction Database. Nucleic Acids Res, 2009. 37(Database issue): p. D674–9. 10.1093/nar/gkn653 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ma D.Y. and Clark E.A., The role of CD40 and CD154/CD40L in dendritic cells. Semin Immunol, 2009. 21(5): p. 265–72. 10.1016/j.smim.2009.05.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lu T.T. and Cyster J.G., Integrin-mediated long-term B cell retention in the splenic marginal zone. Science, 2002. 297(5580): p. 409–12. 10.1126/science.1071632 [DOI] [PubMed] [Google Scholar]
  • 57.Carrasco Y.R., et al. , LFA-1/ICAM-1 interaction lowers the threshold of B cell activation by facilitating B cell adhesion and synapse formation. Immunity, 2004. 20(5): p. 589–99. 10.1016/s1074-7613(04)00105-0 [DOI] [PubMed] [Google Scholar]
  • 58.Tipton C.M., et al. , Diversity, cellular origin and autoreactivity of antibody-secreting cell population expansions in acute systemic lupus erythematosus. Nat Immunol, 2015. 16(7): p. 755–65. 10.1038/ni.3175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Wei C., et al. , A new population of cells lacking expression of CD27 represents a notable component of the B cell memory compartment in systemic lupus erythematosus. J Immunol, 2007. 178(10): p. 6624–33. 10.4049/jimmunol.178.10.6624 [DOI] [PubMed] [Google Scholar]
  • 60.Ehrhardt G.R., et al. , Discriminating gene expression profiles of memory B cell subpopulations. J Exp Med, 2008. 205(8): p. 1807–17. 10.1084/jem.20072682 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Nutt S.L., et al. , The generation of antibody-secreting plasma cells. Nat Rev Immunol, 2015. 15(3): p. 160–71. 10.1038/nri3795 [DOI] [PubMed] [Google Scholar]
  • 62.Xu H., et al. , Regulation of bifurcating B cell trajectories by mutual antagonism between transcription factors IRF4 and IRF8. Nat Immunol, 2015. 16(12): p. 1274–81. 10.1038/ni.3287 [DOI] [PubMed] [Google Scholar]
  • 63.Jenks S.A., et al. , Distinct Effector B Cells Induced by Unregulated Toll-like Receptor 7 Contribute to Pathogenic Responses in Systemic Lupus Erythematosus. Immunity, 2018. 49(4): p. 725–739 e6. 10.1016/j.immuni.2018.08.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Jenks S.A., et al. , Extrafollicular responses in humans and SLE. Immunol Rev, 2019. 288(1): p. 136–148. 10.1111/imr.12741 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Lam J.H. and Baumgarth N., The Multifaceted B Cell Response to Influenza Virus. J Immunol, 2019. 202(2): p. 351–359. 10.4049/jimmunol.1801208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Spaargaren M., et al. , The B cell antigen receptor controls integrin activity through Btk and PLCgamma2. J Exp Med, 2003. 198(10): p. 1539–50. 10.1084/jem.20011866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Waffarn E.E. and Baumgarth N., Protective B cell responses to flu—no fluke! J Immunol, 2011. 186(7): p. 3823–9. 10.4049/jimmunol.1002090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Waffarn E.E., et al. , Infection-induced type I interferons activate CD11b on B-1 cells for subsequent lymph node accumulation. Nat Commun, 2015. 6: p. 8991 10.1038/ncomms9991 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Choi Y.S. and Baumgarth N., Dual role for B-1a cells in immunity to influenza virus infection. J Exp Med, 2008. 205(13): p. 3053–64. 10.1084/jem.20080979 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Shi C. and Pamer E.G., Monocyte recruitment during infection and inflammation. Nat Rev Immunol, 2011. 11(11): p. 762–74. 10.1038/nri3070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Sorman A., et al. , How antibodies use complement to regulate antibody responses. Mol Immunol, 2014. 61(2): p. 79–88. 10.1016/j.molimm.2014.06.010 [DOI] [PubMed] [Google Scholar]
  • 72.Wang H., Coligan J.E., and Morse H.C. 3rd, Emerging Functions of Natural IgM and Its Fc Receptor FCMR in Immune Homeostasis. Front Immunol, 2016. 7: p. 99 10.3389/fimmu.2016.00099 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Dubois B., et al. , Critical role of IL-12 in dendritic cell-induced differentiation of naive B lymphocytes. J Immunol, 1998. 161(5): p. 2223–31. [PubMed] [Google Scholar]
  • 74.Huse K., et al. , Human Germinal Center B Cells Differ from Naive and Memory B Cells in CD40 Expression and CD40L-Induced Signaling Response. Cytometry A, 2019. 95(4): p. 442–449. 10.1002/cyto.a.23737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Woodruff Matthew, R. R., Cashman Kevin, Nguyen Doan, Ley Ariel, Kyu Shuya, Saini Ankur, et al. , Critically ill SARS-CoV-2 patients display lupus-like hallmarks of extrafollicular B cell activation. 2020. 32511635 [Google Scholar]
  • 76.Quinti I., et al. , A possible role for B cells in COVID-19? Lesson from patients with agammaglobulinemia. J Allergy Clin Immunol, 2020. 10.1016/j.jaci.2020.04.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Agematsu K., et al. , Absence of memory B cells in patients with common variable immunodeficiency. Clin Immunol, 2002. 103(1): p. 34–42. 10.1006/clim.2001.5197 [DOI] [PubMed] [Google Scholar]
  • 78.Vihinen M., Mattsson P.T., and Smith C.I., Bruton tyrosine kinase (BTK) in X-linked agammaglobulinemia (XLA). Front Biosci, 2000. 5: p. D917–28. [DOI] [PubMed] [Google Scholar]
  • 79.Davids M.S. and Brown J.R., Ibrutinib: a first in class covalent inhibitor of Bruton’s tyrosine kinase. Future Oncol, 2014. 10(6): p. 957–67. 10.2217/fon.14.51 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Raedler L.A., Imbruvica (Ibrutinib): First Drug Approved for the Treatment of Patients with Waldenstrom’s Macroglobulinemia. Am Health Drug Benefits, 2016. 9(Spec Feature): p. 89–92. [PMC free article] [PubMed] [Google Scholar]
  • 81.Drennan S., et al. , Ibrutinib Therapy Releases Leukemic Surface IgM from Antigen Drive in Chronic Lymphocytic Leukemia Patients. Clin Cancer Res, 2019. 25(8): p. 2503–2512. 10.1158/1078-0432.CCR-18-1286 [DOI] [PubMed] [Google Scholar]
  • 82.Treon S.P., et al. , The BTK inhibitor ibrutinib may protect against pulmonary injury in COVID-19-infected patients. Blood, 2020. 135(21): p. 1912–1915. 10.1182/blood.2020006288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Chun J. and Hartung H.P., Mechanism of action of oral fingolimod (FTY720) in multiple sclerosis. Clin Neuropharmacol, 2010. 33(2): p. 91–101. 10.1097/WNF.0b013e3181cbf825 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Huwiler A. and Pfeilschifter J., Sphingolipid signaling in renal fibrosis. Matrix Biol, 2018. 68–69: p. 230–247. 10.1016/j.matbio.2018.01.006 [DOI] [PubMed] [Google Scholar]
  • 85.Barzegar M., et al. , COVID-19 infection in a patient with multiple sclerosis treated with fingolimod. Neurol Neuroimmunol Neuroinflamm, 2020. 7(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Giblin P.A. and Lemieux R.M., LFA-1 as a key regulator of immune function: approaches toward the development of LFA-1-based therapeutics. Curr Pharm Des, 2006. 12(22): p. 2771–95. 10.2174/138161206777947731 [DOI] [PubMed] [Google Scholar]
  • 87.Verma N.K. and Kelleher D., Not Just an Adhesion Molecule: LFA-1 Contact Tunes the T Lymphocyte Program. J Immunol, 2017. 199(4): p. 1213–1221. 10.4049/jimmunol.1700495 [DOI] [PubMed] [Google Scholar]
  • 88.Hussein H.A., et al. , Beyond RGD: virus interactions with integrins. Arch Virol, 2015. 160(11): p. 2669–81. 10.1007/s00705-015-2579-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Borriello G. and Ianniello A., COVID-19 occurring during Natalizumab treatment: a case report in a patient with extended interval dosing approach. Mult Scler Relat Disord, 2020. 41: p. 102165 10.1016/j.msard.2020.102165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Putzki N., et al. , Effects of natalizumab on circulating B cells, T regulatory cells and natural killer cells. Eur Neurol, 2010. 63(5): p. 311–7. 10.1159/000302687 [DOI] [PubMed] [Google Scholar]
  • 91.Saraste M., Penttila T.L., and Airas L., Natalizumab treatment leads to an increase in circulating CXCR3-expressing B cells. Neurol Neuroimmunol Neuroinflamm, 2016. 3(6): p. e292 10.1212/NXI.0000000000000292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Patruno C., et al. , Dupilumab and COVID-19: What should we expect? Dermatol Ther, 2020: p. e13502 10.1111/dth.13502 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Huaux F., et al. , Dual roles of IL-4 in lung injury and fibrosis. J Immunol, 2003. 170(4): p. 2083–92. 10.4049/jimmunol.170.4.2083 [DOI] [PubMed] [Google Scholar]
  • 94.Wynn T.A., et al. , An IL-12-based vaccination method for preventing fibrosis induced by schistosome infection. Nature, 1995. 376(6541): p. 594–6. 10.1038/376594a0 [DOI] [PubMed] [Google Scholar]
  • 95.Keane M.P., et al. , IL-12 attenuates bleomycin-induced pulmonary fibrosis. Am J Physiol Lung Cell Mol Physiol, 2001. 281(1): p. L92–7. 10.1152/ajplung.2001.281.1.L92 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Mrinmoy Sanyal

7 Sep 2020

PONE-D-20-21528

In silico immune infiltration profiling combined with functional enrichment analysis reveals the specific role of naive B cells as a trigger for severe immune responses in the lungs of COVID-19 patients

PLOS ONE

Dear Dr. Chiu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The study is a very timely contribution by the authors to elucidate the etiology of lung damage in COVID-19 patients. In general, the manuscript is well written. The Reviewers and the Editor agree that this is an exciting study. We believe, addressing the Reviewer’s comments will significantly strengthen the manuscript.

Please submit your revised manuscript by Oct 22 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Mrinmoy Sanyal, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Wu et al. employed CIBERSORT method to investigate common and differences in the level of immune cell infiltration in lung tissues of COVID-19 patients compared with patients of Idiopathic Pulmonary Fibrosis (IPF). They determined that several immune cell sub-types, particularly naive B-cells, are highly infiltrated in COVID-19 patients. Besides that they reported several other important findings using CIBERSORT and gene set enrichment based analysis (Figs 2-6). Overall this is an interesting study. However, the study is suffering from lack of validation results. Authors have stated this limitation and addressed this major issue only by indicating that in the midst of COVID-19 pandemic, they are eager to provide this data that can be used for interpretation or to increase confidence in COVID-19 treatment. Although rapid data generation is a critical step during this pandemic situation; however, ideally we should not compromise with reproducibility, scientific rigor, and quality of the data.

Major comments

=============

1. It is understandable that the experimental validation of the major prediction results reported in this manuscript is a time consuming task; however, some levels of validation are required to determine the quality of the reported prediction results. Authors are suggested to evaluate single cell RNA-seq data and investigate whether the prediction results they obtained from the bulk RNA-seq data using the tool CIBERSORT are reproducible in the actual single cell resolution. A quick pubmed search shows, recently Liao et al. (Nature Medicine, May 2020; https://doi.org/10.1038/s41591-020-0901-9) published single cell landscape of bronchoalveolar immune cells extracted from moderate to sever patients with COVID-19 and healthy control and the data are publicly available (GSE128033 and GSM3660650). Authors are suggested to do a comprehensive literature search and check if this study or any additional studies could be used as the validation data. Furthermore, multiple single cell RNA-seq data on IPF (GSE94555 and GSE86618) are also available in the public repositories. Therefore, authors are suggested to take this opportunity and investigate whether their reported comparative results are reproducible in a completely independent validation data set.

2. If the single cell data are not suitable for the analysis because of some valid reasons, authors are suggested to at least evaluate independent RNA-seq data and examine whether the prediction results obtained from the training data are consistent in the independent validation data.

3. For the Gene set enrichment analysis FDR cut-off 0.25 was chosen to determine significantly enriched gene sets. Although this cut-off is not uncommon; however, a general trend is to use FDR < 0.05 to determine significance. I am wondering how much the results would be changed with the FDR < 0.05 compared with the current results. This is important to evaluate, particularly in the absence of validation data, because such a low cut-off value may introduce high-false positives.

Reviewer #2: Summary:

This is a timely manuscript presenting a comparative in silico analysis of estimated cell type frequencies and gene expression changes in the lungs of deceased COVID-19 patients and idiopathic pulmonary fibrosis patients. Using CIBERSORT to estimate cell subset abundancies from gene expression profiles, the authors find certain cell types, such as monocytes and naïve B cells, uniquely increased in fatal COVID-19 samples. Furthermore, gene set enrichment analysis revealed changes in functional and signaling pathways, including decreases in CD40/CD40L signaling and alterations in integrin expression. The authors suggest that these results implicate naïve B cells as a potential mediator of lung pathology in COVID-19. Overall, this manuscript presents original research and the conclusions are supported by the data. However, there are some concerns about the limitations of the study and presentation of the results that should be addressed prior to acceptance.

Major concerns:

1. The title overstates the findings and should be changed to reflect the fact that the role for naïve B cells in COVID-19 lung pathology is not proven by this analysis. A potential title could be: “In silico immune infiltration profiling combined with functional enrichment analysis [suggests a/reveals a potential] role for naïve B cells as a trigger for severe immune responses in the lungs of COVID-19 patients”.

2. A significant limitation which is not addressed is that all of the COVID-19 samples analyzed were from fatal cases. To understand whether naïve B cell infiltration plays a specific role in the pathology of severe/fatal COVID-19, it would be important to examine whether these changes also occur in mild/moderate cases where there is a lack of significant lung damage or fibrosis. If the samples/data are not available to make this comparison, this limitation should be raised in the discussion.

3. The authors state the following rationale for comparing COVID-19 and IPF: “Nearly all of the patients who died from COVID-19 had severe lung tissue damage and pulmonary fibrosis[29]. On the other hand, mortality in IPF is generally the result of progressive fibrotic lung disease.” If pulmonary fibrosis is a cause of mortality in both cases, it would make sense to look for common signatures rather than those unique to COVID-19. I think further clarification on the rationale here would be useful.

4. Figures 2-4: CIBERSORT can return relative cellular fractions instead of absolute score. These would be easier to interpret as an estimated percentage of total cells. In some cases the y axis scale changes significantly between datasets, making it harder to compare. Additionally, as a positive control it would be good to show that the relative cellular fractions of different subsets in the healthy samples match the approximate expected distribution of cell types in normal lung tissue. This could be included as a supplementary figure.

5. Figure 5: Given that there are ~7000 GO gene sets, an FDR<0.25 cutoff is quite relaxed and will likely result in many false positives. Do the results change if using a stricter cutoff such as 0.05?

6. Figure 5-6: In panel 5A, it appears the results from the two IPF datasets have been merged, but in panel 5B and Figure 6 they are separated. The authors should merge these results as well in panel 5B/Figure 6 to simplify the visualization and make a clearer message. Furthermore, the method of merging is not described. I would suggest running the enrichment analysis separately in each dataset and identifying gene sets that are consistently enriched in both datasets to produce the most robust results.

7. Figure 5B: The figure legend mentions “positive correlations”. If these are correlations, what are the variables being correlated? Instead, are these gene set enrichments? If so, the authors should change “positive correlations” to “positive enrichments” or “upregulated pathways versus healthy”. The legend also states that the edge represent overlap in genes between different gene sets. If this is so, how can an edge be dataset specific (colored), since the members of a gene set are predefined and not dataset dependent?

8. Figures 5/6: As is suggested by the differences in estimated cell frequencies in Figures 2-4, the changes in gene expression between COVID-19/IPF and healthy can be driven both by alterations in cell type abundancies and by transcriptional changes within a given cell type. The authors of CIBERSORT recently released a newer version, CIBERSORTx (Newman et al. Nat. Biotech. 2019) which allows estimation of cell type-specific gene expression within a mixture. I believe a valuable extension to the analysis presented would be to compare estimated gene expression between COVID-19/IPF and healthy within particular subsets of interest, such as naïve/memory B cells. This could provide further clarity, for example by revealing whether the upregulation of integrin pathways is occurring within naïve B cells of COVID-19 patients or if it is occurring in other cell types or simply due to changes in B cell frequencies.

9. Figure 6: The results section describing changes in various interleukin signaling pathways is not clear about the disease specificity. It should mention that these changes are common to IPF and COVID-19. Furthermore, IL-4 signaling, which is upregulated in COVID-19 (and IPF), is known to induce proliferation and class switching in B cells (Gascan et al., J. Exp. Med. 1991). This appears at odds with the finding of increased naïve B cells in COVID-19 samples. How do the authors explain this apparent dichotomy? This should be addressed.

Minor concerns:

1. Figure 5B: What does the size of the circles represent?

2. Figure 6: Are these enrichments significant? The cutoffs used to determine inclusion in the figure should be stated in the legend

3. Discussion: The authors refer to increased infiltration of different immune cells, it should be clarified that these are transcriptional estimates and not actual measurements of cell frequencies.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Dec 2;15(12):e0242900. doi: 10.1371/journal.pone.0242900.r002

Author response to Decision Letter 0


20 Oct 2020

Response to reviewer 1:

Dear reviewer, thank you very much for your valuable comments and suggestions on the direction of this paper, which have greatly help us improving the strength of evidence of our conclusions. Based on the suggestion, we attempted to search multiple public repositories and found the single-cell sequencing databases currently associated with COVID-19 lung-related immune cell infiltration provided by Chua et al. and Liao et al. (GSE145926), and the single-cell sequencing database associated with IPF lung immune cell infiltration published by Morse et al. (GSE128033). The GSE94555 and GSE86618 databases were also once included in our evaluation but were not used due to the small sample size. On the other hand, the GSE135893 IPF database contains a large number of single-cell analysis results, but due to the processing capacity of the equipment, it was not possible to analyze the relevant results of the B-cell population.

In order to achieve consistency with the CIBERSORT analysis to verify whether similar results occur in both the bulk RNA-seq and single-cell RNA-seq, and to allow for re-running the gene set clustering and visualization analysis with FDR < 0.05, we performed gene cluster overlap visualization analysis by matching the immune cell signature files used in CIBERSORT and converting them to Naïve or Memory specific gene sets. In addition, in order to reduce the complexity and improve the interpretation of the results, we polarized the enrichment results as red and green for COVID-19 or IPF respectively. The results of the single cell sequencing database and clinical situation analysis were added to Figure 6 and 7, while the PID-related analysis was included as supplementary data. For B-cell related signature analysis, since the number of gene sets has been filtered down to nearly 500, the use of FDR<0.05 may result in too few results, so please forgive us for adopting the more relaxed FDR<0.25 criterion in Figure 7A. The relevant amendments and new result paragraphs are described in the attached rebuttal letter.

Responses to Reviewer 2:

1. The title overstates the findings and should be changed to reflect the fact that the role for naïve B cells in COVID-19 lung pathology is not proven by this analysis. A potential title could be: “In silico immune infiltration profiling combined with functional enrichment analysis [suggests a/reveals a potential] role for naïve B cells as a trigger for severe immune responses in the lungs of COVID-19 patients”.

Response:

Many thanks to the valuable suggestion, we have corrected the title as follows: “In silico immune infiltration profiling combined with functional enrichment analysis reveals a potential role for naïve B cells as a trigger for severe immune responses in the lungs of COVID-19 patients.”

2. A significant limitation which is not addressed is that all of the COVID-19 samples analyzed were from fatal cases. To understand whether naïve B cell infiltration plays a specific role in the pathology of severe/fatal COVID-19, it would be important to examine whether these changes also occur in mild/moderate cases where there is a lack of significant lung damage or fibrosis. If the samples/data are not available to make this comparison, this limitation should be raised in the discussion.

Response:

We thank the reviewer for the suggestion. For moderate/severe case comparison, we analyzed two single cell RNA-seq databases that clearly indicate the severity of COVID-19, and further compared the phenotypic differences between B cells from different conditions and their degree of infiltration in relation to clinical status in Figure 7. The relevant amendments are described in the rebuttal letter.

3. The authors state the following rationale for comparing COVID-19 and IPF: “Nearly all of the patients who died from COVID-19 had severe lung tissue damage and pulmonary fibrosis[29]. On the other hand, mortality in IPF is generally the result of progressive fibrotic lung disease.” If pulmonary fibrosis is a cause of mortality in both cases, it would make sense to look for common signatures rather than those unique to COVID-19. I think further clarification on the rationale here would be useful.

Response:

We thank the reviewer for the suggestions and strongly agree that it is necessary and valuable to analyze the shared signatures of the two diseases. However, given the availability of raw data and the complex interactions of multiple immune cells, we are currently unable to analyze such a large amount of data and further expand the presentation of the results. For example, the data tentatively suggest that differences in macrophage are common to both diseases, a phenomenon also observed in our newly added single cell sequencing database, but a complete analysis of all macrophage-related gene set signatures and correlations would require more time, knowledge, and manpower to achieve. On the other hand, we did observe some possible disease-related findings, such as the suppression of insulin-associated signaling pathways in both diseases, which we believe may be related to the abnormal regulation of blood glucose in patients with pulmonary fibrosis, but decided to delete this discussion due to space limitation and lack of expertise. As such, we are grateful for your understanding of our decision to focus on the specific differences between the two diseases. Hopefully, the publication of this paper will allow more experts in various fields to conduct in-depth analysis on different faces.

4. Figures 2-4: CIBERSORT can return relative cellular fractions instead of absolute score. These would be easier to interpret as an estimated percentage of total cells. In some cases the y axis scale changes significantly between datasets, making it harder to compare. Additionally, as a positive control it would be good to show that the relative cellular fractions of different subsets in the healthy samples match the approximate expected distribution of cell types in normal lung tissue. This could be included as a supplementary figure.

Response:

Thank you very much for your suggestion and we couldn't agree with you more. We have evaluated the relative scores using Cibersort-X, but the presentation of the results may be affected by the over-representation of specific immune cells, which significantly reduces the impact of the analysis of the target cell clusters. On the other hand, there are other recent publications that analyzed the same database using different deconvolution strategies, and we decided not to include the presentation and discussion of relative cell scores in this paper due to the issue of repetitive publication.

Reference:

Cavalli, E., et al., Transcriptomic analysis of COVID19 lungs and bronchoalveolar lavage fluid samples reveals predominant B cell activation responses to infection. Int J Mol Med, 2020. 46(4): p. 1266-1273.

5. Figure 5: Given that there are ~7000 GO gene sets, an FDR<0.25 cutoff is quite relaxed and will likely result in many false positives. Do the results change if using a stricter cutoff such as 0.05?

Response:

Thanks to the reviewer's suggestion, we have recreated Figure 5 under stricter conditions, as described in the rebuttal letter.

6. Figure 5-6: In panel 5A, it appears the results from the two IPF datasets have been merged, but in panel 5B and Figure 6 they are separated. The authors should merge these results as well in panel 5B/Figure 6 to simplify the visualization and make a clearer message. Furthermore, the method of merging is not described. I would suggest running the enrichment analysis separately in each dataset and identifying gene sets that are consistently enriched in both datasets to produce the most robust results.

Response:

Thanks to the reviewer's suggestion, we have recreated Figure 5, which combines the IPF database results and makes the visualization easier to interpret. All corrections and results will be described below.

7. Figure 5B: The figure legend mentions “positive correlations”. If these are correlations, what are the variables being correlated? Instead, are these gene set enrichments? If so, the authors should change “positive correlations” to “positive enrichments” or “upregulated pathways versus healthy”. The legend also states that the edge represent overlap in genes between different gene sets. If this is so, how can an edge be dataset specific (colored), since the members of a gene set are predefined and not dataset dependent?

Response:

Thanks to the reviewer for the correction, this is indeed an error in our narration. On the one hand, the phenomenon of edge appearing colored no longer appears after updating the enrichment map, which may be a version-specific anomaly that causes us to misinterpret the results. The above error has been corrected after reproduction and is described below.

8. Figures 5/6: As is suggested by the differences in estimated cell frequencies in Figures 2-4, the changes in gene expression between COVID-19/IPF and healthy can be driven both by alterations in cell type abundancies and by transcriptional changes within a given cell type. The authors of CIBERSORT recently released a newer version, CIBERSORTx (Newman et al. Nat. Biotech. 2019) which allows estimation of cell type-specific gene expression within a mixture. I believe a valuable extension to the analysis presented would be to compare estimated gene expression between COVID-19/IPF and healthy within particular subsets of interest, such as naïve/memory B cells. This could provide further clarity, for example by revealing whether the upregulation of integrin pathways is occurring within naïve B cells of COVID-19 patients or if it is occurring in other cell types or simply due to changes in B cell frequencies.

Response:

We would like to thank Reviewer for his suggestion to re-analyze the results in Cibersort-X, but the data is too complex. For example, the specific genes screened by the software are currently unsupported by literature and are difficult to link to Naïve or memory B cells. Therefore, we used the LM22 immune cell gene signature from Cibersort to create specific signatures for Naïve or memory B cell, and then analyzed the bulk RNA-seq and single-cell RNA-seq to cross-validate the enrichment of the same signature in a wider variety of databases. Related gene expression is also specifically evaluated. We believe that such an approach would enhance the credibility of the results as well. All corrections and results have be described in the rebuttal letter.

9. Figure 6: The results section describing changes in various interleukin signaling pathways is not clear about the disease specificity. It should mention that these changes are common to IPF and COVID-19. Furthermore, IL-4 signaling, which is upregulated in COVID-19 (and IPF), is known to induce proliferation and class switching in B cells (Gascan et al., J. Exp. Med. 1991). This appears at odds with the finding of increased naïve B cells in COVID-19 samples. How do the authors explain this apparent dichotomy? This should be addressed.

Response:

Thanks to Reviewer's suggestion, we understand that it is difficult to directly relate the results of bulk RNA-seq samples to the effects of specific cell populations. However, it is possible that presenting the results of this kind of analysis may provide preliminary confirmation for other more in-depth studies, so we decided to present the results in supplementary data. On the other hand, we also tried to perform PID analysis in single cell RNA-seq, but the single B-cell diversity may be too divergent for simple dichotomy to obtain significant results, and thus the results are not presented. On the issue of IL-4 signaling, a report published this year indicated that the activation of IL-4 signaling may be related to the restore of DN2 B cell to naïve B cell. Although this result is somewhat contrary to our additional inference, since the role of DN2 B cells in autoimmunity and the detailed activation mechanism remain to be elucidated, the up-regulation of IL-4 signal may also be a potential inhibitory mechanism indirectly induced by the over-activation of DN2 B cells, and preserving the relevant data may help future research progress.

Reference:

Hsu, H.-C., et al., IL-4 synergizes with low-dose IL-2 to restore systemic lupus erythematosus B cells at the resting naive status. The Journal of Immunology, 2020. 204(1 Supplement): p. 218.8-218.8.

Minor concerns:

1. Figure 5B: What does the size of the circles represent?

Response:

Thanks to reviewer's correction, the size of the circle, which represents the number of genes in the gene set, is clearly indicated in the recreated figure.

2. Figure 6: Are these enrichments significant? The cutoffs used to determine inclusion in the figure should be stated in the legend

Response:

Thanks to reviewer's correction, all enrichments have been filtered for FDR < 0.05 and are noted specifically in the recreated diagram and legends.

3. Discussion: The authors refer to increased infiltration of different immune cells, it should be clarified that these are transcriptional estimates and not actual measurements of cell frequencies.

Response:

Thanks to the reviewer for the corrections, which have been made to the text.

Attachment

Submitted filename: Rebuttal letter v1.docx

Decision Letter 1

Mrinmoy Sanyal

6 Nov 2020

PONE-D-20-21528R1

In silico immune infiltration profiling combined with functional enrichment analysis reveals a potential role for naive B cells as a trigger for severe immune responses in the lungs of COVID-19 patients

PLOS ONE

Dear Dr. Chiu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 21 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Mrinmoy Sanyal, PhD

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: Summary:

The authors have addressed most of my initial concerns. However, the updated analysis and results suggest that naïve B cells are not elevated in the BALF of severe COVID-19 patients relative to moderate cases, but that the B cells of severe patients may be enriched in antibody-secreting cells instead. This does not implicate naïve B cells in severe immune responses to COVID-19 and the conclusions of the manuscript need to be altered to be in agreement with these new findings.

Major concerns:

1. Figure 5B: How do the authors define ‘top gene’ in the memory v naïve comparison? Is this based on the top 100 most highly expressed genes from the LM22 signature for both the memory and naïve B cell subsets? This is not an ideal way to define a naïve vs memory specific signature, because it is not guaranteed to identify the most distinctive genes. The 20 genes unique to naïve could be rank 101-120 in memory, just missing the cutoff, for example. Therefore, the authors should validate their analyses by using an alternative memory vs naïve signature and see if they produce consistent results. One approach could be to use up/down DEG memory vs naïve B cell gene sets included in the C7:immunologic signature gene sets from MSigDB.

2. Figure 7: The the moderate versus severe B cell comparison here is interesting and informative. However, as the authors correctly point out, the results indicate that severe patients likely have increased plasma cells or other antibody-secreting cells relative to mild/moderate patients. These cells, as well as DN2 B cells, are not naïve. Therefore, this would suggest that naïve B cells are not involved in the pathology of severe COVID-19, but that potentially antibody-secreting cells are instead. This is at odds with the conclusions of the manuscript, including the title. Unless the authors have some alternative explanation as to why these results do not implicate antibody-secreting cells rather than naïve B cells in severe COVID-19, the conclusions of the manuscript need to be changed to better match these findings.

Minor concerns:

1. The end of the introduction mentions ‘anti-secretory cells’, is this supposed to be antibody-secreting cells (as in the abstract)?

2. Figure 5A: This panel might be easier to interpret if these results were separated into 3 plots (COVID-19, IPF, and common)

3. Figure 6: The Liao and Morse datasets both have ~40% B cells in control samples, but the Chua dataset has < 1% B cells in control samples. Why do these controls have such a low frequency of B cells? Are they appropriate controls?

4. Figure 7B: What is the source for these DN2 B cell gene markers? Are these differences significant?

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Dec 2;15(12):e0242900. doi: 10.1371/journal.pone.0242900.r004

Author response to Decision Letter 1


10 Nov 2020

Reviewer #2: Summary:

The authors have addressed most of my initial concerns. However, the updated analysis and results suggest that naïve B cells are not elevated in the BALF of severe COVID-19 patients relative to moderate cases, but that the B cells of severe patients may be enriched in antibody-secreting cells instead. This does not implicate naïve B cells in severe immune responses to COVID-19 and the conclusions of the manuscript need to be altered to be in agreement with these new findings.

Major concerns:

1. Figure 5B: How do the authors define ‘top gene’ in the memory v naïve comparison? Is this based on the top 100 most highly expressed genes from the LM22 signature for both the memory and naïve B cell subsets? This is not an ideal way to define a naïve vs memory specific signature, because it is not guaranteed to identify the most distinctive genes. The 20 genes unique to naïve could be rank 101-120 in memory, just missing the cutoff, for example. Therefore, the authors should validate their analyses by using an alternative memory vs naïve signature and see if they produce consistent results. One approach could be to use up/down DEG memory vs naïve B cell gene sets included in the C7:immunologic signature gene sets from MSigDB.

Response:

We really appreciate the suggestions, and we have indeed tried and considered a number of methods similar to this. We initially spent a lot of time searching the literature for a more representative gene-set of memory and naïve B-cell, but most of the literature only had one or two genes (e.g. CD27 for memory B cell) as representative of the B-cell population. Considering that we initially observed this phenomenon from the results of the Cibersort analysis and aimed to narrow down the results to focus on differences in expression of immune cell related genes in further single-cell sequencing databases, we finally decided to use the current method for the evaluation. There are two reasons for choosing the top 100 genes: firstly, in the LM22 document, the weighting (numbers of each gene in the LM22 document ) of the top 10 immune cell-specific genes is over 10,000 and drops to a few hundred after the top 100, and the weighting of the genes further down the list is insignificant for the software to determine the immune infiltration score of the immune cells. Secondly, among the immune cell-specific genes generated by this method, although there are indeed problems as you mentioned, most of them have more than twofold differences in the weight of genes in the LM22 naive and memory cell gene set, so we think that they still efficient to discriminate between the two types of B cells. Further, a number of genes are listed as naive or memory B-cell-specific biomarkers in the single-cell biomarker database (PanglaoDB, https://www.panglaodb.se/index.html) (Naive: ZNF286A, MEP1A, FCER2, UGT1A8, BEND5, BCL7A, P2RY14; Memory: TNFRSF13B, CD27, CD86, IL7, AIM2) , and most of the biomarker genes fall in the 50th to 100th rank, showing that this method is capable of accurately reflecting biomarker genes in specific immune cell type. In addition, we also used the same method to find specific genes representing memory and naive B cells in the MSigDB C7:Immunology signature gene set (GSE13411), with even fewer genes intersecting with those from single-cell biomarker database, and thus finally decided not to adopt it for analysis.

2. Figure 7: The moderate versus severe B cell comparison here is interesting and informative. However, as the authors correctly point out, the results indicate that severe patients likely have increased plasma cells or other antibody-secreting cells relative to mild/moderate patients. These cells, as well as DN2 B cells, are not naïve. Therefore, this would suggest that naïve B cells are not involved in the pathology of severe COVID-19, but that potentially antibody-secreting cells are instead. This is at odds with the conclusions of the manuscript, including the title. Unless the authors have some alternative explanation as to why these results do not implicate antibody-secreting cells rather than naïve B cells in severe COVID-19, the conclusions of the manuscript need to be changed to better match these findings.

Response:Thank you for your approval. With regard to the association of antibody-secreting cells with naive cells, we did not adequately describe the context in the current manuscript and have therefore corrected it as shown below:

“As possessing both naïve B-cell and antibody-secreting cell properties, these cells are similar to the DN2 B-cells found in systemic lupus erythematosus proposed by Tipton et al. The researchers found that they are derived from naïve B cells and appear to be precursors of plasma cells [1]. The term double negative (DN) comes from the absence of immunoglobulin D and memory B-cell marker CD27 [2]. DN B cells are further classified into DN1(CXCR5+) and DN2(CXCR5-) by whether CXCR5 is expressed or not [3]. An increase in IRF4 and a decrease in IRF8 are thought to be associated with the promote naïve B cell differentiation into DN2 [4, 5]. The process of induced DN2 B-cell differentiation is accompanied by a decrease in TRAF5 and CD21 and an increase in CD11c [6]. These DN2 B cells are usually found outsides of B-cell follicles rather than germinal centers and may therefore be genetically phenotypically different from GC-derived memory B cells.”

Regarding the difference between antibody-secreting cells (ASC) and naive cells, it is true that it is difficult to connect the two in previous definitions. However, recent studies investigating DN2 B cells that induce Systemic Lupus Erythematosus (SLE) indicate that these B cells are directly stimulated by follicular helper T cells and differentiate into plasma-like cells that secrete large amounts of antibodies while retaining properties of naïve cells to some extent [6]. These implications led us to try to assess whether the infiltrating B-cells in patients with severe COVID-19 have a tendency to be DN2 B-cells.

We understand that there is insufficient evidence to directly define these ASC-like naïve cells as DN2 B cells by inference for the further conclusion in this article, because there is no credible gene set or experiment to define these naïve B cells as DN2 B cells so far. Therefore, we decided to keep the initial “naive B-cells” in the title, which showed a similar tendency in both bulk RNA-seq and single cell sequencing results and included the postulated characteristics of DN2 B cell in the single cell sequencing analysis and discussion as a clue for further research.

Minor concerns:

1. The end of the introduction mentions ‘anti-secretory cells’, is this supposed to be antibody-secreting cells (as in the abstract)?

Response:

Thank you for correcting this typo, which we have corrected as follows:

“Further analysis of the defined B-cell population using single-cell RNA sequencing databases showed that the B-cells from COVID-19 patients not only tended to be naïve B-cells, but also tended to be antibody-secreting cells in patients with severe disease, and the proportion of B-cell infiltration seemed to correlate with the severity of the disease.”

2. Figure 5A: This panel might be easier to interpret if these results were separated into 3 plots (COVID-19, IPF, and common)

Response:

Thank you for your suggestion, we have recreated the following figure:

3. Figure 6: The Liao and Morse datasets both have ~40% B cells in control samples, but the Chua dataset has < 1% B cells in control samples. Why do these controls have such a low frequency of B cells? Are they appropriate controls?

Response:

With regard to the differences in B-cell infiltration in healthy donors, we suggest that this may be due to sampling methods. In the Liao and Morse datasets, samples were collected from patients' bronchoalveolar lavage fluid, whereas in Chua dataset, most samples were collected from respiratory mucosa by swab only. Therefore, in the Chua dataset, samples from healthy donors may have significantly different B-cell population due to lower mucosal secretions. We understand that this is an experimental limitation and that extreme differences in quantity may also cause bias in the results. However, this database is an important source of evidence for the extent of disease and immune cell infiltration in patients, and we felt it was necessary to perform consistent analyses across all databases, and therefore used these small numbers of B cells as controls.

4. Figure 7B: What is the source for these DN2 B cell gene markers? Are these differences significant?

Response:

Thank you for the consideration, these markers are based on the gene markers used in several widely cited papers discussing DN2 B-cells, so we trust that these markers are approved. The relevant genes and literature have been added to the context as follows:

“As possessing both naïve B-cell and antibody-secreting cell properties, these cells are similar to the DN2 (Double negative) B-cells found in systemic lupus erythematosus proposed by Tipton et al. The researchers found that they are derived from naïve B cells and appear to be precursors of plasma cells [1]. The term double negative (DN) comes from the absence of immunoglobulin D and memory B-cell marker CD27 [2]. DN B cells are further classified into DN1(CXCR5+) and DN2(CXCR5-) by whether CXCR5 is expressed or not [3]. An increase in IRF4 and a decrease in IRF8 are thought to be associated with the promote naïve B cell differentiation into DN2 [4, 5]. The process of induced DN2 B-cell differentiation is accompanied by a decrease in TRAF5 and CD21 and an increase in CD11c [6]. These DN2 B cells are usually found outsides of B-cell follicles rather than germinal centers and may therefore be genetically phenotypically different from GC-derived B cells.”

The relevant statistical difference calculations have been supplemented as shown in the figure below:

1. Tipton, C.M., et al., Diversity, cellular origin and autoreactivity of antibody-secreting cell population expansions in acute systemic lupus erythematosus. Nat Immunol, 2015. 16(7): p. 755-65.

2. Wei, C., et al., A new population of cells lacking expression of CD27 represents a notable component of the B cell memory compartment in systemic lupus erythematosus. J Immunol, 2007. 178(10): p. 6624-33.

3. Ehrhardt, G.R., et al., Discriminating gene expression profiles of memory B cell subpopulations. J Exp Med, 2008. 205(8): p. 1807-17.

4. Nutt, S.L., et al., The generation of antibody-secreting plasma cells. Nat Rev Immunol, 2015. 15(3): p. 160-71.

5. Xu, H., et al., Regulation of bifurcating B cell trajectories by mutual antagonism between transcription factors IRF4 and IRF8. Nat Immunol, 2015. 16(12): p. 1274-81.

6. Jenks, S.A., et al., Distinct Effector B Cells Induced by Unregulated Toll-like Receptor 7 Contribute to Pathogenic Responses in Systemic Lupus Erythematosus. Immunity, 2018. 49(4): p. 725-739 e6.

Attachment

Submitted filename: Rebuttal letter 2nd.docx

Decision Letter 2

Mrinmoy Sanyal

12 Nov 2020

In silico immune infiltration profiling combined with functional enrichment analysis reveals a potential role for naive B cells as a trigger for severe immune responses in the lungs of COVID-19 patients

PONE-D-20-21528R2

Dear Dr. Chiu,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Mrinmoy Sanyal, PhD

Academic Editor

PLOS ONE

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Acceptance letter

Mrinmoy Sanyal

17 Nov 2020

PONE-D-20-21528R2

In silico immune infiltration profiling combined with functional enrichment analysis reveals a potential role for naïve B cells as a trigger for severe immune responses in the lungs of COVID-19 patients

Dear Dr. Chiu:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mrinmoy Sanyal

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Data

    (PDF)

    Attachment

    Submitted filename: Rebuttal letter v1.docx

    Attachment

    Submitted filename: Rebuttal letter 2nd.docx

    Data Availability Statement

    All gene expression profiling files are available from the NCBI GEO database or website of the original articles published by Chua et al. (GEO accession numbers: GSE53845, GSE128033, GSE124685, GSE145926, and GSE150316.).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES