Hematopoietic stem cell transplantation (HCT) is a potentially life-saving therapy but can lead to lung injury due to chemoradiation toxicity, infection, and immune dysregulation
We previously showed that bronchoalveolar lavage (BAL) transcriptomes representing pulmonary inflammation and cellular injury can phenotype post-HCT lung injury and predict mortality. To test whether peripheral blood might be a suitable surrogate for BAL, we compared 210 paired BAL and blood transcriptomes obtained from 166 pediatric patients with HCT at 27 hospitals. BAL and blood RNA abundance showed minimal correlation at the level of individual genes, gene set enrichment scores, imputed cell fractions, and T and B cell receptor clonotypes. Instead, we identified significant site-specific transcriptional programs. In BAL, pathways related to immunity, hypoxia, and epithelial mesenchymal transition were tightly coexpressed and linked to mortality. In contrast, in blood, expression of endothelial injury, DNA repair, and cellular metabolism pathways was associated with mortality. Integration of paired BAL and blood transcriptomes dichotomized patients into 2 groups with significantly different rates of hypoxia and clinical outcomes within 1 week of BAL. These findings reveal a compartmentalized injury response, where BAL and blood transcriptomes provide distinct but complementary insights into local and systemic mechanisms of post-HCT lung injury.
Keywords: Immunology, Inflammation, Pulmonology
Keywords: Bone marrow transplantation, Stem cell transplantation, Transcriptomics
In stem cell transplant patients, paired lung and peripheral blood transcriptomes show little correlation, instead revealing a compartmentalized response with distinct yet complementary gene expression signatures of post-transplant lung injury.
Introduction
Hematopoietic stem cell transplantation (HCT) combines high-dose chemotherapy and/or radiation with i.v. infusion of hematopoietic progenitor cells in order to eradicate and replace malignant or dysfunctional cellular lineages (1). While the number of HCT procedures has increased dramatically and outcomes have improved over time, safety remains a concern. Specifically, acute or chronic lung injury can develop in 20%–40% of patients due to chemotherapy toxicity, infections, and/or immune dysregulation (2, 3). Post-HCT lung injury can lead to impaired quality of life and premature death (4, 5). Treatments largely include supportive care, antiinfectives, and attempts at immune modulation where possible but are limited by a need for deeper understanding of the lung microenvironment after HCT (6).
Therefore, the Pediatric Transplant and Cell Therapy Consortium (PTCTC) undertook a broad cross-sectional study of patients undergoing bronchoalveolar lavage (BAL) as diagnostic evaluation for post-HCT lung injury. We measured pulmonary microbiomes and paired human lung gene expression in BAL samples and identified 4 lung injury subtypes with varying degrees of dysbiosis, infection, inflammation, and cellular injury (7). This approach challenged previous clinical diagnoses such as idiopathic pneumonia syndrome (IPS) by increasing detection of pathogens and clarifying dominant biologic processes in the lung. Importantly, clinical outcomes varied significantly across the 4 subtypes, providing a biology-guided framework for risk stratification in this population. For example, patients with depleted pulmonary microbiomes, upregulated T cell signaling, diminished alveolar macrophage activity, and signs of epithelial mesenchymal transition (EMT) had 3- to 4-fold higher in-hospital mortality than their counterparts.
It remains crucial to determine whether these processes are compartmentalized to the lung or are indicative of systemic disease, as this could affect the development of both precision medicine diagnostics and treatments. From a diagnostic standpoint, BAL is a medically invasive procedure that is limited to those with acceptably low illness severity and is not well suited for serial sampling to monitor disease evolution or treatment-response (8, 9). Thus, surrogates of lung biology obtained through noninvasive testing such as blood sampling could enable more nimble diagnosis and disease tracking. From a therapeutic standpoint, understanding whether and how these lung processes are reflected in or regulated by the blood compartment could inform whether eventual treatments ought to be delivered directly to the lung, through the blood, or both.
Thus, we undertook a systems biology approach to compare paired BAL and peripheral blood samples from pediatric patients with post-HCT lung injury. We questioned whether peripheral blood transcriptomes would correlate with key aspects of BAL transcriptomes that we had previously linked to disease evolution and clinical outcomes. Here we present an integrated approach to understanding BAL, blood, and combined signatures of post-HCT lung injury that advance our understanding of local and systemic disease processes after pediatric HCT.
Results
Patient characteristics
As previously described, the PTCTC SUP1601 study enrolled 229 pediatric patients with HCT who developed post-HCT lung injury and underwent 278 clinically indicated BAL procedures (7). Patients were enrolled across 32 children’s hospitals in the United States, Canada, and Australia between 2014 and 2022. For this study, paired peripheral blood was collected during BALs and 210 BAL-blood pairs in 166 patients from 27 hospitals were included for analysis (Figure 1). Patients varied broadly in age, sex, race, ethnicity and geography; transplant indication was most commonly hematologic malignancy followed by receipt of mostly bone marrow or peripheral blood allografts from a variety of donor types (Table 1). Post-HCT lung injury signs and symptoms included hypoxia, dyspnea, declining pulmonary function testing, and/or abnormal chest imaging and often developed in conjunction with other complications such as graft versus host disease (GVHD; Table 2). BAL was performed a median 131 days after HCT (IQR, 37–376), at which point the cohort displayed a median absolute neutrophil count (ANC) of 3,042 cells/μL (IQR, 1,620–5,511) and a median absolute lymphocyte count (ALC) of 482 cells/μL (IQR, 184–1,159); approximately half of the cohort required supplemental oxygen at the time of sample collection. Based on BAL clinical microbiology results, lung injury was classified as lower respiratory tract infection (95 of 210), nonpulmonary sepsis (4 of 210), or IPS (111 of 210). After each patient’s most recent BAL, 85 of 166 patients required intensive care (51%), 48 required ≥ 7 days of mechanical ventilation (29%), and 32 patients died prior to hospital discharge (19%).
Figure 1. Study design.
(A) Geographic location of participating children’s hospitals. (B) Patients were followed from the time of conditioning chemotherapy for the development of pulmonary symptoms. If bronchoscopy with bronchoalveolar lavage was planned for clinical reasons, patients were enrolled and BAL with a paired blood sample was collected. Patients were then followed clinically through hospital discharge.
Table 1. Patient characteristics.
Table 2. Event characteristics.
Contrasting transcriptomes in paired BAL and peripheral blood
We hypothesized that both BAL and peripheral blood could yield complementary insight into pulmonary disease states in this cohort. To contrast the information captured in each sample type, we first compared BAL and blood sample pairs. We observed tissue-specific differential gene expression, with 6,156 genes showing greater expression in BAL and 9,368 genes showing greater expression in blood (Figure 2A and Supplemental Data File 1; supplemental material available online with this article; https://doi.org/10.1172/jci.insight.194440DS1). For example, as expected, Surfactant Protein C (SFTPC) was expressed to a greater level in BAL, whereas Hemoglobin subunit Alpha 2 (HBA2) was expressed to a greater level in blood (Figure 2B). Many canonical lung-specific genes were detected in the blood of many patients (e.g., SFTPC 53/210, MUC5AC 92/210, LAMP3 180/210), albeit rarely above 1 read per million per sample.
Figure 2. Differential gene expression and coexpression in BAL and paired blood samples.
(A) Genes differentially expressed in BAL versus peripheral blood are shown. (B) Expression levels of select genes specific to lung (SFTPC, MUC5AC) and blood (HBA2, HBB). (C) Gene set enrichment scores to the 50 MSigDB Hallmark Pathways were calculated, and correlation between expression of each gene set was calculated within each body site and then contrasted to identify site-specific coregulation. Here we show unique coexpression of Hallmark Hypoxia and IFN-γ gene sets in BAL but not blood as well as unique coexpression of Hallmark DNA Repair and E2F targets in blood but not BAL. (D) Correlation of MSigDB Hallmark pathways within blood (top right triangle) and within BAL (bottom left triangle) are shown. See Supplemental Figure 1 for detailed labels. (E) BAL-blood correlation was calculated for expression levels of n = 7,169 protein-coding genes, and the distribution of correlation coefficients is plotted. Expression levels of CXCL8 in BAL and blood are shown as an example of minimal correlation. Gene set enrichment scores for the MSigDB Hallmark Inflammatory Response gene set measured in BAL and blood are also shown as an example of minimal BAL-blood correlation. BAL and peripheral blood cell type fractions were imputed using CIBERSORTx and reference atlases, and cell fractions across body sites were correlated using Spearman correlation coefficients, with neutrophils and CD8+ T cells shown as examples. T and B cell receptor clonotypes were measured using ImReP, and the number of unique clonotypes across body sites were correlated using Spearman correlation coefficients, with TRA shown as an example.
To understand the different regulatory networks unique to each body site, we next tested for tissue-specific differential gene coexpression. When assessing enrichment scores to the 50 MSigDB Hallmark gene sets (10), differential coexpression networks revealed highly site-specific interactions (Supplemental Data File 2). Of note, 39 gene sets were coexpressed in BAL but not blood; for example, Hallmark Hypoxia expression scores were correlated with Hallmark IFN-α and IFN-γ signaling, DNA repair signaling, and oxidative phosphorylation signaling in the lung (correlation ≥ 0.5) but not in the blood (correlation < 0.1, FDR-adjusted P < 0.05). In contrast, 9 gene sets were coexpressed in blood but not BAL; for example, Hallmark DNA Repair expression scores were correlated with early region 2 binding factor (E2F) signaling in the blood (correlation ≥ 0.5) but not in the lung (correlation < 0.1, FDR-adjusted P < 0.05; Figure 2C). Thus, overall, BAL gene networks showed much stronger coexpression than blood gene networks (Figure 2D and Supplemental Figure 1), suggesting synchronization of biological pathways in the lung.
Given the site-specific differences in expression levels and coexpression networks, we next sought to determine the degree of cross–body site transcriptome correlation. For any given gene, there was minimal correlation between BAL and blood expression levels (median Spearman rho = –0.005; IQR, –0.056 to –0.002; Supplemental Data File 3 and Figure 2E). As an example, the lack of BAL-blood correlation for CXCL8 gene expression is depicted in Figure 2E. Similar results were found when analyzing enrichment scores to broader gene networks in the MSigDB collection (Supplemental Data File 4 and Figure 2E) and when analyzing imputed cell fractions (Supplemental Data File 5 and Figure 2E). For example, the imputed fraction of neutrophils in BAL and blood sample pairs showed no correlation (Spearman correlation = –0.099, FDR-adjusted P = 0.520). Patient-level cell fractions in BAL and blood are depicted in Supplemental Figure 2 to demonstrate the lack of cross-site correlation. Of note, the blood ANC correlated with imputed blood neutrophil fraction (Spearman rho = 0.462, P < 0.001) but not BAL neutrophil fraction (Spearman rho=0.043, P = 0.549). We then calculated T and B cell receptor clonotypes using ImReP; we detected a median 16 TRA clonotypes in BAL (IQR, 8–31; range, 0–271) and 25 TRA clonotypes in blood (IQR, 13–42; range, 0–148) but no T-cell receptors alpha (TRA) clonotypes were shared across BAL-blood pairs (except 1 clonotype in 1 of 210 sample pairs). Similar results were found for T-cell receptors beta, gamma, and delta; immunoglobin heavy chain (IGH); kappa light chain (IGK); and lambda light chain (Supplemental Data File 6). Together, these data highlight the uniqueness of the biological information contained in both BAL and blood.
BAL and peripheral blood transcriptomes differ by survival outcome
To better understand site-specific and cross-site contributors to survival, we next contrasted transcriptomes in survivors and nonsurvivors. We again observed tissue-specific differential gene expression in nonsurvivors. In BAL, expression levels of 350 genes were associated with death; overall, nonsurvivors had increased expression of genes related to epithelial injury and hypoxia (e.g., SPP1, SFTPB, CEACAM6) and lower expression of genes related to innate immune signaling (e.g., CSF3R, CXCR1, IL1B, CXCL8; Supplemental Data File 7 and Figure 3A). In blood, expression levels of 656 genes were associated with death; nonsurvivors had increased expression of genes related to endothelial cell junction and smooth muscle activity (e.g., JCAD, PTGFR) and lower expression of genes related to lymphocyte signaling (e.g., JCHAIN, CCR7; Supplemental Data File 8 and Figure 3B). Genes associated with death in BAL and in blood showed minimal overlap (Figure 3C), again indicating different site-specific contributors to outcome. Notably, CEACAM6, LGMN, and BCAT1 showed increased expression in both BAL and blood of nonsurvivors, while CD74, MUC5B, HLA.DRB1 and HLA.DPB1, and CFD showed lower expression in both BAL and blood of nonsurvivors.
Figure 3. Differential gene expression by survival status.
(A) BAL gene expression differences in nonsurvivors. (B) Peripheral blood gene expression differences in nonsurvivors. (C) Overlap between BAL and blood genes associated with mortality.
In many diseases, genes maintain stable expression levels but exhibit altered coexpression patterns with other genes, suggesting that they influence pathobiology through network reorganization rather than heightened expression (11, 12). Therefore, we next tested for tissue-specific differential gene coexpression unique to nonsurvivors. In BAL, a network of 10,038 genes was coexpressed in nonsurvivors, but these genes were not coexpressed in survivors (Spearman rho ≥ 0.5 in nonsurvivors versus rho < 0.1 in survivors, FDR < 0.05; Supplemental Data File 9). Top hub genes included CEACAM6, which was coexpressed with 563 other genes related to platelet signaling and endoplasmic reticulum stress; CXCL17, which was coexpressed with 387 other genes related to immune signaling and hypoxia; and NFAM1, which was coexpressed with 1,441 genes related to EMT and lung morphogenesis (Figure 4A). In contrast, in blood, a network of 6,929 genes was correlated in blood of nonsurvivors but not survivors (Supplemental Data File 10). Top hub genes included ZNF707, which was coexpressed with 203 other genes representing cell cycle transitions and DNA repair, and GARRE1, which was coexpressed with 251 other genes representing heme metabolism (Figure 4B). These gene coregulatory networks complement the gene expression pathways noted above and accentuate the differences in biologic activity identified in BAL versus peripheral blood specimens.
Figure 4. Differential gene coexpression by survival status.
(A) Network of genes coexpressed in BAL of nonsurvivors (right) but not coexpressed in survivors (left). Examples of hubs genes (CEACAM6, CXCL17, NFAM1) are shown. Examples of differentially coexpressed genes linked to each hub gene are shown (e.g., CEACAM6-FN1 coexpression) to illustrate differential gene-expression. To the right, pathway enrichment for hub genes are shown. (B) Network of genes coexpressed in peripheral blood of nonsurvivors (right) but not coexpressed in survivors (left). Examples of hub genes (ZNF707, GARRE1, TMEM86B) are shown. Examples of differentially coexpressed genes linked to each hub gene are shown (e.g., ZNF707-BUB1B coexpression) to illustrate differential gene-expression. To the right, pathway enrichment for hub genes are shown.
Although we observed minimal BAL-blood transcriptome correlation in the overall cohort, we hypothesized that nonsurvivors might show greater cross-site correlation due to systemic illness and increased signaling across the alveolar-capillary membrane. However, we did not identify any gene sets that were differentially correlated between BAL and blood according to survivor status (all FDR-adjusted P > 0.1; Supplemental Data File 11). For example, Hallmark Hypoxia gene expression signaling in BAL and blood were minimally correlated in both survivors and nonsurvivors (P > 0.05), and these correlations were not significantly different (FDR-adjusted P = 0.994). This indicates that even the sickest patients do not show significant cross-site transcriptome correlation.
BAL and peripheral blood transcriptomes differ by lung injury subtype
We previously identified 4 subtypes of HCT-related lung injury based on BAL transcriptome-microbiome signatures (7, 13, 14). These subtypes varied not only in clinical outcomes but also in burden of infection, microbiome dysbiosis, inflammation, and cell injury (Figure 5A). Of note, lung injury Subtype 1 was defined by alveolar macrophage (AM) predominance, a replete and balanced microbiome, minimal signs of inflammation, and superior clinical outcomes. To better understand the biology of these lung injury subtypes, we next contrasted BAL and blood transcriptome pairs in reference to the best-performing Subtype 1, with results summarized in Figure 4 and Table 3 and described in detail below.
Figure 5. BAL and peripheral blood transcriptome correlates of post-HCT lung injury subtypes.
(A) Concept diagram for 4 post-HCT lung injury subtypes derived in the PTCTC SUP1601 cohort and validated in the University of Utrecht, Netherlands, cohort (58). (B) Example BAL and blood genes differentially expressed in lung injury subtypes 2, 3, and 4 relative to subtype 1. (C) Differentially expressed BAL and blood genes underwent pathway analysis, and pathways identified in BAL and blood gene are quantified to show greater overall differences detected in BAL as opposed to paired blood.
Table 3. Overview of HCT lung injury subtypes.
Subtype 2.
We previously showed that patients with Subtype 2 had higher rate of pulmonary bacterial infections and low in-hospital mortality. In BAL, Subtype 2 patients differentially expressed 2,027 genes (|LFC| ≥ 1, FDR-adjusted P < 0.05), including higher expression of genes related to granulocyte and inflammasome pathways (e.g., CXCR1, IL1R2, S100A8) and lower expression of genes related to macrophage and lymphocyte signaling (e.g., CCL18, APOE, HLA.DRA; Figure 5B). Supporting this, cell deconvolution showed an increase in BAL neutrophil fraction and a decrease in the AM fraction relative to Subtype 1 (Supplemental Data File 12). Patients with Subtype 2 had unique BAL coexpression of a network of 9,181 genes including top hubs NLRP3, CXCR1, and ALPL, which together showed enrichment for immune activation, cell-cell adhesion, platelet activity, and extracellular matrix interactions (Supplemental Data File 13). In peripheral blood, Subtype 2 patients differentially expressed 1,061 genes, including higher expression of genes related to thyroid activity (e.g., CALCA, TPO, DIO2) and lower expression of genes related to heme metabolism (e.g., HBD, HBG2) with no differences in blood cell fractions noted (Supplemental Data File 14). These patients also had greater blood coexpression of a network of 8,347 genes, including top hubs NUP210, CDH13, and COL6A6, which together represent DNA replication, oxidative phosphorylation, and cellular metabolism (Supplemental Data File 15). Comparison of enriched pathways in BAL and blood showed only 1 overlapping gene set: Hallmark Heme Signaling was lower in both BAL and blood of patients with Subtype 2. These data illustrate that, in patients with Subtype 2, pulmonary bacterial infections and a pulmonary neutrophil response are associated with systemic signs of increased blood cell metabolic activity without clear shifting of the fractions of blood cell types.
Subtype 3.
In contrast, patients with Subtype 3 showed a high rate of pulmonary microbiome depletion as well as high mortality. In BAL, Subtype 3 patients differentially expressed 8,166 genes, including higher expression of genes related to fibroblast activation, myogenesis, and nitric oxide signaling (e.g., FGF3, BMP1, NOS) and lower expression of genes related to macrophage signaling (e.g., CD74, FTL, HLA.DRA) and response to hypoxia (e.g., HIF1A, HMOX1; Figure 5C). In conjunction, cell deconvolution showed an increase in BAL CD4+ and CD8+ lymphocyte fractions and a decrease in the AM fraction relative to Subtype 1 (Supplemental Data File 16). Patients with Subtype 3 had unique BAL coexpression of a network of 6,724 genes, including top hubs SIGLEC1, PTPRC, and HADHB, which together showed enrichment for collagen deposition, integrin interactions, and EMT (Supplemental Data File 17). In peripheral blood, Subtype 3 patients differentially expressed 574 genes, including higher expression of genes related to collagen deposition and EMT (e.g. COL4A1, LAMA5, TIMP3), with no differences in blood cell fractions noted (Supplemental Data File 18). These patients also had greater blood coexpression of 9,269 genes, including top hubs ANO6, GPD2, and TLR8, which together represented DNA repair, T cell signaling, and heme metabolism (Supplemental Data File 19). Unique to Subtype 3, comparison of enriched pathways in BAL and blood showed substantial overlap for genes related to EMT and endothelial activation. These data illustrate that, in patients with Subtype 3, pulmonary microbiome depletion, T cell activation, and profibrotic signaling coocur with systemic signs of EMT.
Subtype 4.
Finally, patients with Subtype 4 showed both commensal microbiome depletion and viral infection, again with high mortality rates. In BAL, patients with Subtype 4 differentially expressed 6,252 genes, including higher expression of genes related to NK/T cell activity (e.g. IL2, KLRF1, IFNG, IFNA6; Figure 5B), β-defensins (DEFB114, DEFB110), and EMT (COL11A1, MMP27), and lower expression of genes related to AM signaling (e.g., MARCO, FTH1, HLA.C), neutrophil signaling (e.g., MYD88, TREM2), and airway and alveolar epithelial function (e.g., SPRR3, MUC5B, SFTPB). In conjunction, cell deconvolution showed an increase in BAL CD4+ and CD8+ lymphocyte fractions and a decrease in the AM fraction relative to Subtype 1 (Supplemental Data File 20). Patients with Subtype 4 had unique BAL coexpression of a network of 10,223 genes including top hubs SLC38A6, ETFDH, and TAF2, which together showed enrichment for collagen deposition, ankyrin interactions, and EMT (Supplemental Data File 21). In peripheral blood, patients with Subtype 4 differentially expressed only 156 genes, including weak overlap with IFN-α/β pathways (e.g., OAS3, IFIT1) and modest increase in blood CD4+ lymphocyte fraction (Supplemental Data File 22). These patients also had greater blood coexpression of 6,908 genes including top hubs PRR16, SCML2, and TRPC6, which together represented DNA repair, oxidative phosphorylation, and aerobic respiration (Supplemental Data File 23). Comparison of enriched pathways in BAL and blood showed no overlapping gene sets. These data illustrate that, in patients with Subtype 4, pulmonary microbiome depletion, T cell activation, and epithelial injury response coocur with nonspecific systemic signs of increased metabolic activity, but BAL and blood transcriptomes overall showed minimal overlap.
Integrated BAL and blood transcriptome signatures
Given that paired BAL and blood transcriptomes were not correlated, we next questioned whether they would complement each other in understanding patient illness and clinical outcomes. By inputting normalized BAL and blood transcriptomes into multi-omics factor analysis (MOFA) followed by dimensionality reduction (Uniform Manifold Approximation and Projection [UMAP]), we detected 2 clusters of patients (Figure 6A). Cluster A consisted largely of patients from the lowest-risk lung injury subtype 1, whereas Cluster B consisted largely of patients from the sicker lung injury Subtypes 2, 3, and 4 (Figure 6B). Patients in Cluster B were twice as likely to have required oxygen immediately prior to BAL (50.3% versus 29.1%; 66 of 131 versus 23 of 79; P = 0.003; Figure 6C) and, when analyzing the most recent encounter for each patient, were twice as likely to be dead or require ongoing mechanical ventilation 7 days after BAL (37.0% versus 17.2%; 40 of 108 versus 10 of 58, P = 0.008; Figure 6D). Consistent with the biologic themes identified by other methods above, Cluster B differentially expressed 9,784 genes in BAL (|LFC| ≥ 1, FDR-adjusted P < 0.05) representing AM depletion, lymphocyte activation, and epithelial injury, as well as 696 genes in blood, representing EMT and endothelial activation (Supplemental Data Files 24 and 25). Thus, this unified multi-omics approach demonstrates that post-HCT lung injury can be conceptualized as 2 large groups, with group B harboring mostly lung injury Subtypes 2, 3, and 4 and showing greater illness severity and worse clinical outcomes.
Figure 6. Integrated BAL and blood transcriptomic signatures reveal 2 large patient groups.
(A) Paired BAL and blood transcriptomes underwent multi-omics factor analysis (MOFA) followed by dimensionality reduction (UMAP) and k-means clustering to show 2 groups of patients. (B) Post-HCT lung injury subtype was mapped onto the 2 integrated transcriptome clusters, showing that most patients from subtypes 2, 3, and 4 mapped to Cluster B. (C and D) Approximately twice as many patients in Cluster B required oxygen prior to BAL sampling, and twice as many patients died or required ongoing mechanical ventilation within 7 days of BAL sampling.
Discussion
In this study, we compared 210 BAL and blood transcriptome pairs from pediatric patients with HCT with lung injury. We identified minimal cross-site transcriptome correlation at the level of individual gene expression, gene set enrichment scores, cell type fractions, and T and B cell receptor clonotypes. Instead, we uncovered unique site-specific transcriptome networks, suggesting body site compartmentalization of injury response. BAL signatures of mortality reflected immune perturbations, epithelial injury, and hypoxia signaling. Instead of mirroring the pulmonary compartment, peripheral blood transcriptomes showed broad signatures of endothelial injury, DNA repair, neurohormonal activation, and altered cellular metabolism, thus expanding our understanding of the systemic processes involved in post-HCT lung injury.
Other investigators have identified a lack of BAL-blood correlation for cell fractions and cytokines in patients with COVID-19, asthma, and HIV-related lung disease, although transcriptomic comparisons are sparse (15–21). Data in patients with HCT are rare, although Omdahl et al. (22) showed in a nonhuman primate model of GVHD that peripheral and lung-specific T cells have notably different TCR repertoires and even different site-specific T cell transcriptomes when TCRs are shared. While the lung is well known to have a robust local immune repertoire, during lung injury, there is degradation of the alveolar-capillary membrane with transmigration of bone marrow–derived immune cells, including neutrophils and monocytes, into the alveolar space (23, 24); thus, we were surprised to see the starkly absent cross-site correlation. Since the peripheral blood represents a composite of circulating blood cells as well as signals from all tissues in the body, it is possible that the effect of lung disease on the blood signal was diluted by extrapulmonary processes. It is also possible that poor post-HCT systemic immune function and/or exogeneous immunosuppression limited systemic immune responses to pulmonary processes. Overall, this strongly suggests that transcriptome measurements obtained from peripheral blood cannot be used as a proxy for analogous pulmonary processes, and it challenges the current clinical paradigm of assessing pulmonary inflammation using blood-based biomarkers. Whether more durable markers of pulmonary processes, such as protein biomarkers, can be detected in blood to guide pulmonary diagnosis and treatment remains an ongoing question in the field (25–27). However, the prognostic and predictive utility of such approaches in post-HCT GVHD lend optimism to this search (28–30).
While an overlap between BAL and blood pathways of disease was minimal, we did note a few exceptions. A top candidate gene related to post-HCT lung injury in our data was CEACAM6, which was upregulated in both BAL and blood of nonsurvivors and also tightly coregulated with BAL expression of numerous genes related to platelet signaling and endoplasmic reticulum stress. CEACAM6 is expressed on the cell surface of respiratory epithelium, including airway secretory cells and alveolar epithelial cells (31); is upregulated after numerous forms of lung injury; and serves as an intercellular adhesion, antiapoptotic, and surfactant protective molecule (32–36). Future studies addressing the biological role and biomarker utility of CEACAM6 are warranted.
In addition, we identified a shared BAL-blood transcriptomic signature in lung injury subtype 3, where both the BAL and blood showed upregulation of pathways related to EMT and collagen biosynthesis. Post-HCT fibrosis is a feature of sclerotic GVHD as well as post-HCT bronchiolitis obliterans syndrome and is notoriously difficult to diagnose in the lungs due to the insidious onset and late imaging findings (37–39). Thus our findings could suggest a pathway where detection in peripheral blood might provide insight into pulmonary processes, which could be leveraged for improved diagnostics. Given the lack of tissue for histologic examination, it remains unclear if these patients truly had pulmonary fibrosis, but this can be explored in the future. In addition, it remains unclear whether the detection of this signal in blood represents a synergized systemic response or is simply “leakage” of the pulmonary signal into the blood compartment. Future work is needed to define and characterize the possibility of pulmonary and extrapulmonary fibrosis in this population.
Given these findings, it is likely that the field of post-HCT lung injury will need not only pulmonary-targeted diagnostics, but pulmonary-targeted therapeutics as well. Especially given the serious off-target effects of many immunomodulatory medicines, inhaled therapies possibly coupled with nanoparticles for durable delivery hold high promise for local effect with the least off-target toxicity (40–42). However, it remains unclear whether targeting the identified blood-specific pathways associated with disease could actually improve pulmonary outcomes. In support of this possibility, it has recently been shown that targeting intestinal health can improve respiratory health, perhaps through the gut-lung axis (43, 44).
In summary, by comparing 210 paired BAL-blood transcriptomes obtained after pediatric HCT, we identified surprisingly little cross-site correlation in gene expression. Instead, we identified unique site-specific signatures of disease, suggesting compartmentalization of injury-response. These findings strongly support the need for pulmonary-based diagnostics and therapeutics and also question the exclusive use of peripheral blood testing to guide clinical care in patients with lung injury post-HCT.
Methods
Sex as a biological variable.
Our study included male and female human participants. Since BAL and blood samples were paired, each patient served as their own control. Similar findings are reported for both sexes.
Patients.
As previously described (7), participating pediatric centers screened all patients with HCT preparing to undergo clinically indicated bronchoscopic BAL for diagnostic assessment of pulmonary disease (NCT02926612).
Biospecimen collection.
Bronchoscopy and BAL were performed at the discretion of the treating team using local institutional protocols. All BAL samples were obtained by pediatric pulmonologists trained in fiberoptic bronchoscopy with anesthesia provided by anesthesiologists or critical care physicians. Lavage protocol was not dictated by the study but typically involved 3–6 aliquots of 10 mL sterile saline inserted into diseased areas of the lung as determined by preceding chest imaging or physical exam. Percent of lavage returned was not routinely documented, and lavage aliquots were typically pooled by the clinical team immediately after collection. After aliquoting for clinical testing, excess lavage remained unfractionated and was placed immediately on dry ice, stored at –70ºC until processing. Blood was collected during the BAL procedure, typically within 30 minutes of the lavage; 2.5mL whole blood was collected directly into a PAXgene tube, which was inverted 5–10 times and stored per the manufacturer’s instructions at –70°C until processing.
Clinical protocols and data collection.
Clinical microbiologic testing was determined by the treating team and typically included culture for bacteria, fungus, and acid-fast bacteria; multiplex PCR for respiratory viruses; galactomannan antigen; and cytology for Pneumocystis jirovecii. Additional molecular diagnostics such as PCR for atypical bacteria or fungi were used at the discretion of the site. After BAL, supportive care protocols were determined by the treating team. Patient demographics, medical history, and transplant-specific data were documented by trained study coordinators at each site. The most recent ANC and ALC measured clinically prior to BAL were documented. Results of clinical microbiologic testing on BAL were documented and not considered complete until 4 weeks after collection. Patients were followed until hospital discharge with no loss to follow-up.
RNA extraction.
BAL underwent RNA extraction as previously described (7). Briefly, 200 μL of BAL was combined with 200 μL DNA/RNA Shield (Zymo) and 0.5 mm glass bashing beads (Omni) for 5 cycles of 25 seconds bashing at 30 Hz, with 60 seconds of rest on ice between each cycle (TissueLyser II, Qiagen). Subsequently, samples were centrifuged at 16,000g for 10 minutes at 4°C, and the supernatant was used for column-based RNA extraction with DNase treatment according to the manufacturer’s recommendations (Zymo ZR-Duet DNA/RNA MicroPrep Kit). RNA extraction of PAXgene tubes was performed for this study. Briefly, 1.7 mL of peripheral blood mixed with PAXgene reagent underwent 10 minutes of centrifugation at 1,200g, was washed with PBS, recentrifuged for 10 minutes, and the pellet was combined with 100 μL DNA/RNA Shield, treated with Proteinase K, and underwent a magnetic bead–based extraction with DNase treatment according to the manufacturer’s recommendations (Zymo Quick-RNA Magbead Kit).
RNA-Seq.
BAL underwent RNA-Seq as previously described using the New England Biolabs Ultra II RNA Library Prep Kit (7). Both BAL and peripheral blood RNA underwent sequencing library preparation using miniaturized protocols adapted from the New England Biolabs Ultra II RNA Library Prep Kit (45). Reagents were dispensed using the Echo 525 (Labcyte) and underwent Ampure-XP bead cleaning on a Beckman Coulter Biomek NXP instrument. The peripheral blood libraries received treatment with rRNA and globin-depleting FastSelect reagent. Libraries underwent 19 cycles of PCR amplification and size selection to a target 300–700 nucleotides (nt), and they were pooled to facilitate approximately even depth of sequencing. Twenty-five picograms (pg) of External RNA Controls Consortium (ERCC) pooled standards were spiked in to each sample after RNA extraction and before library preparation to serve as internal positive controls (Thermo Fisher Scientific, 4456740). In addition, to identify contamination in laboratory reagents and the laboratory environment, each batch contained 2 samples of 200 μL sterile water and 6–8 samples of 200 μL HeLa cells taken from a laboratory stock and processed identically to patient samples, in order to account for laboratory- and reagent-introduced contamination. These samples were processed at the same time as the patient BAL samples in order to use the same lot of reagents and minimize batch effect on control samples. Samples were first sequenced at shallow depth on an Illumina iSeq instrument; n = 4 samples were removed due to low sequencing quality, and the remaining samples were pooled across lanes of an Illumina NovaSeq 6000 or NovaSeq X instrument and sequenced to a target depth of 40 million read-pairs with sequencing read length of 125 nt. Resultant fastq files underwent alignment to hg38 (STAR), with mitochondrial, ribosomal, and non-protein-coding transcripts excluded, leading to detection of a median 18,341 protein-coding transcripts per BAL (IQR, 17,052–18,755) and a median 14,146 transcripts per blood sample (IQR, 13,339–14,560). The dataset was then further subset for samples with > 50,000 total reads to protein-coding genes and genes present in > 25% of samples (9 blood samples were removed).
Statistics.
Genes differentially expressed by body site were identified by fitting negative binomial generalized linear models to body site with patient grouping as a random effect (edgeR) (46). Genes differentially coexpressed by body site were analyzed at the pathway level by first creating gene set enrichment scores to the MSigDB Hallmark pathways (gsva) (47), calculating gene set–gene set correlation within each site, and then contrasting the correlations across BAL and blood (DGCA) (48). Gene expression correlation across body sites was assessed by subjecting gene counts to variance stabilizing transformation (DESeq2, vst) (49) and then calculating genewise Spearman correlations using paired values. This analysis was repeated using gene set enrichment scores (gsva). We then used CIBERSORTx (50) to impute cell type fractions in BAL using the Travaglini et al. lung cell atlas (51) and in blood using the Schulte-Schrepping et al. blood cell atlas (52). We then tested for correlation between BAL and blood cell fractions using Spearman correlations. ImReP was used to identify T and B cell receptor repertoires (53). All analyses involving ≥10 comparisons were subjected to FDR adjustment to address multiple-hypothesis testing, and statistical significance was assessed at the level of FDR-adjusted P < 0.05.
Contrasts were then repeated by survival status (using each patient’s most recent encounter) and by lung injury subtype. Here, site-specific gene-to-gene correlation networks were created by identifying genes correlated in 1 body site (Spearman rho ≥ 0.5) and not correlated in the other body site (Spearman rho < 0.1) with a significant contrast of FDR-adjusted P < 0.05 (DGCA). Gene networks were plotted using Cytoscape v3.10.3 (54). Genes involved in each gene network were then ranked by Hub Score (igraph) (55) to identify most central genes, and were summarized at the pathway level using gene set enrichment (ClusterProfiler) (56). We integrated BAL and blood transcriptomes by applying variance-stabilizing transformation (vst, DESeq2) (49) to BAL and blood transcriptomes followed by MOFA (57), dimensionality reduction (UMAP), and k-means clustering (cluster). Data were visualized using volcano plots (EnhancedVolcano), box-and-whisker violin plots (ggplot), heatmaps (pheatmap), and gene set enrichment plots (ClusterProfiler).
Study approval.
Patients or their guardians were approached prospectively for consent under local IRB approval at each site (UCSF IRB nos. 14-13546 and 16-18908) in accordance with the Declaration of Helsinki, and permission was obtained to collect leftover BAL fluid as well as paired blood.
Data availability.
Raw sequencing files and instructions to request download are available under controlled access on NIH dbGaP (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001684.v3.p1). Individual-level data are available indefinitely. Code and processed anonymized individual-level data files are available on GitHub (https://github.com/zinterm/pedBMT_BALseq; commit ID 4beb6dd). Values for all data points in graphs are reported in the Supporting Data Values file.
Author contributions
Conceptualization: MSZ, CCD, EMP, and JLD. Methodology: MSZ, CCD, EMP, and JLD. Investigation: EMP, EE, MYM, GR, MRS, JB, HK, JM, PJS, CMR, JJA, PLM, CH, EMK, MQ, HAA, AKK, GDEC, JRH, JSK, KG, RH, CND, TCQ, PC, NRL, JCF, KMM, PS, TBM, BH, AAM, DBD, MPH, GAY, MAP, CCD, JLD, MSZ. Visualization: MSZ, EMP, and JLD. Funding acquisition: MSZ, JLD, MAP, and CCD. Project administration: MSZ, JLD, and CCD. Supervision: MSZ and JLD. Writing – original draft: MSZ, CCD, EMP, and JLD. Writing – review & editing: EMP, EE, MYM, GR, MRS, JB, HK, JM, PJS, CMR, JJA, PLM, CH, EMK, MQ, HAA, AKK, GDEC, JRH, JSK, KG, RH, CND, TCQ, PC, NRL, JCF, KMM, PS, TBM, BH, AAM, DBD, MPH, GAY, MAP, CCD, JLD, MSZ
Supplementary Material
Acknowledgments
This work has been supported by NIH grant K23HL146936 (MSZ), NIH grant R03HL171423 (MSZ), NIH grant K12HD000850 (MSZ), American Thoracic Society grant (MSZ), Pediatric Transplantation and Cell Therapy Foundation grant (MSZ), National Marrow Donor Program Amy Strelzer Manasevit Grant (MSZ), NIH grant F31CA271571 (MYM), Gateway Foundation grant (HAA), St. Baldrick’s Foundation Grant (HA-A), NIH grant P30CA008748 (JSK), NIH grant P30CA040214 (MAP), Chan Zuckerberg Biohub (JLD), and NIH grant UG1HL069254 (MAP). This work is the result of NIH funding, in whole or in part, and is subject to the NIH Public Access Policy. Through acceptance of this federal funding, the NIH has been given a right to make the work publicly available in PubMed Central.
Version 1. 07/22/2025
In-Press Preview
Version 2. 09/09/2025
Electronic publication
Footnotes
Conflict of interest: MSZ discloses consulting and advisory board work (Roche, DelveBio). CCD discloses consulting and advisory board work (Jazz Pharmaceuticals; Alexion Inc.). JJA discloses consulting and advisory board work (AscellaHealth, Takeda). TCQ discloses speaker bureau, consulting and advisory board work (Alexion AstraZeneca Rare Disease, Jazz Pharmaceuticals). HAA discloses research support (Adaptive). MAP discloses consulting and advisory board work (Novartis, Garuda, Autolous, Pfizer, Cargo, BlueBird Bio, Vertex) and research support (Miltenyi, Adaptive). JLD discloses salary support and research support (Chan Zuckerberg Biohub).
Copyright: © 2025, Pearce et al. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.
Reference information: JCI Insight. 2025;10(17):e194440.https://doi.org/10.1172/jci.insight.194440.
Contributor Information
Emma M. Pearce, Email: emma.pearce@ucsf.edu.
Erica Evans, Email: erica.evans@ucsf.edu.
Madeline Y. Mayday, Email: madeline.mayday@yale.edu.
Gustavo Reyes, Email: gustavo.reyes@ucsf.edu.
Miriam R. Simon, Email: miriam.simon@ucsf.edu.
Jacob Blum, Email: Jacob.Blum@tufts.edu.
Hanna Kim, Email: hanna.kim@yale.edu.
Jessica Mu, Email: jessica.mu@ucsf.edu.
Peter J. Shaw, Email: peter.shaw@health.nsw.gov.au.
Courtney M. Rowan, Email: coujohns@iu.edu.
Jeffery J. Auletta, Email: jaulett3@nmdp.org.
Paul L. Martin, Email: paul.martin@duke.edu.
Caitlin Hurley, Email: caitlin.hurley@stjude.org.
Erin M. Kreml, Email: ekreml@phoenixchildrens.com.
Muna Qayed, Email: Muna.Qayed@choa.org.
Hisham Abdel-Azim, Email: habdelazim@chla.usc.edu.
Amy K. Keating, Email: amy_keating@dfci.harvard.edu.
Geoffrey D.E. Cuvelier, Email: gcuvelier@cancercare.mb.ca.
Janet R. Hume, Email: jrhume@umn.edu.
James S. Killinger, Email: killingj@mskcc.org.
Kamar Godder, Email: kamar.godder@fda.hhs.gov.
Rabi Hanna, Email: hannar2@ccf.org.
Christine N. Duncan, Email: christine_duncan@dfci.harvard.edu.
Troy C. Quigg, Email: troy.quigg@spectrumhealth.org.
Paul Castillo, Email: castillopa@ufl.edu.
Nahal R. Lalefar, Email: nahal.lalefar@ucsf.edu.
Julie C. Fitzgerald, Email: fitzgeraldj@email.chop.edu.
Kris M. Mahadeo, Email: kris.mahadeo@duke.edu.
Prakash Satwani, Email: ps2087@cumc.columbia.edu.
Theodore B. Moore, Email: tmoore@mednet.ucla.edu.
Benjamin Hanisch, Email: bhanisch@childrensnational.org.
Aly Abdel-Mageed, Email: aly.mageed@spectrum-health.org.
Dereck B. Davis, Email: ddavis7@umc.edu.
Michelle P. Hudspeth, Email: hudspeth@musc.edu.
Greg A. Yanik, Email: gyanik@umich.edu.
Michael A. Pulsipher, Email: michael.pulsipher@hsc.utah.edu.
Christopher C. Dvorak, Email: christopher.dvorak@ucsf.edu.
Joseph L. DeRisi, Email: joe@derisilab.ucsf.edu.
Matt S. Zinter, Email: Matt.Zinter@ucsf.edu.
References
- 1.D’Souza A, et al. Current use of and trends in hematopoietic cell transplantation in the United States. Biol Blood Marrow Transplant. 2020;26(8):e177–e182. doi: 10.1016/j.bbmt.2020.04.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Panoskaltsis-Mortari A, et al. An official American Thoracic Society research statement: noninfectious lung injury after hematopoietic stem cell transplantation: idiopathic pneumonia syndrome. Am J Respir Crit Care Med. 2011;183(9):1262–1279. doi: 10.1164/rccm.2007-413ST. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gower WA, et al. Pulmonary dysfunction in pediatric hematopoietic stem cell transplant patients: non-infectious and long-term complications. Pediatr Blood Cancer. 2007;49(3):225–233. doi: 10.1002/pbc.21060. [DOI] [PubMed] [Google Scholar]
- 4.Zinter MS, et al. Pediatric hematopoietic cell transplant patients who survive critical illness frequently have significant but recoverable decline in functional status. Biol Blood Marrow Transplant. 2018;24(2):330–336. doi: 10.1016/j.bbmt.2017.10.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zinter MS, et al. Intensive care risk and long-term outcomes in pediatric allogeneic hematopoietic cell transplant recipients. Blood Adv. 2024;8(4):1002–1017. doi: 10.1182/bloodadvances.2023011002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tamburro RF, et al. Pulmonary complications of pediatric hematopoietic cell transplantation. A National Institutes of Health workshop summary. Ann Am Thorac Soc. 2021;18(3):381–394. doi: 10.1513/AnnalsATS.202001-006OT. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zinter MS, et al. Pathobiological signatures of dysbiotic lung injury in pediatric patients undergoing stem cell transplantation. Nat Med. 2024;30(7):1982–1993. doi: 10.1038/s41591-024-02999-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Qualter E, et al. A comparison of bronchoalveolar lavage versus lung biopsy in pediatric recipients after stem cell transplantation. Biol Blood Marrow Transplant. 2014;20(8):1229–1237. doi: 10.1016/j.bbmt.2014.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Elbahlawan L, et al. A critical care and transplant-based approach to acute respiratory failure after hematopoietic stem cell transplantation in children. Biol Blood Marrow Transplant. 2016;22(4):617–626. doi: 10.1016/j.bbmt.2015.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Liberzon A, et al. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst. 2015;1(6):417–425. doi: 10.1016/j.cels.2015.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Faner R, et al. Network analysis of lung transcriptomics reveals a distinct B-cell signature in emphysema. Am J Respir Crit Care Med. 2016;193(11):1242–1253. doi: 10.1164/rccm.201507-1311OC. [DOI] [PubMed] [Google Scholar]
- 12.Chu J-H, et al. Gene coexpression networks reveal novel molecular endotypes in alpha-1 antitrypsin deficiency. Thorax. 2021;76(2):134–143. doi: 10.1136/thoraxjnl-2019-214301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zinter MS, et al. The pulmonary metatranscriptome prior to pediatric HCT identifies post-HCT lung injury. Blood. 2021;137(12):1679–1689. doi: 10.1182/blood.2020009246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zinter MS, et al. Pulmonary microbiome and gene expression signatures differentiate lung function in pediatric hematopoietic cell transplant candidates. Sci Transl Med. 2022;14(635):eabm8646. doi: 10.1126/scitranslmed.abm8646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Saris A, et al. Distinct cellular immune profiles in the airways and blood of critically ill patients with COVID-19. Thorax. 2021;76(10):1010–1019. doi: 10.1136/thoraxjnl-2020-216256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zhu A, et al. Robust mucosal SARS-CoV-2-specific T cells effectively combat COVID-19 and establish polyfunctional resident memory in patient lungs. Nat Immunol. 2025;26(3):459–472. doi: 10.1038/s41590-024-02072-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Shanthikumar S, et al. Cross-tissue, age-specific flow cytometry reference for immune cells in airway and blood of children. J Allergy Clin Immunol. 2025;155(3):1002–1014. doi: 10.1016/j.jaci.2024.11.018. [DOI] [PubMed] [Google Scholar]
- 18.De Brabander J, et al. Persistent alveolar inflammatory response in critically ill patients with COVID-19 is associated with mortality. Thorax. 2023;78(9):912–921. doi: 10.1136/thorax-2023-219989. [DOI] [PubMed] [Google Scholar]
- 19.Dehoux MS, et al. Compartmentalized cytokine production within the human lung in unilateral pneumonia. Am J Respir Crit Care Med. 1994;150(3):710–716. doi: 10.1164/ajrccm.150.3.8087341. [DOI] [PubMed] [Google Scholar]
- 20.Adel-Patient K, et al. A comprehensive analysis of immune constituents in blood and bronchoalveolar lavage allows identification of an immune signature of severe asthma in children. Front Immunol. 2021;12:700521. doi: 10.3389/fimmu.2021.700521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Muema DM, et al. Contrasting inflammatory signatures in peripheral blood and bronchoalveolar cells reveal compartment-specific effects of HIV infection. Front Immunol. 2020;11:864. doi: 10.3389/fimmu.2020.00864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Omdahl KI, et al. Organ-specific microenvironments drive divergent T cell evolution in acute graft-versus-host disease. Sci Transl Med. 2025;17(783):eads1298. doi: 10.1126/scitranslmed.ads1298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Matthay MA. Resolution of pulmonary edema. Thirty years of progress. Am J Respir Crit Care Med. 2014;189(11):1301–1308. doi: 10.1164/rccm.201403-0535OE. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Matthay MA, et al. The acute respiratory distress syndrome. J Clin Invest. 2012;122(8):2731–2740. doi: 10.1172/JCI60331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bhargava M, et al. Proteome profiling in lung injury after hematopoietic stem cell transplantation. Biol Blood Marrow Transplant. 2016;22(8):1383–1390. doi: 10.1016/j.bbmt.2016.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Bowler RP, et al. New strategies and challenges in lung proteomics and metabolomics. An official American Thoracic Society workshop report. Ann Am Thorac Soc. 2017;14(12):1721–1743. doi: 10.1513/AnnalsATS.201710-770WS. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sparks R, et al. A unified metric of human immune health. Nat Med. 2024;30(9):2461–2472. doi: 10.1038/s41591-024-03092-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Levine JE, et al. A prognostic score for acute graft-versus-host disease based on biomarkers: a multicentre study. Lancet Haematol. 2015;2(1):e21–e29. doi: 10.1016/S2352-3026(14)00035-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Paczesny S. Biomarkers for posttransplantation outcomes. Blood. 2018;131(20):2193–2204. doi: 10.1182/blood-2018-02-791509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Akahoshi Y, et al. Novel MAGIC composite scores using both clinical symptoms and biomarkers best predict treatment outcomes of acute GVHD. Blood. 2024;144(9):1010–1021. doi: 10.1182/blood.2024025106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Guo M, et al. Guided construction of single cell reference for human and mouse lung. Nat Commun. 2023;14(1):4566. doi: 10.1038/s41467-023-40173-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wu C-Y, et al. CEACAM6 as a novel therapeutic target to boost HO-1-mediated antioxidant defense in COPD. Am J Respir Crit Care Med. 2023;207(12):1576–1590. doi: 10.1164/rccm.202208-1603OC. [DOI] [PubMed] [Google Scholar]
- 33.Gonzales LW, et al. Expression of carcinoembryonic cell adhesion molecule 6 and alveolar epithelial cell markers in lungs of human infants with chronic lung disease. J Histochem Cytochem. 2015;63(12):908–921. doi: 10.1369/0022155415603768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lin S-E, et al. Expression of human carcinoembryonic antigen-related cell adhesion molecule 6 and alveolar progenitor cells in normal and injured lungs of transgenic mice. Physiol Rep. 2015;3(12):e12657. doi: 10.14814/phy2.12657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kolla V, et al. Carcinoembryonic cell adhesion molecule 6 in human lung: regulated expression of a multifunctional type II cell protein. Am J Physiol Lung Cell Mol Physiol. 2009;296(6):L1019–L1030. doi: 10.1152/ajplung.90596.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Shikotra A, et al. A CEACAM6-high airway neutrophil phenotype and CEACAM6-high epithelial cells are features of severe asthma. J Immunol. 2017;198(8):3307–3317. doi: 10.4049/jimmunol.1600606. [DOI] [PubMed] [Google Scholar]
- 37. Nieder ML, et al. National Cancer Institute-National Heart, Lung and Blood Institute/pediatric blood and marrow transplant consortium First International Consensus Conference on late effects after pediatric hematopoietic cell transplantation: long-term organ damage and dysfunction. Biol Blood Marrow Transplant. 2011;17(11):1573–1584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jagasia MH, et al. National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: I. The 2014 diagnosis and staging working group report. Biol Blood Marrow Transplant. 2015;21(3):389–401. doi: 10.1016/j.bbmt.2014.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Bergeron A, et al. Noninfectious lung complications after allogeneic haematopoietic stem cell transplantation. Eur Respir J. 2018;51(5):1702617. doi: 10.1183/13993003.02617-2017. [DOI] [PubMed] [Google Scholar]
- 40.Ghanem R, et al. The (re)emergence of aerosol delivery: treatment of pulmonary diseases and its clinical challenges. J Control Release. 2025;379:421–439. doi: 10.1016/j.jconrel.2025.01.017. [DOI] [PubMed] [Google Scholar]
- 41.Fernández-García R, Fraguas-Sánchez AI. Nanomedicines for pulmonary drug delivery: overcoming barriers in the treatment of respiratory infections and lung cancer. Pharmaceutics. 2024;16(12):1584. doi: 10.3390/pharmaceutics16121584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Berkenfeld K, et al. Formulation strategies, preparation methods, and devices for pulmonary delivery of biologics. Eur J Pharm Biopharm. 2024;204:114530. doi: 10.1016/j.ejpb.2024.114530. [DOI] [PubMed] [Google Scholar]
- 43.Haak BW, et al. Impact of gut colonization with butyrate-producing microbiota on respiratory viral infection following allo-HCT. Blood. 2018;131(26):2978–2986. doi: 10.1182/blood-2018-01-828996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Harris B, et al. Gut microbiota predict pulmonary infiltrates after allogeneic hematopoietic cell transplantation. Am J Respir Crit Care Med. 2016;194(4):450–463. doi: 10.1164/rccm.201507-1491OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Mayday MY, et al. High-throughput echo library prep protocol. Protocols.io. 2021. [DOI]
- 46.Robinson MD, et al. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hänzelmann S, et al. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7. doi: 10.1186/1471-2105-14-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McKenzie AT, et al. DGCA: a comprehensive R package for differential gene correlation analysis. BMC Syst Biol. 2016;10(1):106. doi: 10.1186/s12918-016-0349-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Love MI, et al. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Steen CB, et al. Profiling Cell type abundance and expression in bulk tissues with CIBERSORTx. Methods Mol Biol. 2020;2117:135–157. doi: 10.1007/978-1-0716-0301-7_7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Travaglini KJ, et al. A molecular cell atlas of the human lung from single-cell RNA sequencing. Nature. 2020;587(7835):619–625. doi: 10.1038/s41586-020-2922-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Schulte-Schrepping J, et al. Severe COVID-19 is marked by a dysregulated myeloid cell compartment. Cell. 2020;182(6):1419–1440. doi: 10.1016/j.cell.2020.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Mandric I, et al. Profiling immunoglobulin repertoires across multiple human tissues using RNA sequencing. Nat Commun. 2020;11(1):3126. doi: 10.1038/s41467-020-16857-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Csárdi G, et al. igraph for R: R interface of the igraph library for graph theory and network analysis. https://zenodo.org/records/14736815 Accessed July 22, 2025.
- 56.Yu G, et al. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Argelaguet R, et al. Multi-omics factor analysis-a framework for unsupervised integration of multi-omics data sets. Mol Syst Biol. 2018;14(6):e8124. doi: 10.15252/msb.20178124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Zinter MS, et al. Pathobiological signatures of dysbiotic lung injury in pediatric patients undergoing stem cell transplantation. Nat Med. 2024;30(7):1982–1993. doi: 10.1038/s41591-024-02999-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw sequencing files and instructions to request download are available under controlled access on NIH dbGaP (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001684.v3.p1). Individual-level data are available indefinitely. Code and processed anonymized individual-level data files are available on GitHub (https://github.com/zinterm/pedBMT_BALseq; commit ID 4beb6dd). Values for all data points in graphs are reported in the Supporting Data Values file.









