Skip to main content
PLOS One logoLink to PLOS One
. 2022 Dec 8;17(12):e0278296. doi: 10.1371/journal.pone.0278296

Whole blood RNA-seq demonstrates an increased host immune response in individuals with cystic fibrosis who develop nontuberculous mycobacterial pulmonary disease

Miguel Dario Prieto 1,2, Jiah Jang 2, Alessandro N Franciosi 1,2, Yossef Av-Gay 3, Horacio Bach 3, Scott J Tebbutt 1,2,4, Bradley S Quon 1,2,*
Editor: Abdelwahab Omri5
PMCID: PMC9731410  PMID: 36480571

Abstract

Background

Individuals with cystic fibrosis have an elevated lifetime risk of colonization, infection, and disease caused by nontuberculous mycobacteria. A prior study involving non-cystic fibrosis individuals reported a gene expression signature associated with susceptibility to nontuberculous mycobacteria pulmonary disease (NTM-PD). In this study, we determined whether people living with cystic fibrosis who progress to NTM-PD have a gene expression pattern similar to the one seen in the non-cystic fibrosis population.

Methods

We evaluated whole blood transcriptomics using bulk RNA-seq in a cohort of cystic fibrosis patients with samples collected closest in timing to the first isolation of nontuberculous mycobacteria. The study population included patients who did (n = 12) and did not (n = 30) develop NTM-PD following the first mycobacterial growth. Progression to NTM-PD was defined by a consensus of two expert clinicians based on reviewing clinical, microbiological, and radiological information. Differential gene expression was determined by DESeq2.

Results

No differences in demographics or composition of white blood cell populations between groups were identified at baseline. Out of 213 genes associated with NTM-PD in the non-CF population, only two were significantly different in our cystic fibrosis NTM-PD cohort. Gene set enrichment analysis of the differential expression results showed that CF individuals who developed NTM-PD had higher expression levels of genes involved in the interferon (α and γ), tumor necrosis factor, and IL6-STAT3-JAK pathways.

Conclusion

In contrast to the non-cystic fibrosis population, the gene expression signature of patients with cystic fibrosis who develop NTM-PD is characterized by increased innate immune responses.

Introduction

Cystic fibrosis (CF) is a life-limiting genetic condition more prevalent in white populations (affecting approximately 1 in 3,000–4,000 live newborns) [1, 2]. People living with CF experience chronic respiratory symptoms and a steady decline in lung function leading to an increased risk of death or the need for a lung transplant [2, 3]. CF patients suffer from persistent and repetitive infections, including an increased lifetime risk of infection and disease caused by nontuberculous mycobacteria (NTM). According to data from the United States of America CF registry, around 12% of CF individuals in a given year have a positive culture for NTM, mostly of species belonging to the Mycobacterium avium (MAC) and Mycobacterium abscessus (MABs) complexes [3]. The clinical course of NTM infection in patients with CF is variable, ranging from an isolated positive culture to NTM pulmonary disease (NTM-PD) [4, 5]. The latter is diagnosed in a patient with two or more positive cultures of the same NTM, clinical deterioration, and characteristic radiological findings after excluding more common causes of the deteriorated status [4]. In CF, NTM-PD has been associated with an accelerated decline in lung function, and sputum positivity for NTM may affect eligibility for lung transplants [48].

Clinically, NTM infection in CF has been linked to specific age groups, clinical history of allergic bronchopulmonary aspergillosis, infection with Aspergillus spp., and chronic exposure to immunomodulatory drugs like macrolides and steroids [914]. However, fewer studies have evaluated risk factors for the progression from NTM infection to NTM-PD in the CF population. Caverly et al. showed that changes in the microbiome composition, including relative abundance of the Rhotia taxonomic unit, were associated with developing NTM-PD (n = 25) [15]. Furthermore, based on a prospective cohort of 96 CF patients with positive NTM culture, Martiniano et al. showed that those who progressed to NTM-PD had lower baseline Forced Expiratory Volume in 1 second (FEV1) and a faster lung function decline in the year before NTM infection [5]. While the role of the host immune response in the development of NTM-PD remains unclear in CF, various genetic polymorphisms have been reported in association with the diagnosis of NTM-PD in the non-CF population [16]. Additionally, Cowman et al. compared whole blood transcriptomics of patients with chronic obstructive pulmonary disease or non-CF bronchiectasis with and without a diagnosis of NTM-PD. Using microarray technology, they found reduced expression of genes involved in lymphocyte effector functions and interferon-γ production in those who developed NTM-PD. Also, they reported that those with NTM-PD and poor survival after diagnosis had a positive enrichment (higher expression) of innate immunity pathways compared to those with NTM-PD and better outcomes [17]. No prior studies have studied the relationship between host immune response to NTM and the development of NTM-PD in CF. In this exploratory study, we hypothesized that patients with CF who develop NTM-PD would have intrinsic downregulation of gene expression in interferon-γ and immune-related pathways. Thus, we evaluated patterns of gene expression associated with NTM-PD in a cohort of CF patients using whole-blood transcriptomics.

Methods

Study population and clinical data

This study is a secondary data analysis using blood samples and data from the “CF Biomarker” cohort approved by the University of British Columbia-Providence Health Care ethics review board (H12-00835). The local ethics board also approved the secondary analysis (H20-00117). Patients in the parent cohort were recruited following informed consent at the St. Paul’s Hospital Adult CF Clinic (Vancouver, Canada) between January 2012 and December 2019. In the current study, we included participants who consented to the future use of their samples and data, had at least one positive respiratory culture for NTM, and had a whole blood RNA sample available (PAXgene® stored at -70°C). Lung transplant recipients and subjects without a definite diagnosis of CF were excluded [18]. We preferentially selected blood samples taken during clinically stable periods and closest to the first positive growth of NTM. We did not limit blood sampling to within a specified time frame in relation to NTM infection or the development of NTM-PD as we expected an intrinsic downregulation of gene expression in immune-related pathways in patients with CF who develop NTM-PD.

Clinical and demographic data analysis

Demographic characteristics, anthropometric measurements, pulmonary function tests, CFTR genotyping results, and microbiology laboratory reports were extracted from the clinical charts using a pre-defined case report form. Clinical and demographic data were extracted starting from 2 years before the recorded time of first NTM isolation for each patient (as far back as May 2001) and follow-up data was reviewed for the development of NTM-PD or not up until September 2022. Participants were censored at the time of NTM-PD diagnosis, lung transplant, or death. The diagnosis of NTM-PD was defined independently by two expert CF clinicians, based on current CF NTM guidelines, and disagreements were resolved by consensus [4]. Lung function measurements (forced expiratory volume in 1 second [FEV1]) were standardized according to the 2012 Global Lung Function Initiative equation [19]. Clinical and demographic differences were compared according to the development or not of NTM-PD using univariable statistics in R studio (build 443) and R version 4.1.3 with the tidyverse package collection [2022]. Statistical comparisons used parametric and non-parametric tests depending on the skewness of data and the type of variable analyzed, the significance level was defined at alpha < 0.05. Plots were produced using the ggplot and ggpubr packages [23, 24].

RNA extraction and RNA sequencing experiments

The PAXgene Blood micro-RNA Kit was used for RNA extraction (in five batches) following the manufacturer’s instructions (QIAGEN) and omitting the DNA depletion steps [25]. The quality of the extracted RNA was evaluated using the 260/280nm ratio in a NanoDrop spectrophotometer; the mean concentration was 117.4 ± 73.2 (SD) ng/uL. The RNA Integrity Number (RIN) was evaluated using a 2100 Bioanalyzer instrument from Agilent. A RIN ≥ 7 was required for sequencing; two samples failed bioanalyzer quality control and were re-extracted. Poly-A RNA-seq libraries were prepared in a single batch with a starting concentration of 250 ng per sample. The library preparation was performed using the strand-specific Nextera NEB mRNA kit with adapters AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC and AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT. The NovaSeq 6000 S4 PE100 (Illumina) platform with automatic base calling (RTA3) and an initial concentration of 200 pM per library was used for sequencing. Samples were multiplexed in a single flow cell after dual barcoding. The sequencing run generated 150 bp paired-end reads, the lowest mean Phred+33 score per file was 36/40. The median number of reads per sample was 65 x 106 (range: 36-151 x 106). All transcriptomic data from this study can be found in NCBI Gene Expression Omnibus with accession GSE20516. The scripts to reproduce our results and additional metadata are deposited in the Dryad Digital Repository (doi:10.5061/dryad.np5hqbzx2).

RNA-seq data analysis

Alignment and quantification of raw reads (FASTQ format) were performed in a High-Performance computational cluster (CentOS 7 Linux). Fast-QC was used to evaluate the quality of raw reads before and after alignment [26]. Untrimmed reads were aligned to the primary human assembly GRCh38.p13 v38 (May 21, 2021) from GENCODE using STAR v 2.7 [27, 28]. We produced gene-level quantification (bam format) in RSEM v1.3.3 with default parameters [29]. Quality control of alignment was performed using Picard tools, and quality reports were summarized with MultiQC [30, 31]. The raw count data was imported to R version 4.1.1 using tximport [32] and annotated with the Ensembl database version 104 of Bioconductor [33, 34]. Globin subunits and genes showing abnormally large expression counts (>7*107) were filtered out before analysis; prior to filtering, 68–93% of reads per sample were overrepresented. An outlier sample was detected in exploratory principal component analysis (CFB2006) and removed from the dataset. Count data were corrected for extraction batches using the ComBat-seq algorithm [35]. Differential gene expression of the group with vs. without NTM-PD was performed using DESeq2 (v 1.3.2) with a cut-off FDR of 0.3. No minimum fold-change threshold was defined to increase the recovery of differentially expressed genes in this exploratory cohort [36]. Finally, gene set enrichment analysis of the molecular signatures database (human hallmark pathways) was performed with the fgsea package using the fold change ranked results from DESeq2 [3740]. The cell-specific population compositions were estimated and explored using gene expression deconvolution in cibersortX against the complete blood cell counts (CBC) measured in patients at the time of sample procurement. The deconvolution process used the LM22 (22 blood immune cell types) signature matrix, bulk mode batch correction, and a hundred iterations [41]. Reporting of high-throughput sequencing methods follows the MINISEQE recommendations [42].

Results

Study cohort selection

Among 189 participants included in the CF biomarkers study, 53 had one or more positive airway cultures for NTM, and 42 of these fulfilled all eligibility criteria and were included in the study. Exclusion criteria included unclear CF diagnosis (n = 1), lung transplant recipient (n = 1), and no whole blood RNA sample available (n = 9). The diagnosis of NTM-PD was defined independently by two expert clinicians with a Cohen’s kappa of 90%. Overall, 12 out of the 42 (29%) participants progressed to NTM-PD during clinical follow-up (up to September 2022). Among the 30 patients who did not develop NTM-PD, 22 spontaneously cleared the NTM from their respiratory cultures (12 after a single positive culture and 10 after multiple cultures), 3 had chronic colonization (NTM positive on all cultures) and 5 had intermittent positive cultures for NTM. The median follow-up time was 83.6 months (IQR 43–136), and the median interval between the first NTM growth and diagnosis of NTM-PD was 14.8 months (IQR 3–30). Patients who did not develop NTM-PD were followed for a minimum of 20.7 months after the first isolation of an NTM. Most patients were infected with M. avium complex (n = 22 in total; n = 7 progressed to NTM-PD) or M. abscessus complex (n = 12 in total; n = 5 progressed to NTM-PD). The remaining mycobacterial species included M. gordonae (n = 3), M. fortuitum (n = 2), M. cosmeticum, M. peregrinum, and M. simiae (n = 1 each), and none of these developed NTM-PD.

Clinical and demographic characteristics of the cohort

Included participants were predominantly male (n = 29, 69%), with a median age of 25 years, and 83% had at least one copy of F508del. Table 1 summarizes the demographic and clinical characteristics of the study cohort at baseline (first NTM growth). No differences at baseline were found in exposure to azithromycin (prior three months) or oral steroids (prior month) between patients who did vs. did not progress to NTM-PD. Furthermore, baseline clinical characteristics, including body mass index, FEV1, and sputum microbiology, were not significantly different between the NTM-PD groups. No significant differences were found in co-infection rates at baseline for Pseudomonas aeruginosa, Burkholderia spp., Stenotrophomonas maltophilia, or Aspergillus spp. At baseline, there were also no differences in demographic or clinical factors (age, sex, body mass index, ppFEV1, genotype, or co-morbidities) between patients infected with MAC or MABs (S1 Table). The median time window between the first NTM growth and whole blood RNA collection was 20.8 months (range between -15.2 and 171 months as 5/42 samples were collected before first NTM growth). Six patients were sampled after therapy for NTM-PD (median of 49 months before sampling).

Table 1. Demographic and clinical characteristics of included patients at baseline, stratified by NTM-PD status.
Total (n = 42) No NTM-PD (n = 30) NTM-PD (n = 12)
Age—median (IQR) 28 (22–37) 30 (23–37) 24 (21.8–34.8)
Females—n (%) 13 (31.0) 9 (30.0) 4 (33.3)
Genotype—n (%)
 F508del/F508del 17 (40.5) 11 (36.7) 5 (41.7)
 F508del/other 18 (42.9) 13 (43.3) 6 (50.0)
 Others 7 (16.7) 6 (20.0) 1 (8.3)
Pancreatic insufficiency—n (%) 33 (78.6) 23 (76.7) 10 (83.3)
CF diabetes—n (%) 10 (23.8) 7 (23.3) 3 (25.0)
Body mass indexamedian (IQR) 22 (20.4–23.4) 22 (20.7–23.6) 22 (20.0–22.6)
FEV1 [% predicted]amedian (IQR) 80 (63.7–89.3) 80 (63.9–89.2) 80 (61.6–89.5)
Oral steroids exposure—n (%) a 3 (7.1) 2 (6.7) 1 (8.3)
Macrolide exposure—n (%) a 14 (33.3) 11 (36.7) 3 (25.0)
Follow-up time in monthsbmedian(IQR) 83.6 (43–136) 102 (75–139) 14.8 (3–30)
Sputum positivity—n (%) c
Pseudomonas aeruginosa 15 (39) 12 (41) 3 (33)
Aspergillus spp. 12 (32) 8 (27) 4 (44)
Stenotrophomonas maltophilia 7 (18) 4 (14) 3 (33)
Burkholderia spp. 3 (8) 2 (7) 1 (11)

a n = 41, missing data for 1 patient.

b Follow-up time after first positive culture and until the patient developed NTM-PD, died, had a lung transplant, was lost to follow-up, or the study was closed (September 2022).

c n = 38, missing data for 4 patients.

IQR: interquartile range.

There were no significant differences at baseline in the absolute number of leukocytes, neutrophils, monocytes, or lymphocytes between NTM-PD groups (S1 Fig). A principal component analysis of all white blood cell parameters explained more than 50% of the variance in the first two components but did not separate NTM groups.

Quality assessment of RNA-seq data

Quality control of raw reads did not show bias in nucleotide distribution, sequencing depth, or base calling quality. At least 94.6% of the reads per sample aligned to the reference genome, and at least 49.1% of them aligned to protein-coding regions, see S2 Fig. During exploratory principal component analysis of normalized and batch corrected gene counts, we found no particular grouping by extraction batch, biological sex, or mycobacterial species (after removing the CFB2006 outlier). A principal component analysis of normalized counts did not show a graphical difference according to the timing of sample procurement for RNAseq after dividing the cohort into three groups: before the first positive NTM culture, after the first isolation without progression to NTM-PD, and after the first isolation with progression to NTM-PD (S3 Fig). A PermANOVA test of this distance matrix mas not significant (p = 0.29) [43]. CybersortX deconvolution of sequencing counts showed a good prediction of the initial white blood cell counts. We observed Pearson correlations greater than 0.8 for lymphocyte and neutrophil counts and a moderate correlation (r = 0.4) for monocyte counts (S4 Fig).

Evaluation of whole blood gene expression of candidate genes of NTM-PD in CF

We explored the candidate genes identified by Cowman et al. in our cohort. Among the 213 gene transcripts associated with NTM-PD in their cohort, only two genes (interleukin-2 receptor subunit Beta—IL-2RB and granzyme K—GZMK) were differentially expressed in our study population (FDR < 0.3) [17]. We also explored the mean expression of the differentially expressed genes with the highest fold change in the Cowman cohort. We found no apparent trends or statistical differences using univariable analysis. S5 Fig shows the mean expression of 13 of the top 15 features reported by Cowman et al. in our NTM-PD groups. We also evaluated the genes associated with poorer survival in those with NTM-PD among Cowman’s cohort, as a potentially similar clinical course to the development of NTM-PD in CF. Still, none of the 215 genes associated with decreased survival was found as differentially expressed in our cohort according to the adjusted p-value. Furthermore, only 6 out of these 215 genes had an unadjusted p-value below 0.05.

Whole blood gene expression of individuals who developed NTM-PD in our CF cohort

Differential gene expression with DESeq2 identified 111 differentially expressed genes (DEG) at an FDR cut-off of < 0.3. The top 30 DEG are summarized in Table 2 in descending order of adjusted p-value. The Bland–Altman, and Volcano plots in Fig 1 summarize the differential gene expression analysis results. We used gene set enrichment analysis of the molecular signatures database to explore the biological implications of these gene expression results. The results showed significant positive enrichment (higher expression in those who developed NTM-PD) in pathways involved in the interferon-α response (Adjusted p = 2.5 e-09), interferon-γ response (Adjusted p = 2.5 e-09), and IL6-JAK-STAT3 signaling (Adjusted p = 1.88 e-05). Table 3 includes all enriched pathways with an adjusted p-value below 0.001.

Table 2. Top 30 differentially expressed genes in those who did vs. did not develop NTM-PD.

SYMBOL Adjusted p value stat DESCRIPTION
TBC1D3H <0.001 -10.849 TBC1 domain family member 3H
TCL1A 0.054 4.426 TCL1 family AKT coactivator A
CD177 0.054 4.477 CD177 molecule
NA 0.054 4.504 novel transcript, antisense to TCL1A
RN7SL731P 0.069 4.322 RNA, 7SL, cytoplasmic 731, pseudogene
ARHGEF25 0.070 -4.279 Rho guanine nucleotide exchange factor 25
VPREB3 0.084 4.187 V-set pre-B cell surrogate light chain 3
FADS3 0.084 4.174 fatty acid desaturase 3
MKRN3 0.089 -4.133 makorin ring finger protein 3
SCN3A 0.095 4.053 sodium voltage-gated channel alpha subunit 3
CLEC17A 0.095 4.067 C-type lectin domain containing 17A
NA 0.095 4.063 novel transcript, antisense to HS3ST1
PAK6 0.095 -4.023 p21 (RAC1) activated kinase 6
NA 0.095 4.016 POM121 membrane glycoprotein-like 1 pseudogene
CACNA1C-AS1 0.109 -3.967 CACNA1C antisense RNA 1
CTSW 0.117 -3.934 cathepsin W
NA 0.117 -3.919 novel transcript
NIBAN3 0.121 3.897 niban apoptosis regulator 3
DDX11L2 0.136 -3.856 DEAD/H-box helicase 11 like 2 (pseudogene)
RPL13P12 0.150 3.820 ribosomal protein L13 pseudogene 12
RBPMS2 0.158 -3.784 RNA binding protein, mRNA processing factor 2
NA 0.158 -3.783 novel transcript
RIMBP2 0.163 3.753 RIMS binding protein 2
NA 0.163 -3.753 zinc finger protein 726 pseudogene 1
GZMK 0.176 -3.660 granzyme K
KLRC1 0.176 -3.653 killer cell lectin like receptor C1
CLDN12 0.176 -3.698 claudin 12
CMBL 0.176 3.687 carboxymethylenebutenolidase homolog
KRT72 0.176 -3.720 keratin 72
SERF1A 0.176 3.709 small EDRK-rich factor 1A

Fig 1. Volcano plot and MA plot of DESeq2 differential expression analysis.

Fig 1

(A) Bland-Altman plot of normalized mean counts and shrunk (apeglm) Log2 fold-changes. The differentially expressed genes are in red, and the top 15 features are labeled. (B) Volcano plot of differential gene expression analysis with cut-offs of p< 0.001 and Log2 fold-change larger than absolute 0.5. Highlighted in red are genes with higher transcription levels in patients who progressed to NTM-PD, in green are those with lower expression in the same group.

Table 3. Gene set enrichment analysis of whole blood gene expression in those who did vs. did not develop NTM-PD.

Pathway p-value FDR NES Size
Interferon α response <0.001 <0.001 2.98 96
Interferon γ response <0.001 <0.001 2.65 199
IL-6-JAK-STAT3 signaling <0.001 <0.001 2.05 81
Heme metabolism <0.001 <0.001 2.04 188
Tumor necrosis factor-α signaling via NFK-β <0.001 <0.001 1.74 198
Inflammatory response <0.001 <0.001 1.66 194
Protein secretion <0.001 <0.001 1.53 86
Oxidative phosphorylation <0.001 <0.001 1.51 188
E2F targets <0.001 <0.001 -1.60 193

Enrichment of molecular signatures database 3.0, accessed in May 2022. NES: normalized enrichment score, positive scores represent enrichment in patients that developed NTM-PD. FDR: false discovery rate.

Discussion

The main finding from this exploratory study is that CF individuals who developed NTM-PD after initial growth of NTM had increased expression of genes involved in pathways related to interferon responses, tumor necrosis factor-α production, and IL6-JAK-STAT3 signaling [17]. This contrasts with our expectation as we hypothesized that defective cellular immunity, as reflected by decreased gene expression in host immune response pathways, would confer an increased risk of persistent NTM infection and subsequent development of pulmonary disease.

While our results may seem contradictory to the findings reported by Cowman et al., which found defective T lymphocyte and interferon-γ immune responses in patients with NTM-PD, individuals with NTM-PD in their study were compared to control subjects with respiratory disease (COPD or bronchiectasis) but without NTM. As such, the decreased immune responses could have conferred susceptibility to NTM-PD in their study involving subjects without other known host immune-related risk factors for NTM. The airways of individuals with CF already create a hospitable environment for NTM growth and infection; therefore, reduced protective immunity in this setting could play a less significant role.

In our study, we compared CF individuals with NTM infection who did vs. did not progress to NTM-PD, and a pro-inflammatory innate immune response characterized the latter. Indeed, much of the lung damage that occurs in CF results from the deleterious effects of an overexuberant inflammatory response to infection, as the host is incapable of controlling microbial growth due to intrinsic host defense defects related to CFTR dysfunction [44, 45]. Our results demonstrating increased immune activity in patients with NTM-PD are consistent with a recent study that compared the bronchoalveolar lavage fluid of CF patients with vs. without NTM infection and found that those with NTM infection had increased immune cells and inflammation [46]. Moreover, regions of the lung with increased inflammation correlated with greater lung tissue damage based on CT scans [46].

Interestingly, several of the top differentially expressed genes found in the Cowman cohort were also downregulated in our cohort. Among them, granzyme K and interleukin 2 receptor β were differentially expressed in our cohort, see S5 Fig. Both genes are primarily expressed by lymphocytes but their relationship with susceptibility to NTM-PD remains unclear. Granzyme K is an effector protein involved in the cytotoxic activity of CD8+ lymphocytes, while interleukin 2 participates in the expansion of T cells. Despite the decreased expression of these genes, the pro-inflammatory response involving innate immune pathways related to interferons and tumor necrosis factor predominated in our enrichment results. The functional significance and the cells responsible for this increased production need to be further elucidated.

Our study has several limitations that should be acknowledged. The parent study was not initially designed for the discovery of new biomarkers or to evaluate the pathological process leading to NTM-PD in patients with CF. Instead, we performed a secondary exploratory analysis with existing samples to confirm if a gene expression pattern associated with NTM-PD was also present in CF patients. As such, we cannot ascertain the temporal relationship between infection with NTM and the gene expression profiles because the time window between NTM positivity and whole blood sampling was variable (median of 20 months). Based on preliminary data, we did not control for this factor as we expected to identify permanent intrinsic deficits in host defense associated with developing NTM-PD. Future studies will benefit from regular blood sampling in the period between the first NTM growth and the development of NTM-PD to determine if the innate immune response ramps up over time. Furthermore, additional factors that may affect the progression to NTM-PD, like the virulence factors of infecting NTM and the lung microbiome composition, should be explored. The gene expression results are unlikely to be biased by the initial proportion of cell populations because baseline CBC data showed no significant differences, and the deconvolution of gene expression data corroborated the distribution of cell populations in the CBC. The interval between the diagnosis of NTM-PD and sampling may also affect the results. Interestingly, a sensitivity analysis excluding 6 samples from patients sampled after starting treatment for NTM-PD displayed a similar pattern of enrichment to our main results (S2 Table). Finally, our cohort represents mostly patients naïve to CFTR modulator therapy as access to lumacaftor/ivacaftor and tezacaftor/ivacaftor was very limited during the study period in Canada due to lack of public and private reimbursement in most cases and elexacaftor/tezacaftor/ivacaftor was not reimbursed until September 2022. Only four patients, including one who progressed to NTM-PD, were receiving modulator therapy when the blood sample was collected (1 on ivacaftor, 2 on lumacaftor/ivacaftor, and 1 on tezacaftor/ivacaftor). As CFTR modulator therapy may change the lung microenvironment and affect the host susceptibility to mycobacterial infection and subsequent development of disease, our results may have limited generalizability to a CFTR modulator treated population but can serve as a baseline comparison for future studies in cohorts receiving these highly effective therapies [47].

In conclusion, our cohort shows that gene expression in patients who develop NTM-PD displays an exaggerated expression of genes that participate in innate immune responses. Overall, this study provides novel insights into the host response to NTM infection in the presence of CFTR dysfunction. The results of this exploratory cohort are valuable for future biomarker discovery and can serve as a validation dataset for other research groups.

Supporting information

S1 Table. Demographic and clinical characteristics of patients infected with MAC or MABs at the time of first positive NTM culture.

(DOCX)

S2 Table. Gene set enrichment analysis of whole blood gene expression in those who did vs. did not develop NTM-PD, excluding 6 samples taken after diagnosis of NTM-PD.

(DOCX)

S1 Fig. Absolute whole blood cell population counts in samples used for RNAseq.

Boxplots with the distribution of absolute values for leukocytes, neutrophils, lymphocytes, and monocytes in complete blood counts at the time of blood sample procurement. No significant differences using the unpaired Wilcoxon rank-sum test.

(TIF)

S2 Fig. Assignment of reads to genomic regions produced by Picard tools.

Coding: protein-coding region. UTR: untranscribed region. Intronic: intronic region. Intergenic: intergenic region. Ribosomal: mapping to ribosomal RNA or proteins. PF not aligned: bases that passed the quality filter and were not aligned.

(TIF)

S3 Fig. Principal component analysis of normalized and batch corrected count data.

Separated by the timing of blood sample procurement for RNAseq into three categories: before first positive NTM culture, after first positive NTM culture without progression to NTM-PD, and after first positive NTM culture with progression to NTM-PD. No apparent separation is seen graphically. A PermANOVA test shows no significant differences in the distance matrices by group (p = 0.29).

(TIFF)

S4 Fig. CibersortX deconvolution of cell percentages compared to ground truth reference values in CBC.

(TIF)

S5 Fig. Mean expression values per NTM-PD group in top differentially expressed genes in Cowman analysis.

The p-values represent the results of independent Wilcoxon tests. The feature TRGC1 (T Cell Receptor Gamma Constant 1) was a duplicated probe in the initial analysis, while the uncharacterized locus FLJ45825 was not detected in our dataset. PMS2P1: PMS1 homolog 2, Mismatch Repair System Component Pseudogene 1. CRTAM: Cytotoxic and regulatory T cell Molecule. IFNG: Interferon Gamma. GZMK: Granzyme K. PZP: PZP alpha-2-macroglobulin like. XCL2: X-C motif chemokine ligand 2. AK5: Adenylate kinase 5. FCRL3: Fc receptor Like 3. PPIH: Peptidylprolyl isomerase H. A2M: Alpha-2-macroglobulin. TIGIT: T cell immunoreceptor with Ig and ITIM domains. MUC12: Mucin 12, cell surface associated.

(TIF)

Acknowledgments

The authors would like to acknowledge the feedback and support of Kang Dong in the review of the bioinformatics pipeline employed for analysis. This research was supported in part through computational resources and services provided by Advanced Research Computing at the University of British Columbia.

Data Availability

The RNA sequencing data, raw reads, and normalized counts were deposited in the Gene Expression Omnibus repository of NCBI (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE205161, access token: kxsteiucbhebbkp). The dataset will be publicly available without restriction once we finish the publication process. The scripts for analysis of clinical, demographic, and genomic data have been deposited with the rest of the necessary data sets in dryad (https://datadryad.org/stash/share/FZ1DbrVy1KzveU5UQ_UbjqG-IrfFSouFSpspXhdnghg). While completing the submission process, the data is accessible by request of the reviewers. Once published, it will be made available without restrictions.

Funding Statement

Franciossi AN is supported by a Research Trainee Award (RT-2020-0493) from the Michael Smith Foundation for Health Research (https://www.msfhr.org/). Quon BS is supported by a Michael Smith Foundation for Health Research Scholar Award, with no reference number. Av-Gay Y is supported by the Cystic Fibrosis Canada Foundation, with no reference number (https://www.cysticfibrosis.ca/). The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Sanders DB, Fink AK. Background and Epidemiology. Pediatric Clinics of North America. 2016;63: 567–584. doi: 10.1016/j.pcl.2016.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.O’Sullivan BP, Freedman SD. Cystic fibrosis. The Lancet. 2009;373: 1891–1904. doi: 10.1016/S0140-6736(09)60327-5 [DOI] [PubMed] [Google Scholar]
  • 3.Foundation CF. Cystic Fibrosis Foundation Patient Registry 2019 Annual Data Report. Cystic Fibrosis Foundation; 2020.
  • 4.Floto RA, Olivier KN, Saiman L, Daley CL, Herrmann JL, Nick JA, et al. US Cystic Fibrosis Foundation and European Cystic Fibrosis Society consensus recommendations for the management of non-tuberculous mycobacteria in individuals with cystic fibrosis. Thorax. 2016;71: i1–i22. doi: 10.1136/thoraxjnl-2015-207360 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Martiniano SL, Sontag MK, Daley CL, Nick JA, Sagel SD. Clinical significance of a first positive nontuberculous mycobacteria culture in cystic fibrosis. Annals of the American Thoracic Society. 2014;11: 36–44. doi: 10.1513/AnnalsATS.201309-310OC [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Qvist T, Taylor-Robinson D, Waldmann E, Olesen HV, Hansen CR, Mathiesen IH, et al. Comparing the harmful effects of nontuberculous mycobacteria and Gram negative bacteria on lung function in patients with cystic fibrosis. Journal of Cystic Fibrosis. 2016;15: 380–385. doi: 10.1016/j.jcf.2015.09.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Charles R Esther J, Esserman DA, Gilligan P, Kerr A, Noone PG, Esther CR, et al. Chronic Mycobacterium abscessus infection and lung function decline in cystic fibrosis. Journal of cystic fibrosis: official journal of the European Cystic Fibrosis Society. 2010;9: 117–123. doi: 10.1016/j.jcf.2009.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Tissot A, Thomas MF, Corris PA, Brodlie M. NonTuberculous Mycobacteria infection and lung transplantation in cystic fibrosis: a worldwide survey of clinical practice. BMC pulmonary medicine. 2018;18. doi: 10.1186/s12890-018-0635-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Catherinot E, Roux AL, Vibet MA, Bellis G, Ravilly S, Lemonnier L, et al. Mycobacterium avium and Mycobacterium abscessus complex target distinct cystic fibrosis patient subpopulations. Journal of Cystic Fibrosis. 2013;12: 74–80. doi: 10.1016/j.jcf.2012.06.009 [DOI] [PubMed] [Google Scholar]
  • 10.Daley CL, Iaccarino JM, Lange C, Cambau E, Wallace RJ, Andrejak C, et al. Treatment of nontuberculous mycobacterial pulmonary disease: An official ATS/ERS/ESCMID/IDSA clinical practice guideline. European Respiratory Journal. 2020;56. doi: 10.1183/13993003.00535-2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Viviani L, Harrison MJ, Zolin A, Haworth CS, Floto RA. Epidemiology of nontuberculous mycobacteria (NTM) amongst individuals with cystic fibrosis (CF). Journal of Cystic Fibrosis. 2016;15: 619–623. doi: 10.1016/j.jcf.2016.03.002 [DOI] [PubMed] [Google Scholar]
  • 12.Levy I, Grisaru-Soen G, Lerner-Geva L, Kerem E, Blau H, Bentur L, et al. Multicenter cross-sectional study of nontuberculous mycobacterial infections among cystic fibrosis patients, Israel. Emerging infectious diseases. 2008;14: 378–384. doi: 10.3201/eid1403.061405 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Coolen N, Morand P, Martin C, Hubert D, Kanaan R, Chapron J, et al. Reduced risk of nontuberculous mycobacteria in cystic fibrosis adults receiving long-term azithromycin. Journal of Cystic Fibrosis. 2015;14: 594–599. doi: 10.1016/j.jcf.2015.02.006 [DOI] [PubMed] [Google Scholar]
  • 14.Olivier KN, Weber DJ, Lee JH, Handler A, Tudor G, Molina PL, et al. Nontuberculous mycobacteria. II: nested-cohort study of impact on cystic fibrosis lung disease. American journal of respiratory and critical care medicine. 2003;167: 835–840. doi: 10.1164/rccm.200207-679OC [DOI] [PubMed] [Google Scholar]
  • 15.Caverly LJ, Zimbric M, Azar M, Opron K, LiPuma JJ. Cystic fibrosis airway microbiota associated with outcomes of nontuberculous mycobacterial infection. ERJ Open Research. 2021;7: 00578–02020. doi: 10.1183/23120541.00578-2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lipner EM, Garcia BJ, Strong M. Network analysis of human genes influencing susceptibility to mycobacterial infections. PLoS ONE. 2016;11. doi: 10.1371/journal.pone.0146585 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cowman SA, Jacob J, Hansell DM, Kelleher P, Wilson R, Cookson WOC, et al. Whole-blood gene expression in pulmonary nontuberculous mycobacterial infection. American Journal of Respiratory Cell and Molecular Biology. 2018;58: 510–518. doi: 10.1165/rcmb.2017-0230OC [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Farrell PM, White TB, Ren CL, Hempstead SE, Accurso F, Derichs N, et al. Diagnosis of Cystic Fibrosis: Consensus Guidelines from the Cystic Fibrosis Foundation. Journal of Pediatrics. 2017;181: S4–S15.e1. doi: 10.1016/j.jpeds.2016.09.064 [DOI] [PubMed] [Google Scholar]
  • 19.Quanjer PH, Stanojevic S, Cole TJ, Baur X, Hall GL, Culver BH, et al. Multi-ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations. European Respiratory Journal. 2012;40: 1324–1343. doi: 10.1183/09031936.00080312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2022. https://www.R-project.org/
  • 21.Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, et al. Welcome to the Tidyverse. Journal of Open Source Software. 2019;4: 1686. doi: 10.21105/joss.01686 [DOI] [Google Scholar]
  • 22.RStudio Team. RStudio: Integrated Development Environment for R. Boston, MA: RStudio, PBC.; 2020. http://www.rstudio.com/
  • 23.Kassambara A. ggpubr: “ggplot2” Based Publication Ready Plots. 2020. https://CRAN.R-project.org/package=ggpubr
  • 24.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York; 2016. https://ggplot2.tidyverse.org
  • 25.PreAnalytix. PAXgene® Blood miRNA Kit Handbook. 2020. https://www.qiagen.com/us/resources/download.aspx?id=fa752f15-2496-4805-ac3a-a1b541135dea&lang=en
  • 26.Andrews S, Krueger F, Segonds-Pichon A, Biggins L, Krueger C, Wingett S. FastQC. A Quality Control Tool for High Throughput Sequence Data [Online]. Babraham, UK; 2012.
  • 27.Frankish A, Diekhans M, Ferreira AM, Johnson R, Jungreis I, Loveland J, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic acids research. 2019;47: D766–D773. doi: 10.1093/nar/gky955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29: 15–21. doi: 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinformatics. 2011;12. doi: 10.1186/1471-2105-12-323 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32: 3047–3048. doi: 10.1093/bioinformatics/btw354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Picard toolkit. Broad Institute, GitHub repository. Broad Institute; 2019. https://broadinstitute.github.io/picard/
  • 32.Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research. 2015;4: 1521. doi: 10.12688/f1000research.7563.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods. 2015;12: 115–121. doi: 10.1038/nmeth.3252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Rainer J, Gatto L, Weichenberger CX. ensembldb: an R package to create and use Ensembl-based annotation resources. Bioinformatics. 2019. doi: 10.1093/bioinformatics/btz031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhang Y, Parmigiani G, Johnson WE. ComBat-seq: batch effect adjustment for RNA-seq count data. NAR Genomics and Bioinformatics. 2020;2. doi: 10.1093/nargab/lqaa078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15: 1–21. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Korotkevich G, Sukhov V, Sergushichev A. Fast gene set enrichment analysis. bioRxiv. 2019. doi: 10.1101/060012 [DOI] [Google Scholar]
  • 38.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences. 2005;102: 15545–15550. doi: 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell systems. 2015;1: 417. doi: 10.1016/j.cels.2015.12.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27: 1739–1740. doi: 10.1093/bioinformatics/btr260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, Scherer F, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nature Biotechnology. 2019;37: 773–782. doi: 10.1038/s41587-019-0114-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Brazma A, Ball C, Bumgarner R, Furlanello C, Miller M, Quackenbush J, et al. MINSEQE: Minimum Information about a high-throughput Nucleotide SeQuencing Experiment—a proposal for standards in functional genomic data reporting. 2012 [cited 31 Oct 2022].
  • 43.Zapala MA, Schork NJ. Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables. Proc Natl Acad Sci U S A. 2006;103: 19430–19435. doi: 10.1073/pnas.0609333103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gannon AD, Darch SE. Same game, different players: Emerging pathogens of the CF lung. mBio. 2021;12: 1–13. doi: 10.1128/mBio.01217-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Bernut A, Dupont C, Ogryzko NV, Neyret A, Herrmann JL, Floto RA, et al. CFTR Protects against Mycobacterium abscessus Infection by Fine-Tuning Host Oxidative Defenses. Cell reports. 2019;26: 1828–1840.e4. doi: 10.1016/j.celrep.2019.01.071 [DOI] [PubMed] [Google Scholar]
  • 46.Hayes Don Jr, Shukla RK, Cheng Y, Gecili E, Merling MR, Szczesniak RD, et al. Tissue-localized immune responses in people with cystic fibrosis and respiratory nontuberculous mycobacteria infection. JCI Insight. 2022. doi: 10.1172/jci.insight.157865 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ricotta EE, Prevots DR, Olivier KN. CFTR modulator use and risk of nontuberculous mycobacteria positivity in cystic fibrosis, 2011–2018. ERJ Open Research. 2022;8. doi: 10.1183/23120541.00724-2021 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Abdelwahab Omri

4 Oct 2022

PONE-D-22-24141Whole blood RNA-Seq demonstrates an increased host immune response in individuals with cystic fibrosis who develop nontuberculous mycobacterial pulmonary diseasePLOS ONE

Dear Dr. Prieto,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Nov 18 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Abdelwahab Omri, Pharm B, Ph.D, Laurentian University 

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. 

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a retrospective study evaluating whole blood gene expression using bulk RNA-seq in a cohort of cystic fibrosis patients with samples collected closest in timing to the first isolation of nontuberculous mycobacteria. The study population included patients who did (N=12) and did not (n-30) develop NTM-PD following first nontuberculous mycobacteria growth.

1. It is not stated in the inclusion/exclusion criteria that people on CFTR modulator therapy were excluded from analysis. However, this is mentioned in the discussion. Please add this to the exclusion criteria. Interestingly, the vast majority of patients were eligible for E/T/I therapy based on having one or more copies of F508del. I presume the timing of data abstraction preceded approval for E/T/I in Canada. Clarification of these details would be helpful, given most people meeting eligibility in the US (and some other locations) are likely on highly effective modulator therapy, skewing the presented results to a population that is not representative of most of North America. Please include some discussion of this issue in the manuscript.

2. Please better define the patients who did not develop NTM-PD. What percentage were intermittently colonized (NTM not recovered from every AFB culture, but rather intermittently) vs chronically colonized (NTM recovery with every AFB culture) vs spontaneously cleared infection (transient or chronic colonization, but clear after one year of negative cultures)?

3. Please define the data abstraction timeframe?

4. What was the total length of time that patients were followed to determine if they met criteria for NTM-PD? Some patients will be intermittently positive for years before conversion to NTM disease. Did some patients that were initially without NTM-PD found to have PD after data analysis was completed?

5. Per the manuscript, blood sampling was not limited to a specified timeframe in relation to NTM infection, but closest to the first positive growth. Presumptively these blood samples were collected exclusively after the first NTM recovery, but that is not explicitly stated. Please clarify. Additionally, 6/12 (50%) patients were sampled after initiation of NTM-specific therapy. Additional details surrounding the timing would be helpful.

6. In your analysis were there any differences between M. abscessus or M. avium patients? The n was very small, so statistical differences may not be observed. However, were trends observed? Please better define these data.

7. Table 1. Please define “follow up time”? Is this the total time a patient was followed post first + NTM culture?

Reference is missing in lines 270-273.

8. Have you compared bulk RNA-seq prior to first positive NTM isolate compared to the two groups?

Thank you for allowing be to participate in the review process.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Dec 8;17(12):e0278296. doi: 10.1371/journal.pone.0278296.r002

Author response to Decision Letter 0


12 Nov 2022

Re: Manuscript PONE-D-22-24141

Dear Dr. Omri,

We want to thank you and your group of reviewers for reviewing our manuscript “Whole blood RNA-Seq demonstrates an increased host immune response in individuals with cystic fibrosis who develop nontuberculous mycobacterial pulmonary disease” and providing meaningful feedback to improve it. We sought to address all the recommendations and provide clarifications when necessary. First, we addressed the journal requirements and then proceeded to respond to the comments posed by reviewer #1; the relevant changes associated to the reviewer comment are specified in the revised version of the manuscript without track changes.

Journal Requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Response: We have reviewed the manuscript document and made the necessary corrections to the format and content to closely follow the requirements of PLOS ONE. The filenames of all documents, including figures and supplementary figures, have also been reviewed and adjusted according to the recommendations of the journal.

2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability - loc-unacceptable-data-access-restrictions.

Response: Thank you for highlighting this important point, we have corrected the data availability statement to accurately describe that the data is available for review during the peer review process and will be made available without restrictions once the manuscript finishes its publication cycle.

The scripts necessary to reproduce the results, as well as the complete metadata of the samples have been deposited in the Dryad Digital Repository (https://doi.org/10.5061/dryad.np5hqbzx2) and reviewers can access it using the link below.

https://datadryad.org/stash/share/FZ1DbrVy1KzveU5UQ_UbjqG-IrfFSouFSpspXhdnghg

Also, the raw sequencing reads and the curated reads are available in the NCBI Gene Expression Omnibus (GEO) and Sequence Read Archive (SRA) platforms with the accession code GSE205161. Reviewers can access the data using the authorized reviewer token kxsteiucbhebbkp.

We apologize for the misunderstanding about data availability and reiterate our commitment to making the results fully reproducible and the datasets freely available using these repositories.

3. In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability - loc-recommended-repositories.

c) We will update your Data Availability statement on your behalf to reflect the information you provide.

Response: There are no restrictions to the use of the data and the metadata contains no sensitive information or direct identifiers of participants. The datasets also had linkable data including the date of birth, postal code, and city of residence removed before upload. The sequencing data is available in the NCBI GEO and SRA platforms with the accession code GSE205161 and the reviewers can access it using the token kxsteiucbhebbkp. The scripts to reproduce the results starting from the raw reads as well as the contextual metadata (clinical and demographic variables) are available in the Dryad Digital Repository(https://datadryad.org/stash/share/FZ1DbrVy1KzveU5UQ_UbjqG-IrfFSouFSpspXhdnghg).

Reviewer #1 comments to the author:

This is a retrospective study evaluating whole blood gene expression using bulk RNA-seq in a cohort of cystic fibrosis patients with samples collected closest in timing to the first isolation of nontuberculous mycobacteria. The study population included patients who did (N=12) and did not (n-30) develop NTM-PD following the first nontuberculous mycobacteria growth.

1. It is not stated in the inclusion/exclusion criteria that people on CFTR modulator therapy were excluded from the analysis. However, this is mentioned in the discussion. Please add this to the exclusion criteria. Interestingly, the majority of patients were eligible for E/T/I therapy based on having one or more copies of F508del. I presume the timing of data abstraction preceded approval for E/T/I in Canada. Clarification of these details would be helpful, given most people meeting eligibility in the US (and some other locations) are likely on highly effective modulator therapy, skewing the presented results to a population that is not representative of most of North America. Please include some discussion of this issue in the manuscript.

Response: Thank you for highlighting the relevance of using CFTR modulator therapy and its possible impact on our cohort. The use of CFTR modulator therapy was not an exclusion criterion for our study but given the limited public reimbursement for CFTR modulators in Canada, only four participants (including one who developed NTM-PD) were receiving CFTR modulator therapy (1 on ivacaftor, 2 on lumacaftor/ivacaftor, and 1 on tezacaftor/ivacaftor) when the blood sample was taken. ETI was not approved in Canada until June 2021 and the first patient was reimbursed in September 2022.

We have clarified the reasons behind the low use of CFTR modulators and expanded upon the implications of our results in terms of the limited generalizability to a CFTR modulator treated population (Page 19, lines 320 to 330)

2. Please better define the patients who did not develop NTM-PD. What percentage were intermittently colonized (NTM not recovered from every AFB culture, but rather intermittently) vs chronically colonized (NTM recovery with every AFB culture) vs spontaneously cleared infection (transient or chronic colonization, but clear after one year of negative cultures)?

Response: We decided to divide our population into those who developed or did not NTM-PD as it represents a similar classification to the one observed in the non-CF population. Among the 30 patients who did not develop NTM-PD, 22 can be classified as spontaneous clearance (12 of them cleared after a single positive culture and 10 after multiple positive cultures), 3 had chronic colonization with NTM isolated in a majority of their respiratory cultures and 5 had intermittent colonization. The classification of these patients has been added to the manuscript (Results section, lines 170 to 173) and is available in the clinical and demographic dataset uploaded to the Dryad Digital Repository.

3. Please define the data abstraction timeframe

Response: Thank you for this suggestion, we have now added the timeframe of data collection from the clinical charts. For each participant, we collected data starting from two years before the time of the first isolation (the earliest date was May 2001) and until September 2022 for follow-up of NTM status (disease or not). We have now added this important information in the Methods section, lines 104 to 107 and Results section, lines 168 and 169.

4. What was the total length of time that patients were followed to determine if they met the criteria for NTM-PD? Some patients will be intermittently positive for years before conversion to NTM disease. Did some patients that were initially without NTM-PD found to have PD after data analysis was completed?

Response: Thank you for the appropriate observation. The follow-up time of patients was variable and went from the time of the first positive culture for Mycobacteria spp. until the patient developed NTM-PD, died, or had a lung transplant (lines 107 to 109 of the Methods section). The shortest follow-up periods were seen in patients who developed NTM-PD quickly after the first isolation of an NTM. Patients who did not develop NTM-PD were followed for a minimum of 20.7 months after the first isolation of NTM (Table 1 on Page 11 and lines 173 to 176 on the Results section).

To make sure that our participants who did not develop NTM-PD up until December 2020 remained disease free, we went back to the clinical charts and reviewed their clinical status between December 2020 (when the data abstraction was initially finalized) and September 2022. None of these patients progressed to NTM-PD in the additional follow-up period. We have included this information in the Methods section, lines 104 to 107.

5. Per the manuscript, blood sampling was not limited to a specified timeframe in relation to NTM infection, but closest to the first positive growth. Presumptively these blood samples were collected exclusively after the first NTM recovery, but that is not explicitly stated. Please clarify. Additionally, 6/12 (50%) patients were sampled after initiation of NTM-specific therapy. Additional details surrounding the timing would be helpful.

Response: Our study was a secondary analysis of an ongoing cohort, and we could not control the timing of sampling around the first isolation of an NTM. Thus, we decided to take the blood sample available closest (before or after) to the time when the participant first had a positive NTM culture. As we hypothesized that an inherently decreased expression of immune response genes was associated with the development of NTM-PD, we did not restrict the sampling window. We clarified this additional information in lines 192 to 194 on page 10 of the Results section.

As mentioned by the reviewer, half of the patients with NTM-PD had a sample taken after starting specific antimycobacterial therapy. These patients were sampled between 2 and 79 months after starting treatment due to limitations in blood sample availability in our repository. We suspected that patients who developed NTM-PD would have a similar gene expression signature to the one seen in the non-CF population, representing intrinsic defects in immunity. Thus, we did not control for the variability in sampling time. We conducted a sensitivity analysis excluding the samples taken after the diagnosis of NTM-PD and found a similar pattern of gene set enrichment (S2 Table; Discussion section, lines 317 to 320).

These limitations in sampling and their possible impact on our results are acknowledged in the Discussion section (lines 302 to 314). We also disclose the need for prospective studies with a more systematic sampling scheme to accurately establish the temporal relationship between infection and the observed patterns of gene expression.

6. In your analysis were there any differences between M. abscessus or M. avium patients? The n was very small, so statistical differences may not be observed. However, were trends observed? Please better define these data.

Response: We thank you for the comment and agree that CF patients affected with either Mycobacterium abscessus (MABs) or Mycobacterium avium (MAC) complexes have different characteristics. In our study, we did not separate by type of mycobacteria due to the limited sample size. However, we do observe that only those infected with either MAC or MABs in our cohort progressed to NTM-PD. Furthermore, we explored clinical and demographic differences among the participants infected with MAC (n=21) or MABs (n=12). No statistically significant differences were seen in age at baseline, baseline ppFEV1, body mass index, biological sex, genotype, or co-morbidities (diabetes or pancreatic insufficiency). We have outlined this information in the Results section (lines 190 to 192 and S1 Table).

7. Table 1. Please define “follow up time”? Is this the total time a patient was followed post first + NTM culture? Reference is missing in lines 270-273.

Response: We have now clarified what “follow-up time” means in the footnote of Table 1 (page 11). Also, we have added the missing reference in line 288 (previously lines 270-273) of the Discussion.

8. Have you compared bulk RNA-seq prior to first positive NTM isolate compared to the two groups?

Response: Thank you for your comment. As part of our quality control, we have now performed a Principal Component Analysis (PCA) separating the normalized counts according to the timing of the blood sample procurement (used for RNAseq) into three groups: before culture, after culture without NTM-PD progression, and after culture with NTM-PD progression. The representation of this PCA can be found in S3 Fig. This plot shows no graphical separation of clusters in the first and second dimensions of the PCA. To further validate this finding, we evaluated if the distance matrix of the PCA had divergence according to the window between the first NTM isolation and blood sample collection. No statistically significant differences were found among the three groups.

We have added this information to lines 213 to 217 of the Results section and the PCA plot can be seen in a newly cited S3 Fig.

Attachment

Submitted filename: 20221111-reviewers_response.docx

Decision Letter 1

Abdelwahab Omri

15 Nov 2022

Whole blood RNA-Seq demonstrates an increased host immune response in individuals with cystic fibrosis who develop nontuberculous mycobacterial pulmonary disease

PONE-D-22-24141R1

Dear Dr. Miguel Darío Prieto,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter, and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Abdelwahab Omri, Pharm B, Ph.D, Laurentian University

Academic Editor

PLOS ONE

Acceptance letter

Abdelwahab Omri

1 Dec 2022

PONE-D-22-24141R1

Whole blood RNA-Seq demonstrates an increased host immune response in individuals with cystic fibrosis who develop nontuberculous mycobacterial pulmonary disease

Dear Dr. Prieto:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Abdelwahab Omri

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Demographic and clinical characteristics of patients infected with MAC or MABs at the time of first positive NTM culture.

    (DOCX)

    S2 Table. Gene set enrichment analysis of whole blood gene expression in those who did vs. did not develop NTM-PD, excluding 6 samples taken after diagnosis of NTM-PD.

    (DOCX)

    S1 Fig. Absolute whole blood cell population counts in samples used for RNAseq.

    Boxplots with the distribution of absolute values for leukocytes, neutrophils, lymphocytes, and monocytes in complete blood counts at the time of blood sample procurement. No significant differences using the unpaired Wilcoxon rank-sum test.

    (TIF)

    S2 Fig. Assignment of reads to genomic regions produced by Picard tools.

    Coding: protein-coding region. UTR: untranscribed region. Intronic: intronic region. Intergenic: intergenic region. Ribosomal: mapping to ribosomal RNA or proteins. PF not aligned: bases that passed the quality filter and were not aligned.

    (TIF)

    S3 Fig. Principal component analysis of normalized and batch corrected count data.

    Separated by the timing of blood sample procurement for RNAseq into three categories: before first positive NTM culture, after first positive NTM culture without progression to NTM-PD, and after first positive NTM culture with progression to NTM-PD. No apparent separation is seen graphically. A PermANOVA test shows no significant differences in the distance matrices by group (p = 0.29).

    (TIFF)

    S4 Fig. CibersortX deconvolution of cell percentages compared to ground truth reference values in CBC.

    (TIF)

    S5 Fig. Mean expression values per NTM-PD group in top differentially expressed genes in Cowman analysis.

    The p-values represent the results of independent Wilcoxon tests. The feature TRGC1 (T Cell Receptor Gamma Constant 1) was a duplicated probe in the initial analysis, while the uncharacterized locus FLJ45825 was not detected in our dataset. PMS2P1: PMS1 homolog 2, Mismatch Repair System Component Pseudogene 1. CRTAM: Cytotoxic and regulatory T cell Molecule. IFNG: Interferon Gamma. GZMK: Granzyme K. PZP: PZP alpha-2-macroglobulin like. XCL2: X-C motif chemokine ligand 2. AK5: Adenylate kinase 5. FCRL3: Fc receptor Like 3. PPIH: Peptidylprolyl isomerase H. A2M: Alpha-2-macroglobulin. TIGIT: T cell immunoreceptor with Ig and ITIM domains. MUC12: Mucin 12, cell surface associated.

    (TIF)

    Attachment

    Submitted filename: 20221111-reviewers_response.docx

    Data Availability Statement

    The RNA sequencing data, raw reads, and normalized counts were deposited in the Gene Expression Omnibus repository of NCBI (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE205161, access token: kxsteiucbhebbkp). The dataset will be publicly available without restriction once we finish the publication process. The scripts for analysis of clinical, demographic, and genomic data have been deposited with the rest of the necessary data sets in dryad (https://datadryad.org/stash/share/FZ1DbrVy1KzveU5UQ_UbjqG-IrfFSouFSpspXhdnghg). While completing the submission process, the data is accessible by request of the reviewers. Once published, it will be made available without restrictions.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES