ABSTRACT
The microbial metagenome in cystic fibrosis (CF) airways was investigated by whole-genome shotgun sequencing of total DNA isolated from nasal lavage samples, oropharyngeal swabs, and induced sputum samples collected from 65 individuals with CF aged 7 to 50 years. Each patient harbored a personalized microbial metagenome unique in microbial load and composition, the exception being monocultures of the most common CF pathogens Staphylococcus aureus and Pseudomonas aeruginosa from patients with advanced lung disease. The sampling of the upper airways by nasal lavage uncovered the fungus Malassezia restricta and the bacterium Staphylococcus epidermidis as prominent species. Healthy and CF donors harbored qualitatively and quantitatively different spectra of commensal bacteria in their sputa, even in the absence of any typical CF pathogen. If P. aeruginosa, S. aureus, or Stenotrophomonas maltophilia belonged to the trio of the most abundant species in the CF sputum metagenome, common inhabitants of the respiratory tract of healthy subjects, i.e., Eubacterium sulci, Fusobacterium periodonticum, and Neisseria subflava, were present only in low numbers or not detectable. Random forest analysis identified the numerical ecological parameters of the bacterial community, such as Shannon and Simpson diversity, as the key parameters that globally distinguish sputum samples from CF and healthy donors.
IMPORTANCE Cystic fibrosis (CF) is the most common life-limiting monogenetic disease in European populations and is caused by mutations in the CFTR gene. Chronic airway infections with opportunistic pathogens are the major morbidity that determines prognosis and quality of life in most people with CF. We examined the composition of the microbial communities of the oral cavity and upper and lower airways in CF patients across all age groups. From early on, the spectrum of commensals is different in health and CF. Later on, when the common CF pathogens take up residence in the lungs, we observed differential modes of depletion of the commensal microbiota in the presence of S. aureus, P. aeruginosa, S. maltophilia, or combinations thereof. It remains to be seen whether the implementation of lifelong CFTR (cystic fibrosis transmembrane conductance regulator) modulation will change the temporal evolution of the CF airway metagenome.
KEYWORDS: airways, cystic fibrosis, metagenomics
INTRODUCTION
Cystic fibrosis (CF) is a severe monogenic trait with autosomal recessive inheritance that is caused by mutations in the CFTR gene (1). The basic defect of impaired chloride and bicarbonate secretion predisposes the individual to chronic airway infections (2). The CF lungs typically harbor polymicrobial communities with dozens of bacterial species, viruses, and fungi, with Staphylococcus aureus and Pseudomonas aeruginosa being the most prevalent hallmark pathogens (3–10).
The composition of the bacterial communities in CF airways has so far mainly been studied by 16S rRNA gene amplicon sequencing (5–8, 10–32). Alternatively, whole-genome shotgun (WGS) metagenome sequencing can be performed (33), where host and microbial DNA is extracted and sequenced from biological specimens with no or low amplification. Microbial communities are then determined and defined via reference-based alignment and analysis of the sequenced nonhost DNA. WGS metagenomics is “taxonomically agnostic” and can identify fungi, archaea, eubacteria, and DNA viruses, if proper reference genome sequences are available for the alignment.
The focus of past microbial metagenome studies investigating the lower CF airway has been on small cohorts of CF adolescents and adults who naturally expectorate relatively large volumes of respiratory secretions (34–48). We have optimized our protocols so that deep microbial metagenome sequencing has become feasible with samples retrieved from all age groups (3, 49, 50). This achievement laid the foundation to conduct a representative survey of the airway microbiome of the current CF patient population across all age groups.
In this study, we compared the microbial metagenome of the oral cavity, the sinonasal compartment, and lower airway. The study revealed that the CF host and the corresponding disease status were the major determinants of the quantitative taxonomic profile of the CF airway metagenome. Each CF patient over the age of 6 years harbored a personalized microbial metagenome, which was unique in microbial load and composition. Notable exceptions to this observation were monocultures of the most common CF pathogens, S. aureus and P. aeruginosa, in patients with advanced lung disease. The host-driven individual signature was shared by the oral cavity and lower airway.
RESULTS
Patients.
Nasal lavage samples, oropharyngeal swabs, and induced sputa (Fig. S1) were collected on one to seven occasions within a period of up to 25 months from 45 exocrine pancreatic-insufficient (PI) and 20 exocrine pancreatic-sufficient (PS) individuals with CF (Table 1; also, see Table S1 in the supplemental material). PI patients carry two severe disease-causing CFTR mutations in the CFTR gene that confer no or only minimal CFTR (cystic fibrosis transmembrane conductance regulator) function, whereas PS patients harbor at least one CFTR allele that confers residual CFTR activity in the range of 10 to 20% of the wild-type level and is typically associated with a milder course of CF disease (1, 51). The distribution of lung function and disease centiles of the study cohort matched with the age- and gender-corrected centile distribution of the patient registry of the European Cystic Fibrosis Society (52) (Fig. 1), implying that our study cohort is representative of the disease severity of the current CF patient population. The patient population consisted of 34 children and adolescents (6 to 17 years) and of 31 adults (18 to 50 years) (Data Sets S1 and S2). Induced sputum samples and oropharyngeal swabs were also collected from 157 schoolchildren, adolescents, and adults who, like the CF patient cohort, were living in Central Lower Saxony in Germany (Table S2).
TABLE 1.
Characteristic | Value for group |
||
---|---|---|---|
All (n = 65) | PI (n = 45) | PS (n = 20) | |
No. (%) of females | 31 (48) | 21 (47) | 10 (50) |
Mean age (SD) (yrs) | 21.4 (10.3) | 22.1 (10.9) | 19.6 (8.6) |
No. (%) in age group | |||
7–17 yrs | 34 (52) | 22 (49) | 12 (60) |
18–27 yrs | 13 (20) | 8 (18) | 5 (25) |
28–50 yrs | 18 (28) | 15 (33) | 3 (15) |
FEV1 centilea | 44 (27–75; 3–99) | 32 (15–68; 3–99) | 50 (39–76; 13–99) |
BMI centilea | 48 (26–70; 6–99) | 40 (21–73; 7–99) | 50 (30–61; 6–91) |
Disease centilea | 52 (28–71; 9–94) | 42 (24–72; 9–94) | 60 (34–71; 15–84) |
Median (inner quartile; range).
Microbial metagenome of nasal lavage samples, oropharyngeal swabs, and induced sputum of the CF cohort.
Genomic DNA isolated from nasal lavage samples, oropharyngeal swabs, and induced sputum was subjected to high-throughput shotgun metagenome sequencing. After processing, trimming, and quality filtering, the primary reads were mapped onto the human reference genome and a microbial pangenome of completely sequenced genomes (see Materials and Methods) (Fig. 2). The normalized read numbers were deposited in the local database (Data Sets S1 and S2) representing the number of reads per million base pairs of reference genome per million reads (RPMM) of species and higher taxonomic ranks. Selected samples were reinvestigated by quantitative PCR (qPCR).
The microbial metagenome was dominated by bacteria, which on average made up more than 99% of microbial reads (Data Sets S1 and S2). The sampling of the upper airways by nasal lavage uncovered two prominent bacterial species, i.e., predominantly Staphylococcus epidermidis and to a lesser extent Cutibacterium (Propionibacterium) acnes. In two of three patient samples, they accounted for 10% to more than 90% of the microbial community (Fig. 3 and 4A). S. epidermidis and C. acnes are common members of the skin flora (53), but are not typical inhabitants of healthy nasal conchae. In CF, however, the inherited basic defect predisposes to rhinosinusitis, inflamed nasal passages, or a stuffy nose (1, 54, 55) which apparently triggers the upper airway colonization with these skin commensals. The representatives of a healthy nasal microbiome (56, 57), including Streptococcus, Prevotella, Veillonella, Rothia, and Neisseria species, were the major constituents of diversity, each having a share of a few percent. The typical CF pathogens S. aureus and P. aeruginosa were present in the majority of nasal lavage samples in similar quantities as C. acnes. In most samples, they were only minor members of the community. Only in one-fifth of samples was the relative abundance higher than 10%.
Oropharyngeal swabs recovered a similar pattern of commensal Firmicutes, Fusobacteria, Bacteroidetes, and Actinobacteria species from healthy controls and PI and PS patients (Fig. 3 and 4B). The typical CF pathogens S. aureus, P. aeruginosa, and Haemophilus influenzae were present in relatively low numbers, albeit more abundant and more prevalent in PI patients’ than PS patients’ samples.
This difference between PI and PS CF was even more pronounced in induced sputum, which has been demonstrated to be an appropriate proxy for the respiratory secretions that line the conducting lower airway (31, 58). The metagenome of the PI CF cohort was dominated by S. aureus and P. aeruginosa, consistent with our common knowledge of CF microbiology derived from culture-dependent diagnostics. The PS CF lower-airway bacteriome was between those of PI CF patients and healthy controls (Fig. 3 and 4C).
Only 0.1% of the DNA reads per sample were on average assigned to fungi and molds. For technical reasons, the contribution of the mycobiome to the CF airway metagenome was most likely underestimated, because, first, our DNA extraction protocol was not adapted to the lysis of rigid fungal cell walls and, second, a representative reference mycobiome could not be compiled from the databases due to the continuing paucity of completely sequenced fungal genomes. Applying the tool EukDetect (59), we identified 10 fungal species altogether in 36 sputa from 3 of 20 PS patients and from 14 of 45 PI patients (Table 2). Consistent with the CF literature, Aspergillus fumigatus and Exophilia dermatitis were most often detected. More than one fungus was detected in samples from six patients. Numerous nasal lavage samples contained the sinus-associated fungi Malassezia restricta and Malassezia globosa (Table 2) (60).
TABLE 2.
Sample and fungal species | No. of: |
|
---|---|---|
CF donors | Samples | |
Nasal lavage | ||
Bradysia coprophila | 1 | 1 |
Cyberlindnera jadinii | 1 | 1 |
Malassezia globosa | 2 | 2 |
Malassezia restricta | 22 | 22 |
Induced sputum | ||
Aspergillus fumigatus | 5 | 6 |
Candida albicans | 4 | 6 |
Candida dubliniensis | 1 | 1 |
Candida glabrata | 2 | 3 |
Clavispora lusitaniae | 2 | 2 |
Cyberlindnera jadinii | 1 | 1 |
Exophiala dermatitidis | 5 | 7 |
Saccharomyces cerevisiae | 3 | 3 |
Scedosporium apiospermum | 1 | 3 |
Scedosporium boydii | 2 | 2 |
Beta diversity.
We performed nonmetric multidimensional scaling to assess the effect of sample type (nasal lavage samples, oropharyngeal swabs, and induced sputa) and clinical variables on the microbial respiratory tract community structure. Therefore, we fitted metadata onto the ordination axes, including age group, gender, pancreatic state, type and donor of sample, lung function (FEV1 [forced expiratory volume in the first second] percentile), disease and BMI percentile. Neither gender nor pancreatic state had a significant effect on the microbial community structures (Table 3). Conversely, lung function was a discriminatory parameter for the ordination (Table 3). Moreover, the individual CF host shaped the microbial community structure. Samples that had been repetitively collected from the same patient over a period of up to 2 years could be discerned from other donors’ samples and clustered together (Table 3; Fig. 5A). In other words, longitudinal intrapatient were more similar than cross-sectional interpatient samples, even across ecological niches in the CF airway habitat (Fig. 5A). This personalized metagenome signature was detected in oropharyngeal swabs and induced sputa (Fig. 5B), confirming previous reports that the CF airway should be considered an open habitat with extensive exchange between upper and lower airways (55). However, the personalized signature in terms of Bray-Curtis dissimilarity indices was not observed in metagenome samples of the nasal conchae (Fig. 5B).
TABLE 3.
Parameter | Goodness of fit |
|
---|---|---|
r 2 | P c | |
Sample type (nasal lavage, oropharyngeal swab, induced sputum) | 0.14 | <0.01** |
Pancreatic state | 0.03 | >0.05 |
Age group (7–17, 18–27, 28–50 yrs) | 0.05 | >0.05 |
Gender | 0.01 | >0.05 |
Serial samples from the same donor | 0.75 | <0.01** |
FEV1 centile | 0.22 | <0.01** |
Disease centile | 0.19 | <0.01** |
BMI centile | 0.11 | <0.05* |
R, metaMDS, k = 3, no autotransform.
A permutation test (R envfit, number of permutations = 999) was used for plotting metadata variables onto the ordination and obtaining their significance with Benjamini-Hochberg adjustment of P values. The stress value of the nonmetric multidimensional scaling ordination (<0.2) indicated a good representation in reduced dimensions.
*, P < 0.05; **, P < 0.01.
Microbial load and disease severity (Fig. 6).
Next, we allocated the individual bacterial metagenome data sets of nasal lavage samples (Fig. 6A and B), oropharyngeal swabs (Fig. 6C and D), and induced sputum (Fig. 6E and F) to the patient’s age and disease centile differentiated by PI (Fig. 6A, C, and E) and PS (Fig. 6B, D, and F) status. Size and color in Fig. 6 visualize the bacterial load and the relative contribution of the three dominant phyla, Actinobacteria, Firmicutes, and Proteobacteria. Each metagenome was unique, as indicated by singular combinations of size and color, the exception being the monocultures of S. aureus and P. aeruginosa (Data Sets S1 and S2). Firmicutes were the dominant bacteria in most CF metagenomes of nasal lavage samples, oropharyngeal swabs, and induced sputum. Since shotgun sequencing of all extracted DNA had been performed without depleting the intracellular human DNA, the load of bacterial cells per human cell could be estimated (3). The human genome is approximately about 1,000-fold larger than the average bacterial genome; hence, a contribution of 1% of bacterial reads of total corresponds to about 10 bacterial cells per human cell in a sample (3). Bacterial load in the samples varied by more than 3 orders of magnitude, between 0.1 and 400 bacterial cells per human cell, and increased in the order nasal lavage samples < oropharyngeal swabs < sputum. In contrast to our expectation but consistent with previous reports (13, 61), no association between microbial load and disease severity was observed.
The lower-airway bacterial community network.
The composition of the patient’s bacterial metagenome, which is visualized in Fig. 6 by its individual color, was resolved in Fig. 4 at the level of the 30 dominant bacterial species. The spectrum of induced sputa as a surrogate for the lower conducting airway ranged from diverse communities of commensals to monocultures of the CF lead pathogen P. aeruginosa. To differentiate the diversity of the lower CF airway bacterial community at higher resolution, subsamples with or without H. influenzae, S. aureus, P. aeruginosa, and Stenotrophomonas maltophilia were compared with the benchmark of a cumulative metagenome of 157 induced sputa from healthy subjects aged 10 to 59 years living in the same geographic region as the CF cohort (Table 4; Fig. 7).
TABLE 4.
Speciesa | Depletion in presence of dominant CF pathogen(s)b |
||||
---|---|---|---|---|---|
SA | PA | SM | PA and SM | PA and SA | |
Eubacterium sulci | ↓ | ↓ | ↓ | ↓ | ↓ |
Fusobacterium periodonticum | ↓ | ↓ | ↓ | ↓ | ↓ |
Neisseria subflava | ↓ | ↓ | ↓ | ↓ | ↓ |
Rothia aeria | ↓ | ↓ | ↓ | ||
Streptococcus pneumoniae | ↓ | ↓ | ↓ | ||
Streptococcus pseudopneumoniae | ↓ | ↓ | ↓ | ||
Atopobium parvulum | ↓ | ↓ | ↓ | ||
Gemella spp. | ↓ | ↓ | ↓ | ↓ | |
Prevotella spp. | ↓ | ↓ | |||
Veillonella spp. | ↓ | ↓ | |||
Streptococcus sanguinis | ↓ | ||||
Streptococcus cristatus | ↓ |
Only species with consistent patterns in all subgroup samples are included.
SA, S. aureus; PA, P. aeruginosa; SM, S. maltophilia.
Figure 7 displays the absolute abundance of the 50 most common bacterial species in the respective CF subgroup at a logarithmic scale. As expected for a progressive lung disease, all sputa devoid of any typical CF pathogen were collected from patients in the youngest age group. Typical members of the oral microbiome were detected, but the individual patterns of commensals differed from that of the average healthy control (Fig. 7A). The other extreme of no or very few commensal bacteria was seen in a few sputum metagenomes which were dominated by either P. aeruginosa and/or S. aureus or Stenotrophomonas maltophilia. If any of these three pathogens belonged to the trio of the most abundant species in the CF sputum metagenome, common inhabitants of the upper respiratory tract present in all samples of healthy subjects—i.e., Eubacterium sulci, Fusobacterium periodonticum, and Neisseria subflava—were present only in low numbers or not detectable in the CF sputa. Likewise, the presence of S. aureus or P. aeruginosa as the dominant taxon was associated with the suppression of Rothia aeria and Haemophilus parainfluenzae.
Dominance of S. aureus was moreover associated with the reduction of the Streptococcus mitis group, including Streptococcus pneumoniae and Streptococcus pseudopneumoniae, and dominance of P. aeruginosa accompanied suppression of the genera Gemella, Prevotella, and Veillonella. These differential modes of depletion of commensals by CF pathogens are qualitatively summarized in Table 4 and shown by sample in Fig. 7B and Data Sets S1 and S2. We moreover noted that microbial communities cocolonized by S. aureus and P. aeruginosa typically had one species as the prime and the other one as only a minor member (Fig. 7B).
Next, we applied machine learning by the random forest method (62) to elucidate the key parameters among 294 microbial and host metadata that globally discern samples from CF and healthy donors (Fig. 8). The numerical ecological parameters of the bacterial community (Shannon diversity, Simpson diversity, Pielou’s evenness indices, and species number) were identified as the major classifiers of sputum specimens followed by a large number of bacterial species (Fig. 8). Conversely, as indicated by large out-of-bag errors, none of the host and microbial input variables reliably differentiated samples from PI and PS patients and oropharyngeal swabs from healthy and CF subjects from each other.
Inflammation and microbiota in CF sputum.
Next, we sought associations between the patient’s lung function, inflammation and bacterial metagenome. Sputum specimens utilized for metagenome sequencing were examined for inflammatory mediators that are known to be diagnostic for the intensity of inflammation of CF lungs (63). Sputum levels were determined for the proinflammatory cytokines interleukins 1β, 6 and 8, the neutrophilic enzymes myeloperoxidase and elastase, matrix metalloproteinase 9, and metallopeptidase inhibitor 1, involved in the reorganization of the extracellular matrix (Data Set S3). According to principal-component analysis, the CF host’s lung function explained about 10% and the levels of the selected inflammation markers explained about 18% of the variance of the data sets within the groups of PI and PS CF patients (Tables 5 and 6). The remaining variance of the components was assigned to the abundance of the tested bacterial species.
TABLE 5.
Parameter | Contribution (%) of observation in dimension |
||||||||
---|---|---|---|---|---|---|---|---|---|
F1 | F2 | F3 | F4 | F5 | F6 | F7 | F8 | F9 | |
Streptococcus mitis | 9.1 | 0.9 | 1.4 | 2.6 | 0.9 | 0.5 | 0.3 | 0.5 | 0.4 |
Streptococcus gordonii | 8.0 | 0.0 | 0.0 | 0.1 | 0.1 | 1.8 | 1.0 | 4.8 | 0.1 |
Neisseria meningitidis | 4.6 | 2.4 | 1.7 | 0.4 | 0.0 | 0.2 | 2.6 | 0.1 | 2.6 |
Haemophilus parainfluenzae | 5.1 | 2.3 | 1.7 | 0.2 | 0.0 | 0.1 | 5.4 | 0.2 | 3.2 |
Rothia mucilaginosa | 3.4 | 0.5 | 0.1 | 0.2 | 0.0 | 7.3 | 5.6 | 1.3 | 0.5 |
NE | 4.0 | 1.8 | 2.7 | 3.0 | 1.7 | 4.2 | 1.1 | 0.4 | 0.9 |
Streptococcus oralis | 7.0 | 0.1 | 1.8 | 3.5 | 0.0 | 1.1 | 1.1 | 0.4 | 0.1 |
Streptococcus pseudopneumoniae | 8.3 | 1.0 | 1.3 | 3.4 | 1.1 | 1.3 | 0.1 | 0.5 | 0.1 |
Streptococcus pneumoniae | 7.7 | 0.8 | 1.3 | 4.2 | 1.1 | 1.6 | 0.0 | 0.7 | 0.0 |
Staphylococcus epidermidis | 2.8 | 0.2 | 0.5 | 5.6 | 7.4 | 0.0 | 0.7 | 1.1 | 0.1 |
Cutibacterium acnes | 2.9 | 1.2 | 0.0 | 0.1 | 9.2 | 0.4 | 2.3 | 1.9 | 1.8 |
Streptococcus salivarius | 2.8 | 5.8 | 0.0 | 6.2 | 2.5 | 3.8 | 1.8 | 0.1 | 0.1 |
Actinomyces oris | 2.5 | 5.1 | 1.8 | 1.6 | 0.3 | 3.1 | 5.4 | 0.9 | 0.8 |
Actinomyces meyeri | 2.2 | 4.5 | 2.2 | 2.0 | 0.1 | 4.0 | 6.4 | 0.3 | 0.7 |
Streptococcus parasanguinis | 3.3 | 8.6 | 0.2 | 2.8 | 0.7 | 0.7 | 0.5 | 0.7 | 0.5 |
TIMP1 | 0.3 | 12.2 | 0.0 | 2.2 | 0.0 | 5.7 | 1.8 | 1.0 | 0.3 |
Streptococcus sanguinis | 1.0 | 11.2 | 0.3 | 0.8 | 0.0 | 5.3 | 2.8 | 1.0 | 0.0 |
FEV1 | 0.7 | 9.3 | 0.1 | 0.5 | 0.3 | 4.7 | 5.5 | 2.8 | 3.2 |
FVC | 0.6 | 7.5 | 0.6 | 4.3 | 0.2 | 1.3 | 3.1 | 0.5 | 2.1 |
Stenotrophomonas maltophilia | 1.5 | 3.3 | 0.0 | 10.7 | 1.8 | 1.4 | 0.0 | 8.2 | 0.1 |
Age | 0.6 | 7.6 | 0.6 | 8.1 | 0.0 | 0.4 | 0.1 | 1.5 | 1.1 |
Haemophilus influenzae | 0.9 | 0.2 | 8.2 | 0.3 | 1.5 | 0.0 | 0.0 | 0.0 | 0.0 |
Streptococcus thermophilus | 2.2 | 2.1 | 6.5 | 0.1 | 1.1 | 8.1 | 0.2 | 2.3 | 1.4 |
Rothia dentocariosa | 1.2 | 0.6 | 10.4 | 3.9 | 0.6 | 5.4 | 6.9 | 0.6 | 0.8 |
MMP9 | 1.1 | 0.7 | 9.9 | 3.1 | 0.8 | 4.9 | 5.8 | 0.9 | 1.9 |
IL-1β | 1.1 | 0.7 | 11.5 | 2.5 | 0.7 | 6.2 | 7.1 | 0.6 | 0.3 |
Streptococcus anginosus | 0.0 | 0.4 | 8.1 | 0.1 | 0.5 | 7.1 | 8.5 | 0.1 | 0.7 |
MPO | 1.1 | 0.6 | 3.8 | 0.1 | 0.0 | 3.0 | 5.4 | 12.3 | 0.2 |
Pseudomonas aeruginosa | 1.2 | 0.6 | 9.6 | 0.2 | 0.4 | 2.8 | 0.2 | 7.0 | 0.1 |
Fusobacterium nucleatum | 0.7 | 0.6 | 2.6 | 2.4 | 7.9 | 2.2 | 0.5 | 6.6 | 1.4 |
Veillonella parvula | 0.1 | 0.8 | 3.2 | 0.0 | 19.9 | 0.1 | 0.0 | 0.4 | 7.9 |
Achromobacter xylosoxidans | 1.2 | 0.7 | 0.1 | 3.4 | 7.0 | 4.2 | 0.1 | 0.4 | 1.0 |
Atopobium parvulum | 1.7 | 2.6 | 1.8 | 4.8 | 7.3 | 1.7 | 0.6 | 3.7 | 1.9 |
IL-6 | 0.4 | 0.0 | 0.5 | 1.8 | 5.4 | 0.5 | 1.7 | 2.6 | 20.5 |
MEF25 | 0.6 | 1.9 | 0.0 | 0.1 | 2.9 | 0.8 | 0.6 | 10.2 | 2.6 |
Staphylococcus aureus | 1.6 | 0.8 | 0.6 | 2.7 | 1.1 | 0.7 | 2.0 | 16.9 | 1.0 |
IL-8 | 0.3 | 0.0 | 0.0 | 0.1 | 0.9 | 0.6 | 4.4 | 1.6 | 31.6 |
Bifidobacterium longum | 0.8 | 0.0 | 1.7 | 0.0 | 0.5 | 0.3 | 0.2 | 0.1 | 6.4 |
Prevotella melaninogenica | 0.2 | 0.3 | 0.3 | 0.5 | 0.8 | 1.1 | 3.1 | 0.7 | 0.1 |
Variability (%)b | 18.7 | 13.0 | 8.6 | 8.5 | 6.8 | 6.1 | 5.9 | 5.4 | 4.3 |
Cumulative (%)c | 18.7 | 31.8 | 40.4 | 48.9 | 55.7 | 61.8 | 67.8 | 73.2 | 77.5 |
The selected clinical parameters are age and the lung function parameters FEV1, FVC, and MEF25. The selected host defense molecules are IL-1β, IL-6, IL-8, neutrophilic elastase (NE), myeloperoxidase (MPO), metalloproteinase 9 (MMP9), and metallopeptidase inhibitor 1 (TIMP1).
Explained variance by the respective principal component.
Sequential sum of explained variance sorted in ascending order.
TABLE 6.
Parameter | Contribution (%) of observation in dimension |
||||
---|---|---|---|---|---|
F1 | F2 | F3 | F4 | F5 | |
Actinomyces meyeri | 0.3 | 0.2 | 1.7 | 18.1 | 0.2 |
IL-8 | 2.6 | 2.2 | 0.3 | 8.2 | 0.2 |
Tannerella forsythia | 2.5 | 1.4 | 0.4 | 9.2 | 1.7 |
Staphylococcus aureus | 0.7 | 0.0 | 1.7 | 7.9 | 3.4 |
Bifidobacterium longum | 1.7 | 0.0 | 0.0 | 5.1 | 0.0 |
Streptococcus anginosus | 0.6 | 0.3 | 0.1 | 4.8 | 1.9 |
Achromobacter xylosoxidans | 0.8 | 1.2 | 1.4 | 3.7 | 2.2 |
MPO | 0.7 | 0.0 | 9.5 | 0.2 | 1.3 |
Fusobacterium nucleatum | 2.5 | 1.1 | 3.1 | 4.2 | 0.1 |
Streptococcus pneumoniae | 0.3 | 1.0 | 4.7 | 4.1 | 4.1 |
MMP9 | 1.8 | 0.8 | 3.4 | 0.1 | 11.2 |
TIMP1 | 0.2 | 2.9 | 4.0 | 0.6 | 12.2 |
Streptococcus gordonii | 0.3 | 10.6 | 0.6 | 1.0 | 1.1 |
Pseudomonas aeruginosa | 0.2 | 8.2 | 0.9 | 3.1 | 0.0 |
Haemophilus influenzae | 0.6 | 8.0 | 0.1 | 3.0 | 0.6 |
Prevotella denticola | 0.6 | 7.0 | 0.0 | 2.7 | 0.5 |
Streptococcus sanguinis | 0.3 | 10.0 | 0.3 | 3.4 | 0.8 |
NE | 1.3 | 7.1 | 3.3 | 1.0 | 0.3 |
Streptococcus thermophilus | 5.8 | 0.3 | 4.4 | 0.1 | 0.4 |
Streptococcus salivarius | 5.5 | 0.0 | 5.7 | 0.3 | 0.2 |
Streptococcus parasanguinis | 5.5 | 0.0 | 5.7 | 0.3 | 0.2 |
Actinomyces oris | 5.5 | 0.0 | 5.8 | 0.3 | 0.2 |
Atopobium parvulum | 5.5 | 0.0 | 5.8 | 0.3 | 0.2 |
FVC | 5.4 | 0.0 | 5.8 | 0.3 | 0.2 |
MEF25 | 4.8 | 0.3 | 6.9 | 0.0 | 0.0 |
Age | 4.7 | 0.5 | 3.4 | 0.6 | 0.2 |
Streptococcus oralis | 4.7 | 0.5 | 2.9 | 0.0 | 0.0 |
Rothia mucilaginosa | 4.5 | 0.7 | 3.1 | 0.1 | 0.1 |
Prevotella melaninogenica | 4.4 | 1.2 | 1.7 | 0.0 | 0.9 |
Veillonella parvula | 4.4 | 1.3 | 0.0 | 0.0 | 10.3 |
Cutibacterium acnes | 4.1 | 2.9 | 0.3 | 0.7 | 4.3 |
Streptococcus mitis | 4.0 | 4.3 | 1.1 | 0.6 | 0.8 |
FEV1 | 5.0 | 0.9 | 1.6 | 4.6 | 0.1 |
IL-6 | 3.0 | 0.1 | 0.8 | 4.7 | 5.6 |
Rothia dentocariosa | 1.2 | 0.7 | 0.0 | 0.9 | 4.6 |
Haemophilus parainfluenzae | 1.0 | 1.9 | 0.1 | 0.0 | 19.5 |
IL-1β | 2.9 | 0.5 | 1.6 | 0.0 | 0.0 |
Variability (%)b | 28.3 | 17.8 | 14.2 | 10.3 | 9.2 |
Cumulative (%)c | 28.3 | 46.1 | 60.2 | 70.5 | 79.7 |
The selected clinical parameters are age and the lung function parameters FEV1, FVC, and MEF25. The selected host defense molecules are interleukins IL-1β, IL-6, and IL-8, neutrophilic elastase (NE), myeloperoxidase (MPO), metalloproteinase 9 (MMP9), and metallopeptidase inhibitor 1 (TIMP1).
Explained variance by the respective principal component.
Sequential sum of explained variance sorted in ascending order.
DISCUSSION
CF airway ecology has been investigated by the 16S rRNA gene profile of bacterial taxa (5–8, 10–26, 61) and by WGS sequencing of the metagenome (3, 34–36, 38–48, 50). A common rationale for metagenome sequencing is the identification of the genetic content and the functional potential of the microbiota, particularly if most taxa of the habitat of interest are still unknown (33, 64). In the case of the human airway, however, completely sequenced reference genomes are already available for almost all bacterial taxa that reside in this habitat. The functional potential of the microbiome can thus be more reliably explored from annotated reference genomes than from metagenome sequence data sets, which, because of their insufficient coverage, will preclude genome assembling, at least from the rare species in the community (65). Instead, as described in this report, untargeted airway metagenome sequencing has value in the determination of the abundance of microbes (3) and the sensitive and specific identification of poorly cultivatable, rare, and closely related species (65, 66). The quantitative taxonomic profile of core and rare species can then be exploited for a comprehensive evaluation of the microbial community.
The published CF airway metagenome studies focused primarily on adolescents and adults with established lung disease characterized by the vicious circle of infection, inflammation, and airway remodeling (1, 67). Our metagenome data sets presented in this report and the recently published report on CF infants (50) cover all age groups. Since the disease centile distribution in our cohort matches that of the European registry (52), the results of this single-center study should be representative of the current CF population.
The microbial community was found to be rather loosely structured during the first year of life. During the second and third years, we identified stable microbial communities that were indistinguishable from those of a healthy infant (50). However, by the age of 4 years, the CF disease-specific signature of the lung metagenome started to emerge. By comparing the individual CF metagenomes of older patients (8 to 60 years) in this study with the average metagenome of healthy non-CF persons, it became evident that the spectrum of commensal bacteria in CF sputa was qualitatively and quantitatively different from that in sputa from healthy donors, even in the absence of any typical CF pathogen (Fig. 7A). Since sampling, sequencing, data processing, and evaluation of CF and non-CF specimens were performed at the same site, we assume that the methodological and technical bias is acceptable and that the divergent spectrum of commensals in the respiratory secretion of healthy and CF people reflects differential living conditions for the microbial community in CF and non-CF conducting airways.
The basic defect in CF, which from early on leads to mucus obstruction, anomalous pH of the airway surface liquid, a compromised innate, and an activated adaptive immunity (2), modifies the airway microbiome when people with CF are apparently healthy, have normal spirometry, and do not harbor any of the CF lead pathogens. Later, when the common CF pathogens take up residence in the lungs, we observed differential modes of depletion of the commensal microbiota in the presence of S. aureus, P. aeruginosa, S. maltophilia, or combinations thereof (Fig. 7). Variable CF habitats and divergent microbial lifestyles and microbe-microbe and host-microbe interactions should account for the differential evolution of the CF airway metagenome, which finally merges to uniform bacterial monocultures in end-stage lung disease. To the best of our knowledge, the varied depletion of commensals in the CF airway metagenome has yet not been reported. Since the loss or persistence of selected taxa could represent biomarkers of dysbiosis and its degree of severity, we encourage our peers to execute a replication study at another site. It will be of particular interest to learn whether combination therapy with effective CFTR modulators (68, 69) could prevent or slow the fragmentation of the CF airway metagenome in preschool children, when its composition is closest to that in healthy lungs (50, 65).
In summary, from early on, the CF airway metagenome evolves differently from that of the healthy non-CF population. Consistent with this conclusion, microbial ecology and selected bacterial taxa were the key classifiers that differentiated CF from non-CF sputa by the random forest algorithm (Fig. 8). Likewise, the individual profile of a patient’s sputum metagenome was more strongly shaped by the bacteria than by the CF host’s lung function and his local inflammatory response (Tables 5 and 6).
Besides these more general implications of this work, there are some specific findings worth mentioning, because they enrich or modify our understanding of CF airway microbiology. First, if the typical CF pathogens S. aureus and P. aeruginosa belonged to the trio of most abundant species (1, 8), only one of them was among the most abundant. Although culture-dependent diagnostics often identifies both taxa, the metagenome data told us that the dominant species is typically more abundant than the other by 1 order of magnitude or more. Coexistence of and interaction between S. aureus and P. aeruginosa are less prevalent than deduced from detection rates documented in registries. S. aureus and P. aeruginosa were present in nasal lavage samples but were far less abundant than in sputum, and hence, they may not play a central role in the nasal metagenome. The sinuses and nose are suspected to represent the gateway for the CF pathogens, from which they invade the lower airway (55, 70). This scenario has been substantiated by numerous independent studies. However, within the upper CF airway, corynebacteria and skin commensals are the key residents. With the exception of H. influenzae, the oral cavity was devoid of typical CF pathogens, consistent with our knowledge that the buccal epithelium does not express CFTR and thus is not affected in CF.
The lungs are the major site of chronic infections. The Gram-negative CF pathogens were more prevalent in the lung microbiomes of PI than in those of PS patients. Conversely, the constituents of a healthy lung flora, i.e., Veillonella, Prevotella, and Streptococcus species were more common in PS than in PI lungs. These findings confirm the registry data based on culture-dependent diagnostics showing that the amount of CFTR activity is the key determinant for the microbial colonization of CF lungs (1, 71); i.e., PI patients carry CFTR mutation genotypes which confer null or minimal CFTR activity, whereas PS patients harbor CFTR genotypes which confer some residual CFTR activity, about 10 to 20% of wild type (51, 70, 72).
Our metagenome study has some limitations. First, RNA viruses are not yet identified by DNA-based pipelines. Second, due to technical constraints of the protocols, microbes with easy-to-lyse and hard-to-lyse cell walls, such as Tenericutes and mycobacteria, were not recovered in representative amounts (3, 49). Third, metagenome data are presented from a single CF center. In principle, a multicenter study would have been more informative; however, generating high-quality data would have been logistically challenging. We performed sampling and processing according to in-house standard operating procedures (SOPs). Samples were shock-frozen to −80°C within less than 30 s, stored at −80°C, and processed using meticulous precautions (49, 50). We are hence confident that our data reflect the metagenomes of the airway compartments. Sputum reflects not only the lung microbiota but also to some extent the oral microbiota. Bronchoalveolar lavage principally bypasses the oropharynx (73), but according to our experience, the procedure is prone to microbial contamination by the equipment and the lavage fluid. Alternatively, saliva and sputum have been sampled in parallel to estimate the relative proportions of the lung and oral microbiome in a sputum sample (5). As in this work, we will continue to use induced sputum (58) collected according to a SOP of the U.S. CF Foundation (74). The sequential sampling after 3-min cycles of inhalation of hypertonic saline yields specimens that from the second portion onward are virtually free of oral contaminants.
In conclusion, each CF patient carries a personalized microbiome in his airway that after birth is initially unstable, but subsequently develops into a stable close-to-healthy network, which then gradually declines and becomes increasingly fragmented from early school age on. The spectrum of commensals is different in health and CF, and it remains to be seen whether the CF microbiomes will change their temporal characteristics with the implementation of lifelong CFTR modulation from early on.
MATERIALS AND METHODS
Cohorts.
Individuals with CF were recruited from the CF clinic at Hannover Medical School, Germany. All patients had been regularly seen at the CF clinic since the age of diagnosis. All patients were clinically stable at the day of sampling and had not experienced any pulmonary exacerbation (75) in the preceding 4 weeks. The diagnosis of CF had been made by the detection of two disease-causing mutations in the CFTR gene (51) and elevated chloride concentrations in the Gibson-Cooke pilocarpine iontophoresis sweat test (76) or a Sermet score in the CF range of nasal transepithelial potential difference measurements (77, 78) and/or chloride secretory responses in the CF range of intestinal current measurements (79). Exocrine pancreatic status was assessed by the fecal elastase 1 test (80). The exocrine pancreas-insufficient (PI) patients were either homozygous for the most common CF mutation, p.Phe508del (51), or compound heterozygous for p.Phe508del and a loss-of-function class 1 mutation (51). The exocrine pancreas-sufficient (PS) subjects were either compound heterozygous for a PI- and a PS-conferring CFTR mutation or homozygous for a PS mutation (51). Information about the patients’ lung function, anthropometry, comorbidities, and treatment was extracted from the patients’ electronic files.
The healthy cohort included 84 males and 73 females of European descent living in the Hannover region. The age of the control subjects varied from 10 to 59 years (median, 24 years; inner quartiles, 16 to 37 years). All subjects were free from pulmonary medical conditions, acute infection, and other known illness or chronic disease at the time of enrollment. Prior to the collection of induced sputum and oropharyngeal swabs, the participants were asked to fill out a questionnaire regarding their height, weight, smoking status, allergies, living conditions, physical activity, and nutrition (Table S2). The cohort consisted of 8 smokers and 149 nonsmokers. The median body mass index (BMI) was 22.3 kg/m2 (inner quartiles, 19.4 to 24.9 kg/m2; range, 14.7 to 35.9 kg/m2).
The study was approved by the ethics committee of the Medizinische Hochschule Hannover (no. 1510-2012; no. 9299_BO_K_2020) and by the ethics committee of the Jena University Hospital (no. 4323-02/15). Informed consent was obtained from all adult study participants and from legal guardians in the case of minors.
Lung function.
Spirometry was performed and forced expiratory volume in the first second (FEV1), vital capacity (VCmax), and mid-expiratory flow at 25% of VCmax (MEF25) were determined according to American Thoracic Society (ATS)/European Respiratory Society standards (ERS) standards (81). Percent predicted results were based on equations of the global lung initiative (82).
CF disease centiles.
Patients’ BMI and FEV1 values were ascertained on sample collection day and at all visits to the CF clinic at half-year intervals prior to and after taking the sample. Average values were mapped onto the age- and gender-corrected centile distribution of the 2013 edition of the CF European registry (52). Next, the FEV1 and BMI centiles were combined into a disease centile, which gives equal weight to lung function and anthropometry:
The issue of whether the disease centile distribution of the study cohort recruited from the CF clinic Hannover matched that of the current European CF patient population was tested as follows. If a population of n CF patients follows the theoretical centile distribution, one can allocate n equidistant rank numbers to these individuals. If one determines all possible rank number differences in this cohort, the rank number difference of 1 will occur n − 1 times, but the maximal rank number difference (n − 1) will occur only once. In other words, the frequency of rank number difference will linearly decrease with increasing rank number difference. To compare the disease centile distribution of the m individuals of the study cohort with the theoretical centile distribution, the centiles from 1 to 100 were divided into m equal intervals, representing the m rank numbers. The centile of each individual was converted into a rank number, and for each individual, the rank number differences with all other members of the cohort were counted. The plot of the counts versus rank number difference was then compared with the expected linear decline from a rank number difference of 0 to a rank number difference of m (Fig. 1).
Sampling from CF patients and healthy non-CF controls.
Samples were collected at regular visits of the CF outpatient clinic or prior to the first dose of a 14-day course of elective intravenous antipseudomonal chemotherapy. The upper airways were explored by nasal lavage (63): First, the nose was cleared of excessive mucus and crusts by forceful exsufflation/blowing into a handkerchief. Then, the seated or standing patient extended the neck approximately 30° from the horizontal while 10 mL of isotonic saline was instilled in a nasal side during elevation of the soft palate and holding of breath. After 10 s, the subject bent forward and gently rinsed the nasal lavage fluid into a sterile specimen cup. The fluid was decanted into a precooled tube, shock frozen, and stored at −80°C. The oral cavity was sampled with cotton swabs close to the oropharynx without inducing a cough. The cotton tip was immediately clipped with scissors into a precooled Eppendorf tube and shock frozen at −80°C. Induced sputum was collected by four 3-min cycles during which the participant breathed aerosolized 6% hypertonic saline according to SOP 530.00 of the Cystic Fibrosis Foundation Therapeutics (CFFT) Therapeutics Development Network (TDN) (74). Sputum expectorated after the second, third, and fourth cycles was collected, shock frozen within less than 30 s at −80°C, and then stored at −80°C.
Control samples.
To control for the performance of sequencing, a mock community of equal amounts of genomic GC-rich P. aeruginosa, AT-rich Streptococcus salivarius, and GC≈AT Escherichia coli DNA was sequenced.
Measurement of cytokines and chemokines (83).
The concentrations of the soluble inflammatory mediators interleukin 1β (IL-1β), IL-6, and IL-8, matrix metalloproteinase 9 (MMP-9), and tissue inhibitor of metalloproteinases 1 (TIMP-1) were quantified via bead-based multiplexed assay (xMAP; Luminex, Austin, TX, USA) by using Milliplex MAP kits (human high-sensitivity HS TCMAG-28K kit, human MMP panel 2 HMMP2MAG-55K kit, and human TIMP panel 1 HTMP1MAG-54K kit; all from Merck Millipore, Darmstadt, Germany). Neutrophil elastase (NE) levels were measured by enzyme-linked immunosorbent assay (DEH3311; Demeditec Diagnostics GmbH, Kiel, Germany). Assays were performed following the manufacturers’ instructions.
Processing of samples.
(i) Induced sputum. Fresh or thawed samples were diluted 1:5 with ice-cold 97.5% phosphate-buffered saline–2.5% mercaptoethanol (vol/vol) and incubated on ice for 2 h with shaking. The suspension was centrifuged (15 min; 3,800 × g; 10°C), and the pellet was dried for 10 s, dissolved at 4°C in 10 mL double-distilled water, and then incubated on ice for 15 min on a rocker switch. This cycle of centrifugation, drying, and incubation in distilled water was repeated twice. The pellet was dissolved in 1 mL 0.1 RDD buffer (Qiagen, Hilden) and incubated in two 0.5-mL aliquots with 60 U DNase I for 90 min at 30°C under shaking (350 rpm). The solutions were combined, diluted with 40 mL DNase buffer and centrifuged (15 min; 3,800 × g; 10°C). The pellets were washed three times with 10 mL SE buffer (75 mM NaCl, 25 mM EDTA, pH 7.4) each by centrifugation, then dissolved in 0.5 mL SE buffer, and pelleted again (10 min; 12,000 × g; 10°C). Subsequently, genomic DNA was purified according to the “hard-to-lyse-bacteria” protocol with the NucleoSpin tissue kit (Macherey-Nagel, Düren, Germany). DNA was stored at 4°C in Tris-EDTA buffer. The yield of double-stranded DNA was determined with a Qubit 1.0 fluorimeter with the Qubit dsDNA BR assay kit (Q32850; Agilent Technologies, Santa Clara, CA). This protocol was found to be an acceptable compromise to obtain some nonstoichiometric amounts of hard-to-lyse mycobacteria and fungi without losing all easy-to-lyse mycoplasms.
(ii) Swabs. Cells were removed from the swab tips by 15 min incubation in SE buffer with intermittent vortex mixing. After centrifugation (15 min; 14,000 × g), the DNA was extracted from the dissolved pellet by the hard-to-lyse-bacteria protocol with the NucleoSpin tissue kit.
(iii) Nasal lavage fluid. Cells were collected by centrifugation (15 min; 14,000 × g). Then DNA was extracted from the dissolved pellet by the hard-to-lyse-bacteria protocol with the NucleoSpin tissue kit.
Library preparation and DNA sequencing.
For all samples collected from people with CF and oropharyngeal swabs collected from healthy people, fragment libraries were prepared with a NEBNext Ultra II DNA library preparation kit for Illumina (E7645) and NEBNext unique dual index primer pairs (E64405), with a total of six to eight PCR cycles. The Illumina NextSeq 500/550 platform was used for sequencing (high-output kit v2.5; 75 cycles, single-end reads; no. 20024906; Illumina, San Diego, CA). For the induced sputa collected from healthy subjects, fragment libraries were prepared with the Nextera DNA Flex library preparation kit (no. 20018704; Illumina) with index adapter set B (no. 20027214) and subsequent PCR amplification. Sequencing was performed on the Illumina NovaSeq platform (NovaSeq 6000 S1 reagent kit v1.5; 300 cycles, paired-end reads; Illumina no. 20028317).
Taxonomic classification of the airway microbial community.
The whole-metagenomic-sequencing alignment pipeline Wochenende (version 2.0.0) was applied to the taxonomic classification of microbial community members (84). For each sample, raspir (version 1.0.2) was run to extract microbial species with uniform read distributions toward the reference genome (66). EukDetect enabled an assessment of the community of fungi and moulds from our metagenome samples (59). Read counts were RPMM normalized, accounting for variations in both sequencing depth across samples (reads per million reads) and chromosome length (reads per million reference bases) (Data Sets S1 and S2) (84). Alternatively, bacterial load was quantified by the ratio of bacterial cells to human cells (3) or semiquantitative qPCR for selected cases.
qPCR.
qPCR was performed to determine independent of metagenome copy number estimates, the amount of DNA of different bacterial species and human DNA in the metagenome samples. qPCR targets were selected for human DNA and six bacterial species (S. aureus, P. aeruginosa, H. influenzae, Prevotella melaninogenica, Rothia mucilaginosa, and Veillonella atypica), which had been indicated as being present in many metagenome samples by the full sequencing approach. Target loci and the respective primers were selected after literature screening, choosing targets that were described to have been used in qPCR approaches for distinctive species identification and quantification from mixed DNA samples (see Table S3 for a list of targets, primer sequences, and references). qPCRs using SYBR green for product quantification were performed with the QuantiFast SYBR green RT-PCR kit (Qiagen) on a 7500 fast real-time PCR system (Applied Biosciences). The corresponding software (Life Technologies 7500 software v2.3) was used for monitoring fluorescence curves, determination of cycle threshold (CT) values, calculation of ΔCT values from samples and negative controls, and melting curve analyses of the products. Experiments were performed in 96-well plates with a 20-μL reaction volume per well. All qPCRs were done with 10 ng DNA template, 40 amplification cycles, and 60°C as the annealing temperature (see Table S3 for the exact reaction conditions and corresponding PCR program). Each reaction was executed at least in duplicate on the same 96-well plate to allow CT value determination from more than one well. Controls containing distilled water instead of template DNA were included in each 96-well plate for all primer pairs used in the respective plate. After the qPCR, the products were inspected by gel electrophoresis using 4% agarose gels. Primer pairs were initially tested in positive- and negative-control reactions (with DNA of single species as templates) in order to optimize primer concentrations but also to sort out primer pairs that generated interfering amounts of nonspecific products.
Overall, 21 different metagenome DNA samples were used as templates in qPCRs. For all of them, reactions were done with primers targeting human DNA and primers for one to six bacterial species, depending on the preceding sequencing results for these samples. In addition, qPCRs were performed using artificial DNA mixes as templates, containing human DNA and DNA of two to four bacterial species in various proportions (refer to Table S3 for composition). Besides serving as additional controls, these reactions were also applied for assigning CT values to known DNA amounts or copy numbers, respectively, of a species in a qPCR template.
A detailed overview of all tested template/primer combinations is shown in Table S4.
Statistics.
For comparing two independent groups and more than two groups, the nonparametric Mann–Whitney U test and the Kruskal-Wallis rank test were applied, respectively. For two groups, the effect size r was calculated, which is the Mann-Whitney U test statistics divided by the square-root of the sample size. For more than two groups, the epsilon-squared effect size and the corresponding confidence intervals (CI) were obtained. The Conover-Iman test with the Benjamini-Hochberg adjustment (85) was used for multiple comparisons between group levels. R statistical software was used for data analyses, including the rcompanion package for statistical testing (86).
Principal-component analysis was performed with XLSTAT BASE. MetaPhlAn2 (87, 88) was used for taxonomic classification of normalized sequence data and for the construction of heat maps of the most abundant species. Bray-Curtis dissimilarity indices were obtained for nonmetric multidimensional scaling (89) (without autotransform adjustment, k = 3, stress < 0.2). A permutation test (envfit [89], permutations = 1,000) was used to establish relationships between the nonmetric multidimensional ordination axes and metadata variables.
We applied the random forest algorithm (62) to rank the importance of variables for differentiating samples originating from CF patients and healthy non-CF subjects. To reduce the random forest regression model’s complexity (62, 90), a Boruta wrapper algorithm (91) was applied beforehand, extracting uncorrelated, nonrandom, and nonredundant features from our high-dimensional data set. Two hundred ninety-four input variables were selected for the classification, including numerical ecological parameters of the bacterial and fungal community (Shannon diversity, Simpson diversity, Pielou’s evenness indices, and species number), subject’s metadata (gender and age), and bacterial and fungal metagenome data at the species level. The Boruta algorithm generated shadow attributes by randomly shuffling the original input variables and iteratively comparing the random with the corresponding original variables, so that nonrandom features of importance for the prediction were distinguishable from random feature assignments (65, 91). Outcome of the out-of-bag (OOB) estimates of error for random forest classifications were repeated 100 times with different seeds set for the classification and Boruta feature selection. All data analysis was performed in R (86).
Data availability.
The metagenome sequencing data sets have been deposited in the European Nucleotide Archive (ENA) (study accession no. PRJEB38221, PRJEB52317, and PRJEB52822) hosted by the European Bioinformatics Institute (EMBL-EBI) (www.ebi.ac.uk). The pipeline Wochenende is available at https://github.com/MHH-RCUG/nf_wochenende. The data integration tool Haybaler is at https://github.com/MHH-RCUG/haybaler. Our reference masking tool Blacklister is at https://github.com/colindaven/blacklister. The reference database is publicly available from https://drive.google.com/drive/folders/1q1btJCxtU15XXqfA-iCyNwgKgQq0SrG4?usp=sharing. The STORMS checklist for this article is available from Zenodo (10.5281/zenodo.7053226).
ACKNOWLEDGMENTS
We cordially thank the patients for their participation and the CF physicians at the Hannover CF clinic, namely, Anna Maria Dittrich, Christian Dopfer, Sibylle Junge, and Annette Sauer-Heilborn, for patient care. Philippe Chouvarine provided bioinformatics support during the initiation period of the study. Technical support by Patrick Chhatwal, Julia Hentschel, Joshua Schulte, Angela Schulz, and Samira Mielke is gratefully acknowledged.
This work was supported by the German Federal Ministry of Education and Research (82DZL002A1) for the German Center for Lung Research (DZL), Disease Area Cystic Fibrosis, and the Deutsche Forschungsgemeinschaft (SFB900, projects A2 and Z1, project no. 158989968). K.P., M.-M.P., and P.M.L. were members of the Ph.D. program of the Center for Infection Biology (ZIB) of Hannover Medical School. M.-M.P. was a scholar of the Studienstiftung des deutschen Volkes. M.G. received a stipend from the Else-Kröner-Fresenius Stiftung within the frame of the Klin-StrucMed program of Hannover Medical School.
J.G.M. and B.T. conceived the study. R.M., M.-M.P., A.T., L.W., and B.T. collected the samples. C.A. assayed the proinflammatory parameters. S.H., M.-M.P., A.T., M.D., and L.W. processed the samples for sequencing. J.K. performed qPCR. K.P., M.-M.P., S.G., and P.M.L. wrote scripts and algorithms. K.P., M.-M.P., M.G., S.G., I.R., P.M.L., J.K., and B.T. performed the in silico analysis, interpreted the outcome, and analyzed the association between metagenome and clinical status. K.P., M.-M.P., M.G., J.K., and B.T. wrote the manuscript. All authors reviewed the manuscript and approved the final version.
Footnotes
Supplemental material is available online only.
Contributor Information
Burkhard Tümmler, Email: tuemmler.burkhard@mh-hannover.de.
Joanna B. Goldberg, Emory University School of Medicine
REFERENCES
- 1.Bell SC, Mall MA, Gutierrez H, Macek M, Madge S, Davies JC, Burgel PR, Tullis E, Castaños C, Castellani C, Byrnes CA, Cathcart F, Chotirmall SH, Cosgriff R, Eichler I, Fajac I, Goss CH, Drevinek P, Farrell PM, Gravelle AM, Havermans T, Mayer-Hamblett N, Kashirskaya N, Kerem E, Mathew JL, McKone EF, Naehrlich L, Nasr SZ, Oates GR, O'Neill C, Pypops U, Raraigh KS, Rowe SM, Southern KW, Sivam S, Stephenson AL, Zampoli M, Ratjen F. 2020. The future of cystic fibrosis care: a global perspective. Lancet Respir Med 8:65–124. doi: 10.1016/S2213-2600(19)30337-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Stoltz DA, Meyerholz DK, Welsh MJ. 2015. Origins of cystic fibrosis lung disease. N Engl J Med 372:351–362. doi: 10.1056/NEJMra1300109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Moran Losada P, Chouvarine P, Dorda M, Hedtfeld S, Mielke S, Schulz A, Wiehlmann L, Tümmler B. 2016. The cystic fibrosis lower airways microbial metagenome. ERJ Open Res 2:E00096-2015. doi: 10.1183/23120541.00096-2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Muhlebach MS, Hatch JE, Einarsson GG, McGrath SJ, Gilipin DF, Lavelle G, Mirkovic B, Murray MA, McNally P, Gotman N, Davis TS, Wolfgang MC, Gilligan PH, McElvaney NG, Elborn JS, Boucher RC, Tunney MM. 2018. Anaerobic bacteria cultured from cystic fibrosis airways correlate to milder disease: a multisite study. Eur Respir J 52:1800242. doi: 10.1183/13993003.00242-2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lu J, Carmody LA, Opron K, Simon RH, Kalikin LM, Caverly LJ, LiPuma JJ. 2020. Parallel analysis of cystic fibrosis sputum and saliva reveals overlapping communities and an opportunity for sample decontamination. mSystems 5:e00296-20. doi: 10.1128/mSystems.00296-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.O'Toole GA. 2018. Cystic fibrosis airway microbiome: overturning the old, opening the way for the new. J Bacteriol 200:e00561-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Caverly LJ, LiPuma JJ. 2018. Cystic fibrosis respiratory microbiota: unraveling complexity to inform clinical practice. Expert Rev Respir Med 12:857–865. doi: 10.1080/17476348.2018.1513331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Blanchard AC, Waters VJ. 2019. Microbiology of cystic fibrosis airway disease. Semin Respir Crit Care Med 40:727–736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bevivino A, Bacci G, Drevinek P, Nelson MT, Hoffman L, Mengoni A. 2019. Deciphering the ecology of cystic fibrosis bacterial communities: towards systems-level integration. Trends Mol Med 25:1110–1122. doi: 10.1016/j.molmed.2019.07.008. [DOI] [PubMed] [Google Scholar]
- 10.Khanolkar RA, Clark ST, Wang PW, Hwang DM, Yau YCW, Waters VJ, Guttman DS. 2020. Ecological succession of polymicrobial communities in the cystic fibrosis airways. mSystems 5:e00809-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fodor AA, Klem ER, Gilpin DF, Elborn JS, Boucher RC, Tunney MM, Wolfgang MC. 2012. The adult cystic fibrosis airway microbiota is stable over time and infection type, and highly resilient to antibiotic treatment of exacerbations. PLoS One 7:e45001. doi: 10.1371/journal.pone.0045001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rogers GB, Shaw D, Marsh RL, Carroll MP, Serisier DJ, Bruce KD. 2015. Respiratory microbiota: addressing clinical questions, informing clinical practice. Thorax 70:74–81. doi: 10.1136/thoraxjnl-2014-205826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Coburn B, Wang PW, Diaz Caballero J, Clark ST, Brahma V, Donaldson S, Zhang Y, Surendra A, Gong Y, Elizabeth Tullis D, Yau YC, Waters VJ, Hwang DM, Guttman DS. 2015. Lung microbiota across age and disease stage in cystic fibrosis. Sci Rep 5:10241. doi: 10.1038/srep10241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mika M, Korten I, Qi W, Regamey N, Frey U, Casaulta C, Latzin P, Hilty M, SCILD study group . 2016. The nasal microbiota in infants with cystic fibrosis in the first year of life: a prospective cohort study. Lancet Respir Med 4:627–635. doi: 10.1016/S2213-2600(16)30081-9. [DOI] [PubMed] [Google Scholar]
- 15.Frayman KB, Armstrong DS, Carzino R, Ferkol TW, Grimwood K, Storch GA, Teo SM, Wylie KM, Ranganathan SC. 2017. The lower airway microbiota in early cystic fibrosis lung disease: a longitudinal analysis. Thorax 72:1104–1112. doi: 10.1136/thoraxjnl-2016-209279. [DOI] [PubMed] [Google Scholar]
- 16.Zemanick ET, Wagner BD, Robertson CE, Ahrens RC, Chmiel JF, Clancy JP, Gibson RL, Harris WT, Kurland G, Laguna TA, McColley SA, McCoy K, Retsch-Bogart G, Sobush KT, Zeitlin PL, Stevens MJ, Accurso FJ, Sagel SD, Harris JK. 2017. Airway microbiota across age and disease spectrum in cystic fibrosis. Eur Respir J 50:1700832. doi: 10.1183/13993003.00832-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Muhlebach MS, Zorn BT, Esther CR, Hatch JE, Murray CP, Turkovic L, Ranganathan SC, Boucher RC, Stick SM, Wolfgang MC. 2018. Initial acquisition and succession of the cystic fibrosis lung microbiome is associated with disease progression in infants and preschool children. PLoS Pathog 14:e1006798. doi: 10.1371/journal.ppat.1006798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Boutin S, Depner M, Stahl M, Graeber SY, Dittrich SA, Legatzki A, von Mutius E, Mall M, Dalpke AH. 2017. Comparison of oropharyngeal microbiota from children with asthma and cystic fibrosis. Mediators Inflamm 2017:5047403. doi: 10.1155/2017/5047403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Acosta N, Heirali A, Somayaji R, Surette MG, Workentine ML, Sibley CD, Rabin HR, Parkins MD. 2018. Sputum microbiota is predictive of long-term clinical outcomes in young adults with cystic fibrosis. Thorax 73:1016–1025. doi: 10.1136/thoraxjnl-2018-211510. [DOI] [PubMed] [Google Scholar]
- 20.Caverly LJ, Lu J, Carmody LA, Kalikin LM, Shedden K, Opron K, Azar M, Cahalan S, Foster B, VanDevanter DR, Simon RH, LiPuma JJ. 2019. Measures of cystic fibrosis airway microbiota during periods of clinical stability. Ann Am Thorac Soc 16:1534–1542. doi: 10.1513/AnnalsATS.201903-270OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kirst ME, Baker D, Li E, Abu-Hasan M, Wang GP. 2019. Upper versus lower airway microbiome and metagenome in children with cystic fibrosis and their correlation with lung inflammation. PLoS One 14:e0222323. doi: 10.1371/journal.pone.0222323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Voronina OL, Ryzhova NN, Kunda MS, Loseva EV, Aksenova EI, Amelina EL, Shumkova GL, Simonova OI, Gintsburg AL. 2020. Characteristics of the airway microbiome of cystic fibrosis patients. Biochemistry (Mosc) 85:1–10. doi: 10.1134/S0006297920140011. [DOI] [PubMed] [Google Scholar]
- 23.Cuthbertson L, Walker AW, Oliver AE, Rogers GB, Rivett DW, Hampton TH, Ashare A, Elborn JS, De Soyza A, Carroll MP, Hoffman LR, Lanyon C, Moskowitz SM, O'Toole GA, Parkhill J, Planet PJ, Teneback CC, Tunney MM, Zuckerman JB, Bruce KD, van der Gast CJ. 2020. Lung function and microbiota diversity in cystic fibrosis. Microbiome 8:45. doi: 10.1186/s40168-020-00810-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Raghuvanshi R, Vasco K, Vázquez-Baeza Y, Jiang L, Morton JT, Li D, Gonzalez A, DeRight Goldasich L, Humphrey G, Ackermann G, Swafford AD, Conrad D, Knight R, Dorrestein PC, Quinn RA. 2020. High-resolution longitudinal dynamics of the cystic fibrosis sputum microbiome and metabolome through antibiotic therapy. mSystems 5:e00292-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Garcia-Nuñez M, Garcia-Gonzalez M, Pomares X, Montón C, Millares L, Quero S, Prina E, Asensio O, Bosque M, Capilla S, Cuevas O, Monsó E. 2020. The respiratory microbiome in cystic fibrosis: compartment patterns and clinical relationships in early stage disease. Front Microbiol 11:1463. doi: 10.3389/fmicb.2020.01463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Heirali A, Thornton C, Acosta N, Somayaji R, Laforest LI, Storey D, Rabin H, Waddell B, Rossi L, Arrieta MC, Surette M, Parkins MD. 2020. Sputum microbiota in adults with CF associates with response to inhaled tobramycin. Thorax 75:1058–1064. doi: 10.1136/thoraxjnl-2019-214191. [DOI] [PubMed] [Google Scholar]
- 27.Lucas SK, Feddema E, Boyer HC, Hunter RC. 2021. Diversity of cystic fibrosis chronic rhinosinusitis microbiota correlates with different pathogen dominance. J Cyst Fibros 20:678–681. doi: 10.1016/j.jcf.2021.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hampton TH, Thomas D, van der Gast C, O'Toole GA, Stanton BA. 2021. Mild cystic fibrosis lung disease is associated with bacterial community stability. Microbiol Spectr 9:e00029-21. doi: 10.1128/Spectrum.00029-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Widder S, Zhao J, Carmody LA, Zhang Q, Kalikin LM, Schloss PD, LiPuma JJ. 2022. Association of bacterial community types, functional microbial processes and lung disease in cystic fibrosis airways. ISME J 16:905–914. doi: 10.1038/s41396-021-01129-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Sosinski LM, Cm H, Neugebauer KA, Ghuneim LJ, Guzior DV, Castillo-Bahena A, Mielke J, Thomas R, McClelland M, Conrad D, Quinn RA. 2021. A restructuring of microbiome niche space is associated with elexacaftor-tezacaftor-ivacaftor therapy in the cystic fibrosis lung. J Cyst Fibros 21:996–1005. doi: 10.1016/j.jcf.2021.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Weiser R, Oakley J, Ronchetti K, Tame JD, Hoehn S, Jurkowski TP, Mahenthiralingam E, Forton JT. 2022. The lung microbiota in children with cystic fibrosis captured by induced sputum sampling. J Cyst Fibros 21:1006–1012. doi: 10.1016/j.jcf.2022.01.006. [DOI] [PubMed] [Google Scholar]
- 32.Frey DL, Bridson C, Dittrich S, Graeber SY, Stahl M, Wege S, Herth F, Sommerburg O, Schultz C, Dalpke A, Mall MA, Boutin S. 2022. Changes in microbiome dominance are associated with declining lung function and fluctuating inflammation in people with cystic fibrosis. Front Microbiol 13:885822. doi: 10.3389/fmicb.2022.885822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Quince C, Walker AW, Simpson JT, Loman NJ, Segata N. 2017. Shotgun metagenomics, from sampling to analysis. Nat Biotechnol 35:833–844. doi: 10.1038/nbt.3935. [DOI] [PubMed] [Google Scholar]
- 34.Lim YW, Schmieder R, Haynes M, Willner D, Furlan M, Youle M, Abbott K, Edwards R, Evangelista J, Conrad D, Rohwer F. 2013. Metagenomics and metatranscriptomics: windows on CF-associated viral and microbial communities. J Cyst Fibros 12:154–164. doi: 10.1016/j.jcf.2012.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hauser PM, Bernard T, Greub G, Jaton K, Pagni M, Hafen GM. 2014. Microbiota present in cystic fibrosis lungs as revealed by whole genome sequencing. PLoS One 9:e90934. doi: 10.1371/journal.pone.0090934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lim YW, Evangelista JS, 3rd, Schmieder R, Bailey B, Haynes M, Furlan M, Maughan H, Edwards R, Rohwer F, Conrad D. 2014. Clinical insights from metagenomics analysis of sputum samples from patients with cystic fibrosis. J Clin Microbiol 52:425–437. doi: 10.1128/JCM.02204-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Quinn RA, Whiteson K, Lim YW, Zhao J, Conrad D, LiPuma JJ, Rohwer F, Widder S. 2016. Ecological networking of cystic fibrosis lung infections. NPJ Biofilms Microbiomes 2:4. doi: 10.1038/s41522-016-0002-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bacci G, Mengoni A, Fiscarelli E, Segata N, Taccetti G, Dolce D, Paganin P, Morelli P, Tuccio V, De Alessandri A, Lucidi V, Bevivino A. 2017. A different microbiome gene repertoire in the airways of cystic fibrosis patients with severe lung disease. Int J Mol Sci 18:1654. doi: 10.3390/ijms18081654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Feigelman R, Kahlert CR, Baty F, Rassouli F, Kleiner RL, Kohler P, Brutsche MH, von Mering C. 2017. Sputum DNA sequencing in cystic fibrosis: non-invasive access to the lung microbiome and to pathogen details. Microbiome 5:20. doi: 10.1186/s40168-017-0234-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Nelson MT, Pope CE, Marsh RL, Wolter DJ, Weiss EJ, Hager KR, Vo AT, Brittnacher MJ, Radey MC, Hayden HS, Eng A, Miller SI, Borenstein E, Hoffman LR. 2019. Human and extracellular DNA depletion for metagenomic analysis of complex clinical infection samples yields optimized viable microbiome profiles. Cell Rep 26:2227–2240.E5. doi: 10.1016/j.celrep.2019.01.091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Whelan FJ, Waddell B, Syed SA, Shekarriz S, Rabin HR, Parkins MD, Surette MG. 2020. Culture-enriched metagenomic sequencing enables in-depth profiling of the cystic fibrosis lung microbiota. Nat Microbiol 5:379–390. doi: 10.1038/s41564-019-0643-y. [DOI] [PubMed] [Google Scholar]
- 42.Hahn A, Whiteson K, Davis TJ, Phan J, Sami I, Koumbourlis AC, Freishtat RJ, Crandall KA, Bean HD. 2020. Longitudinal associations of the cystic fibrosis airway microbiome and volatile metabolites: a case study. Front Cell Infect Microbiol 10:174. doi: 10.3389/fcimb.2020.00174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nelson MT, Wolter DJ, Eng A, Weiss EJ, Vo AT, Brittnacher MJ, Hayden HS, Ravishankar S, Bautista G, Ratjen A, Blackledge M, McNamara S, Nay L, Majors C, Miller SI, Borenstein E, Simon RH, LiPuma JJ, Hoffman LR. 2020. Maintenance tobramycin primarily affects untargeted bacteria in the CF sputum microbiome. Thorax 75:780–790. doi: 10.1136/thoraxjnl-2019-214187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bacci G, Taccetti G, Dolce D, Armanini F, Segata N, Di Cesare F, Lucidi V, Fiscarelli E, Morelli P, Casciaro R, Negroni A, Mengoni A, Bevivino A. 2020. Untargeted metagenomic investigation of the airway microbiome of cystic fibrosis patients with moderate-severe lung disease. Microorganisms 8:1003. doi: 10.3390/microorganisms8071003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.de Almeida OGG, Capizzani CPDC, Tonani L, Grizante Barião PH, da Cunha AF, De Martinis ECP, Torres L, von Zeska Kress MR. 2020. The lung microbiome of three young Brazilian patients with cystic fibrosis colonized by fungi. Front Cell Infect Microbiol 10:598938. doi: 10.3389/fcimb.2020.598938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dmitrijeva M, Kahlert CR, Feigelman R, Kleiner RL, Nolte O, Albrich WC, Baty F, von Mering C. 2021. Strain-resolved dynamics of the lung microbiome in patients with cystic fibrosis. mBio 12:e02863-20. doi: 10.1128/mBio.02863-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Neerincx AH, Whiteson K, Phan JL, Brinkman P, Abdel-Aziz MI, Weersink EJM, Altenburg J, Majoor CJ, Maitland-van der Zee AH, Bos LDJ. 2021. Lumacaftor/ivacaftor changes the lung microbiome and metabolome in cystic fibrosis patients. ERJ Open Res 7:e0731-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ghuneim LJ, Raghuvanshi R, Neugebauer KA, Guzior DV, Christian MH, Schena B, Feiner JM, Castillo-Bahena A, Mielke J, McClelland M, Conrad D, Klapper I, Zhang T, Quinn RA. 2022. Complex and unexpected outcomes of antibiotic therapy against a polymicrobial infection. ISME J 16:2065–2075. doi: 10.1038/s41396-022-01252-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wiehlmann L, Pienkowska K, Hedtfeld S, Dorda M, Tümmler B. 2017. Impact of sample processing on human airways microbial metagenomes. J Biotechnol 250:51–55. doi: 10.1016/j.jbiotec.2017.01.001. [DOI] [PubMed] [Google Scholar]
- 50.Pust MM, Wiehlmann L, Davenport C, Rudolf I, Dittrich AM, Tümmler B. 2020. The human respiratory tract microbial community structures in healthy and cystic fibrosis infants. NPJ Biofilms Microbiomes 6:61. doi: 10.1038/s41522-020-00171-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Sosnay PR, Siklosi KR, Van Goor F, Kaniecki K, Yu H, Sharma N, Ramalho AS, Amaral MD, Dorfman R, Zielenski J, Masica DL, Karchin R, Millen L, Thomas PJ, Patrinos GP, Corey M, Lewis MH, Rommens JM, Castellani C, Penland CM, Cutting GR. 2013. Defining the disease liability of variants in the cystic fibrosis transmembrane conductance regulator gene. Nat Genet 45:1160–1167. doi: 10.1038/ng.2745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Boëlle PY, Viviani L, Busson PF, Olesen HV, Ravilly S, Stern M, Assael BM, Barreto C, Drevinek P, Thomas M, Krivec U, Mei-Zahav M, Vibert JF, Clement A, Mehta A, Corvol H, French CF Modifier Gene Study Investigators, European CF Registry Working Group . 2012. Reference percentiles for FEV(1) and BMI in European children and adults with cystic fibrosis. Orphanet J Rare Dis 7:64. doi: 10.1186/1750-1172-7-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Fournière M, Latire T, Souak D, Feuilloley MGJ, Bedoux G. 2020. Staphylococcus epidermidis and Cutibacterium acnes: two major sentinels of skin microbiota and the influence of cosmetics. Microorganisms 8:1752. doi: 10.3390/microorganisms8111752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Mainz JG, Gerber A, Arnold C, Baumann J, Baumann I, Koitschev A. 2015. Rhinosinusitis bei Mukoviszidose. HNO 63:809–820. doi: 10.1007/s00106-015-0069-7. [DOI] [PubMed] [Google Scholar]
- 55.Møller ME, Alanin MC, Grønhøj C, Aanæs K, Høiby N, von Buchwald C. 2017. Sinus bacteriology in patients with cystic fibrosis or primary ciliary dyskinesia: a systematic review. Am J Rhinol Allergy 31:293–298. doi: 10.2500/ajra.2017.31.4461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Mainz JG, Naehrlich L, Schien M, Käding M, Schiller I, Mayr S, Schneider G, Wiedemann B, Wiehlmann L, Cramer N, Pfister W, Kahl BC, Beck JF, Tümmler B. 2009. Concordant genotype of upper and lower airways P aeruginosa and S. aureus isolates in cystic fibrosis. Thorax 64:535–540. doi: 10.1136/thx.2008.104711. [DOI] [PubMed] [Google Scholar]
- 57.Dickson RP, Erb-Downward JR, Freeman CM, McCloskey L, Falkowski NR, Huffnagle GB, Curtis JL. 2017. Bacterial topography of the healthy human lower respiratory tract. mBio 8:e02287-16. doi: 10.1128/mBio.02287-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Ronchetti K, Tame JD, Paisey C, Thia LP, Doull I, Howe R, Mahenthiralingam E, Forton JT. 2018. The CF-Sputum Induction Trial (CF-SpIT) to assess lower airway bacterial sampling in young children with cystic fibrosis: a prospective internally controlled interventional trial. Lancet Respir Med 6:461–471. doi: 10.1016/S2213-2600(18)30171-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lind AL, Pollard KS. 2021. Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing. Microbiome 9:58. doi: 10.1186/s40168-021-01015-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Gelber JT, Cope EK, Goldberg AN, Pletcher SD. 2016. Evaluation of Malassezia and common fungal pathogens in subtypes of chronic rhinosinusitis. Int Forum Allergy Rhinol 6:950–955. doi: 10.1002/alr.21777. [DOI] [PubMed] [Google Scholar]
- 61.Zhao J, Schloss PD, Kalikin LM, Carmody LA, Foster BK, Petrosino JF, Cavalcoli JD, VanDevanter DR, Murray S, Li JZ, Young VB, LiPuma JJ. 2012. Decade-long bacterial community dynamics in cystic fibrosis airways. Proc Natl Acad Sci USA 109:5809–5814. doi: 10.1073/pnas.1120577109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Breiman L. 2001. Random forests. Machine Learning 45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
- 63.Hentschel J, Müller U, Doht F, Fischer N, Böer K, Sonnemann J, Hipler C, Hünniger K, Kurzai O, Markert UR, Mainz JG. 2014. Influences of nasal lavage collection-, processing- and storage methods on inflammatory markers—evaluation of a method for non-invasive sampling of epithelial lining fluid in cystic fibrosis and other respiratory diseases. J Immunol Methods 404:41–51. doi: 10.1016/j.jim.2013.12.003. [DOI] [PubMed] [Google Scholar]
- 64.Frioux C, Singh D, Korcsmaros T, Hildebrand F. 2020. From bag-of-genes to bag-of-genomes: metabolic modelling of communities in the era of metagenome-assembled genomes. Comput Struct Biotechnol J 18:1722–1734. doi: 10.1016/j.csbj.2020.06.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Pust MM, Tümmler B. 2021. Bacterial low-abundant taxa are key determinants of a healthy airway metagenome in the early years of human life. Comput Struct Biotechnol J 20:175–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Pust MM, Tümmler B. 2021. Identification of core and rare species in metagenome samples based on shotgun metagenomic sequencing, Fourier transforms and spectral comparisons. ISME Communications 1:2. doi: 10.1038/s43705-021-00010-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tang AC, Turvey SE, Alves MP, Regamey N, Tümmler B, Hartl D. 2014. Current concepts: host-pathogen interactions in cystic fibrosis airways disease. Eur Respir Rev 23:320–332. doi: 10.1183/09059180.00006113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Middleton PG, Mall MA, Dřevínek P, Lands LC, McKone EF, Polineni D, Ramsey BW, Taylor-Cousar JL, Tullis E, Vermeulen F, Marigowda G, McKee CM, Moskowitz SM, Nair N, Savage J, Simard C, Tian S, Waltz D, Xuan F, Rowe SM, Jain R, VX17-445–102 Study Group . 2019. Elexacaftor-tezacaftor-ivacaftor for cystic fibrosis with a single Phe508del allele. N Engl J Med 381:1809–1819. doi: 10.1056/NEJMoa1908639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Tümmler B (ed). 2022. Mutation-specific therapies in cystic fibrosis. Uni-Med, Bremen, Germany. [Google Scholar]
- 70.Whiteson KL, Bailey B, Bergkessel M, Conrad D, Delhaes L, Felts B, Harris JK, Hunter R, Lim YW, Maughan H, Quinn R, Salamon P, Sullivan J, Wagner BD, Rainey PB. 2014. The upper respiratory tract as a microbial source for pulmonary infections in cystic fibrosis. Parallels from island biogeography. Am J Respir Crit Care Med 189:1309–1315. doi: 10.1164/rccm.201312-2129PP. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Koch C, Cuppens H, Rainisio M, Madessani U, Harms H, Hodson M, Mastella G, Navarro J, Strandvik B, McKenzie S, Investigators of the ERCF . 2001. European Epidemiologic Registry of Cystic Fibrosis (ERCF): comparison of major disease manifestations between patients with different classes of mutations. Pediatr Pulmonol 31:1–12. doi:. [DOI] [PubMed] [Google Scholar]
- 72.McCague AF, Raraigh KS, Pellicore MJ, Davis-Marcisak EF, Evans TA, Han ST, Lu Z, Joynt AT, Sharma N, Castellani C, Collaco JM, Corey M, Lewis MH, Penland CM, Rommens JM, Stephenson AL, Sosnay PR, Cutting GR. 2019. Correlating cystic fibrosis transmembrane conductance regulator function with clinical features to inform precision treatment of cystic fibrosis. Am J Respir Crit Care Med 199:1116–1126. doi: 10.1164/rccm.201901-0145OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Jorth P, Ehsan Z, Rezayat A, Caldwell E, Pope C, Brewington JJ, Goss CH, Benscoter D, Clancy JP, Singh PK. 2019. Direct lung sampling indicates that established pathogens dominate early infections in children with cystic fibrosis. Cell Rep 27:1190–1204.E3. doi: 10.1016/j.celrep.2019.03.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.CFFT Therapeutics Development Network. 2010. SOP 530.00. Sputum induction using the Nouvag nebulizer with medication cup. Cystic Fibrosis Foundation, Bethesda, MD. [Google Scholar]
- 75.Bilton D, Canny G, Conway S, Dumcius S, Hjelte L, Proesmans M, Tümmler B, Vavrova V, De Boeck K. 2011. Pulmonary exacerbation: towards a definition for use in clinical trials. Report from the EuroCareCF Working Group on outcome parameters in clinical trials. J Cyst Fibros 10(Suppl 2):S79–S81. doi: 10.1016/S1569-1993(11)60012-X. [DOI] [PubMed] [Google Scholar]
- 76.CLSI. 2009. Sweat testing: sample collection and quantitative chloride analysis. Approved guideline, 3rd ed. CLSI document C34-A3. Clinical and Laboratory Standards Institute, Wayne, PA. [Google Scholar]
- 77.Rowe SM, Clancy JP, Wilschanski M. 2011. Nasal potential difference measurements to assess CFTR ion channel activity. Methods Mol Biol 741:69–86. doi: 10.1007/978-1-61779-117-8_6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Sermet-Gaudelus I, Girodon E, Sands D, Stremmler N, Vavrova V, Deneuville E, Reix P, Bui S, Huet F, Lebourgeois M, Munck A, Iron A, Skalicka V, Bienvenu T, Roussel D, Lenoir G, Bellon G, Sarles J, Macek M, Roussey M, Fajac I, Edelman A. 2010. Clinical phenotype and genotype of children with borderline sweat test and abnormal nasal epithelial chloride transport. Am J Respir Crit Care Med 182:929–936. doi: 10.1164/rccm.201003-0382OC. [DOI] [PubMed] [Google Scholar]
- 79.Derichs N, Sanz J, Von Kanel T, Stolpe C, Zapf A, Tümmler B, Gallati S, Ballmann M. 2010. Intestinal current measurement for diagnostic classification of patients with questionable cystic fibrosis: validation and reference data. Thorax 65:594–599. doi: 10.1136/thx.2009.125088. [DOI] [PubMed] [Google Scholar]
- 80.Leeds JS, Oppong K, Sanders DS. 2011. The role of fecal elastase-1 in detecting exocrine pancreatic disease. Nat Rev Gastroenterol Hepatol 8:405–415. doi: 10.1038/nrgastro.2011.91. [DOI] [PubMed] [Google Scholar]
- 81.Miller MR, Hankinson J, Brusasco V, Burgos F, Casaburi R, Coates A, Crapo R, Enright P, van der Grinten CP, Gustafsson P, Jensen R, Johnson DC, MacIntyre N, McKay R, Navajas D, Pedersen OF, Pellegrino R, Viegi G, Wanger J, ATS/ERS Task Force . 2005. Standardisation of spirometry. Eur Respir J 26:319–338. doi: 10.1183/09031936.05.00034805. [DOI] [PubMed] [Google Scholar]
- 82.Quanjer PH, Stanojevic S, Cole TJ, Baur X, Hall GL, Culver BH, Enright PL, Hankinson JL, Ip MS, Zheng J, Stocks J, ERS Global Lung Function Initiative . 2012. Multi-ethnic reference values for spirometry for the 3–95-yr age range: the global lung function 2012 equations. Eur Respir J 40:1324–1343. doi: 10.1183/09031936.00080312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Janhsen WK, Arnold C, Hentschel J, Lehmann T, Pfister W, Baier M, Böer K, Hünniger K, Kurzai O, Hipler UC, Mainz JG. 2016. Colonization of CF patients' upper airways with S. aureus contributes more decisively to upper airway inflammation than P. aeruginosa. Med Microbiol Immunol 205:485–500. doi: 10.1007/s00430-016-0463-0. [DOI] [PubMed] [Google Scholar]
- 84.Rosenboom I, Scheithauer T, Friedrich FC, Pörtner S, Pust MM, Sifakis K, Wehrbein T, Rosenhahn B, Wiehlmann L, Chhatwal P, Tümmler B, Davenport CF. 2022. Wochenende – modular and flexible alignment-based shotgun metagenome analysis. BMC Genomics 23:748. doi: 10.1186/s12864-022-08985-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Pounds S, Cheng C. 2006. Robust estimation of the false discovery rate. Bioinformatics 22:1979–1987. doi: 10.1093/bioinformatics/btl328. [DOI] [PubMed] [Google Scholar]
- 86.Mangiafico S. 2016. An R companion for the handbook of biological statistics. rcompanion.org/rcompanion/.
- 87.Asnicar F, Weingart G, Tickle TL, Huttenhower C, Segata N. 2015. Compact graphical representation of phylogenetic data and metadata with GraPhlAn. Peer J 3:e1029. doi: 10.7717/peerj.1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Segata N, Waldron L, Ballarini A, Narasimhan V, Jousson O, Huttenhower C. 2012. Metagenomic microbial community profiling using unique clade-specific marker genes. Nat Methods 9:811–814. doi: 10.1038/nmeth.2066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Dixon P. 2003. VEGAN, a package of R functions for community ecology. J Veg Sci 14:927–930. doi: 10.1111/j.1654-1103.2003.tb02228.x. [DOI] [Google Scholar]
- 90.Liaw A, Wiener M. 2002. Classification and regression by randomForest. R News 2:18–22. [Google Scholar]
- 91.Kursa MB, Rudnicki WR. 2010. Feature selection with the Boruta package. J Stat Softw 36:1–13. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The metagenome sequencing data sets have been deposited in the European Nucleotide Archive (ENA) (study accession no. PRJEB38221, PRJEB52317, and PRJEB52822) hosted by the European Bioinformatics Institute (EMBL-EBI) (www.ebi.ac.uk). The pipeline Wochenende is available at https://github.com/MHH-RCUG/nf_wochenende. The data integration tool Haybaler is at https://github.com/MHH-RCUG/haybaler. Our reference masking tool Blacklister is at https://github.com/colindaven/blacklister. The reference database is publicly available from https://drive.google.com/drive/folders/1q1btJCxtU15XXqfA-iCyNwgKgQq0SrG4?usp=sharing. The STORMS checklist for this article is available from Zenodo (10.5281/zenodo.7053226).