Proper management of polymicrobial infections in patients with cystic fibrosis (CF) has extended their life span. Information about the composition and dynamics of each patient’s microbial community aids in the selection of appropriate treatment of pulmonary exacerbations. We propose the cystic fibrosis rapid response (CFRR) as a fast approach to determine viral and microbial community composition and activity during CF pulmonary exacerbations. The CFRR potential is illustrated with a case study in which a cystic fibrosis fatal exacerbation was characterized by the presence of shigatoxigenic Escherichia coli. The incorporation of the CFRR within the CF clinic could increase the life span and quality of life of CF patients.
KEYWORDS: Shiga toxins, clinical metagenomics, cystic fibrosis, metabolomics, metatranscriptome
ABSTRACT
Pulmonary exacerbations are the leading cause of death in cystic fibrosis (CF) patients. To track microbial dynamics during acute exacerbations, a CF rapid response (CFRR) strategy was developed. The CFRR relies on viromics, metagenomics, metatranscriptomics, and metabolomics data to rapidly monitor active members of the viral and microbial community during acute CF exacerbations. To highlight CFRR, a case study of a CF patient is presented, in which an abrupt decline in lung function characterized a fatal exacerbation. The microbial community in the patient’s lungs was closely monitored through the multi-omics strategy, which led to the identification of pathogenic shigatoxigenic Escherichia coli (STEC) expressing Shiga toxin. This case study illustrates the potential for the CFRR to deconstruct complicated disease dynamics and provide clinicians with alternative treatments to improve the outcomes of pulmonary exacerbations and expand the life spans of individuals with CF.
INTRODUCTION
Cystic fibrosis (CF) is a recessive genetic disease in which defects or deficits in the cystic fibrosis transmembrane conductance regulator (CFTR) protein result in disease phenotypes of the pancreas, sweat glands, and reproductive, respiratory, and digestive systems (1). In the lungs of individuals with CF, mucociliary clearance is impaired, which promotes chronic polymicrobial infections (2). Antibiotic treatments and proper disease management have extended the average life span of CF patients; nevertheless, these polymicrobial lung infections are still the primary cause of morbidity and mortality (3). Common bacteria that colonize CF lungs over the long-term include Pseudomonas aeruginosa, Staphylococcus aureus, Haemophilus influenzae, Burkholderia cepacia complex, Rothia mucilaginosa, and Streptococcus spp. (4–7), but every CF individual presents a unique microbial community that changes over time (8–10). This highlights the need to characterize the microbial communities in each CF individual.
Microbial community dynamics in CF lungs follow the climax attack model (CAM) (11, 12), in which a climax community is acclimated to the host and dominates during stable periods and a transient attack community is associated with exacerbations. Attack communities are virulent and either colonize the CF lungs from an external source or are already present in the CF lungs and become active during exacerbations. In the CAM, attack communities lead to cystic fibrosis pulmonary exacerbations (CFPEs), declines in lung function, and eventually death. Preventing CFPE relies on quickly identifying attack viral and microbial communities and the genes that they carry and express, such as those encoding specific toxins (13), to efficiently tailor antimicrobial therapies.
Here we propose the cystic fibrosis rapid response (CFRR), a strategy for determining microbial dynamics during CFPE. This strategy is a personalized multi-omics approach that uses viromes (14), metagenomes, metatranscriptomes (15), and metabolomes (7, 16) from longitudinal samples to monitor the whole microbial community, particularly its active members and their metabolic products. Using the CFRR to obtain personalized taxonomic and functional profiles of the lung microbial communities would provide clinicians with comprehensive information about each patient’s viral and microbial ecosystem. This information allows clinicians to generate testable hypotheses, test those hypotheses using standard clinical tests, and propose specific clinical interventions (e.g., precisely targeted antibiotic therapy) to improve CFPE outcomes.
The ability to generate multi-omic data sets and analyze large amounts of data in a clinically relevant time frame (i.e., ≤48 h) makes the CFRR approach applicable in CF clinical practice, especially in clinics closely related to research institutions. It requires access to a sequencing instrument, a mass spectrometer, computational resources, and specialized personnel in each one of these areas. In an optimal situation, the time between sample collection and data interpretation is 30 h for metabolomes (17), 38 h for metagenomes and metatranscriptomes, and 48 h for viromes. These times are expected to shorten as technologies improve. The rapid decrease in sequencing costs (18) and incorporation of sequencing cores within hospitals (19) will increase CF patients’ accessibility to the CFRR in the foreseeable future.
A case study is presented to demonstrate the potential of the CFRR strategy. A 37-year-old male CF patient (CF01) was monitored over a 2-year period with metagenomes, metatranscriptomes, and metabolomes. Integrating the information from these sources led to the identification of an attack community in which a strain of Escherichia coli that likely produced Shiga toxin was detected during a fatal exacerbation.
RESULTS
Patient CF01 fatal exacerbation expedited monitoring: metatranscriptomes and metabolomes.
An overall decline in lung function was observed in patient CF01 during his last year of life, and four CFPEs were reported. In the last month of life, 10% of the predicted median forced expiratory volume in 1 s (FEV1) was lost (Fig. 1A). During the last exacerbation, the patient was hospitalized at the intensive care unit (ICU) for 7 days and then died. The fatal exacerbation was characterized by severe lung tissue damage (Fig. 1D; see also Table S1A in the supplemental material), an increase in white blood cell counts (Fig. 1B and Table S1B), and a general decline in health. During the fatal exacerbation, clinical microbiology laboratory cultures from sputum samples tested positive for P. aeruginosa, Stenotrophomonas maltophilia, Aspergillus terreus, and yeast (Fig. 1C and Table S1C). Treatment alternated between the antibiotics aztreonam and azithromycin, in addition to a sulfonamide and a quinolone; at the ICU, colistin and meropenem were administered (Table S1D), but no improvement was observed.
The CFRR strategy was launched to rapidly identify the cause of the CFPE. Sputum samples were collected 7 and 8 days before death (samples D-7 and D-8). In samples D-7 and D-8, active members of the microbial community were determined using metatranscriptomics. In sample D-8, small-molecule profiles (using metabolomics) were characterized, and a total DNA metagenome was sequenced.
Metatranscriptomics data from sample D-8 showed that the most abundant microbial rRNAs belonged to the genera Bacillus (29.9%), Escherichia-Shigella (23.9%), Streptococcus (11.6%), Salmonella (6.9%), and Lactococcus (4.4%), among other genera (23.3%) (Fig. S1A). The microbial mRNA composition was dominated by the genus Pseudomonas (97.1%), followed by Stenotrophomonas (1.9%) and Escherichia (0.07%) (Fig. S1B). At species-level resolution, the most abundant bacterial genomes (based on total RNA) (Fig. 2) were Bacillus sp., shigatoxigenic E. coli (STEC), Salmonella enterica serovar Infantis, P. aeruginosa, and S. maltophilia. Enterobacterial phage SP6, Pseudomonas phages, and Stenotrophomonas phage S1 were also detected. Two members of the phylum Ascomycota were identified: Candida albicans and Aspergillus fumigatus. Metagenomics data of sample D-8 identified Pseudomonas (98.5%) as the dominant bacterial genus (Fig. S5A).
The presence of Escherichia-Shigella in the lungs of a CF patient is unusual, and thus, a detailed analysis was performed to further resolve the taxonomy at the strain level. Strain-level analysis identified that E. coli present in patient CF01’s lungs was most closely related to the genome of E. coli (STEC) B2F1. This strain typically carries the Shiga toxin 1 and Shiga toxin 2 genes, both of which were identified in the metatranscriptomes (Fig. 3B and C). Furthermore, the Shiga toxin receptor globotriaosylceramide (Gb3) was detected in the metabolome from sample D-8 (Fig. 3A). This suggests that Shiga toxin and its Gb3 target were being produced in the lungs of patient CF01. Gb3 is produced in human cells by Gb3 synthase, which adds a sugar to a lactosylceramide molecule. Ceramide is produced by sphingomyelinase (SMase) in the host cell or by the action of bacterially encoded SMase (see Fig. 5B). The gene that encodes a P. aeruginosa secreted SMase, the hemolytic phospholipase C (PlcH) (20), was detected in the sample D-8 metatranscriptome (Fig. S2B).
In a longitudinal metabolomics data set, Gb3 was highly abundant (P < 0.0001) in sample D-8 but was in low abundance in the prior samples (Fig. 3A). The Gb3 precursor lactosylceramide (18:1/16:0) (21) and its ceramide donor sphingomyelin (18:1/16:0) (22) were abundant in all samples throughout the longitudinal data set (Fig. 3A and Table S2A). These data demonstrate that Gb3 precursors were present for at least a year before the fatal exacerbation, but Gb3 was produced in significantly high quantities 8 days before death (sample D-8).
Gb3 levels positively correlate with Shiga toxin levels (23), although the mechanism behind this positive correlation is not clear. Gb3 is the only known functional receptor for Shiga toxins (24), and Shiga toxins induce reorganization of lipids in the epithelial cell’s membrane. Shiga toxin B can bind up to 15 Gb3 molecules (25), and this binding results in the aggregation of Gb3 in lipid rafts. The aggregation of Gb3 in lipid rafts promotes a negative membrane curvature and internalization of Shiga toxin (26). The spatial distribution of Gb3 in the cell membrane has a regulatory role in its presentation (27); thus, higher recruitment of Gb3 in lipid rafts may induce the production of more Gb3.
Antibiotic resistance genes were detected in the metatranscriptomes of samples D-8 and D-7. Transcripts encoding all the protein components were identified for two RND-type multidrug exporters, MexGHI-OpmD (28) and MexA-MexB-OprM (29), previously described in Pseudomonas, as well as the tetracycline efflux pump Tet(C), previously described in Achromobacter. Transcripts encoding several beta-lactamases were identified, such as TEM-116, PDC-3, OXA-50, and BEL-3 (30), which are typically found in Pseudomonas, and CTX-M-21 (31), which is usually found in Enterobacteriaceae. Transcripts encoding enzymes that are involved in resistance to macrolide, aminoglycoside, lincosamide, diaminopyrimidine, and glycopeptide antibiotics were detected; these enzymes were previously described in Pseudomonas, Achromobacter, Escherichia, Streptomyces, Paenibacillus, Clostridium, and Morganella (Table S3A).
A partial P. aeruginosa genome sequence was recovered by assembling reads from the fatal exacerbation metatranscriptomes (samples D-8 and D-7) into contigs and then mapping those contigs to the P. aeruginosa PAO1 reference genome (Fig. S2A). In the resulting P. aeruginosa CF01 contigs, 38 genes related to resistance to antibiotics and toxic compounds were identified (Table S3B). Two prophages were also identified in the assembled P. aeruginosa CF01 contigs (samples D-8 and D-7); one was complete, and the second one was a partial prophage (Fig. S2C and D).
Bacterial small-molecule profiles before and during fatal exacerbation.
Longitudinal metabolomic data from patient CF01’s historical samples and fatal exacerbation sample D-8 were compared to metabolic profiles from six pathogenic bacterial isolates previously detected in CF sputum (P. aeruginosa VVP172, Enterococcus sp. strain VVP100, E. coli VVP427, Streptococcus sp. strain VVP047, Stenotrophomonas sp. strain VVP327, and S. aureus VVP270). The goal was to identify metabolites produced by pathogenic bacteria and track how changes in their abundances might have preceded the fatal exacerbation. Metabolites from these pathogens were consistently detected throughout the longitudinal samples. In sample D-8, there was an increase (P < 0.001) in the number of metabolites that matched P. aeruginosa VVP172, E. coli VVP427, Streptococcus sp. VVP047, and S. aureus VVP270 (Fig. S3 and Table S2B).
Active members of the microbial community during a stable period and the fatal exacerbation.
Analysis of metatranscriptomes from a stable period 10 and 9 months before the fatal exacerbation event (samples D-303 and D-279) identified several differences between this stable period and the fatal exacerbation. First, the phylum Firmicutes was the most active phylum during the stable period, whereas the phylum Proteobacteria was the most active during exacerbation (Fig. 4A). Second, samples from the stable period showed an active microbial community that was more even and diverse than the community in exacerbation samples (Fig. 4D). Third, transcripts from Pseudomonas were detected at very low levels in stable samples (average relative abundance, 3%) but at high levels in exacerbation samples (average relative abundance, 37%) (Fig. S2A). Fourth, the percentages of unclassified sequences were higher in stable samples D-303 and D-279 (40.9% and 39.0%) than in exacerbation samples D-8 and D-7 (27.6% and 17.0%). Fifth, a higher fractional abundance of bacteriophages was detected in the fatal exacerbation samples than in the stable ones. Enterobacterial phage SP6, several Pseudomonas phages (Fig. 4B), and sarcoma viruses (Fig. 4C) were the dominant viruses in samples D-8 and D-7.
Microbial community dynamics during a nonfatal exacerbation.
Two years before the fatal exacerbation, patient CF01’s lung function declined faster than in previous years (Fig. S4A). The rate of lung function change in the last 2 years of life was −9.75 FEV1%/year (Fig. S4C). The overall rate of lung function change during patient CF01’s last 14 years of life was −1.39 FEV1%/year. During a 2-year period of 4 and 3 years before death, the rate of lung function change was 1.30 FEV1%/year (Fig. S4B).
During the 2-year period leading up to the fatal exacerbation, seven exacerbation events were reported, and sputum samples were periodically screened for fungi and bacteria at the clinical microbiology laboratory (Table S1C). P. aeruginosa was detected in all samples. Six months before the fatal exacerbation, S. maltophilia was detected, and during the last 2 months of life, Enterobacter cloacae was detected. A. terreus was detected in two samples in the last 6 months of life. Yeast was detected in all screened samples, except for the final exacerbation samples. Based on this information, several antibiotics were prescribed to manage the exacerbations (Table S1D); these included monobactams, macrolides, quinolones, beta-lactams, sulfonamides, and a cationic polypeptide.
Two years before patient CF01’s death, metagenomics was used to monitor the microbial composition of the respiratory tract during an exacerbation event, the subsequent antibiotic treatment (samples D-724 to D-718), and a stable period that followed (samples D-409 and D-286) (Fig. S5A). The bacterial genera that best differentiated between samples collected during periods of antibiotic treatment (D-722 to D-718) and no antibiotic treatment (D-724 and D-723) were Rothia, Campylobacter, Veillonella, and Prevotella (Fig. S6). The antibiotics prescribed during this exacerbation were a fluoroquinolone (ciprofloxacin) and a tetracycline (doxycycline). Clinical microbiology laboratory tests performed on sample D-719 were positive for P. aeruginosa, Pseudomonas fluorescens, A. fumigatus, and yeast (Table S1C). Exacerbation and stable samples had Streptococcus phages, Staphylococcus phages, and Pseudomonas phages, whereas only exacerbation samples had a Shiga toxin-converting phage (Fig. S5B), and stable samples had higher abundances of herpesviruses (Fig. S5D).
DISCUSSION
The unusually fast decline of patient CF01 led to the implementation of the CFRR. During patient CF01’s fatal exacerbation, E. coli mRNA, rRNA, and metabolites were detected, which demonstrated not only the presence but also the activity of shigatoxigenic E. coli. The identification of a shigatoxigenic E. coli strain is supported by rRNA (36,590 unique rRNA sequences in metatranscriptome D-8), mRNA (1,412 E. coli mRNA reads in metatranscriptome D-8 and 11 partial mRNA reads with 60% identity to STEC BRF1), and metabolites (10 metabolome spectra matched to E. coli in sample D-8). The presence of STEC in the lungs of a CF patient was alarming, as this strain causes severe damage to the lung epithelium (32, 33). Moreover, interactions between Shiga toxin and the host epithelium were inferred from metabolomes. The molecule Gb3, the receptor for Shiga toxin, showed an increase of 3 orders of magnitude during the fatal exacerbation (sample D-8), compared to previous samples.
Altogether, these multi-omics data support the following model of microbial dynamics that caused patient CF01’s death. At the beginning of the fatal exacerbation, STEC produced Shiga toxin that remained inside the bacterial cells. Later in the exacerbation, STEC’s cell membranes were disrupted, and the Shiga toxin was released (Fig. 5B). This release may have been triggered by the action of the cationic polypeptide colistin (34). Next, the toxin was taken up by lung epithelial cells through the host membrane receptor globotriaosylceramide (35) (Fig. 5C). Inside the lung epithelial cells (32), Shiga toxin inhibited host translation by blocking the ribosomes, thereby inducing cell death, necrosis, and an acute inflammatory response (32, 36, 37). The immune response and lung tissue damage were evident in the chest X rays and the increase in white blood cells (samples D-8 and D-1) (Fig. 1D).
During the fatal exacerbation, STEC led the attack community that ultimately destabilized the climax community, a phenomenon previously reported in CF exacerbations (11); this resulted in declines of evenness (diversity index that quantifies how equal the community is [38]) and diversity (the number of different species in a community [39]), a switch from a community dominated by Firmicutes to one dominated by Proteobacteria, and transcription of enterobacterial and Pseudomonas bacteriophages and sarcoma viruses. This event was followed by a Pseudomonas and Stenotrophomonas bloom, characterized by active transcription, as both rRNA and mRNA were detected, as was an increase in their metabolites. Pseudomonas was the most active member of the microbial community, with an mRNA abundance of 97%, followed by 1.92% for Stenotrophomonas mRNA. Bacillus was either lysed or dormant, as only rRNA was detected. A feature that may have contributed to the success of Pseudomonas was its resistance to multiple antibiotics, as detected by the transcription of over 38 antibiotic resistance genes. This scenario is congruent with the one described by the clinical laboratory, as cultures positive for Pseudomonas and Stenotrophomonas were reported during the fatal exacerbation.
Additional dynamics such as bacteriophage induction may have happened during the fatal exacerbation, as active transcription was detected from enterobacterial phage SP6 and Pseudomonas bacteriophages. Bacteriophage induction is known to play a role in the control of bacterial populations in CF lungs (40).
CFRR for polymicrobial infection management, the importance of historical samples, and a fast sample-to-result strategy.
The CFRR emerged from the need to investigate the cause of acute exacerbations. The power of the CFRR is shown in the information obtained for the patient CF01 case study. The CFRR is ideal for medical centers closely associated with research facilities where the equipment is available. However, as technologies improve and become more accessible, the CFRR could be implemented within the clinic.
A key component of the CFRR strategy is the comparison between acute exacerbations and stable periods. Because CF microbial communities are heterogeneous, a baseline needs to be determined for each patient. Longitudinal samples are essential to identify the changes in the microbial community and metabolites during acute exacerbations.
In the presented patient CF01 case study, historical samples were essential to differentiate the attack community that led to a fatal exacerbation from the attack community associated with a nonfatal exacerbation. The increase in Gb3 abundance during patient CF01’s fatal exacerbation (Fig. 3) was detected by comparing its abundances in historical samples. In the case of metabolites, a baseline is necessary for each CF patient because for many compounds, the basal levels are not known. Accumulation of ceramides and sphingomyelins is observed in CF lungs (41). In particular, levels of sphingomyelins, ceramides, and lactosylceramide are significantly higher in CF lungs than in non-CF ones (42).
A challenging component of the CFRR is the collection and storage of historical samples. Sputum samples intended for virome, metagenome, and metabolome (43) analyses are stable if stored at −20°C or −80°C. Metatranscriptomes are prone to RNA degradation, and sputum collection intended for this purpose requires RNA stabilization prior to −20°C or −80°C storage. Given these considerations, each patient can be provided with a non-thaw-cycle −20°C freezer where individual raw sputum samples can be stored for viromes, metagenomes, and metabolomes (see Fig. S7A in the supplemental material). Sputum samples for metatranscriptomes can be collected during the patient’s visit to the CF clinic, where immediately after collection, the RNA integrity is preserved by adding TRIzol or RNAlater. RNA should then be extracted as soon as possible. A sampling scheme in which a higher resolution of samples is desired close to an acute exacerbation and fewer samples are desired far away from the exacerbation event is proposed (Fig. S7).
Historical samples collected by the patient at home or during routine visits to the clinic are a valuable resource in the event of an acute exacerbation. In these cases, historical samples would be processed along with those from acute exacerbations in the CFRR pipeline (Fig. 6), and valuable information would be obtained in less than 48 h. This information is then analyzed by a multidisciplinary scientific team along with the clinician to (i) validate the multi-omics findings with approved clinical tests and (ii) identify appropriate therapeutic options.
The information presented by the CFRR to the clinician is more detailed than that provided by classical clinical microbiology. A clear understanding of how this information is obtained and the exploratory nature of the findings needs to be considered when interpreting the results. Discussion among clinicians and experts on the benefits and limitations of each omics approach is essential to identify the elements causing CF acute exacerbations and then select the course of action to prevent a fatality. The final treatment decision is always in the hands of the clinician, who evaluates the different lines of evidence for each finding and considers the cost-to-benefit ratio of possible therapeutic interventions. The application of the CFRR in a clinical context gives CF patients the opportunity for a better outcome based on an informed treatment decision. Another consideration when implementing the CFRR in the clinic is the availability of financial resources to perform the multi-omics strategy on exacerbation and historical samples.
Considerations about implementing the cystic fibrosis rapid response.
This was a retrospective study in which the patient’s treatment was not modified based on the presented meta-omics results. The course of action of the CFRR strategy is to provide information to clinicians so that they can evaluate and confirm the findings before proceeding with pertinent treatment modifications.
In the case of patient CF01’s fatal exacerbation, the information obtained from the CFRR strategy could have informed the course of action of the treatment with the following modifications: (i) use of different antibiotics, since the mechanism of action of colistin results in liberation of the bacterial cell contents, such as Shiga toxin, and (ii) administration of neutralizing antibodies against Shiga toxin. Colistin is a cationic polypeptide that disrupts the cell membrane of Gram-negative bacteria through a detergent-like mechanism, and it is often used in the treatment of multidrug-resistant exacerbation in patients with CF (44).
In the presented case study, only metatranscriptomes, metabolomes, and metagenomes were used to elucidate the cause of a fatal exacerbation. In future CFRR case studies, the use of viromes could be incorporated. The combination of metagenomes and viromes allows the identification of viral induction events, for example, of prophages carrying toxins. Shigatoxigenic phages are capable of lysogenic conversion (45, 46), and in the case of patient CF01’s fatal exacerbation, an early detection of Shiga toxin in the viromes of historical samples could have provided valuable information about the coding potential of the viral community.
Time is crucial during the management of CF exacerbations. The estimated execution time of the CFRR in an ideal situation with specialized staff working 24/7 is 48 h. Each step has room for improvement that would shorten the execution times. For example, real-time direct sequencing, such as Oxford Nanopore, can eventually be used for CFRR metagenomes, metatranscriptomes, and viromes. These technologies provide genomic information as it is being sequenced (47, 48), which will be ideal for the CFRR once sample preparation and data analysis are optimized for human DNA removal (49) and once large amounts of sputum starting material (400 ng of DNA needed for a Nanopore run) are no longer necessary for DNA sequencing.
Combining data from multiple omics sources enabled the identification of shigatoxigenic E. coli as the likely cause of patient CF01’s fatal exacerbation. Although these omics data were not used to alter clinical treatment of patient CF01, future applications of the CFRR are expected to provide information that is essential for improving therapy, e.g., antibiotic resistance predictions and gene expression in major attack community pathogens. Although each individual’s CF community is unique, these methods will allow for the observation of overarching trends within and between patients, for example, a loss in diversity in acute exacerbations.
MATERIALS AND METHODS
Clinical data.
Sample collection procedures and access to clinical data were approved by the institutional review boards (IRBs) of the University of California San Diego (UCSD) (HRPP 081510), and San Diego State University (IRB approval number 1711018R). Clinical microbiology, hematology, and X rays were performed during the normal care of the patient at the UCSD medical center. Spirometry tests were used to calculate the percentage of predicted FEV1 as previously described (50). Clinical status (exacerbation or stable) was determined by the clinician. Lung function dynamics were modeled using splines and linear model fitting as previously described (51).
Metagenome and metatranscriptome shotgun sequencing.
Sputum samples were collected by expectoration in a sterile cup and processed for metagenomes or metatranscriptomes as previously described (52). Metagenome libraries were constructed using a Nextera DNA library preparation kit. Metatranscriptome libraries were constructed using a TruSeq RNA library preparation kit. All libraries were sequenced on the Illumina GAIIx platform. Metatranscriptomes D-7 and D-8 were prepared using a modified procedure to obtain rRNA and mRNA in a single sequencing step, where half of the sample was depleted of rRNA using a Ribo-Zero gold kit (15) and total RNA was extracted from the other half. Both fractions were pooled in a proportion of 4:1, and a single Illumina library was then constructed and sequenced.
Sequencing data processing.
Quality filtering and dereplication were done using PRINSEQ (53) (-min_qual_mean 20 -derep 1245 -lc_method entropy -lc_threshold 50 -ns_max_p 1 -out_bad null). Cloning vector sequences were removed using SMALT (-y 0.8 -x) with 80% identity against the UniVec database (54); possible sources of cloning vector sequences are reagents used in the library preparation (55, 56). Human genome sequences were removed using BLASTn (E value of 0.1) against the human reference genome GRCh38. Metagenome and metatranscriptome data sets presented in this study are summarized in Table S1E in the supplemental material. Microbial taxonomy assignments at the genus level were made from BLASTn against the nucleotide (NT) database (E value of 0.001; the hit with the lowest E value out of 10 hits was kept) for metagenomes and KAIJU (57) for metatranscriptomes. Viral assignments were made by mapping reads against the viral reference genome database (NCBI RefSeq, release 87) using SMALT (58) with 80% identity. Fractional abundances were calculated using FRAP as previously described (59) and expressed per million reads. After quality filtering and removal of reads that mapped to the human genome, metatranscriptome D-8 reads were compared to the SILVA SSU database using BLASTn with an E value cutoff of 0.001, and taxonomy was assigned at the genus level using the best hit from 10,000 subsample replicates. Nonribosomal reads were compared to the NCBI NT database using BLASTn with an E value cutoff of 0.001. The best hit was selected and used to assign bacterial taxonomy at the genus level. Species-level assignments were determined by the genome that recruited the most reads for each genus at either the rRNA (Bacillus, Escherichia, and Salmonella) or mRNA (Pseudomonas and Stenotrophomonas) level. The bacterial genome with more hits in the BLASTn analysis was selected as the closest strain and used as the reference genome. rRNA and mRNA reads were mapped against each one of the reference genomes using SMALT with identity cutoffs of 60% and 80%, respectively, and the results were visualized using Anvi’o (60).
Reads from metatranscriptomes D-8 and D-7 were together assembled de novo using SPADES (61), and all resultant contigs were compared to the NT database using BLASTn with an E value cutoff of 0.001; taxonomies were assigned using MEGAN6 (62). Contigs identified as Pseudomonas in all metatranscriptomes were separately mapped to the reference genome of P. aeruginosa PAO1 using SMALT with an identity cutoff of 80%. Pseudomonas contigs (n = 4965; total of 2,686,355 bp) were annotated using PATRIC (63); genes identified by subsystem classification as encoding resistance to antibiotics and toxic compounds are summarized in Table S3A in the supplemental material. All contigs were screened for antibiotic resistance genes using the Resistance Gene Identifier implemented in the CARD database (30). All perfect and strict hits were retained, as was any hit with an identity of ≥80%. Metatranscriptome D-8 and D-7 reads were mapped to the proteins Shiga-like toxin subunit A and subunit B using BLASTx with an E value cutoff of 0.001 and an identity of 60%. Fragment recruitment plots were generated using custom python scripts.
Sample comparison.
Random forest, a nonparametric statistical method, was used to determine the bacterial genera that best differentiated between (i) antibiotic treatment and no antibiotic treatment in the metagenomes and (ii) stable and exacerbation states in the metatranscriptomes. The importance of each variable was assessed using the R implementation of the algorithm random forest (64), using 2,000 trees.
The R package vegan (65) was used with the metatranscriptomes to calculate Pielou’s evenness using Shannon diversity.
Metabolomics.
Liquid chromatography-tandem mass spectrometry (LC-MS/MS) metabolomics data were generated from sputum sample D-8 and compared to those of a set of 15 samples routinely collected from the previous 426 days. Metabolite extraction (ethyl acetate and methanol), LC-MS/MS methods, and data analysis were performed as described previously (16). Data from these same sputum samples have been reported previously (16), but the metabolites reported here were not presented in that study, making these data novel (MassiVE data set MSV000079444).
Metabolomics data processing.
Metabolomics data were analyzed using molecular networking (66) and Global Natural Products Social Molecular Networking (GNPS) (67). Molecular networking parameters were altered for this study and are as follows: cosine minimum of 0.7, 6 minimum matched peaks for spectral clustering, and precursor mass and fragment ion mass tolerance of 0.1 Da. Molecular networks were visualized using Cytoscape software (68). Molecules were annotated by searching the GNPS libraries, and specific metabolites of interest were searched for using the MS1 parent mass and then compared to the Metlin MS/MS spectral libraries (69). Area under the curve abundances of metabolites in the LC-MS/MS data were calculated using mzMine 2 software (70), using selected masses. The parameters of the feature finding were as follows: minimum time span of 0.05 min, minimum feature height of 2, and m/z tolerance of 0.05 m/z or 15.0 ppm. The chromatograms were deconvoluted, isotope peaks were grouped, and the peaks were aligned with the same ion mass tolerance and a retention time tolerance of 1 min. The final matrix of features was gap filled. All metabolite annotations based on spectral alignment are considered level 2 according to proposed minimum reporting standards for metabolomics (71).
Isolates of CF pathogens P. aeruginosa VVP172, Enterococcus sp. VVP100, Escherichia coli VVP427, Streptococcus sp. VVP047, Stenotrophomonas maltophilia VVP327, and Staphylococcus aureus VVP270 were obtained from the UCSD Center for Advanced Laboratory Medicine. These isolates were grown in artificial sputum medium according to a method described previously (12), and their metabolomes were extracted using sequential extraction with ethyl acetate and methanol (the same method as for the sputum samples described in reference 16). The LC-MS/MS data were generated with the same protocols as those for the sputum samples, and the data were uploaded to GNPS. The MS/MS data from these bacterial isolates were used individually as a reference for searching for matching spectra in patient CF01’s longitudinal sputum data. Spectral matching parameters were as follows: parent and fragment mass tolerance of 0.1, minimum matched peaks of 6, cosine of 0.7, and minimum spectral count of 3 in the data set. Spectral matches between a sputum sample file and a bacterial isolate were summed for each sample for each bacterium and plotted to identify metabolite matches through the longitudinal data sets from pathogens known to be present in patient CF01 from clinical culture history (it must be noted these isolates were obtained from CF patients in the same clinic as patient CF01 but not from patient CF01). It is unknown if specific bacterial molecules were detected.
Data availability.
Sequencing data are available at the SRA under accession number SRP173673 (72). Metabolomics data are available on GNPS with MassiVE data set MSV000079444 (73). The resulting FASTA files are available in the NCBI Sequence Read Archive (SRA) with the following accession numbers: SAMN10605049 to SAMN10605062 (n = 12).
ACKNOWLEDGMENTS
We are grateful for the support received from Argonne National Laboratories staff members for their time and access to their large-scale systems, in particular Tomas Brettin, Rick Stevens, Ross Overbeek, and Robert Olson. We are thankful to Ty Roach, Nate Robinett, Douglas Naliboff, Sandi Calhoun, and Mark Little for critical discussion of this work.
This work was supported by the Spruance Foundation. A.G.C.G. was supported by CONACyT and UCMEXUS. We declare no competing interests.
Experiments were performed by Y.W.L., S.B., and R.A.Q. Data collection, analysis, and interpretation were performed by A.G.C.G., R.A.Q., T.B., V.A.C., R.E., D.C., R.H., and D.J.C. The manuscript, figures, and tables were prepared by A.G.C.G., H.M., R.A.Q., D.J.C., and F.R. Experimental design and data interpretation were performed by D.J.C., P.D., R.E., R.A.Q., F.R., Y.W.L., and A.G.C.G.
Footnotes
Citation Cobián Güemes AG, Lim YW, Quinn RA, Conrad DJ, Benler S, Maughan H, Edwards R, Brettin T, Cantú VA, Cuevas D, Hamidi R, Dorrestein P, Rohwer F. 2019. Cystic fibrosis rapid response: translating multi-omics data into clinically relevant information. mBio 10:e00431-19. https://doi.org/10.1128/mBio.00431-19.
Contributor Information
Vanessa Sperandio, UT Southwestern Med Center Dallas.
Ryan Hunter, University of Minnesota.
Heather Bean, Arizona State University.
REFERENCES
- 1.Knowles MR, Drumm M. 2012. The influence of genetics on cystic fibrosis phenotypes. Cold Spring Harb Perspect Med 2:a009548. doi: 10.1101/cshperspect.a009548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Laguna TA, Wagner BD, Williams CB, Stevens MJ, Robertson CE, Welchlin CW, Moen CE, Zemanick ET, Harris JK. 2016. Airway microbiota in bronchoalveolar lavage fluid from clinically well infants with cystic fibrosis. PLoS One 11:e0167649. doi: 10.1371/journal.pone.0167649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Alexander BM, Petren EK, Rizvi S, Fink A, Ostrenga J, Sewall A, Loeffler D. 2016. Cystic Fibrosis Foundation patient registry annual data report. Cystic Fibrosis Foundation, Bethesda, MD. [Google Scholar]
- 4.Surette MG. 2014. The cystic fibrosis lung microbiome. Ann Am Thorac Soc 11:S61–S65. doi: 10.1513/AnnalsATS.201306-159MG. [DOI] [PubMed] [Google Scholar]
- 5.LiPuma JJ. 2010. The changing microbial epidemiology in cystic fibrosis. Clin Microbiol Rev 23:299–323. doi: 10.1128/CMR.00068-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lim YW, Schmieder R, Haynes M, Furlan M, Matthews TD, Whiteson K, Poole SJ, Hayes CS, Low DA, Maughan H, Edwards R, Conrad D, Rohwer F. 2013. Mechanistic model of Rothia mucilaginosa adaptation toward persistence in the CF lung, based on a genome reconstructed from metagenomic data. PLoS One 8:e0064285. doi: 10.1371/journal.pone.0064285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Whiteson KL, Meinardi S, Lim YW, Schmieder R, Maughan H, Quinn R, Blake DR, Conrad D, Rohwer F. 2014. Breath gas metabolites and bacterial metagenomes from cystic fibrosis airways indicate active pH neutral 2,3-butanedione fermentation. ISME J 8:1247–1258. doi: 10.1038/ismej.2013.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Whelan FJ, Heirali AA, Rossi L, Rabin HR, Parkins MD, Surette MG. 2017. Longitudinal sampling of the lung microbiota in individuals with cystic fibrosis. PLoS One 12:e0172811. doi: 10.1371/journal.pone.0172811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhao J, Schloss PD, Kalikin LM, Carmody LA, Foster BK, Petrosino JF, Cavalcoli JD, VanDevanter DR, Murray S, Li JZ, Young VB, LiPuma JJ. 2012. Decade-long bacterial community dynamics in cystic fibrosis airways. Proc Natl Acad Sci U S A 109:5809–5814. doi: 10.1073/pnas.1120577109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lim YW, Evangelista JS, Schmieder R, Bailey B, Haynes M, Furlan M, Maughan H, Edwards R, Rohwer F, Conrad D. 2014. Clinical insights from metagenomic analysis of sputum samples from patients with cystic fibrosis. J Clin Microbiol 52:425–437. doi: 10.1128/JCM.02204-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Conrad D, Haynes M, Salamon P, Rainey PB, Youle M, Rohwer F. 2013. Cystic fibrosis therapy: a community ecology perspective. Am J Respir Cell Mol Biol 48:150–156. doi: 10.1165/rcmb.2012-0059PS. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Quinn RA, Whiteson K, Lim Y, Salamon P, Bailey B, Mienardi S, Sanchez SE, Blake D, Conrad D, Rohwer F. 2015. A Winogradsky-based culture system shows an association between microbial fermentation and cystic fibrosis exacerbation. ISME J 9:1024–1038. doi: 10.1038/ismej.2014.234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gallant CV, Raivio TL, Olson JC, Woods DE, Storey DG. 2000. Pseudomonas aeruginosa cystic fibrosis clinical isolates produce exotoxin A with altered ADP-ribosyltransferase activity and cytotoxicity. Microbiology 146:1891–1899. doi: 10.1099/00221287-146-8-1891. [DOI] [PubMed] [Google Scholar]
- 14.Willner D, Haynes MR, Furlan M, Hanson N, Kirby B, Lim YW, Rainey PB, Schmieder R, Youle M, Conrad D, Rohwer F. 2012. Case studies of the spatial heterogeneity of DNA viruses in the cystic fibrosis lung. Am J Respir Cell Mol Biol 46:127–131. doi: 10.1165/rcmb.2011-0253OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lim YW, Schmieder R, Haynes M, Willner D, Furlan M, Youle M, Abbott K, Edwards R, Evangelista J, Conrad D, Rohwer F. 2013. Metagenomics and metatranscriptomics: windows on CF-associated viral and microbial communities. J Cyst Fibros 12:154–164. doi: 10.1016/j.jcf.2012.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Quinn RA, Lim YW, Mak TD, Whiteson K, Furlan M, Conrad D, Rohwer F, Dorrestein P. 2016. Metabolomics of pulmonary exacerbations reveals the personalized nature of cystic fibrosis disease. PeerJ 4:e2174. doi: 10.7717/peerj.2174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Quinn RA, Navas-Molina JA, Hyde ER, Song J, Vázquez-Baeza Y, Humphrey G, Gaffney J, Minich JJ, Melnik AV, Herschend J, Dereus J, Durant A, Dutton RJ, Khosroheidari M, Green C, Da Silva R, Dorrestein P, Knight R. 2016. From sample to multi-omics conclusions in under 48 hours. mSystems 1:e00038-16. doi: 10.1128/mSystems.00038-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.National Human Genome Research Institute. 2018. DNA sequencing costs: data from the NHGRI Genome Sequencing Program (GSP). NIH, Bethesda, MD. [Google Scholar]
- 19.Deurenberg RH, Bathoorn E, Chlebowicz MA, Couto N, Ferdous M, García-Cobos S, Kooistra-Smid AMD, Raangs EC, Rosema S, Veloo ACM, Zhou K, Friedrich AW, Rossen JWA. 2017. Application of next generation sequencing in clinical microbiology and infection prevention. J Biotechnol 243:16–24. doi: 10.1016/j.jbiotec.2016.12.022. [DOI] [PubMed] [Google Scholar]
- 20.Vasil ML, Stonehouse MJ, Vasil AI, Wadsworth SJ, Goldfine H, Bolcome RE III, Chan J. 2009. A complex extracellular sphingomyelinase of Pseudomonas aeruginosa inhibits angiogenesis by selective cytotoxicity to endothelial cells. PLoS Pathog 5:e1000420. doi: 10.1371/journal.ppat.1000420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Sandvig K, Dyve Lingelem AB, Skotland T, Bergan J. 2012. Shiga toxins: properties and action on cells, p 1085–1107. In Alouf J, Ladant D, Popoff MR (ed), The comprehensive sourcebook of bacterial protein toxins, 4th ed Elsevier, Waltham, MA. [Google Scholar]
- 22.Obrig TG, Seaner RM, Bentz M, Lingwood CA, Boyd B, Smith A, Narrow W. 2003. Induction by sphingomyelinase of Shiga toxin receptor and Shiga toxin 2 sensitivity in human microvascular endothelial cells. Infect Immun 71:845–849. doi: 10.1128/IAI.71.2.845-849.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Boyd B, Tyrrell G, Maloney M, Gyles C, Brunton J, Lingwood C. 1993. Alteration of the glycolipid binding specificity of the pig edema toxin from globotetraosyl to globotriaosyl ceramide alters in vivo tissue targetting and results in a verotoxin l-like disease in pigs. J Exp Med 177:1745–1753. doi: 10.1084/jem.177.6.1745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Aigal S, Claudinon J, Römer W. 2015. Plasma membrane reorganization: a glycolipid gateway for microbes. Biochim Biophys Acta 1853:858–871. doi: 10.1016/j.bbamcr.2014.11.014. [DOI] [PubMed] [Google Scholar]
- 25.Ling H, Boodhoo A, Hazes B, Cummings MD, Armstrong GD, Brunton JL, Read RJ. 1998. Structure of the Shiga-like toxin I B-pentamer complexed with an analogue of its receptor Gb3. Biochemistry 2960:1777–1788. doi: 10.1021/bi971806n. [DOI] [PubMed] [Google Scholar]
- 26.Betz J, Bielaszewska M, Thies A, Humpf H, Dreisewerd K, Karch H, Kim KS, Friedrich AW, Müthing J. 2011. Shiga toxin glycosphingolipid receptors in microvascular and macrovascular endothelial cells: differential association with membrane lipid raft microdomains. J Lipid Res 52:618–634. doi: 10.1194/jlr.M010819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lingwood CA, Manis A, Mahfoud R, Khan F, Binnington B, Mylvaganam M. 2010. New aspects of the regulation of glycosphingolipid receptor function. Chem Phys Lipids 163:27–35. doi: 10.1016/j.chemphyslip.2009.09.001. [DOI] [PubMed] [Google Scholar]
- 28.Aendekerk S, Ghysels B, Cornelis P, Baysse C. 2002. Characterization of a new efflux pump, MexGHI-OpmD, from Pseudomonas aeruginosa that confers resistance to vanadium. Microbiology 148:2371–2381. doi: 10.1099/00221287-148-8-2371. [DOI] [PubMed] [Google Scholar]
- 29.Li XZ, Nikaido H, Poole K. 1995. Role of MexA-MexB-OprM in antibiotic efflux in Pseudomonas aeruginosa. Antimicrob Agents Chemother 39:1948–1953. doi: 10.1128/AAC.39.9.1948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jia B, Raphenya AR, Alcock B, Waglechner N, Guo P, Tsang KK, Lago BA, Dave BM, Pereira S, Sharma AN, Doshi S, Courtot M, Lo R, Williams LE, Frye JG, Elsayegh T, Sardar D, Westman EL, Pawlowski AC, Johnson TA, Brinkman FSL, Wright GD, McArthur AG. 2017. CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database. Nucleic Acids Res 45:D566–D573. doi: 10.1093/nar/gkw1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Saladin M, Cao VTB, Lambert T, Donay JL, Herrmann JL, Ould-Hocine Z, Verdet C, Delisle F, Philippon A, Arlet G. 2002. Diversity of CTX-M β-lactamases and their promoter regions from Enterobacteriaceae isolated in three Parisian hospitals. FEMS Microbiol Lett 209:161–168. doi: 10.1016/S0378-1097(02)00484-6. [DOI] [PubMed] [Google Scholar]
- 32.Uchida H, Kiyokawa N, Taguchi T, Horie H, Fujimoto J, Takeda T. 1999. Shiga toxins induce apoptosis in pulmonary epithelium-derived cells. J Infect Dis 180:1902–1911. doi: 10.1086/315131. [DOI] [PubMed] [Google Scholar]
- 33.Bergan J, Dyve Lingelem AB, Simm R, Skotland T, Sandvig K. 2012. Shiga toxins. Toxicon 60:1085–1107. doi: 10.1016/j.toxicon.2012.07.016. [DOI] [PubMed] [Google Scholar]
- 34.Gupta S, Govil D, Kakar PN, Prakash O, Arora D, Das S, Govil P, Malhotra A. 2009. Colistin and polymyxin B: a re-emergence. Indian J Crit Care Med 13:49–53. doi: 10.4103/0972-5229.56048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gallegos KM, Conrady DG, Karve SS, Gunasekera TS, Herr AB, Weiss AA. 2012. Shiga toxin binding to glycolipids and glycans. PLoS One 7:e0030368. doi: 10.1371/journal.pone.0030368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Obrig TG. 2010. Escherichia coli Shiga toxin mechanisms of action in renal disease. Toxins (Basel) 2:2769–2794. doi: 10.3390/toxins2122769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Melton-Celsa AR. 2014. Shiga toxin (Stx) classification, structure, and function. Microbiol Spectr 2:EHEC-0024-2013. doi: 10.1128/microbiolspec.EHEC-0024-2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Pielou EC. 1979. Biogeography. Wiley, New York, NY. [Google Scholar]
- 39.Shannon CE, Weaver W. 1949. The mathematical theory of communication. University of Illinois, Urbana, IL. [Google Scholar]
- 40.James CE, Davies EV, Fothergill JL, Walshaw MJ, Beale CM, Brockhurst MA, Winstanley C. 2015. Lytic activity by temperate phages of Pseudomonas aeruginosa in long-term cystic fibrosis chronic lung infections. ISME J 9:1391–1398. doi: 10.1038/ismej.2014.223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Seitz AP, Grassmé H, Edwards MJ, Pewzner-Jung Y, Gulbins E. 2015. Ceramide and sphingosine in pulmonary infections. Biol Chem 396:611–620. doi: 10.1515/hsz-2014-0285. [DOI] [PubMed] [Google Scholar]
- 42.Quinn RA, Phelan VV, Whiteson KL, Garg N, Bailey BA, Lim YW, Conrad DJ, Dorrestein PC, Rohwer FL. 2016. Microbial, host and xenobiotic diversity in the cystic fibrosis sputum metabolome. ISME J 10:1483–1498. doi: 10.1038/ismej.2015.207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wandro S, Carmody L, Gallagher T, LiPuma JJ, Whiteson K. 2017. Making it last: storage time and temperature have differential impacts on metabolite profiles of airway samples from cystic fibrosis patients. mSystems 2:e00100-17. doi: 10.1128/mSystems.00100-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z, Assempour N, Iynkkaran I, Liu Y, MacIejewski A, Gale N, Wilson A, Chin L, Cummings R, Le D, Pon A, Knox C, Wilson M. 2018. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res 46:D1074–D1082. doi: 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Moons P, Faster D, Aertsen A. 2013. Lysogenic conversion and phage resistance development in phage exposed Escherichia coli biofilms. Viruses 5:150–161. doi: 10.3390/v5010150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Krüger A, Lucchesi PMA. 2015. Shiga toxins and stx phages: highly diverse entities. Microbiology 161:451–462. doi: 10.1099/mic.0.000003. [DOI] [PubMed] [Google Scholar]
- 47.Greninger AL, Naccache SN, Federman S, Yu G, Mbala P, Bres V, Stryke D, Bouquet J, Somasekar S, Linnen JM, Dodd R, Mulembakani P, Schneider BS, Muyembe-Tamfum J-J, Stramer SL, Chiu CY. 2015. Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med 7:99. doi: 10.1186/s13073-015-0220-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Schmidt K, Mwaigwisya S, Crossman LC, Doumith M, Munroe D, Pires C, Khan AM, Woodford N, Saunders NJ, Wain J, O’Grady J, Livermore DM. 2017. Identification of bacterial pathogens and antimicrobial resistance directly from clinical urines by nanopore-based metagenomic sequencing. J Antimicrob Chemother 72:104–114. doi: 10.1093/jac/dkw397. [DOI] [PubMed] [Google Scholar]
- 49.Gu W, Crawford ED, O’Donovan BD, Wilson MR, Chow ED, Retallack H, DeRisi JL. 2016. Depletion of abundant sequences by hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome Biol 17:41. doi: 10.1186/s13059-016-0904-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hankinson IL, Odencrantz JR, Fedan KB. 1999. Spirometric reference values from a sample of the general U.S. population. Am J Respir Crit Care Med 159:179–187. doi: 10.1164/ajrccm.159.1.9712108. [DOI] [PubMed] [Google Scholar]
- 51.Conrad D, Bailey B, Hardie JA, Bakke PS, Eagan TML, Aarli BB. 2017. Median regression spline modeling of longitudinal FEV1 measurements in cystic fibrosis (CF) and chronic obstructive pulmonary disease (COPD) patients. PLoS One 12:e0190061. doi: 10.1371/journal.pone.0190061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Lim YW, Haynes M, Furlan M, Robertson CE, Harris JK, Rohwer F. 2014. Purifying the impure: sequencing metagenomes and metatranscriptomes from complex animal-associated samples. J Vis Exp 2014:52117. doi: 10.3791/52117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Schmieder R, Edwards R. 2011. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.National Center for Biotechnology Information. 2016. The UniVec database. National Center for Biotechnology Information, Bethesda, MD. [Google Scholar]
- 55.National Center for Biotechnology Information. 2016. Contamination in sequence databases. National Center for Biotechnology Information, Bethesda, MD. [Google Scholar]
- 56.Woyke T, Sczyrba A, Lee J, Rinke C, Tighe D, Clingenpeel S, Malmstrom R, Stepanauskas R, Cheng J-F. 2011. Decontamination of MDA reagents for single cell whole genome amplification. PLoS One 6:e26161. doi: 10.1371/journal.pone.0026161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Menzel P, Ng KL, Krogh A. 2016. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. doi: 10.1038/ncomms11257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wellcome Sanger Institute. 2010. SMALT manual. Wellcome Sanger Institute, Hinxton, United Kingdom. [Google Scholar]
- 59.Cobián Güemes AG, Youle M, Cantú VA, Felts B, Nulton J, Rohwer F. 2016. Viruses as winners in the game of life. Annu Rev Virol 3:197–214. doi: 10.1146/annurev-virology-100114-054952. [DOI] [PubMed] [Google Scholar]
- 60.Eren AM, Esen ÖC, Quince C, Vineis JH, Morrison HG, Sogin ML, Delmont TO. 2015. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3:e1319. doi: 10.7717/peerj.1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, Ruscheweyh HJ, Tappu R. 2016. MEGAN Community Edition—interactive exploration and analysis of large-scale microbiome sequencing data. PLoS Comput Biol 12:e1004957. doi: 10.1371/journal.pcbi.1004957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Wattam AR, Davis JJ, Assaf R, Boisvert S, Brettin T, Bun C, Conrad N, Dietrich EM, Disz T, Gabbard JL, Gerdes S, Henry CS, Kenyon RW, Machi D, Mao C, Nordberg EK, Olsen GJ, Murphy-Olson DE, Olson R, Overbeek R, Parrello B, Pusch GD, Shukla M, Vonstein V, Warren A, Xia F, Yoo H, Stevens RL. 2017. Improvements to PATRIC, the all-bacterial bioinformatics database and analysis resource center. Nucleic Acids Res 45:D535–D542. doi: 10.1093/nar/gkw1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Liaw A, Wiener M. 2002. Classification and regression by randomForest. R News 2–3:18–22. [Google Scholar]
- 65.Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D. 2017. vegan: community ecology package. 2.4-3. [Google Scholar]
- 66.Watrous J, Roach P, Alexandrov T, Heath BS, Yang JY. 2012. Mass spectral molecular networking of living microbial colonies. Proc Natl Acad Sci U S A 109:E1743–E1752. doi: 10.1073/pnas.1203689109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wang M, Carver JJ, Phelan VV, Sanchez LM, Garg N, Peng Y, Nguyen DD, Watrous J, Kapono CA, Luzzatto-Knaan T, Porto C, Bouslimani A, Melnik AV, Meehan MJ, Liu W-T, Crüsemann M, Boudreau PD, Esquenazi E, Sandoval-Calderón M, Kersten RD, Pace LA, Quinn RA, Duncan KR, Hsu C-C, Floros DJ, Gavilan RG, Kleigrewe K, Northen T, Dutton RJ, Parrot D, Carlson EE, Aigle B, Michelsen CF, Jelsbak L, Sohlenkamp C, Pevzner P, Edlund A, McLean J, Piel J, Murphy BT, Gerwick L, Liaw C-C, Yang Y-L, Humpf H-U, Maansson M, Keyzers RA, Sims AC, Johnson AR, Sidebottom AM, Sedio BE, et al. 2016. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 34:828–837. doi: 10.1038/nbt.3597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. 2003. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Smith CA, O’Maille G, Want EJ, Qin C, Trauger SA, Brandon TR, Custodio DE, Abagyan R, Siuzdak G. 2005. METLIN: a metabolite mass spectral database. Ther Drug Monit 27:747–751. doi: 10.1097/01.ftd.0000179845.53213.39. [DOI] [PubMed] [Google Scholar]
- 70.Pluskal T, Castillo S, Villar-Briones A, Oresic M. 2010. MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinformatics 11:395. doi: 10.1186/1471-2105-11-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Sumner LW, Amberg A, Barrett D, Beale MH, Beger R, Daykin CA, Fan TW-M, Fiehn O, Goodacre R, Griffin JL, Hankemeier T, Hardy N, Harnly J, Higashi R, Kopka J, Lane AN, Lindon JC, Marriott P, Nicholls AW, Reily MD, Thaden JJ, Viant MR. 2007. Proposed minimum reporting standards for chemical analysis: Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics 3:211–221. doi: 10.1007/s11306-007-0082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Rohwer F. 2016. Cystic fibrosis case studies. National Center for Biotechnology Information SRA https://www.ncbi.nlm.nih.gov/sra/?term=SRP173673 (accession no. SRP173673).
- 73.Rohwer F, Dorrestein P. 2019. GNPS CF1 4y longitudinal. GNPS http://proteomics2.ucsd.edu/ProteoSAFe/result.jsp?task=0e8c1c0bc22745519be9d7147e74eff4&view=advanced_view#{} (data set MSV000079444).
- 74.Brasfield D, Hicks G, Soong S, Peters J, Tiller R. 1980. Evaluation of scoring system of the chest radiograph in cystic fibrosis. Am J Roentgenol 134:1195–1198. doi: 10.2214/ajr.134.6.1195. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data are available at the SRA under accession number SRP173673 (72). Metabolomics data are available on GNPS with MassiVE data set MSV000079444 (73). The resulting FASTA files are available in the NCBI Sequence Read Archive (SRA) with the following accession numbers: SAMN10605049 to SAMN10605062 (n = 12).