Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2024 Jan 11;19(1):e0290062. doi: 10.1371/journal.pone.0290062

The salivary microbiota of patients with acute lower respiratory tract infection–A multicenter cohort study

Matthew B Rogers 1, Ashley Harner 1, Megan Buhay 1, Brian Firek 1, Barbara Methé 1, Alison Morris 1, Octavia M Peck Palmer 1, Susan B Promes 2, Robert L Sherwin 3, Lauren Southerland 4, Alexandre R Vieira 1, Sachin Yende 1, Michael J Morowitz 1,5, David T Huang 1,*
Editor: Benjamin M Liu6
PMCID: PMC10783762  PMID: 38206940

Abstract

The human microbiome contributes to health and disease, but the oral microbiota is understudied relative to the gut microbiota. The salivary microbiota is easily accessible, underexplored, and may provide insight into response to infections. We sought to determine the composition, association with clinical features, and heterogeneity of the salivary microbiota in patients with acute lower respiratory tract infection (LRTI). We conducted a multicenter prospective cohort study of 147 adults with acute LRTI presenting to the emergency department of seven hospitals in three states (Pennsylvania, Michigan, and Ohio) between May 2017 and November 2018. Salivary samples were collected in the emergency department, at days 2–5 if hospitalized, and at day 30, as well as fecal samples if patients were willing. We compared salivary microbiota profiles from patients to those of healthy adult volunteers by sequencing and analyzing bacterial 16-rRNA. Compared to healthy volunteers, the salivary microbiota of patients with LRTI was highly distinct and strongly enriched with intestinal anaerobes such as Bacteroidaceae, Ruminococcaceae, and Lachnospiraceae (e.g., mean 10% relative abundance of Bacteroides vs < 1% in healthy volunteers). Within the LRTI population, COPD exacerbation was associated with altered salivary microbiota composition compared to other LRTI conditions. The largest determinant of microbiota variation within the LRTI population was geography (city in which the hospital was located).

Introduction

The human microbiota consists of a diverse and abundant community of microbial cells dwelling within each individual. Recent data suggest connections between perturbations of the respiratory microbiota and respiratory disease. Studies in this area have examined chronic respiratory illnesses in outpatients, such as asthma and idiopathic pulmonary fibrosis, or critically ill patients receiving mechanical ventilation [15]. However, there are limited data on patients presenting to the emergency department (ED) with acute lower respiratory tract infection (LRTI). Dysbiosis of the upper respiratory tract microbiota has been found in patients presenting to the ED with pneumonia [6], and a preponderance of a Haemophilus species has been reported in patients with chronic lung conditions hospitalized with pneumonia [7]. Acute LRTI is a common cause of ED visits, incurs a significant healthcare burden, and is an important patient population to investigate. Profiling their microbiota could provide real time, actionable, and personalized information that could be used to improve clinical outcomes for LRTI patients.

Within the oropharynx and airway, most published studies have examined sputum samples or tracheal aspirates, but the salivary microbiota is mostly underexplored [8]. Prior studies of saliva have been limited by sample size and single center design, and it is unclear whether the salivary microbiome varies with acute disease, patient characteristics, and geography [913] This knowledge gap is potentially important because saliva represents an easily accessible source for investigation and the development of diagnostic tools [14,15]. Further, past studies have indicated that the lung microbiota, which cannot be studied noninvasively, overlaps with the oral microbiota [16]. Therefore it is possible that analysis of saliva could provide important information about the lung microbiota [17], and in turn could provide information about LRTI etiology. We therefore sought to explore in a prospective multicenter cohort, the composition of the salivary microbiota in patients presenting with clinically diagnosed acute LRTI, association with baseline clinical features and outcomes, and heterogeneity across geographic regions.

Materials and methods

We conducted an investigator-initiated, multicenter prospective cohort study of adult patients with suspected acute LRTI presenting to the EDs of seven hospitals in southwestern and central Pennsylvania, southern Michigan, and central Ohio between May 2017 and November 2018. We collected a longitudinal series of saliva samples and corresponding fecal samples and associated clinical data to microbiota analyses as described below. The University of Pittsburgh and all site institutional review boards approved the protocol. Thermo Fisher Scientific provided an unrestricted grant to fund the study, received enrollment updates and reviewed a manuscript draft, but had no role in study design or manuscript writing.

Study subjects

Eligible participants were 18 years or older and had a clinical diagnosis in the ED of acute LRTI (< 28 days duration). For broad applicability we sought to enroll routine cases of uncomplicated LRTI commonly seen in the ED. We therefore excluded patients receiving antibiotics prior to ED presentation, on chronic dialysis, receiving vasopressors or mechanical ventilation in the ED, with known severe immunosuppression, with accompanying non-respiratory infection, known empyema, or metastatic cancer, or who had surgery in the past seven days, were incarcerated, homeless, or previously enrolled in this study within the past 30 days. Fecal samples were obtained by self-swabbing and patients could opt out of providing fecal samples.

We categorized the initial diagnosis of LRTI into final diagnoses of acute exacerbation of chronic obstructive pulmonary disease (COPD), asthma exacerbation, acute bronchitis, community-acquired pneumonia, and other, using the classification schema of a recent large LRTI trial [18]. This cohort was enrolled prior to the SARS-Cov-2 pandemic.

Clinical procedures

We asked participants to provide a saliva sample and a fecal sample (or a rectal swab if the patient preferred) at three timepoints—in the ED, on days 2–5 if hospitalized, and on day 30. If a patient was still hospitalized at day 30, we obtained samples in the hospital. Site coordinators sent these locally obtained samples to the University of Pittsburgh. For patients discharged from the ED or hospitalized and discharged prior to day 30, we provided discharge sample collection kits with instructions, pre-labeled sample collection supplies, and a prepaid, pre-addressed return envelope to the University of Pittsburgh. Samples were collected into Zymoresearch DNA/RNA Shield Collection tubes to avoid growth of bacteria while samples were transported at room temperature.

To encourage participant compliance with obtaining and mailing the day 30 samples and to gather post-discharge health status information, we contacted patients by phone at days 15 and 30. We compensated participants for each timepoint swabs that were obtained (ED, day 2–5, day 30), provided a study calendar with reminders for the phone calls and day 30 sample collection, required sites to collect multiple phone contacts and to notify us of patient discharge dates within 48 hours of discharge, and provided a text messaging option for patients who preferred texting versus phone calls. For saliva collection, subjects were instructed to refrain from using mouth wash, brushing their teeth, or flossing for 12 hours before sample collection. Samples were not stored for more than 20 days at ambient temperature, and all samples were shipped within 20 days of collection. Each center was sent highly detailed collection protocols in order to standardize collection methods. S1 Appendix contains illustrated details of saliva sample collection procedures. S2 Appendix details procedures for fecal sample collection. S3 Appendix is an inventory of the collection kit and contains further information on storage and shipping of samples.

All experimental protocols were approved by the University of Pittsburgh Institutional Review Board (STUDY20070139: MAPLE) and were carried out according to these guidelines. All patients or their authorized representatives provided written informed consent and indicated willingness to provide salivary samples at all pre-defined study timepoints.

We obtained clinical data via chart review by site research staff, and phone calls at days 15 and 30 by coordinating center staff. For adverse outcomes, we used the composite outcome used in a recent large trial [18] of endotracheal intubation, administration of vasopressors, renal failure, lung abscess/empyema, development of pneumonia in patients presenting with non-pneumonia LRTI, rehospitalization or death.

Healthy reference population of individuals without LRTI

For a reference group of individuals without LRTI, we used sequencing data from saliva samples collected from 290 patients without chronic health conditions enrolled in the Dental Registry and DNA Repository (DRDR) from the School of Dental Medicine at the University of Pittsburgh [19]. In brief, this project has the approval of the University of Pittsburgh Institutional Review Board and all participants provided written informed consent. Biospecimens were linked to patients’ electronic health record (EHR) data, permitting analysis of associations between oral microbiota data obtained from DNA extracted from the specimens with dental and medical conditions. All data were de-identified, and biospecimens were linked to EHRs using a unique study number rather than personal identifying information. The microbiotas of these patients were then sequenced at the Center for Medicine and the Microbiome (CMM) at the University of Pittsburgh (https://www.microbiome.pitt.edu). Henceforth we refer to the DRDR cohort as the ‘healthy’ group, and the study cohort as the LRTI group. To adjust for the effect of age on microbiota composition we excluded 13 DRDR patients under 18 years old, leaving 276 subjects / samples in our analysis.

Microbiome procedures

DNA extraction

DNA from LRTI patient samples was extracted using the Qiagen DNeasy PowerSoil HTP DNA Isolation kit. Briefly, for the saliva, fecal or rectal swab samples, the swab heads were cut off directly into individual wells of the bead plate and stored overnight at -80°C. The next day the Bead Solution and Solution C1 were added, and the plates were incubated at 65°C for 10 minutes in a bead bath. The homogenization step was performed on a Retsch Oscillating Mill MM400 with 96-well plate adaptors for 7.5 minutes at a frequency of 20Hz. The plates were rotated 180° and the cycle was repeated. The Solution C2 and C3 steps were combined (200 μl of each added) to improve DNA yield. All remaining steps followed the manufacturer’s centrifugation protocol.

DNA from the DRDR (healthy) samples were brought to room temperature and centrifuged for 5 minutes at 10,000xg. After the supernatant was removed 1 ml of extraction buffer (10mM Tris-HCL pH 7.8, 5mM EDTA, 0.5% SDS) was added to the buccal cell pellet and thoroughly mixed. A 5ul solution of proteinase K was added and the samples were incubated in a 56°C water bath overnight. After removal, the samples were vortexed and 500 ul of 10M Ammonium Acetate was added. Samples were inverted for 3 minutes and centrifuged at 21,000xg for 15 minutes. The supernatant was then transferred to two eppendorfs with an equal volume of cold isopropanol and shaken vigorously. Samples were then incubated at -20°C for a minimum of 30 minutes, after which they were centrifuged at 10,000xg for 20 minutes at 4°C. The supernatant was poured off and 1 ml of cold 70% EtOH was added, and samples were inverted 3 times. Samples were then centrifuged at 10,000xg for 5 minutes at 4°C. The supernatant was poured off and samples were allowed to air dry before 100 ul of TE buffer was added. Samples were kept in a 4°C refrigerator for 2–3 days before ensuring that the DNA was completely dissolved by reading the concentrations of the samples.

16S-rRNA gene amplicon PCR and sequencing

PCR amplification of the small subunit ribosomal RNA gene (16S-rRNA) was performed in triplicate 25 μl reactions. Amplicons were produced utilizing primers adapted for the Illumina MiSeq. Amplicons target the V4 region and primers utilized either the Illumina adapter, primer pad and linker (forward primer) or Illumina adaptor, Golay barcode, primer pad and linker (reverse primer) followed by a sequence targeting a conserved region of the bacterial 16S-rRNA gene as described by Caporaso, et al [20]. Five extraction controls were sequenced alongside the experimental samples.

Microbiota analysis

Paired-end amplicon sequence reads were demultiplexed using QIIME2 package [21]. The first 22 bps of the reads were trimmed, and both forward and reverse reads were truncated at 180 bp at which point we observed a decline in the average phred score of randomly sampled reads. The DADA2 [22] algorithm, executed within the QIIME2 package [21] was used to predict amplicon sequence variants (ASVs). ASVs were aligned using MAFFT [23], and a phylogenetic tree was generated using Fasttree. Finally a naive-bayes feature classifier was trained on the V4 region of 16-rRNA using the Greengenes database (v 13.5). Taxonomic assignment was then carried out on the ASV sequences using the feature classifier. ASV table, taxonomy table and phylogenetic tree were then loaded into R using the Phyloseq [24] import_biom function. Samples with amplicon counts less than 4,000 were removed from the analysis. The five extraction controls included in the sample preparation were searched against the Greengenes (13.5) database using BlastN [25]. Most of the extraction controls (4/5) had less than 10 reads. Only one had a read-pair matching, which matched an unknown bacterium distantly related to Veillonella. We could not find an ASV matching this amplicon in the rest of the samples. One extraction control contained 860 paired end reads. Along with common soil bacteria, this sample also included an abundance of reads matching human gut bacteria Bacteroidaceae and Enterobacteriaceae. This control was only associated with 3 patients and 5 samples. We treated these samples as contaminated and removed them from downstream analysis.

To adjust for the well documented effects of age on microbiota composition we excluded DRDR patients that were below the lower range of LRTI patients (18 years old; 13 patient samples).

Alpha diversity and beta diversity calculations were done using both Vegan [26] and Phyloseq [24]. Alpha diversity in its simplest form accounts for the richness of different ASVs within a single environment (for example, the number of bacterial species in a saliva sample from an individual patient). A more informative alpha diversity metric is the Shannon index, which quantifies not only ASV richness, but also distribution of ASV abundances. High values of Shannon diversity indicate a high ASV richness and even distribution of ASVs. Beta diversity accounts for differences between environments (for example, differences in microbial species abundance and phylogenetic distance between samples from different patients). Beta diversity is typically measured as distances between samples based on abundances of ASVs and may also include phylogenetic distances (Unifrac metric).

All statistical analyses of the microbiota data were done in R using the libraries Vegan [26], Phyloseq [24], ANCOM [27], ANCOM-BC [28] and LOCOM [29]. We wrote additional custom code for comparisons against the healthy samples. All code, ASV table, tree, ASV sequences, phylogenetic tree, control sequence data and metadata are available at https://github.com/MattRogers2/.

Statistical analyses

To compare the composition of the salivary microbiota in our LRTI study group versus healthy controls, we collected saliva samples from all timepoints, and compared alpha diversities (Shannon metric) using a Wilcoxon rank-sum test. We calculated beta diversity distances between samples using the weighted-unifrac metric [30] implemented in the Phyloseq R library [24], then observed whether samples clustered as discrete groups using principal coordinate analysis (PCoA), and tested the significance of variance between these group differences using the Permutational Multivariate Analysis of Variance test (ADONIS2) encoded in the vegan R library. As some subjects had samples taken at up to three time points we stratified the ADONIS2 permutations for each time point to adjust for multiple samples per subject. Due to significant differences in age differences in the LRTI and healthy cohorts, we included age as an interacting term in the ADONIS2 analysis comparing cohorts. Taxonomic composition was compared between groups using the Analysis of Composition of Microbiomes (ANCOM) [27] R package, Analysis of Composition of Microbiomes with Bias Correction (ANCOM-BC) [28] and logistic compositional analysis (LOCOM) [29].

We looked for differences in the salivary microbiota profiles associated with a composite endpoint of adverse outcomes within 30 days of enrollment, including hospital re-admission, development of pneumonia in a patient without pneumonia at enrollment, empyema, organ failure, endotracheal intubation, vasopressor requirement, renal failure, and death [18].

Due to a limited number of subjects with samples beyond a baseline sample, we chose to focus our comparisons on samples taken at the baseline timepoint for clinical variable comparisons. We performed a multivariable ADONIS2 analysis of 24 categorical variables taken from patient demographics and clinical features. We conducted a separate ADONIS2 analysis including the five LRTI diagnoses. ADONIS2 analyses were repeated while stratifying permutations on city in which the hospitals were located. To examine heterogeneity of the salivary microbiota across cities, we performed PCoA and pairwise ADONIS2 comparisons on the hospital city locations. We also performed multiple chi-squared analyses between clinical variables and the hospital location city to verify independence of patient clinical features from hospital location. Finally, a correlogram was generated from patient metadata and outcomes using the Pearson metric and the R corrplot library. For all statistical tests, we rejected null hypotheses above a p-value of 0.01.

Results and discussion

Results

Clinical cohort

We enrolled 150 patients with 3 requesting withdrawal, for a final analysis cohort of 147 patients. Patients were middle age (mean 51 years), 61.9% female, and 60.5% Black; 30.6% received antibiotics in the ED and 47.6% were admitted to hospital (Table 1). For saliva samples, 145 patients (98.6%) provided samples in the ED, 48 of the 70 patients admitted to hospital (68.6%) provided a day 2–5 sample, and 50 patients (34.0%) provided a day 30 sample. For fecal samples, 39 patients (26.5%) provided samples in the ED, 21 of the 70 patients admitted to hospital (32.9%) provided a day 2–5 sample, and 50 patients (34.0%) provided a day 30 sample. Fourteen patients experienced an adverse outcome within 30 days of enrollment. The mean age of the 290 patients from the healthy cohort was 35 years old, with a male to female ratio of 1.6:1.

Table 1. Clinical and demographic data collected from subjects participating in both the LRTI study and DRDR studies.
Characteristics Total (n = 147) Healthy (n = 290) Healthy > 18 yrs old
(n = 277)
Age–yr 51 ± 18.2 35±15.5 36.3± 15.1
Male sex–no. (%) 56 (38.1%) 142 (49%) 137 (49.4%)
Race/ethnicity–no. (%)
White 56 (38.1%) 212(73%) 208 (75.0%)
Black 89 (60.5%) 42(14.4%) 36 (12.9%)
Hispanic 3 (2.0%) 5(1.7%) 4 (1.4%)
Asian 0 23(7.9%) 21 (7.5%)
Other/Undisclosed 0 8(2.7%) 8 (2.8%)
Comorbidities§
Charlson comorbidity score 1.4 ± 1.2
Current smoker–no. (%) 55 (37.4%)
COPD 65 (44.2%)
Asthma 67 (45.6%)
Home medications–no. (%)
Home oxygen 18 (12.2%)
Oral corticosteroids 28 (19.0%)
Inhaled corticosteroids 46 (31.3%)
Inhaled long-acting bronchodilators 41 (27.9%)
Leukotriene receptor
antagonists
11 (7.5%)
Symptoms
Duration—days 4 ± 4.9
Cough–no. (%) 107 (72.8%)
Dyspnea–no. (%) 126 (85.7%)
Sputum production–no. (%) 63 (42.9%)
Chest discomfort–no. (%) 73 (49.7%)
Chills–no. (%) 46 (31.3%)
Clinical findings
Temperature (degrees Celsius) 36.9 ± 0.5
Heart rate 90.0 ± 14.8
Respiratory rate 20.0 ± 4.6
Mean arterial pressure (mmHg) 96.2 ± 13.7
Oxygen saturation (%) 96.0 ± 3.1
Rhonchi—no. (%) 17 (11.6%)
Wheezing—no. (%) 74 (50.3%)
Final diagnoses—no. (%) ††
Asthma 67 (45.6%)
COPD 63 (42.9%)
Acute bronchitis 22 (15.0%)
CAP 29 (19.7%)
Other 6 (4.1%)
Hospitalized–no. (%)^ 70 (47.6%)

COPD—chronic obstructive pulmonary disease; CAP—community-acquired pneumonia.

* Plus-minus values are means ± SD.

‡ Home medications defined as medications taken by patient in last 7 days.

†† 12 patients had CAP and COPD, 10 patients had CAP and asthma.

^ Mean length of stay was 5.2 days (SD 4.9).

Microbiota composition of saliva samples from study subjects differ from healthy samples in diversity and taxonomic composition

Alpha diversity was significantly higher in saliva samples from patients with LRTI relative to saliva from healthy volunteers (mean 3.56 for LRTI saliva samples vs 3.38 for healthy saliva, p-value < 0.01) (Fig 1A). Saliva and fecal samples from study subjects and saliva samples from healthy volunteers each formed a discrete cluster (Fig 1B). Differences between all groups were significant (ADONIS2, R2 = 0.164, S1B Table). Age was a significant, albeit weak contributor to variation between cohorts (R2 = 0.008). The greatest difference between body site / cohorts was found between LRTI fecal samples and healthy saliva samples (R2 = 0. 202, p-value < 0.001). As expected, variation between LRTI samples and healthy samples was also considerable (R2 = 0.126, p-value < 0.001). Relative to variation between saliva samples from the two cohorts, variation between LRTI fecal samples and LRTI saliva samples was unexpectedly lower than expected (R2 = 0.056, p-value < 0.001). Statistics for these analyses are shown in S1A, S1B and S1C Table.

Fig 1. Alpha diversity and beta diversity plots.

Fig 1

(A) Scatterplot of Alpha diversity values (Shannon metric) across groups. Mean values for each group are shown as a central red dot with non-parametric confidence limits indicated by red lines. (B) PCoA plot of weighted unifrac beta diversity distances between LRTI saliva samples, LRTI fecal samples and healthy saliva samples. Large circles indicate centroid of each group.

The taxonomic composition of most saliva samples from both cohorts largely resembled published profiles of the typical saliva microbiota, with members of Pasteurella, Spirochaetes and Streptococcus abundantly represented (Fig 2A). However, ANCOM also identified multiple taxonomic families that were differentially abundant between the healthy and LRTI saliva samples. Saliva samples from LRTI study subjects were enriched with prominent members of the normal colonic microbiota, notably the Bacteroidaceae, Ruminococcaceae and Verrucomicrobiaceae families (Fig 2B). Each taxon was relatively abundant in LRTI patient samples, but virtually absent in the healthy group. LRTI saliva samples were also enriched in the family Alcaligenaceae (which contains pathogens such as Bordetella species) and were depleted of common saliva commensals such as Micrococcaceae (Fig 2C).

Fig 2. Taxonomic composition of saliva samples by cohort and body site.

Fig 2

A) Common bacterial families by relative abundance shared between saliva samples from the healthy cohort and the LRTI cohort. B) Differentially abundant families predicted by ANCOM that are enriched in the LRTI group versus the healthy group. C) Equivalent to (B) but with families that are enriched in the healthy group. Taxa with red labels are predicted to be enriched in the LRTI group by LOCOM, while those with blue font are predicted to be depleted by LOCOM.

Analysis of taxonomic composition using LOCOM broadly agreed with the ANCOM results (S1 Fig) but some taxa that appeared to be of similar abundance were predicted to be depleted in LOCOM (see blue text in Figs 2A and S1).

Association of clinical features and outcome with the salivary microbiota

Multivariable analysis of baseline variables revealed that the variable with the greatest effect size was the city from which the sample was taken (R2 = 0.154, p-value = 0.001; S1D and S1E Table). This finding will be discussed further below. As a result of this unexpected variation in beta-diversity between city location, we stratified the ADONIS2 analysis on city of sampling variable to remove confounding effects that city location may have (S1F and S1G Table).

In the ADONIS2 analysis stratified on hospital city, we found significant variation between the salivary microbiota of baseline samples from subjects that had an LRTI classification of COPD exacerbation (R2 = 0.019, S1G Table, Fig 3). Overall, each of these variables had a minimal effect size relative to the effect of the city in which the hospitals were located. We did not find a significant difference in the salivary microbiota of patients who experienced an adverse outcome and patients who did not (R2 = 0.010, p-value = 0.171)

Fig 3. PCoA plots of baseline saliva samples by clinical variables.

Fig 3

PcoA plot of samples from subjects experiencing a COPD exacerbation, red points indicate a positive diagnosis of COPD exacerbation. COPD exacerbation was the only clinical finding that was predicted to be significantly different by ADONIS multivariable analysis (p-value < 0.01).

We performed a similar ADONIS2 analysis on baseline fecal samples using the same clinical and demographic variables. Unlike the baseline saliva samples we did not observe a significant difference in geographic location despite a modest effect size (R2 = 0.128, p-value = 0.0397). This finding may also have been due to a limited number of fecal samples (34 samples), with most (21/34) collected in Pittsburgh hospitals. Unlike saliva samples, we found no significant variation in the fecal microbiota of baseline samples when grouped by outcome or clinical/demographic variables (S1H and S1I Table).

ANCOM predicted only minor differences in taxonomic variation in saliva samples from a patient with COPD exacerbation after adjusting for hospital city location (Fig 4). In samples from subjects with COPD exacerbation, ANCOM predicted a reduced abundance of oral commensals from the families Fusobacteriaceae and Spirochaetaceae.

Fig 4. Variations in taxonomic composition by clinical variable.

Fig 4

Differences in taxonomic family composition between patients that experienced an outcome of COPD exacerbation and those that did not. City of hospital site was used as a co-variate to correct for taxonomic families that varied between hospital sites. Differentially abundant families are represented as boxplot graphs, coloured by taxon, shown in the shared legend. The COPD exacerbation diagnosis is displayed on the x-axis, and relative abundances are plotted along the y-axis.

Variation of salivary microbiota by geographic location

The variable with the greatest effect size in our comparisons of initial salivary samples within the LRTI group was the city in which the sample was collected (ADONIS2; P-value<0.001, R2 = 0.185). (S1D Table and Fig 5). Multiple pairwise ADONIS2 tests showed differences between the healthy group and all other LRTI city groups were significant at p-values < 0.001. Similarly, differences between the Pittsburgh hospitals and all other sites were significant at p-values < 0.001(S1J Table), and pairwise comparisons comparisons between all cities except Hershey vs. Detroit were significant. In addition to differences in beta-diversity between sites, we also found that Shannon alpha diversity of the Pittsburgh samples was significantly lower than the other sites, though not when compared against the healthy group. (Wilcoxon rank-sum test, p-value <0.01) (results not shown).

Fig 5. PCoA plot of Weighted Unifrac distances between all saliva samples.

Fig 5

Samples are coloured by city, each point indicates an individual sample, large points indicate the position the centroid of each hospital site group.

The microbiota composition of samples from each city compared against Pittsburgh samples also varied. Saliva samples from Pittsburgh consistently had more oral commensals, including bacteria from the Streptococcaceae, Micrococcaceae and Leptotrichiaceae families. In contrast, samples from Pittsburgh hospitals had a comparative absence of gut-associated bacteria (Ruminococcaceae, Lachnospiraceae, Bacteroidaceae) compared against those from other cities (S2 Fig), though Pittsburgh LRTI patient saliva samples were still enriched with gut associated bacteria compared to healthy controls (S3 Fig).

Clinical features of the patient populations also varied across cities (S1K Table). A total of eight clinical or demographic features failed a chi-squared test of independence when tested against city location. Most prominently of these was race of the subject (chi-squared test, p-adjusted = 6.22e-07), with only 5% white patients in Detroit, compared to 46% in Pittsburgh, and 87% in Hershey. A correlogram of clinical metadata (S4 Fig) indicated that patient metadata positively correlated with Pittsburgh as a collection city included chills, cough and a LRTI classification of community associated pneumonia or acute bronchitis. Inversely correlated was the recent use of oral or inhaled corticosteroids and steroids administered in the ED. Taken together these data indicate that patients in Pittsburgh suffered a higher rate of CAP and bronchitis. Conversely clinical features from patients in Detroit included significantly higher use of oral corticosteroid use, a history of asthma and a LRTI classification of exacerbation of asthma.

Discussion

We found that the salivary microbiota of patients presenting to the ED with acute LRTI had unique characteristics compared to healthy controls, clinical features were associated with the salivary microbiota, and that the salivary microbiota varied significantly by geography.

The high abundance of gut microbiota in the salivary microbiota of LRTI subjects was the most notable finding, and similar to recent findings of gut flora within the lung microbiota of patients with acute respiratory distress syndrome [31]. It has been suggested that this shift in microbiota results from translocation during acute illness [32]. Past studies have generally examined the lower respiratory tract rather than saliva, but one study examining the salivary microbiota in respiratory disorders also observed a high abundance of intestinal taxa (Lachnospiraceae) in salivary samples from patients with chronic COPD [12]. These findings are notable as a core microbiome principle is site specificity, meaning that microbial communities from different body sites, such as the mouth, lungs, and gastrointestinal tract, are generally distinct in a healthy state. An alternative explanation could be that lung microbiota dysbiosis predisposes an individual to an LRTI. A final possibility is that there may be a bi-directional relationship between LRTI and dysbiosis. Of note, salivary and oral microbiota dysbiosis have been found associated with not only respiratory disorders [33], but also multiple systemic disorders [34], including head and neck cancer [35], autoimmune disorders [36], and Alzheimer’s disease [37], and these relationships continue to be explored. With further study on greater numbers of individuals over extended periods of time, it may be possible to establish the presence of gut microbiota in saliva as a marker of respiratory disorders [38]. Future longitudinal studies on patients with LRTI may help shed some light on directionality of the dysbiosis / LRTI relationship.

We also found that patients who experienced a clinical outcome of COPD exacerbation were significantly different in terms of beta-diversity, though with a low effect size. Taken together, these findings provide proof of concept that salivary microbiota can differ with clinical features.

Samples from patients with a classification of COPD exacerbation also had a slight reduction in oral commensal families, including Fusobacteriaceae and Spirochaetaceae. Fusobacteriaceae and Spirochaetaceae are both contributors to dental caries and oral plaque formation, but are also highly prevalent members of the oral microbiota [39,40]. One previous study has also shown the Spirochaete genus Treponema to be enriched in healthy individuals compared against individuals with COPD [41].

The largest determinant of the salivary microbiota was the geographic location of the hospital. Similar results with fecal samples have been published and our finding is consistent with past studies demonstrating the geographic impact on microbiota composition [42]. Taxonomic composition was also noted to vary between cities, with the composition of samples from Pittsburgh most closely ressembling a healthy oral microbiota, though still enriched in gut bacteria when contrasted with the healthy cohort (also located in Pittsburgh). Other large studies with samples collected across multiple centers have not observed geographic variation in the salivary microbiota [43]. Site variation in sample collection and processing may have contributed to these differences, though we provided standardized instructions across sites. Finally, it is highly possible that city specific differences in subject populations are contributing to apparent differences in geography. Regardless, it is important to consider the geographic location of study participants when designing microbiota studies because a positive result from a single-center study may not be generalizable to other populations.

Limitations

Gut microbiota in saliva samples can indicate contamination during processing and sequencing of samples, but the absence of gut flora in saliva samples from the healthy subjects argues against this possibility. Further, past studies have indicated that extraction methodology has little impact on the composition of sampled salivary microbiota [44]. Less than ten percent of the cohort experienced an adverse outcome, limiting our ability to associate the salivary microbiota with an adverse outcome. Finally, we note that there is a significant difference in the mean and age ranges of the healthy subjects compared against the LRTI patients. Further studies on the salivary microbiome of healthy individuals will be required to determine if our observations of an enrichment of gut microbiota in the salivary microbiome of LRTI patients are the result of a difference in ages between the healthy and LRTI group.

In conclusion, we found the salivary microbiota of patients with acute LRTI differed from healthy controls mainly due to an enrichment with bacteria typically associated with gut flora. We further found a large degree of variation occurred between cities in which the samples were collected and that patients with a LRTI classification of COPD exacerbation differed significantly from other patients with LRTI. These findings offer preliminary insights into future investigations of the salivary microbiota in acute illness. With further knowledge of the immune role played by the host in LRTI associated dysbiosis and a thorough metagenomic analysis of bacterial pathways future investigations could determine how to best use an easily accessible biospecimen to classify respiratory infection, understand pathophysiology, and identify potentially clinically actionable biomarkers of infection.

Supporting information

S1 Fig. Scatter plot of differentially abundant taxa between LRTI group and control group predicted by LOCOM method.

Taxonomic families appear along the and points are distributed along the X-axis according to effect size. Size of point indicates mean relative abundance of each family.

(DOCX)

S2 Fig. Volcano plot of differentially abundant taxa between Pittsburgh baseline saliva samples and those of other cities.

Taxa predicted by ANCOM-BC to be either enriched in other cities compared against Pittsburgh. Taxa enriched in the above cities appear to the right of the dotted line and vary along the x-axis according to their log-fold difference in abundance (those to the right are higher in the compared city; those to the left are higher in Pittsburgh). The y-axis shows the -log10 (FDR) value, taxa above 0.05 significance are labelled. Blue and red shading indicate whether the family is typical is the oral or gut microbiota respectively.

(DOCX)

S3 Fig. Volcano plot of differentially abundant taxa between the healthy cohort and baseline saliva samples from the Pittsburgh LRTI cohort and those of other cities.

Taxa enriched in the above cities appear to the right of the dotted line and vary along the x-axis according to their log-fold difference in abundance (those to the right are higher in the compared city; those to the left are higher in the healthy cohort). The y-axis shows the -log10 (FDR)value, taxa above 0.05 significance are labelled. Blue and red shading indicate whether the family is typical is the oral or gut microbiota, respectively.

(DOCX)

S4 Fig. Correlogram of patient metadata and LRTI types showing relationships between variables.

Size of circles indicate strength of correlation by R2 value. Shading of circle also indicates strength of correlation by R2 value, blue shading indicates correlation, red shading indicates inverse correlation. Variables are clustered by R2 value using hclust. Only associations with a p-value < 0.05 (Pearson metric) are shown.

(DOCX)

S1 Table. Results of statistical tests.

(DOCX)

S1 Appendix. Illustrated procedures for collecting saliva samples.

(DOCX)

S2 Appendix. Illustrated procedures for collecting fecal samples.

(DOCX)

S3 Appendix. Inventory of each collection kit and information on storage and shipping of samples.

(DOCX)

Acknowledgments

Kourtney Wofford, Elizabeth Gimbel, Adam Fitch, Caroline Pidro, Vanessa M. Jackson, Meghan Allwes, Lauren Porter.

Site coordinators: Nancy Campbell, BSN, RN, Viera lab: Kathleen Deeley.

Abbreviations

CAP

Community acquired pneumonia

COPD

Chronic obstructive pulmonary disease

ED

Emergency Department

LRTI

Lower respiratory tract infection

ASV

Amplicon Sequence Variant

PCoA

Principal Coordinates Analysis

FDR

False discovery rate

Data Availability

Raw sequencing reads from this study have been deposited on the Genbank SRA database (SRR18910915-SRR18910984, Bioproject PRJNA830485). ASV tables, ASV sequences, sample metadata, and phylogenetic trees and analysis code is deposited on GitHub (https://github.com/MattRogers2/MAPLE.git).

Funding Statement

DTH received an award from Thermo Fisher to conduct this research (Grant 1#0052116).The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

References

  • 1.Huang YJ, Nelson CE, Brodie EL, Desantis TZ, Baek MS, Liu J, et al. Airway microbiota and bronchial hyperresponsiveness in patients with suboptimally controlled asthma. J Allergy Clin Immunol. 2011;127:372–381.e1-3. doi: 10.1016/j.jaci.2010.10.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Han MK, Zhou Y, Murray S, Tayob N, Noth I, Lama VN, et al. Lung microbiome and disease progression in idiopathic pulmonary fibrosis: an analysis of the COMET study. Lancet Respir Med. 2014;2:548–56. doi: 10.1016/S2213-2600(14)70069-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Dickson RP, Erb-Downward JR, Martinez FJ, Huffnagle GB. The Microbiome and the Respiratory Tract. Annu Rev Physiol. 2016;78:481–504. doi: 10.1146/annurev-physiol-021115-105238 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Haak BW, Brands X, Davids M, Peters-Sengers H, Kullberg RFJ, van Houdt R, et al. Bacterial and Viral Respiratory Tract Microbiota and Host Characteristics in Adults With Lower Respiratory Tract Infections: A Case-Control Study. Clin Infect Dis Off Publ Infect Dis Soc Am. 2022;74:776–84. doi: 10.1093/cid/ciab568 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Man WH, Clerc M, de Steenhuijsen Piters WAA, van Houten MA, Chu MLJN, Kool J, et al. Loss of Microbial Topography between Oral and Nasopharyngeal Microbiota and Development of Respiratory Infections Early in Life. Am J Respir Crit Care Med. 2019;200:760–70. doi: 10.1164/rccm.201810-1993OC [DOI] [PubMed] [Google Scholar]
  • 6.de Steenhuijsen Piters WAA, Huijskens EGW, Wyllie AL, Biesbroek G, van den Bergh MR, Veenhoven RH, et al. Dysbiosis of upper respiratory tract microbiota in elderly pneumonia patients. ISME J. 2016;10:97–108. doi: 10.1038/ismej.2015.99 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wootton DG, Cox MJ, Gloor GB, Litt D, Hoschler K, German E, et al. A Haemophilus sp. dominates the microbiota of sputum from UK adults with non-severe community acquired pneumonia and chronic lung disease. Sci Rep. 2019;9:2388. doi: 10.1038/s41598-018-38090-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Takeshita T, Nakano Y, Kumagai T, Yasui M, Kamio N, Shibata Y, et al. The ecological proportion of indigenous bacterial populations in saliva is correlated with oral health status. ISME J. 2008;3:65–78. doi: 10.1038/ismej.2008.91 [DOI] [PubMed] [Google Scholar]
  • 9.Belstrøm D, Fiehn N-E, Nielsen CH, Kirkby N, Twetman S, Klepac-Ceraj V, et al. Differences in bacterial saliva profile between periodontitis patients and a control cohort. J Clin Periodontol. 2014;41:104–12. doi: 10.1111/jcpe.12190 [DOI] [PubMed] [Google Scholar]
  • 10.Espuela-Ortiz A, Lorenzo-Diaz F, Baez-Ortega A, Eng C, Hernandez-Pacheco N, Oh SS, et al. Bacterial salivary microbiome associates with asthma among african american children and young adults. Pediatr Pulmonol. 2019;54:1948–56. doi: 10.1002/ppul.24504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lazarevic V, Manzano S, Gaïa N, Girard M, Whiteson K, Hibbs J, et al. Effects of amoxicillin treatment on the salivary microbiota in children with acute otitis media. Clin Microbiol Infect. 2013;19:E335–42. doi: 10.1111/1469-0691.12213 [DOI] [PubMed] [Google Scholar]
  • 12.Lin M, Li X, Wang J, Cheng C, Zhang T, Han X, et al. Saliva Microbiome Changes in Patients With Periodontitis With and Without Chronic Obstructive Pulmonary Disease. Front Cell Infect Microbiol. 2020;10:124. doi: 10.3389/fcimb.2020.00124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zaura E, Brandt BW, Teixeira de Mattos MJ, Buijs MJ, Caspers MPM, Rashid M-U, et al. Same Exposure but Two Radically Different Responses to Antibiotics: Resilience of the Salivary Microbiome versus Long-Term Microbial Shifts in Feces. mBio. 2015;6:e01693–01615. doi: 10.1128/mBio.01693-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Acharya A, Chan Y, Kheur S, Jin LJ, Watt RM, Mattheos N. Salivary microbiome in non-oral disease: A summary of evidence and commentary. Arch Oral Biol. 2017;83:169–73. doi: 10.1016/j.archoralbio.2017.07.019 [DOI] [PubMed] [Google Scholar]
  • 15.Dong T, Santos S, Yang Z, Yang S, Kirkhus NE. Sputum and salivary protein biomarkers and point-of-care biosensors for the management of COPD. The Analyst. 2020;145:1583–604. doi: 10.1039/c9an01704f [DOI] [PubMed] [Google Scholar]
  • 16.Segal LN, Clemente JC, Tsay J-CJ, Koralov SB, Keller BC, Wu BG, et al. Enrichment of the lung microbiome with oral taxa is associated with lung inflammation of a Th17 phenotype. Nat Microbiol. 2016;1:16031. doi: 10.1038/nmicrobiol.2016.31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Krone CL, Oja AE, van de Groep K, Sanders EAM, Bogaert D, Trzciński K. Dried Saliva Spots: A Robust Method for Detecting Streptococcus pneumoniae Carriage by PCR. Int J Mol Sci. 2016;17:343. doi: 10.3390/ijms17030343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Huang DT, Yealy DM, Filbin MR, Brown AM, Chang C-CH, Doi Y, et al. Procalcitonin-Guided Use of Antibiotics for Lower Respiratory Tract Infection. N Engl J Med. 2018;379:236–49. doi: 10.1056/NEJMoa1802670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bezamat M, Harrison B, Zhou Y, Glickman KM, Telles V, Guirguis C, et al. Phenome-Wide Scan Finds Potential Orofacial Risk Markers for Cancer. Sci Rep. 2020;10:4869. doi: 10.1038/s41598-020-61654-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012;6:1621–4. doi: 10.1038/ismej.2012.8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37:852–7. doi: 10.1038/s41587-019-0209-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13:581–3. doi: 10.1038/nmeth.3869 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.McMurdie PJ, Holmes S. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PloS One. 2013;8:e61217. doi: 10.1371/journal.pone.0061217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10. doi: 10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
  • 26.Dixon P. VEGAN, a package of R functions for community ecology. J Veg Sci. 2003;14:927–30. [Google Scholar]
  • 27.Mandal S, Van Treuren W, White RA, Eggesbø M, Knight R, Peddada SD. Analysis of composition of microbiomes: a novel method for studying microbial composition. Microb Ecol Health Dis. 2015;26:27663. doi: 10.3402/mehd.v26.27663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lin H, Peddada SD. Analysis of compositions of microbiomes with bias correction. Nat Commun. 2020;11:3514. doi: 10.1038/s41467-020-17041-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hu Y, Satten GA, Hu Y-J. LOCOM: A logistic regression model for testing differential abundance in compositional microbiome data with false discovery rate control. Proc Natl Acad Sci U S A. 2022;119:e2122788119. doi: 10.1073/pnas.2122788119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lozupone C, Lladser ME, Knights D, Stombaugh J, Knight R. UniFrac: an effective distance metric for microbial community comparison. ISME J. 2011;5:169–72. doi: 10.1038/ismej.2010.133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Dickson RP, Singer BH, Newstead MW, Falkowski NR, Erb-Downward JR, Standiford TJ, et al. Enrichment of the lung microbiome with gut bacteria in sepsis and the acute respiratory distress syndrome. Nat Microbiol. 2016;1:16113. doi: 10.1038/nmicrobiol.2016.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lozupone CA, Stombaugh JI, Gordon JI, Jansson JK, Knight R. Diversity, stability and resilience of the human gut microbiota. Nature. 2012;489:220–30. doi: 10.1038/nature11550 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Dong J, Li W, Wang Q, Chen J, Zu Y, Zhou X, et al. Relationships Between Oral Microecosystem and Respiratory Diseases. Front Mol Biosci. 2021;8:718222. doi: 10.3389/fmolb.2021.718222 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Varoni EM, Rimondini L. Oral Microbiome, Oral Health and Systemic Health: A Multidirectional Link. Biomedicines. 2022;10:186. doi: 10.3390/biomedicines10010186 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hayes RB, Ahn J, Fan X, Peters BA, Ma Y, Yang L, et al. Association of Oral Microbiome With Risk for Incident Head and Neck Squamous Cell Cancer. JAMA Oncol. 2018;4:358–65. doi: 10.1001/jamaoncol.2017.4777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Nikitakis NG, Papaioannou W, Sakkas LI, Kousvelari E. The autoimmunity-oral microbiome connection. Oral Dis. 2017;23:828–39. doi: 10.1111/odi.12589 [DOI] [PubMed] [Google Scholar]
  • 37.Loughman A, Adler CJ, Macpherson H. Unlocking Modifiable Risk Factors for Alzheimer’s Disease: Does the Oral Microbiome Hold Some of the Keys? J Alzheimers Dis JAD. 2023;92:1111–29. doi: 10.3233/JAD-220760 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Shah FA, Pike F, Alvarez K, Angus D, Newman AB, Lopez O, et al. Bidirectional relationship between cognitive function and pneumonia. Am J Respir Crit Care Med. 2013;188:586–92. doi: 10.1164/rccm.201212-2154OC [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Thurnheer T, Karygianni L, Flury M, Belibasakis GN. Fusobacterium Species and Subspecies Differentially Affect the Composition and Architecture of Supra- and Subgingival Biofilms Models. Front Microbiol. 2019;10. doi: 10.3389/fmicb.2019.01716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wu Y-W, Rho M, Doak TG, Ye Y. Oral Spirochetes Implicated in Dental Diseases Are Widespread in Normal Human Subjects and Carry Extremely Diverse Integron Gene Cassettes. Appl Environ Microbiol. 2012;78:5288–96. doi: 10.1128/AEM.00564-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Melo-Dias S, Valente C, Andrade L, Marques A, Sousa A. Saliva as a non-invasive specimen for COPD assessment. Respir Res. 2022;23:16. doi: 10.1186/s12931-022-01935-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.He Y, Wu W, Zheng H-M, Li P, McDonald D, Sheng H-F, et al. Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nat Med. 2018;24:1532–5. doi: 10.1038/s41591-018-0164-x [DOI] [PubMed] [Google Scholar]
  • 43.Beck JM, Schloss PD, Venkataraman A, Twigg H, Jablonski KA, Bushman FD, et al. Multicenter Comparison of Lung and Oral Microbiomes of HIV-infected and HIV-uninfected Individuals. Am J Respir Crit Care Med. 2015;192:1335–44. doi: 10.1164/rccm.201501-0128OC [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lim Y, Totsika M, Morrison M, Punyadeera C. The saliva microbiome profiles are minimally affected by collection method or DNA extraction protocols. Sci Rep. 2017;7:8523. doi: 10.1038/s41598-017-07885-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Scatter plot of differentially abundant taxa between LRTI group and control group predicted by LOCOM method.

Taxonomic families appear along the and points are distributed along the X-axis according to effect size. Size of point indicates mean relative abundance of each family.

(DOCX)

S2 Fig. Volcano plot of differentially abundant taxa between Pittsburgh baseline saliva samples and those of other cities.

Taxa predicted by ANCOM-BC to be either enriched in other cities compared against Pittsburgh. Taxa enriched in the above cities appear to the right of the dotted line and vary along the x-axis according to their log-fold difference in abundance (those to the right are higher in the compared city; those to the left are higher in Pittsburgh). The y-axis shows the -log10 (FDR) value, taxa above 0.05 significance are labelled. Blue and red shading indicate whether the family is typical is the oral or gut microbiota respectively.

(DOCX)

S3 Fig. Volcano plot of differentially abundant taxa between the healthy cohort and baseline saliva samples from the Pittsburgh LRTI cohort and those of other cities.

Taxa enriched in the above cities appear to the right of the dotted line and vary along the x-axis according to their log-fold difference in abundance (those to the right are higher in the compared city; those to the left are higher in the healthy cohort). The y-axis shows the -log10 (FDR)value, taxa above 0.05 significance are labelled. Blue and red shading indicate whether the family is typical is the oral or gut microbiota, respectively.

(DOCX)

S4 Fig. Correlogram of patient metadata and LRTI types showing relationships between variables.

Size of circles indicate strength of correlation by R2 value. Shading of circle also indicates strength of correlation by R2 value, blue shading indicates correlation, red shading indicates inverse correlation. Variables are clustered by R2 value using hclust. Only associations with a p-value < 0.05 (Pearson metric) are shown.

(DOCX)

S1 Table. Results of statistical tests.

(DOCX)

S1 Appendix. Illustrated procedures for collecting saliva samples.

(DOCX)

S2 Appendix. Illustrated procedures for collecting fecal samples.

(DOCX)

S3 Appendix. Inventory of each collection kit and information on storage and shipping of samples.

(DOCX)

Data Availability Statement

Raw sequencing reads from this study have been deposited on the Genbank SRA database (SRR18910915-SRR18910984, Bioproject PRJNA830485). ASV tables, ASV sequences, sample metadata, and phylogenetic trees and analysis code is deposited on GitHub (https://github.com/MattRogers2/MAPLE.git).


Articles from PLOS ONE are provided here courtesy of PLOS

RESOURCES