Skip to main content
American Journal of Respiratory Cell and Molecular Biology logoLink to American Journal of Respiratory Cell and Molecular Biology
letter
. 2018 Nov;59(5):648–652. doi: 10.1165/rcmb.2018-0068LE

BAL Fluid Metaproteome in Acute Respiratory Failure

Pratik D Jagtap 1,*, Kevin J Viken 2,*, James Johnson 3, Thomas McGowan 3, Kathryn M Pendleton 2, Timothy J Griffin 1, Ryan C Hunter 2, Joel D Rudney 4, Maneesh Bhargava 2,
PMCID: PMC6236685  PMID: 30382775

To the Editor:

Studies have shown that lung microbiota are altered from the healthy state during respiratory disease. Gaining insights into the interactions between the host and the microbiome is of paramount importance to increase our understanding of both health and disease. A limitation of gene marker analysis to infer phylogenetic information about microbial communities (1, 2) is that sequenced DNA could originate from inactive cells. Recent techniques such as presorting live microbes (3), stable-isotope probing (4), and RNA sequencing (5) have emerged as tools for providing insights into active community members. Metaproteomics detects proteins expressed by bacterial communities as potential markers of microbial activity (6). Given the limitations of DNA sequencing and culture-dependent methods for identifying respiratory pathogens, we sought to develop a mass spectrometry (MS)-based proteomics and bioinformatics pipeline for determining the composition of active lung microbial communities.

We previously reported findings from quantitative studies of BAL fluid (BALF) from patients with lung injury after hematopoietic stem cell transplantation (idiopathic pneumonia syndrome, n = 12; infectious lung injury, n = 18) (7) or acute respiratory distress syndrome (n = 36) (8). The majority of the patients had widespread lung damage and were on antibiotics. These studies used isobaric tagging to quantify host human proteins. For the proof-of-concept study described here, we reanalyzed this dataset using spectral counting to provide an estimate of microbial peptide abundance. The spectral datasets for hematopoietic stem cell transplantation (8) and acute respiratory distress syndrome (9) were combined for our analysis. The RAW files containing combined spectrum-level data were searched against a custom protein sequence database (described below). We processed 173 MS RAW files containing 926,102 tandem mass spectra (MS/MS) using sequential analytical workflows within the Galaxy-P platform (Figure 1). The raw data for the analyses of this dataset have been deposited in the ProteomeXchange Consortium with the dataset identifier PXD008273.

Figure 1.

Figure 1.

Metaproteomics workflow in Galaxy-P to identify bacterial peptides in BAL fluid. In the first workflow (steps A–C), a database was generated (A) and peak processing RAW files were converted to mzml and MGF files and searched with ProteinPilot version 5.0.0, 1654 revision: 1656. (B). The outputs from these steps were used for a database search (C) to generate a list of both human and bacterial peptide-spectrum matches (PSMs) (Table E1). In the second workflow (steps D–F), bacterial PSMs were parsed out by eliminating human-origin peptides from the list (D). The microbial peptides were subjected to BLAST-P analysis (E) against the NCBInr database to generate an output for subsequent MEGAN6 analysis. MEGAN software uses a bitscore threshold to assign matches. Any match that has a bitscore below the threshold is not used to assign taxonomy. If all matches for a taxonomy or function are below the threshold, it is marked as unassigned. Peptides that do not have a match because of an absence of sequences in the InterPro2GO mapping file are termed “No Hits” (F).

Workflow 1

We created a protein sequence database containing 2,837,537 human and microbial peptides expected in BALF by merging five publically available databases: 1) the Human UniProt database, 2) contaminant proteins from the common Repository of Adventitious Proteins, 3) the Oral Human Microbiome Project Reference Genome Sequence Data database, 4) the Human Oral Microbiome database, and 5) the Human Microbiome Project gastrointestinal tract database. This workflow created peaklists from the MS/MS data for matching to the customized protein sequence database. Controlling for a false discovery rate of ≤5%, 235,602 peptide MS/MS spectra matched to peptide sequences in this database.

Workflow 2

Peptide-spectrum matches (PSMs) from workflow 1 that did not match Human UniProt, decoy, or contaminant species were used to identify microbial sequences. These PSMs were analyzed with Megan6 to extract phylogenetic and functional information (9). We identified 776 microbial PSMs (426 distinct bacterial peptide sequences) that were further examined with Megan6 (Table E1 in the data supplement).

We identified 63 genera (Figure E1), 41 of which were reported in at least one prior metagenome study in sputum or BALF (Table 1). Twenty of the 27 genera that were identified with at least two peptides were also reported in prior metagenomics studies (Table 1). These included several known pulmonary pathogenic genera, including Burkholderia, Klebsiella, Listeria, and Staphylococcus. Clinical records for these patients identified normal respiratory flora and six other microorganisms, four of which were detected by our metaproteomic approach. Nine genera, including Pyramidobacter, Actinomyces, and Streptococcus, were detected from at least 20 clinical samples (Table 1). These microbial peptides mapped to several Gene Ontology annotations, including regulation of transcription, protein translation, trafficking, protein folding, nucleic acid metabolism, amino acid metabolism, carbohydrate metabolism, glycolysis, gluconeogenesis, fatty acid oxidation, redox response, cellular response to stress, and cell adhesion (Figure E2).

Table 1.

Microbial Organisms Identified at the Genus Level with Spectral Identification Statistics, Detection across Clinical Samples in this Study, and Prior Lung Microbiome Studies

Genus Number of Peptide-Spectrum Matches (Peptides) Total Number of Datasets* Total Number of Patients in Whom the Peptide Was Detected Prior Gene Marker Studies Reporting These Microbiota in the Respiratory Tract
Streptococcus 9 (3) 7 42 (1, 1012, 1416)
Parabacteroides 9 (2) 5 30 (10, 11)
Actinomyces 8 (8) 8 44 (1, 10, 11)
Prevotella 8 (6) 6 35 (1, 10, 11, 12, 14, 16)
Haemophilus 6 (4) 1 6 (1, 11, 15)
Cardiobacterium 6 (2) 4 19 (11)
Streptomyces 5 (2) 3 17  
Lactobacillus 4 (4) 4 23 (10, 11)
Enterococcus 3 (3) 3 18 (11)
Anaerofustis 3 (3) 3 18  
Hungatella 3 (3) 3 17  
Paenibacillus 3 (3) 3 16 (10)
Pseudomonas 3 (3) 3 11 (11, 16)
Delftia 3 (2) 2 12 (11)
Pseudoflavonifractor 3 (2) 2 12  
Neisseria 3 (2) 2 11 (1, 10, 11, 14, 16)
Eubacterium 3 (2) 1 6 (11)
Anaerotruncus 3 (1) 1 6  
Pyramidobacter 21 (2) 11 65  
Clostridium 2 (2) 2 12 (10, 11)
Bifidobacterium 2 (2) 2 12 (11, 15)
Ruminococcus 2 (2) 2 12  
Yersinia 2 (2) 1 6 (11)
Succinatimonas 2 (2) 1 6  
Burkholderia 2 (1) 3 18 (11)
Klebsiella 2 (1) 2 12  
Providencia 2 (1) 2 12  
Rothia 2 (1) 1 6 (1012)
Edwardsiella 2 (1) 1 6  
Fusobacterium 19 (4) 2 12 (1, 1012, 16)
Bacteroides 14 (11) 5 29 (10)
Oribacterium 12 (4) 4 20 (1, 10, 11)
Capnocytophaga 10 (6) 5 29 (10, 11)
Staphylococcus 1 (1) 1 6 (11, 15, 16)
Scardovia 1 (1) 2 10 (11)
Catonella 1 (1) 1 6 (10, 11)
Treponema 1 (1) 1 6 (10, 11)
Megasphaera 1 (1) 1 6 (1012, 16)
Anaerococcus 1 (1) 1 6 (11, 15)
Porphyromonas 1 (1) 1 6 (10, 11, 15)
Eikenella 1 (1) 1 6 (11)
Ochrobactrum 1 (1) 1 6 (11)
Alistipes 1 (1) 1 6 (10)
Escherichia 1 (1) 1 6 (10)
Turicibacter 1 (1) 1 6 (10)
Catenibacterium 1 (1) 1 6  
Listeria 1 (1) 1 6  
Paenisporosarcina 1 (1) 1 6  
Turicella 1 (1) 1 6  
Tyzzerella 1 (1) 1 6  
Peptostreptococcus 1 (1) 1 5 (10, 11)
Selenomonas 1 (1) 1 5 (10, 11, 16)
Mycoplasma 1 (1) 1 5 (11)
Shuttleworthia 1 (1) 1 5 (11)
Blautia 1 (1) 1 5  
Collinsella 1 (1) 1 5  
Dorea 1 (1) 1 5  
Kytococcus 1 (1) 1 5  
Oxalobacter 1 (1) 1 5  
Sanguibacter 1 (1) 1 5  
Atopobium 1 (1) 1 4 (10, 11)
Desulfovibrio 1 (1) 1 4 (11)
Parascardovia 1 (1) 1 4 (11)
*

Eight-plex runs with one or more peptide spectral matches.

Genera identified in clinical culture data.

There is growing evidence of a complex interplay between respiratory microbes and host in both health and disease. Thus far, most of the studies evaluating microbial diversity in the lung have used marker gene analysis of 16S rRNA–encoding genes (1, 1012), which does not reveal the host–microbe interactions. Our pipeline demonstrates the feasibility of identifying microbial peptides, bacterial taxa, and associated biological functions activated by the microbial community in the distal lung.

Importantly, a functional analysis may suggest that the microbiota we identified were metabolically active (Table E1, Figure E3). Many of the proteins we detected are known to be involved in energy production, protein transcription, or protein assembly. For instance, the most abundant protein recovered, triose-phosphate isomerase, is a key enzyme in the glycolytic pathway. We also discovered members of the heat shock protein 70 family, whose constituents chaperone protein folding. Protein regulators of transcription, including transcription factor Helix-turn-helix (HTH) DNA binding motif and members of the Repressor, Orf, Kinase (ROK) family, were also recovered.

Although our pipeline serves as an important proof of concept, we acknowledge several key areas in need of improvement. To increase detection of microbial proteins within a host sample, enrichment of BALF microbiota and/or sample fractionation coupled with the use of increasingly sensitive mass spectrometers could be pursued. To improve our coverage of lower-abundance proteins, we depleted medium- and high-abundance proteins (13)—a strategy that could remove other proteins bound to the high-abundance proteins. Another limitation of the field is the absence of databases of respiratory microbial peptides for spectral matching. A high-quality metagenomic sequencing database is critical, as peptide sequence identification relies on the fidelity of predicted protein sequences. An approach using simultaneous whole-metagenome sequencing from the same sample could substantially increase the detection of bacterial peptides for metaproteomic studies. Such advances could propel metaproteomics to a point where it rivals the depth of microbial characterization offered by 16S rRNA sequencing, with the added benefit of providing a functional snapshot of the system. Although the use of a pooled spectral dataset does not allow absolute quantification of bacterial peptides from individual patients, our bioinformatics pipeline provides a framework for obtaining such information in the future.

These current limitations notwithstanding, the workflow for identification of BALF microbiota provides a promising framework for integrating gene marker and metaproteomic studies. The development of such bioinformatics pipelines is a critical step toward gaining insight into the distal lung microenvironment and the microbial functional state, as well as the interplay between microbial communities and the host. Improved knowledge about the host–pathogen dynamic may provide the key to unlock more individualized care. The proof-of-concept work described here is a step toward achieving these exciting possibilities.

Acknowledgments

Acknowledgment

The authors thank the Minnesota Supercomputing Institute and the Center for Mass Spectrometry and Proteomics at the University of Minnesota for assistance in this work.

Footnotes

Supported by an American Heart Association Scientist Development Grant (12SDG8160000) (M.B.) and the University of Minnesota CTSI KL2 Scholars Program (KL2 RR0333182) (M.B.). The Galaxy-P project is supported by National Science Foundation grant DBI-1458524 (T.J.G.) and National Institutes of Health grant U24CA199347 (T.J.G.).

Author Contributions: Study conception and design: P.D.J., K.J.V., and M.B. Acquisition of data: P.D.J., K.J.V., and M.B. Analysis and interpretation of data: P.D.J., K.J.V., J.J., T.M., K.M.P., T.J.G., R.C.H., J.D.R., and M.B. Drafting or revision of the manuscript: P.D.J., K.J.V., J.J., T.M., K.M.P., T.J.G., R.C.H., J.D.R., and M.B. Final approval of the manuscript: P.D.J., K.J.V., J.J., T.M., K.M.P., T.J.G., R.C.H., J.D.R., and M.B.

This letter has a data supplement, which is accessible from this issue’s table of contents at www.atsjournals.org.

Author disclosures are available with the text of this letter at www.atsjournals.org.

References

  • 1.Morris A, Beck JM, Schloss PD, Campbell TB, Crothers K, Curtis JL, et al. Lung HIV Microbiome Project. Comparison of the respiratory microbiome in healthy nonsmokers and smokers. Am J Respir Crit Care Med. 2013;187:1067–1075. doi: 10.1164/rccm.201210-1913OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Pendleton KM, Erb-Downward JR, Bao Y, Branton WR, Falkowski NR, Newton DW, et al. Rapid pathogen identification in bacterial pneumonia using real-time metagenomics. Am J Respir Crit Care Med. 2017;196:1610–1612. doi: 10.1164/rccm.201703-0537LE. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Rebets Y, Lupoli T, Qiao Y, Schirner K, Villet R, Hooper D, et al. Moenomycin resistance mutations in Staphylococcus aureus reduce peptidoglycan chain length and cause aberrant cell division. ACS Chem Biol. 2014;9:459–467. doi: 10.1021/cb4006744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Neufeld JD, Dumont MG, Vohra J, Murrell JC. Methodological considerations for the use of stable isotope probing in microbial ecology. Microb Ecol. 2007;53:435–442. doi: 10.1007/s00248-006-9125-x. [DOI] [PubMed] [Google Scholar]
  • 5.Urich T, Lanzén A, Qi J, Huson DH, Schleper C, Schuster SC. Simultaneous assessment of soil microbial community structure and function through analysis of the meta-transcriptome. PLoS One. 2008;3:e2527. doi: 10.1371/journal.pone.0002527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ram RJ, Verberkmoes NC, Thelen MP, Tyson GW, Baker BJ, Blake RC, II, et al. Community proteomics of a natural microbial biofilm. Science. 2005;308:1915–1920. [PubMed] [Google Scholar]
  • 7.Bhargava M, Viken KJ, Dey S, Steinbach MS, Wu B, Jagtap PD, et al. Proteome profiling in lung injury after hematopoietic stem cell transplantation. Biol Blood Marrow Transplant. 2016;22:1383–1390. doi: 10.1016/j.bbmt.2016.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bhargava M, Viken K, Wang Q, Jagtap P, Bitterman P, Ingbar D, et al. Bronchoalveolar lavage fluid protein expression in acute respiratory distress syndrome provides insights into pathways activated in subjects with different outcomes. Sci Rep. 2017;7:7464. doi: 10.1038/s41598-017-07791-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, et al. Megan community edition - interactive exploration and analysis of large-scale microbiome sequencing data. PLoS Comput Biol. 2016;12:e1004957. doi: 10.1371/journal.pcbi.1004957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Charlson ES, Bittinger K, Haas AR, Fitzgerald AS, Frank I, Yadav A, et al. Topographical continuity of bacterial populations in the healthy human respiratory tract. Am J Respir Crit Care Med. 2011;184:957–963. doi: 10.1164/rccm.201104-0655OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hogan DA, Willger SD, Dolben EL, Hampton TH, Stanton BA, Morrison HG, et al. Analysis of lung microbiota in bronchoalveolar lavage, protected brush and sputum samples from subjects with mild-to-moderate cystic fibrosis lung disease. PLoS One. 2016;11:e0149998. doi: 10.1371/journal.pone.0149998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dickson RP, Erb-Downward JR, Freeman CM, McCloskey L, Beck JM, Huffnagle GB, et al. Spatial variation in the healthy human lung microbiome and the adapted island model of lung biogeography. Ann Am Thorac Soc. 2015;12:821–830. doi: 10.1513/AnnalsATS.201501-029OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Bhargava M, Higgins L, Wendt CH, Ingbar DH. Application of clinical proteomics in acute respiratory distress syndrome. Clin Transl Med. 2014;3:34. doi: 10.1186/s40169-014-0034-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zimmermann A, Zissel G, Müller-Quernheim J, Hofmann S, Schreiber S, Fischer A. Are bronchoalveolar lavages a good source for microbial profiling? Differences between throat and bronchoalveolar lavage microbiomes. J Med Microbiol. 2015;64:948–951. doi: 10.1099/jmm.0.000112. [DOI] [PubMed] [Google Scholar]
  • 15.Brown PS, Pope CE, Marsh RL, Qin X, McNamara S, Gibson R, et al. Directly sampling the lung of a young child with cystic fibrosis reveals diverse microbiota. Ann Am Thorac Soc. 2014;11:1049–1055. doi: 10.1513/AnnalsATS.201311-383OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Scher JU, Joshua V, Artacho A, Abdollahi-Roodsaz S, Öckinger J, Kullberg S, et al. The lung microbiota in early rheumatoid arthritis and autoimmunity. Microbiome. 2016;4:60. doi: 10.1186/s40168-016-0206-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from American Journal of Respiratory Cell and Molecular Biology are provided here courtesy of American Thoracic Society

RESOURCES