Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2020 Dec 17;223(Suppl 3):S246–S256. doi: 10.1093/infdis/jiaa655

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2021. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

PMC Copyright notice

Figure 5. — Bootstrapped ElasticNet-identified predictors of lung function. Machine learning models were trained using varying input datasets. A, 1000-fold bootstrapping and (B) leave-one-out cross-validation (LOOCV) were used to generate prediction error (MSE) ranges across feature subsets. Models trained on all of the data showed lower error compared to other feature subsets. Adding 16S pathogen quantitation decreased model error. Models trained on all 16S data outperformed models using only 16S pathogen quantitation (P < .01, t test). Regardless of input features, models trained on the full sample set (black points) were greater than median LOOCV MSEs (boxplots). C, Coefficient ranges for train/test (black points) and bootstrapped models (boxplots) trained on standardized input datasets (blue, metadata; orange, 16S pathogens; yellow, 16S other taxa) show consistency between both machine learning strategies. Both cases selected Pseudomonas and Achromobacter as negative predictors. Abbreviations: BMI, body mass index; CF, cystic fibrosis; MSE, mean squared error; ns, not significant. **P < .01; ****P < .0001.