Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2020 Aug 25;11(4):e01527-20. doi: 10.1128/mBio.01527-20

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2020 Pincus et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license.

PMC Copyright notice

FIG 5 — Performance of the random forest algorithm in predicting P. aeruginosa virulence from accessory genomic content when intermediate virulence isolates (middle third of estimated mLD₅₀ values) were removed. (A) Cumulative distribution function of estimated mLD₅₀ values after removing intermediate virulence isolates. Isolates with estimated mLD₅₀ values less than the median value in the complete training set (red dashed line) were designated high virulence, with the remainder designated low virulence. (B) Nested 10-fold cross-validation performance of the random forest model, including accuracy, sensitivity, specificity, positive predictive value (PPV), area under the receiver operating characteristic curve (AUC), and F1 score. The results for each cross-validation fold are shown in black with the mean and 95% confidence interval of each statistic indicated in red. (C) Learning curve showing change in mean training accuracy (red line) and cross-validation accuracy (green line) with increasing training set sizes. Shading indicates the 95% confidence interval. Assessments at each number of training examples were through 10-fold nested cross-validation.