Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2016 Jun 27;6:28484. doi: 10.1038/srep28484

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2016, Macmillan Publishers Limited

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

PMC Copyright notice

(A) Comparison of the classification error of the RF trained model to guess, which always predicts the class label based on the majority class in the training data set. The boxplots are based on the results from 500 bootstrap samples. The three horizontal lines of the box represent the first, second (median) and third quartile respectively with the whisk extending to 1.5 inter-quartile range (IQR). RF achieves significantly lower classification error. (B) Predictive power of individual genera assessed by Boruta feature selection algorithm. Blue boxplots correspond to minimal, average and maximum Z score of shadow genera, which are shuffled version of real genera introduced to RF classifier and act as benchmarks to detect truly predictive genera. Red, yellow and green colors represent rejected, suggestive and confirmed genera by Boruta Selection. (C) Heatmap based on the abundance Boruta selected genera. Hierarchical clustering (Euclidean distance, complete linkage) shows that MS samples tend to cluster together.