Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2020 Mar 10;18:612–621. doi: 10.1016/j.csbj.2020.02.022

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2020 The Authors

This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

PMC Copyright notice

Fig. 6 — Machine learning using a broad range of features leads to high classification performance in human. (A) Heatmaps showing accuracy metrics for the performance evaluation of essential gene classification in human. Five ML approaches were used (Generalised linear model [GLM], Support-Vector Machines [SVM)], Neural Network [NNET], Random Forests [RF], and Extreme Gradient Boosting [XGB]). The performance was measured for the training and testing sets. Features were generated following our new approach including a broad range of features (All Features) and the performance was benchmarked to protein sequence features only (protr). The algorithm with the best performance (highest harmonic mean) was XGB, indicated by a green frame. (B) Receiver operating characteristic and Precision-Recall curves from XGB classifier measured on the testing sets. Using all features (red) yielded distinctively better results compared to the results from the protein sequence features only (grey). Results from a random classification is indicated by dotted lines. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)