Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2023 Aug 25;7(9):e939. doi: 10.1097/HS9.0000000000000939

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2023 the Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the European Hematology Association.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial License 4.0 (CCBY-NC), where it is permissible to download, share, remix, transform, and buildup the work provided it is properly cited. The work cannot be used commercially without permission from the journal.

PMC Copyright notice

Figure 4. — ALLCatchR predicts sample blast counts, patient’s sex, and immunophenotype based on the gene expression data. (A) For GMALL (n = 302), MLL (n = 282), and RCH/PM (n = 77), sample blast counts obtained by cytology or flow cytometry were available. GMALL and MLL cohorts were separately used for training 2 classifiers in a 10-fold cross-validation scheme with the same machine learning algorithms used for subtype prediction. GMALL and MLL classifiers were validated on each other, and both were validated on the RCH/PM data. Best performing methods in terms of the RSME on the training data are shown. Training 2 classifiers on independent data sets allowed for the validation on each other and both were combined for final predictions. Blast count predictions had a good correlation to measured counts, that is, rho = 0.590 in GMALL and rho = 0.771 in MLL. Moreover, predicting MLL samples with the classifier trained on GMALL achieved a similar performance as the classifier trained on MLL samples and vice versa. (B) Because both GMALL and MLL classifiers had a good performance and were generalizable, predictions from both are combined in ALLCatchR. (C) Subclassifiers for immunophenotype and patient’s sex were developed using SVM linear and ranger machine learning models, respectively. An immunophenotype classifier was trained on GMALL samples (n = 413 common-B/pre-B and n = 66 pro-B) and validated on MLL data (n = 168 common-B/pre-B and n = 64 pro-B) with available EGIL immunophenotypes. A patient sex classifier was trained on n = 357 GMALL samples (female = 165; male = 192) analogous to the subtype classifier. For validation n = 1892 St Jude samples with known sex (female = 850; male = 1042) were used. Corresponding accuracies, sensitivities, and specificities are shown for these subclassifiers. BCP-ALL = B-cell precursor acute lymphoblastic leukemia; RSME = root mean squared error.