Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2026 Apr 16;6(6):101217. doi: 10.1016/j.xgen.2026.101217

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2026 The Author(s)

This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

PMC Copyright notice

Performance of ProtoCloud on cell type classification

(A–C) Benchmarking of cell type annotation performance. We compared ProtoCloud against Seurat V4,²⁷ scANVI,²⁶ CellTypist,⁸ scPoli,⁶¹ TOSICA,⁶⁰ SIMS,⁶² scGPT,³⁹ and scBERT³⁷ for cell type annotation across eight datasets, ranging from approximately 10,000 to 400,000 cells. The x axis represents datasets, with the numbers in the parentheses indicating the number of rare cell types in each dataset. Downward-pointing arrows indicate values below 0.5. Error bars: standard error of the mean. (A) Evaluation metrics include accuracy, (B) macro F1 score, (C) and Cohen’s kappa coefficient. The metrics were averaged over five random seed experiments, with 80% of the data used for training and 20% for validation in each run.

(D) Macro F1 score rank distributions across methods and datasets over all experimental repetitions.

(E and F) Model performance analysis under varying conditions using the PBMC10K dataset.³ (E) Validation accuracy as the proportion of label perturbation in the training set increases from 0% to 20%. (F) Validation accuracy as training data ratios decrease, ranging from 0.8 to 0.1.