Skip to main content
. 2023 May 22;14:2899. doi: 10.1038/s41467-023-38569-4

Table 1.

Distribution of WSI patches across four different institutions

Training sets
Clients
Label C1 C2 C3 C4
Healthy 1195 1363 1727 1517
Tumor 1143 1363 1210 1324
Sum 2338 2726 2937 2841
Test set
Clients
Label C1 C2 C3 C4 Sum
Healthy 322 338 107 344 1111
Tumor 374 338 624 537 1873
Sum 696 676 731 881 2984

Class labels (healthy vs. tumor-containing) are defined at the WSI level, and are exactly balanced within each Client’s overall dataset. Patches in train and test sets are from non-overlapping patients, which forces some imbalance in class labels. The test sets are merged into a single multi-centric dataset for model evaluation.