TABLE II.
The p-values of two-sample t-tests (a significance level of 0.05) comparing to other methods, in terms of F1 score. and denote the source and target domains, respectively. The first, second, third and forth panels represent our method’s variants, proposed UDA methods, proposed SSDA methods with 5% gold-standard annotations available in each target dataset, and the target-only models respectively, for different radii (i.e., r = 16 and 8) used to define gold-standard regions.
Method | r = 16 | r = 8 | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||
H&E () | DAPI () | IHC () | H&E () | DAPI () | IHC () | ||||||||
| |||||||||||||
DAPI () | IHC () | H&E () | IHC () | H&E () | DAPI () | DAPI () | IHC () | H&E () | IHC () | H&E () | DAPI () | ||
| |||||||||||||
1 | Baseline | 2×10−6 | 0.081 | 0.095 | 0.198 | 1.9×10−3 | 1.2×10−5 | 10−6 | 0.014 | 0.357 | 0.012 | 0.015 | 1.3×10−5 |
0.018 | 5.9×10−3 | 0.0 | 10−6 | 0.0 | 8.8×10−3 | 1.3×10−3 | 10−6 | 0.0 | 3×10−6 | 0.0 | 1.1×10−3 | ||
1.8×10−3 | 10−6 | 0.114 | 0.442 | 5×10−6 | 1.5×10−3 | 4.3×10−5 | 3×10−6 | 1.8×10−3 | 0.05 | 2×10−6 | 1.9×10−4 | ||
1.5×10−3 | 0.0 | 5×10−6 | 0.442 | 0.0 | 2.3×10−3 | 9.2×10−3 | 0.0 | 0.0 | 4×10−6 | 0.0 | 0.011 | ||
| |||||||||||||
2 | 0.546 | 3.1×10−3 | 0.077 | 0.703 | 0.598 | 0.572 | 0.248 | 0.035 | 0.018 | 0.327 | 0.856 | 0.319 | |
- | - | - | - | - | - | - | - | - | - | - | - | ||
| |||||||||||||
3 | 4.3×10−5 | 7.7×10−4 | 1.1×10−5 | 1.4×10−3 | 0.071 | 1.7×10−5 | 5.7×10−5 | 0.594 | 4×10−6 | 5.6×10−3 | 0.150 | 4.6×10−5 | |
6×10−6 | 10−6 | 9×10−6 | 0.222 | 0.042 | 3×10−6 | 5×10−6 | 1.5×10−4 | 6×10−6 | 0.068 | 0.072 | 4×10−6 | ||
| |||||||||||||
4 | 5%-target | 2.4×10−3 | 0.847 | 1.5×10−5 | 0.012 | 0.576 | 2.1×10−5 | 2.1×10−3 | 0.059 | 5×10−6 | 0.036 | 0.672 | 4×10−6 |
Full-target | 0.0 | 0.0 | 0.0 | 1.2×10−4 | 2×10−6 | 0.0 | 0.0 | 0.0 | 0.0 | 3.3×10−5 | 10−6 | 0.0 |