TABLE 4.
Evaluating the impact of key components through ablation on PMC-15M and NIH ChestX-ray14.
| Model | PMC-15M dataset | NIH ChestX-ray14 dataset | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Recall | F1 score | AUC | Accuracy | Recall | F1 score | AUC | |
| w./o. Disentangled Temporal Representation | 87.83 0.02 | 84.74 0.02 | 85.20 0.02 | 90.45 0.03 | 89.55 0.02 | 86.92 0.02 | 87.04 0.03 | 90.41 0.02 |
| w./o. Ontology-Aware Alignment | 88.96 0.03 | 85.33 0.02 | 86.45 0.02 | 91.14 0.02 | 89.11 0.03 | 87.24 0.02 | 87.68 0.02 | 91.27 0.03 |
| w./o. Causal-Aware Refinement | 88.42 0.02 | 86.27 0.02 | 86.38 0.03 | 91.72 0.02 | 90.14 0.02 | 88.20 0.02 | 88.51 0.03 | 92.02 0.02 |
| Ours | 89.74 0.02 | 86.95 0.02 | 87.84 0.03 | 92.61 0.02 | 91.86 0.02 | 89.33 0.02 | 90.07 0.03 | 93.15 0.02 |
The values in bold are the best values.