[Preprint]. 2023 Aug 2:arXiv:2308.01451v1. [Version 1]

Table 1.

Prediction results with and without erroneous exploitation of identifiability. Best Prediction for UKB is reported for 1350 subjects in the training set, whereas for all other datasets it is reported with the maximum number of training subjects available.

Dataset	Task	Null Model	Best Prediction	Double-Dipping Prediction	Unintentional Improvement
Dataset	Task	Null Model	Best Prediction	UKB	Sex (Accuracy)	0.528	0.82 ± 0.02	0.89 ± 0.006	7%
UKB	Age (RMSE)	7.68	5.92 ± 0.03	4.97 ± 0.50	12.4%
PNC	WRAT (RMSE)	14.6	14.45 ± 0.92	11.0 ± 0.28	23.6%
Fibro	Diagnosis (Accuracy)	0.51	0.61 ± 0.17	0.86 ± 0.038	25%
BSNIP	Diagnosis (Accuracy)	0.5	0.78 ± 0.05	0.89 ± 0.04	11%