Table 5.
Mean XGBoost model performance on five predicted CpG methylation levels with different sets of genetic features, predicted on an independent testing set.
| XGBoost | ||||||
|---|---|---|---|---|---|---|
| 1. GWAS of 75% of samples | 2. GTEx | 3. 41.5 Mb-43.6 Mb | ||||
| 1.1. Before B-H correction | 1.2. After B-H correction | |||||
| 1.1.1, 1.1.3–1.1.5, 1.1.7. Individual CpG probes | 1.1.9. Combined | 1.2.1, 1.2.3–1.2.5, 1.2.7. Individual CpG probes | 1.2.9. Combined | |||
| Mean RMSE train set (SD) | 0.00794 (0.00847) | 0.0117 (0.00723) | 0.0308 (0.0192) | 0.0298 (0.0192) | 0.0184 (0.00825) | 0.0279 (0.0199) |
| Mean R2 train set (SD) | 0.939 (0.0766) | 0.745 (0.261) | 0.180 (0.108) | 0.246 (0.128) | 0.420 (0.322) | 0.368 (0.206) |
| Mean RMSE test set (SD) | 0.0308 (0.0201) | 0.0302 (0.0178) | 0.0295 (0.0164) | 0.0296 (0.0174) | 0.0302 (0.0194) | 0.0291 (0.0176) |
| Mean R2 test set (SD) | 0.0164 (0.138) | 0.0211 (0.0689) | 0.0470 (0.147) | 0.0528 (0.135) | 0.0569 (0.0948) | 0.100 (0.100) |