Figure 2.
The dataset splits in the general training process. (1) European, African, East Asian, and South Asian ancestry groups are made from the UKB subject data. (2) The large European set is split into sibling and non-sibling sets. (3) The non-sibling set is further split into training sets and cross-validation (c.v.) folds. (4) Final predictors are tested on the remaining groups.