Table 2.
Average accuracy of sex prediction in the tested datasets
Data seta | Five-fold cross validation | Twofold cross validation | ||
---|---|---|---|---|
Accuracyb | Number of misclassified samplesc | Accuracy | Number of misclassified samplesd | |
Sim 5 | 1.000 | 0 | 1.000 | 0 |
Sim 50 | 0.600 | 1120 | 0.599 | 2804 |
Sim rand | 0.995 | 12 | 0.995 | 34 |
Sim real | 0.998 | 12 | 0.997 | 19 |
Real data | 0.979 | 5 | 0.977 | 15 |
aSim 5 = simulated data with a 5% missing rate for all SNPs; Sim 50 = simulated data with a 50% missing rate for all SNPs; Sim rand = simulated data with a random error rate in the range of 5 to 50%; Sim real = simulated data with a random error rate in the range of 5 to 10% for 5 SNPs and 10 to 50% for 10 SNPs; Real data = data from the Finnish Rainbow Trout Breeding Program
bAccuracy calculated as NCorrect/NTotal, where NCorrect is the number of correct predictions and NTotal is the total number of predicted samples
cNumber of test samples: simulated data—2802; real data—272
dNumber of test samples: simulated data—7005; real data—681