Skip to main content
. 2024 Dec 30;56:79. doi: 10.1186/s12711-024-00944-0

Table 2.

Average accuracy of sex prediction in the tested datasets

Data seta Five-fold cross validation Twofold cross validation
Accuracyb Number of misclassified samplesc Accuracy Number of misclassified samplesd
Sim 5 1.000 0 1.000 0
Sim 50 0.600 1120 0.599 2804
Sim rand 0.995 12 0.995 34
Sim real 0.998 12 0.997 19
Real data 0.979 5 0.977 15

aSim 5 = simulated data with a 5% missing rate for all SNPs; Sim 50 = simulated data with a 50% missing rate for all SNPs; Sim rand = simulated data with a random error rate in the range of 5 to 50%; Sim real = simulated data with a random error rate in the range of 5 to 10% for 5 SNPs and 10 to 50% for 10 SNPs; Real data = data from the Finnish Rainbow Trout Breeding Program

bAccuracy calculated as NCorrect/NTotal, where NCorrect is the number of correct predictions and NTotal is the total number of predicted samples

cNumber of test samples: simulated data—2802; real data—272

dNumber of test samples: simulated data—7005; real data—681