Skip to main content
[Preprint]. 2023 May 2:2023.05.01.538953. [Version 2] doi: 10.1101/2023.05.01.538953

Fig. 4. Impact of training dataset size on classification accuracy.

Fig. 4.

(A) Improved performance of PrimateAI-3D with increasing number of common human and primate variants in the training dataset (x-axis). Performance of each dataset (y-axis) was divided by the maximum performance observed across all training dataset sizes. (B) Cumulative fractions of all possible human synonymous (grey) and missense (green) variants observed as common variants in 234 primate species, including human (allele frequency > 0.1%). Each point shows the average of ten permutations, calculated with a different random ordering of the list of primate species each time.