Table 5. Genotype inference task.
Test Case | Chrom. | Inversion | Present Genotypes | Clusters | Balanced Accuracy |
---|---|---|---|---|---|
Single | D. mel. 2L | In(2L)t | 3 | 3 | 93.3% |
Single | D. mel. 2R | In(2R)NS | 3 | 3 | 94.4% |
Single | D. mel. 3R | In(3R)Mo | 3 | 60.7% | |
Single | In(3R)p | 3 | 43.3% | ||
Single | In(3R)K | 3 | 55.0% | ||
Multiple | 150 An. gam. and col. 2L | 2La | 2 | 3 | 66.7% |
Multiple | 81 An. gam. 2L | 2La | 2 | 2 | 100.0% |
Multiple | 34 An. gam. and col. 2L | 2La | 3 | 4 | 100.0% |
We evaluated clustering in terms of accuracy of inferring inversion genotypes. Inversion genotypes were retrieved from the original papers describing the data [17, 37–39]. Association of the known genotypes with the cluster labels was measured using balanced accuracy. *Could not resolve multiple, mutually-exclusive inversions