Table 3.
SL prediction task | AUPRC |
AUROC |
||||
---|---|---|---|---|---|---|
ETNA | MUNK | ETNA | MUNK | |||
Sce Spo | Sce Spo | Sce Spo | Sce Spo | Sce Spo | Sce Spo | |
Sce + Spo Sce | 0.757 | 0.681 | 0.655 | 0.753 | 0.700 | 0.650 |
Sce + Spo Spo | 0.773 | 0.606 | 0.554 | 0.744 | 0.581 | 0.571 |
Sce Spo | 0.762 | - | - | 0.741 | - | - |
Spo Sce | 0.688 | 0.533 | 0.543 | 0.648 | 0.499 | 0.431 |
Sce Hsa | Sce Hsa | Sce Hsa | Sce Hsa | Sce Hsa | Sce Hsa | |
Sce + Hsa Sce | 0.781 | 0.675 | 0.652 | 0.774 | 0.683 | 0.677 |
Sce + Hsa Hsa | 0.738 | 0.485 | 0.553 | 0.803 | 0.396 | 0.582 |
Sce Hsa | 0.650 | - | - | 0.687 | - | - |
Hsa Sce | 0.686 | 0.552 | 0.642 | 0.684 | 0.545 | 0.664 |
For each SL prediction task, indicates SL pairs in A were used for training to predict SL pairs in B (e.g. Sce + Spo Sce indicates that SL gene pairs from both Sce and Spo were used for training, and a set of held out SL gene pairs in Sce were used for evaluation). When SL pairs for two organisms are used for training, equal numbers of positives were used (i.e. SL pairs were subsampled for the larger of the two organisms). Notably, ETNA is able to predict genetic interactions across species well using only interactions reported in the other (Sce Spo, Spo Sce, Hsa Sce, and Sce Hsa). Each of the gold standards has balanced positives and negatives (prior = 0.5). For prediction tasks based on Sce examples, MUNK failed to converge after several days of training so we were unable to calculate prediction performance.