Skip to main content
. 2021 Mar 1;3(1):lqab009. doi: 10.1093/nargab/lqab009

Table 2.

Quality assessment metrics. The set of predicted bins and expected bins (gold standard) are, respectively, represented by Inline graphic and Inline graphic. Each predicted bin Inline graphic is assigned to its best mapping expected bin Inline graphic before computing purity, contamination and completeness. Inline graphic equals to the number of genes in the predicted bin Inline graphic and Inline graphic represents the set of expected bins that do not correspond to the best mapping of any predicted bin and are associated with a completeness of 0. NMI corresponds to the Mutual Information (MI) normalized by the maximum of the unconditional entropies of the expected and predicted bins Inline graphic and Inline graphicInline graphic represents the definition of MI where the normalized number of shared genes between a given predicted bin Inline graphic and an expected bin Inline graphic is defined as Inline graphic; and the normalized number of genes in Inline graphic and in Inline graphic are Inline graphic and Inline graphic.

Metric Formula Reference
Best mapping expected bin Inline graphic (26)
Purity Inline graphic (26)
Contamination Inline graphic (26)
Completeness Inline graphic (26)
Average purity Inline graphic (26)
Average completeness Inline graphic (26)
High-Quality (HQ) bins Inline graphic (52)
Normalized Mutual Information (NMI) Inline graphic , with Inline graphic, and Inline graphic (51)