Skip to main content
. 2024 Mar 29;88(5):50. doi: 10.1007/s00285-024-02058-z

Table 3.

Correlation between the precision of predictions of g-annotations to human proteins in an alignment A versus g’s p-value in A according to Sect. 2.4

Thresh N Pearson Pearson p σ’s Spearman Spearman p σ’s
10-2 56,470 - 0.371 9.7×10-1939 94.8 - 0.579 1.6×10-6126 168.6
10-4 49,270 - 0.378 1.5×10-1774 90.7 - 0.618 2.8×10-6551 174.3
10-8 36,570 - 0.431 2.1×10-1793 92.2 - 0.681 8.1×10-6802 177.6
10-16 25,871 - 0.449 8.1×10-1411 80.9 - 0.688 5.9×10-5013 152.5
10-32 16,272 - 0.473 9.9×10-1011 68.5 - 0.721 2.8×10-3801 132.8
10-64 7688 - 0.621 1.1×10-1037 69.4 - 0.827 2.6×10-3580 128.9
10-128 4500 - 0.711 4.1×10-990 67.8 - 0.737 3.1×10-1154 73.2

The “thresh” column specifies the upper bound on the p-value of g in a particular alignment A before A’s g-annotation predictions are included in that row; N is the number of (A, g) pairs that result, across all alignments and GO terms with human proteins as targets. The “Pearson” column is the correlation between (a) the fraction of g-annotation predictions that are validated in alignment A and (b) the p-value of g in the alignment A that produced the predictions; the “Pearson p” column is the p-value of the Pearson correlation of the previous column; and the σ’s column is the number of standard deviations represented by the Pearson p. The last three columns duplicate the previous three, but for the Spearman correlation. The correlations are negative because the prediction precision increases as p-value decreases, as expected (note: the Pearson and Spearman p’s technically decrease in significance as N decreases, though they remain highly significant throughout.)