Figure 2. Large numbers of correct inferences are only observed when the fraction of correct inferences is very low.
We show the results of inferring interactions from S. cerevisiae (SC), C. elegans (CE), D. melanogaster (DM), H. sapiens (HS), S, Pombe (SP), and M. musculus (MM) to the first four of those species. (A) Number of correct interolog inferences across species, for different blastp
-value cut-offs. (B) Fraction of all inferences that are observed in the interactions of the target species,
. (C) The likelihood ratio
that an inference is correct. This indicates how much better it is to use the inferences than to select random pairs of proteins in the target species that have homologs in the source species interactome. (A) and (B) together indicate that it is only at lax
-values that one makes significant numbers of correct inferences, but this is a very small fraction of the total number of inferences made at these
-values. The S. cerevisiae data-set coverage is significantly higher than that of other species, so one obtains larger values for inferences to S. cerevisiae.
