Statistics of datasets used in this study. The first column is the name of the organism, the second column – the number of protein-coding genes in its genome, Ngenes, the third column – the number of proteins for which we found at least one paralogous partner, the fourth column is the percentage of proteins with at least one paralog, the fifth column – the total number of distinct BLAST hits generated before we applied subsequent filtering, the sixth column – the number of paralogous pairs included in Na(p), and the seventh column – in Nd(p).