Table 2.
59 gene clusters analyzed in our experiment
| # | Gene cluster ID | Size | PQ-tree | |||
|---|---|---|---|---|---|---|
| 1 | 19876 | 1.000 | 0.655 | 0.500 | 3 | ((0− [1− 2− 3−]) 4+) |
| 2 | 21344 | 0.905 | 0.615 | 0.439 | 4 | ([0− 1−] [2+ 3+]) |
| 3 | 19877 | 0.997 | 0.915 | 0.592 | 3 | ([0+ 1+ 2+] 3+) |
| 4 | 27180 | 0.828 | 0.828 | 0.436 | 3 | ([0− 1− 2−] 3+) |
| 5 | 14602 | 0.978 | 0.985 | 0.654 | 4 | (0− [1− 2− 3− 4−]) |
| 6 | 23340 | 0.974 | 0.809 | 0.684 | 3 | (0− (1+ [2+ 3+])) |
| 7 | 19853 | 0.967 | 0.825 | 0.703 | 3 | (0− ([1+ 2+ 3+] 4+)) |
| 8 | 14790 | 0.994 | 0.994 | 0.808 | 3 | ([0− 1− 2− 3−] 4− 5−) |
| 9 | 26244 | 0.971 | 0.996 | 0.817 | 3 | (0− [1− 2− 3−]) |
| 10 | 26243 | 0.903 | 0.909 | 0.757 | 4 | (([0− 1−] 2−) 3−) |
| 11 | 26476 | 0.925 | 0.998 | 0.791 | 3 | (0+ [1+ 2+ 3+]) |
| 12 | 15297 | 0.991 | 0.945 | 0.866 | 3 | (0− ([1− 2−] [3− 4+])) |
| 13 | 22866 | 0.845 | 0.600 | 0.722 | 5 | ([0+ 1+] [2+ 3+]) |
| 14 | 26238 | 0.785 | 0.807 | 0.663 | 7 | ([0− 1−] 2− 3−) |
| 15 | 18995 | 0.770 | 0.621 | 0.681 | 9 | ((0+ [1+ 2+]) 3+) |
| 16 | 28119 | 0.559 | 0.559 | 0.478 | 5 | (0− 1− 2− 3−) |
| 17 | 25371 | 0.983 | 0.942 | 0.907 | 4 | (0− ([1− 2−] 3−)) |
| 18 | 26231 | 0.974 | 0.802 | 0.909 | 4 | ((0+ [1+ 2+]) 3+) |
| 19 | 14796 | 0.760 | 0.686 | 0.700 | 9 | (0− 1− [2− 3−] 4−) |
| 20 | 20007 | 0.999 | 0.999 | 0.960 | 3 | (0− 1− 2− [3− 4−]) |
| 21 | 22299 | 0.929 | 0.929 | 0.891 | 3 | (0− 1− 2− 3−) |
| 22 | 19255 | 0.948 | 0.940 | 0.918 | 5 | ([0− 1−] [2− 3−] 4−) |
| 23 | 20764 | 0.880 | 0.987 | 0.852 | 3 | ([0+ 1+ 2+] 3+) |
| 24 | 21553 | 0.982 | 0.982 | 0.961 | 3 | (0− [1− 2−] 3− 4−) |
| 25 | 27851 | 1.000 | 1.000 | 0.982 | 3 | (0+ [1+ 2+] 3+) |
| 26 | 27427 | 0.866 | 0.866 | 0.866 | 3 | (0+ [1+ 2+ 3+]) |
| 27 | 15467 | 1.000 | 1.000 | 1.000 | 3 | ([0− 1− 2− 3−] 4− 5−) |
| 28 | 19610 | 0.998 | 0.998 | 0.998 | 3 | (0− ([1− 2−] 3− 4−)) |
| 29 | 20078 | 0.982 | 0.945 | 0.982 | 3 | (0+ (1+ [2+ 3+])) |
| 30 | 22181 | 0.912 | 0.940 | 0.912 | 3 | (([0+ 1+] 2+) 3+) |
| 31 | 25612 | 0.997 | 0.888 | 0.997 | 4 | ([0− 1−] [2− 3−]) |
| 32 | 27177 | 0.945 | 0.945 | 0.945 | 3 | (0− [1+ 2+ 3+]) |
| 33 | 8962 | 0.933 | 0.933 | 0.933 | 3 | (0− 1− [2− 3− 4− 5− 6−]) |
| 34 | 27530 | 0.999 | 0.974 | 0.999 | 3 | [[0− 1−] 2− 3−] |
| 35 | 19256 | 0.779 | 0.737 | 0.798 | 8 | (0− 1− 2− 3−) |
| 36 | 21317 | 0.974 | 0.974 | 0.995 | 3 | (0+ 1+ [2+ 3+ 4+]) |
| 37 | 19852 | 0.890 | 0.821 | 0.919 | 6 | (0− [1− 2−] 3−) |
| 38 | 27250 | 0.967 | 0.967 | 0.997 | 3 | ([0+ 1+] [2+ 3+]) |
| 39 | 30976 | 0.963 | 1.000 | 0.996 | 3 | [0− 1+ [2+ 3+]] |
| 40 | 28245 | 0.715 | 0.715 | 0.775 | 3 | (0− 1− 2− 3−) |
| 41 | 26257 | 0.933 | 0.988 | 0.999 | 3 | ((0+ [1+ 2+]) 3+) |
| 42 | 14839 | 0.860 | 0.912 | 0.928 | 8 | ([0− 1−] 2− 3− 4−) |
| 43 | 27207 | 0.748 | 0.749 | 0.824 | 7 | (0− [1− 2−] 3+) |
| 44 | 12685 | 0.917 | 0.917 | 0.994 | 3 | ([0− 1− 2−] 3− 4− 5−) |
| 45 | 21268 | 0.876 | 0.876 | 0.956 | 4 | ([0+ 1+] [2+ 3+]) |
| 46 | 23083 | 0.832 | 0.916 | 0.912 | 5 | (([0− 1−] 2−) 3−) |
| 47 | 25597 | 0.810 | 0.817 | 0.892 | 5 | (0+ [1+ 2+] 3+) |
| 48 | 19005 | 0.902 | 0.997 | 0.997 | 3 | ((0+ 1+ 2+) 3+) |
| 49 | 15035 | 0.623 | 0.671 | 0.732 | 4 | ((0+ ([1+ 2+] 3+)) 4+) |
| 50 | 28547 | 0.828 | 0.828 | 0.961 | 7 | (0− 1− 2− 3−) |
| 51 | 25375 | 0.866 | 0.866 | 1.000 | 3 | ([0− 1−] 2− 3−) |
| 52 | 20332 | 0.860 | 0.860 | 0.997 | 5 | (0+ 1+ [2+ 3+]) |
| 53 | 26364 | 0.839 | 0.869 | 0.988 | 5 | (0+ 1+ 2+ 3+) |
| 54 | 19909 | 0.818 | 0.818 | 0.968 | 6 | (0− [1− 2−] 3−) |
| 55 | 20601 | 0.586 | 0.586 | 0.756 | 5 | (0− 1− 2− 3−) |
| 56 | 15391 | 0.784 | 0.784 | 0.999 | 3 | (0− (1− [2− 3− [4− 5−]])) |
| 57 | 14573 | 0.501 | 0.501 | 0.722 | 4 | (0− 1− (2− [3− 4−]) 5−) |
| 58 | 25554 | 0.721 | 0.971 | 0.971 | 3 | (([0+ 1+] 2+) 3+) |
| 59 | 19526 | 0.543 | 0.471 | 0.814 | 4 | ((0− [1− 2−]) 3−) |
For each gene cluster (row in the table) we report the Pearson correlation versus the corresponding instance-abundance measure for each of the three genome rearrangement measures compared in our experiment: our proposed measure (), signed break-point distance () as in Definition 8 and CREx reversals distance () [28]. The column denoted “Size” gives the number of genes in the gene cluster, and the column denoted “PQ-tree” gives the corresponding PQ-tree representing the gene cluster. The functional category annotations for each cluster are given in Tables 3 and 4