Appendix 1—table 4. Average percentage of matches to various BPPS subgroup (BSG) patterns for haloacid dehalogenase sequences assigned to SFLD subgroup SG1135.
BSG | SG1135 | % matches to each BSG pattern for SG1135 sequences*: | Mean score† vs others | new‡ | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
ID | # seqs | # seqs | 15 | 16 | 25 | 3 | 11 | 14 | 17 | 18 | 24 | 23 | In SG1135 | In BSG | SGs |
15 | 1417 | 63 | 69 | 35 | 32 | 51 | 8 | 42 | 19 | 56 | 22 | 20 | 63 | 64 | ? |
16 | 1008 | 1 | 43 | 40 | 52 | 52 | 12 | 40 | 40 | 52 | 24 | 8 | 110 | 14 | error |
25 | 387 | 311 | 48 | 37 | 84 | 57 | 13 | 36 | 43 | 45 | 27 | 10 | 80 | 309 | yes |
3 | 5447 | 4500 | 49 | 25 | 35 | 90 | 15 | 37 | 20 | 38 | 20 | 9 | 148 | 132 | ? |
11 | 287 | 3 | 33 | 29 | 37 | 51 | 93 | 32 | 17 | 32 | 16 | 8 | 44 | 632 | yes |
14 | 1066 | 32 | 49 | 27 | 25 | 37 | 9 | 77 | 18 | 47 | 19 | 13 | 42 | 130 | yes |
17 | 518 | 288 | 41 | 39 | 41 | 42 | 8 | 30 | 88 | 48 | 25 | 6 | 55 | 321 | yes |
18 | 1455 | 950 | 58 | 34 | 28 | 44 | 9 | 50 | 16 | 88 | 20 | 15 | 43 | 169 | yes |
24 | 217 | 107 | 42 | 35 | 26 | 44 | 9 | 32 | 20 | 42 | 92 | 4 | 53 | 293 | yes |
23 | 227 | 125 | 62 | 32 | 23 | 41 | 4 | 32 | 10 | 36 | 14 | 90 | 45 | 403 | yes |
root | n.a. | 3040 | 46 | 37 | 41 | 56 | 12 | 40 | 29 | 46 | 23 | 10 | n.a. | n.a. | n.a. |
*Average percentage of matches to the pattern residues for their assigned BSG among the SG1135 sequences. The highest percentages (bold) correspond to the highest percentage in each row.
†The mean pairwise BLAST scores of the BPPS-assigned sequences against the remaining sequences either in SG1135 or in the BSG for that row. The highest scores in each row are bold. (See Appendix 1—table 2.).
‡A ‘yes’ in this column indicates that the SG1135 sequences assigned to the BSG in that row likely correspond to a subgroup distinct from SG1135; ‘?’ indicates a possible subcategory of SG1135; ‘error’ indicates a BPPS misclassification.