Table 2. Host Range Substitution Hotspots in φ6 P3 Proteina.
Substitution | This Study | Duffy | Ferris | N | Frequencyb | Combined Frequencyb |
G5S | 0 | 0 | 2 | 2 | 1.3% | 54.7% |
E8K | 23 | 4 | 1 | 28 | 18.7% | |
E8G | 28 | 9 | 5 | 42 | 28% | |
E8D | 4 | 0 | 0 | 4 | 2.7% | |
E8A | 5 | 1 | 0 | 6 | 4% | |
Q130R | 7 | 0 | 0 | 7 | 4.7% | 16.7% |
A133V | 0 | 9 | 0 | 9 | 6.0% | |
D145G | 0 | 0 | 3 | 3 | 2.0% | |
N146S | 0 | 0 | 6 | 6 | 3.9% | |
D533A | 0 | 0 | 1 | 1 | 0.7% | 14.0% |
D535N | 0 | 0 | 1 | 1 | 0.7% | |
D554G | 3 | 1 | 8 | 12 | 8.0% | |
D554A | 1 | 0 | 1 | 2 | 1.3% | |
D554V | 0 | 0 | 1 | 1 | 0.7% | |
D554N | 0 | 0 | 2 | 2 | 1.3% | |
L555F | 1 | 0 | 1 | 2 | 1.3% | |
Others | 9 | 6 | 7 | 22 | 14.7% | 14.7% |
Total | 81 | 30 | 39 | 150 | 100% | 100% |
Amino acid substitutions close together in the primary sequence are grouped together. We combine data from this study with two other studies of φ6 host range expansion. N is total number of times a substitution was observed across all studies. Frequency is percentage of total substitutions a particular substitution was observed. Combined frequency is percentage of total substitutions constituted by substitutions in a particular region of the primary sequence. Others category includes substitutions found outside substitution hotspots.
Some frequencies rounded off to nearest tenth percent.