Table 4. Coding sequences specific to Haemophilus influenzae biogroup aegyptius strain F3031 at 10 loci*.
Locus no. and F3031 ID | G+C, % | % Identity | Predicted product | Species harboring closest homologue |
---|---|---|---|---|
1, phage region 1† | ||||
HIBPF00881 | 41.2 | 60 | Conserved hypothetical protein | Neisseria meningitidis |
HIBPF00900 | 37.6 | 68 | Plasmid maintenance system killer | Haemophilus parasuis |
HIBPF00910 | 40.2 | 82 | Plasmid maintenance system antidote protein | Neisseria gonorrhoeae |
HIBPF01110 | 40.2 | 70 | Conserved hypothetical protein | H. parasuis |
HIBPF01260 |
38.5 |
NA |
Unknown protein, no known homologs |
NA |
2, phage region 3† | ||||
HIBPF04833 | 40.1 | NA | Unknown protein, no known homologs | NA |
HIBPF04834 |
37.6 |
54 |
Conserved hypothetical protein |
H. parasuis
|
3 | ||||
HIBPF05360 |
27.5 |
NA |
Unknown protein, no known homologs |
NA |
4, phage region 4† | ||||
HIBPF09220 | 41.2 | 55 | Conserved hypothetical protein | Haemophilus haemolyticus |
HIBPF09642 | 31.2 | NA | Unknown protein, no known homologs | NA |
HIBPF09722 |
31.9 |
80 |
Conserved hypothetical protein |
H. parasuis
|
5, phage region 5† | ||||
HIBPF13250 |
44.1 |
75 |
Conserved hypothetical protein |
H. parasuis
|
6 | ||||
HIBPF16620 | 37.5 | 65 | Adenine-specific methyltransferase (pseudo) | Mannheimia haemolytica |
HIBPF16630 |
38.1 |
71 |
HNH endonuclease |
M. haemolytica
|
7 | ||||
HIBPF17711 | 32.6 | 53 | Conserved hypothetical protein | Escherichia coli |
HIBPF17712 |
26.5 |
49 |
Conserved hypothetical protein |
N. meningitidis
|
8 | ||||
HIBPF18000 | 28.1 | 52 | DNA methyltransferase | Macrococcus caseolyticus |
HIBPF18010 | 26.7 | 56 | DNA methyltransferase | M. caseolyticus |
HIBPF18040 |
30.3 |
44 |
Restriction endonuclease |
M. caseolyticus
|
9 | ||||
HIBPF19140 | 35.3 | 100 | HadA trimeric autotransporter adhesin | Previously identified in HaeBPF |
HIBPF20030 | 36.1 | 77 | Antibiotic biosynthesis monooxygenase | Aggregatibacter aphrophilus |
*Putative product based on closest homologue in public databases, shown by percent amino acid identity. Percentage G+C content given for each coding sequence. ID, identification; NA, not applicable. †Genes occur within regions of bacteriophage.