Table 2.
AH2 genome annotation
Gene | Start | End | Putative function | Strand | Predicted ribosome binding site and start codon | Length (amino acids) | Closest relative | Alignment region (amino acids) | Percent identity | Source | GenBank accession number |
---|---|---|---|---|---|---|---|---|---|---|---|
1 |
619 |
1035 |
unknown |
- |
AAGGAAAcgacATG |
138 |
hypothetical protein Nazgul32 |
12-130/130 |
29 |
Burkholderia phage BcepNazgul |
NP_918966.1 |
2 |
1073 |
1423 |
unknown |
- |
AGGGGGGAAcggccATG |
116 |
conserved hypothetical protein |
1-116/116 |
72 |
Burkholderia multivorans CGD1 |
ZP_03586942.1 |
3 |
1501 |
1818 |
unknown |
- |
GGATTActgaccATG |
105 |
family 2 glycosyl transferase |
292-387/387 |
32 |
Haloterrigena turkmenica DSM 5511 |
YP_003404522.1 |
4 |
1809 |
2024 |
unknown |
+ |
GAGAAAtagagATG |
71 |
mobilization protein mbeA |
190-237/325 |
37 |
Escherichia coli E128010 |
EFZ49597.1 |
5 |
2021 |
2578 |
unknown |
- |
AGGGGttacatcATG |
185 |
hypothetical protein Nazgul06 |
88-158/330 |
44 |
Burkholderia phage BcepNazgul |
NP_919015.1 |
6 |
2728 |
2877 |
unknown |
- |
AGGTGcaaaaATG |
49 |
hypothetical protein BoklE_20935 |
6-38/38 |
48 |
Burkholderia oklahomensis EO147 |
ZP_02357945.1 |
7 |
2874 |
3002 |
unknown |
- |
AGGGGcgatcATG |
42 |
polysaccharide deacetylase |
21-60/287 |
35 |
Bacillus mycoides Rock3-17 |
ZP_04156726.1 |
8 |
3071 |
3325 |
unknown |
- |
AAAGAgctATG |
84 |
major facilitator superfamily MFS_1 |
131-209/467 |
37 |
Burkholderia gladioli BSR3 |
YP_004349464.1 |
9 |
3322 |
3579 |
unknown |
- |
GGAGTAtccgccATG |
85 |
hypothetical protein Plabr_1809 |
308-361/603 |
31 |
Planctomyces brasiliensis DSM 5305 |
YP_004269441.1 |
10 |
3663 |
3911 |
unknown |
- |
GGGGGTAtgacATG |
82 |
HAD-superfamily hydrolase |
70-119/268 |
38 |
Methanosphaerula palustris E1-9c |
YP_002465429.1 |
11 |
3913 |
4314 |
unknown |
- |
AGGGGGAGtaacggccATG |
133 |
hypothetical protein Nazgul09 |
1-129/141 |
59 |
Burkholderia phage BcepNazgul |
NP_919018.1 |
12 |
4320 |
4805 |
unknown |
- |
AGGGGttacatcATG |
161 |
hypothetical protein Nazgul10 |
1-151/160 |
74 |
Burkholderia phage BcepNazgul |
NP_919019.2 |
13 |
4846 |
5454 |
unknown |
- |
AAAAAGGGGtttttgacATG |
202 |
194 gene product |
101-187/188 |
43 |
Salmonella phage PVP-SE1 |
YP_004894001.1 |
14 |
6021 |
6302 |
unknown |
+ |
AAGGAGcaatcATG |
93 |
hypothetical protein Nazgul13 |
3-93/93 |
41 |
Burkholderia phage BcepNazgul |
NP_919022.1 |
15 |
6311 |
6550 |
unknown |
+ |
AGGCGGtcgtATG |
79 |
hypothetical protein BDB_mp60418 |
1-67/67 |
45 |
blood disease bacterium R229 |
CCA83252.1 |
16 |
6707 |
7015 |
unknown |
+ |
ACACGAcaccATG |
102 |
hypothetical protein MC7420_4162 |
43-84/88 |
45 |
Microcoleus chthonoplastes PCC 7420 |
ZP_05027813.1 |
17 |
7012 |
7218 |
unknown |
+ |
GAAGGtgccggcATG |
68 |
hypothetical protein Cy51472DRAFT_4929 |
53-81/152 |
45 |
Cyanothece sp. ATCC 51472 |
ZP_08976132.1 |
18 |
7215 |
8069 |
unknown |
+ |
AGGAAAGgaaATG |
284 |
hypothetical protein TK90_2682 |
5-175/177 |
45 |
Thioalkalivibrio sp. K90mix |
YP_003494636.1 |
19 |
8123 |
8407 |
unknown |
+ |
GAGAAGGcacacacATG |
94 |
GTP-binding protein |
150-232/1016 |
29 |
Gemmata sp. Wa1-1 |
AAX07516.1 |
20 |
8499 |
9128 |
DNA polymerase III β subunit |
+ |
GAACGGTGAGcttATG |
209 |
hypothetical protein Nazgul21 |
24-216/237 |
24 |
Burkholderia phage BcepNazgul |
NP_918955.1 |
21 |
9149 |
9343 |
unknown |
+ |
AGGAGAAAGgagATG |
64 |
hypothetical protein R2APBS1DRAFT_0277 |
9-63/344 |
31 |
Rhodanobacter sp. 2APBS1 |
ZP_08951135.1 |
22 |
9346 |
9645 |
unknown |
+ |
GGGGGTAtctgaccATG |
99 |
hypothetical protein PFL_2108 |
3-63/70 |
33 |
Pseudomonas fluorescens Pf-5 |
YP_259216.1 |
23 |
9642 |
9938 |
unknown |
+ |
GGAGGGtcaTTG |
98 |
aspA gene product |
38-122/317 |
32 |
Rhodospirillum centenum SW |
YP_002297975.1 |
24 |
9935 |
10171 |
unknown |
+ |
GGGGcttggcgtATG |
78 |
hypothetical protein Nazgul19 |
18-97/97 |
39 |
Burkholderia phage BcepNazgul |
NP_919028.2 |
25 |
10256 |
10711 |
pyrophosphohydrolase |
+ |
AAGGAAAggacATG |
151 |
hypothetical protein BCAS0549 |
15-139/140 |
60 |
Burkholderia cenocepacia J2315 |
YP_002153936.1 |
26 |
10720 |
10977 |
unknown |
+ |
GAGGccggccATG |
85 |
hypothetical protein AGRO_3677 |
208-273/300 |
41 |
Agrobacterium sp. ATCC 31749 |
ZP_08529674.1 |
27 |
11082 |
12074 |
unknown |
+ |
AGGAGAAatcGTG |
330 |
hypothetical protein |
8-95/113 |
48 |
Escherichia phage vB_EcoM_ECO1230-10 |
ADE87960.1 |
28 |
12101 |
13075 |
transcriptional regulator |
+ |
AAGGAAccgacATG |
324 |
hypothetical protein Pnap_4317 |
25-252/342 |
45 |
Polaromonas naphthalenivorans CJ2 |
YP_973341.1 |
29 |
13078 |
13497 |
unknown |
+ |
GCTGACGAtctctgaccATG |
139 |
hypothetical protein SCHCODRAFT_69044 |
549-631/848 |
33 |
Schizophyllum commune H4-8 |
XP_003030158.1 |
30 |
13574 |
13768 |
transcriptional regulator |
+ |
AGGGAtttttcATG |
64 |
hypothetical protein APT_2164 |
9-65/75 |
53 |
Acetobacter pasteurianus NBRC 101655 |
GAB28674.1 |
31 |
13768 |
14031 |
transcriptional regulator |
+ |
AAGCGGAGccgtcctgATG |
87 |
hypothetical protein Bcep1808_2468 |
2-85/86 |
73 |
Burkholderia vietnamiensis G4 |
YP_001120302.1 |
32 |
14064 |
14450 |
Vsr endonuclease |
- |
GGAGGAatgATG |
128 |
DNA mismatch endonuclease Vsr |
15-141/141 |
65 |
Methylocella silvestris BL2 |
YP_002360880.1 |
33 |
14450 |
15025 |
excinuclease |
- |
AACAGAGttgcagcGTG |
191 |
Excinuclease ABC C subunit domain protein |
3-183/192 |
58 |
Pseudomonas syringae pv. lachrymans str. M301315 |
EGH83133.1 |
34 |
15038 |
15892 |
restriction endonuclease |
- |
GGCAAAGGtcgccgcATG |
284 |
conserved hypothetical protein |
1-285/285 |
70 |
Ralstonia solanacearum CMR15 |
CBJ36134.1 |
35 |
15889 |
17031 |
cytosine methylase |
- |
AGGGGGttcgcGTG |
380 |
DNA-cytosine methyltransferase |
1-385/385 |
66 |
Ralstonia solanacearum CMR15 |
CBJ36133.1 |
36 |
17107 |
17199 |
unknown |
+ |
ACGAAGccttgcttaATG |
30 |
resistance-nodulation-cell division acriflavin:proton (H+) antiporter |
850-868/1014 |
68 |
Bacillus pumilus SAFR-032 |
YP_001486844.1 |
37 |
17511 |
18842 |
integrase |
+ |
GAAGGAGGtcttgtagcactgATG |
443 |
chorismate mutase family protein |
1-362/386 |
62 |
Phaeobacter gallaeciensis BS107 |
ZP_02147383.1 |
38 |
18990 |
19412 |
unknown |
+ |
AAGGAGGAatcATG |
140 |
hypothetical protein Dda3937_00584 |
60-163/163 |
40 |
Dickeya dadantii 3937 |
YP_003882998.1 |
39 |
19462 |
20001 |
unknown |
- |
GGAGAttttcATG |
179 |
hypothetical protein PcarcW_20243 |
68-197/198 |
67 |
Pectobacterium carotovorum subsp. carotovorum WPP14 |
ZP_03833564.1 |
40 |
20034 |
20264 |
Rz1 |
- |
GGAGGAcgccATG |
76 |
hypothetical protein BURPS668_A2333 |
27-81/81 |
62 |
Burkholderia pseudomallei 668 |
YP_001063327.1 |
41 |
20277 |
20588 |
Rz |
- |
AGGGGGccgtATG |
103 |
hypothetical protein ORF004 |
2-101/101 |
35 |
Pseudomonas phage 73 |
YP_001293411.1 |
42 |
20585 |
21091 |
lysin |
- |
AAGGAGAAGAacaGTG |
168 |
hypothetical protein HMPREF0005_02034 |
1-161/163 |
60 |
Achromobacter xylosoxidans C54 |
EFV83908.1 |
43 |
21088 |
21339 |
holin |
- |
GAAGGGGtggacccgaccATG |
83 |
conserved exported hypothetical protein |
1-83/85 |
35 |
blood disease bacterium R229 |
CCA83792.1 |
44 |
21336 |
21665 |
unknown |
- |
AAGGGGccagaagATG |
109 |
hypothetical protein HDEF_1702 |
3-87/92 |
31 |
Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) |
YP_002924457.1 |
45 |
21807 |
22121 |
unknown |
- |
AAGGAGAAAtcacATG |
104 |
hypothetical protein PPL19_05085 |
1-103/161 |
53 |
Pseudomonas psychrotolerans L19 |
ZP_09283635.1 |
46 |
22133 |
23731 |
tail fiber protein |
- |
GGAACGtggacATG |
532 |
hypothetical protein Bpse112_32291 |
69-240/282 |
45 |
Burkholderia pseudomallei 112 |
ZP_02502292.1 |
47 |
23809 |
26178 |
tail assembly protein |
- |
AGAGGAAGAcaaATG |
789 |
hypothetical protein HCH_05649 |
2-727/728 |
34 |
Hahella chejuensis KCTC 2396 |
YP_436732.1 |
48 |
26175 |
26375 |
tail assembly protein |
- |
GGGGGCAAgaaATG |
66 |
hypothetical protein HCH_05650 |
4-67/71 |
50 |
Hahella chejuensis KCTC 2396 |
YP_436733.1 |
49 |
26372 |
26608 |
tail assembly protein |
- |
GAGGActgatcATG |
78 |
putative transmembrane protein |
7-82/82 |
47 |
Rhodobacter sp. SW2 |
ZP_05845047.1 |
50 |
26618 |
27418 |
tail assembly protein |
- |
AGGGGGAtcaaacaATG |
266 |
hypothetical protein HCH_05652 |
1-268/269 |
39 |
Hahella chejuensis KCTC 2396 |
YP_436735.1 |
51 |
27415 |
29100 |
tail assembly protein |
- |
AAGAAGAtcacTTG |
561 |
hypothetical protein HCH_05654 |
35-560/563 |
32 |
Hahella chejuensis KCTC 2396 |
YP_436736.1 |
52 |
29097 |
30158 |
unknown |
- |
GACGAGGtttgaaATG |
353 |
hypothetical protein D11S_2171 |
1-326/327 |
23 |
Aggregatibacter actinomycetemcomitans D11S-1 |
YP_003256741.1 |
53 |
30160 |
31122 |
unknown |
- |
GAGCGAGGcataacGTG |
320 |
hypothetical protein XALc_0225 |
1-194/307 |
35 |
Xanthomonas albilineans GPE PC73 |
YP_003374757.1 |
54 |
31124 |
35860 |
tail tape measure |
- |
GGACTGAAcggaaATG |
1578 |
phage tape measure protein |
1-109, 452-1680/1683 |
33 |
Sinorhizobium meliloti AK83 |
YP_004548730.1 |
55 |
35853 |
36538 |
tail protein |
- |
AAGGGGGCGagcATG |
228 |
pre-tape measure frameshift protein G-T |
1-242/243 |
34 |
Burkholderia phage BcepNazgul |
NP_918998.2 |
56 |
36098 |
36538 |
tail protein |
- |
AAGGGGGCGagcATG |
146 |
hypothetical protein Sinme_1368 |
4-126/142 |
34 |
Sinorhizobium meliloti AK83 |
YP_004548729.1 |
57 |
36549 |
37337 |
unknown |
- |
GAGGAAtcaatcATG |
262 |
hypothetical protein Sinme_1367 |
1-257/262 |
45 |
Sinorhizobium meliloti AK83 |
YP_004548728.1 |
58 |
37385 |
37897 |
minor tail protein |
- |
GAGGAAAGtataATG |
170 |
hypothetical protein Sinme_1366 |
7-177/177 |
50 |
Sinorhizobium meliloti AK83 |
YP_004548727.1 |
59 |
37897 |
38517 |
unknown |
- |
GACGCAGGtttgccgacATG |
206 |
hypothetical protein Nazgul55 |
5-198/205 |
49 |
Burkholderia phage BcepNazgul |
NP_918988.2 |
60 |
38514 |
38873 |
unknown |
- |
GAGGCGcgtgATG |
119 |
hypothetical protein Sinme_1364 |
3-120/125 |
38 |
Sinorhizobium meliloti AK83 |
YP_004548725.1 |
61 |
38886 |
39134 |
unknown |
- |
AAAGGAAccatcATG |
82 |
hypothetical protein Nazgul57 |
1-38/85 |
47 |
Burkholderia phage BcepNazgul |
NP_918990.1 |
62 |
39205 |
40233 |
major capsid protein |
- |
AAGGAGAAAGcaaaATG |
342 |
capsid protein E |
2-343/346 |
50 |
Burkholderia phage BcepNazgul |
NP_918991.1 |
63 |
40290 |
40688 |
decorator protein |
- |
AGGAGAAccatcATG |
132 |
decorator protein D |
4-123/131 |
49 |
Burkholderia phage BcepNazgul |
NP_918992.1 |
64 |
40743 |
42071 |
prohead protease |
- |
AGGACCAGAAccaATG |
442 |
prohead protease ClpP |
4-427/434 |
53 |
Burkholderia phage BcepNazgul |
NP_918994.2 |
65 |
42068 |
43591 |
portal protein |
- |
GGAAcccgtcgATG |
507 |
phage portal protein |
57-554/559 |
59 |
Staphylococcus phage SA1 |
ACZ55505.1 |
66 |
43736 |
43960 |
head-tail joining protein |
- |
GGACAAcactATG |
74 |
head-tail joining protein Lambda W |
13-76/76 |
56 |
Burkholderia phage BcepNazgul |
NP_918996.1 |
67 |
44097 |
46076 |
terminase large subunit |
- |
AAGAcctcgATG |
659 |
terminase large subunit TerL |
44-677/677 |
58 |
Burkholderia phage BcepNazgul |
NP_918997.2 |
68 |
46210 |
46803 |
terminase small subunit |
- |
GAAGGTGAtagcgATG |
197 |
TerS |
9-179/222 |
49 |
Burkholderia phage BcepNazgul |
NP_918999.1 |
69 |
46796 |
46990 |
transcriptional regulator |
- |
AGGAGTAcggtATG |
64 |
aminoglycoside phosphotransferase |
423-473/487 |
29 |
Frankia sp. EUN1f |
ZP_06416368.1 |
70 |
47047 |
47736 |
repressor |
- |
GAAAGGCAAGGcagcagcATG |
229 |
hypothetical protein Rvan_1213 |
14-180/242 |
36 |
Rhodomicrobium vannielii ATCC 17100 |
YP_004011581.1 |
71 |
47833 |
49446 |
helicase |
- |
ACGAcctcctgcgATG |
537 |
helicase |
11-507/522 |
52 |
Burkholderia phage BcepNazgul |
NP_919000.2 |
72 |
49443 |
49745 |
resolvase |
- |
GAAAGGAGGAttcactGTG |
100 |
conserved phage protein |
15-103/108 |
55 |
Burkholderia phage BcepNazgul |
NP_919001.2 |
73 |
49742 |
51796 |
DNA polymerase |
- |
ACGTcaccATG |
684 |
hypothetical protein ORF026 |
48-670/683 |
45 |
Pseudomonas phage 73 |
YP_001293433.1 |
74 |
51875 |
52609 |
single-stranded DNA binding protein |
- |
AAAGGTGAcaaaaATG |
244 |
conserved phage protein |
4-186/198 |
35 |
Staphylococcus phage SA1 |
ACZ55548.1 |
75 |
52655 |
53995 |
Cas4 superfamily exonuclease |
- |
GATCctctcgaccccATG |
446 |
conserved phage protein |
8-448/454 |
48 |
Burkholderia phage BcepNazgul |
NP_919005.2 |
76 |
54140 |
54538 |
unknown |
- |
GGAGAAatcATG |
132 |
hypothetical protein RUMHYD_01446 |
1-120/122 |
26 |
Blautia hydrogenotrophica DSM 10507 |
ZP_03782010.1 |
77 |
54718 |
55017 |
Cro |
+ |
AACGGAGAtcacaATG |
99 |
hypothetical protein Nazgul73 |
5-90/97 |
31 |
Burkholderia phage BcepNazgul |
NP_919007.1 |
78 | 55054 | 57534 | primase | + | GGAGGGgcaATG | 826 | DR0530-like primase | 1-843/843 | 49 | Burkholderia phage BcepNazgul | NP_919008.2 |