TABLE 1.
ORF label | Gene name | Directionb | Left end position no. | Right end position no. | No. of amino acids | Function of deduced protein | e value | % Identitya | Accession no. and sequence similarityc |
---|---|---|---|---|---|---|---|---|---|
attL | 1 | 11 | |||||||
L01 | int | − | 30 | 1343 | 437 | Integrase | 0.0 | 70 (All) | BAB19626; putative integrase, prophage VT1-Sa (E. coli) |
0.0 | 70 (All) | AAG57258; putative integrase, prophage CP-933V (E. coli) | |||||||
L02 | xis | − | 1399 | 1635 | 78 | Excisionase | 3e−16 | 53 (66) | NP_049462; putative excisionase, phage 933W (E. coli) |
3e−16 | 53 (66) | NP_050501; putative excisionase, prophage VT2-Sa (E. coli) | |||||||
L03 | − | 1794 | 2159 | 121 | 1e−22 | 63 (80) | BAB36434; hypothetical protein (E. coli RIMD 0509952) | ||
2e−13 | 60 (60) | AAG57256; hypothetical protein, prophage CP-933V (E. coli) | |||||||
L04 | − | 2068 | 2511 | 147 | 7e−04 | 34 (100) | NP_050134; hypothetical protein, prophage φadh (Lactobacillus gasseri) | ||
L05 | − | 2480 | 3307 | 275 | Protease | e−124 | 79 (All) | NP_047914; putative serine protease (Y. pestis) | |
L06 | − | 3669 | 4307 | 212 | |||||
L07 | − | 4247 | 4450 | 67 | |||||
L08 | − | 4632 | 4997 | 121 | |||||
L09 | + | 5106 | 5285 | 59 | 5e−06 | 78 (28) | AAG55461; hypothetical protein, prophage CP-933M (E. coli) | ||
L10 | − | 5575 | 5991 | 138 | Repressor | 1e−06 | 37 (69) | AAC74643; DicA repressor (E. coli) | |
L11 | cI | − | 6095 | 6751 | 218 | CI repressor | 1e−45 | 42 (All) | NP_059606; repressor C2, phage P22 (S. enterica) |
L12 | + | 6863 | 7090 | 75 | |||||
L13 | − | 7104 | 7775 | 223 | |||||
L14 | roi | + | 7876 | 8583 | 235 | DNA binding | 1e−32 | 55 (136) | NP_112063; DNA-binding protein Roi, phage HK620 (E. coli) |
5e−32 | 52 (140) | AAD04652; Roi, phage H-19B (E. coli) | |||||||
L15 | + | 8636 | 9460 | 274 | Nucleic acid binding | 5e−20 | 31 (194) | B82549; hypothetical protein XF2506 (X. fastidiosa) | |
5e−14 | 33 (127) | AAG55918; putative antirepressor, prophage CP-933N (E. coli) | |||||||
8e−14 | 38 (118) | NP_046925; antirepressor AntB, phage N15 (E. coli) | |||||||
L16 | + | 9457 | 10116 | 219 | 9e−31 | 48 (153) | S34345; hypothetical protein 179 (Shigella flexneri) | ||
1e−12 | 43 (112) | AAG54596; putative regulator, prophage CP-933I (E. coli) | |||||||
L17 | + | 10334 | 11260 | 308 | DNA binding | 9e−31 | 42 (196) | AAK16983; hypothetical protein, prophage CP-933P (E. coli) | |
2e−28 | 33 (227) | AAG55470; hypothetical protein, prophage CP-933M (E. coli) | |||||||
L18 | P | + | 11391 | 12149 | 252 | DNA replication | 2e−24 | 31 (207) | BAB12748; DNA replication protein DnaC (Buchnera aphidicola) |
1e−21 | 27 (210) | AAG55471; DNA replication factor, prophage CP-933M (E. coli) | |||||||
L19 | + | 12146 | 13537 | 463 | Helicase | 6e−87 | 39 (447) | NP_059611; helicase, phage P22 (S. enterica) | |
2e−86 | 39 (451) | NP_037740; gp55, prophage HK97 (E. coli) | |||||||
L20 | + | 13549 | 13791 | 80 | |||||
L21 | Q | + | 13791 | 14171 | 126 | Late antiterminator | 6e−25 | 50 (119) | AAG55890; Q antiterminator, prophage CP-933N (E. coli) |
7e−24 | 47 (120) | O48429; antitermination protein Q, prophage H-19B (E. coli) | |||||||
3e−23 | 44 (126) | CAB39299; antitermination protein Q, phage 933W (E. coli) | |||||||
L22 | + | 14266 | 15495 | 409 | 6e−04 | 32 (127) | CAA22431; putative chromatin assembly factor (Schizosaccharomyces pombe) | ||
L23 | + | 15711 | 15908 | 65 | 5e−29 | 98 (63) | AAG56133; hypothetical protein, prophage CP-933O (E. coli) | ||
4e−28 | 96 (63) | BAB35615; hypothetical protein (E. coli RIMD 0509952) | |||||||
1e−19 | 97 (49) | AAG55892; hypothetical protein, prophage CP-933N (E. coli) | |||||||
L24 | dam | + | 16059 | 17117 | 352 | DNA methylase | 0.0 | 86 (All) | AAG56134; adenine methyltransferase, prophage CP-933O (E. coli) |
0.0 | 86 (All) | BAB35203; DNA methylase (E. coli RIMD 0509952) | |||||||
2e−81 | 55 (282) | NP_046948; adenine-specific methylase, phage N15 (E. coli) | |||||||
ileZ | + | 17158 | 17233 | tRNA | |||||
arqO | + | 17335 | 17411 | tRNA | |||||
L25 | stxA2e | + | 17502 | 18461 | 319 | rRNA N-glycosidase | 0.0 | 99 (All) | CAA57173; Stx2e, A subunit (E. coli) |
L26 | stxB2e | + | 18474 | 18737 | 87 | Receptor binding | 3e−44 | 100 (All) | CAA57176; Stx2e, B subunit (E. coli) |
L27 | − | 18788 | 18985 | 65 | 2e−28 | 96 (61) | CAC05562; hypothetical protein (E. coli T4/97) | ||
5e−25 | 96 (55) | CAC05572; hypothetical protein (E. coli H.I.8) | |||||||
L28 | S | + | 19114 | 19548 | 144 | Holin | 8e−06 | 36 (86) | AAF80841; ORF 89, prophage D3 (Pseudomonas aeruginosa) |
0.002 | 28 (123) | BAA36235; holin ORF 9, phage φCTX (P. aeruginosa) | |||||||
L29 | S | + | 19538 | 19813 | 91 | Holin | 0.005 | 34 (85) | H83531; hypothetical protein PA0909 (P. aeruginosa) |
0.049 | 32 (86) | BAA36236; holin ORF 10, phage φCTX (P. aeruginosa) | |||||||
L30 | R | + | 19816 | 20193 | 125 | Endolysin | 3e−25 | 48 (All) | CAA09701; endolysin gp 19, phage PS3 (S. enterica serovar Typhimurium) |
3e−12 | 38 (119) | AAC38580; peptidoglycan lytic enzyme (Listeria monocytogenes) | |||||||
L31 | + | 20136 | 20351 | 71 | |||||
L32 | + | 20630 | 20785 | 51 | |||||
L33 | + | 20985 | 21332 | 115 | |||||
L34 | + | 21394 | 21744 | 116 | 1e−37 | 62 (All) | CAB58450; hypothetical protein ORF 7 (Xenorhabdus nematophilus) | ||
2e−11 | 39 (111) | AAB59284; putative holin, prophage φ105 (B. subtilis) | |||||||
cos | 21806 | 21815 | |||||||
L35 | + | 21860 | 22330 | 156 | Terminase small subunit | 2e−11 | 31 (138) | AAG50266; hypothetical protein, phage GMSE-1 (Sodalis) | |
2e−07 | 31 (96) | AAG55950; hypothetical protein, prophage CP-933C (E. coli) | |||||||
2e−07 | 31 (96) | BAB35020; terminase small subunit (E. coli RIMD 0509952) | |||||||
L36 | + | 22345 | 24057 | 570 | Terminase large subunit | e−178 | 77 (293) | P75978; hypothetical protein YmfN (E. coli) | |
e−141 | 45 (All) | NP_061498; terminase, phage D3 (P. aeruginosa) | |||||||
L37 | + | 24069 | 24251 | 60 | 2e−27 | 95 (All) | P75979; hypothetical protein YmfR (E. coli) | ||
L38 | + | 24251 | 25492 | 413 | Portal protein | 5e−63 | 84 (135) | P75980; hypothetical protein YmfO (E. coli) | |
4e−52 | 33 (372) | NP_108600; head portal protein (M. loti) | |||||||
1e−44 | 32 (375) | NP_037699; portal protein, phage HK97 (E. coli) | |||||||
L39 | + | 25470 | 26120 | 216 | Prohead protease | 2e−24 | 40 (172) | AAF13181; putative prohead protease (Rhodobacter capsulatus) | |
2e−18 | 35 (196) | NP_037665; head maturation protease, prophage HK022 (E. coli) | |||||||
L40 | + | 26134 | 27357 | 407 | Major capsid protein | 2e−73 | 36 (All) | NP_108602; phage major capsid protein gp36 (M. loti) | |
2e−58 | 34 (All) | AAF27364; phage φC31 gp36-like protein (H. influenzae) | |||||||
L41 | + | 27404 | 27727 | 107 | 6e−04 | 29 (104) | AAG55948; hypothetical protein, prophage CP-933C (E. coli) | ||
L42 | + | 28067 | 28630 | 187 | |||||
L43 | + | 28627 | 29190 | 187 | |||||
L44 | + | 29358 | 30854 | 498 | Tail sheath protein | 5e−78 | 35 (All) | P44233; putative tail sheath protein, prophage FLUMU (H. influenzae) | |
7e−75 | 35 (All) | NP_050643; sheath protein gpL, phage Mu (E. coli) | |||||||
L45 | + | 30854 | 31210 | 118 | |||||
L46 | + | 31210 | 31539 | 109 | |||||
L47 | + | 31624 | 33573 | 649 | Tail length determinator | 9e−41 | 26 (493) | F82769; tail protein XF0730 (X. fastidiosa) | |
3e−13 | 22 (473) | NP_046782; putative tail length determinator gpT, phage P2 (E. coli) | |||||||
L48 | + | 33589 | 34980 | 463 | DNA binding | 1e−23 | 26 (450) | P71389; DNA circulation protein, prophage FLUMU (H. influenzae) | |
3e−8 | 23 (479) | NP_050647; DNA circulation protein N, phage Mu (E. coli) | |||||||
L49 | + | 34977 | 36032 | 351 | 4e−40 | 30 (All) | NP_050648; tail protein P, phage Mu (E. coli) | ||
L50 | + | 36032 | 36565 | 177 | Baseplate assembly | 3e−12 | 31 (170) | AAF41502; putative baseplate assembly protein V (Neisseria meningitidis) | |
3e−11 | 26 (174) | NP_050649; baseplate assembly protein gp45, phage Mu (E. coli) | |||||||
L51 | + | 36571 | 36984 | 137 | 4e−14 | 40 (130) | BAB38409; hypothetical protein (E. coli RIMD 0509952) | ||
8e−11 | 41 (112) | NP_050650; gp46, phage Mu (E. coli) | |||||||
L52 | + | 36977 | 38059 | 360 | 3e−78 | 55 (260) | P75981; hypothetical protein YmfP, prophage E14 (E. coli) | ||
1e−29 | 31 (330) | NP_050651; gp47, phage Mu (E. coli) | |||||||
L53 | + | 38059 | 38649 | 196 | 2e−59 | 55 (194) | P75982; hypothetical protein YmfQ, prophage E14 (E. coli) | ||
1e−05 | 26 (182) | NP_050652; gp48, phage Mu (E. coli) | |||||||
L54 | + | 38636 | 39634 | 332 | Tail fiber protein | 2e−33 | 39 (247) | A42463; hypothetical protein Bcv (Shigella boydii) | |
7e−20 | 31 (305) | NP_046775; putative tail fiber protein gpH, phage P2 (E. coli) | |||||||
L55 | + | 39637 | 40185 | 182 | Tail fiber assembly | 3e−37 | 40 (All) | C42463; hypothetical protein B177 (S. boydii) | |
2e−29 | 36 (All) | NP_050654; tail fiber assembly protein U, phage Mu (E. coli) | |||||||
L56 | + | 40209 | 41681 | 490 | Tail fiber protein | 2e−20 | 32 (235) | BAB35654; putative tail fiber protein (E. coli RIMD 0509952) | |
3e−20 | 32 (235) | AAK16943; putative tail fiber protein, prophage CP-933P (E. coli) | |||||||
L57 | + | 41678 | 42244 | 188 | 5e−29 | 35 (All) | AAF63233; ORF 191A, prophage P-EibA (E. coli) | ||
L58 | + | 42399 | 42575 | 58 | |||||
attR | 42565 | 42575 |
Numbers in parentheses represent the whole number of amino acids from which the sequence identity is calculated. All, whole length identity. Empty fields in the table indicate that no homologous sequences were available.
−, lower strand; +, upper strand.
The GenBank database was used for homolog searches.