Table 1. Putative Open Reading Frames deduced from SOCP genome sequences and their predicted functions.
ORF | Start | End | Strand | Length [aa] a | IP b | Mw [kDa] c | Putative RBS d and start codon | Putative function | Best hit with Blast ‘Locus tag’ | ORF in Cp-1 | # of identical aa/ size of the alignment [% aa identity] | Length [aa] | E-value | Accession number [GenBank] e |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 376 | 642 | + | 88 | 4.4 | 10.4 | AAAGGAGAAAGAAAACATG | Hypothetical protein | Cp-1, p01 | 1 | 88/88[100%] | 88 | 3E-57 | NP_044813.1 |
2 | 657 | 947 | + | 96 | 6.6 | 11.7 | AAAGGAGATAATAAAAATG | Hypothetical protein | Cp-1, p02 | 2 | 96/96[100%] | 96 | 5E-61 | NP_044814.1 |
3 | 1074 | 1346 | + | 90 | 4.7 | 10.5 | AAAGGAGTAAAAGCACTTG | Hypothetical protein | Cp-1, p03 | 3 | 63/64[98%] | 64 | 3E-36 | NP_044815.1 |
4 | 1351 | 2043 | + | 230 | 10.1 | 26.7 | AAGGGGTGTAATTAAATG | Terminal protein | Cp-1, p04 | 4 | 229/230[99%] | 230 | 7E-165 | NP_044816.1 |
5 | 2040 | 2609 | + | 189 | 4.6 | 22.1 | AAAGAAGCGAGGGAAGAAGTG | DNA polymerase | Cp-1, p05 | 5 | 178/179[99%] | 568 | 3E-123 | NP_044817.1 |
6 | 2681 | 3745 | + | 354 | 6.8 | 40.8 | AATCTTAGATGAAAAGGTG | DNA polymerase | Cp-1, p05 | 5 | 341/354[96%] | 568 | 0 | NP_044817.1 |
7 | 3702 | 4148 | + | 148 | 9.2 | 17.6 | AAAGGGGGTACGCTGATTTATG | Hypothetical protein | Cp-1, p06 | 6 | 148/148[100%] | 148 | 1E-100 | NP_044818.1| |
8 | 4141 | 4653 | + | 170 | 4.8 | 18.9 | AAACGGAGATAAACAAAATG | Hypothetical protein | Cp-1, p07 | 7 | 170/170[100%] | 170 | 5E-118 | NP_044819.1 |
9 | 4875 | 5165 | + | 96 | 4.2 | 10.5 | AAAGGAGAGGGCTATG | Scaffolding protein | Cp-1, p08 | 8 | 95/96[99%] | 96 | 4E-59 | NP_044820.1 |
10 | 5409 | 6506 | + | 365 | 5.4 | 41.7 | AAGAGGGAGAAGAATAGAATG | Major head protein | Cp-1, p09 | 9 | 352/365[96%] | 365 | 0 | NP_044821.1 |
11 | 6563 | 7576 | + | 337 | 5.3 | 39.5 | AAAGGGGACTAAATG | Connector protein | Cp-1, p11 | 10 | 337/337[100%] | 337 | 0 | NP_044823.1 |
12 | 7563 | 8210 | + | 215 | 5.1 | 24.7 | AAAAGGAGGGGACAATCATTG | Collar protein | Cp-1, p12 | 11 | 192/194[99%] | 194 | 2E-137 | NP_044824.1 |
13 | 8223 | 8807 | + | 194 | 8.6 | 22.8 | AAAGGTGTATAGATG | Hypothetical protein | Cp-1, p13 | 12 | 194/194[100%] | 194 | 5E-140 | NP_044825.1 |
14 | 8804 | 9118 | + | 104 | 5.8 | 11.9 | AAAGAGGACATGAAAACCTATG | Hypothetical protein | Cp-1, p14 | 13 | 104/104[100%] | 104 | 7E-67 | NP_044826.1 |
15 | 9102 | 9989 | + | 295 | 4.9 | 32.9 | AAAAAGAGGTAGAAACAAATG | Hypothetical protein | Cp-1, p15 | 14 | 295/295[100%] | 295 | 0 | NP_044827.1 |
16 | 10011 | 10787 | + | 258 | 5.0 | 29.4 | AAAGGATTTTAAAACATG | Hypothetical protein | Cp-1, p16 | 15 | 192/207[93%] | 288 | 4E-132 | NP_044828.1 |
17 | 10787 | 11548 | + | 253 | 6.0 | 28.4 | AGGAGGTATCTAATG | Hypothetical protein | Cp-1, p18 | 16 | 253/253[100%] | 253 | 0 | NP_044830.1 |
18 | 11515 | 13077 | + | 520 | 5.7 | 59.0 | AAAGTCGGGTCAATG | Tail protein | Cp-1, p19 / Cp-1, p20 | 17 18 | 210/224[94%]/ 236/237[99%] | 230 /237 | 6e-149/ 2E-164 | NP_044831.1 / NP_044832.1 |
19 | 13081 | 14919 | + | 612 | 4.8 | 67.5 | AAAGGGTAAACAATG | Tail protein | Cp-1, p21 | 19 | 582/583[99%] | 586 | 0 | NP_044833.1 |
20 | 14993 | 16039 | + | 348 | 7.6 | 40.8 | AAATGGTACAATCCGCAGAAAATG | Encapsidation protein | Cp-1, p23 | 20 | 348/348[100%] | 360 | 0 | NP_044835.1 |
21 | 16029 | 16433 | + | 134 | 7.9 | 15.5 | AGGTTATCAATCATG | Holin protein | Cp-1, p24 | 21 | 134/134[100%] | 134 | 1E-89 | NP_044836.1 |
22 | 16433 | 17452 | + | 339 | 4.6 | 39.2 | AAAGGAGAAAAGAAATAATG | Lysozyme | Cp-1, p25 | 22 | 339/339[100%] | 339 | 0 | NP_044837.1 |
23 | 17480 | 17896 | - | 139 | 5.62 | 15.8 | AAAACGTAGGGGGTTAATACTATG | Hypothetical protein | SP058_00395 | 24/65[37%] | 234 | 6E+00 | YP_008239483.1 | |
24 | 17901 | 18143 | - | 80 | 9.9 | 9.4 | AAATTGAGGTATTAAGAAAATG | Hypothetical protein | Cp-1, p26 [orfc] | c | 80/80[100%] | 80 | 5E-50 | NP_044838.1 |
25 | 18148 | 18423 | - | 91 | 5.1 | 10.9 | AAGGGACGGTTACTAGATG | Hypothetical protein | AGR66263 | 14/44[32%] | 410 | 8E-01 | gb|AGR66263.1 | |
26 | 18424 | 18693 | - | 89 | 7.7 | 10.8 | ACAAATAGGAGGGTAAACATG | Hypothetical protein | Cp-1, p27[orfb] | b | 89/89[100%] | 89 | 5E-58 | NP_044839.1 |
27 | 18704 | 18973 | - | 89 | 5.0 | 10.2 | AAAGAGGTATAACAAAATG | Hypothetical protein | Cp-1, p28 [orfa] | a | 60/61[98%] | 62 | 7E-35 | NP_044840.1 |
a Number of amino acids (aa),
b IP, isoelectric point and
c MM, molecular mass.
d RBS, ribosomal binding site. Bases in bold correspond to nucleotides identical to the RBS consensus; lowercase indicates.