Skip to main content
. 2015 Feb 18;10(2):e0118807. doi: 10.1371/journal.pone.0118807

Table 1. Putative Open Reading Frames deduced from SOCP genome sequences and their predicted functions.

ORF Start End Strand Length [aa] a IP b Mw [kDa] c Putative RBS d and start codon Putative function Best hit with Blast ‘Locus tag’ ORF in Cp-1 # of identical aa/ size of the alignment [% aa identity] Length [aa] E-value Accession number [GenBank] e
1 376 642 + 88 4.4 10.4 AAAGGAGAAAGAAAACATG Hypothetical protein Cp-1, p01 1 88/88[100%] 88 3E-57 NP_044813.1
2 657 947 + 96 6.6 11.7 AAAGGAGATAATAAAAATG Hypothetical protein Cp-1, p02 2 96/96[100%] 96 5E-61 NP_044814.1
3 1074 1346 + 90 4.7 10.5 AAAGGAGTAAAAGCACTTG Hypothetical protein Cp-1, p03 3 63/64[98%] 64 3E-36 NP_044815.1
4 1351 2043 + 230 10.1 26.7 AAGGGGTGTAATTAAATG Terminal protein Cp-1, p04 4 229/230[99%] 230 7E-165 NP_044816.1
5 2040 2609 + 189 4.6 22.1 AAAGAAGCGAGGGAAGAAGTG DNA polymerase Cp-1, p05 5 178/179[99%] 568 3E-123 NP_044817.1
6 2681 3745 + 354 6.8 40.8 AATCTTAGATGAAAAGGTG DNA polymerase Cp-1, p05 5 341/354[96%] 568 0 NP_044817.1
7 3702 4148 + 148 9.2 17.6 AAAGGGGGTACGCTGATTTATG Hypothetical protein Cp-1, p06 6 148/148[100%] 148 1E-100 NP_044818.1|
8 4141 4653 + 170 4.8 18.9 AAACGGAGATAAACAAAATG Hypothetical protein Cp-1, p07 7 170/170[100%] 170 5E-118 NP_044819.1
9 4875 5165 + 96 4.2 10.5 AAAGGAGAGGGCTATG Scaffolding protein Cp-1, p08 8 95/96[99%] 96 4E-59 NP_044820.1
10 5409 6506 + 365 5.4 41.7 AAGAGGGAGAAGAATAGAATG Major head protein Cp-1, p09 9 352/365[96%] 365 0 NP_044821.1
11 6563 7576 + 337 5.3 39.5 AAAGGGGACTAAATG Connector protein Cp-1, p11 10 337/337[100%] 337 0 NP_044823.1
12 7563 8210 + 215 5.1 24.7 AAAAGGAGGGGACAATCATTG Collar protein Cp-1, p12 11 192/194[99%] 194 2E-137 NP_044824.1
13 8223 8807 + 194 8.6 22.8 AAAGGTGTATAGATG Hypothetical protein Cp-1, p13 12 194/194[100%] 194 5E-140 NP_044825.1
14 8804 9118 + 104 5.8 11.9 AAAGAGGACATGAAAACCTATG Hypothetical protein Cp-1, p14 13 104/104[100%] 104 7E-67 NP_044826.1
15 9102 9989 + 295 4.9 32.9 AAAAAGAGGTAGAAACAAATG Hypothetical protein Cp-1, p15 14 295/295[100%] 295 0 NP_044827.1
16 10011 10787 + 258 5.0 29.4 AAAGGATTTTAAAACATG Hypothetical protein Cp-1, p16 15 192/207[93%] 288 4E-132 NP_044828.1
17 10787 11548 + 253 6.0 28.4 AGGAGGTATCTAATG Hypothetical protein Cp-1, p18 16 253/253[100%] 253 0 NP_044830.1
18 11515 13077 + 520 5.7 59.0 AAAGTCGGGTCAATG Tail protein Cp-1, p19 / Cp-1, p20 17 18 210/224[94%]/ 236/237[99%] 230 /237 6e-149/ 2E-164 NP_044831.1 / NP_044832.1
19 13081 14919 + 612 4.8 67.5 AAAGGGTAAACAATG Tail protein Cp-1, p21 19 582/583[99%] 586 0 NP_044833.1
20 14993 16039 + 348 7.6 40.8 AAATGGTACAATCCGCAGAAAATG Encapsidation protein Cp-1, p23 20 348/348[100%] 360 0 NP_044835.1
21 16029 16433 + 134 7.9 15.5 AGGTTATCAATCATG Holin protein Cp-1, p24 21 134/134[100%] 134 1E-89 NP_044836.1
22 16433 17452 + 339 4.6 39.2 AAAGGAGAAAAGAAATAATG Lysozyme Cp-1, p25 22 339/339[100%] 339 0 NP_044837.1
23 17480 17896 - 139 5.62 15.8 AAAACGTAGGGGGTTAATACTATG Hypothetical protein SP058_00395 24/65[37%] 234 6E+00 YP_008239483.1
24 17901 18143 - 80 9.9 9.4 AAATTGAGGTATTAAGAAAATG Hypothetical protein Cp-1, p26 [orfc] c 80/80[100%] 80 5E-50 NP_044838.1
25 18148 18423 - 91 5.1 10.9 AAGGGACGGTTACTAGATG Hypothetical protein AGR66263 14/44[32%] 410 8E-01 gb|AGR66263.1
26 18424 18693 - 89 7.7 10.8 ACAAATAGGAGGGTAAACATG Hypothetical protein Cp-1, p27[orfb] b 89/89[100%] 89 5E-58 NP_044839.1
27 18704 18973 - 89 5.0 10.2 AAAGAGGTATAACAAAATG Hypothetical protein Cp-1, p28 [orfa] a 60/61[98%] 62 7E-35 NP_044840.1

a Number of amino acids (aa),

b IP, isoelectric point and

c MM, molecular mass.

d RBS, ribosomal binding site. Bases in bold correspond to nucleotides identical to the RBS consensus; lowercase indicates.