TABLE 2.
ORFs encoded on B. burgdorferi 297 cp18-2
ORFa | Alternate name | Family | bp
|
Orientationb | Potential RBSc | No. of amino acids | Mol wt | pI | PSORT results
|
TMpred results
|
Borrelia paralogous family distribution
|
||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5′ | 3′ | Leader | Leader | Location | No. of transmembrane domains | Position | Score | DNA typed | Paralog locatione | ||||||||
1 | 154 | 161 | 415 | 255 | F | 5′ truncated | 85 | 9,880 | 3.98 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S | |
2 | 155 | 415 | 648 | 234 | F | AACGAA-11 | 78 | 9,343 | 6.59 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S | |
3 | 156 | 659 | 1945 | 1,287 | F | TGAGG-5 | 429 | 47,854 | 9.5 | N | Ce | 1 | 134–153 | 919f | C | L, M, N, O, P, Q, R, S | |
4 | 157 | 2188 | 2649 | 462 | F | None | 154 | 17,557 | 8.93 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S | |
5 | 159 | 2656 | 3603 | 948 | F | AAAGGA-6 | 316 | 36,059 | 7.96 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S | |
6 | 160 | 3624 | 4175 | 552 | F | AAAGGG-14 | 184 | 20,764 | 6.23 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S | |
7 | 2.9 orf5 | 139 | 4208 | 4537 | 330 | F | None | 110 | 12,848 | 5.68 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S |
8 | 2.9 orf6 | 140 | 4537 | 5412 | 876 | F | ATGGAGTT-6 | 292 | 33,272 | 5.02 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S |
9 | 2.9 orf7 | 141 | 5421 | 6023 | 602 | F | ATGGATAG-9 | 201 | 22,731 | 9.83 | N | Ce | 1 | 50–67 | 684f | C/L | A, L, M, N, O, P, Q, R, S |
10 | 2.9 orf8 | 142 | 6036 | 6845 | 809 | F | AGGGGGG-4 | 270 | 30614 | 5.64 | N | C | 0 | NA | NA | C/L | A, L, M, N, O, P, Q, R, S |
11 | blyA | 109 | 6925 | 7125 | 201 | F | AAAGGATT-5 | 67 | 7,477 | 9.71 | Y, uncleaved | IM | 1 | 18–36 | 2876 | C | L, M, N, O, P, Q, R, S |
12 | blyB | 111 | 7132 | 7476 | 344 | F | AAAGAGA-7 | 115 | 13,200 | 5.59 | N | C | 0 | NA | NA | C | L, M, N, O, P, Q, R, S |
13 | 2.9 orfC | 143 | 7469 | 7801 | 333 | F | AAGGAAAG-8 | 111 | 12,698 | 0.18 | Y, uncleaved | IM | 0 | NA | NA | C/L | A, G, L, M, N, O, P, Q, R, S |
14 | 2.9 orfD | 143 | 7797 | 8145 | 349 | F | AAAGAGG-5 | 117 | 13,616 | 9.22 | Y, cleavedf | P/OMf | 1 | 5–22 | 1702 | C/L | A, G, L, M, N, O, P, Q, R, S |
15 | bdrB | 80 | 8255 | 8719 | 465 | F | ATAAGGGAA-5 | 155 | 17,853 | 4.94 | N | IM | 1 | 132–154 | 2639 | C/L | F, G, H, L, M, N, O, P, Q, R, S |
16 | rep- | NA | 8712 | 8371 | 342 | R | AAGTGAGG-16 | 114 | 10.38 | Y, cleavedg | IM | 3 | 3–23 | 629 | NA | ||
32–50 | 2037 | ||||||||||||||||
72–90 | 1786 | ||||||||||||||||
17 | mlp9 | 113 | 8828 | 9460 | 633 | F | AAAGGAG-5 | 211 | 24,081 | 5.04 | Y, uncleavedh | IMh | 1 | 3–19 | 1336 | C | L, M, N, O, P, Q, R, S |
18 | 36kDa | 161 | 10563 | 9547 | 1,016 | R | AACGGAGG-6 | 339 | 39,970 | 7.91 | N | C | 0 | NA | NA | C | C, L, M, N, O, P, Q, R, S |
19 | orfA | 57 | 10943 | 12040 | 1,098 | F | AACGGAGA-6 | 366 | 44,309 | 9.65 | N | C | 0 | NA | NA | C/L/X | X, A, C, D, E, F, G, H, I, L, M, N, O, P, Q, R, S, T, U |
20 | orf2 | 50 | 12054 | 12617 | 563 | F | TATGAGG-8 | 188 | 22,580 | 9.66 | N | C | 0 | NA | NA | C/ | A, B, C, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S |
21 | orfC/parA | 49 | 12596 | 13348 | 753 | F | CAAGGA-11 | 251 | 29,091 | 8.8 | N | Ce | 1 | 139–154 | 711e | C/L | A, B,C, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S |
22 | orf3 | 32 | 13393 | 13950 | 558 | F | TAAGGAG-5 | 186 | 22,011 | 9.51 | N | C | 0 | NA | NA | C/L/X | X, A, B, C, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, U |
23 | bdrA | 80 | 13963 | 14484 | 522 | F | TAAGAGGA-4 | 174 | 20,268 | 5.58 | N | IM | 1 | 147–167 | 2324 | C/L | F, G, H, L, M, N, O, P, Q, R, S |
24 | orf8 | 165 | 15022 | 16050 | 1,029 | F | AACGGAGG-6 | 343 | 39,666 | 5.15 | N | C | 0 | NA | NA | C | C, L, M, N, O, P, Q, R, S |
25 | orf7 | 165 | 16072 | 16350 | 278 | F | AATGGAG-9 (L) | 93 | 10,848 | 9.51 | N | Ce | 1 | 69–89 | 833e | C | C, L, M, N, O, P, Q, R, S |
26 | orf10 | 144 | 16405 | 16827 | 423 | F | AGGAGAGG-6 | 141 | 16,745 | 9.61 | N | Ce | 1 | 52–75 | 999e | C/L | A, L, M, N, O, P, Q, R, S |
27 | orf6 | 96 | 17612 | 16839 | 774 | R | AAAGGAGAGG-5 | 258 | 30,270 | 9.9 | N | Ce | 1 | 74–90 | 550e | C/L | C, H, L, M, N, O, P, Q, R, S |
28 | p21 | 162 | 18043 | 18597 | 555 | F | TATGGAGA-6 | 185 | 20,747 | 7.19 | Y, cleaved | Lipoprotein | 1 | 9–26 | 1632 | C | L, N, P, R |
29 | elpB2 | 163 | 18628 | 19845 | 1,218 | F | AGGGGAGA-6 | 406 | 46,539 | 5.02 | Y, uncleaved | IMh | 1 | 6–21 | 1262 | C | F, L, N, O, P, Q |
30 | 114 | 19998 | 20387 | 390 | F | AAAAG-12 (L) | 130 | 15,661 | 9.64 | Y, uncleaved | IM | 1 | 1–20 | 803 | C | L, N, O, P, Q, R | |
31 | 115 | 20747 | 21169 | 423 | F | TTAGGAGA-6 | 141 | 16,214 | 9.22 | N | C | 0 | NA | NA | C | L, M, N, O, P, Q, R, S |
Potential pseudogenes are indicated in boldface.
F, forward; R, reverse.
Number following each potential ribosome binding site (RBS) indicates the distance to the AUG start codon. ORFs with putative leucine start codons are indicated by (L).
Refers to the type of genetic element on which ORFs belonging to the designated paralogous family have been described in B31: C, circular plasmid; L, linear plasmid; X, chromosome (BBGD).
Discrepancy between the presence of a predicted transmembrane domain (TMpred) and cytoplasm localization (PSORT) most likely due to relatively low (<1,000) TMpred score.
Experimental studies demonstrated that the 2.9 OrfD leader peptide is not cleaved (39).
Atypical leader peptide.
Mlp leader peptide has been shown to be lipid modified in vitro (39). ElpB2 contains appropriate lipid modification signal sequences.