TABLE 5.
Predicted intrinsic transcription terminators
Familya | Positionb | Sequencec | Tetraloopd | Locuse | Gene |
---|---|---|---|---|---|
1 | 39856 | ATTTATTTGAATAAAAGGGTTGCTTCGGCAACCCTTTTTGCGTATTATAGC | UUCG | 066 | h |
1 | 54582 | AAATAATTAAAGAAAAGGGTTGCTTCGGCAGCCCTTTTCTCGTATTATAGC | UUCG | 087 | h |
1 | 68356 | TGGATATTAATCAGAAAGGTAGCTTCGGCTACCTTTTTTCGTTTTCAGAGC | UUCG | 116 | h |
1 | 109856 | AAATAACCAAAGAAAGGGGTTGCTTCGGCAGCCCTTTTTGCGTATTATAGC | UUCG | 199 | h |
1 | 139742 | AAAAATCGTAAATTTAGGGTTGCTTCGGCAGCCCTTTTTTCGTATTATAGC | UUCG | 264 | nadV |
1 | 226081# | GAAACAGACAACAAAGGGGACTCTTCGGAGTCCCTTTTTGCGTTTATAGAC | UUCG | 363 | 23 |
1 | 233033(−) | TTAACCAACTGTTAAAGGGTGACTTCGGTCGCCCTTTTTTCGTTTACGTCT | UUCG | 373 | h |
2 | 9426 | TTAAACACTGTTAAGAAAGCGCCTTCGGGCGCTTTTTTTGTTTTTGTCGGA | UUCG | 012 | nrdD |
2 | 72687 | TTTCAAAGAACAAGAATAGCGCCTTCGGGGCGCTCTTTATGAGGTAAACATG | UUCG | 122 | h |
2 | 90272 | TAACACAACGAACACAAAGCGCCTACGGGCGCTTTTTTTATGCAAGGAAGG | UACG | 154 | h |
2 | 158804 | CGAAACTCAATTAACAAAGCGCCTCCGGGCGCTTTTTTTGTTTAGAACTCC | UCCG | 293 | ch |
3 | 21144 | TATTCTTGAAGAAAAGGGTTGACTGCGGTCAGCCCTTTCGCGTATTATAGC | (UGCG) | 037 | h |
3 | 59696 | CGAATAATGAAAAAAGGGTTGACAGATGTCAGCCCTTTCGTGTATTCTACG | (AGAU) | 097 | h |
3 | 97797 | AAAAGTAATAAAAAAGGGTTGACAGATGTCAGCCCTTTCGCGTATTATACA | (AGAU) | 169 | h |
3 | 100191 | CGTAATTGAAGAAAAGGGTTGACGCTTGTCAGCCCTTTCTTGTATTATCTC | (GCUU) | 174 | h |
3 | 121374 | AAAACTTTTCGAAAAGGGTTGACTACGGTCAACCCTTTCTTGTATTATAGC | UACG | 231 | h |
3 | 148817 | TTTCAATAAAAATAAGGTTGACCATCTTGGTCAACCTCTTTTCTTGGGCGA | 277 | nrdA | |
3 | 150224 | ATAATTAAAGAAAAGGGTTGAACAGATGTCAGCCCTTTTTTGTATGCTTCG | (AGAU) | 279 | nrdC |
4 | 31635 | GTTCAAGGGTTAATAAAAGGGGCGAAAGCCCCTTTTTTCGTATAAATACTT | GAAA | 057 | 30 |
4 | 48888 | ACTGGTTTTAATGAATAAGGGGCATTCGCCCCTTTTATTTGAGGAATACAC | (AUUC) | 080 | regA |
4 | 62011(−)# | CTACAGACCATAAAAAAAGGGGCGAATGCCCCTTTTTTTATTCTTTTTCGC | (GAAU) | 103 | h |
4 | 154605 | ATTTGGGGTAACGCGGTGGGCACGTAAGTGCCCTTTTATTCCCTTTGAAGT | GUAA | 284 | h |
5 | 35862 | CCTGTAATCTCTCATTTAGCCCCGAAAGGGGCTTTTTTAGGTTTGAGCCGG | GAAA | 061 | ch |
5 | 184096(−) | GTTAGACACAAGCAATAAGCCCCTATCGGGGCTTTTTTTGTATCTGAATCT | (UAUC) | 325 | 3 |
5 | 238328(−) | TACGCAATGCAATAAGAAAGCCCTTCGGGGCTTTTTTTATACGCGAAGCAA | UUCG | 383 | 35 |
6 | 32575# | TCGATTGAATAAGAAAAGGGAGCAAATTGCTCCCTTTTTTGATCATGGTGT | 058 | h | |
6 | 76557# | AGTATAGTTACACAAAAGGGAGCTAATTGCTCCCTTTTTGCTATTCATCAT | 130 | h | |
6 | 87419# | ACGCAGAAAATAAAAAAGGGAGCATTCGCTCCCTTTTTGCTATTTGTTTTT | (AUUC) | 149 | h |
6 | 143423# | AATGGTGAAATGAGAAAGGGAGCTTAGTGCTCCCTTTTTTATAACCAATAA | 271 | h | |
6 | 236858# | ATAATAGATACGAAAAAGGGAGCAATTTGCTCCCTTTTTTATTAGTCAGAA | 378 | uvsW | |
7 | 65690 | ATCACAGACGCAATTTAAGCCGCATTTCGCGGCTTTTTGAGGTTCATATGA | 111 | h | |
7 | 229959# | AACGCCATAATAGAAAAGCCCGCATTTAGCGGGCTTTTTTGTGTCTACGAG | 368 | h | |
7 | 231296(−) | CCTGATTGATAATTAAAAAGCGCATTTTGCGCTTTTTTTGTATCTATTGGT | 371 | h | |
8 | 78629 | GAAGATGAAAACAACAAAGCCCCTTAATTGGGGCTTTTTTATGGGTGAAAG | 134 | h | |
8 | 107118 | ACTATTGACTTATCCAGAGCCCCTTAATTGGGGCTTTTTTAGGTCTGAAGA | 194 | h | |
8 | 126416 | AGACTGGCGTTAATAAAAGCCCCTTAATTGGGGCTTTTTTAGGTTTGATGA | 243 | h | |
8 | 143013 | GACTTGAATCACATAAAAGCCCCTTAATTGGGGCTTTTTTCGTTTTAGCGG | 270 | h | |
9 | 3179 | CGATTAATGAATGGAAAGGGGCTTAACAGCCCCTTTGTTCTCTCTATAGGA | UAAC | 005 | 32 |
9 | 46933 | CTAGATTTTGTAAGTTACGGGGCGTAACAGCCCCTTTTTTTGAATATTTTA | 077 | 45 | |
10 | 5657 | CTAATACCCACGAGACCCGTTGATTTGTCAGCGGGTCTTTTAGCCATTTGA | (UUUG) | 008 | uvsX |
10 | 209948 | TCAATGCTAATAAACGGGTGAGATTGATTTCTCACCCTTTTTTGTATGAGC | 348 | 12 | |
11 | 61307 | AATAGTTGAATAAAAAGGGTTGCACTGTGCAGCCCTTTTCTATATCATGAG | 101 | h | |
11 | 128903 | ATTAATTAAAGAAAAGTGGTTGCTCTATGCAGCCACTTTCTGTATTATAGC | 249 | h | |
Unrelated | 13732 | CTTCGACTAATTAAAATGCCTCGTTTTCGGGGCTTTTTTATGTTTATCGTA | UUUU | 020 | h |
85042(−) | GTCTTTCCAATGTAGAACCCGTCTTCTCGACGGGTTACCACATGAATCTCA | 146 | segD | ||
117954 | CCATCGTATGTTTTTTAAGACGCTTCGGCGTCTTTTTTTCGGTTTGAAGAA | UUCG | 221 | h | |
132112 | TCCGATGCAGACAACGTCCGTGCCATGTGTATGGACGTTGTTCGTTTATTG | 255 | h | ||
138122 | ACAGCTTAATTGAAATGTGCAGATTCGTCTGCACATTTTGTTTAATAACGA | UUCG | 263 | h | |
160380# | ACGAATAAAGAAAAGGGAGAGTACAACTGTTACTCTCCCTTTTTTTGGTCA | 296 | |||
167720(−) | ATAAGGCAAATCGTAAAGGGCGCAAATAGCGCCCTTTTTTACGGCATAATA | 299 | ch | ||
168272(−) | TTTAAATAAGTGATAAGGGCGGCATTGTGCTGCCCTTTTTTGTACAGAGGT | 300 | ch | ||
171605# | TAAGCTATAAAAGAAAAGCCCCGTTTTTGGGGCTTTTTTGTTTAATGTGAG | UUUU | 305 | h | |
179037(−)# | AAGAAAATCGCAGAAAGTTGAAGTTTTTTTCAATTTTCTGCGAAAATACCG | UUUU | 316 | h | |
194945(−) | TTAAATAATCAAAACGGGGCGGCTATGTCGCCCCTTTGTTTCACACGAAAT | 339 | h | ||
196440 | ATCGTCTTTAATGAACAGCCCGCCATTGTGCGGGCTTTTTTGTGCATTAAA | 340 | 25 | ||
203021 | AAGTAAACAACTCATTAGGCGCGTGTTCGCGCCTTTCTTGGAGATTAAGAT | (UGUU) | 343 | 8 | |
215638 | ATGCTTATTTTAACGCTGTGTGCAGCTCGTACGCAGCGTTTGTCTATTTTA | 353 | h |
Terminator family of related sequence (80% identity over 80% of the sequence).
Genome position of the first base of the hairpin. (−), present on the minus strand; #, likely to function on both strands, located between convergently transcribed genes.
Underlined bases are predicted to be in the helix. G:U pairs are allowed.
Terminator-stabilizing tetraloop sequences, when present (those in parentheses are not commonly observed).
Locus refers to the preceding 5′ KVP40 CDS; name refers to the orthologous phage T4 gene aligning with the 5′ locus, except nadV. ch, conserved hypothetical; h, hypothetical.