TABLE 1.
Spot no.a | N-terminal sequenceb | Identityc |
---|---|---|
1 | AXMTG AQYIE SLRKL | 4-Hydroxyphenylacetate-3-hydroxylase |
2 | GFKSD IEIAQ EAKPQ | Formate tetrahydrofolate dehydrogenase |
3 | MKQLM TGNEA IARGA | Indolepyruvate ferredoxin oxidoreductase |
4 | MKLAV IGSGT MGSGI VQTFA | 3-Hydroxybutyryl-CoA dehydrogenase |
Numbers refer to circled spots in Fig. 4.
X denotes unidentified amino acid.
Genes were identified by searching for a high-score match of the N-terminal sequence in the C. difficile strain 630 sequence database (http://www.sanger.ac.uk/Projects/C_difficile/) using the BLAST algorithm. These sequence data were produced by the Clostridium difficile Sequencing Group at the Sanger Center and can be obtained from ftp://ftp.sanger.ac.uk/pub/pathogens/cd/CD.dbs. The found genome segments were further analyzed using the ORF finder at http://www.ncbi.nlm.nih.gov/, and the ORFs in the contigs were identified using the BLAST algorithm.