Skip to main content
. 2017 Dec 15;84(1):e01739-17. doi: 10.1128/AEM.01739-17

TABLE 1.

Distribution of cah and cah homologs in Shiga toxin-producing Escherichia colia

Strain Serotype GenBank accession no. cah
cah homologs
Gene (positions) or locus tag (corresponding protein length in aa) % identity Gene (locus tag or gene positions) (corresponding protein length in aa) % identity
EDL933 O157:H7 CP008957 Z1211 (949) 100 ND ND
Z1651 (949) 100
Sakai O157:H7 NC_002695 ECs1396 (949) 100 ND ND
TW14359 O157:H7 CP001368 ECSP_1319 (949) 100 ND ND
EC4115 O157:H7 CP001164 ECH74115_1395 (949) 100 ND ND
Xuzhou21 O157:H7 NC_017906 CDCO157_RS07230 (949) 100 ND ND
SS17 O157:H7 CP008805 SS17_4978 (778) 100 ND ND
SS52 O157:H7 NZ_CP010304 SS52_1399 (949) 100 ND ND
WS4202 O157:H7 NZ_CP012802 AO055_11995 (949) 100 ND ND
SRCC 1675 O157:H7 CP015023 AR439_25245 (949) 100 ND ND
JEONG-1266 O157:H7 CP014314 JEONG1266_13650 (949) 100 ND ND
FRIK944 O157:H7 CP016625 A9L45_07745 (949) 100 ND ND
FRIK2069 O157:H7 CP015846 A8V30_07330 (949) 100 ND ND
FRIK2455 O157:H7 CP015843 A8V32_07345 (949) 100 ND ND
FRIK2533 O157:H7 CP015842 A8V31_07325 (949) 100 ND ND
644-PT8 O157:H7 CP015831 cah (positions 3635954–3633105) (949) 98.5 caha (positions 141895–139049) (948) 89.3
cah (positions 3980603–3977754) (949) 95.2
180–PT54 O157:H7 CP015832 cah (positions 3382829–3379980) (949) 98.5 caha (positions 141895–139049) (948) 89.3
cah (positions 3724938–3722089) (949) 95.2
28RC1 O157:H7 CP015020 ARC77_26080 (949) 93.8 ND ND
1130 O157:H7 NZ_CP017434 A4C50_17255 (650) 100 ND ND
2149 O157:H7 NZ_CP017436 A4C44_17255 (650) 100 ND ND
2159 O157:H7 NZ_CP017438 A4C45_17280 (650) 100 ND ND
3384 O157:H7 NZ_CP017440 A4C38_17050 (949) 100 ND ND
4276 O157:H7 NZ_CP017442 A4C51_17225 (650) 100 ND ND
8368 O157:H7 NZ_CP017444 A4C39_17025 (949) 100 ND ND
9234 O157:H7 NZ_CP017446 A4C47_17250/(250) 100 ND ND
RM13514 O145:H28 CP006027 ECRM13514_1299 (128) 100 cahb (ECRM13514_5355) (1,039) 68.5
RM12581 O145:H28 CP007136 ECRM12581_6385 (128) 100 cahb (ECRM12581_26320) (1,039) 68.5
RM13516 O145:H28 CP006262 ECRM13516_RS06350 (430) 100 cahb (ECRM13516_5248) (1,039) 70.1
cahb (ECRM13516_5033) (1,026) 69.5
RM12761 O145:H28 CP007133 ECRM12761_6150 (430) 100 cahb (ECRM12761_25570) (1,039) 70.1
cahb ECRM12761_24525 (1,026) 69.5
11368 O26:H11 NC_013361 ECO26_1353 (949) 100 caha (ECO26_RS17785) (948) 89.5
cahb (ECO26_RS28925) (1,039) 68.0
cahb (ECO26_RS15170) (1,039) 67.9
FORC_028 O26:H11 CP012693 FORC28_2696 (949) 100 caha (FORC28_1596) (948) 89.7
cahb (FORC28_5338) (1,039) 68.0
cahb (FORC28_2128) (1,039) 67.9
08-00022 O136:H16 CP013662 CP48_05340 (949) 100 caha (CP48_18180) (948) 88.0
09-00049 O168:H- CP015228 GJ12_07155 (949) 100 ND ND
12009 O103:H2 NC_013353 ND ND caha (ECO103_RS25970) (948) 88.4
caha (ECO103_RS18885) (606) 83.9
2009EL-2050 O104:H4 CP003297 ND ND caha (O3M_00325) (948) 89.3
cahb (O3M_04255) (1,039) 68.7
caha (positions 3144849–3146880) (192) 74.6
2009EL-2071 O104:H4 CP003301 ND ND caha (O3O_25300) (948) 89.3
cahb (O3O_21435) (1,039) 68.7
caha (positions 3201209–3203240) (192) 74.6
2011C-3493 O104:H4 CP003289 ND ND caha (O3K_00310) (948) 89.3
cahb (O3K_04220) (1,039) 68.7
caha (positions 3151328–3153359) (192) 74.6
C227-11 O104:H4 CP011331 ND ND caha (AAF13_23370) (948) 89.3
cahb (AAF13_00880) (1,039) 68.7
caha (positions 2459518–2461549) (192) 74.6
94-3024 O104:H21 CP009106 ND ND cahb (HW43_19650) (1,039) 68.9
2013C-4465 O55:H7 CP015241 ND ND caha (A5955_13015) (948) 90.0
06-00048 O36:H- CP015229 ND ND caha (GJ11_19470) (948) 88.8
CFSAN004176 O145:H- CP014583 ND ND ND ND
CFSAN004177 O145:H- CP014670 ND ND ND ND
11128 O111:H- NC_013364 ND ND ND ND
RM9387 O104:H7 NZ_CP009104 ND ND ND ND
2009C-3133 O119:H4 CP013025 ND ND ND ND
2012C-4227 O165:H25 CP013029 ND ND ND ND
GB089 O168:H- CP013663 ND ND ND ND
2011C-3911 O-:H- CP015240 ND ND ND ND
a

The complete genomes of STEC that were available in GenBank as of September 2016 were used to create a database to search for cah or cah homologs. BLAST was performed in Geneious8.1.8 using EDL933 cah as a query. The retrieved cah and cah homologs were translated using the bacterial translation codon table. The locus tag of the coding DNA sequence (CDS) is based on the original genome annotation. If a locus tag is not available for a cah or cah homolog, the CDS is indicated by the name designated in this study, cah, caha, or cahb, followed by the genomics position of the corresponding gene in parentheses. Items in bold represent genes carrying a mutation in their coding region. The overall mutation rate was 31.3% for cah, 29.4% for caha, and 13.3% for cahb. ND, not detected by BLAST search.