Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 28;16(10):e0258753. doi: 10.1371/journal.pone.0258753

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

Ariel Amadio 1,#, James L Bono 2,#, Matías Irazoqui 1, Mariano Larzábal 3, Wanderson Marques da Silva 3, María Florencia Eberhardt 1, Nahuel A Riviere 3, David Gally 4, Shannon D Manning 5, Angel Cataldi 3,*
Editor: Mark Eppinger6
PMCID: PMC8553066  PMID: 34710106

Abstract

Cattle are the main reservoir of Enterohemorrhagic Escherichia coli (EHEC), with O157:H7 the distinctive serotype. EHEC is the main causative agent of a severe systemic disease, Hemolytic Uremic Syndrome (HUS). Argentina has the highest pediatric HUS incidence worldwide with 12–14 cases per 100,000 children. Herein, we assessed the genomes of EHEC O157:H7 isolates recovered from cattle in the humid Pampas of Argentina. According to phylogenetic studies, EHEC O157 can be divided into clades. Clade 8 strains that were classified as hypervirulent. Most of the strains of this clade have a Shiga toxin stx2a-stx2c genotype. To better understand the molecular bases related to virulence, pathogenicity and evolution of EHEC O157:H7, we performed a comparative genomic analysis of these isolates through whole genome sequencing. The isolates classified as clade 8 (four strains) and clade 6 (four strains) contained 13 to 16 lambdoid prophages per genome, and the observed variability of prophages was analysed. An inter strain comparison show that while some prophages are highly related and can be grouped into families, other are unique. Prophages encoding for stx2a were highly diverse, while those encoding for stx2c were conserved. A cluster of genes exclusively found in clade 8 contained 13 genes that mostly encoded for DNA binding proteins. In the studied strains, polymorphisms in Q antiterminator, the Q-stx2A intergenic region and the O and P γ alleles of prophage replication proteins are associated with different levels of Stx2a production. As expected, all strains had the pO157 plasmid that was highly conserved, although one strain displayed a transposon interruption in the protease EspP gene. This genomic analysis may contribute to the understanding of the genetic basis of the hypervirulence of EHEC O157:H7 strains circulating in Argentine cattle. This work aligns with other studies of O157 strain variation in other populations that shows key differences in Stx2a-encoding prophages.

Introduction

Enterohemorrhagic Escherichia coli (EHEC), with cattle as the primary animal reservoir, is a globally important foodborne pathogen capable of causing hemorrhagic colitis and hemolytic uremic syndrome (HUS) in humans. EHEC infection in humans can occur via ingestion of food products of bovine origin such as meat, dairy, or by contact with fecal contaminated fruits, vegetables and water sources [1]. Due to its low infectious dose (<100 CFU) and severe clinical outcomes, EHEC infections are considered a serious public health concern. While numerous EHEC serotypes are linked to human infections, strains of serotype O157:H7 cause more severe clinical symptoms, and even HUS, than other EHEC serotypes [2, 3]. HUS caused by EHEC O157:H7 has been reported worldwide [4], with the highest incidence in Argentina [5]. The assessment of molecular characteristics governing pathogenicity and virulence of Argentine EHEC isolates is therefore essential.

The key virulence determinant in EHEC O157:H7 is Shiga toxin (Stx) production, which directly contributes to HUS. Stx genes are encoded on lambdoid phages and EHEC strains can contain one or two Stx subtypes including type 1 (Stx1) or type 2 (Stx2) [6]. Multiple stx variants, stx1 (stx1a, stx1c, and stx1d) and stx2 (stx2a, stx2b, stx2c, stx2d, stx2e, stx2f, stx2g [7] stx2h [8] stx2i [9]) and stx2k [10]) have been reported. According to epidemiological studies, Stx2-producing strains and especially Stx2a, are associated with more severe cases of infection than Stx1-producing strains [1113].

Another central virulence attribute of EHEC O157:H7 is the pathogenicity island encoding the locus of enterocyte effacement (LEE) [14, 15], which contains genes for a type 3 secretion system (T3SS) and intimin (eae), a gene critical for the histological attaching and effacing (A/E) lesions characteristic of EHEC O157:H7. In addition, other important genes involved in EHEC O157 virulence are within the pO157 plasmid [16]. Other factors not yet characterized, however, may also be essential for the full virulence of EHEC O157:H7.

According to phylogenetic studies, EHEC O157 can be divided into three main lineages and nine clades [17, 18]. Various clinical cases on multiple continents and countries have been associated with the clade 8 EHEC strains (I/II lineage). For instance, in Argentina a high prevalence of clade 8 strains was evident in cattle and humans [19, 20]. In the USA, clade 8 strains have been associated with more severe human disease, which led researchers to define it as hypervirulent [18]. Although clade 8 strains had higher Stx2a expression levels relative to strains from other clades, along with unique genetic features [11, 21, 22], to date the factors associated with the increased virulence and Stx2a production are not completely understood. Interestingly, the clade 8 EHEC O157:H7 isolates obtained from bovines in Argentina had a certain degree of genetic variability and displayed variable virulence in in vitro and in vivo assays [20].

To better understand the molecular bases related to virulence, pathogenicity and evolution of Argentinian EHEC O157:H7, we performed a comparative genomic analysis of these isolates through whole genome sequencing. Notably, we identified conserved regions, insertions/deletions (in/dels) and inversions, as well as variations within core and accessory genes. We also report on the distribution of key virulence genes and the presence of lambdoid phages and plasmids, further highlighting the diversity of the analyzed EHEC O157:H7 isolates.

Material and methods

Strains

Eight EHEC O157:H7 isolates previously recovered from cattle of the central Humid Pampas of Argentina between 2002 to 2011, were examined in this study (Table 1). Isolates were previously classified into clades by evaluating a subset of 23 single nucleotide polymorphisms (SNPs) [20]from the original 96 SNPs shown to differentiate each clade [18] following annotation (Table 1). In addition, six reference EHEC O157:H7 genomes were included in the analysis: EDL933 accession number CP008957 [23], Sakai (NC_002695, [24]), TW14359 (NC_013008, [25]), TW14588 (NZ_CM000662, [26]), 644-PT8 (NZ_CP015831, [27]), SS52 (NZ_CP010304, [28]). The bacteria were grown at 37°C on Luria-Bertani (LB, Difco Laboratories, USA) agar plates or aerobically in LB broth.

Table 1. Strains with clade assigned and stx genotypes.

Strain Clade stx subtypes
RafaelaII 8 stx2a, stx2c
Vac07.1 8 stx2a, stx2c
Balcarce24.2  6 stx2a, stx2c
9.1Anguil 8 stx2c
7.1Anguil 6 stx2a, stx2c
Balcarce14.2 8 stx2a, stx2c
438–99 6 stx2c
146N4 6 stx1, stx2c

DNA preparation and PacBio whole-genome sequencing

An aliquot of 250 μl of an overnight culture of the different EHEC O157:H7 strains was added to 20 mL of LB and incubated at 37°C with shaking for 3 ½ h. The bacteria were then harvested for DNA extraction using Genomic-Tip 100/g (Qiagen Inc Valencia, CA. DNA (10 ug/ml) were sheared to a targeted size of 30 kb using a g-TUBE (Corvaris, Woburn, MA) and subsequently concentrated using 0.45X volume of AMPure PB magnetic beads (Pacific Biosciences, Menlo Park, CA) according to the manufacturer’s protocol.

Sequencing libraries were created using 5 μg of sheared, concentrated DNA and the PacBio SMRTbell Template Prep Kit 1.0, according to the manufacturer’s protocol. Each library was sequenced using the RS II sequencing platform (Pacific Biosciences, PacBio) with the P6/C4 sequencing chemistry and the 360 min data collection protocol.

PacBio sequence assembly into closed circularized genomes and annotation

PacBio reads were assembled using HGAP3 (SMRTanalysis Version 3.0, Pacific Biosciences) and the resulting contigs were imported into Geneious software (Biomatters, Ltd., Auckland, New Zealand). If present, overlapping sequences on the ends of the contigs were removed from the 5’ and 3’ ends to generate circularized chromosomes and plasmids using the software Geneious (Biomatters, Ltd.). The closed chromosomes were reoriented to start with the putative origin of replication using Ori-Finder 2 [29]. The closed chromosomes and plasmids were polished twice for accuracy using the RS_Resequencing 1.0 protocol in SMRTanalysis and by mapping error corrected PacBio reads to the chromosomes and plasmids using Geneious software (Biomatters, Ltd.). Genome sequences are deposited in GeneBank (BioProject PRJNA280853). Automatic annotations were performed using Prokka [30] against a reference set of six O157:H7 genomes as primary annotations described in (see the Strain section in M&M). Protein sequences were extracted from those genomes and clustered at 95% identity using cd-hit [31] to create a trusted source for annotation for Prokka.

Whole genome comparison

Genomes were compared using BLAST+ [32] and visualized in Artemis Comparison Tool software [33]. Comparison files were generated online at WebACT [34] or locally.

Prophage prediction and extraction of virulence genes

Phaster webserver was used to predict putative prophage sequences in the bacterial genomes [35]. Subsequently, prophage sequences, including Stx prophages, were extracted from the bacterial genome sequences using Artemis [36]. Only prophages identified as complete and questionable by Phaster software were considered for the analysis. Prophages were named R1 to Rn as they appear from 5´to 3´ in the linear representation of the chromosome (with origin of replication at the 5’ end). For example, RafaelaII-R2 means it is the second phage identified in strain RafaelaII.

Prophage comparisons were carried out using Blast+ [32] as follows: each prophage sequence was extracted and used as a query for comparison with all prophages from the same strain (Intra-strain) or from the other strains (Inter-strain). The output was processed with custom Perl scripts (https://github.com/arielamadio/Ecoli_parsing_data) to obtain an average of identity for the High Scoring Pairs (HSPs) and the coverage for each prophage as the ‘qcovus’ parameter (query coverage per unique subject). To have an overall estimation of phage similarity across strains, prophage comparisons were performed with settings Query Coverage Per Subject 80% and not restricting the other parameters (query Coverage Per Unique Subject, Average Identity and HSPs number). These criteria were applied in inter-strain prophage diversity section. To classify the prophages into families by similarity, comparisons were performed with hits Query Coverage Per Subject 80% and query Coverage Per Unique Subject 90% and not restricting the other parameters (Average Identity and HSPs number). Also, a maximum tolerance of 8kb between query and subject sizes was selected.

An even stricter relatedness criterion (query coverage 80% and Query coverage Per Unique Subject 90% and gaps and inversions < 4kb) was established to identify distributed identical prophages. For the nomenclature of these identical prophages, the name of the strains with the identical prophage is followed by the letter and number of the family at the end of the name. For example, TW14359-R3 denotes the third phage in TW14359 and belongs to family R and would be considered identical to Vac07.1-R3, RafaelaII-R3 and Balcarce14.2-R3 (S1 Table).

Virulence gene sequences were identified using Virulence Finder 2.0 [37], with a nucleotide identity threshold of 90% and a minimum length of 60%. The virulence genes considered in this analysis were those encoded on the LEE (tir, eae, espA, espB and espF) or those that coded for toxins (stx2 and stx1), effectors (tccP, espJ, nleA, nleB1, nleB2 and nleC), stress resistance proteins (gadA and gadB) and the adhesin iha [37].

Prediction of clusters of orthologous genes and pan-genomic analysis

Roary [38] was used to identify the pan-genome from the genomes of strains sequenced in this work and the set of references used for annotation. All genomes were re-annotated with Prokka (with a reference set of proteins) previous to running the pangenome pipeline to standardize the annotations.

Comparisons of gene content between clades were performed using “difference” method in Roary and by specifying the groups to compare. A tree was constructed with the alignment of the core genome using RAxML [39] with GTRGAMMA as model and 1000 bootstraps. Clade assignment by SNP analysis were determined with NucDiff [40] by comparing the complete genomes with the strain Sakai as a reference. Then, the 23 SNPs corresponding to the genotyping panel [18] were extracted from the comparison output to subsequently infer the strain genotypes and clades.

Shiga-toxin detection and cytotoxicity in Vero cells

Stx expression was determined for filtered culture supernatants with or without mitomycin C induction, by using RIDASCREEN Kit Verotoxin enzyme immunoassay (R-Biopharm Latin America) and then semi-quantified, as previously described [20]. To measure Vero cell cytotoxicity, the strains were cultured overnight at 37°C in 5 ml of LB broth and the culture was centrifuged at 200 rpm. The supernatant was filtered (0.22 μm filters) and subsequently assayed for cytotoxicity on Vero cells, as previously described [20].

Results and discussion

General genomic features of the EHEC O157:H7 isolates

Complete chromosomes and plasmids were obtained for all but one of the eight analysed strains (Table 2). The chromosome of Strain 438–99 assembled as two contigs that could not be resolved further. Chromosome sizes and gene content of the sequenced strains ranged from 5415530 bp to 5750490 bp and 5193 ORFs to 5599 ORFs, for strains 7.1Anguil and 9.1Anguil, respectively. All the strains characterized here and the reference strain TW14359 belong to lineage I/II, except for the reference strain EDL933, which is from lineage I [41, 42]. Whole genome multiple alignments showed a high level of conservation around the origin of replication and variation around the replication terminus. Most of the In/Dels and inversions were associated with the presence of lambdoid prophages (Fig 1), and located mostly around the terminus of replication of the genome. EHEC O157:H7 7.1Anguil had a large, inverted region comprising 5 Mb, except at the origin of genome replication. This inversion can also be interpreted as involving the origin of replication, but that is against the convention of linear representation of genomes using ori as the first nucleotide. Two approaches followed to assess and confirm the presence of this inversion. The first approach consisted of a manual inspection of long reads, whereas the second involved performing long PCR with two pairs of primers located outside of the inversion (and the repeated region) (S1 Fig) to amplify a 6kb region on both sides of the inversion point. Importantly, the use of PacBio long read sequencing technology allowed us to reconstruct complete finished circular chromosomes and plasmids for most of the analysed isolates, with the subsequent possibility of including genome features, such as phage content and position, in the analysis.

Table 2. Chromosome and plasmid characteristics from the sequenced strains and two previously published strains.

Tw14359 (cl8)a RafaelaII (cl8) Vac07.1 (cl8) Balcarce24.2 (cl8) 9.1Anguil (cl8) Balcarce14.2 (cl6) 7.1Anguil (cl6) 146N4 (cl6) 438–99 (cl6) EDL933 (cl3)
Chromosome size (bp) 5528136 5544602 5425421 5457079 5415130 5666506 5750490 5528657 5653150 5547323
Number of Genes 5253 5348 5219 5224 5193 5479 5599 5318 5465 5675
Coding bases 4728279 4857984 4752591 4781259 4745568 4952364 5011443 4844244 4946871 4819848
genes per kb 0.95 0.964 0.961 0.957 0.958 0.966 0.973 0.961 0.966 1.023
(bases per gene): (1052) (1036) (1039) (1044) (1042) (1034) (1027) (1039) (1034) (977)
Gene average length 900 908 910 915 913 903 895 910 905 849
coding percentage 85,5 87,6 87,5 87,6 87,6 87,3 87,1 87,6 87,5 86,8
GC percentage 51,63 51,57 51,56 51,67 51,6 51,51 51,53 51,64 51,48 51,56
rRNA bases (number) 32078 (22) 31966 (22) 31964 (22) 31964 (22) 31964 (22) 36515 (25) 31963 (22) 31962 (22) 31964 (22) 32223 (22)
tRNA bases (number) 8246 (106) 8272 (106) 8042 (103) 8042 (107) 8195 (105) 9055 (116) 8425 (108) 7965 (102) 8337 (107) 7579 (101)
lambdoid phages 15 14 13 14 13 15 15 16 16 14
Plasmids 94,601bp (pO157) 94,756bp (pO157) 94,989bp (pO157) 95,209bp (pO157) 3,306bp 94567bp (pO157) 94,595bp (pO157) 96510bp (pO157) 55,027bp 95343bp (pO157) 7,323bp 3,306bp 91,463bp (pO157) 56,932bp 92,076bp (pO157)

acl = clades

Fig 1. Schematic lineal representation of aligned EHEC O157 genomes.

Fig 1

The origin of replication is placed at the 5´and 3’end. Lambdoid prophages are represented as boxes of blue color, except for those carrying stx genes. In those cases, stx1, stx2a and stx2c prophages were indicated in yellow, red and light blue boxes, respectively. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

The strains 146N4 and 438–99 from clade 6 presented several reorganizations in the terminus region of the chromosome relative to other strains. The reorganized regions were contained between prophages R11 and R12 in 146N4 and R9 to R11 in 438–99. The chromosomal sequence is conserved in the inverted regions among all strains with the difference in the chromosomal architecture. On the other hand, 7.1 Anguil, has an inversion that isn’t not associated with a phage region but with a ribosomal RNA region. This large inversion the sub-region between R12-R13 from 438–99 is moved to the 5’ of the regions for all strains. This sub-region, however, is inverted between 146N4 and 438–99 and is different from that of other strains. In addition, 438–99, EDL933 and 7.1Anguil shared the orientation of this sub-region, whereas 146N4 shared the direction of the remaining strains (Fig 1).

The number of inserted prophages detected by Phaster, varied from 13 to 16 per genome. Lambdoid prophages were located between 1.2 Mb to 4.60 Mb of the genome coordinates. 149 prophages were identified in the strains, including the reference strains (S1 Table). The number and sequence variability of lambdoid prophages among our EHEC O157:H7 strains supports previous reports [11, 12, 4346]. The order of prophages in the genome has some conservation, as they occupy main preferential integration sites. Most of the sequenced strains had two Stx-converting lambdoid prophages, except EHEC O157:H7 438–99 that had one Stx2c-converting lambdoid prophage, and 9.1Anguil (clade 8) that had probably lost the Stx2a prophage. Stx2a encoding prophages were integrated at argW tRNA in clade 8 strains and 7.1Anguil, whereas it was integrated at yehV/mirA and wrbA in Balcarce24.2 and EDL933, respectively. The gene content of prophages encoding for stx2a are hypervariable. No stx2a containing prophage exhibited more than 70% conservation among the analysed genomes.

There was no genomic inversion larger than 50 kb among the clade 8 strains, in contrast to those of the strains of other clades. E. coli inversions are located around the terminus of replication, a region that seems prone to rearrangements [47] and Fitzgerald, personal communication. The 7.1Anguil strain showed the largest inversion in rRNA genes, which is a well described event in E. coli [48]. By contrast, the locations of the inversions for the remaining strains were restricted to lambdoid phages, which have an ad-hoc machinery of recombination that promotes inversions [49].

Prophage diversity

Inter-strain prophage diversity: some strains contained ubiquitous prophages with high level of similarity in relation to prophages from all strains. Balcarce24.2 and 7.1Anguil had five prophages with similarity hits in all strains whereas some clade 8 strains (Vac07.1, Tw14359, 9.1Anguil) had four conserved prophages (S2 Fig).

Intra-strain prophage diversity similarities only occurred with less than 85% coverage and 85% identity, as determined by blastn with all the prophages from a given strain. This is expected due to the immunity system of lambdoid phages [50]. However, a set of prophages had similarities covering between 70% to 75% of their sizes in other prophages in the same strain. For example, in the strains Vac07.1, Tw14359 and Balcarce14.2, prophages R13 and R6 were similar, whereas in 9.1Anguil, prophage R12 was similar to R5. Similarly, intra-strain prophage similarity was observed with R14 and R5 in 438–99; R13 and R4 in EDL933, R13 and R10 in RafaelaII and R12 and R5 in Balcarce24.2. Also, 438–99 showed two adjacent prophages (R6 and R7) with high level of similarity, which did not occur in any other strain. 438–99 was the strain with the highest level of intra strain phage similarity. Strains with less intra strain phage similarity were Balcarce14.2 and RafaelaII strains (S2 Table).

In order to identify highly related prophages across the strains, stricter relatedness parameters were used. Including visual inspection for large gaps and inversions in dot plots of paired genomes. The analysis identified 21 families with two to ten prophages in each family that accounted for 120 prophages (S1 Table). The remaining 29 prophages were unique to a particular strain. The families with 6 to 10 prophages are described in (S3 Table). Beside these families there are another 10 smaller families composed of 2 to 5 prophages. Interestingly, in some families, prophages have relevant metabolic or regulatory genes that are located 5’ or 3’ of prophage genes but are located between the predicted attL and attR integration sites (S3 Table).

Unique prophages

Some prophages are unique to a single strain (with no other hits above the established cut off) (S1 Table, S2 Fig). Strains with more unique prophages are 146N4 (six) and Balc24.2 (four). Clade 8 strains tend to have a lower number of unique prophages, but this may be due to the higher frequency of clade 8 strains in the study.

Stx2 encoding prophages

The diversity of Stx2a encoding prophages is reflected by the fact that prophages R2 from TW14359, Vac07.1 and Balcarce14.2 belong to family B, R3 from Balcarce24.2 belongs to family C and R2 from RafaelaII, R12 from EDL933 and R5 from 7.1Anguil are unique. This result strongly differed from that of prophages encoding stx2c that belongs to family E. In contrast to the stx2a containing prophage, the prophages containing the stx2c gene were highly conserved among the studied strains. All nine stx2c prophages showed an average similarity of 97.7% ±2.13%, whereas the seven stx2a prophages displayed an average similarity of 60.5% ± 6.3%.

Identical prophages

We determined if the same prophage is integrated in the genome of different strains using a stricter relatedness criterion than that for grouping the prophages in families (S1 Table), We identified 16 shared types of prophages (identical prophages in sequence and size present in more than one strain).

The most distributed shared types in this set of strains were TW14359F6 (in all ten strains), TW14359E5 (eight strains) as well as TW14359J10 TW14359K11 and TW14359M13 (all present in seven strains). With these prophage sequences, we performed megablast against nr NCBI database with highly restrictive search parameters coverage 98%, identity 99% (S4 Table). The shared types most frequently found were Tw14359N14, Tw14359J10; Tw14359A1; Tw14359F6 with 179, 175, 170 and 160 hits, respectively followed by other less represented shared type prophages in the nr database. This method does not allow to determining the size of the targeted prophages in the nr database but indicates that the sequence of the prophages studied here are found as complete prophages in other genomes.

As reported by other authors [11, 12, 4346], extreme phage diversity was observed probably because lambdoid phages are equipped with specialized recombination machinery as Exo, Bet, Gam and other proteins [50]. However, in spite of this general diversity they could be grouped by sequence alignments in families of high similarity and even in individual identical sequences shared by different genomes. A tendency was observed that shorter prophages are likely to be associated with higher clustering (S3 Fig), as previously described by Shaaban et al. [46]. Conversely, Larger prophages (>80 kb) were overrepresented among the unique prophages.

Pan genomic analysis reveals regions exclusive to clade 8 strains

The genomes of all sequenced strains and the six strains used as references had 4,415 core genes and a pangenome of 7,881 genes. Roary was used to identify a set of genes only present in the clade 8 genomes. Although no long contiguous exclusive fragments common to all clades 8 strains were detected in the analysis, a short contiguous gene cluster of ten genes was exclusive to this clade. In RafaelaII strain (used as a prototype clade 8), this cluster corresponds to genes from RafaelaII_01805 to 17 and showed 100% identity and conserved synteny in the other clade 8 strains studied here. This segment is located in R3 prophage (family C) of RafaelaII strain (Table 3).

Table 3. A ten gene region exclusive to clade 8 strains.

Genea Annotationb BlastP descriptions Location (start codon base)d Orthologue in EHEC O157 TW14359e
RafaelaII_01805 hypothetical protein ASCH domain containing protein 1834934 ECSP_2983
RafaelaII_01806 hypothetical protein N-acetyl transferase 1835289 ECSP_2982
RafaelaII_01807 helix-turn-helix transcriptional regulator helix-turn-helix transcriptional regulator 1836431 ECSP_2981
RafaelaII_01808 Antirepressor helix-turn-helix transcriptional regulator 1837225 ECSP_2980
RafaelaII_01812 ren protein ren protein 1837225 ECSP_2973
RafaelaII_01813 multidrug efflux protein SMR multidrug resistance protein 1840145 ECSP_2972
RafaelaII_01814 DLP12 prophage recombinase Recombinase family protein 1840734 ECSP_2971
RafaelaII_01815 kinase inhibitor Far kinase inhibitor ybcL 1842725 ECSP_2970
RafaelaII_01816 DNA-binding transcriptional regulator helix-turn-helix transcriptional regulator 1843286 ECSP_2969
RafaelaII_01817 hypothetical protein DNA base flipping protein. NinB superfamily 1844298 ECSP_2968

aGene name from RafaelaII.

bAnnotation as obtained by Prokka.

cBlastp top hit against Genbank nr database.

dLocation in RafaelaII chromosome.

eName of orthologue in E. coli O157:H7 strain TW14359.

The pangenome analysis, even using a small number of genomes, was consistent with previous studies. For instance, the 4415 core genes are very similar in number to the 4369 core-genome identified in 185 strains from the UK [51]. A thorough analysis of the hypervirulent clade 8 strains to search for exclusive regions revealed only one short region that encode for DNA binding proteins, putative RNA binding domains and a ren protein involved in phage exclusion and protection from RecAB recombinase [52]. The product of many of these genes may have a regulatory effect of prophage gene expression as three of the genes are annotated as transcriptional regulators.

SNPs/Phylogeny

A tree constructed using all genes corresponding to the core genome of the analysed and reference strains retrieved four major clusters (Fig 2). One clustered contained reference strains Sakai, EDL933 and TW14588, from clades 1, 3 and 2, respectively. The second cluster contained clade 6 strains 7.1Anguil, Balcarce24.2 and the non-stx2a 438–99 and 146N4. A third cluster consisted of clade 8 isolates Vac07.1 and RafaelaII. A fourth cluster comprised clade 8 strains 9.1Anguil, Balcarce14.2 and reference strains TW14359 and SS52.

Fig 2. Phylogenetic tree constructed with the core genome from the Argentinian strains and previously published strains using RAxML.

Fig 2

Bootstrap values are located at each branch point.

Plasmid pO157

The isolates contained the megaplasmid pO157 that showed a high degree of conservation (Fig 3). EDL933 pO157 plasmid was smaller (92 kb), whereas the pO157 from other strains ranged from 94.5 to 96.5 kb. This longer length compared to pO157-EDL933 is due to a transposon and IS elements insertions after espP and before the lipid A modification enzyme genes cluster (Fig 3). Other genetic differences also related to the insertion of transposons in pO157 include the pO157 from strain 146N4. It had two transposons inserted at position 80800 at the 5´of the espP gene that resulted in two EspP proteins: EspP1 comprising the first 287 N-terminal residues and EspP2 composed of the last 1080 C-terminal fragment. These two fragments are probably non-functional as the segmentation interrupted the Peptidase S6 superfamily domain. EspP has been implicated in inhibiting complement activation, which allows the progress of EHEC infection [53, 54]. In EPEC, EspC, the orthologous of EspP, is related to epithelial cell severe alterations [55]. The loss of a functional EspP protein suggests that this strain my not cause disease in humans or may cause a milder form.

Fig 3. Alignment of pO157 plasmids from EHEC O157 isolates.

Fig 3

Plasmid are represented in lineal forms with the ORF colored blue. The tagA gene was oriented at the 5´end to allow for comparison of plasmid architecture. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

Identification of additional plasmids

Two strains, 7.1Anguil and 438–99, also contained a 55 kb conjugative plasmid, p55 (Table 2). Both plasmids shared 99% similarity with 96% coverage according to blastn. These plasmids were similar to pMCR-1 from EHEC O157:H7 strain 2017C-4109 that carries the mrc-1 gene that mediates colistin resistance [56]. The mcr1 gene was absent from plasmid p55 from strains 7.1Anguil and 438–99 strains. However, the sequence 5’ and 3’ to the gene were conserved in p55 and pMCR.1 (Fig 4). The annotation of the ORFs of p55 was related mostly to hypothetical proteins and the conjugation machinery. Other plasmids highly similar to p55 from E. coli are TP114 from E. coli K12 (MF521836); pChi7122-3 from avian enteropathogenic E. coli (APEC) 7122 O78:K80:H9 (FR851304); AR_0011 plasmid tig00013784_pilon (CP024858; plasmid pF7386-2 from EHEC O157:H7 (CP038361); an unnamed plasmid from E. coli O177:H21 (CP016547) and plasmid pMRSN346355_65.5 (CP018124.).

Fig 4. Alignment of the p55 plasmid of strains 7.1Anguil and 146N4 with a related pMCR1plasmid.

Fig 4

Plasmid are represented in lineal forms with blue and red arrows indicating genes. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear).

In addition, strain 146N4 carried a 7 kb plasmid, p7, which contained genes for colicin D synthesis, lysis protein for colicin release and mobilization genes. This plasmid was highly similar to the plasmid pF1273-2 (accession number CP038377). A small 3.3 kb plasmid that was also detected in Balcarce24.2 and 146N4. This plasmid was identical to pOSAK [5759] and carried the toxin endoribonuclease lsoA, an antitoxin lsoB genes and the mobilisation gene mobA2.

Inference of distinctive virulence functions based on gene content and polymorphisms

In a previous study, we had described the behaviour of the same set of EHEC O157:H7 strains from cattle in virulence correlated assays, such as Vero cell cytotoxicity, Red Blood Cell (RBS) lysis produced by the T3SS, adherence to different cell lines and lethality, weight loss and tissue damage in inoculated mice [20]. The following section refers to an analysis of some of the genes involved in these pathogenic processes.

Shiga toxin encoding prophages and determinants of stx2 expression

We comprehensively evaluated the Stx prophages incorporated into each EHEC genome. The strains of clade 8 and 6 had prophages encoding stx2a and are frequently the second lambdoid prophage in the genome (R2, counting from the replication of origin) Fig 5). Stx toxin production was analyzed in strains carrying the stx2a operon (Fig 5) in order to associate gene content of the encoding prophages with Stx production.

Fig 5. Alignment of prophages carrying stx2a, the red arrows represent genes of the stx2a operon; the grey arrows indicate hypothetical proteins, and the blue arrows refer to genes encoding for protein with assigned function.

Fig 5

Some relevant genes of lambda phage are marked. Artemis comparison tool was used to indicate regions of similarity (red), inversions (blue) and In/dels (clear) or no homology (no lines). Columns at the right show cytotoxicity in Vero cells, by ELISA with or without mitomicin and O and P protein subtype.

Interestingly, the Stx2a prophage from RafaelaII (Fig 5), is a putative defective phage because it lacked the genes for endolysin, terminase, portal protein (a major capsid protein) and the tail fiber protein. All of these proteins are related to the lytic machinery and the bacteriophage body. While one of the isolates, 9.1Anguil (clade 8) had no Stx2a phage. The phage may have been lost before isolation, as suggested by its low Stx2 activity to Vero cells and by the fact that there was a complete argW tRNA, the integration site of Stx2a prophage.

Two strains, 9.1Anguil and RafaelaII, are of particular interest because they either lack the stx2a containing prophages or the phage is defective. Sequence analyses indicated that 9.1Anguil does not have a stx2a containing prophage. This is different from most the clade 8 strains as they carry both stx2a and stx2c containing prophages [18]. We presume that there was a precise excision of the stx2a containing prophage in 9.1Anguil that didn’t leave any phage sequence in the chromosome when it excised. However, we can’t rule out that 9.1Anguil had never had a stx2a containing prophage which would make it an interesting strain for further study. With only the stx2c gene, Stx2 production of this strain was extremely low. This finding suggests that almost all the production of Stx2 in stx2a-stx2c strains is due to the stx2a containing phage, as previously demonstrated [60], and that the low expression of stx2c gene is not due to a repression exerted by the stx2a prophage. In turn, RafaelaII (stx2a-stx2c) contains a defective stx2a containing prophage that lacks the genes encoding for essential components of the phage body and lytic machinery. However, this strain produces high levels of Stx in culture supernatants, as shown by ELISA and verocytotoxicity assay. This observation suggests a complete prophage is not needed for Stx2 production as long as the regulatory genes are present.

Previously published research demonstrated that different alleles of the Q antiterminator gene affect the expression level of the stx2a operon [61]. Stx2a prophages that carry the Q933 allele are related to high expression, while those with Q21 [62] and Q111H- [60]. are linked to low expression. All the strains studied here had the Q933 allele in the Stx2a prophage as determined by in silico PCR with primers described by Olavesen et al. [63]. However, the analysed strains presented an important variation in Stx2 production level. The analysis of the Q gene sequences showed previously undescribed polymorphisms in the Q933 allele of Q gene. These Q933 polymorphisms are evident in two strains, Balcarce24.2 and 7.1Anguil (Fig 6A). Interestingly, 7.1Anguil had the highest cytotoxicity to Vero cell line. Also, strains 7.1Anguil and Balcarce24.2 shared other mutations in the pR’-tRNA region Q-Stx2A intergenic region, in comparison to the other strains, including the reference strains (Fig 6B). The detected non-synonymous mutations in Q genes that may change the function of the protein as well as the SNPs in the intergenic region between the Q and stx2 genes deserves more research to assess the effect of these substitutions on stx expression.

Fig 6.

Fig 6

(A). amino acid substitutions (shaded in grey) in the Q antiterminator gene, (B) SNPs in Q-stx2A intergenic region. EDL933 sequence is used as reference. Dots represent nucleotide that are identical to the reference. SNPs are represented by the substituting nucleotide. A tRNA present in the intergenic region is shaded in grey.

As replication proteins O and P from lambdoid prophages encoding stx2 have been implicated in Shiga toxin expression [11], we subsequently assessed these genes, which are located upstream of stx2a. Most of the strains had the γ allele of O and P genes, except Balcarce24.2 that carry the δ allele and 7.1Anguil that possesses a novel δ allele variant that we called δ–like allele (Fig 5). This variant of O and P allele has not been described previously.

Interestingly, most of the strains of our study that have stx2a genes have the subtype γ, except strains Balcarce24.2 (clade 6) and 7.1Anguil (clade 6). The O-protein δ-like subtype from the highest Stx producer 7.1Anguil has seven amino acid substitutions in relation to the canonical δ subtype, whereas the P-protein from δ-like subtype has eight substitutions concentrated in the N-terminal end. The role of these substitutions in O and P function remain to be elucidated. The O- and P-proteins, which have been proposed to have primase and replicase activity, respectively, are the only proteins from a lambdoid phage implicated in replication as the DNA polymerase and other constituents of the replication machinery are provided by the host.

Identification and variation in virulence genes

The LEE genes eae, espA, espB, espF and iha adhesin were 100% conserved among the strains. Notably, iha was absent from 438–99 due to a deletion event in that strain.

Our analysis detected two tir alleles of a previously described T/A SNP at base 255 [64], seven of the Argentinian strains had the T allele while two strains had the A allele (Table 4). The effector tccp1 had three alleles with six strains forming one allele while the other alleles had two and one strain, respectively. The allele (AB253545) with two strains encoded for a protein that is 47-amino acid longer in relation to that encoded for the other two alleles which are synonymous (Table 4). Effector espJ had three alleles: The most common allele was found in eight of the ten strains and contained a non-sense mutation that created a protein 39 aa shorter at the N-terminal end. The other alleles contained a synonymous mutation (Table 4). The effector nleA had two synonymous mutations. The nleB1, nleB2, nleC1 gadA and gadB gene sequences were 100% conserved among strains (Table 4). In all strains, tccp2 was a pseudogene encoding for a non-functional protein.

Table 4. According to Virulence Finder alleles are named for its accession numbera.

aaccession number corresponds to those informed by virulence finder, rows in yellow mean that all the strains have 100% nucleotide identity, rows in green and red indicate two non- synonymous alleles, whereas rows in green and different grey tone refer to three alleles, being two of them (those in grey) synonymous.

LEE genes polymorphisms

LEE encodes for a T3SS, which is essential in EHEC virulence. The search for polymorphisms of LEE is relevant because we have previously observed important variations in T3SS activity in the same set of strains [20]. For this analysis, we aligned all nucleotide sequences of LEE from all the strains studied here in search of SNPs and In/Dels. Polymorphisms of LEE genes described in the Virulence Finder section will not be described to avoid redundancy.

In all strains, LEE was located in the same genomic position, between rorf1_2 and an IS66 family transposase gene located after the last gene of LEE, espF. The LEE sequence was highly conserved among all isolates. Five strains (RafaelaII, Vac07.1, Balcarce24.2, 7.1Anguil and 146N4) presented a substitution at nucleotide (nt) 8797 of LEE, which corresponded to the intergenic region comprised between genes etgA-grlR and where the etgA and grlR promoters should be located. The strain 7.1Anguil strain showed a unique polymorphism in espA: a non-synonymous mutation (AACxAAA) producing a NxK change at aa 111.

Among different EHEC O157 strains, LEE locus is much more conserved than prophages and even more than other genome regions [17]. The SNP analysis of the LEE region detected a substitution, between etgA-grlR genes, in five strains. The intergenic region is 195 bases in length and is the promoter region for both genes as the grlR gene is transcribed in the downstream direction while the etgA genes is transcribed in the upstream direction. GrlR is an important regulatory protein that forms a complex with GrlA regulatory protein [65]. This regulatory system forms a bicistronic operon that coordinates the LEE regulation, where GrlA is the positive regulator and GrlR acts negatively. In addition, other studies showed that the GrlR-GrlA system also regulates the expression of flagella and enterohemolysin in EHEC [6668]. The etgA gene encodes for a peptidoglycan lytic enzyme with muramidase activity, an enzyme that contributes with T3SS needle to go through the peptidoglycan layer of the bacteria so it can be ready to penetrate the host epithelial cell [69].

Conclusions

In conclusion, clade 8 strains showed conserved genomic structure to each other while other clade strains show inversions and in/dels (>50 kb) with respect to clade 8 strains and, in accordance with previous studies, lambdoid prophage varied among strains, i.e. those encoding for stx2a were extremely polymorphic, in opposition to those encoding for stx2c A cluster of genes presents exclusively in clade 8 contains genes encoding mostly for DNA binding proteins. Stx expression is generally increased in clade 8 strains. However, there are important variations of Stx expression among clade 8 strains that may be associated to the gene content of stx2a-prophages and nucleotide substitutions in the promoter region on of stx2 operon.

Supporting information

S1 Fig. Long PCR analysis of 7.1Anguil strain inversion.

A) Evidence and graphic representation primers for combinatorial PCR to evaluate inversion. B) In silico prediction of amplicons using assembled genomes. C) Agarose gel of amplified fragments. All primers pairs were tested on 2 isolates from 7.1Anguil and one from RafaelaIIA.

(PDF)

S2 Fig. Circular genome representation of prophage similarity.

Prophage similarity among genomes of the analyzed strains. Each prophage is represented by a different color on each chromosome. On each circle, genome used as query is showed larger than the subject genomes. Lines starts on query prophages and end on every subject prophage hit showing at least 80% coverage. Size of prophages are represented in scale. For this representation Circos was used.

(PDF)

S3 Fig. Prophage size vs frequency.

Sizes of prophage families are represented in a scatter graph against the frequency of finding a given prophage family in the 10 genome studied. R2 parameter and the tendency formula were calculated using excel. To determine the relationship between two variables Rho Spearman calculator was used (https://www.socscistatistics.com/tests/spearman/default2.aspx). rho was -0.4341. Shorter prophages were associated with higher clustering (p<0.01).

(JPG)

S1 Table. Identified prophages.

Excel sheets represent all identified prophages. Sheet All: all prophages identified, family or unique, position in the genome, and size data are presented. Sheet family: prophages belonging to families. Sheet of unique prophages: Shared sheet: unique shared types prophages data are presented.

(XLSX)

S2 Table. Intra strain prophage similarity.

Each sheet represents a different strain and query, subject, Query Coverage Per Subject, Query Coverage Per Unique Subject, Average Identity and HSPs Number BLAST+ parameters are presented.

(XLSX)

S3 Table. Main prophage families having > 6 prophages.

(DOCX)

S4 Table. Blastn of shared types and unique prophages.

Shared types and Unique prophages described here were searched against nr.

(XLSX)

Acknowledgments

The authors would like to thank Sandy Fryda-Bradley and the USMARC core sequencing facility for excellent technical assistance. The mention of a trade name, proprietary product, or specific equipment does not constitute a guarantee or warranty by the USDA and does not imply approval to the exclusion of other products that might be suitable. USDA is an equal opportunity provider and employer. AA, ML WMdS MFE and AC are CONICET fellows. MI and NAR holds a CONICET fellowship. We thank Marina Palermo for helping in Stx ELISA determination and Valeria Rocha for her valuable technical help.

Data Availability

All sequence data was deposited in GenBank. All raw sequencing experiments were deposited in SRA. Scripts used are available at https://github.com/arielamadio/Ecoli_parsing_data Strain BioSample Accession SRA Accession Balcarce_14.2 SAMN17295281 CP076243-CP076244 SRR14419381,SRR14419382 Vac07.1 SAMN17295280 CP076241-CP076242 SRR14419379,SRR14419380 146N4 SAMN17295279 CP076237-CP076240 SRR14419377,SRR14419378 9.1_Anguil SAMN17295278 CP076235-CP076236 SRR14419375,SRR14419376 Balcarce_24.2 SAMN17295277 CP076245-CP076247 SRR14419387,SRR14419388 7.1_Anguil SAMN03470769 CP076232-CP076234 SRR14419385,SRR14419386 Rafaela_II SAMN03470766 CP076230-CP076231 SRR14419383,SRR14419384 438/99 SAMN17295282 JAHCTZ000000000 SRR14419389,SRR14419390.

Funding Statement

Fondo para la Investigación Científica y Tecnológica Award Number: PICT 2016 0795 | Recipient: Angel Cataldi https://www.argentina.gob.ar/ciencia/agencia/fondo-para-la-investigacion-cientifica-y-tecnologica-foncyt Fondo para la Investigación Científica y Tecnológica Award Number: PICT-2015-1845 | Recipient: Ariel Amadio https://www.argentina.gob.ar/ciencia/agencia/fondo-para-la-investigacion-cientifica-y-tecnologica-foncyt Agricultural Research Service Award Number: Project Number 3040-42000-017-00-D | Recipient: James L Bono https://www.ars.usda.gov/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Nguyen Y, Sperandio V. Enterohemorrhagic E. coli (EHEC) pathogenesis. Frontiers in cellular and infection microbiology. Frontiers Media SA; 2012. p. 90. doi: 10.3389/fcimb.2012.00090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hedican EB, Medus C, Besser JM, Juni BA, Koziol B, Taylor C, et al. Characteristics of O157 versus Non-O157 shiga toxin-producing escherichia coli infections in minnesota, 2000–2006. Clin Infect Dis. 2009;49: 358–364. doi: 10.1086/600302 [DOI] [PubMed] [Google Scholar]
  • 3.Tseng M, Sha Q, Rudrik JT, Collins J, Henderson T, Funk JA, et al. Increasing incidence of non-O157 Shiga toxin-producing Escherichia coli (STEC) in Michigan and association with clinical illness. Epidemiol Infect. 2016;144: 1394–1405. doi: 10.1017/S0950268815002836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Yang SC, Lin CH, Aljuffali IA, Fang JY. Current pathogenic Escherichia coli foodborne outbreak cases and therapy development. Archives of Microbiology. Springer Verlag; 2017. pp. 811–825. doi: 10.1007/s00203-017-1393-y [DOI] [PubMed] [Google Scholar]
  • 5.Boletín Integrado de Vigilancia N_ 344–SE. Available online: http://www.msal.gob.ar/images/stories/boletines/Boletin-Integrado-De-Vigilancia-N344-SE3.pdf (accessed on 28 March 2018).
  • 6.Strockbine NA, Marques LRM, Newland JW, Smith HW, Holmes RK, O’brien AD. Two toxin-converting phages from Escherichia coli O157:H7 strain 933 encode antigenically distinct toxins with similar biologic activities. Infect Immun. 1986;53: 135–140. doi: 10.1128/iai.53.1.135-140.1986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Scheutz F, Teel LD, Beutin L, Piérard D, Buvens G, Karch H, et al. Multicenter evaluation of a sequence-based protocol for subtyping Shiga toxins and standardizing Stx nomenclature. J Clin Microbiol. 2012;50: 2951–2963. doi: 10.1128/JCM.00860-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bai X, Fu S, Zhang J, Fan R, Xu Y, Sun H, et al. Identification and pathogenomic analysis of an Escherichia coli strain producing a novel Shiga toxin 2 subtype. Sci Rep. 2018;8. doi: 10.1038/s41598-018-25233-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Martin CC, Svanevik CS, Lunestad BT, Sekse C, Johannessen GS. Isolation and characterisation of Shiga toxin-producing Escherichia coli from Norwegian bivalves. Food Microbiol. 2019;84. doi: 10.1016/j.fm.2019.103268 [DOI] [PubMed] [Google Scholar]
  • 10.Yang X, Bai X, Zhang J, Sun H, Fu S, Fan R, et al. Escherichia coli strains producing a novel Shiga toxin 2 subtype circulate in China. Int J Med Microbiol. 2020;310: 151377. doi: 10.1016/j.ijmm.2019.151377 [DOI] [PubMed] [Google Scholar]
  • 11.Ogura Y, Mondal SI, Islam MR, Mako T, Arisawa K, Katsura K, et al. The Shiga toxin 2 production level in enterohemorrhagic Escherichia coli O157:H7 is correlated with the subtypes of toxin-encoding phage. Sci Rep. 2015;5: 16663. doi: 10.1038/srep16663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dallman TJ, Ashton PM, Byrne L, Perry NT, Petrovska L, Ellis R, et al. Applying phylogenomics to understand the emergence of Shiga-toxin-producing Escherichia coli O157:H7 strains causing severe human disease in the UK. Microb Genomics. 2015;1. doi: 10.1099/mgen.0.000029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Boerlin P, McEwen SA, Boerlin-Petzold F, Wilson JB, Johnson RP, Gyles CL. Associations between virulence factors of Shiga toxin-producing Escherichia coli and disease in humans. J Clin Microbiol. 1999;37: 497–503. doi: 10.1128/JCM.37.3.497-503.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.McDaniel TK, Kaper JB. A cloned pathogenicity island from enteropathogenic Escherichia coli confers the attaching and effacing phenotype on E. coli K-12. Mol Microbiol. 1997;23: 399–407. doi: 10.1046/j.1365-2958.1997.2311591.x [DOI] [PubMed] [Google Scholar]
  • 15.Elliott SJ, Wainwright LA, McDaniel TK, Jarvis KG, Deng YK, Lai LC, et al. The complete sequence of the locus of enterocyte effacement (LEE) from enteropathogenic Escherichia coli E2348/69. Molecular Microbiology. Mol Microbiol; 1998. pp. 1–4. doi: 10.1046/j.1365-2958.1998.00783.x [DOI] [PubMed] [Google Scholar]
  • 16.Lim JY, Yoon JW, Hovde CJ. A brief overview of Escherichia coli O157:H7 and its plasmid O157. Journal of Microbiology and Biotechnology. Korean Society for Microbiolog and Biotechnology; 2010. pp. 1–10. doi: 10.4014/jmb.0908.08007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhang Y, Laing C, Steele M, Ziebell K, Johnson R, Benson AK, et al. Genome evolution in major Escherichia coli O157:H7 lineages. BMC Genomics. 2007;8: 121. doi: 10.1186/1471-2164-8-121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Manning SD, Motiwala AS, Springman AC, Qi W, Lacher DW, Ouellette LM, et al. Variation in virulence among clades of Escherichia coli O157:H7 associated with disease outbreaks. Proc Natl Acad Sci U S A. 2008;105: 4868–4873. doi: 10.1073/pnas.0710834105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Pianciola L, D’Astek BA, Mazzeo M, Chinen I, Masana M, Rivas M. Genetic features of human and bovine Escherichia coli O157:H7 strains isolated in Argentina. Int J Med Microbiol. 2016;306: 123–130. doi: 10.1016/j.ijmm.2016.02.005 [DOI] [PubMed] [Google Scholar]
  • 20.Amigo N, Mercado E, Bentancor A, Singh P, Vilte D, Gerhardt E, et al. Clade 8 and clade 6 strains of Escherichia coli O157:H7 from cattle in Argentina have hypervirulent-like phenotypes. PLoS One. 2015;10. doi: 10.1371/journal.pone.0127710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hirai S, Yokoyama E, Wakui T, Ishige T, Nakamura M. Enterohemorrhagic Escherichia coli O157 subclade 8b strains in Chiba Prefecture, Japan, produced larger amounts of Shiga toxin 2 than strains in subclade 8a and other clades. PLoS One. 2018;13. doi: 10.1371/journal.pone.0191834 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Neupane M, Abu-Ali GS, Mitra A, Lacher DW, Manning SD, Riordan JT. Shiga toxin 2 overexpression in Escherichia coli O157:H7 strains associated with severe human disease. Microb Pathog. 2011;51: 466–470. doi: 10.1016/j.micpath.2011.07.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Latif H, Li HJ, Charusanti P, Palsson B, Aziz RK. A gapless, unambiguous genome sequence of the enterohemorrhagic Escherichia coli O157:H7 strain EDL933. Genome Announc. 2014;2: 821–835. doi: 10.1128/genomeA.00821-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hayashi T, Makino K, Ohnishi M, Kurokawa K, Ishii K, Yokoyama K, et al. Complete genome sequence of enterohemorrhagic Eschelichia coli O157:H7 and genomic comparison with a laboratory strain K-12. DNA Res. 2001;8: 11–22. doi: 10.1093/dnares/8.1.11 [DOI] [PubMed] [Google Scholar]
  • 25.Kulasekara BR, Jacobs M, Zhou Y, Wu Z, Sims E, Saenphimmachak C, et al. Analysis of the genome of the Escherichia coli O157:h7 2006 spinach-associated outbreak isolate indicates candidate genes that may enhance virulence. Infect Immun. 2009;77: 3713–3721. doi: 10.1128/IAI.00198-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Escherichia coli O157:H7 str. TW14588 chromosome, whole genome shotgun—Nucleotide—NCBI. [cited 5 May 2021]. Available: https://www.ncbi.nlm.nih.gov/nuccore/CM000662
  • 27.Cowley LA, Dallman TJ, Fitzgerald S, Irvine N, Rooney PJ, McAteer SP, et al. Short-term evolution of Shiga toxin-producing Escherichia coli O157:H7 between two food-borne outbreaks. Microb genomics. 2016;2: e000084. doi: 10.1099/mgen.0.000084 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Katani R, Cote R, Garay JAR, Li L, Arthur TM, DebRoy C, et al. Complete genome sequence of SS52, a strain of Escherichia coli O157:H7 recovered from supershedder cattle. Genome Announc. 2016;3. doi: 10.1128/genomeA.01569-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Luo H, Zhang CT, Gao F. Ori-Finder 2, an integrated tool to predict replication origins in the archaeal genomes. Front Microbiol. 2014;5: 482. doi: 10.3389/fmicb.2014.00482 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Seemann T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics. 2014;30: 2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 31.Li W, Godzik A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22: 1658–1659. doi: 10.1093/bioinformatics/btl158 [DOI] [PubMed] [Google Scholar]
  • 32.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10. doi: 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J. ACT: The Artemis comparison tool. Bioinformatics. 2005;21: 3422–3423. doi: 10.1093/bioinformatics/bti553 [DOI] [PubMed] [Google Scholar]
  • 34.Abbott JC, Aanensen DM, Rutherford K, Butcher S, Spratt BG. WebACT—An online companion for the Artemis Comparison Tool. Bioinformatics. 2005;21: 3665–3666. doi: 10.1093/bioinformatics/bti601 [DOI] [PubMed] [Google Scholar]
  • 35.Arndt D, Marcu A, Liang Y, Wishart DS. PHAST, PHASTER and PHASTEST: Tools for finding prophage in bacterial genomes. Brief Bioinform. 2018;20: 1560–1567. doi: 10.1093/bib/bbx121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, et al. Artemis: Sequence visualization and annotation. Bioinformatics. 2000;16: 944–945. doi: 10.1093/bioinformatics/16.10.944 [DOI] [PubMed] [Google Scholar]
  • 37.Joensen KG, Scheutz F, Lund O, Hasman H, Kaas RS, Nielsen EM, et al. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J Clin Microbiol. 2014;52: 1501–1510. doi: 10.1128/JCM.03617-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: Rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31: 3691–3693. doi: 10.1093/bioinformatics/btv421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. doi: 10.1093/bioinformatics/btu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Khelik K, Lagesen K, Sandve GK, Rognes T, Nederbragt AJ. NucDiff: In-depth characterization and annotation of differences between two sets of DNA sequences. BMC Bioinformatics. 2017;18: 1–14. doi: 10.1186/s12859-016-1414-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Vidovic S, Korber DR. Prevalence of Escherichia coli O157 in Saskatchewan cattle: Characterization of isolates by using random amplified polymorphic DNA PCR, antibiotic resistance profiles, and pathogenicity determinants. Appl Environ Microbiol. 2006;72: 4347–4355. doi: 10.1128/AEM.02791-05 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Steele M, Ziebell K, Zhang Y, Benson A, Konczy P, Johnson R, et al. Identification of Escherichia coli O157:H7 genomic regions conserved in strains with a genotype associated with human infection. Appl Environ Microbiol. 2007;73: 22–31. doi: 10.1128/AEM.00982-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Eppinger M, Mammel MK, Leclerc JE, Ravel J, Cebula TA. Genomic anatomy of Escherichia coli O157:H7 outbreaks. Proc Natl Acad Sci U S A. 2011;108: 20142–20147. doi: 10.1073/pnas.1107176108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Kyle JL, Cummings CA, Parker CT, Quiñones B, Vatta P, Newton E, et al. Escherichia coli Serotype O55:H7 Diversity Supports Parallel Acquisition of Bacteriophage at Shiga Toxin Phage Insertion Sites during evolution of the O157:H7 lineage. J Bacteriol. 2012;194: 1885–1896. doi: 10.1128/JB.00120-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ohnishi M, Kurokawa K, Hayashi T. Diversification of Escherichia coli genomes: Are bacteriophages the major contributors? Trends in Microbiology. Elsevier Current Trends; 2001. pp. 481–485. doi: 10.1016/s0966-842x(01)02173-4 [DOI] [PubMed] [Google Scholar]
  • 46.Shaaban S, Cowley LA, McAteer SP, Jenkins C, Dallman TJ, Bono JL, et al. Evolution of a zoonotic pathogen: investigating prophage diversity in enterohaemorrhagic Escherichia coli O157 by long-read sequencing. Microb genomics. 2016;2: e000096. doi: 10.1099/mgen.0.000096 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.El Kafsi H, Loux V, Mariadassou M, Blin C, Chiapello H, Abraham AL, et al. Unprecedented large inverted repeats at the replication terminus of circular bacterial chromosomes suggest a novel mode of chromosome rescue. Sci Rep. 2017;7: 1–11. doi: 10.1038/s41598-016-0028-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Hill CW, Harnish BW. Inversions between ribosomal RNA genes of Escherichia coli. Proc Natl Acad Sci U S A. 1981;78: 7069–7072. doi: 10.1073/pnas.78.11.7069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Iguchi A, Iyoda S, Terajima J, Watanabe H, Osawa R. Spontaneous recombination between homologous prophage regions causes large-scale inversions within the Escherichia coli O157:H7 chromosome. Gene. 2006;372: 199–207. doi: 10.1016/j.gene.2006.01.005 [DOI] [PubMed] [Google Scholar]
  • 50.Casjens SR, Hendrix RW. Bacteriophage lambda: Early pioneer and still relevant. Virology. Academic Press Inc.; 2015. pp. 310–330. doi: 10.1016/j.virol.2015.02.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Lupolova N, Dallman TJ, Matthews L, Bono JL, Gally DL. Support vector machine applied to predict the zoonotic potential of E. coli O157 cattle isolates. Proc Natl Acad Sci U S A. 2016;113: 11312–11317. doi: 10.1073/pnas.1606567113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Toothman P, Herskowitzt I. Rex-dependent exclusion of lambdoid phages I. Prophage requirements for exclusion. Virology. 1980;102: 133–146. doi: 10.1016/0042-6822(80)90076-8 [DOI] [PubMed] [Google Scholar]
  • 53.Yen YT, Tsang C, Cameron TA, Ankrah DO, Rodou A, Stathopoulos C. Importance of conserved residues of the serine protease autotransporter β-domain in passenger domain processing and β-barrel assembly. Infect Immun. 2010;78: 3516–3528. doi: 10.1128/IAI.00390-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Orth D, Ehrlenbach S, Brockmeyer J, Khan AB, Huber G, Karch H, et al. EspP, a serine protease of enterohemorrhagic Escherichia coli, impairs complement activation by cleaving complement factors C3/C3b and C5. Infect Immun. 2010;78: 4294–4301. doi: 10.1128/IAI.00488-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Navarro-Garcia F, Serapio-Palacios A, Vidal JE, Isabel Salazar M, Tapia-Pastrana G. EspC promotes epithelial cell detachment by enteropathogenic Escherichia coli via sequential cleavages of a cytoskeletal protein and then focal adhesion proteins. Infect Immun. 2014;82: 2255–2265. doi: 10.1128/IAI.01386-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lindsey RL, Batra D, Smith P, Patel PN, Tagg KA, Garcia-Toledo L, et al. PacBio Genome Sequences of Escherichia coli Serotype O157:H7, Diffusely Adherent E. coli, and Salmonella enterica Strains, All Carrying Plasmids with an mcr-1 Resistance Gene. Microbiol Resour Announc. 2018;7. doi: 10.1128/mra.01025-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Sharma VK, Stanton TB. Characterization of a 3.3-kb plasmid of Escherichia coli O157:H7 and evaluation of stability of genetically engineered derivatives of this plasmid expressing green fluorescence. Vet Microbiol. 2008;132: 421–427. doi: 10.1016/j.vetmic.2008.05.016 [DOI] [PubMed] [Google Scholar]
  • 58.Makino K, Ishii K, Yasunaga T, Hattori M, Yokoyama K, Yutsudo CH, et al. Complete nucleotide sequences of 93-kb and 3.3-kb plasmids of an enterohemorrhagic Escherichia coli O157:H7 derived from Sakai outbreak. DNA Res. 1998;5: 1–9. doi: 10.1093/dnares/5.1.1 [DOI] [PubMed] [Google Scholar]
  • 59.Rusconi B, Sanjar F, Koenig SSK, Mammel MK, Tarr PI, Eppinger M. Whole genome sequencing for genomics-guided investigations of Escherichia coli O157:H7 outbreaks. Front Microbiol. 2016;7. doi: 10.3389/fmicb.2016.00985 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Fitzgerald SF, Beckett AE, Palarea-Albaladejo J, McAteer S, Shaaban S, Morgan J, et al. Shiga toxin sub-type 2a increases the efficiency of Escherichia coli O157 transmission between animals and restricts epithelial regeneration in bovine enteroids. PLoS Pathog. 2019;15: e1008003. doi: 10.1371/journal.ppat.1008003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.LeJeune JT, Abedon ST, Takemura K, Christie NP, Sreevatsan S. Human Escherichia coli O157:H7 genetic marker in isolates of bovine origin. Emerg Infect Dis. 2004;10: 1482–1485. doi: 10.3201/eid1008.030784 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Matsumoto M, Suzuki M, Takahashi M, Hirose K, Minagawa H, Ohta M. Identification and epidemiological description of enterohemorrhagic Escherichia coli O157 strains producing low amounts of Shiga toxin 2 in Aichi Prefecture, Japan. Jpn J Infect Dis. 2008;61: 442–445. Available: https://pubmed.ncbi.nlm.nih.gov/19050350/ [PubMed] [Google Scholar]
  • 63.Olavesen KK, Lindstedt BA, Løbersli I, Brandal LT. Expression of Shiga toxin 2 (Stx2) in highly virulent Stx-producing Escherichia coli (STEC) carrying different anti-terminator (q) genes. Microb Pathog. 2016;97: 1–8. doi: 10.1016/j.micpath.2016.05.010 [DOI] [PubMed] [Google Scholar]
  • 64.Bono JL, Keen JE, Clawson ML, Durso LM, Heaton MP, Laegreid WW. Association of Escherichia coli O157:H7 tir polymorphisms with human infection. BMC Infect Dis. 2007; 7: 98. doi: 10.1186/1471-2334-7-98 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Padavannil A, Jobichen C, Mills E, Velazquez-Campoy A, Li M, Leung KY, et al. Structure of GrlR-GrlA complex that prevents GrlA activation of virulence genes. Nat Commun. 2013;4: 1–10. doi: 10.1038/ncomms3546 [DOI] [PubMed] [Google Scholar]
  • 66.Jobichen C, Fernandis AZ, Velazquez-Campoy A, Leung KY, Mok YK, Wenk MR, et al. Identification and characterization of the lipid-binding property of GrlR, a locus of enterocyte effacement regulator. Biochem J. 2009;420: 191–199. doi: 10.1042/BJ20081588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Iyoda S, Koizumi N, Satou H, Lu Y, Saitoh T, Ohnishi M, et al. The GrlR-GrlA regulatory system coordinately controls the expression of flagellar and LEE-encoded type III protein secretion systems in enterohemorrhagic Escherichia coli. J Bacteriol. 2006;188: 5682–5692. doi: 10.1128/JB.00352-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Iyoda S, Honda N, Saitoh T, Shimuta K, Terajima J, Watanabe H, et al. Coordinate control of the locus of enterocyte effacement and enterohemolysin genes by multiple common virulence regulators in enterohemorrhagic Escherichia coli. Infect Immun. 2011;79: 4628–4637. doi: 10.1128/IAI.05023-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.García-Gómez E, Espinosa N, de la Mora J, Dreyfus G, González-Pedrajo B. The muramidase EtgA from enteropathogenic Escherichia coli is required for efficient type III secretion. Microbiology. 2011;157: 1145–1160. doi: 10.1099/mic.0.045617-0 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Mark Eppinger

2 Jul 2021

PONE-D-21-18023

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

PLOS ONE

Dear Dr. Cataldi,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process by both reviewers.

Quality of the figures was borderline when sent out for initial review and I would appreciate if you could include higher quality figures and label them accordingly. Please also address the concerns raised by reviewer 2 in regards to data/software availability. Considering the limited number of isolates analyzed conclusions should not be overstated. As suggested by reviewer 2 "Results/Discussion" could be shortened to avoid redundancies without hurting the overall information content of this manuscript.

Please submit your revised manuscript by Aug 16 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Mark Eppinger

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. 

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

4. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

5. Please upload a new copy of Figures 1 to 6 as the detail is not clear. Please follow the link for more information: https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/" https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This study reports a comparative genomic analysis of eight EHEC strains isolated from cattle in Argentina, that were previously classified into clades based on SNPs. These strains were sequenced using PacBio technology and used adequate bioinformatic software. They made comparative analysis founding virulence genes related to LEE genomic island, toxin effectors and stress resistance. Among novelty information, the authors found prophage distribution between strains which is relatively different to previous reports and were capable to group into families. Its findings are very interesting and give new insights into diversity of this important pathotype.

I only have a few observations and suggestions to make your manuscript better understood:

1. “Pathogenicity island” is a no longer used term, is preferred “genomic Island” even if it harbors virulence genes, my suggestion is to change the terms.

2. When you mentioned “geneious” you are referring to the software so, you should specify it in order to avoid confusion particularly to novel readers into bioinformatics

3. Shiga-toxin detection and cytotoxicity in Vero cells is a well performed assay and according to previous bibliography and provides important information but it seems to be not related with the title of the manuscript, later in text, the importance of this assay arises so I suggest to justify it when you introduce the results of the experiment

4. All the figures are blurry even if you download it as .tif format, I´m not sure if they are formatted in the correct specification. It makes difficult to follow the text referring to the figure.

5. Lines 229-230. You are talking about cytotoxicity referring to Figure 1, but it is an Artemis map not a toxicity graph

6. Lines 370-375. You are indicating panel (A) and (B) but they are not indicated in the figure, please correct it

Reviewer #2: This manuscript presents a genomic comparison of eight E. coli O157:H7 strains, isolated from the same region of Argentina. Comparative genomic tools were applied to screen for genomic features well-described by this field, including clade typing, prophage analysis, stx2 genotyping, and virulence gene identification. The authors identify characteristics reported before in similar studies, including diverse stx2a prophage but less diversity in stx2c prophage, polymorphisms in stx2-associated genes, and genomic rearrangements. While most of their findings were expected, the experiments were performed properly and for the most part (see below), their interpretations were supported by the data.

Major comments;

1. With hundreds of O157:H7 sequenced genomes available, this seems like a missed opportunity not to perform a larger comparison with other clade 8's. This would add much more value to the paper.

2. In the absence of a deeper genomic analysis, the phrase “sheds light about the genetic basis of hypervirulence” must be removed. This study does not report any experiments that provide new insights into in vivo virulence potential.

3. It appears a decent amount of prophage analysis was done with a “custom Perl script”. Without access to the script, the reader cannot repeat any this portion of the study. This should be made available on GitHub or similar site.

4. What statistical analysis was done to make the “larger prophages” conclusion in lines 261-262?

5. Line 363: Did you test whether this strain makes Shiga toxin using your ELISA? If not, explain in the paper why.

6. The Discussion is a reiteration of the Results. These two sections should be combined.

7. Many of the Figures, even when downloaded via the link provided, are really blurry.

Minor comments:

1. Lines 34-35: sentence fragment

2. Line 66: stx2h and 2i have been reported as well

3. Line 224: temper this language, as there may be additional prophage that weren’t identified by Phaster.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Edwin Barrios-Villa

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 28;16(10):e0258753. doi: 10.1371/journal.pone.0258753.r002

Author response to Decision Letter 0


24 Aug 2021

Mark Eppinger

Academic Editor

PLOS ONE

21 August 2021

Ref: PONE-D-21-18023

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

Please find attached a revised version of our manuscript: “Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle”. We thank the reviewers for their comments and suggestions and have modified the manuscript accordingly. Please find below a point-by-point response to each of the concerns raised by the reviewers.

We feel that these changes have significantly enhanced our manuscript, and hope that you will now find it acceptable for publication in Plos One.

Yours sincerely

Dr. Angel Cataldi

Responses to the academic editor:

Comment/Question (C/Q): Quality of the figures was borderline when sent out for initial review and I would appreciate if you could include higher quality figures and label them accordingly.

Answer (A): We apologize for the low quality of the figures when the pdf to submit was ready. Now we have uploaded high definition tiff figures.

C/Q: Please also address the concerns raised by reviewer 2 in regards to data/software availability.

A: all genome data is now available.

C/Q Considering the limited number of isolates analyzed conclusions should not be overstated. As suggested by reviewer 2 "Results/Discussion" could be shortened to avoid redundancies without hurting the overall information content of this manuscript.

A: As suggested Results and Discussion were combined in a single section. This combination removed much redundancy. Part of the text from the discussion of the original manuscript submitted is distributed in lines 204-207, 233-236, 240-249, 312-317, 320-327, 356-360, 418-432, 444-447, 459-466, and 501-512 of the revised clean manuscript.

Journal Requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

A; formatted as requested

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

A: done as suggested

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

A: The data is already deposited in GenBank under the BioProject that we’ve provided as private. We now added the accession number for each genome in the manuscript and they will be immediately released as a publication appears. Therefore, we fully comply with PLOS ONE and NCBI policy for Data Availability.

In the revised version accession number of the genomes (the table below) was placed below Acknowledgements section.

Strain BioProject BioSample Accession Study SRA Accession

Balcarce_14.2 PRJNA280853 SAMN17295281 CP076243-CP076244 SRP057321 SRR14419381,SRR14419382

Vac07.1 PRJNA280853 SAMN17295280 CP076241-CP076242 SRP057321 SRR14419379,SRR14419380

146N4 PRJNA280853 SAMN17295279 CP076237-CP076240 SRP057321 SRR14419377,SRR14419378

9.1_Anguil PRJNA280853 SAMN17295278 CP076235-CP076236 SRP057321 SRR14419375,SRR14419376

Balcarce_24.2 PRJNA280853 SAMN17295277 CP076245-CP076247 SRP057321 SRR14419387,SRR14419388

7.1_Anguil PRJNA280853 SAMN03470769 CP076232-CP076234 SRP057321 SRR14419385,SRR14419386

Rafaela_II PRJNA280853 SAMN03470766 CP076230-CP076231 SRP057321 SRR14419383,SRR14419384

438/99 PRJNA280853 SAMN17295282 JAHCTZ000000000 SRP057321 SRR14419389,SRR14419390

4. We note that you have included the phrase “data not shown” in your manuscript.

A: we replaced the first data no shown (line 274 of the revised clean version) by S Table 3, and the second (line 470) was deleted, as text is itself self-explanatory.

5. Please upload a new copy of Figures 1 to 6 as the detail is not clear. Please follow the link for more information: https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/" https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/

A; new more defined figures were uploaded

Reviewers' comments and Reviewer's Responses to Questions

Reviewer #1: This study reports a comparative genomic analysis of eight EHEC strains isolated from cattle in Argentina, that were previously classified into clades based on SNPs. These strains were sequenced using PacBio technology and used adequate bioinformatic software. They made comparative analysis founding virulence genes related to LEE genomic island, toxin effectors and stress resistance. Among novelty information, the authors found prophage distribution between strains which is relatively different to previous reports and were capable to group into families. Its findings are very interesting and give new insights into diversity of this important pathotype.

I only have a few observations and suggestions to make your manuscript better understood:

1. “Pathogenicity island” is a no longer used term, is preferred “genomic Island” even if it harbors virulence genes, my suggestion is to change the terms.

A: corrected as requested

2. When you mentioned “geneious” you are referring to the software so, you should specify it in order to avoid confusion particularly to novel readers into bioinformatics

A: Yes, we are referring to software Geneious. We believed it was clear that was a product because we added the company in brackets, but we added the word “software” to clarify.

3. Shiga-toxin detection and cytotoxicity in Vero cells is a well performed assay and according to previous bibliography and provides important information but it seems to be not related with the title of the manuscript, later in text, the importance of this assay arises so I suggest to justify it when you introduce the results of the experiment

A: We acknowledge the comment of the reviewer and we add this sentence in lines 401-403 (of the revised clean version) “Stx toxin production was analyzed in strains carrying stx2a operon (Fig 5) in order to associate gene content of the encoding prophages with Stx production”

4. All the figures are blurry even if you download it as .tiff format, I´m not sure if they are formatted in the correct specification. It makes difficult to follow the text referring to the figure.

A: The reviewer is right. More defined figures were uploaded for the revised version

5. Lines 229-230. You are talking about cytotoxicity referring to Figure 1, but it is an Artemis map not a toxicity graph

W: we apologize for the mistake. We deleted the sentence about Verocitotoxicity assays and as this information (line 229-230) is redundant with that in Lines 363-368. We prefer to leave it as it is in 363-368 (lines 405-410 in the revised version)

6. Lines 370-375. You are indicating panel (A) and (B) but they are not indicated in the figure, please correct it

A: corrected as indicated

Reviewer #2: This manuscript presents a genomic comparison of eight E. coli O157:H7 strains, isolated from the same region of Argentina. Comparative genomic tools were applied to screen for genomic features well-described by this field, including clade typing, prophage analysis, stx2 genotyping, and virulence gene identification. The authors identify characteristics reported before in similar studies, including diverse stx2a prophage but less diversity in stx2c prophage, polymorphisms in stx2-associated genes, and genomic rearrangements. While most of their findings were expected, the experiments were performed properly and for the most part (see below), their interpretations were supported by the data.

Major comments;

1. With hundreds of O157:H7 sequenced genomes available, this seems like a missed opportunity not to perform a larger comparison with other clade 8's. This would add much more value to the paper.

A: We understand that a larger dataset of E. coli genomes is available, however, most of the comparisons made in the present study included some phenotypic information of the analyzed strains or manual comparisons (like alignments) of the data that couldn’t be performed in a dataset of thousands of genomes. We chose to sequence the genomes to completion and use complete closed STEC O157:H7 genomes from NCBI in our analysis because we wanted to compare all aspects of genome diversity including chromosome architecture, complete phage content, plasmids and other mobile genome elements to identify any features that might associate with observed phenotype difference between the Argentinean strains and also those used in the comparison. Such detailed resolution is not available with draft genome sequences, which are the majority on public databases. However, we believe that the present study could be a start point to think how to upgrade comparisons made to include all deposited E. coli genomes and we intend to do this in future studies.

2. In the absence of a deeper genomic analysis, the phrase “sheds light about the genetic basis of hypervirulence” must be removed. This study does not report any experiments that provide new insights into in vivo virulence potential.

A: the sentence was changed to “This genomic analysis may contribute to the understanding of the genetic basis of the hypervirulence of EHEC O157:H7 strains circulating in Argentine cattle”.

3. It appears a decent amount of prophage analysis was done with a “custom Perl script”. Without access to the script, the reader cannot repeat any this portion of the study. This should be made available on GitHub or similar site.

A: The Perl script was only used to parse BLAST comparisons files. But as the reviewer suggested, was uploaded to GitHub repository to facilitate reproduction. It is now available at https://github.com/arielamadio/Ecoli_parsing_data and this was also added to the manuscript.

4. What statistical analysis was done to make the “larger prophages” conclusion in lines 261-262?

A: the text about larger prophages associated to unique prophages was moved the end of the Prophage diversity section in lines. We added a text: A tendency was observed that shorter prophages are likely to be associated with higher clustering (S Fig 3), as previously described by Shaaban et al [43]. Conversely, larger prophages (>80 kb) were overrepresented among the unique prophages (line 307-309 in the revised clean version).

The association of prophage size with clustering was determined by Spearman Rho test that the association was statistically significant.

5. Line 363: Did you test whether this strain makes Shiga toxin using your ELISA? If not, explain in the paper why.

A: Yes, in ELISA E. coli O157:H7 9.1 Anguil appears as a very low producer while E. coli O157:H7 RafII is a strong producer as shown in our previous paper (Amigo et al, PLoS One. 2015;10. doi:10.1371/journal.pone.0127710, Fig S1).

6. The Discussion is a reiteration of the Results. These two sections should be combined.

W: As indicated by thee reviewer the two sections were combined.

7. Many of the Figures, even when downloaded via the link provided, are really blurry.

W. new more defined figures were uploaded

Minor comments:

1. Lines 34-35: sentence fragment

A: corrected as suggested. Sentence was fragmented to: Clade 8 strains that were classified as hypervirulent. Most of the strains of this clade have a Shiga toxin stx2a-stx2c genotype

2. Line 66: stx2h and 2i have been reported as well

A: corrected as suggested to stx2 (stx2a, stx2b, stx2c, stx2d, stx2e, stx2f, stx2g, stx2h and stx2i)

3. Line 224: temper this language, as there may be additional prophage that weren’t identified by Phaster.

A: Corrected to The number of inserted prophages detected by Phaster, varied from 13 to 16 per genome

Attachment

Submitted filename: Rebuttal letter.docx

Decision Letter 1

Mark Eppinger

17 Sep 2021

PONE-D-21-18023R1

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

PLOS ONE

Dear Dr. Cataldi,

Thank you for submitting your manuscript to PLOS ONE. 

Please address the comment raised by one of the reviewers in regards to the discussion of stx2 suballeles and references.

"The authors added "stx2h" and "stx2i" as previously suggested, but use an incorrect 2012 reference (these variants were first reported around 2018-2019).  Please see doi:10.1038/s41598-018-25233-x and doi: https:doi.org/10.4315/ 0362-028X.JFP-18-291."

Also, a more recent article reports on stx2k (Yang, X. et (2020). Escherichia coli strains producing a novel Shiga toxin 2 subtype circulate in China. Int J Med Microbiol 310, 151377.).

Please submit your revised manuscript by Nov 01 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Mark Eppinger

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

********** 

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

********** 

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

********** 

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

********** 

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

********** 

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: As mentioned in past review, this study worth to be communicated. All recommendations have been addressed

Reviewer #2: The authors added "stx2h" and "stx2i" as previously suggested, but use an incorrect 2012 reference (these variants were first reported around 2018-2019). Please see doi:10.1038/s41598-018-25233-x and doi: https:doi.org/10.4315/ 0362-028X.JFP-18-291.

********** 

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 28;16(10):e0258753. doi: 10.1371/journal.pone.0258753.r004

Author response to Decision Letter 1


30 Sep 2021

Mark Eppinger

Academic Editor

PLOS ONE

30 September 2021

Ref: PONE-D-21-18023 R1

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

Please find attached a revised (R2) version of our manuscript: “Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle”. We thank the reviewers for their comments and suggestions and have modified the manuscript accordingly. Please find below a point-by-point response to each of the concerns raised by the reviewers.

We feel that these changes have significantly enhanced our manuscript, and hope that you will now find it acceptable for publication in Plos One.

Yours sincerely

Dr. Angel Cataldi

Responses to the academic editor:

Comment/Question (C/Q):"The authors added "stx2h" and "stx2i" as previously suggested, but use an incorrect 2012 reference (these variants were first reported around 2018-2019). Please see doi:10.1038/s41598-018-25233-x and doi: https:doi.org/10.4315/ 0362-028X.JFP-18-291."

Answer (A): the two reference were added in line 67 as [8] and [9].

C7Q: Also, a more recent article reports on stx2k (Yang, X. et (2020). Escherichia coli strains producing a novel Shiga toxin 2 subtype circulate in China. Int J Med Microbiol 310, 151377.).

A: The reference was added as [10] in line 67

. Review Comments to the Author

C7Q: Reviewer #1: As mentioned in past review, this study worth to be communicated. All recommendations have been addressed

A: questions of Reviewer 2 have been addressed under Responses to the academic editor:

Attachment

Submitted filename: Rebuttal letter.docx

Decision Letter 2

Mark Eppinger

5 Oct 2021

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

PONE-D-21-18023R2

Dear Dr. Cataldi,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Mark Eppinger

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Mark Eppinger

19 Oct 2021

PONE-D-21-18023R2

Genomic analysis of shiga toxin-containing Escherichia coli O157:H7 isolated from Argentinean cattle

Dear Dr. Cataldi:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mark Eppinger

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Long PCR analysis of 7.1Anguil strain inversion.

    A) Evidence and graphic representation primers for combinatorial PCR to evaluate inversion. B) In silico prediction of amplicons using assembled genomes. C) Agarose gel of amplified fragments. All primers pairs were tested on 2 isolates from 7.1Anguil and one from RafaelaIIA.

    (PDF)

    S2 Fig. Circular genome representation of prophage similarity.

    Prophage similarity among genomes of the analyzed strains. Each prophage is represented by a different color on each chromosome. On each circle, genome used as query is showed larger than the subject genomes. Lines starts on query prophages and end on every subject prophage hit showing at least 80% coverage. Size of prophages are represented in scale. For this representation Circos was used.

    (PDF)

    S3 Fig. Prophage size vs frequency.

    Sizes of prophage families are represented in a scatter graph against the frequency of finding a given prophage family in the 10 genome studied. R2 parameter and the tendency formula were calculated using excel. To determine the relationship between two variables Rho Spearman calculator was used (https://www.socscistatistics.com/tests/spearman/default2.aspx). rho was -0.4341. Shorter prophages were associated with higher clustering (p<0.01).

    (JPG)

    S1 Table. Identified prophages.

    Excel sheets represent all identified prophages. Sheet All: all prophages identified, family or unique, position in the genome, and size data are presented. Sheet family: prophages belonging to families. Sheet of unique prophages: Shared sheet: unique shared types prophages data are presented.

    (XLSX)

    S2 Table. Intra strain prophage similarity.

    Each sheet represents a different strain and query, subject, Query Coverage Per Subject, Query Coverage Per Unique Subject, Average Identity and HSPs Number BLAST+ parameters are presented.

    (XLSX)

    S3 Table. Main prophage families having > 6 prophages.

    (DOCX)

    S4 Table. Blastn of shared types and unique prophages.

    Shared types and Unique prophages described here were searched against nr.

    (XLSX)

    Attachment

    Submitted filename: Rebuttal letter.docx

    Attachment

    Submitted filename: Rebuttal letter.docx

    Data Availability Statement

    All sequence data was deposited in GenBank. All raw sequencing experiments were deposited in SRA. Scripts used are available at https://github.com/arielamadio/Ecoli_parsing_data Strain BioSample Accession SRA Accession Balcarce_14.2 SAMN17295281 CP076243-CP076244 SRR14419381,SRR14419382 Vac07.1 SAMN17295280 CP076241-CP076242 SRR14419379,SRR14419380 146N4 SAMN17295279 CP076237-CP076240 SRR14419377,SRR14419378 9.1_Anguil SAMN17295278 CP076235-CP076236 SRR14419375,SRR14419376 Balcarce_24.2 SAMN17295277 CP076245-CP076247 SRR14419387,SRR14419388 7.1_Anguil SAMN03470769 CP076232-CP076234 SRR14419385,SRR14419386 Rafaela_II SAMN03470766 CP076230-CP076231 SRR14419383,SRR14419384 438/99 SAMN17295282 JAHCTZ000000000 SRR14419389,SRR14419390.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES