Abstract
Homologous recombination is a major force mechanism driving bacterial evolution, host adaptability, and acquisition of novel virulence traits. Pectobacterium parmentieri is a plant bacterial pathogen distributed worldwide, primarily affecting potatoes, by causing soft rot and blackleg diseases. The goal of this investigation was to understand the impact of homologous recombination on the genomic evolution of P. parmentieri. Analysis of P. parmentieri genomes using Roary revealed a dynamic pan-genome with 3,742 core genes and over 55% accessory genome variability. Bayesian population structure analysis identified 7 lineages, indicating species heterogeneity. ClonalFrameML analysis displayed 5,125 recombination events, with the lineage 4 exhibiting the highest events. fastGEAR analysis identified 486 ancestral and 941 recent recombination events ranging from 43 bp to 119 kb and 36 bp to 13.96 kb, respectively, suggesting ongoing adaptation. Notably, 11% (412 genes) of the core genome underwent recent recombination, with lineage 1 as the main donor. The prevalence of recent recombination (double compared to ancient) events implies continuous adaptation, possibly driven by global potato trade. Recombination events were found in genes involved in vital cellular processes (DNA replication, DNA repair, RNA processing, homeostasis, and metabolism), pathogenicity determinants (type secretion systems, cell-wall degrading enzymes, iron scavengers, lipopolysaccharides (LPS), flagellum, etc.), antimicrobial compounds (phenazine and colicin) and even CRISPR-Cas genes. Overall, these results emphasize the potential role of homologous recombination in P. parmentieri's evolutionary dynamics, influencing host colonization, pathogenicity, adaptive immunity, and ecological fitness.
Keywords: genomic evolution, homologous recombination, Pectobacterium parmentieri, pathogenicity, CRISPR-Cas
Significance.
This study explores the influence of homologous recombination on the genomic evolution of the globally distributed plant pathogen Pectobacterium parmentieri, characterized by highly heterogenous strains from various global locations. Our findings reveal diverse recombinogenic patterns within the core genomes of P. parmentieri isolates, notably in genomic loci associated with important cell functions, pathogenicity determinants, and CRISPR-Cas genes. These findings highlight the role of homologous recombination in shaping the genomes of P. parmentieri and impacting its phytopathogenic lifestyle. Additionally, the data suggest a potential role of recombination in the ecological adaptation of this species across different climates, providing insights into the worldwide presence of P. parmentieri. This study represents a pioneering exploration of the impact of homologous recombination on the dynamic evolutionary genomics of the soft rot-causing bacterium P. parmentieri.
Introduction
In various organisms, including plant bacteria, homologous recombination constitutes an essential genetic mechanism (González-Torres et al. 2019). Plant bacteria engage in homologous recombination through a process known as transformation, enabling them to uptake foreign DNA from the surrounding environment. The process involves the exchange and recombination of genetic material between 2 similar DNA sequences, leading to the emergence of new gene combinations and potential changes in phenotype (Kuzminov 2011; González-Torres et al. 2019). For homologous recombination to occur, both ends of the recombined fragment must exhibit a high degree of DNA sequence similarity. As a result, this molecular mechanism primarily occurs between strains of the same species and taxonomically related bacteria (David et al. 2017; Jibrin et al. 2018; González-Torres et al. 2019). The need for a certain level of identity explains why the highest rates of homologous recombination are found in sections of the core genome where sequence similarities are preserved (Everitt et al. 2014; Dixit et al. 2015; Jibrin et al. 2018).
Homologous recombination in bacterial pathogens is of great importance for genetic diversity and adaptation, as it enables the acquisition of new traits and the evolution of these microorganisms. Through homologous recombination, bacteria can acquire new genes and alleles that confer advantages, such as the rapid spread of traits enabling pathogen survival, overcome host defense mechanism, and establishing successful infections (David et al. 2017; Jibrin et al. 2018; González-Torres et al. 2019). During the recombination process, bacteria can exchange genes involved in virulence, antibiotic resistance, and environmental adaptation, leading to the emergence of new pathogenic variants (Prokchorchik et al. 2020). Importantly, homologous recombination has been shown to accelerate evolution by up to a hundred times when different strains are present in the same host (Didelot et al. 2016). This genetic diversity helps plant bacterial pathogens in colonizing a range of hosts, evade plant defenses, and thrive in various environments (Vanhove et al. 2019).
In addition, homologous recombination functions as the main method for DNA repair in plant bacteria (David et al. 2017). It aids in repairing DNA damage caused by environmental factors, such as breaks or mutations (González-Torres et al. 2019). Plant bacteria may repair their genomes and maintain genomic integrity by exchanging genetic information with a homologous undamaged sequence (Potnis et al. 2019). This ensures their existence and aids in their ability to adapt to shifting environmental factors (Jibrin et al. 2018). Besides its role in DNA repair, recombination has also been observed to help in the deletion of deleterious mutations and the introduction of beneficial mutations during DNA synthesis, acting as a positive side effect leading to an energy source (David et al. 2017). Finally, some studies have found that bacteria involved engage in homologous recombination to delete mobile genetic elements from their genomes (David et al. 2017). In summary, homologous recombination in plant bacterial pathogens is a key process that drives genetic diversity, adaptation, DNA repair, and the acquisition of new traits. Therefore, understanding the mechanisms and dynamics of this process can have profound implications for developing effective strategies to manage and control bacterial pathogens, as well as for optimizing crop health and productivity in agricultural settings.
Pectobacterium parmentieri is classified as a pectinolytic rod-shaped, facultatively anaerobic bacterium belonging to the genus Pectobacterium within the Pectobacteriace family (Adeolu et al. 2016). This species was formerly known as P. wasabiae, the horse radish pathogen, and was later assigned as new species (Khayi et al. 2016). Like other members within the Pectobacterium genus, P. parmentieri is a plant-pathogenic bacterium causing soft rot and black leg diseases (Nykyri et al. 2012; Charkowski 2018; Zoledowska, et al. 2018a). The bacterium is widely distributed across the globe, having been reported in several countries, including Belgium, France, Finland, Germany, Norway, Switzerland, The Netherlands, Canada, South Africa, New Zealand, Malaysia, Poland, Spain, Turkey, China, and different states within the United States (Pitman et al. 2010; De Boer et al. 2012; Ngadze et al. 2012; Pasanen et al. 2013; Moleleki et al. 2017; Suárez et al. 2017; Zoledowska, et al. 2018b; Arizala et al. 2019; Cao et al. 2021; Ge et al. 2021). The strains of this species have been isolated from various environments and sources, including plants, soil, water, and plant debris. In a recent study, the metabolic modeling of the Finland strain SCC3193 revealed that P. parmentieri can adapt and survive in either soil or rhizosphere (Zoledowska et al. 2019). Although most P. parmentieri strains deposited in the NCBI GenBank database have been isolated mainly from potato (accessed on 2022 September 14), there are different surveys reporting the isolation of this bacterium from other hosts such as cabbage, tomato, eggplant, onions, carrot, maize, sugar beet, calla lily, sweet potato, and star of Bethlehem, impacting both yield and quality (Zoledowska et al. 2018b).
The ability of P. parmentieri to cause soft rot diseases and establish a successful infection depends on the production of plant cell wall-degrading enzymes, namely cellulases, pectinases, and proteases (Nykyri et al. 2012; Zoledowska et al. 2018a; Arizala and Arif 2019). These enzymes are delivered via the type I (TISS) or II (TIISS) secretion systems, allowing the pathogen to macerate plant tissues, and spread throughout the host tissues (Nykyri et al. 2012; Zoledowska et al. 2018a; Arizala and Arif 2019; Arif et al. 2022). Importantly, due to the absence of the type III (T3SS) secretion system, studies have reported that P. parmentieri might be less virulent compared to other Pectobacterium species (Kim et al. 2009; Nykyri et al. 2012; Arizala and Arif 2019). However, other research studies have found highly pathogenic P. parmentieri strains on potato plants and tubers (Ge et al. 2021), demonstrating that the lack of T3SS in this bacterium is not a detrimental pathogenicity determinant for disease development (Kim et al. 2009).
Genomic analysis has revealed that this bacterium has a relatively large genome size, with approximately 4.6 to 5.6 million base pairs (Mbp), containing approximately 4,400 to 5,400 predicted protein-coding genes (Nykyri et al. 2012; Khayi et al. 2015; Zoledowska et al. 2018a, 2018b; Arizala and Arif 2019). The P. parmentieri genome typically comprises a single circular chromosome, with few strains harboring a plasmid (Zoledowska et al. 2018a, 2018b). Numerous genes involved in important biological functions, including metabolism, replication, transcription, and translation have been found on the chromosome (Nykyri et al. 2012; Zoledowska et al. 2018a, 2018b). In addition, the genome contains many coding regions linked to pathogenicity, including those responsible for producing pectinolytic enzymes, extracellular enzymes, iron scavengers, toxins, protein effectors, and specialized secretion systems (Nykyri et al. 2012; Zoledowska et al. 2018a, 2018b; Arizala and Arif 2019). Together, these factors contribute to the infection and colonization of this pathogen inside its host plants.
Comparative genomic analyses have provided valuable insights into the genetic variations within the P. parmentieri species and their implications for host specificity and pathogenicity (Nykyri et al. 2012; Zoledowska et al. 2018a; Arizala and Arif 2019). Zoledowska et al. (2018a), reported several genes found to be part of the dispensable genome, indicating a high genomic plasticity across the P. parmentieri strains. These findings could be associated with the widespread distribution of this bacterium across the globe, its broad host range, and its rapid adaptation to thrive in different climatic zones exhibiting different temperatures and humidity ranges (Zoledowska et al. 2018b). On the other hand, widely associated genome studies have shown that P. parmetieri has horizontally acquired gene regions crucial for pathogenicity and environment adaptation (Nykyri et al. 2012; Zoledowska et al. 2018a; Arizala and Arif 2019). The French and type strain RNS 08.42.1A (CFBP 8475T), possesses several quorum sensing genes that seem to be acquired thorough horizontal gene transfer (HGT) (Khayi et al. 2015). Moreover, the genome of the Finland strain SCC3193, which has been used as a model for studies for decades, was observed to harbor different genome islands, some of which contain important pathogenicity determinants. These outcomes provide a glimpse of the impact of gene transfer in the evolution of this phytopathogen.
No study has been conducted regarding the role of homologous recombination and its implication for the evolution of P. parmentieri. Considering this background, the primary objective of the present research article was to examine the importance of homologous recombination and its potential contribution to the evolutionary relationships, wide-strain genomic diversity, pathogenicity, host adaptation, and immunity of P. parmentieri. To our knowledge, this is a pioneering research that delves deeply into the study of homologous recombination across the core genome of the soft rot bacterium P. parmentieri.
Results and Discussion
Pan Genome and Population Structural Analyses Reveal High Diversity Within Strains of P. parmentieri
In total 17 complete and 15 draft genomes belonging to different P. parmentieri strains, isolated from various time periods and locations, were retrieved from the NCBI GenBank database and used in the analyses (supplementary table S1, Supplementary Material online). The pan genome analysis conducted in Roary identified 3,742 and 71 genes in the core (genes present in 99% of strains) and soft core (genes present in 95% to 99% of strains) genomes, respectively (Fig. 1a). The accessory genome composed of the cloud and shell genomes, represented 55% of the pan-genome (8,433 genes). The shell genome (genes present in 15% to 94% of strains) and cloud genome (genes present in less than 15% of strains) accounted for 19% (1,605 genes) and 36% (3,015 genes) of the pan genome, respectively (Fig. 1a). The higher number of genes identified in the accessory genome compared to the core/soft core genomes highlights the relevance of cloud and shell genes in the genetic diversity within the P. parmentieri species. Comparable results were found in a previous pan-genome study performed with 15 P. parmentieri strains, where 1,468 genes were part of the accessory genome and 1,847 were unique genes while the core genome was scarcely bigger than 50% of the pan-genome. Overall, these outcomes reflect an open pan-genome within P. parmentieri, which, as previously pointed (Zoledowska et al. 2018a), might be linked to the high genome plasticity and adaptation of this species to diverse environmental regions. Indeed, the presence of a large number of genes in the accessory genome (found also in our analysis) has been highlighted as the potential basis for the widespread diffusion of P. parmentieri (Zoledowska et al. 2018b) as the pathogen has been reported in numerous geographic locations worldwide, including, Finland, New Zealand, Canada, China, Turkey, South Africa, and several states within the USA.
Fig. 1.
Pan-core genome and population structure analyses in P. parmentieri. a) Pie chart displays the proportion of core, soft-core, shell, and cloud genes in the 32 P. parmentieri genomes analyzed in the Roary pipeline. b) ML tree depicting the population structure analysis of 32 P. parmentieri strains. The phylogenetic tree was built using iTOL v6 (https://itol.embl.de/) and the tips of each branch were annotated and colored according to the level clusters identified using RhierBAPS and for matching the colors used by fastGEAR.
To examine the population structure within the 32 P. parmentieri strains, a clustering analysis was conducted using a hierarchical approach with Bayesian Analysis of Population Structure (BAPS, Cheng et al. 2013) implemented in R (RhierBAPS, Tonkin-Hill et al. 2018). RhierBAPS identified 7 lineages (clusters) among the P. parmentieri strains (Fig. 1b). The first lineage consisted of strains CFIA1002 and SS09 isolated in 2007 and 2017 in Canada: Alberta and Pakistan, Punjab, respectively. Lineage 2 encloses the New York (USA) strains NY1584A, NY1587A, and NY1532B isolated in 2016. Lineage 3 comprises one New York (USA) strain NY1712A isolated in 2017 and 3 Poland strains, 2 collected in 2013 (IFB5441, IFB5432) and IFB5623 isolated in 2014. Interestingly, lineage 4 was the most heterogenous, composed of 4 isolates which formed like-outgroups subclades within the lineages 3, 5, and 6. Specifically, strains WPP163 and WC19161 isolated in 2004 (USA, Wisconsin) and 2019 (China, Hohhot), respectively, grouped separately but closely to the isolates assigned to the lineage 3, whereas IFB5597 isolated in Poland in 2014 grouped closely to the strains of lineage 6. The South Korea strain HC isolated in 2016 was distinctively separated from all lineages and clustered close to strains of lineage 5. Lineage 5, on the other hand, was integrated by 6 isolates, including 2 Poland strains isolated in 2014 (IFB5604, IFB5619), a Belgium strain (IFB5485) isolated in 2012, QK-5 isolated in 2019 in China (QingKou), the Russian strain PB20 isolated in 2014 in the Moscow region, and strain IPO1955 isolated in 2002 (unknown location). Like lineage 5, lineage 6 is constituted by 6 isolates, including 5 New York strains (NY1588A, NY1548A, NY1585A, NY1540A, and NY1533B) isolated in 2016 and the Finland strain SCC3193 isolated in the early 1980s, used as a model for different molecular biology studies concerning Pectobacterium over many decades (Nykyri et al. 2012; Zoledowska et al. 2019). This strain was previously observed to cluster far away from the other P. parmentieri strains in a pan-genome tree due to the high number of unique genes present in this isolate (Zoledowska et al. 2018a). Here, we found that SCC3193 grouped together with strain NY1588A in a sub-clade within the lineage 6. Finally, the lineage 7 harbors 7 strains, including the type strain RNS 08-42-1Aᵀ isolated in France in 2008, 4 Poland strains isolated in 2013 (IFB5427, IFB5408) and 2014 (IFB5605, IFB5626), strain IFB5486 isolated from Belgium in 2012 and strain NY1722A isolated in New York in 2017. Except for strain IFB5427, which was isolated from weed, all other strains were isolated from either symptomatic tuber or potato stem.
Overall, all strains assigned to each of the 7 lineages grouped differently regardless of their geographic origin and isolation year (Fig. 1b), correlating with former phylogeographic analysis of this pathogen (Zoledowska et al. 2018b). This highly heterogenous pattern exhibited in the P. parmentieri population was suggested to be a consequence of high geographical mobility (Zoledowska et al. 2018b). Similarly, an elevated level of genetic diversity has been observed in the population structure analyses of the causal agent of tomato bacterial canker, Clavibacter michiganensis, as result of presumably multiple introductions events, where the international tomato seed trade has been highlighted as the leading force (Thapa et al. 2017; Ansari et al. 2019). Likewise, in P. parmentieri, the potato trading business appears to be one of the major contributors to the remarkably heterogeneity population of this pathogen and its global expansion.
Reconstructed Core-genome Genealogy and Rates of Homologous Recombination
To assess recombination events in the core genome of P. parmentieri and reconstruct its genealogy, a recombination analysis using the ClonalFrameML (Didelot and Wilson 2015) was carried out employing a maximum likelihood tree as starter. ClonalFrameML detected 5,125 recombination events, with 3,094 occurring in the nodes that gave origin to the different lineages identified by RhierBAPS, and 2,031 accounted for recombination regions across the distinct P. parmentieri strains used in this study (Fig. 2). The ClonalFrameML algorithm estimated the following homologous recombination rates (Table 1): the average length of recombined fragment (δ) was 83.456 bp, the average divergence between donor and recipient (ν) displayed a value of 0.0685 and the ratio of recombination to mutation rate was (R/θ) 0.236. This data shows that mutations have occurred 4.24 times more frequently compared to recombination. However, since each recombination event introduced an average of δν = 5.71 substitutions, the impact of recombination over mutation (r/m) was 1.348 times higher than mutation, emphasizing the relevance of recombination in P. parmentieri evolution.
Fig. 2.
Reconstructed genealogy and inferred homologous recombination events in the core genome of 32 p. parmentieri strains are depicted in the graphic. The graphic depicts the phylogenomic relationships among the 32 isolates, along with locations of inferred recombination and substitutions events. Reconstructed ML phylogenetic tree is depicted on the left; a color-coded vertical bar based on the lineages (clades) identified in RHierBAPS is shown in the center; inferred recombination events are illustrated in the right frame according to each position across the core genome (x-axis) and for each branch of the tree (y-axis). Unlabeled branches indicate ancestral lineages in the phylogeny. Dark blue horizontal lines pinpoint homologous recombination events, while vertical lines represent reconstructed substitutions or mutations. Light blue background color in the frame refers to no substitution or invariant polymorphic sites. White lines mark the absence of homoplasy, whereas the range of redness lines, from yellow to red, represent an increase of homoplasy levels. Inference of recombination analysis was conducted in ClonalFrameML using an RaxML tree with 1,000 bootstraps as a guide tree. Arrows indicate the nodes with the highest recombination events, while pink framed squares point to the most recombinant strains as inferred by ClonalFrameML. supplementary fig. S1, Supplementary Material online shows the position and names of all nodes as assigned by ClonalFrameML.
Table 1.
Recombination rates calculated in the core genome of P. parmentieri using ColnalFrameML
| Recombination parameters | Mean values inferred in P. parmentieri |
|---|---|
| δ = length of recombined fragments | 84.026 |
| ν = average divergence between donor and recipient, divergence of DNA imported by recombination | 0.0662275 |
| R/θ = ratio of rates of mutation and recombination | 0.24401 |
| r/m = (R/θ) x δ x ν = ratio of relative effects of recombination and mutation | 1.3578 |
In general, recombination events were detected in the ancestors of each lineage (Fig. 2, indicated with a red arrow). Supplementary fig. S1, Supplementary Material online shows the positions and numbers of the nodes assigned by ClonalFrameML. The highest numbers of recombination events were observed in specific nodes, with node 48 having the most (343 events), followed by node 56 (341 events), node 42 (339 events), node 40 (338 events), node 37 (317 events), node 45 (292 events), and node 50 (150 events). Moreover, we observed distinct degrees of homoplasy (Fig. 2, represented by yellow to red lines) across the lineages, seemingly imported as a consequence of mutation events occurring on the main branches of the genealogy tree. A similar effect has been reported in Staphylococcus aureus (Didelot and Wilson 2015). Intriguingly, all P. parmentieri strains assigned to the lineage 4 (Fig. 2, marked in a pink square) were detected as the most recombinant isolates in the clonal phylogeny. The Poland strain IFB5597 led the list with 320 recombination events, featuring recombinant fragments ranging from 2,283 to 9 bp. The South Korean strain HC ranked second in the list, exhibiting 311 recombination regions with sizes ranging 1,995 to 3 bp. Following, the American strain WPP163, isolated in Wisconsin, presented 306 recombination events with fragment lengths ranging from 1,273 to 5 bp. Finally, the Chinese strain WC19161 displayed 298 recombinant events with size ranging from 2,094 to 5 bp. Within the lineage 7, the Poland isolates IFB5626 and IFB5605 exhibited the highest number of recombinant events, with 72 and 60 inferred recombinant sites ranging from 1,584 to 4 bp and 5,490 to 6 bp, respectively. Additionally, the Belgium strain IFB5486 showed 57 recombinant sites with a maximum and minimum lengths of 885 and 4 bp, respectively, whereas the type strain RNS 08-42-1Aᵀ, isolated from France, and the Poland strains IFB5408 and IFB5427, along with NY1722A from New York, appeared as non-recombinant strains. Regarding lineage 5, the strains IFB5619 (Poland) and IPO1955 (unknown origin) showed 70 and 95 recombination events, with size ranging from 935 to 6 bp and 1,335 to 8 bp, respectively. In contrast, the strains IFB5604 (from Poland) and QK-5 (from China) solely presented 1 and 10 recombination regions of 48 bp and 2,152 to 25 bp, respectively, while PB20 (from Russia) and IFB5604 (from Poland) did not show any recombination. In lineage 6, only 2 strains NY1588A and the model organism SCC3193, presented recombination events of 26 and 33, with fragments sizes oscillating between 5,490 to 88 bp and 1,016 to 6 bp, respectively.
On the other hand, the New York NY1712A and Poland IFB5432 strains clustering in lineage 3 displayed 36 and 29 recombinant regions with lengths ranging from 624 to 36 bp and 780 to 2 bp, respectively, while no recombination events were inferred in strains IFB5623 and IFB5441. All strains in lineage 2 have undergone through homologous recombination, thus recombination numbers of 68, 58, and 51 were observed in the New York strains NY1584A, NY1587A, and NY1532B, with fragments sizes of 997 to 6 bp, 1,009 to 6 bp and 927 to 3 bp, respectively. In the case of lineage 1, the Canadian CFIA1002 (74 recombination events) and Pakistani strain SS90 (51 recombination events) strains showed recombinant sizes ranging from 1,261 to 4 bp and 606 to 5 bp, respectively. Altogether, these results indicate that the frequency of recombination varies across the strains. Related results were observed in recombination studies among Xylella fastidiosa subspecies (Vanhove et al. 2019). It seems reasonable that, similar to X. fastidiosa subspecies, due to geographic isolation, the introduction of foreign P. parmentieri isolates into new regions where other strains were already coexisting has contributed to homologous recombination events, leading to the emergence of novel strains.
Screening of Recombinant Genes Inferred in the P. parmentieri Strains and Ancestral Nodes in the Genealogy Tree
All recombinant fragments predicted by ClonalFrameML in either each strain or the ancestral nodes in the genealogy tree were scanned and tracked using a customized BLASTN tool. Genes related to several vital cell functions, such as DNA repair, RNA methylation, RNA processing and degradation, pyrimidine and purine conversions, nitrate and nitrite ammonification, biosynthesis of amino acids (methionine, alanine, cysteine, glutamine, glutamate, aspartate, asparagine, lysine), utilization of D-galacturonate, D-glucoronate, and D-ribose, uptake and utilization of lactose and galactose, phosphate metabolism, potassium homeostasis, biosynthesis of vitamins (thiamin, biotin, folate, pyridoxin), fatty acid biosynthesis, glycogen, glycerate and phosphate metabolism, protein chaperones, and response to stress showed to be affected by homologous recombination in the analysis (supplementary tables S2 to S44, Supplementary Material online). These findings suggest that homologous recombination may play a role in essential cellular processes for P. parmentieri. Indeed, homologous recombination has been highlighted to be fundamental for completion of DNA replication (Syeda et al. 2014); moreover, homologous recombination has showed to participate in the repair of DNA double-stranded breaks (DSBs) by incorporating foreign DNA as a template (Li and Heyer 2008, Wright et al. 2018). Interestingly, recombination was also found to occur within the genes involved in the Clustered Regularly Interspaced Short palindromic Repeats (CRISPR)-Cas (CRISPR-associated proteins) system. The CRISPR-Cas system confers an innate and adaptive immunity in many bacteria by defending the bacterial cell against bacteriophages or other exogenous invaders (Koonin and Makarova 2013; Makarova et al. 2015). In the case of P. parmentieri, 2 types of CRISPR-Cas systems have been reported: subtype I-F and subtype I-E (Arizala and Arif 2019). In this research, recombination fragments were detected in the cas genes encoding the CRISPR-associated (Cas) proteins Cas1 (subtype I-F) and Cas3 (subtype I-E) with maximum sizes of 1,009 and 469 bp, respectively. Strains WPP163 and WC19161 exhibited recombination signals in both genes cas1 and cas3 (supplementary tables S27 to S28, Supplementary Material online). While strains HC and NY1587A showed recombination only in cas1 (supplementary tables S29 and S38, Supplementary Material online). Conversely, in the strains IFB5597, NY1712A, and NY1584A, homologous recombination events were detected solely in cas3 (supplementary tables S26 and S36 to S39, Supplementary Material online). Likewise, in the genealogy tree (Fig. 2), recombination events within both cas1 and cas3 genes were also found in the ancestral nodes 37, 40, and 56, while nodes 41, 42, 48, and 53 showed recombination only in cas3 gene, and node 45, and 50 exhibited recombination only in cas1 (supplementary tables S4 to S22, Supplementary Material online). Altogether, this data suggests that homologous recombination within cas1 and cas3 derived from the main ancestral node 56 and was transmitted during evolution to the different lineage nodes, and finally, to the P. parmentieri strains mentioned above.
In line with our findings, previous studies have reported HGT events in CRISPR-Cas regions (Tyson and Banfield 2008; Chakraborty et al. 2010). Arizala and Arif (2019) described the acquisition of an entire type III-A CRISPR-Cas system in Pectobacterium aroidearum PC1 from Serratia species through HGT mechanism. The transfer of cas genes between identical loci surrounding CRISPR regions has been reported in Escherichia coli and Salmonella enterica genomes, suggesting that CRISPR-associated regions might serve as hot spots for recombination of cas genes (Touchon and Rocha 2010). Additionally, homologous recombination has been implicated in facilitating this genetic exchange of cas genes (Touchon and Rocha 2010). Subsequently, studies identified widespread microscale recombination events in several cas genes, highlighting the dynamic evolution of cas genes beyond cas1 (Takeuchi et al. 2012). Building on these earlier investigations, the homologous recombination detected in cas1 and cas3 in P. parmentieri in our study supports previous hypotheses about the coevolutionary competition of the CRISPR-Cas system as an immune defense mechanism against rapidly evolving viruses (Takeuchi et al. 2012; González-Torres et al. 2019).
Strikingly, recombination events consistently appeared in genes integral to critical pathogenicity determinants, such as the type VI secretion system (T6SS), iron uptake systems (siderophore aerobactin, ferric citrate, and enterobactin), LPS, plant cell wall degrading enzymes (CWDE), type IVb fimbrial low-molecular-weight protein/tight adherence protein (Flp/Tad) pilus, citrate metabolism, butanediol metabolism, flagella (supplementary tables S2 to S43, Supplementary Material online). These factors collectively play pivotal roles in the infection process, bacterial colonization, and the progression of soft rot/black leg diseases in Pectobacterium sp. (Nykyri et al. 2012; Charkowski 2018; Arizala and Arif 2019; Arif et al. 2022). This underscores the significant impact of homologous recombination on the evolution of several virulence-associated factors across the core genome of P. parmentieri.
The T6SS comprises 15 to 23 proteins, with 13 highly conserved proteins believed to be essential for its function (Shyntum et al. 2019). Although numerous functions such as virulence, bacterial competition, and host interaction have been attributed to the T6SS (Morgado and Vicente 2022), a recent study has suggested its involvement in biofilm formation in Pseudomonas aeruginosa (Chen et al. 2020). Within our analysis, we found recombination regions in 10 T6SS-related genes (supplementary tables S2 to S28 and S38, Supplementary Material online), including: impA encoding for the T6SS component TssA (ImpA) in the nodes 33 (252 bp) and 46 (98 bp), impC encoding for the T6SS component TssC (ImpC/VipB) in the strains IFB5597 (113 bp) and WC19161 (113 bp), impE encoding for the T6SS associated component TagJ (ImpE) in the node 49 (144 bp), impF encoding for the T6SS lysozyme-like component TssE in the nodes 40 (76 bp), 42 (261 bp), 45 (114 bp), 48 (375 bp), 56 (261 bp) and in the strains IFB5597 (364 bp), WPP163 (76 bp), and WC19161 (129 bp), vasA ecoding for the T6SS component TssF (ImpG/VasA) in the strain IFB5597 (30 bp), vasC encoding for the T6SS fork head associated domain protein ImpI/VasC in the node 44 (141 bp), vasD encoding for the T6SS secretion lipoprotein TssJ (VasD) in the node 37 (59 bp), vasK encoding for the T6SS component TssM (IcmF/VasK) in the node 44 (93 bp), vasL encoding for the Type VI secretion-related protein VasL in the nodes 46 (63 bp) and 37 (79 bp) as well as in the strain NY1587A (138 bp), stk1 encoding for the T6SS Serine/threonine protein kinase PpkA in the node 52 (104 bp). Additionally, ClonalFrameML detected recombination events in the genes encoding for the products haemolysin-coregulated protein (Hcp) in the nodes 40 (306 bp) and 43 (48 bp), in the strains HC (12 bp) and SS90 (93 bp), and the valine-glycine repeat protein (VgrG) in the nodes 36 (220 bp) and 40 (220 bp) (supplementary tables S3 to S9 and S29 and S33, Supplementary Material online). Both Hcp and VgrG are considered as major effector proteins injected from T6SS into host cells (Pukatzki et al. 2006; Mattinen et al. 2007). Importantly, the ImpC protein, a T6SS element showing recombination in our analysis, has been identified as a crucial component of the T6SS, playing a role in virulence and participating in the intracellular host response in Pseudomonas syringae pv. tomato (Sarris et al. 2010). Moreover, T6SS gene clusters are often found within predicted genomic islands, suggesting the potential of this system for HGT between bacteria (Morgado and Vicente 2022). Nykyri et al. (2012) reported 2 T6SS in the genome of the Finland strain P. parmentieri SCC3193, with one located on a genomic island. Our findings further underscore that in P. parmentieri, homologous recombination seems to be another important mechanism in the evolution of the T6SS core genes. Consistent with our findings, frequent homologous recombination has been observed in Vibrio cholera and is considered a major contributor to the high diversity of T6SS effector genes (Kirchberger et al. 2017). Comprehensive studies are warranted to establish a conclusive understanding of the specific role of homologous recombination in the evolution of the T6SS in P. parmentieri.
Interestingly, ClonalFrameML inferred recombination events in 3 different LPS encoding gene clusters. Polysaccharides are widely recognized bacterial virulence factors, serving various functions, including attachment inside the cell host, protection from plant toxins and adverse environmental conditions, and favoring host colonization (Toth et al. 2006). Concerning the enterobacterial common antigen (ECA), recombination was predicted in 5 genes: wzxE (lipid III flippase) found in the nodes 36 (647 bp), 37 (145 bp), 39 (647 bp), 44 (1050 bp), 46 (1115 bp), 52 (2152 bp), 57 (1238 bp), 58 (2152 bp), 61 (1245 bp) and in the strains IFB5597 (345 bp), WPP163 (345 bp), WC19161 (2094bp), SCC3193 (512 bp), QK-5 (2152 bp), NY1588A (2162 bp), and IFB5432 (445), rffG (dTDP (deoxythymidine diphosphate)-glucose 4,6-dehydratase) found in the nodes 37 (309 bp), 39 (422 bp), 46 (524 bp), 52 (524 bp), 57 (522 bp), 58 (522 bp), 61 (357 bp), and in the strains IFB5597, WPP163, WC19161, NY1588A, and IFB5432, wecC (UDP (uridine diphosphate)-N-acetyl-D-mannosamine dehydrogenase) and wecA (undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase) found merely in the nodes 56 (110 bp) and 37 (65 bp), respectively (supplementary tables S3 to S43, Supplementary Material online). The ECA plays a major role in bacterial physiology, influencing the host immune response and participating in host-pathogen interactions (Rai and Mitchell 2020). Beyond ECA, the exopolysaccharide (EPS) and O-antigen cluster also underwent recombination in 2 genes (supplementary tables S18 to S37, Supplementary Material online): gnd (6-phosphogluconate dehydrogenase, decarboxylating) with recombined fragments observed in the nodes 52 (504 bp), 57 (465 bp), 58 (504 bp), 61 (183 bp) and in the isolates IFB5597 (336 bp), WC19161 (715 bp), IFB5626 (1335 bp), IPO1955 (1335 bp), QK-5 (321 bp) and NY1588A (1378 bp), and wza (putative polysaccharide export protein YccZ precursor) found in the nodes 52 (1129 bp) and 58 (1094 bp) and in the strains WC19161 (1079 bp), QK-5 (1094 bp), NY1588A (1100 bp). The lipo-oligo/polysaccharide cluster (LOS/LPS) presented recombination in 3 genes (supplementary tables S4 to S32, Supplementary Material online): waaC (lipopolysaccharide core heptosyltransferase I) found in the nodes 37 (411 bp), 40 (238 bp), 42 (505 bp), 43 (116 bp), 46 (799 bp), 50 (117 bp), 53 (105 bp), 56 (372 bp) and in the strains IFB5597 (233 bp), WC19161 (115 bp), IPO1955 (505 bp) and IFB5619 (435 bp), rfaD (ADP (adenosine diphosphate)-L-glycero-D-manno-heptose-6-epimerase) found in 2 nodes 37 (96 bp) and 50 (81 bp), and waaL1 (O-antigen ligase) found only in the node 43 (196 bp). Our results show that homologous recombination actively contributes to the evolution of polysaccharide clusters, facilitating genetic exchange within the P. parmentieri population. Consistent with our findings, recombinant alleles in LPS genes have been described in evolutionary studies of plant pathogens such as Xylella fastidiosa, Xanthomonas euvesicatoria, and X. perforans (Jibrin et al. 2018; Potnis et al. 2019). Moreover, we speculate that the observed recombination homologous in different LPS-genes might be involved with the recently described heterogeneity within the LPS structure of certain P. parmentieri strains (Ossowska et al. 2022).
The ability to acquire iron constitutes a relevant virulence determinant during bacterial pathogenesis, especially in iron-deficient environments due to metal ion restriction adopted by the host as a nutritional immunity strategy (Palmer and Skaar 2016; Mosbahi et al. 2018). Recombination events were predicted in genetic regions associated with 3 different iron uptake systems: the ferric citrate (fecIRABCDE) transport system, the siderophore aerobactin (iuc) and the siderophore enterobactin (ent). In the ferric citrate cluster (supplementary tables S4, S6 to S16 and S39, Supplementary Material online), the gene fecD, encoding for the iron (III) dicitrate transport system permease protein FecD, manifested recombination in the nodes 37 (33 bp), 40 (78 bp), 41 (684 bp), 42 (54 bp), 48 (342 bp), 50 (209 bp), and only in the isolate NY1584A (54 bp). In pathogenic Gram-negative bacteria, this system is formed inside plant tissues and enabling long-distance iron transport (Franza and Expert 2013; Gorshkov et al. 2021). On the other hand, the siderophore aerobactin presented signals of recombination in 4 genes (supplementary tables S4 to S43, Supplementary Material online): fhuB (ferric hydroxamate ABC transporter, permease component FhuB) showing recombination in the nodes 48 (566 bp) and 50 (397 bp), and in the strains IFB5597 (295 bp), WPP163 (806 bp), WC19161 (62 bp), HC (62 bp), IFB5626 (62 bp), and SS90 (29 bp), fhuC (ferric hydroxamate ABC transporter, ATP-binding protein FhuC) found solely in IPO1955 (147 bp), and iucC (aerobactin biosynthesis protein IucC) found in the nodes 54 (135 bp) and 57 (314 bp) and strain IFB5432 (181 bp), and the iron-chelator utilization protein with recombination regions observed in the nodes 37 (81 bp), 40 (31 bp), and 48 (31 bp), and the isolates WPP163 (211 bp) and NY1584A (178 bp). Importantly, Ling et al. (2013) reported that the aerobactin synthesis gene, iucC, plays a significant role in the pathogenicity of Escherichia coli O2 strain E058. Recombination in the siderophore enterobactin was only detected in the node 56 (288 bp) in the gene encoding a transcriptional regulator of the AraC family, enterobactin-dependent (supplementary table S22, Supplementary Material online). In P. atrosepticum, this siderophore has been hypothesized to confer resistance to different stressors and contribute to virulence (Gorshkov et al. 2021). Additionally, a coding sequence annotated as ferric iron ABC transporter, permease protein, and cataloged in the subsystem “iron acquisition in Streptococcus” displayed recombination events in 6 ancestral nodes (supplementary tables S4 to S42, Supplementary Material online): 37 (183 bp), 42 (643 bp), 48 (62 bp), 49 (123 bp), 53 (91 bp), 56 (123 bp), and 6 isolates: IFB5597 (91 bp), WPP163 (408 bp), HC (148 bp), NY1712A (48 bp), NY1532B (121 bp), and IFB5486 (221 bp). In general, these distinct recombination rates observed in different iron-scavenging genes might reflect an adaptative mechanism adopted by P. parmentieri to successfully colonize the plant tissue and survive in poorly iron environments.
The secretion of CWDE is the most well-recognized and the most studied pathogenicity determinant in pectinolytic bacteria of the genera Pectobacterium and Dickeya; indeed, efficient production of CWDE is directly linked with soft rot and black leg symptoms (Nykyri et al. 2012; Charkowski 2018; Arizala and Arif 2019; Boluk et al. 2021; Arif et al. 2022). In our study, recombining signatures were detected mainly in the pelX gene (supplementary tables S6, S11, S26 to S29, Supplementary Material online), encoding for an exopolygalacturonate lyase, in the strains IFB5597 (267 bp), WC19161 (225 bp) and HC (91 bp) as well as in the ancestral nodes 40 and 45 (91 bp). The pectin methylesterase, pemB, also exhibited recombination in the node 49 (44 bp; supplementary table S15, Supplementary Material online), while in the case of the oligogalacturonides degradation, the kdfF (pectin degradation protein KdgF) locus underwent recombination in the Wisconsin strain WPP163 (81 bp; supplementary table S27, Supplementary Material online) and in the ancestral nodes 42 (81 bp; supplementary table S8, Supplementary Material online) and 45 (91 bp; supplementary table S11, Supplementary Material online). In agreement with our analysis, recombination regions have also been encountered in the CWDE-genes of the pathogen Xylella fastidiosa (Potnis et al. 2019). We also observed recombination in genes encoding for proteases and peptidases. For instance, a recombinant fragment of 98 bp was identified in the gene pqqL (putative zinc protease) in the node 51 (supplementary table S17, Supplementary Material online). On the other hand, 3 loci (citC, dcuC, citF) involved with citrate metabolism, uptake and regulation showed evidence of recombination. The genes citC (citrate [pro-3S]-lyase ligase) and citF (citrate lyase alpha chain) displayed recombination fragments of 78 and 321 bp, only in the strain HC and node 48, respectively (supplementary tables S14 and S29, Supplementary Material online). Conversely, dcuC has undergone recombination in 4 strains (SCC3193, NY1588A, IFB5605, and IFB5432; supplementary tables S34, S37, S41, and S43, Supplementary Material online) and 4 nodes (33, 46, 49, and 57; supplementary tables S2, S12, S15, and S23, Supplementary Material online). The New York strain NY1588A harbored the largest recombined fragment of 1,263 bp (supplementary table S37, Supplementary Material online). The citrate uptake system was reported to be critical for bacterial virulence of P. atrosepticum by reducing the citrate concentration during colonization in potato tubers (Urbany and Neuhaus 2008). Our findings highlight the relevance of homologous recombination in the evolution of the citrate uptake system and its potential association with the adaptation of certain P. parmentieri strains to host environments containing high citrate levels. The 3-hydroxy-2-butanone (3H2B—acetoin, butanediol) pathway has been characterized as another relevant factor for P. carotovorum pathogenesis by promoting media alkalization and, consequently, favoring plant cell maceration (Marquez-Villavicencio et al. 2011). In our analysis, the strains IFB5597 (36 bp), WC19161 (60 bp), and ancestral nodes 40 (147 bp) and 45 (60 bp) experienced recombination events in the gene product acetolactate synthase large subunit (supplementary table S6, S11, S26 to S28, Supplementary Material online).
The Flp/Tad (Fimbrial low-molecular-weight protein/Tight adherence protein) pilus, encoded by the flp/tad genes, was characterized as another novel virulence determinant in Pectobacterium due to its implication with maceration of potato tubers (Nykyri et al. 2013). ClonalFramelML revealed recombination events in 4 genes of the flp/tad biogenesis cluster (supplementary tables S2 to S8 and S16 to S31, Supplementary Material online). The tadA (type II/IV secretion system ATP hydrolase TadA/VirB11/CpaF, TadA subfamily) gene showed recombination in the node 37 (96 bp), 42 (66 bp), 57 (79 bp), and strain WC19161 (102 bp). Recombination in the rcpC (Flp pilus assembly protein RcpC/CpaB) gene was found in the nodes 40 (42 bp), 50 (52 bp), and in the strains HC (85 bp) and IF5626 (52 bp). The tadC (type II/IV secretion system protein TadC, associated with Flp pilus assembly) gene displayed recombination regions in the nodes 33 (21 bp) and 52 (21 bp), whereas recombination within the rcpB (Flp pilus assembly protein RcpB/CpaD) locus was predicted only in the node 57 (32 bp). A comparative genomics study showed that the flp/tad cluster is harbored by all Pectobacterium species (Arizala and Arif 2019). The discovery of recombination signals in some of the flp/tad genes highlights how this system is evolving via genetic exchange in P. parmentieri, presumably linked to the host adaptation of the pathogen to efficiently macerate tissues.
The flagella-encoding cluster has been shown to be pivotal for motility, colonization, and virulent lifestyle in numerous pathogenic bacteria, including Pectobacterium (Toth et al. 2006; Nykyri et al. 2012; Arizala and Arif 2019). Three flagella-genes have undergone recombination, including fliD, fliK, flgI (supplementary tables S14 to S16 and S26 to S29 and S41, Supplementary Material online). While recombinant fragments of 155 bp located in node 50 and 300 bp in strain IFB5605 were found in the loci fliD (flagellar cap protein FliD) and fliK (flagellar hook-length control protein FliK), respectively, the flgI (flagellar P-ring protein FlgI) gene displayed recombination in 4 strains (IFB5597–286 bp, WPP163–286 bp, WC19161–393 bp, and HC—387 bp) and 2 nodes (48 to 279 bp and 50 to 99 bp). In a former study, the flagellar genes were shown to participate in chemotaxis, motility, and were fundamental for pathogenesis of D. dadantii (Jahn et al. 2008). Moreover, a reduced virulence was observed in tobacco plants infected with P. parmentieri SCC3193 mutants lacking motility (Pirhonen et al. 1991). In another report, the flagellum was related t the secretion of colin, a toxin released to kill competitors in the niche, thereby suggesting an antagonist role of the flagellar apparatus in Pectobacterium (Charkowski et al. 2012). Finally, the gene predicted as pgaC, encoding for the biofilm PGA (poly-gamma-glutamic acid) synthesis N-glycosyltransferase PgaC protein, showed a recombination region of 246 bp in the strain WC19161. PgaC was demonstrated to participate in host-bacteria interactions and modulate biofilm formation in the human pathogen Klebsiella pneumoniae (Chen et al. 2014).
Insights into Ancient and Recent Recombination Events in P. parmentieri
To examine the ancestral and recent recombinogenic regions between and within the 7 lineages (inferred by RhierBAPS—Fig. 1b), we used the fastGEAR software (Mostowy et al. 2017), utilizing the core-genome alignment obtained from the Roary pipeline as input data. Ancestral recombinations are defined as recombinant regions present in all strains belonging to the lineage (Mostowy et al. 2017). Since the lineage 7 contained the largest number of strains within the P. parmentieri population, according to the principle of parsimony, this lineage was assigned as the main donor, while the other lineages (1 to 6) served as recipients for the ancestral recombination analysis. Thus, excluding lineage 7, all other lineages have undergone recombination, as depicted by Fig. 3a, where a mosaic-like pattern across the core genome is evident. In total, 486 ancient recombination events were detected in the core genome obtained from the 32 P. parmentieri isolates (Fig. 3b).
Fig. 3.
Ancestral recombination events identified using fastGEAR across the core genome of 32 P. parmentieri strains are illustrated. a) ML phylogeny, using a bootstrap support of 1,000 replicates, is illustrated on the left, whereas a panel showing the recombination sites and the population genetic structure is located on the right. The y-axis on the panel displays the 32 strains present in the core genome alignment, and the x-axis displays the sequence positions. Strains on the y-axis were positioned according to the ML phylogeny. Seven lineages previously defined with RhierBAPS were used in the analysis. The vertical color-coded bar on the right side of the panel shows the division of the 32 strains into the 7 predicted lineages. Each lineage represents a specific clade and is labeled and colored as indicated in the supplementary fig. S2, Supplementary Material online. The black color denotes recombination from outside origin of any lineage in the data set. The presence of different color lines within a lineage in the frame pinpoints ancestral recombination event sites occurred respect to the sequence position. White gaps in the panel show the recent recombination events that were removed before the analysis of ancestral recombination. b) Horizontal bar graphic illustrating the number of ancestral recombinations that were received in each lineage. The bars were colored based on the color of the lineages previously assigned in Fig. 1b.
The overall size of ancestral recombination events ranged from 43 to 119 kb, with a median length of 4.8 kb. Lineage 2 exhibited the highest number of recombinations, totaling 126 events originating from all 6 lineages (lineage 2–16, lineage 3–14, lineage 4–1, lineage 5–23, lineage 6–19, and lineage 7–22). This finding highlights the 3 New York strains (NY1584A, NY1532B, NY1587A) belonging to the lineage 2 as highly ancient recombinants. In contrast, lineage 5 presented 114 ancestral recombination events, with fragments donated by lineages 6 (80 events) and 7 (34 events). Notably, this lineage transferred the highest number of recombinogenic regions (191 events) to the other lineages, potentially resulting from coexistence of this lineage with the other ones in the same ecological niche, promoting DNA recombination. In Xyllela fastidiosa, for instance, homologous recombination was reported in the South American strains sharing the same geographic area (Coletta-Filho et al. 2017). Lineage 4 has experienced the lowest level of ancestral recombination events, with 1 and 5 fragments donated by lineages 5 and 4, respectively. In the case of lineage 6, 41 events were detected, originating from lineage 7.
Regarding recent recombinations, these events refer to recombinant regions present in a subset of strains within a lineage (Mostowy et al. 2017). Similar to the ancestral recombination plot, a mosaic like-pattern indicative of recent recombination signals was observed in the core genome of most of the strains (Fig. 4a). A total of 941 unique events were detected, with some events repeating across the strains, resulting in a total of 1,177 events (Fig. 4b). Upon scanning these recent recombination regions, we identified 412 genes that have undergone recent recombination, which represents 11% of the P. parmentieri core genome. The recombinant fragments displayed an average length of 1.38 kb, ranging from 36 bp to 13.96 kb. The 2-fold number of recent recombination events compared to the ancestral ones highlights that P. parmentieri is continuously evolving and adapting to new environments. This observation aligns with the widespread distribution of this bacterium reported in many countries in recent years (Pitman et al. 2010; Pasanen et al. 2013; Zoledowska et al. 2018b; Arizala et al. 2019). No signs of homologous recombination could be inferred in the strains IFB5408, RNS 08-42-1A, NY1722A, IFB5427, IFB5623, and IFB5441. In contrast, the strains IFB5626, IFB5597, HC, and NY1584A showed high levels of recombinants with 140, 122, 113, and 87 events, respectively (Fig. 4b). Strains CFIA1002, QK-5, and SS90 experienced a relatively low rate of recombination with 9, 10, and 13 events, respectively. Lineage 1 appeared as the major donor with 272 events (Fig. 4c), donating mainly to the strains IPO1955 (24%), IFB5626 (21%) and NY1584A (18%). On the contrary, 5% of the total recent recombination events were categorized as outside origins (Fig. 4c) and identified in 10 strains: HC, IFB5485, IFB5486, IFB5597, IFB5626, NY1588A, PB20, SCC3193, WC19161, and WPP163. These outside origin events are defined as recombined regions originating from outside the population or not corresponding to any of the lineages found in the population (Mostowy et al. 2017). Strains NY1588A and IFB5597 led the highest position in this category, each exhibited 15 and 16 outside origin events, respectively. Previous studies have documented the coexistence of soft rot bacteria from the genera Pectobacterium and Dickeya in the same niche (Degefu 2021; Ge et al. 2021; Kastelein et al. 2021). Considering this, it seems logical that these external origin regions might have been incorporated in the genomes of these 9 P. parmentieri strains through genetic exchange with either other Pectobacterium species or members of the Dickeya genus.
Fig. 4.
Recent recombination events identified using fastGEAR across the core genome of 32 P. parmentieri strains are illustrated. a) The y-axis on the panel displays the 32 strains present in the core genome alignment, and the x-axis displays the sequence positions. Strains on the y-axis were positioned according to the ML phylogeny depicted on the left. Seven lineages previously defined with RhierBAPS were used in the population clustering analysis. The vertical color-coded bar on the right side of the panel shows the division of the 32 strains into the 7 predicted lineages. Each lineage represents a specific clade and is labeled and colored as indicated in the supplementary fig. S1, Supplementary Material online. Black color denotes recombination from outside origin of any lineage in the data set. The presence of different color lines within a lineage in the frame pinpoints recent recombination event sites occurred respect to the sequence position. b) Bar graphic depicting the number of recent recombinations received by each strain. The bars were color-coded based on the color of the lineages assigned in supplementary fig. S2, Supplementary Material online. ML tree based on the core genome alignment of 32 P. parmentieri strains. The positions and names of all nodes assigned by ClonalFrameML are indicated and highlighted with different colors. The tree was visualized using FigTree v1.4.4 and midpoint rooted. c) Pie chart showing the number of recent recombinations percentagewise donated by each lineage and outside of the P. parmentieri population.
Subsequently, we traced and identified the coordinates and functional implications of all gene regions affected by ancestral (supplementary tables S45 to S50, Supplementary Material online) and recent recombination events (supplementary tables S51 to S76, Supplementary Material online). Genes associated with cellular functions, including DNA repair, DNA replication, RNA methylation, RNA processing and degradation, tRNA aminoacylation, cell respiratory pathways, dehydrogenase complexes, purine and pyrimidine conversions, fatty acid biosynthesis, redox cycle, lactose, and galactose utilization, sucrose utilization, glycogen metabolism, and oxidative stress have been affected by both ancestral and recent recombination events (Tables 2 to 3). This underscores the potential significance of homologous recombination in the evolution of P. parmentieri. Recombinant fragments from recent and ancestral events were also observed in genes encoding amino acids and vitamins biosynthesis, phosphate metabolism, copper, and potassium homeostasis, nitrite and nitrate ammonification, carbon starvation, nitrosative stress, and ammonia assimilation (Tables 2 to 3). In X. fastidiosa and Xanthomonas species, recombination in these loci has been associated with an adaptability mechanism for surviving in niches with limited nutrient sources (Chen et al. 2018; Potnis et al. 2019).
Table 2.
Important ancestral recombination events predicted in the core-genome of Pectobacterium parmentieri. The genes where the recombination events were identified are listed per subsystem and their product name is described as annotated in the RASTtk server
| Subsystem/functional category | Gene product name (Coding DNA sequencing) | Recipient lineages (No. recombinant events)a | Donor lineagesb | Largest recomb. fragmentc | |||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | ||||
| Biotin biosynthesis (Vitamin B7) | Adenosylmethionine-8-amino-7-oxononanoate aminotransferase (EC 2.6.1.62) | 1 (1)a, 3 (1), 5(1), 6 (1) | 6 | 7 | 7 | 7 | 2802 | ||
| Biotin synthase (EC 2.8.1.6) | 1 (1), 3 (1), 6 (1) | 6 | 7 | 7 | 1038 | ||||
| Phosphate metabolism | Soluble pyridine nucleotide transhydrogenase (EC 1.6.1.1) | 1 (1), 2 (1), 3 (1), 4(1) | 2 | 3 | 6 | 7 | 3573 | ||
| Nitrate and nitrite ammonification | Nitrate ABC transporter, ATP-binding protein | 2 (1), 5 (1), 6 (1) | 5 | 6 | 7 | 1797 | |||
| Flagellar cluster | Flagellar hook-length control protein FliK | 1 (1), 2 (1), 5(1) | 2 | 6 | 6 | 1830 | |||
| Flagellar basal-body P-ring formation protein FlgA | 1 (1), 6 (1) | 6 | 7 | 669 | |||||
| Flagellar basal-body rod protein FlgB | 1 (1), 6 (1) | 6 | 7 | 417 | |||||
| Flagellar basal-body rod protein FlgC | 1 (1), 6 (1) | 6 | 7 | 405 | |||||
| Flagellar basal-body rod modification protein FlgD | 1 (2) | 6,7c | 7 | 675 | |||||
| Flagellar hook protein FlgE | 1 (1), 6 (1) | 6 | 7 | 1218 | |||||
| Flagellar basal-body rod protein FlgF | 1 (1), 6 (1) | 6 | 7 | 756 | |||||
| Flagellar basal-body rod protein FlgG | 1 (1), 2 (1), 6(1) | 7 | 3 | 7 | 783 | ||||
| Flagellar P-ring protein FlgI | 1 (1), 2 (1) | 7 | 3 | 1791 | |||||
| RNA polymerase sigma factor for flagellar operon | 1 (1) | 7 | 640 | ||||||
| Flagellar cap protein FliD | 1 (1), 3 (1) | 7 | 7 | 1440 | |||||
| Flagellar hook-basal body complex protein FliE | 1 (1), 3 (1) | 7 | 7 | 315 | |||||
| Flagellar M-ring protein FliF | 1 (1), 3 (1) | 7 | 7 | 1704 | |||||
| T1SS | T1SS ATPase/ABC-type protease exporter, ATP-binding component PrtD/AprD | 3 (1) | 4 | 2583 | |||||
| Type I secretion outer membrane protein, outer membrane component PrtF/AprF | 1 (1) | 6 | 1151 | ||||||
| Type I secretion system for aggregation | T1SS secreted agglutinin RTX/RTX (repeats-in-toxin) toxins and related Ca2+-binding proteins | 1 (1) | 3 | 2714 | |||||
| Type I secretion membrane fusion protein, HlyD family/T1SS, membrane fusion protein LapC | 3 (1) | 4 | 1170 | ||||||
| Membrane bound c-di-GMP (cyclic-dimeric guanosine monophosphate) receptor LapD | 2 (1) | 3 | 1953 | ||||||
| Type IV pilus biogenesis | Type IV pilus biogenesis protein PilO | 1 (1) | 3 | 618 | |||||
| Type IV pilus biogenesis protein PilQ | 3 (1) | 4 | 1275 | ||||||
| Type IV pilus biogenesis protein PilN | 1 (1), 5(1) | 3 | 7 | 38 | |||||
| Type IV pilus biogenesis protein PilM | 1 (1) | 5 | 852 | ||||||
| Widespread colonization island (Flp/Tad cluster) | Type II/IV secretion system ATP hydrolase TadA/VirB11/CpaF, TadA subfamily | 3 (1) | 6 | 1404 | |||||
| Type VI secretion system (T6SS) | T6SS component TssA (ImpA) | 1 (1) | 5 | 78 | |||||
| T6SS component TssB (ImpB/VipA) | 3 (1) | 5 | 531 | ||||||
| VgrG protein | 1 (1), 5 (1) | 7 | 7 | 2718 | |||||
| T6SS component TssG (ImpH/VasB) | 3 (1) | 6 | 1024 | ||||||
| T6SS lysozyme-like component TssE | 1 (1) | 2 | 397 | ||||||
| Type VI secretion-related protein VasL | NY1587A (1) | … | 815 | ||||||
| T6SS secretion lipoprotein TssJ (VasD) | 1 (1), 2 (1) | 5 | 6 | 525 | |||||
| T6SS secretion lipoprotein TssJ (VasD) | 1 (1) | 2 | 104 | ||||||
| T6SS AAA + chaperone ClpV (TssH) | 2 (2) | 3 | 2598 | ||||||
| Antibiotic biosynthesis | Phenazine biosynthesis protein PhzF | 3 (1) | 4 | 897 | |||||
| Acetoin, butanediol metabolism | 2,3-butanediol dehydrogenase, S-alcohol forming, (R)-acetoin-specific (EC 1.1.1.4)/Acetoin (diacetyl) reductase (EC 1.1.1.304) | 3 (1) | 6 | 774 | |||||
| Acetolactate synthase, catabolic (EC 2.2.1.6) | 2 (1) | 3 | 897 | ||||||
| Alpha-acetolactate decarboxylase (EC 4.1.1.5) | 2 (1) | 3 | 783 | ||||||
| Metallocarboxypeptidases | D-alanyl-D-alanine carboxypeptidase (EC 3.4.16.4) | 1 (1), 5 (1) | 6 | 6 | 1434 | ||||
| Citrate metabolism, transport, and regulation | C4-dicarboxylate transporter DcuC (TC 2.A.61.1.1) | 2 (1), 5 (1) | 3 | 7 | 2073 | ||||
| [Citrate [pro-3S]-lyase] ligase (EC 6.2.1.22) | 2 (1) | 6 | 1065 | ||||||
| Citrate lyase alpha chain (EC 4.1.3.6) | 1 (1) | 3 | 1207 | ||||||
| Citrate lyase beta chain (EC 4.1.3.6) | 2 (1) | 6 | 826 | ||||||
| Siderophore aerobactin | Ferric hydroxamate ABC transporter (TC 3.A.1.14.3), permease component FhuB | 5 (1), 6 (1) | 6 | 7 | 1841 | ||||
| N(2)-citryl-N(6)-acetyl-N(6)-hydroxylysine synthase (EC 6.3.2.38), aerobactin biosynthesis protein IucA | 3 (1) | 6 | 1725 | ||||||
| N6-hydroxylysine O-acetyltransferase (EC 2.3.1.102), aerobactin biosynthesis protein IucB | 3 (1) | 6 | 537 | ||||||
| Aerobactin siderophore receptor IutA | 2 (1) | 3 | 1312 | ||||||
| Iron siderophore sensor (hasRADEF) | Hemophore HasA outer membrane receptor HasR/Iron siderophore receptor protein | 5 (1) | 6 | 167 | |||||
| Siderophore enterobactin | Ferric enterobactin-binding periplasmic protein FepB (TC 3.A.1.14.2) | 3 (1) | 6 | 1089 | |||||
| Ferric enterobactin transport ATP-binding protein FepC (TC 3.A.1.14.2) | 3 (1) | 6 | 831 | ||||||
| Ferric enterobactin transport system permease protein FepD (TC 3.A.1.14.2) | 3 (1) | 6 | 1053 | ||||||
| Ferric enterobactin transport system permease protein FepG (TC 3.A.1.14.2) | 3 (1) | 6 | 1020 | ||||||
| CRISPRs | CRISPR-associated protein, Cse4 family | 1 (1) | 2 | 1105 | |||||
| CRISPR-associated protein, Csy2 family | 2 (1) | 3 | 1161 | ||||||
| CRISPR-associated protein, Csy3 family | 2 (1) | 3 | 1014 | ||||||
| Polysaccharide biosynthesis | Capsular polysaccharide biosynthesis protein cpsA/wcaJ | 1 (1) | 3 | 1188 | |||||
| Undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase (EC 2.7.8.33) | 1 (1) | 3 | 1083 | ||||||
| UDP-N-acetyl-D-mannosamine dehydrogenase (EC 1.1.1.336) | 1 (1) | 3 | 2251 | ||||||
| CWDE | Pectate lyase (pelI) | 1 (1) | 2 | 1044 | |||||
| Endopolygalacturonase (EC 3.2.1.15) (pehA) | 5 (1) | 6 | 1209 | ||||||
| Pectate lyase I precursor Pel1 (pelA) | 2 (1), 5 (1) | 6 | 6 | 1125 | |||||
| Pectate lyase II precursor Pel2 (pelB) | 2 (1), 5 (1) | 6 | 6 | 133 | |||||
| Pectate lyase III precursor Pel3 (pelC) | 2 (1), 5 (1) | 6 | 6 | 1125 | |||||
| Pectate lyase L precursor (EC 4.2.2.2) (pelL) | 2 (1) | 6 | 4 | ||||||
| Exo-poly-alpha-D-galacturonosidase (EC 3.2.1.82) (pehX) | 5 (1) | 6 | 1980 | ||||||
| Exopolygalacturonate lyase (EC 4.2.2.9) (pelX) | 6 (1) | 7 | 2509 | ||||||
| Pectinesterase (EC 3.1.1.11) (pemA) | 6 (1) | 7 | 14 | ||||||
Numbers 1 to 7 refer to the lineages detected by the population structural analysis hierBAPS; for instance, 1 means to all strains belonging to Lineage 1.
a Numbers between parenthesis in column 3 indicates the number of ancestral recombination events identified in each of the recipient lineages (1 to 6) for each gene product shown in column 2. For instance, 3 (1) means that Lineage 3 in that specific gene product presented 1 recombination event.
b Presence of numbers in column 4 “Donor Lineages” points which lineage acted as donor (1 to 7) or donated the recombination segment to its respective recipient lineage (1 to 6) displayed in column 3. Subcolumns 1 to 6 (located in panel beneath “Donor Lineages” header) refers to the respective recipient lineages provided in column 3. For instance, a number 6 in the sub-column 1 within the “Donor Lineage” column means that the Lineage 6 donated that recombination event to the Lineage 1.
c Numbers separated by a comma in the subcolumns (1 to 6) within the “Donor Lineages” column indicates that 2 or more lineages acted as donors. For instance, “6,7” within the subcolumn 1 means that Lineages 6 and 7 acted as donors for that recombination event in the Lineage 1.
d Column 5 indicates the largest recombined fragment in base pair (bp) found for that gene product provided in column 2.
Table 3.
Summary of main recent recombination events predicted in the core-genome of Pectobacterium parmentieri. The genes where the recombination events were identified are listed per subsystem and their product name are described as annotated in the RASTtk server
| Subsystem/functional category | Gene product name (Coding DNA sequencing) | Recombined strains (No. recombinant events)a | Donor lineages (donor lineage corresponding to strains of column 3)b | Largest recomb. Fragmentc |
|---|---|---|---|---|
| Lactose and galactose uptake and utilization | Galactokinase (EC 2.7.1.6) | IFB5626 (1), IFB5605 (1), IFB5486 (1), SCC3193 (1), NY1548A (1), NY1585A (1), NY1540A (1), NY1533B (1) | L2, L3, L3, L3, L3, L3, L3, L3 | 937 |
| Phosphate metabolism | Soluble pyridine nucleotide transhydrogenase (EC 1.6.1.1) | HC (2), IFB5597 (2), IPO1955 (2), IFB5432 (1) | L1&L2d, L3&L6, L3&L7, L5 | 2908 |
| Potassium homeostasis | Kup system potassium uptake protein | IFB5626 (1), IPO1955 (1), IFB5619 (1), SCC3193 (2), IFB5485 (1) | L1, L1, L1, L4&L7, L1 | 1365 |
| Copper homeostasis: copper tolerance | Apolipoprotein N-acyltransferase/Copper homeostasis protein CutE | IFB5597 (1), WPP163 (2), SCC3193 (1), NY1588A (1) | L2, L1&L3, L2, L2 | 1530 |
| Methionine biosynthesis | 5-methyltetrahydrofolate–homocysteine methyltransferase (EC 2.1.1.13) | IFB5597 (1), WPP163 (3), WC19161 (2) | L2&L5, L2&L6, L2&L6 | 1249 |
| S-adenosylmethionine synthetase (EC 2.5.1.6) | IFB5597 (4), WPP163 (2), WC19161 (3) | L1-L3, L2, L2&L6 | 1179 | |
| Cysteine biosynthesis | Cys regulon transcriptional activator CysB | IFB5626 (1), NY1584A (1), IPO1955 (1), IFB5619 (1), IFB5485 (1) | L3, L1, L1, L1, L1 | 1968 |
| Sulfite reductase [NADPH] hemoprotein beta-component (EC 1.8.1.2) | IFB5432 (1), NY1548A (1), NY1585A (1), NY1540A (1), NY1533B (1) | L5, L3, L3, L3, L3 | 2193 | |
| Sulfite reductase [NADPH] flavoprotein alpha-component (EC 1.8.1.2) | IFB5432 (1), NY1548A (1), NY1585A (1), NY1540A (1), NY1533B (1) | L5, L3, L3, L3, L3 | 1721 | |
| Threonine and homoserine biosynthesis | Threonine synthase (EC 4.2.3.1) | HC (3), WPP163 (2), WC19161 (1) | L2&L6, L2&L3, L2 | 1290 |
| Homoserine kinase (EC 2.7.1.39) | HC (1), WC19161 (2) | L2, L2&L3 | 1056 | |
| Ribonucleotide reduction | Ribonucleotide reductase of class Ia (aerobic), alpha subunit (EC 1.17.4.1) | HC (4), WPP163 (1), IFB5626 (1), WC19161 (4) | L2&L6&O, L2&L6, L1, L2&L5 | 2804 |
| Purine conversions | 5′-nucleotidase (EC 3.1.3.5); 2′,3'-cyclic-nucleotide 2'-phosphodiesterase (EC 3.1.4.16); putative UDP-sugar hydrolase (EC 3.6.1.45) | NY1584A (1), NY1532B (1), SS90 (2) | L1, L6, L5&L6 | 2291 |
| Pyrimidine conversions | Uridine phosphorylase (EC 2.4.2.3) | IFB5605 (1), IFB5486 (1), PB20 (1), IFB5604 (1), NY1588A (1) | L6, L5, L1, L1, O | 714 |
| Nitrosative stress | Anaerobic nitric oxide reductase flavorubredoxin | HC (1), IFB5597 (4) | L5, L2&L5&O | 1794 |
| Anaerobic nitric oxide reductase transcription regulator NorR | HC (1), IFB5597 (1), WPP163 (1) | L5, O, L3 | 2167 | |
| Denitrifying reductase gene clusters | Respiratory nitrate reductase delta chain (EC 1.7.99.4)/nitrate reductase molybdenum cofactor assembly chaperone | HC (1), IFB5597 (1), IFB5619 (1) | L5, L7, L1 | 720 |
| Respiratory nitrate reductase gamma chain (EC 1.7.99.4)/respiratory nitrate reductase subunit gamma | HC (2), IFB5597 (1), IFB5619 (1) | L1&L5, L7, L1 | 690 | |
| Respiratory nitrate reductase beta chain (EC 1.7.99.4) | HC (3), IFB5597 (2), IFB5619 (1) | L1&O, L7&O, L1 | 1581 | |
| Respiratory nitrate reductase alpha chain (EC 1.7.99.4) | HC (3), IFB5597 (2), NY1584A (1), IFB5619 (1) | L1&L3, L6&O, L1, L1 | 3540 | |
| Nitrate and nitrite ammonification | Cytochrome c-type protein NapC | HC (1), NY1584A (1), IFB5432 (1) | L3, L1, L5 | 603 |
| Ferredoxin-type protein NapF (periplasmic nitrate reductase) | HC (2), IFB5597 (1), NY1584A (1), IFB5432 (1) | L1&L3, L5, L1, L5 | 855 | |
| Nitrate reductase cytochrome c550-type subunit | HC (2), IFB5597 (1), NY1584A (1), IFB5432 (1) | L1&L3, L1, L1, L5 | 438 | |
| Nitrate ABC transporter, permease protein | IFB5626 (1), NY1584A (1), NY1712A (1) | L1, L1, L5 | 853 | |
| Nitrate ABC transporter, ATP-binding protein | IFB5626 (1), NY1584A (1) | L1, L1 | 1248 | |
| Ferredoxin-type protein NapG (periplasmic nitrate reductase) | IFB5597 (1), NY1584A, IFB5432 (1) | L5, L1, L5 | 693 | |
| Polyferredoxin NapH (periplasmic nitrate reductase) | IFB5597 (3), NY1584A (1), IFB5432 (1) | L5&L6&O, L1, L5 | 864 | |
| Ammonia assimilation | Ammonium transporter | IFB5605 (1), IFB5486 (1), NY1588A (1), QK-5 (1) | L3, L3, O, L3 | 438 |
| Citrate metabolism, transport, and regulation | C4-dicarboxylate transporter DcuC (TC 2.A.61.1.1) | IFB5626 (1), IFB5605 (1), IFB5432 (1), SCC3193 (1), PB20 (1), IFB5604 (1), NY1548A (1), NY1585A (1), NY1540A (1), NY1533B (1) | L2, L2, L5, L7, L2, L2, L2, L2, L2, L2 | 1455 |
| [Citrate [pro-3S]-lyase] ligase (EC 6.2.1.22) | IFB5605 (1) | L6 | 1065 | |
| Citrate lyase beta chain (EC 4.1.3.6) | IFB5605 (1) | L6 | 87 | |
| Motility/flagellar component | Flagellar hook-length control protein FliK | HC (1), IFB5605 (1), WPP163 (1) | L6, L6, L3 | 1731 |
| Flagellar biosynthesis protein FliO | NY1584A (1) | L7 | 116 | |
| Aminopeptidases (EC 3.4.11.-) | Membrane alanine aminopeptidase N (EC 3.4.11.2) | HC (6), IFB5597 (4) | L3&L5&L6&L7, L3&L5&L6&L7 | 1851 |
| Metallocarboxypeptidases | D-alanyl-D-alanine carboxypeptidase (EC 3.4.16.4) | IFB5626 (1) | L4 | 1021 |
| T1SS for aggregation | T1SS associated transglutaminase-like cysteine proteinase LapP | NY1532B (1) | L6 | 690 |
| Type IVb Flp/Tad (fimbrial low-molecular-weight protein/tight adherence protein) pilus-encoding gene cluster | Type II/IV secretion system ATP hydrolase TadA/VirB11/CpaF, TadA subfamily | HC (1), IFB5626 (1) | L2, L4 | 1121 |
| Flp pilus assembly protein RcpC/CpaB | IFB5626 (1) | L6 | 238 | |
| Type II/IV secretion system secretin RcpA/CpaC, associated with Flp pilus assembly | NY1584A (1) | L4 | 915 | |
| Flp pilus assembly protein CpaD | IFB5626 (1) | L4 | 275 | |
| Type VI Secretion System (T6SS) | T6SS component TssA (ImpA) | SCC3193 (1), NY1588A (1), NY1587A (1) | O, O, L6 | 982 |
| VgrG protein | SCC3193 (1), NY1588A (1) | L7, L7 | 1487 | |
| T6SS component TssF (ImpG/VasA) | HC (2) | L2&L5 | 784 | |
| T6SS component TssM (IcmF/VasK) | IFB5626 (2), IPO1955 (1), IFB5619 (1), NY1587A (1), IFB5485 (1) | L1&L4, L1, L1, L6, L1 | 3036 | |
| Type VI secretion-related protein VasL | NY1587A (1) | L6 | 815 | |
| T6SS component TssC (ImpC/VipB) | NY1532B (1) | L6 | 278 | |
| T6SS associated component TagF (ImpM) | IFB5626 (1) | L1 | 660 | |
| Acetoin, butanediol metabolism | 2,3-butanediol dehydrogenase, S-alcohol forming, (R)-acetoin-specific (EC 1.1.1.4)/Acetoin (diacetyl) reductase (EC 1.1.1.304) | HC (1), IFB5597 (1) | L3, L6 | 774 |
| Acetolactate synthase, catabolic (EC 2.2.1.6) | HC (2), IFB5597 (2) | L3&L5, L6&O | 396 | |
| Alpha-acetolactate decarboxylase (EC 4.1.1.5) | HC (207), IFB5597 (2) | L2&L5, L2&O | 518 | |
| Transcriptional regulator of alpha-acetolactate operon AlsR | CFIA1002 (1) | L2 | 141 | |
| Iron acquisition | Ferric iron ABC transporter, ATP-binding protein | HC (1), IFB5626 (1), IFB5597 (1) | O, L5, L1 | 610 |
| Ferric iron ABC transporter, permease protein | IFB5486 (1), NY1532B (1) | L5, L6 | 2131 | |
| Siderophore aerobactin | Ferric hydroxamate ABC transporter (TC 3.A.1.14.3), permease component FhuB | IFB5626 (1), IPO1955 (1) | L1, L1 | 1841 |
| Ferric hydroxamate ABC transporter (TC 3.A.1.14.3), ATP-binding protein FhuC | IPO1955 (1) | L1 | 1646 | |
| N(2)-citryl-N(6)-acetyl-N(6)-hydroxylysine synthase (EC 6.3.2.38), aerobactin biosynthesis protein IucA | NY1584 (A), IFB5432 (1) | L4, L5 | 1313 | |
| N6-hydroxylysine O-acetyltransferase (EC 2.3.1.102), aerobactin biosynthesis protein IucB | NY1584 (A), IFB5432 (1) | L4, L5 | 948 | |
| Aerobactin synthase (EC 6.3.2.39), aerobactin biosynthesis protein IucC | IFB5626 (1), NY1584A (1), IFB5432 (1) | L5, L4, L5 | 1734 | |
| L-lysine 6-monooxygenase [NADPH] (EC 1.14.13.59), aerobactin biosynthesis protein IucD | IFB5626 (1), NY1584A (1), IFB5432 (1) | L5, L4, L5 | 1335 | |
| Aerobactin siderophore receptor IutA | IFB5626 (1), IFB5432 (1) | L5, L5 | 2002 | |
| Siderophore enterobactin | Enterobactin exporter EntS | IFB5619 (1) | L1 | 46 |
| Antibiotic biosynthesis | Colicin V production protein | HC (2), WPP163 (4), WC19161 (1), NY1712A (1) | L1&L3, L3&L5&L7, L3, L5 | 1347 |
| CRISPRs | CRISPR-associated helicase Cas3 | NY1584A (1), NY1712A (1) | L4, L5 | 1822 |
a Numbers between parenthesis in column 3 indicates the number of recent recombination events identified in each of the recipient strains for each gene product shown in column 2. For instance, IFB5626 (1) means that strain IFB5626 presented 1 recombination event for that specific gene product.
b L1 to L7 in column 4 “Donor Lineages” points which lineage acted as donor or donated the recombination segment to its respective recipient strain whereas O, refers to recombination events categorized as outside origin or not corresponding to any of the lineages identified in the P. parmentieri population structural analysis. The order of the letter coding (L1 to L7, O) separated by a comma within column 4 in each row for each gene product (column 2) corresponds to the same order of the recipient lineages as listed in each row displayed within column 3. For instance, in row 1 within column 4, “L2, L3” indicates that lineages 2 and 3 acted as donors for recombination events in the strains IFB5626 and IFB5605 (row 1 in column 3), respectively.
c Column 5 displays the largest recombined fragment in base pair (bp) found for that gene product provided in column 2.
d Letter coding (L1 to L7, O) separated by a “&” symbol within the “Donor Lineages” column indicates that 2 or more lineages acted as donors. For instance, “L1&L2” in row 2 within column 4 means that Lineages 1 and 2 acted as donors for the two recombination events detected in strain HC (row 2 in column 3) for the gene product “soluble pyridine nucleotide transhydrogenase”.
Secretion of CWDE is crucial for host colonization and disease development in soft rot bacteria (Toth et al. 2006; Nykyri et al. 2012; Charkowski 2018; Arizala and Arif 2019). In P. parmentieri, 9 genes (pelI, pehA, pelA, pelB, pelC, pelL, pehX, pelX, pemA) were identified as being affected by ancestral recombination events in lineages 1, 2, 5, and 6 (Table 2). Recombination in genes pelA, pelB, and pelC, which encode for pectate lyases, was observed in lineages 2 and 5, while the other genes experienced recombination event in at least one lineage. Previous studies on the activity of CWDE and potato maceration efficacy have revealed significant statistical differences among P. parmentieri strains (Ngadze et al. 2012; Moleleki et al. 2013; Dees et al. 2017; Zoledowska et al. 2018a; Zoledowska et al. 2018b). The evidence of ancestral recombination found in our study in some of the CWDE genes might be related to the reported variation in the ability to macerate potato tissue, as well as the high differences displayed in CWDE activities within the P. parmentieri intraspecies.
Flagellum is another detrimental pathogenicity associated factor specially in soft rot bacteria (Charkowski et al. 2012). Mutations in flagella genes have led to a reduced virulence in Dickeya (Jahn et al. 2008). In P. carotovorum Pcc21, mutants with defective flagella biosynthesis in the genes flgA and fliA produced less than 50% of biofilm in comparison with the wild type (Lee et al. 2013). Our analysis found ancestral recombination occurring in 12 components (FliK, FlgA, FlgB, FlgC, FlgD, FlgE, FlgF, FlgG, FlgI, FliD, FliE, and FliF) of the flagellum biogenesis cluster in the lineage 1, whereas lineages 6, 2, and 3 experienced recombination in less number of genes (6 and 3 genes, respectively) (Table 2). In addition, recent recombination events (Table 3) were also observed within the coding sequences FliK (in strains HC, IFB5605, and WPP163) and FliO (NY1584A). These results indicate that the recombination seen in the flagellum biosynthesis cluster may be connected to the pathogen's ability to adapt to various host conditions, given that the flagellum is a crucial component for ensuring a successful host colonization. In synchronization with the data obtained from the ClonalFrameML analysis, the ancestral and recent recombinations derived from the fastGEAR algorithm also put in evidence that other distinct and highly important pathogenicity determinants, such as genes encoding for components of the T1SS (type I secretion system), T6SS, T1SS for aggregation, type IV pilus biogenesis, the type IVb Flp/Tad cluster, butanediol metabolism, citrate metabolism, siderophore production, and polysaccharide biosynthesis, have undergone to homologous recombination (Tables 2 and 3). These results seem to indicate that homologous recombination has a pivotal implication in the pathogenic lifestyle of P. parmentieri.
Notably, gene regions encoding for antimicrobial products, namely phenazine and colicin, were also affected by ancestral and recent recombination events, respectively (Tables 2 and 3). Phenazines are secondary metabolites produced by bacteria and are associated with multiple roles, namely, biofilm formation, enhanced virulence, host colonization, and as weapon against other competitors like fungi or bacteria (Pierson and Pierson 2010). Colicins, on the other hand, are proteins produced by bacteria to kill other closely related members, giving them an advantage over other bacteria in the race for nutrients (Grinter et al. 2012; Arizala and Arif 2019). P. carotovorum Pcc21, for instance, has been reported to secrete a colicin-like bacteriocin product, Carocin D, which inhibited the growth of other Pectobacterium members (Roh et al. 2010). The fact that we found recombination within genes involved with the synthesis of these 2 antimicrobial compounds highlights the contribution of homologous recombination in the ecological fitness of P. parmentieri.
Moreover, our fastGEAR output showed recombination within 2 CRISPR-Cas adaptative immune systems of P. parmentieri. In the CRISPR-Cas subtype I-F system, ancestral recombination patterns were shown within the genes cse4 in lineage 1 as well as csy2, and csy3 in lineage 2 (Table 2). In the CRISPR-Cas subtype I-E system, recent recombination events were observed within the cas3 gene of the strains NY1584A and NY1712A (Table 3). These findings suggest that homologous recombination seems to have participated in the evolution of the CRISPR-Cas system of P. parmentieri. This is consistent with earlier research that pointed recombination events spanning the genetic exchange of several cas gene regions (Touchon and Rocha 2010; Takeuchi et al. 2012).
Conclusions
In the present study, we assessed the impact of homologous recombination on the core-genome of the newly, but widely spread, described species P. parmentieri. Clear evidence of diverse recombination patterns was found, with data indicating variations in the sizes of DNA recombinant fragments occurring randomly across the core genome of the 32 P. parmentieri isolates. The fastGEAR analysis identified 941 recent recombination events and 486 ancestral recombination events. The ancestral recombination data suggested that strains from lineages 1st to 6th evolved from strains of the 7th lineage. Importantly, several ancient recombinant regions (3,094) were inferred in the reconstructed ClonalFrameML genealogy tree in the main nodes, giving origin to different clades. Genes affected by homologous recombination are associated with vital cell functions such as replication, transcription, metabolism, and homeostasis. Strikingly, recombination was also observed within gene loci categorized as pathogenicity determinants, including type secretion systems, iron acquisition, flagellum biosynthesis, CWDE, citrate and butanediol metabolism, type IV pilus biogenesis, etc. Recombinant fragments were also present in sequences encoding antimicrobial metabolites (phenazine and colicin) and genes of the CRISPR-Cas systems. These findings highlight the significant role of homologous recombination in shaping P. parmentieri genomes, influencing evolution, strain diversity, lifestyle, pathogenicity, adaptative immunity, and ecological fitness of this phytopathogenic bacterium.
Materials and Methods
Isolate Collection
In total, 32 P. parmentieri strains from different geographic origins and isolated in distinct years were used in this study (supplementary table S1, Supplementary Material online). All genomes (17 complete and 15 draft genomes) available in the NCBI GenBank database as of the time of accession (2022 September 14) were retrieved in FASTA (Fast-All) format. The accession numbers used to retrieve the genomic data of the 32 isolates are provided in supplementary table S1, Supplementary Material online.
Core-genome Analysis
All 32 genomes in FASTA format were annotated using the rapid prokaryotic genome annotation tool Prokka (Seemann 2014). The annotated genomes in GFF3 (general feature format) were used as input data to carry out the pan-genome analysis in the Roary pipeline (Page et al. 2015) with a minimum BLASTP percentage identity of 90%. A multi-FASTA alignment of the previously identified core genes was generated using PRANK (Löytynoja 2014). Later, the gaps were removed from the concatenated core genome alignments using trimAl v1.3 (Capella-Gutiérrez et al. 2009) with the “-nogaps” parameter.
Population Genetic Structure, Genealogy Reconstruction, and Inference of Homologous Recombination
The core-genome alignment generated with Roary for the 32 P. parmentieri isolates was used as input file to determine the intraspecies population structure using the hierarchical Bayesian Analysis of Population Structure (hierBAPS; Cheng et al. 2013) implemented in R (RhierBAPS; Tonkin-Hill et al. 2018). Each strain was assigned to its specific lineage genetic cluster (lineage).
Later, the generated core genome served to compute the ML phylogenetic tree. The inferred phylogeny was constructed using the General Time-Reversible GAMMA (GTR-GAMMA) distribution model with 1,000 bootstraps replicates. The analysis was carried out using the RAxML-NG (RAxML Next Generation) tool (Kozlov et al. 2019). The RAxML-NG ML tree was visualized and plotted in FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) and ordered based on increasing order of nodes.
ClonalFrameML (Didelot and Wilson 2015) analysis was deployed to reconstruct the genealogy, infer homologous recombined fragments and polymorphisms sites within the branches and nodes across the core genome of P. parmentieri. The CLonalFrameML phylogeny was inferred using the previous RAxML-NG ML tree as the starting topology tree and the core genome alignment was obtained using ROARY. Recombination parameters in ClonalFrameML were calculated using the hidden Markov model (HMM) based on 100 simulations (emsim = 100) to refine the reliability of the analysis. Moreover, the impact of introduced recombination events relative to single-point mutations (r/m) was calculated for each branch in ClonalFrameML.
Inference of Recent and Ancestral Recombination Events
The origin, number, and location of recent and ancestral recombination events within the P. parmentieri core-genome was predicted using the novel algorithm fastGEAR, which provides information about lineages across microbial alignments and identifies recombination sites between the studied isolates as well as from external origins (Mostowy et al. 2017). A pre-specified partition file containing the number of lineages obtained in RhierBAPS was used in the fastGEAR analysis instead of the BAPS3 clustering algorithm. Once the lineages were identified in the core-alignment, recent recombinations within each lineage were detected by applying a HMM approach to all subsets of strains assigned to that lineage. Ancestral recombinations were analyzed between lineages and using the same HMM approach. Regions inferred as recent recombination were removed prior to the analysis of ancestral recombination. Bayes factor >1 and >10 were used to evaluate the statistical significance of putative recent and ancestral recombination events. Additionally, the order of the 32 P. parmentieri strains shown in the recent and ancestral recombination plots was specified according to the phylogenetic tree generated in RAxML-NG. To extract the coordinates of the recombined regions and determine the annotation of genes that underwent recent recombination, the next 3 steps, modified from Potnis et al. (2019), were followed. First, the recombination sites obtained from fastGEAR were organized into specific BED (Browser Extensible Data) files for each P. parmentieri strain as well as for each node identified by ClonalFrameML (supplementary fig. S1, Supplementary Material online). Second, the recombined fragments were extracted from their respective genomes using the GetFastaBed tool. Finally, the extracted sequences were submitted to a nucleotide Basic Local Alignment Search Tool (BLASTn) analysis against a customized database of its corresponding P. parmentieri annotated core-genome. The core-genomes were annotated using the Rapid Annotation using Subsystem Technology server (RASTtk; Brettin et al. 2015). The custom BLAST analysis was performed in Geneious Prime. The same approach was followed to determine the genes affected by ancestral recombination in each of the lineages as well as to locate the recombinant fragments inferred by ClonalFrameML.
Supplementary Material
Contributor Information
Dario Arizala, Department of Plant and Environmental Protection Sciences, University of Hawaii at Manoa, Honolulu, HI, USA.
Mohammad Arif, Department of Plant and Environmental Protection Sciences, University of Hawaii at Manoa, Honolulu, HI, USA.
Supplementary Material
Supplementary material is available at Genome Biology and Evolution online.
Funding
This work was supported by the U. S. Department of Agriculture-Agricultural Research Service (USDA-ARS) Agreement No. 58-2040-9-011, Systems Approaches to Improve Production and Quality of Specialty Crops Grown in the U.S. Pacific Basin; sub-project: Genome Informed Next Generation Detection Protocols for Pests and Pathogens of Specialty Crops in Hawaii. This work was also supported by U. S. Department of Agriculture-National Institute of Food and Agriculture (USDA-NIFA) Project No. HAW09706-G (Accession No. 1030040). The bioinformatics analyses were supported by the National Institute of General Medical Sciences (NIGMS) of the National Institutes of Health under award number P20GM125508.
Data Availability
All genome data utilized in this study were retrieved from the NCBI GenBank and are publicly available. Details regarding the NCBI GenBank accession numbers and other metadata related to each genome can be found in supplementary table S1, Supplementary Material online.
Literature Cited
- Adeolu M, Alnajar S, Naushad S, Gupta RS. Genome-based phylogeny and taxonomy of the ‘enterobacteriales’: proposal for enterobacterales ord. Nov. divided into the families Enterobacteriaceae, erwiniaceae fam. Nov., pectobacteriaceae fam. Nov., yersiniaceae fam. nov., Hafniaceae fam. nov., Morganellaceae fam. nov., and Budviciaceae fam. nov. Int J Syst Evol Microbiol. 2016:66:5575–5599. 10.1099/ijsem.0.001485. [DOI] [PubMed] [Google Scholar]
- Ansari M, Taghavi SM, Hamzehzarghani H, Valenzuela M, Siri MI, Osdaghi E. Multiple introductions of tomato pathogen Clavibacter michiganensis subsp. michiganensis into Iran as revealed by a global-scale phylogeographic analysis. Appl Environ Microbiol. 2019:85:e02098-19. 10.1128/AEM.02098-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arif M, Czajkowski R, Chapman TA. Editorial: genome-wide analyses of Pectobacterium and Dickeya species. Front Plant Sci. 2022:13:855262. 10.3389/fpls.2022.855262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arizala D, Arif M. Genome-wide analyses revealed remarkable heterogeneity in pathogenicity determinants, antimicrobial compounds, and CRISPR-cas systems of complex phytopathogenic genus Pectobacterium. Pathogens. 2019:8:247. 10.3390/pathogens8040247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arizala D, Dobhal S, Paudel S, Gunarathne S, Boluk G, Arif M. First report of bacterial soft rot and blackleg on potato caused by Pectobacterium parmentieri in Hawaii. Plant Dis. 2019:104:970. 10.1094/PDIS-09-19-1894-PDN. [DOI] [Google Scholar]
- Boluk G, Arizala D, Dobhal S, Zhang J, Hu J, Alvarez AM, Arif M. Genomic and phenotypic biology of novel strains of Dickeya zeae isolated from pineapple and taro in Hawaii: insights into genome plasticity, pathogenicity, and virulence determinants. Front Plant Sci. 2021:12:663851. 10.3389/fpls.2021.663851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, et al. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep. 2015:5:8365. 10.1038/srep08365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cao YN, Sun QH, Feng ZW, Handique U, Wu J, Li WS, Zhang RF. First report of Pectobacterium parmentieri causing blackleg on potato in Inner Mongolia, China. Plant Dis. 2021:105:2237. 10.1094/PDIS-11-20-2502-PDN. [DOI] [PubMed] [Google Scholar]
- Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. Trimal: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009:25:1972–1973. 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakraborty S, Snijders AP, Chakravorty R, Ahmed M, Tarek AM, Hossain MA. Comparative network clustering of direct repeats (DRs) and cas genes confirms the possibility of the horizontal transfer of CRISPR locus among bacteria. Mol Phylogenet Evol. 2010:56:878–887. 10.1016/j.ympev.2010.05.020. [DOI] [PubMed] [Google Scholar]
- Charkowski AO. The changing face of bacterial soft-rot diseases. Annu Rev Phytopathol. 2018:56:269–288. 10.1146/annurev-phyto-080417-045906. [DOI] [PubMed] [Google Scholar]
- Charkowski A, Blanco C, Condemine G, Expert D, Franza T, Hayes C, Hugouvieux-Cotte-Pattat N, Solanilla EL, Low D, Moleleki L, et al. The role of secretion systems and small molecules in soft-rot Enterobacteriaceae pathogenicity. Annu Rev Phytopathol. 2012:50:425–449. 10.1146/annurev-phyto-081211-173013. [DOI] [PubMed] [Google Scholar]
- Chen K-M, Chiang M-K, Wang M, Ho H-C, Lu M-C, Lai Y-C. The role of pgaC in Klebsiella pneumoniae virulence and biofilm formation. Microb Pathog. 2014:77:89–99. 10.1016/j.micpath.2014.11.005. [DOI] [PubMed] [Google Scholar]
- Chen NWG, Serres-Giardi L, Ruh M, Briand M, Bonneau S, Darrasse A, Barbe V, Gagnevin L, Koebnik R, Jacques M-A. Horizontal gene transfer plays a major role in the pathological convergence of Xanthomonas lineages on common bean. BMC Genomics. 2018:19:606. 10.1186/s12864-018-4975-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen L, Zou Y, Kronfl AA, Wu Y. Type VI secretion system of Pseudomonas aeruginosa is associated with biofilm formation but not environmental adaptation. MicrobiologyOpen. 2020:9:e991. 10.1002/mbo3.991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng L, Connor TR, Siren J, Aanensen DM, Corander J. Hierarchical and spatially explicit clustering of DNA sequences with BAPS software. Mol Biol Evol. 2013:30:1224–1228. 10.1093/molbev/mst028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coletta-Filho HD, Francisco CS, Lopes JRS, Muller C, Almeida RPP. Homologous recombination and Xylella fastidiosa host–pathogen associations in South America. Phytopathology. 2017:107:305–312. 10.1094/PHYTO-09-16-0321-R. [DOI] [PubMed] [Google Scholar]
- David S, Sánchez-Busó L, Harris SR, Marttinen P, Rusniok C, Buchrieser C, Harrison TG, Parkhill J. Dynamics and impact of homologous recombination on the evolution of Legionella pneumophila. PLoS Genet. 2017:13:e1006855. 10.1371/journal.pgen.1006855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Boer SH, Li X, Ward LJ. Pectobacterium spp. Associated with bacterial stem rot syndrome of potato in Canada. Phytopathology. 2012:102:937–947. 10.1094/PHYTO-04-12-0083-R. [DOI] [PubMed] [Google Scholar]
- Dees MW, Lysøe E, Rossmann S, Perminow J, Brurberg MB. Pectobacterium polaris sp. Nov., isolated from potato (Solanum tuberosum). Int J Syst Evol Microbiol. 2017:67:5222–5229. 10.1099/ijsem.0.002448. [DOI] [PubMed] [Google Scholar]
- Degefu Y. Co-occurrence of latent Dickeya and Pectobacterium species in potato seed tuber samples from northern Finland. Agri Food Sci. 2021:30:1–7. 10.23986/afsci.101446. [DOI] [Google Scholar]
- Didelot X, Walker AS, Peto TE, Crook DW, Wilson DJ. Within-host evolution of bacterial pathogens. Nat Rev Microbiol. 2016:14:150–162. 10.1038/nrmicro.2015.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Didelot X, Wilson DJ. ClonalFrameML: efficient inference of recombination in whole bacterial genomes. PLoS Comput Biol. 2015:11:e1004041. 10.1371/journal.pcbi.1004041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixit PD, Pang TY, Studier FW, Maslov S. Recombinant transfer in the basic genome of Escherichia coli. Proc. Natl. Acad. Sci. U S A. 2015:112:9070–9075. 10.1073/pnas.1510839112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Everitt RG, Didelot X, Batty EM., Miller R, Knox K, Young BC, Bowden R, Auton A, Votintseva A, Larner-Svensson H, et al. Mobile elements drive recombination hotspots in the core genome of Staphylococcus aureus. Nat Commun. 2014:5:3956. 10.1038/ncomms4956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franza T, Expert D. Role of iron homeostasis in the virulence of phytopathogenic bacteria: an ‘à la carte’ menu. Molecu Plant Pathol. 2013:14:429–438. 10.1111/mpp.12007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ge T, Ekbataniamiri F, Johnson SB, Larkin RP, Hao J. Interaction between Dickeya dianthicola and Pectobacterium parmentieri in potato infection under field conditions. Microorganisms. 2021:9:316. 10.3390/microorganisms9020316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- González-Torres P, Rodríguez-Mateos F, Antón J, Gabaldón T. Impact of homologous recombination on the evolution of prokaryotic core genomes. mBio. 2019:10:e02494-18. 10.1128/mBio.02494-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorshkov V, Parfirova O, Petrova O, Gogoleva N, Kovtunov E, Vorob’ev V, Gogolev Y. The knockout of enterobactin-related gene in Pectobacterium atrosepticum results in reduced stress resistance and virulence towards the primed plants. IJMS. 2021:22:9594. 10.3390/ijms22179594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grinter R, Milner J, Walker D. Ferredoxin containing bacteriocins suggest a novel mechanism of iron uptake in Pectobacterium spp. PLoS One. 2012:7:e33033. 10.1371/journal.pone.0033033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jahn CE, Willis DK, Charkowski AO. The flagellar sigma factor FliA is required for Dickeya dadantii virulence. Mol Plant Microbe Interact. 2008:21:1431–1442. 10.1094/MPMI-21-11-1431. [DOI] [PubMed] [Google Scholar]
- Jibrin MO, Potnis N, Timilsina S, Minsavage GV, Vallad GE, Roberts PD, Jones JB, Goss EM. Genomic inference of recombination-mediated evolution in Xanthomonas euvesicatoria and X. perforans. Appl Environ Microbiol. 2018:84:e00136-18. 10.1128/AEM.00136-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kastelein P, Förch MG, Krijger MC, van der Zouwen PS, van den Berg W, van der Wolf JM. Systemic colonization of potato plants resulting from potato haulm inoculation with Dickeya solani or Pectobacterium parmentieri. Can J Plant Pathol. 2021:43:1–15. 10.1080/07060661.2020.1777465. [DOI] [Google Scholar]
- Khayi S, Cigna J, Chong TM, Quêtu-Laurent A, Chan K-G, Hélias V, Faure D. Transfer of the potato plant isolates of Pectobacterium wasabiae to Pectobacterium parmentieri sp. Nov. Int J Syst Evol Microbiol. 2016:66:5379–5383. 10.1099/ijsem.0.001524. [DOI] [PubMed] [Google Scholar]
- Khayi S, Raoul des Essarts Y, Quêtu-Laurent A, Moumni M, Hélias V, Faure D. Genomic overview of the phytopathogen Pectobacterium wasabiae strain RNS 08.42.1A suggests horizontal acquisition of quorum-sensing genes. Genetica. 2015:143:241–252. 10.1007/s10709-014-9793-2. [DOI] [PubMed] [Google Scholar]
- Kim H-S, Ma B, Perna NT, Charkowski AO. Phylogeny and virulence of naturally occurring type III secretion system-deficient Pectobacterium strains. Appl Environ Microbiol. 2009:75:4539–4549. 10.1128/AEM.01336-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirchberger PC, Unterweger D, Provenzano D, Pukatzki S, Boucher Y. Sequential displacement of type VI secretion system effector genes leads to evolution of diverse immunity gene arrays in Vibrio cholerae. Sci Rep. 2017:7:45133. 10.1038/srep45133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koonin EV, Makarova KS. CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes. RNA Biol. 2013:10:679–686. 10.4161/rna.24022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics. 2019:35:4453–4455. 10.1093/bioinformatics/btz305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuzminov A. Homologous recombination—experimental systems, analysis, and significance. EcoSal Plus. 2011:4:ecosalplus.7.2.6. 10.1128/ecosalplus.7.2.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee DH, Lim J-A, Lee J, Roh E, Jung K, Choi M, Oh C, Ryu S, Yun J, Heu S. Characterization of genes required for the pathogenicity of Pectobacterium carotovorum subsp. carotovorum Pcc21 in Chinese cabbage. Microbiology. 2013:159:1487–1496. 10.1099/mic.0.067280-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X, Heyer WD. Homologous recombination in DNA repair and DNA damage tolerance. Cell Res. 2008:18:99–113. 10.1038/cr.2008.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ling J, Pan H, Gao Q, Xiong L, Zhou Y, Zhang D, Gao S, Liu X. Aerobactin synthesis genes iucA and iucC contribute to the pathogenicity of avian pathogenic Escherichia coli O2 strain E058. PLoS One. 2013:8:e57794. 10.1371/journal.pone.0057794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Löytynoja A. Phylogeny-aware alignment with PRANK. In: multiple sequence alignment methods. Russell, DJ, editor. Methods Mol Biol. 2014:1079:155–170. 10.1007/978-1-62703-646-7_10. [DOI] [PubMed] [Google Scholar]
- Makarova KS, Wolf YI, Alkhnbashi OS, Costa F, Shah SA, Saunders SJ, Barrangou R, Brouns SJJ, Charpentier E, Haft DH, et al. An updated evolutionary classification of CRISPR–cas systems. Nat Rev Microbiol. 2015:13:722–736. 10.1038/nrmicro3569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marquez-Villavicencio MP, Weber B, Witherell RA, Willis DK, Charkowski AO. The 3-hydroxy-2-butanone pathway is required for Pectobacterium carotovorum pathogenesis. PLoS One. 2011:6:e22974. 10.1371/journal.pone.0022974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mattinen L, Nissinen R, Riipi T, Kalkkinen N, Pirhonen M. Host-extract induced changes in the secretome of the plant pathogenic bacterium Pectobacterium atrosepticum. Proteomics. 2007:7:3527–3537. 10.1002/pmic.200600759. [DOI] [PubMed] [Google Scholar]
- Moleleki LN, Onkendi EM, Mongae A, Kubheka GC. Characterisation of Pectobacterium wasabiae causing blackleg and soft rot diseases in South Africa. Eur J Plant Pathol. 2013:135:279–288. 10.1007/s10658-012-0084-4. [DOI] [Google Scholar]
- Moleleki LN, Pretorius RG, Tanui CK, Mosina G, Theron J. A quorum sensing-defective mutant of Pectobacterium carotovorum ssp. brasiliense 1692 is attenuated in virulence and unable to occlude xylem tissue of susceptible potato plant stems. Mol Plant Pathol. 2017:18:32–44. 10.1111/mpp.12372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morgado S, Vicente AC. Diversity and distribution of type VI secretion system gene clusters in bacterial plasmids. Sci Rep. 2022:12:8249. 10.1038/s41598-022-12382-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mosbahi K, Wojnowska M, Albalat A, Walker D. Bacterial iron acquisition mediated by outer membrane translocation and cleavage of a host protein. Proc Natl Acad Sci U S A. 2018:115:6840–6845. 10.1073/pnas.1800672115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mostowy R, Croucher NJ, Andam CP, Corander J, Hanage WP, Marttinen P. Efficient inference of recent and ancestral recombination within bacterial populations. Mol Biol Evol. 2017:34:1167–1182. 10.1093/molbev/msx066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ngadze E, Brady CL, Coutinho TA, van der Waals JE. Pectinolytic bacteria associated with potato soft rot and blackleg in South Africa and Zimbabwe. Eur J Plant Pathol. 2012:134:533–549. 10.1007/s10658-012-0036-z. [DOI] [Google Scholar]
- Nykyri J, Mattinen L, Niemi O, Adhikari S, Kõiv V, Somervuo P, Fang X, Auvinen P, Mäe A, Palva ET, et al. Role and regulation of the flp/tad pilus in the virulence of Pectobacterium atrosepticum SCRI1043 and Pectobacterium wasabiae SCC3193. PLoS One. 2013:8:e73718. 10.1371/journal.pone.0073718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nykyri J, Niemi O, Koskinen P, Nokso-Koivisto J, Pasanen M, Broberg M, Plyusnin I, Törönen P, Holm L, Pirhonen M, et al. Revised phylogeny and novel horizontally acquired virulence determinants of the model soft rot phytopathogen Pectobacterium wasabiae SCC3193. PLoS Pathog. 2012:8:e1003013. 10.1371/journal.ppat.1003013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ossowska K, Motyka-Pomagruk A, Kaczyńska N, Kowalczyk A, Sledz W, Lojkowska E, Kaczyński Z. Heterogenicity within the LPS structure in relation to the chosen genomic and physiological features of the plant pathogen Pectobacterium parmentieri. IJMS. 2022:23:2077. 10.3390/ijms23042077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, Fookes M, Falush D, Keane JA, Parkhill J. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015:31:3691–3693. 10.1093/bioinformatics/btv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Palmer LD, Skaar EP. Transition metals and virulence in bacteria. Annu Rev Genet. 2016:50:67–91. 10.1146/annurev-genet-120215-035146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pasanen M, Laurila J, Brader G, Palva ET, Ahola V, van der Wolf J, Hannukkala A, Pirhonen M. Characterisation of Pectobacterium wasabiae and Pectobacterium carotovorum subsp. carotovorum isolates from diseased potato plants in Finland. Ann Appl Biol. 2013:163:403–419. 10.1111/aab.12076. [DOI] [Google Scholar]
- Pierson LS, Pierson EA. Metabolism and function of phenazines in bacteria: impacts on the behavior of bacteria in the environment and biotechnological processes. Appl Microbiol Biotechnol. 2010:86:1659–1670. 10.1007/s00253-010-2509-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pirhonen M, Saarilahti HT, Karlsson M-B, Palva ET. Identification of pathogenicity determinants of Erwinia carotovora subsp. carotovora by transposon mutagenesis. Mol Plant Microbe Interact. 1991:4(3):276–283. 10.1094/MPMI-4-276. [DOI] [Google Scholar]
- Pitman AR, Harrow SA, Visnovsky SB. Genetic characterisation of Pectobacterium wasabiae causing soft rot disease of potato in New Zealand. Eur J Plant Pathol. 2010:126:423–435. 10.1007/s10658-009-9551-y. [DOI] [Google Scholar]
- Potnis N, Kandel PP, Merfa MV, Retchless AC, Parker JK, Stenger DC, Almeida RPP, Bergsma-Vlami M, Westenberg M, Cobine PA, et al. Patterns of inter- and intrasubspecific homologous recombination inform eco-evolutionary dynamics of Xylella fastidiosa. ISME J. 2019:13:2319–2333. 10.1038/s41396-019-0423-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prokchorchik M, Pandey A, Moon H, Kim W, Jeon H, Jung G, Jayaraman J, Poole S, Segonzac C, Sohn KH, et al. Host adaptation and microbial competition drive Ralstonia solanacearum phylotype I evolution in the Republic of Korea. Microbial Genomics. 2020:6:mgen000461. 10.1099/mgen.0.000461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pukatzki S, Ma AT, Sturtevant D, Krastins B, Sarracino D, Nelson WC, Heidelberg JF, Mekalanos JJ. Identification of a conserved bacterial protein secretion system in Vibrio cholerae using the dictyostelium host model system. Proc Natl Acad Sci U S A. 2006:103:1528–1533. 10.1073/pnas.0510322103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rai AK, Mitchell AM. Enterobacterial common antigen: synthesis and function of an enigmatic molecule. mBio. 2020:11:e01914-20. 10.1128/mBio.01914-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roh E, Park T-H, Kim M, Lee S, Ryu S, Oh CS, Rhee S, Kim D-H, Park B-S, Heu S. Characterization of a new bacteriocin, carocin d, from Pectobacterium carotovorum subsp. carotovorum Pcc21. Appl Environ Microbiol. 2010:76:7541–7549. 10.1128/AEM.03103-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sarris PF, Skandalis N, Kokkinidis M, Panopoulos NJ. In silico analysis reveals multiple putative type VI secretion systems and effector proteins in Pseudomonas syringae pathovars. Mol Plant Pathol. 2010:11:795–804. 10.1111/j.1364-3703.2010.00644.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014:30:2068–2069. 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- Shyntum DY, Nkomo NP, Shingange NL, Gricia AR, Bellieny-Rabelo D, Moleleki LN. The impact of type VI secretion system, bacteriocins and antibiotics on bacterial competition of Pectobacterium carotovorum subsp. brasiliense and the regulation of carbapenem biosynthesis by iron and the ferric-uptake regulator. Front Microbiol. 2019:10:2379. 10.3389/fmicb.2019.02379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suárez MB, Feria FJ, Martín-Robles MJ, del Rey FJ, Palomo JL. Pectobacterium parmentieri causing soft rot on potato tubers in Southern Europe. Plant Disease. 2017:101:1029–1029. 10.1094/PDIS-01-17-0013-PDN. [DOI] [Google Scholar]
- Syeda AH, Hawkins M, McGlynn P. Recombination and replication. Cold Spring Harb Perspect Biol. 2014:6(11):a016550. 10.1101/cshperspect.a016550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takeuchi N, Wolf YI, Makarova KS, Koonin EV. Nature and intensity of selection pressure on CRISPR-associated genes. J Bacteriol. 2012:194:1216–1225. 10.1128/JB.06521-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thapa SP, Pattathil S, Hahn MG, Jacques M-A, Gilbertson RL, Coaker G. Genomic analysis of Clavibacter michiganensis reveals insight into virulence strategies and genetic diversity of a gram-positive bacterial pathogen. Mol Plant Microbe Interact. 2017:30:786–802. 10.1094/MPMI-06-17-0146-R. [DOI] [PubMed] [Google Scholar]
- Tonkin-Hill G, Lees JA, Bentley SD, Frost SDW, Corander J. RhierBAPS: an R implementation of the population clustering algorithm hierBAPS. Wellcome Open Res. 2018:3:93. 10.12688/wellcomeopenres.14694.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Toth IK, Pritchard L, Birch PRJ. Comparative genomics reveals what makes an enterobacterial plant pathogen. Annu Rev Phytopathol. 2006:44:305–336. 10.1146/annurev.phyto.44.070505.143444. [DOI] [PubMed] [Google Scholar]
- Touchon M, Rocha EPC. The small, slow and specialized CRISPR and anti-CRISPR of Escherichia and Salmonella. PLoS One. 2010:5:e11126. 10.1371/journal.pone.0011126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tyson GW, Banfield JF. Rapidly evolving CRISPRs implicated in acquired resistance of microorganisms to viruses. Environ Microbiol. 2008:10:200–207. 10.1111/j.1462-2920.2007.01444.x. [DOI] [PubMed] [Google Scholar]
- Urbany C, Neuhaus HE. Citrate uptake into Pectobacterium atrosepticum is critical for bacterial virulence. Mol Plant Microbe Interact. 2008:21:547–554. 10.1094/MPMI-21-5-0547. [DOI] [PubMed] [Google Scholar]
- Vanhove M, Retchless AC, Sicard A, Rieux A, Coletta-Filho HD, De La Fuente L, Stenger DC, Almeida RPP. Genomic diversity and recombination among Xylella fastidiosa subspecies. Appl Environ Microbiol. 2019:85:e02972-18. 10.1128/AEM.02972-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright WD, Shah SS, Heyer WD. Homologous recombination and the repair of DNA double-strand breaks. J Biol Chem. 2018:293(27):10524–10535. 10.1074/jbc.TM118.000372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zoledowska S, Motyka-Pomagruk A, Sledz W, Mengoni A, Lojkowska E. High genomic variability in the plant pathogenic bacterium Pectobacterium parmentieri deciphered from de novo assembled complete genomes. BMC Genomics. 2018a:19:751. 10.1186/s12864-018-5140-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zoledowska S, Motyka A, Zukowska D, Sledz W, Lojkowska E. Population structure and biodiversity of Pectobacterium parmentieri isolated from potato fields in temperate climate. Plant Dis. 2018b:102:154–164. 10.1094/PDIS-05-17-0761-RE. [DOI] [PubMed] [Google Scholar]
- Zoledowska S, Presta L, Fondi M, Decorosi F, Giovannetti L, Mengoni A, Lojkowska E. Metabolic modeling of Pectobacterium parmentieri SCC3193 provides insights into metabolic pathways of plant pathogenic bacteria. Microorganisms. 2019:7:101. 10.3390/microorganisms7040101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All genome data utilized in this study were retrieved from the NCBI GenBank and are publicly available. Details regarding the NCBI GenBank accession numbers and other metadata related to each genome can be found in supplementary table S1, Supplementary Material online.




