Summary
The diversification of lineages within Pseudomonas syringae has involved a number of adaptive shifts from herbaceous hosts onto various species of tree, resulting in the emergence of highly destructive diseases such as bacterial canker of kiwi and bleeding canker of horse chestnut. This diversification has involved a high level of gene gain and loss, and these processes are likely to play major roles in the adaptation of individual lineages onto their host plants. In order to better understand the evolution of P. syringae onto woody plants, we have generated de novo genome sequences for 26 strains from the P. syringae species complex that are pathogenic on a range of woody species, and have looked for statistically significant associations between gene presence and host type (i.e. woody or herbaceous) across a phylogeny of 64 strains. We have found evidence for a common set of genes associated with strains that are able to colonize woody plants, suggesting that divergent lineages have acquired similarities in genome composition that may form the genetic basis of their adaptation to woody hosts. We also describe in detail the gain, loss and rearrangement of specific loci that may be functionally important in facilitating this adaptive shift. Overall, our analyses allow for a greater understanding of how gene gain and loss may contribute to adaptation in P. syringae.
Keywords: adaptation; genome fluctuation, Pseudomonas syringae; woody hosts
Introduction
Lineages from the Pseudomonas syringae species complex are the causal agents of a variety of blight, speck, spot and canker diseases on a range of economically and environmentally important plant species (Hirano and Upper, 1990; Mansfield et al., 2012; O'Brien et al., 2011). The P. syringae species complex is divided into more than 50 pathological variants (pathovars), named for their ability to infect different plant species, which are distributed across at least seven distinct phylogenetic groups (phylogroups, PGs) based on sequence divergence of housekeeping genes (e.g. Berge et al., 2014; Hwang et al., 2005; Sarkar and Guttman, 2004). Recently, a number of pathovars have been responsible for the emergence of highly damaging new diseases of woody species, including European horse chestnut (Webber et al., 2008), kiwifruit (Balestra et al., 2010), olive (Rodríguez‐Moreno et al., 2009) and hazelnut (Scortichini et al., 2002). These epidemics have prompted a number of investigations into the genetic basis of the adaptation of P. syringae onto woody hosts, and the evolutionary processes that have enabled this adaptation (e.g. Green et al., 2010; Marcelletti et al., 2011; O'Brien et al., 2012; Rodríguez‐Palenzuela et al., 2010).
Genome fluctuation, defined as the gain and loss of genes through time, is an extensive evolutionary force in P. syringae, and previous studies have revealed the breadth and depth of the potential gene pool available via horizontal gene transfer (HGT) (e.g. Baltrus et al., 2011; Nowell et al., 2014; O'Brien et al., 2012). Both gene gain and loss have been implicated as important adaptive mechanisms in P. syringae evolution, with much focus on the repertoire dynamics of effector genes of the type III secretion system (T3SS) (e.g. Lindeberg et al., 2006; Ma et al., 2006; Pitman et al., 2005). The magnitude of genome fluctuation is remarkable—individual lineages may be exposed to hundreds, perhaps even thousands, of new genes within the same time frame as 1% divergence accrues among protein sequences of the core genome (Nowell et al., 2014). In addition, it is now known that genetically diverse populations of P. syringae thrive in a multitude of environmental (i.e. non‐plant) habitats, including leaf litter, river headwaters and snow‐pack (Monteil et al., 2012, 2013, 2014; Morris et al., 2009). Given this naturally occurring reservoir of genetic diversity, Monteil et al. (2013) have recently suggested an epidemic population structure for P. syringae, whereby clonal expansions of highly virulent lineages emerge from a frequently recombining and genetically diverse background population. Taken together, these findings suggest that the flexible genomes of phytopathogenic P. syringae lineages are adapted to be selectively advantageous when expressed in a particular niche—that of a compatible host species—and implicate HGT and gene loss as key evolutionary mechanisms that facilitate adaptation.
Here, in the light of the recent disease epidemics produced by canker‐causing pathovars, we test this hypothesis by investigating the genomic basis of P. syringae adaptation into an environment that has been colonized multiple times during its evolutionary history—specifically, the woody organs of a range of host species. We augment the current genomic resource for P. syringae with draft genomes of 26 strains (16 pathovars) that are pathogenic on a range of woody species, and delimit the P. syringae pan‐genome into its constituent core (genes that are shared in all taxa) and flexible (genes that occur variably) genome components. We employ these data to investigate the adaptation of P. syringae onto woody hosts using three different approaches.
First, we look for statistically significant correlations between flexible genes and host type among a total of 64 strains for which high‐quality, whole‐genome sequence data are available, using a method that is able to account for phylogenetic relatedness among strains. Second, we elucidate the distribution of a range of both secreted and non‐secreted virulence factors that are known to be important in P. syringae pathogenesis. Lastly, we reconstruct the evolutionary history of gene gain along the phylogenetic lineage leading to pathovar (pv.) aesculi, the causal agent of horse chestnut bleeding canker in the European horse chestnut (Aesculus hippocastanum), and assess the putative functions of acquired genes in relation to their potential role in pathogenesis.
Results
Genome sequencing and assembly
We selected 26 strains of P. syringae (16 pathovars) that are pathogens of a wide range of woody plants for whole‐genome sequencing using Illumina MiSeq technology (Table 1). The resultant draft assemblies ranged in span from 5.62 to 6.47 Mb, with a median of 6.19 Mb (Table S1, see Supporting Information). Assembly N50, defined as the length of the contig at which 50% of the genome is covered by a contig of equivalent length or longer, ranged from 41.8 to 246.4 kb (median of 66.3 kb), and all genomes were assembled into fewer than 400 contigs. Overall, data retention during assembly was high in all cases, with ≥97% of filtered reads aligning to the final assembly for each strain. Gene repertoire ‘completeness’ was also high, with only one core protein (from a total of 40; Simão et al., 2015) absent from each assembly.
Table 1.
Pathovar | Strain | Identifier* | Host | Year † | Contigs | CDS ‡ | Trait § | Reference |
---|---|---|---|---|---|---|---|---|
actinidiae | MAFF 302091 | actn302091 | Actinidia deliciosa (kiwifruit) | 1984 | 941 | 5169 | W | Baltrus et al. ( 2011) |
actinidiae | NCPPB 3739 | actn3739 | Actinidia deliciosa (kiwifruit) | 1984 | 815 | 5283 | W | Marcelletti et al. ( 2011) |
actinidiae | NCPPB 3871 | actn3871 | Actinidia deliciosa (kiwifruit) | 1992 | 466 | 5267 | W | Marcelletti et al. ( 2011) |
actinidiae | CRAFRU8.43 | actn843 | Actinidia deliciosa (kiwifruit) | 2008 | 585 | 5513 | W | Marcelletti et al. ( 2011) |
aesculi | NRS 2113 | aesc2113 | Aesculus hippocastanum (European horse chestnut) | 2006 | 330 | 5644 | W | This study |
aesculi | NRS 2250 | aesc2250 | Aesculus hippocastanum (European horse chestnut) | 2008 | 776 | 5324 | W | Green et al. ( 2010) |
aesculi | NRS 2279 | aesc2279 | Aesculus hippocastanum (European horse chestnut) | 2002 | 322 | 5688 | W | This study |
aesculi | NRS 2306 | aesc2306 | Aesculus hippocastanum (European horse chestnut) | 2010 | 291 | 5734 | W | This study |
aesculi | NRS 2315 | aesc2315 | Aesculus hippocastanum (European horse chestnut) | 2006 | 289 | 5623 | W | This study |
aesculi | NRS 2329 | aesc2329 | Aesculus hippocastanum (European horse chestnut) | 2011 | 319 | 5797 | W | This study |
aesculi | NRS 2336 | aesc2336 | Aesculus hippocastanum (European horse chestnut) | 2010 | 288 | 5717 | W | This study |
aesculi | NRS 3681 | aesc3681 | Aesculus indica (Indian horse chestnut) | 1979 | 841 | 5293 | W | Green et al. ( 2010) |
alisalensis ¶ | ES4326 | Pcan4326 | Raphanus sativus (radish) | 1965 | 878 | 5475 | H | Baltrus et al. ( 2011) |
aptata | DSM 50252 | apta50252 | Beta vulgaris (sugar beet) | 1948 | 3776 | 5265 | H | Baltrus et al. ( 2011) |
atrofaciens | DSM 50255 | atro50255 | Triticum aestivum (wheat) | 1974 | 669 | 5040 | H | Baltrus et al. ( 2014a) |
atrofaciens | LMG 5095 | atro5095 | Triticum aestivum (wheat) | 1974 | 1007 | 5160 | H | Y.‐H. Noh and J.‐S. Cha (unpublished data) |
avellanae | ISPaVe037 | avel037 | Corylus avellana (hazel) | 1992 | 317 | 5321 | W | O'Brien et al. ( 2012) |
avellanae | ISPaVe013 | avel013 | Corylus avellana (hazel) | 1992 | 191 | 5172 | W | O'Brien et al. ( 2012) |
avellanae | BPIC631 | avel631 | Corylus avellana (hazel) | 1976 | 1602 | 5228 | W | O'Brien et al. ( 2012) |
avellanae | CRAFRUec1 | avelec1 | Corylus avellana (hazel) | 2003 | 547 | 5160 | W | Scortichini et al. ( 2013) |
avii | CFBP 3846 | avii3846 | Prunus avium (cherry) | 1991 | 324 | 5680 | W | This study |
— | BRIP 34876 | BRIP34876 | Hordeum vulgare (barley) | 1971 | 148 | 5119 | H | Gardiner et al. ( 2013) |
— | BRIP 34881 | BRIP34881 | Hordeum vulgare (barley) | 1971 | 157 | 5136 | H | Gardiner et al. ( 2013) |
— | BRIP 39023 | BRIP39023 | Hordeum vulgare (barley) | 1988 | 34 | 5123 | H | Gardiner et al. ( 2013) |
broussonetiae | CFBP 5140 | brou5140 | Broussonetia kazinoki (paper mulberry) | 1980 | 359 | 5784 | W | This study |
castaneae | CFBP 4217 | cast4217 | Castanea crenata (Japanese chestnut) | 1977 | 220 | 5710 | W | This study |
cerasicola | CFBP 6109 | cera6109 | Prunus yedoensis (Yoshino cherry) | 1995 | 353 | 5415 | W | This study |
— | Cit7 | cit7 | Citrus sinensis (navel orange) | 2008 | 2655 | 5321 | H | Baltrus et al. ( 2011) |
daphniphylli | CFBP 4219 | daph4219 | Daphniphyllum teijsmanni | 1981 | 370 | 5697 | W | This study |
dendropanacis | CFBP 3226 | dend3226 | Dendropanax trifidus (ivy tree) | 1979 | 219 | 5334 | W | This study |
eriobotryae | CFBP 2343 | erio2343 | Eriobotrya japonica (loquat tree) | 1970 | 129 | 5733 | W | This study |
fraxini | CFBP 5062 | frax5062 | Fraxinus excelsior (ash tree) | 1978 | 331 | 5723 | W | This study |
glycinea | B076 | glycB076 | Glycine max (soybean) | 2007 | 104 | 5613 | H | Qi et al. ( 2011) |
glycinea | race 4 | glycR4 | Glycine max (soybean) | 1977 | 108 | 5314 | H | Qi et al. ( 2011) |
japonica | MAFF 301072 | japo301072 | Hordeum vulgare (barley) | 1951 | 4,661 | 5562 | H | Baltrus et al. ( 2011) |
lachrymans | MAFF 301315 | lach301315 | Cucumis sativus (cucumber) | 1975 | 791 | 6275 | H | Baltrus et al. ( 2011) |
lachrymans | MAFF 302278 | lach302278 | Cucumis sativus (cucumber) | 1935 | 798 | 5265 | H | Baltrus et al. ( 2011) |
morsprunorum | NRS 2341 | mors2341 | Prunus cerasus (wild cherry) | 1988 | 173 | 5692 | W | This study |
morsprunorum | MAFF 302280 | mors302280 | Prunus domesticus (European plum) | 1977 | 969 | 5338 | H** | Baltrus et al. ( 2011) |
morsprunorum | HRI‐W 5261 | mors5261 | Prunus avium (sweet cherry cv. Roundel) | 1990 | 264 | 5887 | W | This study |
morsprunorum | HRI‐W 5269 | mors5269 | Prunus cerasus (sour cherry) | 1990 | 158 | 5580 | W | This study |
myricae | CFBP 2897 | myri2897 | Myrica rubra (Chinese bayberry) | 1978 | 204 | 5421 | W | This study |
nerii | CFBP 5067 | neri5067 | Nerium oleander (oleander) | 1979 | 242 | 5249 | W | This study |
panici | LMG 2367 | pani2367 | Panicium miliaceum (proso millet) | 1963 | 148 | 5154 | H | Liu et al. ( 2012) |
papulans | CFBP 1754 | papu1754 | Malus sylvestris (crab apple) | 1973 | 174 | 5705 | W | This study |
phasiolicola | 1448A | phas1448A | Phaseolus vulgaris (common bean) | 1985 | 3 | 5172 | H | Joardar et al. ( 2005) |
pisi | PP1 | pisiPP1 | Pisum sativum (pea) | 1978 | 256 | 5157 | H | Baltrus et al. ( 2014b) |
rhaphiolepidis | CFBP 4220 | rhap4220 | Rhaphiolepis umbellata (yeddo hawthorn) | 1980 | 292 | 5159 | W | This study |
savastanoi | NCPPB 3335 | sava3335 | Olea europaea (olive tree) | 1984 | 403 | 5194 | W | Rodríguez‐Palenzuela et al. ( 2010) |
syringae | 1212 | syri1212 | Pisum sativum (pea) | — | 338 | 5324 | H | Baltrus et al. ( 2014b) |
syringae | NRS 2339 | syri2339 | Prunus avium (sweet cherry) | 1984 | 69 | 5246 | W | This study |
syringae | NRS 2340 | syri2340 | Pyrus sp. (pear) | 1985 | 98 | 5354 | W | This study |
syringae | 642 | syri642 | Not stated | 2007 | 296 | 5100 | H | Clarke et al. ( 2010) |
syringae | HRI‐W 7872 | syri7872 | Prunus domestica (plum cv. Opal) | 2000 | 105 | 5058 | W | This study |
syringae | HRI‐W 7924 | syri7924 | Prunus cerasus (sour cherry) | 2000 | 130 | 5478 | W | This study |
syringae | B301D‐R | syriB301 | Pyrus communis (pear flower) | 1969 | 81 | 5168 | H | Dudnik and Dudler ( 2014) |
syringae | B728a | syriB728a | Phaseolus vulgaris (common bean) | 1987 | 1 | 5089 | H | Feil et al. ( 2005) |
tabaci | ATCC 11528 | taba11528 | Nicotiana tabacum (tobacco) | 1905 | 1405 | 5432 | H | Studholme et al. ( 2009) |
tabaci | 6605 | taba6605 | Nicotiana tabacum (tobacco) | 1967 | 284 | 5441 | H | D. J. Studholme et al. (unpublished data) |
theae | ICMP 3923 | thea3923 | Camellia sinensis (tea plant) | 1974 | 378 | 5633 | W | Mazzaglia et al. ( 2012) |
tomato | NCPPB 1108 | toma1108 | Solanum lycopersicum (tomato) | 1961 | 304 | 5467 | H | Cai et al. ( 2011) |
tomato | DC3000 | tomaDC3000 | Solanum lycopersicum (tomato) | 1960 | 3 | 5619 | H | Buell et al. ( 2003) |
tomato | T1 | tomaT1 | Solanum lycopersicum (tomato) | 1986 | 122 | 5583 | H | Almeida et al. ( 2009) |
ulmi | CFBP 1407 | ulmi1407 | Ulmus sp. (elm) | 1958 | 323 | 5933 | W | This study |
*Unique identifier used in this study.
†Year of original isolation (if known).
‡Number of coding sequences (CDS) as annotated by Rapid Annotation using Subsystem Technology (RAST).
§Trait designation based on host type: H, herbaceous host; W, woody host (see Experimental Procedures).
¶Originally identified as P. syringae pv. maculicola, this strain has been reclassified recently as Pseudomonas cannabina pv. alisalensis (Bull et al., 2010).
These data were combined with 38 publicly available genome sequences from across the P. syringae species complex. Reannotation of these 64 strains produced a total of 348 022 protein‐coding genes, the products of which were then clustered into 11 200 initial groups by OrthoMCL. After applying the correction procedures outlined in Nowell et al. (2014), the size of the core genome was estimated at 2677 genes, or ∼48% of the total number of genes in an average P. syringae genome. The pan‐genome was estimated at 13 010 genes (Fig. S1, see Supporting Information).
Phylogenetics
The core genome phylogeny was reconstructed from the 1.15 Mb concatenated nucleotide alignment of 2086 one‐to‐one orthologous genes using maximum likelihood (Fig. 1). This shows the well‐supported partitioning of these strains into three clusters, corresponding to PGs 1, 2 and 3, as defined by Sarkar and Guttman (2004). Strains inferred to be pathogens of woody hosts, indicated in green on the phylogeny, fall within each of the three main PGs and are not monophyletic within any PG. The majority of woody host strains (∼75%) cluster within two clades. The largest is in PG3, and contains all of the PG3 woody host strains with the exception of pv. broussonetiae; this is designated as the ‘aesculi’ clade. The other is found in PG1 and is designated as the ‘actinidiae’ clade.
Correlated evolution between gene presence and woody hosts
We used the program BayesTraits (Pagel, 1994) to look for statistically significant correlations between gene presence and the ability to colonize the woody parts of a host plant (the ‘woody niche’) by way of a likelihood ratio (LR) test. The shape of the observed LR distribution suggests an excess of genes with an LR value greater than the threshold indicated by the null (Fig. S2, see Supporting Information). The numbers of genes exceeding each threshold are shown in Table 2, together with the expected number of Type I (false‐positive) errors under the null model. Of the 3883 tested sites of the flexible genome, 899 have an LR value that exceeds the P ≤ 0.05 threshold. The expected number of false positives is 194, implying that there are about 700 genes (i.e. ∼18% of tested genes or ∼7% of all flexible genes) showing a significant association with strains that colonize the woody parts of their host.
Table 2.
P value | LR value | Number of genes | Proportion (%) | ||
---|---|---|---|---|---|
Expected* | Observed | Tested † | Flexible ‡ | ||
0.05 | 6.78 | 194 | 899 | 18.15 | 6.82 |
0.01 | 9.50 | 39 | 296 | 6.62 | 2.49 |
0.001 | 13.02 | 4 | 59 | 1.42 | 0.53 |
0.0001 | 16.50 | <1 | 20 | 0.51 | 0.19 |
0.00001 | 20.89 | ≪1 | 3 | 0.08 | 0.03 |
*Expected number of Type I (false‐positive) errors under the null model.
†Proportion of the 3883 tested genes.
‡Proportion of the total flexible genome (10 333 genes).
To gain a better understanding of the nature of this association, we plotted the patterns of occurrence of the 59 genes associated with the woody niche at P ≤ 0.001 (Fig. 2). Most of these genes (47 of 59) are not found exclusively in woody host strains, but are present in multiple transitions from herbaceous to woody hosts in the phylogeny. On average, woody host strains possess 33 of the 59 genes (56%), compared with about 18 (30%) in non‐woody strains.
The putative functions of these genes were ascertained using evidence from gene orthology. Twenty genes (∼34%), including five of the top 10, were either annotated as hypothetical proteins or returned no matches. A further 10 genes (∼17%) were described as having functions related to either transposition or conjugal transfer. The putative functions for the remaining 29 genes are shown in Table S2 (see Supporting Information). Two proteins show sequence identity to known type III secretion effector proteins (HopAY1 and HopAO1), whereas six proteins are involved in the uptake, transport or utilization of urea. In addition, 4‐oxalocrotonate tautomerase (gene #23) and muconate cycloisomerase (gene #26) both have roles in the degradation of a number of aromatic compounds, including benzene, toluene and xylene, which are constituents of extracts from wood, such as pine tar.
Physical linkage among these 59 genes was also assessed, using the myri2897 genome as a reference, as this strain encoded the most ‘woody niche’ genes. Of the 56 genes present in myri2897, 32 (∼57%) hit to different contigs, and the only operon of note included five of the six genes involved in urea metabolism. Querying these genes against a database of putatively plasmid‐derived contigs (Table S3, see Supporting Information) suggests that at least 22 genes (37%) are likely to be encoded on contigs with identity to known plasmids.
Distribution of T3SS effectors (T3SEs) and virulence genes across the P. syringae complex
We also elucidated the distribution of specific genes with known functions in P. syringae pathogenicity, including T3SEs and other virulence factors. The occurrence profile for 88 T3SE subfamilies is given in Fig. 3. Overall, T3SE occurrence is highly variable and does not correspond to the phylogeny of these strains. It should be noted that strain syri642 is known to lack the canonical T3SS apparatus (Clarke et al., 2010).
Discounting syri642, repertoire size ranged from 10 (atro5095, japo301072 and pani2367) to 41 (tomaDC3000). In agreement with previous analyses (e.g. Baltrus et al., 2011; Bartoli et al., 2015), strains within PG2 have many fewer T3SEs than the other two PGs (13 on average, compared with 35 and 29 for PG1 and PG3, respectively). A total of seven T3SEs (AvrPto3, HopBE1, HopBI1, HopBH1, HopH3, HopZ5 and PthG) was encoded exclusively by woody host strains in this analysis, although both HopBH1 and HopBI1 are found in the more diverged (PG4) rice pathogen pv. oryzae str. 1_6 (Mucyn et al., 2014). The average number of effectors encoded by woody host strains is 29, compared with 20 encoded by non‐woody host strains, although the phylogenetic non‐independence of these data makes the significance of this difference difficult to ascertain.
A 488‐residue protein with 92% amino acid identity to an effector encoded by the gall‐forming plant pathogen Pantoea agglomerans pv. gypsophilae, denoted PthG (Ezra et al., 2004), was found exclusively in the PG2 strains syri2339, syri2340, syri7924 and papu1754, and has no identity to any T3SEs already described for P. syringae. It should be noted that the ability of this putative novel effector to be translocated (i.e. injected into a host cell via the T3SS) is not known.
We also characterized the pattern of occurrence for a number of other virulence factors (Fig. 4). In agreement with previous studies (e.g. Baltrus et al., 2011; Hwang et al., 2005), patterns of occurrence are simpler than those shown by T3SEs and largely correspond to phylogeny. The β‐ketoadipate and protocatechuate‐4,5‐deoxygenase operons have been suggested previously to be potentially important adaptations of P. syringae to the woody niche (e.g. Bartoli et al., 2015; Green et al., 2010); thus, we focus on the distribution of these genes here. In agreement with Bartoli et al. (2015), the β‐ketoadipate operon is restricted to strains within PG1 and PG3. Expanding on their result, we show that this operon is present in the monophyletic ‘aesculi’ clade in PG3, and delimits host type (woody versus non‐woody) within PG3, with the exception of pv. broussonetiae. The operon is also present in pathovars actinidiae, theae and morsprunorum within the PG1 ‘actinidiae’ clade, but is not found in the closely related hazelnut pathogens from the pathovar avellanae (strains avel631 and avelec1). In contrast, the protocatechuate‐4,5‐deoxygenase pathway was found to be unique to pv. aesculi.
Genomic adaptations to the woody niche along the aesculi lineage
In order to gain a clearer understanding of the evolution of P. syringae into the woody niche, we investigated the history of gene gain along the phylogenetic lineage leading to pv. aesculi (Fig. 5; see also Dataset S1 in Supporting Information). This reveals a number of potentially important adaptations to the woody niche, outlined below.
Our reconstruction shows the gain of a gene encoding a 278‐amino‐acid protein annotated as a putative xylose isomerase, involved in the utilization of the wood‐derived sugar d‐xylose, at the root of all PG3 pathovars. Mapping of this gene to the aesc2336 assembly showed it to be independent of the alternative xylose degradation operon (xylRAFGH) which is ubiquitous across the P. syringae species complex. This operon also contains a xylose isomerase gene, that we denote xylA 1, but these two genes are not similar—the PG3 xylose isomerase (denoted xylA 2) is 160 codons shorter than xylA 1, and alignment of the two reveals very low amino acid identity (∼15%). The xylA 2 gene is present in all PG3 strains, but also in the relatively distantly related pathovars actinidiae and theae in PG1.
Phylogenetic analysis of xylA 2 revealed that, although PG1 and PG3 homologues were clearly partitioned, the level of divergence across all sites (p distance) was much reduced relative to that of xylA 1 (0.07 versus 0.25). Further investigation revealed this difference to be primarily driven by divergence at synonymous sites (K s), with values of 0.63 and 0.12 for xylA 1 and xylA 2, respectively (Table S4, see Supporting Information). In addition, two further genes with putative functions in the transport of d‐xylose across the cell membrane were inferred to have been acquired at the root of the ‘aesculi’ clade in PG3. As was the case for xylA 2, these two genes are independent of the xylRAFGH locus and are not similar to any component of this operon. Neither of the two genes was found outside the ‘aesculi’ clade, and they also occurred variably within this group.
In both PG1 and PG3 strains, the xylA 2 gene occurs immediately downstream of three genes with putative functions in the degradation of rhizopines, compounds which are synthesized by nitrogen‐fixing bacteria within the root nodules of leguminous plants (Bahar et al., 1998; Murphy et al., 1995; Saint et al., 1993). This cluster of genes, denoted mocDEF, was also inferred to have been acquired at the root of PG3, and is similarly exclusive to PG3 strains and to pathovars theae and actinidiae in PG1.
We also found evidence for the gain of at least six T3SEs along the lineage leading to pv. aesculi. Of particular interest is the effector gene hopV1, gained along the branch ancestral to PG3. blast analysis revealed that this gene was ubiquitous among PG3 strains, but it was also found in pathovars theae and tomato str. DC3000 in PG1. Phylogenetic analysis of hopV1 showed that the pv. theae homologue clustered within the PG3 clade, suggesting the recent transfer of this gene from a PG3 lineage into the pv. theae genome (Fig. S3, see Supporting Information). Alignment of hopV1 to the aesc2336 assembly showed that it was inserted immediately downstream of the xylRAFGH operon discussed above. Furthermore, we detected a topological discordance relative to the core genome phylogeny at the nearby xylH locus, such that PG1 and PG3 homologues cluster monophyletically, with PG2 basal to this group (Fig. S4, see Supporting Information), suggesting that the transfer of hopV1 between PGs may have involved homologous recombination of the xylH locus.
Our reconstruction showed that the β‐ketoadipate operon had been gained at the root of the ‘aesculi’ clade in PG3. Phylogenetic analysis of the ∼7.5‐kb concatenated alignment of the 10 genes of this operon showed the well‐supported partitioning of these homologues into clusters that correspond to PGs 1 and 3 of the core genome phylogeny (Fig. 6). The observed level of divergence between PG1 and PG3 homologues, however, was approximately half that of genes of the core genome (average p distance of 0.127 versus 0.215). Partitioning this divergence into its constituent synonymous and non‐synonymous components showed an average K s of 0.097 and an average K a of 0.007, both of which are at least an order of magnitude lower than those observed for core genes (Table S4). Phylogenies of genes immediately upstream [tree (iv)] and downstream [trees (vi) and (vii)] of the operon show the clustering of PG1 with PG3, whereas phylogenies for loci further away [trees (i), (ii), (iii) and (viii)] resemble the core genome phylogeny (Fig. 6b).
Discussion
Our analyses demonstrate a novel approach for the detection of genes that may be important in the expression of certain phenotypes by bacterial lineages. We used Pagel's (1994) method of detecting correlated evolution of discrete traits along a phylogeny, defining one trait as gene occurrence (presence or absence) and the other as the ability (or otherwise) to cause disease in the woody parts of a host plant. Below, we discuss the wider implications of our results in the context of recent literature regarding P. syringae population genomics and evolution, and highlight a number of genes and pathways that merit further investigation with regard to the genetic basis of P. syringae pathogenesis in the woody parts of host plants.
The distribution of flexible genes contains an ecological signal that is dependent on niche type
We have found that a substantial proportion of the P. syringae flexible genome (∼7%, or about 700 genes) is significantly associated with the ability to colonize the woody parts of a plant host. This suggests that, for a certain fraction of the flexible genome at least, patterns of gain and loss are neither random nor strictly inherited (i.e. congruent with phylogeny); rather, they follow associations based on the ecological characteristics of these lineages—namely, the ability or otherwise to exist in the woody niche. This implies, perhaps unsurprisingly, that strains inhabiting a given ecological niche require the same, or similar, sets of functions that are encoded by the same, or similar, sets of genes, in order to proliferate. Given the extent of HGT‐mediated genome fluctuation in P. syringae genomes, this suggests a convergent ‘tailoring’ of the flexible genome that is determined within the ecological context of the environment in which it resides.
Our observations fit well with models regarding the role of HGT in bacterial niche adaptation (Ochman et al., 2005; Polz et al., 2013), and lend support to recent suggestions of an epidemic population structure for P. syringae, whereby clonal expansions of plant‐pathogenic lineages emerge from a highly diverse and recombinogenic background population that lives primarily in environmental habitats (Monteil et al., 2013; Vinatzer and Monteil, 2014). Although the majority of strains included in this study are plant pathogens, the results presented here suggest that HGT‐mediated genome fluctuation may also facilitate the transition of a P. syringae lineage from an epiphyte/environmental bacterium to a pathogen.
It follows that genes that are significantly associated with the woody niche are likely to confer a selective advantage when expressed in that environment. We note that a number of genes involved in the utilization of urea are among the set most significantly associated with the woody niche. Although these genes were not exclusive to woody host strains, we speculate that the ability to breakdown urea may be an important trait of strains that have invaded the nitrogen‐limited woody parts of host plants (Eriksson et al., 2012; Higuchi, 2012), although further work is needed to confirm this hypothesis. In addition, two enzymes (muconate cycloisomerase and 4‐oxalocrotonate tautomerase) have roles in the degradation of wood‐derived compounds, such as xylene and toluene. We also found that two T3SEs, HopAY1 and HopAO1, are significantly associated with the woody niche, whereas a further two (HopH3 and HopZ5) have been independently acquired by multiple woody host lineages, and are found only in strains that are pathogens of woody hosts.
It is interesting to note the large number of proteins that we infer to be either hypothetical proteins or involved in transposition among the most significantly associated genes. This may be a result of the HGT process itself, which is likely to involve mobile elements, such as plasmids and pathogenicity islands, which are rich in both insertion sequences and coding sequences of unknown function. Nonetheless, we observe a clear signal of association from these data at the genome‐wide level: when these strains are defined by the fairly broad ecological distinction of woody versus non‐woody host type, the occurrence profile of specific genes is dictated not by phylogeny, but by ecology. Thus, we suggest that these genes and pathways merit further investigation with regard to the genetic basis of P. syringae adaptation onto woody hosts.
Gain, loss and rearrangement within the d‐xylose operon
Our results implicate the utilization of d‐xylose as a potentially important adaptation in woody host‐infecting pathovars in PG1 and PG3. d‐Xylose is an environmentally abundant pentose sugar, and is the primary constituent of hemicellulose xylan, itself a major component of both hard‐ and softwoods (Jeffries, 1983). We infer the gain of a number of genes involved in both the transport and isomerization of d‐xylose along lineages within both PG1 and PG3. For example, the reduced level of divergence observed for an alternative xylose isomerase gene (xylA 2), involved in the incorporation of d‐xylose into the pentose phosphate pathway (Bettiga et al., 2008; Stephens et al., 2007), suggests that the time to coalescence for PG1 and PG3 xylA 2 homologues is much shorter than the genome‐wide average. This reduction in divergence is unlikely to be caused by selectional constraints, as the K a/K s ratio, which is an indicator of the strength and type of selection that may be acting on a gene (Li, 1993; Sharp, 1997), implies that the xylA 2 gene is not experiencing a stronger level of purifying selection relative to the genome‐wide average. Importantly, these imported xyl genes are not part of the d‐xylose degradation operon (xylRAFGH), which is present in all lineages regardless of host type. The additional xyl genes are highly diverged from their xylRAFGH homologues and are therefore unlikely to have arisen via duplication. Thus, we infer that these genes have been imported via HGT from outside the P. syringae species complex and, although the specific function of these imported xyl genes is yet to be determined, we hypothesize that their presence may allow for an increase in either the rate or efficiency of d‐xylose utilization in the woody environment.
The proximity and orientation of the T3SE gene hopV1 to the xylRAFGH operon suggest that hopV1 may be co‐expressed with the inducement of the xylose operon—i.e. in the presence of d‐xylose. This mechanism may be selectively advantageous if HopV1 contributes to pathogenicity in xylose‐rich environments, such as the woody tissues of an infected woody host plant.
The alternative xylose isomerase gene (xylA 2) is located next to three genes (mocDEF) with putative functions in the degradation of opine compounds. The mocDEF genes encoded by rhizobial species have been well characterized in their capacity to utilize rhizopines (Bahar et al., 1998), but the action of these genes is also thought to be similar to the initial stages of the degradation of aromatic hydrocarbons, such as toluene, benzene and xylene (Bahar et al., 2000; Suzuki et al., 1991). The production of opine compounds is a common feature of gall‐inducing bacterial species from the genus Agrobacterium (Kim and Farrand, 1996); however, the mocDEF genes encoded by P. syringae are not similar to genes in the Agrobacterium pathway, and there is no evidence of the remainder of this operon (mocCABR) in any P. syringae lineage. Thus, although the putative function of the mocDEF genes in P. syringae remains unclear, their presence may allow for the utilization of opine‐like molecules that are produced by other bacteria on woody plants, or as a part of an alternative and uncharacterized pathway involved in the degradation of aromatic compounds, such as toluene and xylene.
Acquisition of the β‐ketoadipate pathway coincides with expansion into the woody niche across PGs
A number of studies have indicated the potential importance of the β‐ketoadipate operon in the ability of pathovars, such as aesculi, savastanoi and actinidiae, to cause disease in their respective host plants (Green et al., 2010; Marcelletti et al., 2011; Rodríguez‐Palenzuela et al., 2010). More recently, Bartoli et al. (2015) have shown a correlation between the presence of this locus and the ability of strains to grow endophytically in the stems of kiwifruit, highlighting the importance of these genes in the adaptation of P. syringae to that woody niche. In our extended analysis (and in agreement with the results of Bartoli et al., 2015), we find this operon to be present in the major expansions of P. syringae onto woody hosts in both PG1 and PG3. We infer this pathway to have been gained at the root of the large monophyletic cluster of woody host strains in PG3, and we hypothesize that the gain of these genes may have been the underlying factor that facilitated the remarkable diversification of this group of PG3 lineages onto a range of woody host species.
Bartoli et al. (2015) have suggested that the presence of the β‐ketoadipate operon in PG1 and PG3 strains is most probably the result of a single gain in the ancestor to the P. syringae species complex. However, our results show a reduced level of divergence between PG1 and PG3 homologues at this locus that would indicate a more recent common ancestor for these genes, relative to the genome‐wide average, and evidence for phylogenetic discordance at genes flanking the β‐ketoadipate cluster, indicative of recombination in these regions. The reduced divergence is again unlikely to be a result of selection, as the K a/K s ratio does not indicate that these genes are experiencing unusually strong purifying selection, relative to the genome‐wide average. Thus, we suggest that the β‐ketoadipate operon was probably gained subsequent to the differentiation of PGs 1, 2 and 3 from a source most likely outside the P. syringae species complex, and that a recombination event between an ancestral PG1 lineage and an ancestral PG3 lineage resulted in the presence of these genes in both PGs. Given that K s within the ‘actinidiae’ clade is about twice that of the ‘aesculi’ clade, the most likely scenario is that the operon was first acquired by a PG1 lineage, and was transferred into PG3 soon after. A number of other factors, such as the reduced divergence between the PG1 and PG3 xylA 2 homologues and the phylogenetic placement of the hopV1 gene, also point to a history of recombination between woody host lineages in PGs 1 and 3.
Although the β‐ketoadipate pathway is likely to be important for pathogenesis in pathovars such as aesculi and actinidiae, it is clearly not required for all pathogens of woody hosts. It is interesting to note the absence of this pathway from the PG1 pv. avellanae strains (avel631 and avelec1), the causal agents of hazelnut decline. These strains are close relatives of pathovars actinidiae, theae and morsprunorum, and cluster as a sister clade to these pathovars. Thus, PG1 pv. avellanae strains, together with all PG2 pathogens of woody hosts (primarily species of fruit tree, such as cherry and apple), must use alternative metabolic pathways that are yet to be elucidated. Furthermore, it is intriguing to note the presence of these genes in the PG1 pv. morsprunorum str. 302280PT (mors302280), despite the apparent non‐pathogenicity of this strain on its plum host (Gardan et al., 1999; Ménard et al., 2003). Although further testing may be required to confirm the non‐pathogenicity of mors302280, we hypothesize that this strain may have lost some other component that is required for pathogenesis, either during passage or in the wild, highlighting the potential rapidity at which the transition between a pathogen and an epiphyte can occur.
A novel approach for the detection of candidate genes from whole‐genome data
The search for associations between genotype and phenotype has been used as an analytical approach in many areas of research, particularly in relation to humans and disease (e.g. Hirschhorn and Daly, 2005). The application of the same principles to bacterial populations, however, has only recently gained traction, primarily because of the problems associated with accounting for the underlying structure of bacterial populations (e.g. Falush and Bowden, 2006). Consequently, the number of available methods for addressing these questions remains limited (but see Sheppard et al., 2013 for a notable alternative method). Here, we describe a novel approach for the detection of candidate genes that may be functionally involved in the expression of a given phenotype by a bacterial lineage. Our method combines phylogenetics and whole‐genome data within a statistical framework, and highlights a number of genes and associated pathways that may be involved in the adaptation of P. syringae to woody hosts. Further work is now required to confirm these findings, and to elucidate the potential roles of these genes in pathogenesis. Given the increasing availability of genomic data in other genera, including a number of other plant‐pathogenic microbial systems, such as Xanthomonas and Phytophthora, we suggest that our method may be useful as a first step for the rapid identification of candidate genes from whole‐genome sequence data.
Experimental Procedures
Strain information
We selected 26 strains of 16 different pathovars for whole‐genome shotgun sequencing. All strains have been reported to infect the woody parts of their respective host species, and to cause a range of diseases with symptoms including cankers, galls, knots and tissue necrosis. Information regarding the source, host, disease symptoms and reference is provided in Table 1 for all strains used in this study.
Freeze‐dried samples were revived by streaking onto King's B agar and incubated for 24 h at room temperature. For each strain, a single colony was selected and grown overnight in 3 mL of King's B broth for 12 h with shaking at room temperature. Laboratory passage of strains was minimized to avoid the loss of non‐essential genes, although the total length of passage since the original isolation is not known. For each isolate, cells were harvested by centrifugation of 1.5 mL of overnight culture at 1400 g for 5 min, discarding the supernatant and storing at −80°C. Genomic DNA was extracted using the DNeasy Plant Mini Kit (Qiagen, Hilden, Germany), following the standard protocol.
Whole‐genome sequencing, assembly and annotation
For each strain, a single library with an estimated average insert of ∼270 bases was prepared by ARK Genomics (now Edinburgh Genomics, Edinburgh, UK) using Illumina Nextera reagents. Libraries were multiplexed and run on a single lane of an Illumina MiSeq benchtop sequencer by ARK Genomics, to generate datasets of 250 base paired‐end reads.
Reads containing adapter contamination were identified and trimmed using a combination of CutAdapt v1.2.1 (Martin, 2011) and TagDust v1.12 (Lassmann et al., 2009). Low‐quality base pairs (quality score threshold < 25) were trimmed using ConDeTri v2.2 (Smeds and Künstner, 2011). The final assembly for all strains was performed using a modified version of the SPAdes assembler v2.4.0 (Bankevich et al., 2012) that allowed for an increased final k‐mer of 229. Assembly ‘completeness’ was assessed by mapping the adapter‐ and quality‐trimmed reads to its assembly using the Bowtie2 aligner v2.2.6 (Langmead and Salzberg, 2012) and counting the proportion of data that aligned. Gene repertoire completeness was also assessed by querying a set of 40 ‘core’ bacterial proteins, recently defined by Simão et al. (2015), against each assembly using tblastn (E‐value ≤ 1e–5). All genomes were annotated with the Rapid Annotation using Subsystem Technology (RAST) online server (Aziz et al., 2008; Overbeek et al., 2013). This Whole Genome Shotgun project, including raw data, has been deposited at DDBJ/EMBL/GenBank under the BioProject accession number PRJNA287460.
Sequence data and orthology
Genome data for an additional 38 strains were downloaded from the National Center for Biotechnology Information (NCBI) GenBank, giving a total of 64 strains of 33 pathovars. The genome sequences for certain strains, e.g. pv. oryzae str. 1_6, were explicitly excluded because of a high level of fragmentation, which is known to cause errors in the inference of orthology among proteins. To account for potential variation in gene content as a result of differences in annotation methodologies, all strains were re‐annotated using RAST, with the exception of the extensively curated genomes of pv. tomato str. DC3000, pv. phaseolicola str. 1448A and pv. syringae str. B728a.
Proteins were clustered into orthologous groups (OGs) using OrthoMCL v2.0.9 (Li et al., 2003; Van Dongen, 2000). The OrthoMCL pipeline first performs an all‐versus‐all blast (E‐value ≤ 1e–5), followed by Markov clustering (MCL), to determine clusters of orthologous proteins. MCL was performed across a range of inflation indices from 1.2 to 4.8, choosing the final value, 1.5, which maximizes the number of single‐copy OGs in all 64 strains (Swingley et al., 2008). The resultant list of putative OGs was subjected to a number of quality control procedures as per Nowell et al. (2014) to improve the inference of orthologous relationships among proteins.
Phylogenetics and reconstruction of gene gain and loss
The evolutionary history of the core genome was estimated from the concatenated alignment of 2086 one‐to‐one (single‐copy) orthologous genes. Nucleotide alignments were generated using T‐Coffee (Notredame et al., 2000) and concatenated using Geneious. Gap columns were removed, giving a final alignment of 1.15 Mb in length. A maximum likelihood phylogeny was constructed in RAxML v7.2.8 (Stamatakis, 2006), using a GTR + Γ model of evolution, and 100 bootstrap resamples.
The list of OGs was converted into a binary matrix of gene occurrence and mapped onto the core genome phylogeny using GLOOME software (Cohen and Pupko, 2011; Cohen et al., 2008, 2010). Briefly, this method uses stochastic mapping to infer both the total number of gene gains and losses per branch and the associated probability of gain for all OGs across all branches of the phylogeny, allowing for the identification of genes with a high probability of gain (≥0.8) along specific branches of the phylogeny.
Where applicable, gained genes were functionally annotated using blast and/or blast2go (Conesa et al., 2005); nucleotide data for individual genes were aligned using Geneious v5.4 (Biomatters Ltd., Auckland, New Zealand) and phylogenies were constructed using PhyML v3.0 (Guindon and Gascuel, 2003; Guindon et al., 2010), employing the general time reversible model of evolution with four gamma‐distributed rate categories (GTR + Γ), and 100 bootstrap replicates to assess topological support.
Distribution of T3SEs and virulence factors
Sequence data for T3SEs were downloaded from www.pseudomonas-syringae.org (16 August 2013) and combined with a multi‐species T3SE database compiled by Wang et al. (2012) to give a database of 1729 sequences. These were queried against the genomes using tblastn (E‐value ≤ 1e–5), defining presence by similarity if a hit showed a minimum of 80% identity over at least 80% query length. Putative truncation was recorded if a hit showed ≥80% identity over <80% query length. It should be noted that the ability of each putative effector to be translocated was not tested. The same schema was used for screening for a range of other virulence factors.
Statistical modelling of correlated evolution
We modelled correlated evolution between two traits, host type and gene occurrence, using the ‘Discrete’ module of the program BayesTraits v2 (Pagel, 1994; Pagel and Meade, 2006). This method fits continuous‐time Markov models to discrete binary data, and calculates the likelihood of two hierarchically nested evolutionary models, one in which two traits are allowed to evolve independently along a phylogenetic tree and another in which the two traits evolve in a correlated (dependent) manner (Barker and Pagel, 2005; Pagel, 1994; Pagel and Meade, 2006). We define host type as a discrete binary trait designated ‘woody’ (W) or ‘herbaceous’ (H), dependent on the natural ability of an individual strain to proliferate within the woody organs of its host. Pathogenic capabilities were not tested explicitly; trait designation for host type was inferred on the basis of careful analysis of the literature for each strain. Gene occurrence was defined as a discrete binary trait, designated either ‘1’ for gene presence or ‘0’ for gene absence. Our model therefore makes two important assumptions: (i) that the host‐type trait is in fact discrete, binary and mutually exclusive—strains that may have the ability to colonize both woody and non‐woody hosts are not accounted for; and (ii) that no genes have been lost in the time between the description of each strain's pathogenicity and genome sequencing.
Hypothesis testing and null model
The goodness of fit of the dependent versus the independent model was compared using an LR test: where is the likelihood of the independent model and is the likelihood of the dependent model (Pagel, 1994). A custom Perl script (available from https://github.com/reubwn/bayestraits-wrapper) was written that ran both models and calculated the LR statistic for all genes that occurred in either greater than five or fewer than 59 strains (i.e. excluding genes that were present at either a very low or very high frequency), resulting in a total of 3883 LRs.
To account for the problem of multiple testing, we constructed a null distribution of LRs that describes the random association between host preference and gene presence (Barker and Pagel, 2005). The construction of an empirically estimated null distribution negates the need for corrections, such as Bonferroni adjustment, as the null model should provide the expected distribution of LRs under the hypothesis of no association between the two traits, given a large number of individual tests. The null LR distribution was constructed by randomly permuting the gene occurrence data for each of the 3883 tested genes a total of ten times, in each case calculating a new LR statistic. The phylogeny, the H/W trait designations for each taxon and the overall proportion of gene presence relative to absence at each gene were held constant; only the occurrence profile was permuted; P value thresholds were then derived directly from the null distribution. An alternative null model, in which only the host‐type trait designation (H or W) was permuted, was also calculated for comparison.
Supporting information
Acknowledgements
We would like to thank Darren J. Obbard for assistance with genome assembly, and three anonymous reviewers for helpful and informative comments. RWN was supported by a Biotechnology and Biological Sciences Research Council (BBSRC) Doctoral Training Grant (BB/F017030/1) and a Cooperative Awards in Science & Technology (CASE) Studentship awarded by Forest Research.
References
- Almeida, N.F. , Yan, S. , Lindeberg, M. , Studholme, D.J. , Schneider, D.J. , Condon, B. , Liu, H. , Viana, C.J. , Warren, A. , Evans, C. , Kemen, E. , Maclean, D. , Angot, A. , Martin, G.B. , Jones, J.D. , Collmer, A. , Setubal, J.C. and Vinatzer, B.A. (2009) A draft genome sequence of Pseudomonas syringae pv. tomato T1 reveals a type III effector repertoire significantly divergent from that of Pseudomonas syringae pv. tomato DC3000. Mol. Plant–Microbe Interact. 22, 52–62. [DOI] [PubMed] [Google Scholar]
- Aziz, R.K. , Bartels, D. , Best, A.A. , DeJongh, M. , Disz, T. , Edwards, R.A. , Formsma, K. , Gerdes, S. , Glass, E.M. , Kubal, M. , Meyer, F. , Olsen, G.J. , Olson, R. , Osterman, A.L. , Overbeek, R.A. , McNeil, L.K. , Paarmann, D. , Paczian, T. , Parrello, B. , Pusch, G.D. , Reich, C. , Stevens, R. , Vassieva, O. , Vonstein, V. , Wilke, A. and Zagnitko, O. (2008) The RAST Server: rapid annotations using subsystems technology. BMC Genomics, 9, 75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bahar, M. , de Majnik, J. , Wexler, M. , Fry, J. , Poole, P.S. and Murphy, P.J. (1998) A model for the catabolism of rhizopine in Rhizobium leguminosarum involves a ferredoxin oxygenase complex and the inositol degradative pathway. Mol. Plant–Microbe Interact. 11, 1057–1068. [DOI] [PubMed] [Google Scholar]
- Bahar, M. , de Majnik, J. , Saint, C.P. and Murphy, P.J. (2000) Conservation of a pseudomonad‐like hydrocarbon degradative ferredoxin oxygenase complex involved in rhizopine catabolism in Sinorhizobium meliloti and Rhizobium leguminosarum bv. viciae . J. Mol. Microbiol. Biotechnol. 2, 257–259. [PubMed] [Google Scholar]
- Balestra, G.M. , Renzi, M. and Mazzaglia, A. (2010) First report of bacterial canker of Actinidia deliciosa caused by Pseudomonas syringae pv. actinidiae in Portugal. New Dis. Rep. 22, 10. [Google Scholar]
- Baltrus, D.A. , Nishimura, M.T. , Romanchuk, A. , Chang, J.H. , Mukhtar, M.S. , Cherkis, K. , Roach, J. , Grant, S.R. , Jones, C.D. and Dangl, J.L. (2011) Dynamic evolution of pathogenicity revealed by sequencing and comparative genomics of 19 Pseudomonas syringae isolates. PLoS Pathog. 7, e1002132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baltrus, D.A. , Yourstone, S. , Lind, A. , Guilbaud, C. , Sands, D.C. , Jones, C.D. , Morris, C.E. and Dangl, J.L. (2014a) Draft genome sequences of a phylogenetically diverse suite of Pseudomonas syringae strains from multiple source populations. Genome Announc. 2, e01195–e01113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baltrus, D.A. , Dougherty, K. , Beckstrom‐Sternberg, S.M. , Beckstrom‐Sternberg, J.S. and Foster, J.T. (2014b) Incongruence between multi‐locus sequence analysis (MLSA) and whole‐genome‐based phylogenies: Pseudomonas syringae pathovar pisi as a cautionary tale. Mol. Plant Pathol. 15, 461–465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bankevich, A. , Nurk, S. , Antipov, D. , Gurevich, A.A. , Dvorkin, M. , Kulikov, A.S. , Lesin, V.M. , Nikolenko, S.I. , Pham, S. , Prjibelski, A.D. , Pyshkin, A.V. , Sirotkin, A.V. , Vyahhi, N. , Tesler, G. , Alekseyev, M.A. and Pevzner, P.A. (2012) SPAdes: a new genome assembly algorithm and its applications to single‐cell sequencing. J. Comput. Biol. 19, 455–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker, D. and Pagel, M. (2005) Predicting functional gene links from phylogenetic‐statistical analyses of whole genomes. PLoS Comput. Biol. 1, e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartoli, C. , Lamichhane, J.R. , Berge, O. , Guilbaud, C. , Varvaro, L. , Balestra, G.M. , Vinatzer, B.A. and Morris, C.E. (2015) A framework to gauge the epidemic potential of plant pathogens in environmental reservoirs: the example of kiwifruit canker. Mol. Plant Pathol. 16, 137–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berge, O. , Monteil, C.L. , Bartoli, C. , Chandeysson, C. , Guilbaud, C. , Sands, D.C. and Morris, C.E. (2014) A user's guide to a data base of the diversity of Pseudomonas syringae and its application to classifying strains in this phylogenetic complex. PLoS One, 9, e105547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bettiga, M. , Hahn‐Hägerdal, B. and Gorwa‐Grauslund, M.F. (2008) Comparing the xylose reductase/xylitol dehydrogenase and xylose isomerase pathways in arabinose and xylose fermenting Saccharomyces cerevisiae strains. Biotechnol. Biofuels, 1, 16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buell, C.R. , Joardar, V. , Lindeberg, M. , Selengut, J. , Paulsen, I.T. , Gwinn, M.L. , Dodson, R.J. , DeBoy, R.T. , Durkin, A.S. , Kolonay, J.F. , Madupu, R. , Daugherty, S. , Brinkac, L. , Beanan, M.J. , Haft, D.H. , Nelson, W.C. , Davidsen, T. , Zafar, N. , Zhou, L.W. , Liu, J. , Yuan, Q.P. , Khouri, H. , Fedorova, N. , Tran, B. , Russell, D. , Berry, K. , Utterback, T. , Van Aken, S.E. , Feldblyum, T.V. , D'ascenzo, M. , Deng, W.L. , Ramos, A.R. , Alfano, J.R. , Cartinhour, S. , Chatterjee, A.K. , Delaney, T.P. , Lazarowitz, S.G. , Martin, G.B. , Schneider, D.J. , Tang, X.Y. , Bender, C.L. , White, O. , Fraser, C.M. and Collmer, A. (2003) The complete genome sequence of the Arabidopsis and tomato pathogen Pseudomonas syringae pv. tomato DC3000. Proc. Nat. Acad. Sci. USA, 100, 10 181–10 186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bull, C.T. , Manceau, C. , Lydon, J. , Kong, H. , Vinatzer, B.A. and Saux, M.F.L. (2010) Pseudomonas cannabina pv. cannabina pv. nov., and Pseudomonas cannabina pv. alisalensis (Cintas Koike and Bull, 2000) comb. nov., are members of the emended species Pseudomonas cannabina (ex Sutic & Dowson 1959) Gardan, Shafik, Belouin, Brosch, Grimont & Grimont 1999. Syst. Appl. Microbiol. 33, 105–115. [DOI] [PubMed] [Google Scholar]
- Cai, R. , Lewis, J. , Yan, S. , Liu, H. , Clarke, C.R. , Campanile, F. , Almeida, N.F. , Studholme, D.J. , Lindeberg, M. , Schneider, D. , Zaccardelli, M. , Setubal, J.C. , Morales‐Lizcano, N.P. , Bernal, A. , Coaker, G. , Baker, C. , Bender, C.L. , Leman, S. and Vinatzer, B.A. (2011) The plant pathogen Pseudomonas syringae pv. tomato is genetically monomorphic and under strong selection to evade tomato immunity. PLoS Pathog. 7, e1002130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clarke, C.R. , Cai, R. , Studholme, D.J. , Guttman, D.S. and Vinatzer, B.A. (2010) Pseudomonas syringae strains naturally lacking the classical P. syringae hrp/hrc locus are common leaf colonizers equipped with an atypical type III secretion system. Mol. Plant–Microbe Interact. 23, 198–210. [DOI] [PubMed] [Google Scholar]
- Cohen, O. and Pupko, T. (2011) Inference of gain and loss events from phyletic patterns using stochastic mapping and maximum parsimony—a simulation study. Genome Biol. Evol. 3, 1265–1275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen, O. , Rubinstein, N.D. , Stern, A. , Gophna, U. and Pupko, T. (2008) A likelihood framework to analyse phyletic patterns. Philos. Trans. R. Soc. B, 363, 3903–3911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen, O. , Ashkenazy, H. , Belinky, F. , Huchon, D. and Pupko, T. (2010) GLOOME: gain loss mapping engine. Bioinformatics, 26, 2914–2915. [DOI] [PubMed] [Google Scholar]
- Conesa, A. , Götz, S. , García‐Gómez, J.M. , Terol, J. , Talón, M. and Robles, M. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics, 21, 3674–3676. [DOI] [PubMed] [Google Scholar]
- Dudnik, A. and Dudler, R. (2014) Genome and transcriptome sequences of Pseudomonas syringae pv. syringae B301D‐R. Genome Announc. 2, e00306–e00314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eriksson, K.E.L. , Blanchette, R. and Ander, P. (2012) Microbial and Enzymatic Degradation of Wood and Wood Components. Berlin: Springer. [Google Scholar]
- Ezra, D. , Barash, I. , Weinthal, D.M. , Gaba, V. and Manulis, S. (2004) pthG from Pantoea agglomerans pv. gypsophilae encodes an avirulence effector that determines incompatibility in multiple beet species. Mol. Plant Pathol. 5, 105–113. [DOI] [PubMed] [Google Scholar]
- Falush, D. and Bowden, R. (2006) Genome‐wide association mapping in bacteria? Trends Microbiol. 14, 353–355. [DOI] [PubMed] [Google Scholar]
- Feil, H. , Feil, W.S. , Chain, P. , Larimer, F. , DiBartolo, G. , Copeland, A. , Lykidis, A. , Trong, S. , Nolan, M. , Goltsman, E. , Thiel, J. , Malfatti, S. , Loper, J.E. , Lapidus, A. , Detter, J.C. , Land, M. , Richardson, P.M. , Kyrpides, N.C. , Ivanova, N. and Lindow, S.E. (2005) Comparison of the complete genome sequences of Pseudomonas syringae pv. syringae B728a and pv. tomato DC3000. Proc. Nat. Acad. Sci. USA, 102, 11 064–11 069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gardan, L. , Shafik, H. , Belouin, S. , Broch, R. , Grimont, F. and Grimont, P.A. (1999) DNA relatedness among the pathovars of Pseudomonas syringae and description of Pseudomonas tremae sp. nov. and Pseudomonas cannabina sp. nov. (ex Sutic and Dowson 1959). Int. J. Syst. Bacteriol. 49 Pt 2, 469–478. [DOI] [PubMed] [Google Scholar]
- Gardiner, D.M. , Stiller, J. , Covarelli, L. , Lindeberg, M. , Shivas, R.G. and Manners, J.M. (2013) Genome sequences of Pseudomonas spp. isolated from cereal crops. Genome Announc. 1, e00209–e00213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Green, S. , Studholme, D.J. , Laue, B.E. , Dorati, F. , Lovell, H. , Arnold, D. , Cottrell, J.E. , Bridgett, S. , Blaxter, M. , Huitema, E. , Thwaites, R. , Sharp, P.M. , Jackson, R.W. and Kamoun, S. (2010) Comparative genome analysis provides insights into the evolution and adaptation of Pseudomonas syringae pv. aesculi on Aesculus hippocastanum . PLoS One, 5, e10224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guindon, S. and Gascuel, O. (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696–704. [DOI] [PubMed] [Google Scholar]
- Guindon, S. , Dufayard, J.F. , Lefort, V. , Anisimova, M. , Hordijk, W. and Gascuel, O. (2010) New algorithms and methods to estimate maximum‐likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. [DOI] [PubMed] [Google Scholar]
- Higuchi, T. (2012) Biosynthesis and Biodegradation of Wood Components. London: Elsevier. [Google Scholar]
- Hirano, S.S. and Upper, C.D. (1990) Population biology and epidemiology of Pseudomonas syringae . Annu. Rev. Phytopathol. 28, 155–177. [Google Scholar]
- Hirschhorn, J.N. and Daly, M.J. (2005) Genome‐wide association studies for common diseases and complex traits. Nat. Rev. Genet. 6, 95–108. [DOI] [PubMed] [Google Scholar]
- Hwang, M.S.H. , Morgan, R.L. , Sarkar, S.F. , Wang, P.W. and Guttman, D.S. (2005) Phylogenetic characterization of virulence and resistance phenotypes of Pseudomonas syringae . Appl. Environ. Microbiol. 71, 5182–5191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeffries, T.W. (1983) Utilization of xylose by bacteria, yeasts, and fungi. Adv. Biochem. Eng. Biotechnol. 27, 1–32. [DOI] [PubMed] [Google Scholar]
- Joardar, V. , Lindeberg, M. , Jackson, R.W. , Selengut, J. , Dodson, R. , Brinkac, L.M. , Daugherty, S.C. , DeBoy, R. , Durkin, A.S. , Giglio, M.G. , Madupu, R. , Nelson, W.C. , Rosovitz, M.J. , Sullivan, S. , Crabtree, J. , Creasy, T. , Davidsen, T. , Haft, D.H. , Zafar, N. , Zhou, L.W. , Halpin, R. , Holley, T. , Khouri, H. , Feldblyum, T. , White, O. , Fraser, C.M. , Chatterjee, A.K. , Cartinhour, S. , Schneider, D.J. , Mansfield, J. , Collmer, A. and Buell, C.R. (2005) Whole‐genome sequence analysis of Pseudomonas syringae pv. phaseolicola 1448A reveals divergence among pathovars in genes involved in virulence and transposition. J. Bacteriol. 187, 6488–6498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim, K.S. and Farrand, S.K. (1996) Ti plasmid‐encoded genes responsible for catabolism of the crown gall opine mannopine by Agrobacterium tumefaciens are homologs of the T‐region genes responsible for synthesis of this opine by the plant tumor. J. Bacteriol. 178, 3275–3284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead, B. and Salzberg, S.L. (2012) Fast gapped‐read alignment with Bowtie 2. Nat. Methods, 9, 357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lassmann, T. , Hayashizaki, Y. and Daub, C.O. (2009) TagDust – a program to eliminate artifacts from next generation sequencing data. Bioinformatics, 25, 2839–2840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, L. , Stoeckert, C.J. and Roos, D.S. (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, W.H. (1993) Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36, 96–99. [DOI] [PubMed] [Google Scholar]
- Lindeberg, M. , Cartinhour, S. , Myers, C.R. , Schechter, L.M. , Schneider, D.J. and Collmer, A. (2006) Closing the circle on the discovery of genes encoding Hrp regulon members and type III secretion system effectors in the genomes of three model Pseudomonas syringae strains. Mol. Plant–Microbe Interact. 19, 1151–1158. [DOI] [PubMed] [Google Scholar]
- Liu, H. , Qiu, H. , Zhao, W. , Cui, Z. , Ibrahim, M. , Jin, G. , Li, B. , Zhu, B. and Xie, G.L. (2012) Genome sequence of the plant pathogen Pseudomonas syringae pv. panici LMG 2367. J. Bacteriol. 194, 5693–5694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma, W. , Dong, F.F.T. , Stavrinides, J. and Guttman, D.S. (2006) Type III effector diversification via both pathoadaptation and horizontal transfer in response to a coevolutionary arms race. PLoS Genet. 2, e209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mansfield, J. , Genin, S. , Magori, S. , Citovsky, V. , Sriariyanum, M. , Ronald, P. , Dow, M. , Verdier, V. , Beer, S.V. , Machado, M.A. , Toth, I. , Salmond, G. and Foster, G.D. (2012) Top 10 plant pathogenic bacteria in molecular plant pathology. Mol. Plant Pathol. 13, 614–629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marcelletti, S. , Ferrante, P. , Petriccione, M. , Firrao, G. and Scortichini, M. (2011) Pseudomonas syringae pv. actinidiae draft genomes comparison reveals strain‐specific features involved in adaptation and virulence to Actinidia species. PLoS One, 6, e27297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin, M. (2011) Cutadapt removes adapter sequences from high‐throughput sequencing reads. EMBnet J. 17, 10–12. [Google Scholar]
- Mazzaglia, A. , Studholme, D.J. , Taratufolo, M.C. , Cai, R. , Almeida, N.F. , Goodman, T. , Guttman, D.S. , Vinatzer, B.A. and Balestra, G.M. (2012) Pseudomonas syringae pv. actinidiae (PSA) isolates from recent bacterial canker of kiwifruit outbreaks belong to the same genetic lineage. PLoS One, 7, e36518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ménard, M. , Sutra, L. , Luisetti, J. , Prunier, J.P. and Gardan, L. (2003) Pseudomonas syringae pv. avii (pv. nov.), the causal agent of bacterial canker of wild cherries (Prunus avium) in France. Eur. J. Plant Pathol. 109, 565–576. [Google Scholar]
- Monteil, C.L. , Guilbaud, C. , Glaux, C. , Lafolie, F. , Soubeyrand, S. and Morris, C.E. (2012) Emigration of the plant pathogen Pseudomonas syringae from leaf litter contributes to its population dynamics in alpine snowpack. Environ. Microbiol. 14, 2099–2112. [DOI] [PubMed] [Google Scholar]
- Monteil, C.L. , Cai, R. , Liu, H. , Llontop, M.E.M. , Leman, S. , Studholme, D.J. , Morris, C.E. and Vinatzer, B.A. (2013) Nonagricultural reservoirs contribute to emergence and evolution of Pseudomonas syringae crop pathogens. New Phytol. 199, 800–811. [DOI] [PubMed] [Google Scholar]
- Monteil, C.L. , Lafolie, F. , Laurent, J. , Clement, J.C. , Simler, R. , Travi, Y. and Morris, C.E. (2014) Soil water flow is a source of the plant pathogen Pseudomonas syringae in subalpine headwaters. Environ. Microbiol. 16, 2038–2052. [DOI] [PubMed] [Google Scholar]
- Morris, C.E. , Bardin, M. , Kinkel, L.L. , Moury, B. , Nicot, P.C. and Sands, D.C. (2009) Expanding the paradigms of plant pathogen life history and evolution of parasitic fitness beyond agricultural boundaries. PLoS Pathog. 5, e1000693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mucyn, T.S. , Yourstone, S. , Lind, A.L. , Biswas, S. , Nishimura, M.T. , Baltrus, D.A. , Cumbie, J.S. , Chang, J.H. , Jones, C.D. , Dangl, J.L. and Grant, S.R. (2014) Variable suites of non‐effector genes are co‐regulated in the type III secretion virulence regulon across the Pseudomonas syringae phylogeny. PLoS Pathog. 10, e1003807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murphy, P.J. , Wexler, W. , Grzemski, W. , Rao, J.P. and Gordon, D. (1995) Rhizopines – their role in symbiosis and competition. Soil Biol. Biochem. 27, 525–529. [Google Scholar]
- Notredame, C. , Higgins, D.G. and Heringa, J. (2000) T‐Coffee: a novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302, 205–217. [DOI] [PubMed] [Google Scholar]
- Nowell, R.W. , Green, S. , Laue, B.E. and Sharp, P.M. (2014) The extent of genome flux and its role in the differentiation of bacterial lineages. Genome Biol. Evol. 6, 1514–1529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ochman, H. , Lerat, E. and Daubin, V. (2005) Examining bacterial species under the specter of gene transfer and exchange. Proc. Natl. Acad. Sci. USA, 102 Suppl 1, 6595–6599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Overbeek, R. , Olson, R. , Pusch, G.D. , Olsen, G.J. , Davis, J.J. , Disz, T. , Edwards, R.A. , Gerdes, S. , Parrello, B. , Shukla, M. , Vonstein, V. , Wattam, A.R. , Xia, F. and Stevens, R. (2013) The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res. 42, D206–D214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O'Brien, H.E. , Desveaux, D. and Guttman, D.S. (2011) Next‐generation genomics of Pseudomonas syringae . Curr. Opin. Microbiol. 14, 24–30. [DOI] [PubMed] [Google Scholar]
- O'Brien, H.E. , Thakur, S. , Gong, Y. , Fung, P. , Zhang, J. , Yuan, L. , Wang, P.W. , Yong, C. , Scortichini, M. and Guttman, D.S. (2012) Extensive remodeling of the Pseudomonas syringae pv. avellanae type III secretome associated with two independent host shifts onto hazelnut. BMC Microbiol. 12, 141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pagel, M. (1994) Detecting correlated evolution on phylogenies: a general method for the comparative analysis of discrete characters. Proc. R. Soc. London, Ser. B, 255, 37–45. [Google Scholar]
- Pagel, M. and Meade, A. (2006) Bayesian analysis of correlated evolution of discrete characters by reversible‐jump Markov chain Monte Carlo. Am. Nat. 167, 808–825. [DOI] [PubMed] [Google Scholar]
- Pitman, A.R. , Jackson, R.W. , Mansfield, J.W. , Kaitell, V. , Thwaites, R. and Arnold, D.L. (2005) Exposure to host resistance mechanisms drives evolution of bacterial virulence in plants. Curr. Biol. 15, 2230–2235. [DOI] [PubMed] [Google Scholar]
- Polz, M.F. , Alm, E.J. and Hanage, W.P. (2013) Horizontal gene transfer and the evolution of bacterial and archaeal population structure. Trends Genet. 29, 170–175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qi, M. , Wang, D. , Bradley, C.A. and Zhao, Y. (2011) Genome sequence analyses of Pseudomonas savastanoi pv. glycinea and subtractive hybridization‐based comparative genomics with nine Pseudomonads. PLoS One, 6, e16451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodríguez‐Moreno, L. , Jiménez, A.J. and Ramos, C. (2009) Endopathogenic lifestyle of Pseudomonas savastanoi pv. savastanoi in olive knots. Microb. Biotechnol. 2, 476–488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodríguez‐Palenzuela, P. , Matas, I.M. , López‐Solanilla, E. , Bardaji, L. , Pérez‐Martínez, I. , Rodríguez‐Moskera, M.E. , Penyalver, R. , López, M.M. , Quesada, J.M. , Biehl, B.S. , Perna, N.T. , Glasner, J.D. , Cabot, E.L. , Neeno‐Eckwall, E. and Ramos, C. (2010) Annotation and overview of the Pseudomonas savastanoi pv. savastanoi NCPPB 3335 draft genome reveals the virulence gene complement of a tumour‐inducing pathogen of woody hosts. Environ. Microbiol. 12, 1604–1620. [DOI] [PubMed] [Google Scholar]
- Saint, C.P. , Wexler, M. , Murphy, P.J. , Tempé, J. and Tate, M.E. (1993) Characterization of genes for synthesis and catabolism of a new rhizopine induced in nodules by Rhizobium meliloti Rm220‐3: extension of the rhizopine concept. J. Bacteriol. 175, 5205–5215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sarkar, S.F. and Guttman, D.S. (2004) Evolution of the core genome of Pseudomonas syringae, a highly clonal, endemic plant pathogen. Appl. Environ. Microbiol. 70, 1999–2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scortichini, M. , Marchesi, U. , Rossi, M.P. and Di Prospero, P. (2002) Bacteria associated with hazelnut (Corylus avellana L.) decline are of two groups: Pseudomonas avellanae and strains resembling P. syringae pv. syringae . Appl. Environ. Microbiol. 68, 476–484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scortichini, M. , Marcelletti, S. , Ferrante, P. and Firrao, G. (2013) A genomic redefinition of Pseudomonas avellanae species. PLoS One, 8, e75794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharp, P.M. (1997) In search of molecular darwinism. Nature, 385, 111–112. [DOI] [PubMed] [Google Scholar]
- Sheppard, S.K. , Didelot, X. , Meric, G. , Torralbo, A. , Jolley, K.A. , Kelly, D.J. , Bentley, S.D. , Maiden, M.C.J. , Parkhill, J. and Falush, D. (2013) Genome‐wide association study identifies vitamin B5 biosynthesis as a host specificity factor in Campylobacter. Proc. Natl. Acad. Sci. USA, 110, 11 923–11 927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simão, F.A. , Waterhouse, R.M. , Ioannidis, P. , Kriventseva, E.V. and Zdobnov, E.M. (2015) BUSCO: assessing genome assembly and annotation completeness with single‐copy orthologs. Bioinformatics, 31, 3210–3212. [DOI] [PubMed] [Google Scholar]
- Smeds, L. and Künstner, A. (2011) ConDeTri – a content dependent read trimmer for Illumina data. PLoS One, 6, e26314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stamatakis, A. (2006) RAxML‐VI‐HPC: maximum likelihood‐based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics, 22, 2688–2690. [DOI] [PubMed] [Google Scholar]
- Stephens, C. , Christen, B. , Fuchs, T. , Sundaram, V. , Watanabe, K. and Jenal, U. (2007) Genetic analysis of a novel pathway for D‐xylose metabolism in Caulobacter crescentus . J. Bacteriol. 189, 2181–2185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Studholme, D.J. , Ibanez, S. , MacLean, D. , Dangl, J.L. , Chang, J.H. and Rathjen, J.P. (2009) A draft genome sequence and functional screen reveals the repertoire of type III secreted proteins of Pseudomonas syringae pathovar tabaci 11528. BMC Genomics, 10, 395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suzuki, M. , Hayakawa, T. , Shaw, J.P. , Rekik, M. and Harayama, S. (1991) Primary structure of xylene monooxygenase: similarities to and differences from the alkane hydroxylation system. J. Bacteriol. 173, 1690–1695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swingley, W.D. , Blankenship, R.E. and Raymond, J. (2008) Integrating Markov clustering and molecular phylogenetics to reconstruct the cyanobacterial species tree from conserved protein families. Mol . Biol. Evol. 25, 643–654. [DOI] [PubMed] [Google Scholar]
- Van Dongen, S.M. (2000) Graph clustering by flow simulation. PhD Thesis, University of Utrecht.
- Vinatzer, B.A. and Monteil, C.L. (2014) Pseudomonas syringae genomics: from comparative genomics of individual crop pathogen strains toward population genomics In: Genomics of Plant‐Associated Bacteria (Gross D.C., Lichens‐Park A., and Kole C., eds.), pp. 79–98. Berlin: Springer. [Google Scholar]
- Wang, Y. , Huang, H. , Sun, M. , Zhang, Q. and Guo, D. (2012) T3DB: an integrated database for bacterial type III secretion system. BMC Bioinformatics, 13, 66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webber, J.F. , Parkinson, N.M. , Rose, J. , Stanford, H. , Cook, R.T.A. and Elphinstone, J.G. (2008) Isolation and identification of Pseudomonas syringae pv. aesculi causing bleeding canker of horse chestnut in the UK. Plant Pathol. 57, 368. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.