Abstract
The Liverpool epidemic strain (LES) is an important transmissible clonal lineage of Pseudomonas aeruginosa that chronically infects the lungs of people with cystic fibrosis (CF). Previous studies have focused on the genomics of the LES in a limited number of isolates, mostly from one CF centre in the UK, and from studies highlighting identification of the LES in Canada. Here we significantly extend the current LES genome database by genome sequencing 91 isolates from multiple CF centres across the UK, and we describe the comparative genomics of this large collection of LES isolates from the UK and Canada. Phylogenetic analysis revealed that the 145 LES genomes analysed formed a distinct clonal lineage when compared with the wider P. aeruginosa population. Notably, the isolates formed two clades: one associated with isolates from Canada, and the other associated with UK isolates. Further analysis of the UK LES isolates revealed clustering by clinic geography. Where isolates clustered closely together, the association was often supported by clinical data linking isolates or patients. When compared with the earliest known isolate, LESB58 (from 1988), many UK LES isolates shared common loss-of-function mutations, such as in genes gltR and fleR. Other loss-of-function mutations identified in previous studies as common adaptations during CF chronic lung infections were also identified in multiple LES isolates. Analysis of the LES accessory genome (including genomic islands and prophages) revealed variations in the carriage of large genomic regions, with some evidence for shared genomic island/prophage complement according to clinic location. Our study reveals divergence and adaptation during the spread of the LES, within the UK and between continents.
Keywords: cystic fibrosis, genomics, prophage, Pseudomonas aeruginosa
Data Summary
The GenBank accession numbers of the genomes used in this study that have previously been deposited in the National Center for Biotechnology Information (NCBI) database, along with the published studies they were analysed for, have been provided in Table S1 (available in the online version of this article). Genomes sequenced for this study have been deposited in the NCBI Genome database under the BioProject ID PRJNA615626 and their accession numbers are provided in Table S1.
Impact Statement.
Chronic lung infections caused by Pseudomonas aeruginosa are the major cause of the morbidity and mortality associated with cystic fibrosis. In the UK and Canada, a particular lineage of P. aeruginosa (the Liverpool epidemic strain, or LES) possesses unusual characteristics: it can transmit between patients and is associated with greater morbidity. In the UK, the LES lineage is the most common lineage found amongst CF isolates. Using whole-genome sequencing and comparative genomics analyses, we have analysed 145 LES isolates from across the UK and Canada, providing the most comprehensive analysis to date of the LES lineage. We demonstrate geographical clustering of LES isolates, and describe how the lineage has adapted and diverged. In particular, we show divergence between isolates in Canada compared to the UK, and variations between isolates from different UK clinics. Our analysis has identified adaptations that might contribute to the success of this lineage as a pathogen in cystic fibrosis.
Introduction
Chronic lung infections with the versatile pathogen Pseudomonas aeruginosa remain the major cause of morbidity and mortality associated with cystic fibrosis (CF). During the infection process in the CF lung, the bacterium undergoes characteristic adaptive changes [1], making it difficult or impossible to eradicate once established. Because P. aeruginosa can prosper in a wide variety of environmental and clinical niches, exposure to it is inevitable, and it is still a wide assumption that the majority of CF patients acquire their infections from an environmental source. However, it has become clear that specialist clones of P. aeruginosa capable of patient-to-patient transmission exist [2, 3], leading to over-representation amongst isolates from CF. The occurrence of such strains presents additional challenges in terms of patient care, but their identification does enable steps to be taken to minimize transmission amongst patients [4]. One of the best-studied transmissible clones is the Liverpool epidemic strain (LES), first discovered in a children’s CF unit in Liverpool [5], but since identified as the most common single clone amongst CF isolates from patients in the UK [6]. The LES lineage has also been identified in Canada [7, 8]. The LES has a number of important characteristics, including the ability to superinfect [9], increased production of some virulence factors [10, 11], an association with increased morbidity [12], increased resistance to antimicrobials [13] and the ability to cause infections beyond the CF host, including infections of the non-CF parents of a CF patient [14] and a patient’s pet cat [15]. The strain has also been reported as causing pleural empyema in a CF patient [16] and has been isolated from patients with primary ciliary dyskinesia and non-CF bronchiectasis (unpublished data).
Since the first P. aeruginosa genome was published in 2000 [17], a number of other complete P. aeruginosa genomes have been made available [18–24]. Whole-genome sequence data have also been used for analyses of within-clone variation for the two most abundant P. aeruginosa clones globally (PA14 and clone C) [25]. The complete genome of the earliest known LES isolate (LESB58 from 1988) was published in 2009 [26], revealing a number of genomic features, including five GIs GIs) and six prophages, some of which have been implicated in the competitiveness of the strain in vivo [26, 27], and five of which can be detected as free phages in CF patient sputa [28]. Since then, 3 further genomes from LES isolates from Liverpool [29] and 12 from Ontario, Canada [8] have been reported. In addition, further LES isolate genomes have been sequenced from the adult CF unit in Liverpool in studies, suggesting the co-existence within patients of distinct LES sub-lineages [30–32].
The aim of this study was to determine the genome sequences of a larger collection of LES isolates from multiple centres throughout the UK and analyse the resulting extended dataset in order to characterize the diversity of a major CF transmissible strain in the context of geographical and temporal dissemination. Our data support the sub-division of LES isolates into distinct UK and Canadian clades, with some additional clustering by centre within the UK, and reveal mutational signatures of on-going adaptation to the CF lung environment.
Methods
Bacterial strains
Details of all the LES isolates included in the genomic analysis are given in Table S1. In total, 145 LES genomes were analysed, including the genomes of 91 isolates sequenced for this study. The majority of the newly sequenced isolates were obtained from the collection at Public Health England (PHE), Colindale or the archived collection in Liverpool. Additional UK genomes were obtained from CF centres in the West Midlands and East Midlands. Amongst the published genomes included were representatives of different sub-lineages of the LES identified in a study of isolates in the Liverpool Adult CF Unit [30].
Genome sequencing of bacterial isolates
DNA was extracted from 91 LES isolates using the Promega Wizard genomic DNA extraction kit. Whole genomes were sequenced either by the Centre of Genomic Research (CGR), Liverpool, using Nextera library preparation and an Illumina HiSeq sequencer or at Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada on the Illumina MiSeq platform. An additional 54 previously published LES genomes were included in the analysis [8, 29, 30]. These included a resequenced strain LESB58, labelled as genome LESB58_2. Three hundred and ninety non-LES P. aeruginosa genomes were obtained from a published P. aeruginosa genomics study [23] to represent a wider P. aeruginosa population set.
Genome assemblies
Short reads were adapter-trimmed and quality-filtered using cutadapt-1.9.2 [33] and assembled to the contig level using the A5 assembly pipeline [34]. The de novo-assembled contigs were scaffolded to the genome of LESB58 [26] using ABACAS [35]. Contigs containing sequence accessory to the LESB58 genome were appended to the end of each draft genome. All genomes were annotated with Prokka-1.12, including CRISPR prediction [36]. The pseudomonas.com database [37] was used to facilitate further assessment of gene function and the impact of mutations. Multilocus sequence types (MLSTs) were determined using mlst (https://github.com/tseemann/mlst) and the pubmlst database [38].
Core genome phylogeny
In order to assess the placement of the LES lineage in the wider P. aeruginosa population, a relaxed core P. aeruginosa tree was inferred from the LES genomes and the 390 P . aeruginosa genomes from a previous genomics study [23]. LESB58 was designated as the initial reference genome for Panseq [39], followed by random successive reference selection for genomic regions not present in previous reference genomes. Successive reference genomes were split into 500 bp fragments and pairwise aligned to all other genomes. Fragments with at least 90 % blast percentage identity to the reference in 95 % of taxa (508/535) were designated as belonging to the relaxed core, aligned with clustalw [40] and concatenated with SCaFoS [41]. The resulting 5 517 752 bp supermatrix contained 429 463 informative sites. The tree was inferred by maximum likelihood (ML) with the HKY85 substitution model and 1000 ultrafast bootstrap (UFBoot) [42] replicates for branch support using IQ-TREE-1.6.2 [43]. To assess the root position of the LES lineage, this was repeated with the 145 LES genomes and P. aeruginosa reference genomes and known outgroups: PAO1 and PA14 [17, 44]. Fragments were defined as core if they had 90 % blast percentage identity with the reference and were present in 95 % of taxa (141/147). This resulted in a 5 840 172 bp relaxed core supermatrix including 14 730 informative sites. The TIM+R5 substitution model was determined as the best fit for the data by IQ-TREE [43] model test [45] and the phylogenetic tree was inferred by ML with 1000 UFBoot replicates, where at least 95 % confidence is recommended as well supported. The tree was rooted between outgroups PAO1 and PA14 and the rest of the tree. Finally, snippy (https://github.com/tseemann/snippy) was used with default parameters to generate a substitutions-only LES core single-nucleotide polymorphism (SNP) alignment from all LES assemblies and the tree was approximated using runListCompare (https://github.com/davideyre/runListCompare), which uses IQ-TREE [43] for model selection and phylogeny inference, and ClonalFrameML [46] for the identification of recombinant sites to be removed from the alignment. LESB58 GI and prophage regions [26] were also masked. The GTR+F+G4 substitution model was selected by IQ-TREE and there were 2947 informative sites. Phylogenetic trees were visualized with either the interactive tree of life software (iTol) [47] or GrapeTree [48]. SNP distances were plotted using the Python packages seaborn (https://seaborn.pydata.org/), matplotlib [49], pandas [50] and numpy [51] with the Jupyter Notebook environment [52].
Core and accessory genes
The core and accessory gene content of the LES genomes was determined by Roary [53] using Prokka-annotated gff files as input and without splitting paralogues. This was repeated for the LES plus the wider P. aeruginosa population genomes, but with paralogue splitting. Genes determined to be ‘strict core’ in the pangenome analysis by Roary were present in >=99 % of the total set and relaxed core >=95 %. Association tests of accessory gene content and Benjamimi–Hochberg (BH) multiple testing corrections were performed by SCOARY [54] using the Roary-generated gene presence/absence matrix output as input. Abricate (https://github.com/tseeman/abricate) was used to investigate the potential presence of antibiotic resistance-conferring genes.
Structural analysis
The assembled genomes were also compared to LESB58 for large, structural differences using the blast Ring Image Generator (BRIG) [55] to identify large (10 kb or greater) deletions. The presence and absence of virulence genes in the assembled genomes was investigated using Blastable v0.4, a Python script (https://github.com/bawee/blastable) that performs a nucleotide blast+ search. To further investigate the presence and absence of LESB58 prophages and GIs, a database of LES genomes was generated using blast+ [56] makeblastdb, against which each LESB58 prophage and GI was aligned using blast+ blastn (window size 28). Percentage identity was corrected for the contribution of duplicate regions present elsewhere in the draft genome by reciprocally aligning known duplicate regions with blastn [28].
Identification of SNPs and microindels in UK LES relative to the genome of LESB58
High-quality SNPs and microindels in the UK LES genomes for which short reads were available were called against the completed LESB58 reference genome, which is the outgroup to the UK LES and the oldest dated LES isolate [26, 30]. Quality-filtered short reads were mapped to reference genome LESB58 with bwa0.7.5a mem [57]. The LESB58 genome (.fasta) was indexed with bwa-0.7.5a index [57] and samtools-0.1.18-r580 [58] faidx. A sequence dictionary was created with picard-tools-1.135 [59] –CreateSequenceDictionary. The resulting sequence alignment map (.sam) file from read mapping with bwa was converted to a binary alignment map (.bam) file using picard-tools SortSam. Duplicates were marked using picard-tools MarkDuplicates and a bam file index created with picard-tools BuildBamIndex. The Genome Analysis Toolkit-3.4 (GATK) [60] Realignor Target Creator was used to designate targets for indel realignment and indels were realigned with GATK IndelRealigner. Variants were called using GATK HaplotypeCaller (-ploidy 1-emitRefConfidence GVCF) to produce a variant call (.vcf) file prepared for genotyping. The vcf file was genotyped using GATK GenotypeGVCFs and filtered using vcffilter [61] basic filtering (DP >9 and QUAL >10). Variant annotation was performed by snpEff-4.1l [62] with the default parameters for GATK output (-eff -gatk) to the LESB58 reference genome annotation database, uid59275. Additionally, we determined whether genes had larger deletions not detectable due to lack of sequencing reads and absence of genomic context in vcf files. First bam files were indexed with samtools index and the reads aligned to a regions determined by snpEff database gene regions using samtools depth. Genes with any deletions larger than 20 bp were not considered for parsimonious assessment of convergent adaptation by loss of function.
Results
Genome assemblies
A geographically and historically diverse sample of LES isolates was selected for this study (n=91) and was combined with a set of previously sequenced LES genomes (n=54). The newly sequenced genomes were primarily isolated from CF patients, but also included some isolates from primary ciliary dyskinesia (PCD) patients, a pet cat (OSIRIS) [63], non-CF bronchiectasis (nCFBr) patients, a blood isolate (PHELES19) and the non-CF parents of a CF patient [14]. All LES genomes analysed in this study, and their geographical source (according to PHE regions) and date of isolation, if known, are presented in Table S1. The mean number of CDS per genome was 6004, with a range of 5854 (PHELES12) to 6506 (JD324). Nearly all LES genomes, with the exception of those from isolates prefixed with ‘JD’, and ‘MdLES’, were predicted to carry type 1 CRISPR sequences.
The LES genomes form a distinct lineage amongst the wider P. aeruginosa population
A relaxed core genome was extracted from a combination of all the LES genomes plus the wider P. aeruginosa population set. The LES was resolved as a monophyletic, clonal lineage (Fig. 1). The lineage is composed of two MLST sequence types, ST-146 (primarily) and ST-683, that differ in the ppsA locus. Of the 145 LES genomes, 128 were ST-146, 2 were ST-683 and 15 did not correspond to known STs, differing from ST-146 by 1 locus in each case. The wider tree largely resolved into two major P. aeruginosa phylogenetic groups, but with a few lineages being placed in neither of the major groups, consistent with previous analysis [64]. One genome from the wider population set was placed within the LES clade, namely isolate AZPAE13757_2337, associated with a respiratory tract infection (RTI) from Canada. This isolate also carries the LES serotype (06) and MLST (ST-146) and was therefore coded as LES in the gene association analysis [23].
A tree of the LES genomes included in this study, with additional genomes from the reference strains PAO1 and PA14, was reconstructed based on a relaxed core genome fragments supermatrix. The LES genomes clustered into two major phylogenetic groups, with clade I containing mainly Canadian isolates and clade II containing entirely UK-sourced isolates. The root position was determined with 100% UFBoot support to be within clade I (Fig. S1).
Phylogenetic analysis reveals clustering by clinic geography
The LES reference genomes representing the oldest known LES isolate, LESB58 (whole genome) and LESB58_2 (resequenced with Illumina data, 2.93 SNPs different) were placed between the two clades, and thus represented an outgroup to the rest of the UK genomes (Fig. 2). There were two main exceptions to the general rule of separation according to country of origin. Clade I contained two LES isolate genomes sampled from the UK: LiP6 and PHELES19. Further investigation revealed that the patient infected with LiP6 had lived in Canada, but no further information was available for the patient infected with PHELES19, other than the fact that this was a blood isolate. Within clade II there was phylogenetic evidence for further clustering by source of isolation within the UK and between patients known to be in contact with one another. Where genomes from one region or clinic were placed within a UK sub-clade dominated by genomes from a different clinic or region, further information was sought about possible previous patient links to the region. PHELES10 and PHELES16 were isolated from patients in the East of England in 2014, but fall within a majority East Midlands clade. Both patients were recorded as attending and presenting samples prior to 2014 in the East Midlands CF unit. PHELES1 was isolated from a patient in Scotland, but falls within a majority East Midlands clade; this patient had previously attended the East Midlands CF clinic. Although many sub-clades of the LES had a clear geographical association (Fig. 2), there were some examples of isolates from different origins clustering. For example, isolates PHELES20, 4, 15, 12 and 23 cluster together despite being from different geographical locations. However, we do not have sufficient information about patient histories to explain this.
The LES tree was corrected for recombination after likely recombinant regions were detected in 30 inner nodes of the tree and 109 tip-branches (Fig. 2). Within clade I the median recombination corrected pairwise branch length (193.92 SNPs) was longer than within clade II (119.72 SNPs), including LESB58 (Figs S2 and S3a). The longer clade I branch lengths could either be due to the lineages being older, sparse sampling or that the clade II genomes by comparison have transmitted faster. Overall, the median branch lengths within the LES tips were 131.11 SNPs (of 10 440 exclusive pairs).
Phylogenetic analysis reveals clustering supported by clinical history
Individual CF patients (identified as isolates that are designated with names commencing with ‘CF’ followed by a common number, for example CF5S120, CF5S138 and CF5S134, all from patient CF5) typically clustered together. In a previous study by Williams et al. [30] with extensive within-patient sampling within a single CF clinic it was possible to determine that there were coexisting distinct lineages of the LES within individual patients. The coexistence of divergent lineages within patient CF3 were further confirmed here with two separate clusters (one containing CF3S124, CF3S235, CF3S224 and CF3S108; the other containing CF3S25, CF3S219 and CF3S225). The branch length between randomly selected genomes from each cluster, CF3S235 and CF3S219 was 108 SNPs.
The shortest branch lengths were most often between within-patient samples, though some branch lengths for isolates from different patients were shorter than between within-patient isolates, suggesting recent transmission or conversely distinct coexisting lineages within patients (Table S2). The shortest branch lengths from within-patient samples were SED11/SED20 SED1/SED11, SED1/SED20 [32, 65], with 0, 3.9 and 3.9 SNPs respectively and CF9S124/CF9S111, with 0.97 SNPs. Conversely, between-patient isolate genomes also had smaller branch lengths than some within-patient lineage branch lengths. These included 49492/30567S with 21.41 SNPs, 72G/B35 with 18.48 SNPs and PHELES27/PHELES29 with 10.69 SNPs.
There were a number of examples where isolates clustering together were linked based on clinical data. For example, isolates 139124 and 139125 (103.2 SNPs) from a single PCD patient, and isolate 119579 from their sibling (81.78 SNPs to 139 125), also a PCD patient, were adjacent on the tree. Isolates PHELES27 and PHELES29 (10.7 SNPs), and isolates 89204 and 89212 (82.7 SNPs), were isolated from siblings. Isolate CFS120 was from a CF patient and clustered with 417GLY (71.1 SNPs), isolated from their non-CF parent [63]. The isolate OSIRIS, isolated from a pet cat, clustered with the human isolate 89568 (28.21 SNPs), isolated from the owner. Isolates 79426 and 79427(166.6 SNPs), from the same empyema patient, clustered together.
Although many sub-clades of the LES had a clear geographical association (Fig. 2), there were some examples of isolates from different origins clustering. For example, isolates PHELES20, 4, 15, 12 and 23 cluster together, despite being from different geographical locations. However, we do not have sufficient information about patient histories to explain this.
The LES accessory genome
The gene content for all LES genomes in this study (the LES pangenome) was compared with the pangenome of the wider P. aeruginosa population set. To assess significant associations between genes and designated groups of either the LES or the wider population set, Fisher’s exact test (<0.05) was used and BH correction for multiple testing [54]. A pangenome with 535 genomes, including the 145 LES genomes, was constructed. There were 3251 strict core genes (present in >=99 % genomes) of a pangenome of 33 033 different genes (where paralogous groups within the same genome were classified as the same gene). Compared with the wider population set, there were 2922 genes significantly associated with the LES by absence and 1397 genes significantly associated with the LES by presence. There were 152 genes present in >=95 % of the LES genomes but <=10 % of the wider population. Of these 108 are located within LESB58 GIs or prophages, particularly LES GI-3 (48 genes), LES prophage 3 (25 genes), LES prophage 4 (17 genes) and LES GI-1 (15 genes) (Table S3). There were 22 genes that were present in >=95 % of the wider population and <=10 % of the LES (Table S4).
We conducted further analyses to identify large regions that were absent from some LES isolate genomes. We interrogated the LES genomes using abricate in order to seek evidence for horizontal acquisition of antibiotic resistance genes. In addition to identifying resistance-related genes common to all isolates, there was evidence for additional beta-lactamase genes in several isolates. In particular, isolates from three different CF patients (CF3S108, CF4S132 and CF7S115) harboured class A beta-lactamase TEM-181 genes sharing 100 % identity with a gene in Escherichia coli (accession number NG_050218.1). This gene was not carried by other LES isolates from the same patient sputum samples.
Deletions of more than 10 kb were present in 24 of the LES sequenced genomes (Fig. S4, Table S5). Twenty eight deletions were identified, with seven of the deletions being greater than 100 kb, and the largest being 249 kb. CF9S111 and CF9S124 contained the same deletion, consistent with their high sequence similarity and adjoining positions on the phylogenetic tree (Fig. 2); the same is true of JD304 and LESlike1. Most strains only contained one deletion, although four strains contained two. The deletions were non-randomly distributed in the genome, with 15 of the 24 strains having deletions in the region between 3.10 and 3.50 Mb (Fig. S4). This region of the LESB58 genome contains, amongst others, genes encoding an extracellular polysaccharide (Psl), a siderophore (pyoverdine) and a type VI secretion system (Table S6).
There were also variations in the carriage of previously reported [26] LES GIs and prophage regions (Fig. 2). LES prophage 5 was more commonly absent from the LES isolate genomes. Carriage of prophage 2 and GI5 was also highly variable, with some evidence for shared GI/prophage complement according to geographical regions (Fig. 2). For example, the absence of GI5 was a feature of isolates from the East Midlands region. Likewise, there is a cluster of isolates from the Liverpool region that lack LES prophages 2 and 5. All of the isolates from Canada lacked LES prophage 5, but there were variations with respect to both carriage and divergence for other GI and prophage regions (Fig. 2).
Mutations in LES genomes
Genomes of clade II (entirely UK-sourced) isolates for which short reads were available for high-quality read mapping and variant calling (n=124) were assessed for mutations compared with the oldest known LES isolate LESB58 (from 1988), as reference and outgroup to the clade II LES. There were 19 679 variants counted across all UK LES compared with LESB58, including 10 036 different mutations in 3742 different genes. The majority of variants by mutational impact category were amino acid substitutions (54.94 %), followed by synonymous nucleotide substitutions (20.24 %), frame-shifting mutations (9.96 %), mutations upstream of coding regions (9.56 %) and premature stop codons gained (2.60 %). The remainder (2.69 %) comprised codon insertions, codon deletions, codon changes plus insertions, mutations downstream of coding regions, stop codon loss, start codon loss, synonymous stop codon substitution and non-synonymous start codon substitutions. The majority of different mutations (7507/10 036) were only present in one genome. One hundred and six of the different mutations were present in 10 or more genomes, including 34 carried in each of 100 or more genomes. Mutations in or upstream of 20 different genes were found in all 124 UK LES genomes relative to LESB58 (Table S7). Two of these were synonymous mutations (in phuR and wbpY), and two were located upstream of genes. The remaining 16 were amino acid substitutions in genes, including in the nosR gene, encoding a regulatory protein. It is possible that in these cases, LESB58 is the isolate that has acquired a mutation. However, the genome of LESB58 is identical to strain PAO1 in the region of the nosR mutation, and the mutation identified is not present in any of the Canadian LES, suggesting that it is a genuine mutation relative to LESB58 in all the UK LES isolates.
The reported 14 bp deletion in gltR, previously believed to be fixed in UK LES genomes other than LESB58 [29], was present in 100/124 UK LES genomes. Of the 24 UK isolates not harbouring this deletion, all but one (CF54NCTC13415) were associated with a majority East Midlands cluster (from isolate 10333 to isolate SED1 on the tree in Fig. 2). This mutation is not present in any of the Canadian LES. A nonsense mutation in fleR, also not found amongst the Canadian isolates, was present in 122/124 UK LES genomes (Table S7). An additional isolate (8835) carried a 28 bp deletion in fleR spanning the same locus of the nonsense mutation. LiP1a was the only isolate genome analysed without a nonsense mutation in fleR. However, the closely related LiP1c, isolated from the same patient as LiP1a, did harbour the nonsense mutation in fleR (Fig. 2, Table S7).
There were more non-synonymous mutations in some genes than others, suggesting ongoing adaptation as the strain spread around the UK. The majority of different genes carrying a nonsense mutation represented a single acquisition (615/742), including those present in only one isolate genome. The remaining 127 different genes were inferred to have independently acquired nonsense mutations in the population between 2 and 32 times. Those with four or more independent acquisitions in the population are displayed in Table 1, and include genes associated with functions such as antibiotic resistance, quorum sensing, mucoidy, iron acquisition, amino acid metabolism, chemotaxis, cell wall/LPS/capsule synthesis and central metabolism (Table 1). The porin protein-encoding gene, oprD, had independently acquired nonsense mutations more than any other gene in the population, acquiring nonsense mutations a minimum of 32 times and carried by 51/124 genomes. A further six genomes contained larger deletions ranging from 141 to 647 bp (10.63–48.79 %) within the 1326 bp oprD gene, bringing the total number of UK LES genomes with strong genetic evidence of loss of function in OprD to 57. MexA and MexB multidrug efflux pump proteins were both inferred to be adaptive by loss of function (Table 1) and are both encoded by the mexAB-oprM operon. Taken together, the genes encoding the MexAB-OprM efflux pump independently acquired loss-of-function mutations at least 26 times, in 50 isolates.
Table 1.
No. of independent acquisitions |
Gene name |
No. of genomes |
Gene description |
---|---|---|---|
32 |
oprD |
51 |
Imipenem outer-membrane porin |
18 |
lasR |
26 |
Transcriptional regulator of quorum sensing |
21 |
mexB |
43 |
Multidrug efflux pump transporter, part of the mexAB-oprM operon |
20 |
PLES_59241 |
32 |
Hypothetical membrane protein |
13 |
mucA |
29 |
Anti-sigma factor for alginate biosynthesis |
10 |
glpT |
15 |
Sn-glycerol-3-phosphate transporter |
9 |
mltD |
55 |
Murein transglycosylase D precursor |
8 |
pncA |
10 |
Nicotinamidase |
7 |
PLES_34011 |
13 |
TonB-dependent receptor |
6 |
PLES_48481 |
16 |
Hypothetical protein |
6 |
pmrB |
6 |
Two-component regulator system |
5 |
PLES_46251 |
12 |
Putative glycosyl transferase |
5 |
opmQ |
7 |
Putative outer-membrane protein precursor |
5 |
PLES_16021 |
5 |
Transcriptional regulator (marR orthologue, involved in repression of MDR mar operon) |
5 |
mpl |
11 |
UDP-N-acetylmuramate:L-alanyl-gamma-d-glutamyl-meso-diaminopimelate ligase |
5 |
chpA |
5 |
Chemosensory system component |
5 |
PLES_51581 |
5 |
Putative S-adenosylmethionine decaboxylase proenzyme |
5 |
mdoH |
13 |
Glucosyltransferase involved in biofilm formation |
5 |
ampD |
111 |
N-acetyl-anhydromuranmyl-l-alanine amidase |
4 |
PLES_46401 |
101 |
Hypothetical protein with filamentous haemagglutinin outer-membrane domain) |
4 |
PLES_03151 |
9 |
Transcriptional regulator |
4 |
PLES_47771 |
8 |
Two-component sensor |
4 |
pchE |
7 |
Dihydroaeruginoic acid synthase |
4 |
mexA |
7 |
Multidrug efflux pump transporter, part of the mexAB-oprM operon |
4 |
PLES_07571 |
5 |
Putative oxidoreductase |
4 |
PLES_29121 |
7 |
Putative ATP-binding/permease fusion ABC transporter involved in sulphate metabolism |
Deficiency in DNA mismatch repair as a result of disruption to genes mutS, mutL, mutM and uvrD can result in the hypermutable phenotype, another adaptation that has been observed previously amongst P. aeruginosa CF isolates [66]. In the LES dataset, 10/124 genomes investigated were identified as probable hypermutators due to DNA mismatch repair deficiency. There was also evidence of cross-infection of hypermutable lineages. Isolates LiP10a/b and LiP13, from two different CF patients attending the same centre, are adjacent on the core SNP tree (Fig. 2) and carry a 4 bp deletion in mutM at the same position. In addition, isolates PHELES3 and PHELES6, also from to different patients attending the same CF centre, are adjacent on the core SNP tree (Fig. 2) and share 57 bp of a 58 bp deletion in mutM.
Discussion
Phylogenetic analysis of the LES
Previously, a limited number of LES genomes were available for comparative genomics [8, 29] and studies focused on the within-host diversity and adaptation of LES lineages during chronic lung infections [30, 65]. As well as greatly increasing the sample size for comparative genomics, the panel of LES isolates for this study significantly extends the geographical diversity of samples, and the diversity of associated pathologies. Using this extended dataset, we show that the LES forms a distinct clonal lineage, and that UK and Canadian LES isolates form distinct clades (Figs 1 and 2). There were two exceptions in the 145 isolate dataset, both of which were sourced in the UK but clustered with the isolates from Canada. For one (LiP6), further clinical data linked the patient with Canada. We were unable to find sufficient information to explain the location of the second (PHELES19). Nevertheless, the divergence between the isolates from Canada and UK, with the earliest known LES isolate (LESB58) lying between the groups, suggests that the LES may have established itself in both countries some decades ago and then undergone within-country spread.
There were seven genes with mutations leading to amino acid substitutions in all of the clade II genomes (UK) but none of the clade I genomes (Canada), and a further gene PLES_53461 with a mutation upstream in all clade II genomes. These included genes encoding the regulatory protein NosR, a predicted two-component regulatory protein (PLES_28161), transport proteins (PLES_13941 and PLES_44581), a ring cleavage dioxygenase (PLES_44361), a putative hydroxylase (PLES_48951) and DNA gyrase (gyrB). loss-of-function mutations in genes encoding two other regulatory proteins (FleR and GltR) were also found in the majority of clade II isolates. It is possible that some of these mutations have contributed to the adaptations allowing this strain to be transmissible. However, if this is the case, then the spread of the clade I isolates in Canada must have been facilitated by other adaptations.
There was evidence for clustering by geography within the UK. Again, there were exceptions, some of which could be explained by follow-up epidemiological investigations. Our sample was dominated by isolates from the North West (Liverpool) region. Notably, not all isolates from this region clustered together. Rather, there were several separate clusters, some indicating closer relationships with isolates from other regions, consistent with some sub-lineages seeding geographical spread, followed by divergence. It is not uncommon for CF patients to move from one area to another within the UK, and we cannot construct an entirely accurate natural history of the strain based on the limited clinical and epidemiological data available.
Variations in the accessory genome content
The genome of the earliest isolate of the LES (LESB58) contains large GIs and prophages, some of which harbour genes shown to contribute to competitiveness in an animal model of chronic infections [26, 27, 67]. It has also been shown that LES prophages are active as free phage in the sputum of CF patients [28]. However, it is clear that LES isolates can vary in their complement of GIs and prophages, and also that some GIs and prophages were more often absent or divergent than others. For example, carriage of LES prophage 5 was restricted to sub-sets of isolates from the same CF region (Liverpool). This might support the notion that the strain first evolved in Liverpool, given that prophage 5 was present in the earliest known isolate, and that loss of the unstable prophage occurred before spread, but we cannot rule out acquisition restricted to some isolates. Interestingly, absence of the most variable GI (GI-5) was also mostly associated with sub-sets of isolates from this same region. In contrast, absence of GI-2 or GI-3 was rare, as was complete absence of the other LES prophages, implying that they contribute to the success of their host during chronic CF infections. In particular, genes associated with prophages 3 and 4 and GI GI-3 featured prominently amongst those genes associated with the LES in comparison to other P. aeruginosa strains, including in prophage 3, genes putatively encoding a HicA/HicB toxin–antitoxin system, implicated in the stringent response and persistence [68]. Previous studies support the notion that LES prophages contribute to the establishment of chronic lung infections [27], and our data support the idea that these accessory regions are making an important contribution to the success of the LES. It is worth noting that the isolate PHELES19, from an unusual source (blood), was also unusual in that it lacked GI-3 and most of the LES prophages. Generally, isolates that clustered closely according to core genome SNP phylogeny tended to share the same complement of GIs and prophages, suggesting that instability has occurred during transmission and spread of the LES lineage, but not so rapidly that closely related isolates do not broadly share the same accessory genome content.
LES adaptations
We identified a number of loss-of-function mutations previously associated with the LES. For example, fleR mutations, associated with loss of motility, a characteristic of many P. aeruginosa isolates from CF infections [69], and a defining characteristic of the LES [29], were identified in all but one of the UK isolates but not in the isolates from Canada.
Although adaptations to the CF lung occurring by mutation have been widely reported [70], we also found evidence for adaptation via the loss of large genomic regions. We identified a large region, including genes for extracellular polysaccharide (Psl), a siderophore (pyoverdine) and a type VI secretion system, that was deleted from the genomes of 15 of the LES isolates (Fig. S4). The boundaries of these deletions differed between isolates, and additional, different deletions were present in some isolates, indicating the occurrence of independent deletion events and a tendency towards genome reduction during chronic infection. Biofilm formation is considered to be an important strategy for P. aeruginosa during chronic lung infections, and the bacterium can produce three different exopolysaccharides that contribute to this process, namely Psl, Pel and alginate [71]. Alginate has been strongly implicated in P. aeruginosa CF infections because of the observation that many isolates express a mucoid phenotype due to over-production of alginate because of mutations, especially in mucA [72, 73]. Although the contributions of Pel and Psl during CF infections are less well understood, it has been shown that a LES isolate favours expression of pel genes over psl genes in a murine model of chronic lung infection [74]. Mutations in the pyoverdine biosynthesis genes are a common adaptation in CF, and P. aeruginosa adapts to favour iron acquisition from haemoglobin rather than via siderophores [75, 76]. Hence the loss of genes encoding Psl or pyoverdine would be consistent with previous analysis of P. aeruginosa adaptions. Since the type VI secretion system is a weapon employed by bacteria to attack competitors, loss of this system in the highly competitive multi-species environment of the CF lung is harder to understand. However, it may be that in a biofilm lifestyle within the structurally heterogeneous environment of the CF lung, over time P. aeruginosa can establish itself in niches where there is no requirement for these weapons.
We also observed many independent mutations in genes previously implicated in adaptive responses during P. aeruginosa chronic lung infections [70]. Some of these were loss-of-function mutations. Mutations leading to mucoidy (mucA) or impairment of quorum sensing (lasR) are amongst the most common reported adaptations in P. aeruginosa chronic lung infections [70, 77]. Some of these were loss-of-function mutations. We also observed common antimicrobial resistance-associated mutations, including pmrB mutations, which have been associated with adaptation to the lung in a murine chronic infection model [78].
Inactivation of oprD in P. aeruginosa results in carbapenemase resistance, a growing problem with P. aeruginosa [79]. Decreased expression of oprD in LES400 and LES431 compared with PAO1 has been shown to correlate with an increase in minimum inhibitory concentrations (MICs) for meropenem [80]. Disruptive mutations in the PAO1 mltD homologue (PA1812) have been associated with increased beta-lactam antibiotic MICs [81] and in P. aeruginosa PA14 disruptive mutations in the glpT gene result in fosfomycin resistance [82]. Disruption of mpl homologues in P. aeruginosa have additionally been associated with piperacillin and ceftazidime resistance due to an accumulation of immature cell wall components and up-regulation of AmpC expression [83].
The loss of the MexAB-OprM efflux pump, due to severe mutations in any of the genes in the mexAB-oprM operon, has also been observed in non-CF bronchiectasis chronic lung infections [77]. This is harder to explain in terms of an adaptation leading to antibiotic resistance. The efflux pump has also been implicated in virulence, and it may be this alternative role that is more important in the CF lung adaptation [84].
Although we have identified mutations via analysis of the genomic data, we do not here present phenotypic data to demonstrate linkage between the genetic variations and specific phenotypic adaptations.
Conclusions
Most CF patients acquire their P. aeruginosa from environmental sources; transmissible (epidemic) strains capable of spreading by transmission between patients are therefore unusual. Here we have demonstrated that isolates identified as the LES belong to a distinct lineage when compared to other P. aeruginosa strains and separate by core genome SNP phylogeny on the basis of country of isolation (UK or Canada). In addition to transmissibility, in both nations, the LES has been associated with greater patient morbidity [12, 85]. In CF, P. aeruginosa causes long-term chronic infections, and during the course of such infections P. aeruginosa populations not only acquire adaptive mutations, but also evolve to be heterogeneous [30, 70]. For our isolate collection, we cannot know how far down this adaptive process the P. aer uginosa population sampled has travelled at the point the isolate was obtained. Hence, attempts to reconstruct a global LES evolutionary history based upon substitution rates would not yield meaningful insights. However, we observed that the oldest known isolates (LESB58, from 1988) falls between the two major clades (associated with the UK or Canada), and that there are variations between the clades in terms of signature mutations and the carriage of accessory genes. These findings suggest (a) that the LES lineage established itself separately in the UK and Canada some considerable time ago and (b) that the special characteristics of the lineage in terms of greater competitiveness, the ability to spread and associations with increased morbidity, cannot easily be assigned to shared mutations or accessory gene content.
Supplementary Data
Funding information
We acknowledge funding from the UK Cystic Fibrosis Trust (grant RS34) to support the studentship of M. P. M. R. C. L. was supported by Cystic Fibrosis Canada. I. L. would like to acknowledge funding support from Cystic Fibrosis New Zealand, the New Zealand Lotteries Board (Health) and the NHMRC (Australia).
Author contributions
The roles of the authors were as follows: C. W.: conceptualization, formal analysis, funding acquisition, supervision, writing the original draft, reviewing and editing. M. P. M.: data curation, formal analysis, investigation, writing the original draft, reviewing and editing. I. L. L.: formal analysis, writing the original draft, reviewing and editing. D. W.: formal analysis, reviewing and editing. S. P.: formal analysis. I. K.: data curation, formal analysis, investigation, methodology. N. P. T.: formal analysis and funding acquisition: formal analysis and methodology. D. T. D. K. and J. F. T.: formal analysis and methodology. J. J., L. F.: data curation and formal analysis. B. W.: software. N. L.: conceptualization and resources. S. H., S. M. and P. H.: resources. K. W. S. and M. J. W.: funding acquisition and resources. R. C. L.: funding acquisition, supervision, data curation, reviewing and editing. J. L. F.: conceptualization, funding acquisition, supervision, reviewing and editing.
Conflicts of interest
The authors declare that there are no conflicts of interest.
Footnotes
Abbreviations: BRIG, blast ring image generator; CDS, coding sequences; CF, cystic fibrosis; CRISPR, clustered regularly interspaced short palindromic repeats; GIs, genomic islands; LES, liverpool epidemic strain; MLST, multi-locus sequence typing; NCBI, National Center for Biotechnology Information; PCD, primary ciliary dyskinesia; SNP, single nucleotide polymorphism.
All supporting data, code and protocols have been provided within the article or through supplementary data files. Four supplementary figures and seven supplementary tables are available with the online version of this article.
References
- 1.Rossi E, La Rosa R, Bartell JA, Marvig RL, Haagensen JAJ, et al. Pseudomonas aeruginosa adaptation and evolution in patients with cystic fibrosis. Nat Rev Microbiol. 2020 doi: 10.1038/s41579-020-00477-5. [DOI] [PubMed] [Google Scholar]
- 2.Fothergill JL, Walshaw MJ, Winstanley C. Transmissible strains of Pseudomonas aeruginosa in cystic fibrosis lung infections. Eur Respir J. 2012;40:227–238. doi: 10.1183/09031936.00204411. [DOI] [PubMed] [Google Scholar]
- 3.Parkins MD, Somayaji R, Waters VJ. Epidemiology, Biology, and impact of clonal Pseudomonas aeruginosa infections in cystic fibrosis. Clin Microbiol Rev. 2018;31 doi: 10.1128/CMR.00019-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ashish A, Shaw M, Winstanley C, Humphreys L, Walshaw MJ. Halting the spread of epidemic Pseudomonas aeruginosa in an adult cystic fibrosis centre: a prospective cohort study. JRSM Short Rep. 2013;4:1. doi: 10.1258/shorts.2012.012018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cheng K, Smyth RL, Govan JRW, Doherty C, Winstanley C, et al. Spread of ?-lactam-resistant Pseudomonas aeruginosa in a cystic fibrosis clinic. The Lancet. 1996;348:639–642. doi: 10.1016/S0140-6736(96)05169-0. [DOI] [PubMed] [Google Scholar]
- 6.Martin K, Baddal B, Mustafa N, Perry C, Underwood A, et al. Clusters of genetically similar isolates of Pseudomonas aeruginosa from multiple hospitals in the UK. J Med Microbiol. 2013;62:988–1000. doi: 10.1099/jmm.0.054841-0. [DOI] [PubMed] [Google Scholar]
- 7.Aaron SD, Vandemheen KL, Ramotar K, Giesbrecht-Lewis T, Tullis E, et al. Infection with transmissible strains of Pseudomonas aeruginosa and clinical outcomes in adults with cystic fibrosis. JAMA J Am Med Assoc. 2010;304:2145–2153. doi: 10.1001/jama.2010.1665. [DOI] [PubMed] [Google Scholar]
- 8.Dettman JR, Rodrigue N, Aaron SD, Kassen R. Evolutionary genomics of epidemic and nonepidemic strains of Pseudomonas aeruginosa. Proc Natl Acad Sci U S A. 2013;110:21065–21070. doi: 10.1073/pnas.1307862110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.McCallum SJ, Corkill J, Gallagher M, Ledson MJ, Hart CA, et al. Superinfection with a transmissible strain of Pseudomonas aeruginosa in adults with cystic fibrosis chronically colonised by P aeruginosa . Lancet. 2001;358:558–560. doi: 10.1016/S0140-6736(01)05715-4. [DOI] [PubMed] [Google Scholar]
- 10.Salunkhe P, Smart CHM, Alun JW, Panagea S, Walshaw MJ, et al. A cystic fibrosis epidemic strain of Pseudomonas aeruginosa displays enhanced virulence and antimicrobial resistance a cystic fibrosis epidemic strain of Pseudomonas aeruginosa displays enhanced virulence and antimicrobial resistance. 2005. [DOI] [PMC free article] [PubMed]
- 11.Fothergill JL, Panagea S, Hart CA, Walshaw MJ, Pitt TL, et al. Widespread pyocyanin over-production among isolates of a cystic fibrosis epidemic strain. BMC Microbiol. 2007;7:45. doi: 10.1186/1471-2180-7-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Al-Aloul M. Increased morbidity associated with chronic infection by an epidemic Pseudomonas aeruginosa strain in CF patients. Thorax. 2004 Apr 1;59:334–336. doi: 10.1136/thx.2003.014258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ashish A, Shaw M, Winstanley C, Ledson MJ, Walshaw MJ. Increasing resistance of the Liverpool Epidemic Strain (LES) of Pseudomonas aeruginosa (Psa) to antibiotics in cystic fibrosis (CF)-A cause for concern? J Cyst Fibros. 2012;11:173–179. doi: 10.1016/j.jcf.2011.11.004. [DOI] [PubMed] [Google Scholar]
- 14.McCallum S, Gallagher M, Corkill J. Spread of an epidemic Pseudomonas aeruginosa strain from a patient with cystic fibrosis (CF) to non-CF relatives. Thorax. 2000;2002:559–561. doi: 10.1136/thorax.57.6.559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mohan K, Fothergill JL, Storrar J, Ledson MJ, Winstanley C, et al. Transmission of Pseudomonas aeruginosa epidemic strain from a patient with cystic fibrosis to a PET cat. Thorax. 2008;63:839–840. doi: 10.1136/thx.2007.092486. [DOI] [PubMed] [Google Scholar]
- 16.Mohan K, Lakshman V, Fothergill JL, Ledson MJ, Winstanley C, et al. Empyema due to a highly transmissible Pseudomonas aeruginosa strain in an adult cystic fibrosis patient. J Med Microbiol. 2010;59:614–616. doi: 10.1099/jmm.0.014696-0. [DOI] [PubMed] [Google Scholar]
- 17.Stover CK, Pham XQ, Erwin AL, Mizoguchi SD, Warrener P, et al. Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen. Nature. 2000;406:959–964. doi: 10.1038/35023079. [DOI] [PubMed] [Google Scholar]
- 18.Lee DG, Urbach JM, Wu G, Feinbaum RL. Genomic analysis reveals that Pseudomonas aeruginosa virulence is combinatorial. Genome Biol. 2006;7:R90. doi: 10.1186/gb-2006-7-10-r90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mathee K, Narasimhan G, Valdes C, Qiu X, Matewish JM, et al. Dynamics of Pseudomonas aeruginosa genome evolution. Proc Natl Acad Sci U S A. 2008 Feb 26;105:3100–3105. doi: 10.1073/pnas.0711982105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Roy PH, Tetu SG, Larouche A, Elbourne L, Tremblay S, et al. Complete genome sequence of the multiresistant taxonomic outlier Pseudomonas aeruginosa PA7. PLoS One. 2010;5:e8842. doi: 10.1371/journal.pone.0008842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stewart L, Ford A, Sangal V, Jeukens J, Boyle B, et al. Draft genomes of 12 host-adapted and environmental isolates of Pseudomonas aeruginosa and their positions in the core genome phylogeny. Pathog Dis. 2014;71:20–25. doi: 10.1111/2049-632X.12107. [DOI] [PubMed] [Google Scholar]
- 22.Freschi L, Jeukens J, Kukavica-Ibrulj I, Boyle B, Dupont MJ, et al. Clinical utilization of genomics data produced by the International Pseudomonas aeruginosa Consortium. Front Microbiol. 2015;6:1–8. doi: 10.3389/fmicb.2015.01036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kos VN, Déraspe M, McLaughlin RE, Whiteaker JD, Roy PH, et al. The resistome of Pseudomonas aeruginosa in relationship to phenotypic susceptibility. Antimicrob Agents Chemother. 2015;59:427–436. doi: 10.1128/AAC.03954-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jeukens J, Emond-Rheault J-G, Freschi L, Kukavica-Ibrulj I, Levesque RC. Major release of 161 whole-genome sequences from the International Pseudomonas Consortium database. Microbiol Resour Announc. 2019;8:e00013–00019. doi: 10.1128/MRA.00013-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fischer S, Klockgether J, Morán Losada P, Chouvarine P, Cramer N, et al. Intraclonal genome diversity of the major Pseudomonas aeruginosa clones C and PA14. Environ Microbiol Rep. 2016;8:227–234. doi: 10.1111/1758-2229.12372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Winstanley C, Langille MGI, Fothergill JL, Kukavica-Ibrulj I, Paradis-Bleau C, et al. Newly introduced genomic prophage islands are critical determinants of in vivo competitiveness in the Liverpool epidemic strain of Pseudomonas aeruginosa . Genome. 2009;1:12–23. doi: 10.1101/gr.086082.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lemieux AA, Jeukens J, Kukavica-Ibrulj I, Fothergill JL, Boyle B, et al. Genes required for free phage production are essential for Pseudomonas aeruginosa chronic lung infections. J Infect Dis. 2016;213:395–402. doi: 10.1093/infdis/jiv415. [DOI] [PubMed] [Google Scholar]
- 28.James CE, Davies E, V, Fothergill JL, Walshaw MJ, Beale CM, et al. Lytic activity by temperate phages of Pseudomonas aeruginosa in long-term cystic fibrosis chronic lung infections. Isme J. 2015;9:1391–1398. doi: 10.1038/ismej.2014.223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jeukens J, Boyle B, Kukavica-Ibrulj I, Ouellet MM, Aaron SD, et al. Comparative genomics of isolates of a Pseudomonas aeruginosa epidemic strain associated with chronic lung infections of cystic fibrosis patients. PLoS One. 2014;9:e87611. doi: 10.1371/journal.pone.0087611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Williams D, Evans B, Haldenby S, Walshaw MJ, Brockhurst MA, et al. Divergent, coexisting Pseudomonas aeruginosa lineages in chronic cystic fibrosis lung infections. Am J Respir Crit Care Med. 2015 Jan 15;191:775–785. doi: 10.1164/rccm.201409-1646OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Williams D, Fothergill JL, Evans B, Caples J, Haldenby S, et al. Transmission and lineage displacement drive rapid population genomic flux in cystic fibrosis airway infections of a Pseudomonas aeruginosa epidemic strain. Microb Genomics. 2018 doi: 10.1099/mgen.0.000167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Williams D, Paterson S, Brockhurst MA, Winstanley C. Refined analyses suggest that recombination is a minor source of genomic diversity in Pseudomonas aeruginosa chronic cystic fibrosis infections. Microb Genomics [Internet] 2016;2 doi: 10.1099/mgen.0.000051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
- 34.Coil D, Jospin G, Darling AE. A5-miseq: an updated pipeline to assemble microbial genomes from illumina MiSeq data. Bioinformatics. 2015 Feb 15;31:587–589. doi: 10.1093/bioinformatics/btu661. [DOI] [PubMed] [Google Scholar]
- 35.Assefa S, Keane TM, Otto TD, Newbold C, Berriman M. ABACAS: algorithm-based automatic contiguation of assembled sequences. Bioinformatics. 2009;25:1968–1969. doi: 10.1093/bioinformatics/btp347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 37.Winsor GL, DKW L, Fleming L, Lo R, Whiteside MD, et al. Pseudomonas genome database: improved comparative analysis and population genomics capability for Pseudomonas genomes . Nucleic Acids Res. 2011;39 doi: 10.1093/nar/gkq869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jolley KA, Maiden MCJ. BIGSdb: scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595. doi: 10.1186/1471-2105-11-595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Laing C, Buchanan C, Taboada EN, Zhang Y, Kropinski A, et al. Pan-genome sequence analysis using Panseq: an online tool for the rapid analysis of core and accessory genomic regions. BMC Bioinformatics. 2010;11:461. doi: 10.1186/1471-2105-11-461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994 Nov 11;22:4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Roure B, Rodriguez-Ezpeleta N, Philippe H. SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics. BMC Evol Biol. 2007;7 Suppl 1:S2. doi: 10.1186/1471-2148-7-S1-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hoang DT, Chernomor O, Von Haeseler A, Minh BQ, Vinh LS. UFBoot2: improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018 doi: 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Rahme L, Stevens E, Wolfort S. Common virulence factors for bacterial pathogenicity in plants and animals. Science (80-) 1995 Jun 30;268:1899–1902. doi: 10.1126/science.7604262. [DOI] [PubMed] [Google Scholar]
- 45.Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017 doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Didelot X, Wilson DJ. ClonalFrameML: efficient inference of recombination in whole bacterial genomes. PLoS Comput Biol. 2015 doi: 10.1371/journal.pcbi.1004041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Letunic I, Bork P. Interactive tree of life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007 Jan 1;23:127–128. doi: 10.1093/bioinformatics/btl529. [DOI] [PubMed] [Google Scholar]
- 48.Zhou Z, Alikhan N-F, Sergeant MJ, Luhmann N, Vaz C, et al. GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens. Genome Res. 2018:gr.232397.117. doi: 10.1101/gr.232397.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng. 2007 [Google Scholar]
- 50.McKinney W, Team PD. Pandas - Powerful python data analysis toolkit. 2015.
- 51.Klein B, Klein B. NumPy. Einführung in Python. 2014;3 [Google Scholar]
- 52.Kluyver T, Ragan-kelley B, Pérez F, Granger B, Bussonnier M. Jupyter Notebooks-A Publishing Format for Reproducible Computational Workflows. Positioning and Power in Academic Publishing: Players, Agents and Agendas; 2016. [Google Scholar]
- 53.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31:3691–3693. doi: 10.1093/bioinformatics/btv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Brynildsrud O, Bohlin J, Scheffer L, Eldholm V. Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary. Genome Biol. 2016;17 doi: 10.1186/s13059-016-1108-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA. blast ring image generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12 doi: 10.1186/1471-2164-12-402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, et al. blast+: architecture and applications. BMC Bioinformatics. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Broad Institute Picard tools. 2016 https://broadinstitute.github.io/picard/%5Cnhttp://broadinstitute.github.io/picard/
- 60.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Mohan K, Fothergill JL, Storrar J, Ledson MJ, Winstanley C, et al. Transmission of Pseudomonas aeruginosa epidemic strain from a patient with cystic fibrosis to a PET cat. Thorax. 2008;63:839–840. doi: 10.1136/thx.2007.092486. [DOI] [PubMed] [Google Scholar]
- 64.Freschi L, Vincent AT, Jeukens J, Emond-Rheault JG, Kukavica-Ibrulj I, et al. The Pseudomonas aeruginosa pan-genome provides new insights on its population structure, horizontal gene transfer, and pathogenicity. Genome Biol Evol. 2019;11:109–120. doi: 10.1093/gbe/evy259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Darch SE, McNally A, Harrison F, Corander J, Barr HL, et al. Recombination is a key driver of genomic and phenotypic diversity in a Pseudomonas aeruginosa population during cystic fibrosis infection. Sci Rep. 2015;5:7649. doi: 10.1038/srep07649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Montanari S, Oliver A, Salerno P, Mena A, Bertoni G, et al. Biological cost of hypermutation in Pseudomonas aeruginosa strains from patients with cystic fibrosis. 2007;153:1445–1454. doi: 10.1099/mic.0.2006/003400-0. [DOI] [PubMed] [Google Scholar]
- 67.Davies E, V, James CE, Kukavica-Ibrulj I, Levesque RC, Brockhurst MA, et al. Temperate phages enhance pathogen fitness in chronic lung infection. Isme J. 2016 doi: 10.1038/ismej.2016.51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Moradali MF, Ghods S, Rehm BHA. Pseudomonas aeruginosa lifestyle: a paradigm for adaptation, survival, and persistence. Front Cell Infect Microbiol. 2017;7:39. doi: 10.3389/fcimb.2017.00039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Worlitzsch D, Tarran R, Ulrich M, Schwab U, Cekici A, et al. Effects of reduced mucus oxygen concentration in airway Pseudomonas infections of cystic fibrosis patients. J Clin Invest. 2002;109:317–325. doi: 10.1172/JCI13870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Winstanley C, O’Brien S, Brockhurst MA. Pseudomonas aeruginosa evolutionary adaptation and diversification in cystic fibrosis chronic lung infections. Trends Microbiol. 2016;24:327–337. doi: 10.1016/j.tim.2016.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Maunders E, Welch M. Matrix exopolysaccharides; the sticky side of biofilm formation. FEMS Microbiology Letters. 2017 doi: 10.1093/femsle/fnx120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Anthony M, Rose B, Pegler MB, Elkins M, Service H, et al. Genetic analysis of Pseudomonas aeruginosa isolates from the sputa of Australian adult cystic fibrosis patients. J Clin Microbiol. 2002;40:2772–2778. doi: 10.1128/JCM.40.8.2772-2778.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Mathee K, Ciofu O, Sternberg C, Lindum PW, Campbell JIA, et al. Mucoid conversion of Pseudomonas aeruginosa by hydrogen peroxide: a mechanism for virulence activation in the cystic fibrosis lung. Microbiology. 1999;145:1349–1357. doi: 10.1099/13500872-145-6-1349. [DOI] [PubMed] [Google Scholar]
- 74.Fothergill JL, Neill DR, Loman N, Winstanley C, Kadioglu A. Pseudomonas aeruginosa adaptation in the nasopharyngeal reservoir leads to migration and persistence in the lungs. Nat Commun. 2014 doi: 10.1038/ncomms5780. [DOI] [PubMed] [Google Scholar]
- 75.Marvig RL, Damkiær S, Hossein Khademi SM, Markussen TM, Molin S, et al. Within-host evolution of Pseudomonas aeruginosa reveals adaptation toward iron acquisition from hemoglobin. mBio. 2014;5 doi: 10.1128/mBio.00966-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Konings AF, Martin LW, Sharples KJ, Roddam LF, Latham R, et al. Pseudomonas aeruginosa uses multiple pathways to acquire iron during chronic infection in cystic fibrosis lungs. Infect Immun. 2013;81:2697–2704. doi: 10.1128/IAI.00418-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Hilliam Y, Moore MP, Lamont IL, Bilton D, Haworth CS, et al. Pseudomonas aeruginosa adaptation and diversification in the non-cystic fibrosis bronchiectasis lung. Eur Respir J. 2017;49 doi: 10.1183/13993003.02108-2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Bricio-Moreno L, Sheridan VH, Goodhead I, Armstrong S, Wong JKL, et al. Evolutionary trade-offs associated with loss of PmrB function in host-adapted Pseudomonas aeruginosa . Nat Commun. 2018 doi: 10.1038/s41467-018-04996-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Li H, Luo Y-F, Williams BJ, Blackwell TS, Xie C-M. Structure and function of oprD protein in Pseudomonas aeruginosa: from antibiotic resistance to novel therapies. Int J Med Microbiol. 2012;302:63–68. doi: 10.1016/j.ijmm.2011.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Salunkhe P, Smart CHM, Morgan JAW, Panagea S, Walshaw MJ, et al. A cystic fibrosis epidemic strain of Pseudomonas aeruginosa displays enhanced virulence and antimicrobial resistance. J Bacteriol. 2005;187:4908–4920. doi: 10.1128/JB.187.14.4908-4920.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Lamers RP, Nguyen UT, Nguyen Y, Buensuceso RNC, Burrows LL. Loss of membrane‐bound lytic transglycosylases increases outer membrane permeability and β ‐lactam sensitivity in Pseudomonas aeruginosa . Microbiologyopen. 2015;4:879–895. doi: 10.1002/mbo3.286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Castañeda-García A, Rodríguez-Rojas A, Guelfo JR, Blázquez J. The glycerol-3-phosphate permease GlpT is the only fosfomycin transporter in Pseudomonas aeruginosa. J Bacteriol [Internet] 2009 Nov 4;191:6968–6974. doi: 10.1128/JB.00748-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Tsutsumi Y, Tomita H, Tanimoto K. Identification of novel genes responsible for overexpression of ampC in Pseudomonas aeruginosa PAO1. Antimicrob Agents Chemother. 2013;57:5987–5993. doi: 10.1128/AAC.01291-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Hirakata Y, Srikumar R, Poole K, Gotoh N, Suematsu T, et al. Multidrug efflux systems play an important role in the invasiveness of Pseudomonas aeruginosa. J Exp Med. 2002 doi: 10.1084/jem.20020005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Aaron SD, Vandemheen KL, Ramotar K, Giesbrecht-Lewis T, Tullis E, et al. Infection with transmissible strains of Pseudomonas aeruginosa and clinical outcomes in adults with cystic fibrosis. JAMA - J Am Med Assoc. 2010;304:2145–2153. doi: 10.1001/jama.2010.1665. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.