Abstract
The major human bacterial pathogen, Pseudomonas aeruginosa, causes multidrug-resistant infections in people with underlying immunodeficiencies or structural lung diseases, such as Cystic Fibrosis (CF). We show that a few environmental isolates, driven by horizontal gene acquisition, have become dominant epidemic clones that have sequentially emerged and spread through global transmission networks over the past 200 years. These clones demonstrate varying intrinsic propensities for infecting CF or non-CF individuals (linked to specific transcriptional changes enabling survival within macrophages), have undergone multiple rounds of convergent, host-specific adaptation, and have eventually lost their ability to transmit between different patient groups. Our findings thus explain the pathogenic evolution of P. aeruginosa and highlight the importance of global surveillance and cross-infection prevention in averting the emergence of future epidemic clones.
P. aeruginosa is found widely in natural and man-made environments (1–5) and has become an increasingly important opportunistic human pathogen, causing acute nosocomial lung, soft tissue, and systemic infections (6), as well as chronic pulmonary infections in individuals with underlying inflammatory lung diseases, such as Chronic Obstructive Pulmonary Disease (COPD) (7), CF (8), and non-CF bronchiectasis (9), where it causes increased morbidity and mortality (8, 10). Antimicrobial resistance (AMR) in P. aeruginosa is increasing globally (recognised by its inclusion in the WHO ESKAPE pathogen list (11)) and is responsible for over 300,000 deaths annually (12).
Although individuals frequently acquire P. aeruginosa independently from the environment, hospital-based person-to-person transmission is well recognised in people with CF, leading to strict cohort and individual segregation within clinics (8), but has not been considered a risk in other patient cohorts (13). Nevertheless, epidemic clones of P. aeruginosa have been identified in both CF and non-CF infection studies (14–17), suggesting the presence of wide-spread transmission networks. In CF, these epidemic clones (such as the Liverpool Epidemic Strain; LES) are associated with accelerated inflammatory lung damage and worse clinical outcomes, demonstrating that epidemic clones can emerge with increased virulence in particular patient groups.
We therefore sought to understand the pathogenic evolution of P. aeruginosa, by defining when and how epidemic clones emerged and spread, exploring how they have adapted to specific hosts, and understanding how within-host evolution has influenced ongoing transmission patterns.
Results
Phylogenetic analysis reveals global epidemic clones
We first analysed a globally-distributed collection of 9,829 human, animal, and environmental isolates of P. aeruginosa, including 9,573 human clinical samples from 2,765 patients (14, 18–27) (Figure 1A; Table S1), which we grouped into 596 genetically-related clones (based on pairwise single nucleotide polymorphism (SNP) distances) and then stratified by the number of patients infected by each clone (Figure 1B; Figure S1; Supplementary Methods). We identified 21 major clones each containing isolates from at least 30 individuals, which we term ‘epidemic’ and refer to by their majority multi-locus sequence type (28).
Figure 1. The emergence of epidemic clones of Pseudomonas aeruginosa.
(A) Geographical location of the whole genome sequenced P. aeruginosa isolates obtained from patients, animals, and environment analysed in this study (n = 9,573). Number of samples from each location indicated by the size of blue dot. (B) Cumulative number of isolates across P. aeruginosa clones (defined by clustering genomes using the unweighted pair group method with arithmetic means; see Supplementary Methods), arranged by ascending number of genomes per clone and stratified into epidemic (n ≥30 isolates/clone; red), non-unique (1 < n < 30 isolates/clone; light brown), and unique (n = 1 isolate/clone; blue) groups. (C) Left: Maximum likelihood phylogenetic tree generated from genomes of all study isolates (major epidemic clones labelled in red). Right Bar plot representing the number of cities where each epidemic clone was found, coloured by continent. (D) Estimated date of first population expansion of 21 epidemic clones (predicted by Bayesian inference using BEAST (29)) with graph showing median and interquartile range (IQR; boxplots), 1.5 times IQR range (whiskers), and data points outside this range (black points). (E) Pangenome graph analysis of ancestral representatives of epidemic clones (n = 21) and sporadic clones (n = 80), constructed using Panaroo (39), where nodes represent clusters of orthologous genes and two nodes are connected by an edge if they are adjacent on a contig in any sample from the population, define gene gain events associated with the emergence of epidemic clones (described in detail in Figure S5) with genes highlighted that are involved in transcription (blue), defense mechanisms (purple), and inorganic ion transport and metabolism (yellow).For illustration purposes, the graph has been ordered against the genome of P. aeruginosa PAO1. Inset: magnified section of the pangenome graph is shown to illustrate node and edge structure.
We found that these epidemic clones were more likely to be detected in infected humans than in the non-built environment (Fisher exact test p = 7.80 x 10-12), caused 51 % of all clinical P. aeruginosa infections worldwide, were widely distributed across the phylogenetic tree, and had all spread globally (Figure 1C; Figure S1; Table S1).
Inferring population expansion and geography of epidemic clones over time
Since our sample collections spanned over 100 years (from 1900 to 2018), we wondered whether we could explore the historical origins of epidemic clones using Bayesian temporal reconstruction (29). We estimate that epidemic clones emerged non-synchronously between the late 17th and late 20th centuries (Figure S2) and (through Skyline demographic modelling (30); Figure S3) have each undergone at least one major population expansion between 1850 and 2000 (Figure 1D), suggesting (considering only extant clones) that P. aeruginosa has undergone relatively recent changes in host-pathogen dynamics, potentially driven by changes in human population density, migration patterns (31, 32), and/or increased susceptibility to infections (caused, for example, by increased air pollution during industrialisation (33–35)).
For epidemic clones with geographically clustered clades (association test p value <0.05), we implemented Bayesian phylogeographic methods (36) to infer the ancestral location of clones (accepting the limitations of our opportunistic sample collection). In some instances, for example ST235, we were able to find statistical support for the direction of intercontinental spread (from South America to North America and Europe, and then subsequently Asia and Africa; Figure S4), whereas, for other clones such as ST17 and ST27, we could identify only that transmission was restricted to between Europe and North America and peaked in the second half of the 20th century (Figure S4). We conclude that epidemic clones have likely arisen from ancestral locations distributed around the world.
Horizontal gene transfer may drive emergence of epidemic clones
We next asked why some P. aeruginosa clones had become epidemic and investigated whether gene acquisition, through horizontal transfer, might have driven large jumps in human infectivity through saltatory evolution (37) (meaning abrupt changes in evolutionary fitness caused by sudden large genetic changes), as previously observed in Mycobacterium abscessus (38). To accurately analyse the P. aeruginosa accessory genome, we first inferred a pan-genome graph (using Panaroo (39)), with nodes as clusters of orthologous genes and two nodes linked by an edge if they were found adjacent in any contig (Figure 1E). We then compared the accessory genomes of representatives of ancestral epidemic clones with those of sporadic isolates and found that epidemic clones had significant enrichment of genes involved in transcriptional regulation, inorganic ion transport, lipid metabolism, and protein turnover, with significant depletion of genes involved in bacterial defence and secretion (Fisher exact test, FDR = 0.1; Figure 1E; Figure S5, Table S2); suggesting that fundamental changes in bacterial physiology might have driven the development of epidemic clones.
Epidemic clones vary in their intrinsic host preference
We next examined the types of human infection caused by the epidemic clones and found that ST146, also known as the Liverpool Epidemic Strain, caused infection almost exclusively in people with CF while other clones (e.g., ST175 and ST309) caused infection only in non-CF individuals, with a range of CF and non-CF host distributions for other clones (Figure 2A). While our collection was opportunistically sampled, our findings were replicated when re-analysing existing systematic UK surveillance data (40) for the frequency of CF and non-CF lung infections caused by each epidemic clone (Figure S6). We hypothesised that this apparent clone-specific host preference might relate to intrinsic differences in bacterial behaviour between clones. We found no evidence, on pangenome analysis, for an association of host affinity with acquisition of genes with a particular function (Figure S7) and therefore we reasoned that changes in gene expression could explain differences in host preference. We therefore analysed a previous transcriptomic study of clinical P. aeruginosa isolates (nearly all from non-CF patients; (25)) that were represented in our sample collection, and found that clinical isolates clustered in transcriptional space based on the host-preference of epidemic clones (p < 0.001; Figure 2B; Supplementary Methods).
Figure 2. Variable intrinsic host preference of epidemic P. aeruginosa clones.
(A) Proportion of infections caused by epidemic clones (labelled by their majority multi-locus sequence type, ST) in cystic fibrosis (CF; red) and non-CF (blue) patients. (B) UMAP projection of transcriptomes from representative isolates of epidemic clones (25), colour-coded by the CF affinity of each clone. Expression data were pseudo-aligned to strain-specific gene indices to produce estimates of gene transcript abundance. (C) Transcriptome-wide association of gene expression with CF affinity. Transcript abundances were modelled as a response to the proportion of CF infections caused by each epidemic clone using a negative binomial generalised linear model. Volcano plot visualization of the Log2-fold expression change with CF proportion for every gene in the 99% core genome of Pseudomonas aeruginosa (center). Genes with an adjusted p-value of less than 0.05 and a log2 fold change less than -0.5 were coloured in green, genes with a log2 fold change greater than 0.5 were coloured in red. The coefficients for gene models were assessed using the Wald test (FDR = 0.05). Normalized expression counts vs CF proportions per epidemic clone with a trendline for the two genes with the lowest and highest log2 fold change, respectively, are shown above (top left/top right). Bulk RNA seq data was analysed from 241 clinical isolates of epidemic clones (25) included in our strain collection. (D) Survival of epidemic clones within wildtype (WT) or isogenic F508del knock-in THP1 macrophages at 2 and 4h post infection, expressed as fold change from 1 hour post infection showing median and interquartile range (IQR; boxplots), 1.5 times IQR range (whiskers. Experiments (carried out at least in duplicate) were performed by exposing THP1 macrophages to pooled isolates of 51 clinical isolates at a multiplicity of infection (MOI) of less than 1. Viable bacteria were isolated from macrophages at time points indicated and grown on solid media. Isolate abundance was quantified using sequence-based deconvolution. Strains with less than 1% abundance at the 1h time point were excluded from the analysis. A difference in the abundance of ST27 strains vs ST111 and ST235 strains at the 4h timepoint was assessed using a two-tailed t-test. * p-value < 0.05, ** p-value < 0.01.
We next explored whether there were any patterns in gene expression associated with clone predisposition for specific human hosts and identified, using a negative binomial generalised linear model, a clear expression signature of 624 genes positively associated, and 514 genes inversely associated, with affinity for causing CF infection (Wald test, FDR = 0.05; Figure 2C; Table S3).
Increased survival within macrophages of high CF affinity clones
To identify a potential mechanism by which differential gene expression could alter the host-preference of epidemic clones, we conducted a multi-dimensional phenotypic characterisation of 49 representative isolates (almost all from non-CF patients to minimise the contribution of secondary host-specific adaptation) from epidemic clones with low (ST235, ST111), intermediate (ST253), and high levels of CF affinity (ST17, ST27, ST146) obtained from the International Pseudomonas Consortium Database (21). We initially examined isolate behaviour in established assays of Pseudomonas virulence (biofilm formation, siderophore production, swim and twitch motility, and production of caseinase and gelatinase) but could find no correlation with clonal host affinity (Figure S8).
We therefore decided to test the ability of representative isolates of different epidemic clones to withstand intracellular killing by macrophages that, together with neutrophils, are thought to be the first line of defence against bacterial lung infection (41, 42). We found significantly increased intracellular survival and replication of isolates from the high CF affinity clone (ST27) compared to isolates from the low/intermediate clones ST111 and ST235 in both wildtype and CF (F508del homozygous) isogenic macrophage cell lines (Figure 2D; Table S4), suggesting that enhanced host innate immune evasion might explain the intrinsic success of particular epidemic clones in infecting CF patients.
Host preference of epidemic strains is mediated by DksA1 expression
To further explore the bacterial mechanisms contributing to enhanced intracellular survival of bacterial isolates from high CF affinity clones, we interrogated the differentially-expressed gene set (Figure 2C) and discovered that both the expression of the stringent response modulator DksA1 and the activation of its regulon were associated with CF affinity (Figure 3A). Since DksA1 had previously been implicated in enhancing P. aeruginosa survival within mouse macrophages and increasing tolerance to H2O2 (43), we examined the impact of deleting DksA1 on bacterial survival by using in vitro and in vivo models of non-CF and CF infection employing P. aeruginosa PAO1 wildtype and isogenic DksA1 and DksA2 double knockout (ΔDksA1,2), and complemented (ΔDksA1,2::DksA1) strains (ΔDksA1,2 double knockout mutants were used due to gene redundancy (44).
Figure 3. Activation of the DksA1 regulon contributes to Cystic Fibrosis host preference of P. aeruginosa clones.
(A) Volcano plot visualisation of the Log2-fold expression change with CF proportion for genes positively controlled (red) and negatively controlled (green) within the DskA1 regulon as defined by Fortuna et al. (44). Bulk RNA seq data was analysed from 241 clinical isolates of epidemic clones (24) included in our strain collection. (B) DksA1 promotes survival of P. aeruginosa within CF macrophages. Viable intracellular P. aeruginosa (quantified through enumeration of cell-associated colony forming units; CFU) were measured at 1h and 4h post infection of differentiated wildtype (WT) and isogenic F508del homozygous knockin (CF-F508del) THP1 cells with wildtype (blue), isogenic DskA1-DskA2 double knockout (ΔDksA1,2; pink), and knockout complemented with DksA1 ((ΔDksA1,2::DksA1; yellow) P. aeruginosa PAO1. Data (mean ± SEM) are representative of at least three independent experiments performed in at least triplicate. *** p < 0.001; ns not significant (two-tailed Student’s t-test). (C) (B) Top: Cartoon of zebrafish (created with BioRender.com) illustrating injection site for GFP-labelled fluorescent P. aeruginosa. Bottom: Representative fluorescence and DIC images of whole infected zebrafish larvae at 1 day post-infection (Scale bar: 150 μm; the labelled yolk sac is autofluorescent). (D) Survival analysis of control (top) and cftr morphant (cftr MO; bottom) zebrafish larvae infected intravenously (250-350 CFU) with P. aeruginosa PAO1 wildtype (blue), ΔDksA1,2 knockout (pink), and ΔDksA1,2::DksA1 complemented (yellow) fluorescent strains plotted as the percentage of surviving animals over 6 days (average of 2 independent experiments; n = 30-38 fish for each condition); *** p < 0.001 (Mantel-Cox Log-rank test). (E) Viable P. aeruginosa in zebrafish larvae at Day 1 post infection with P. aeruginosa PAO1 wildtype (blue), ΔDksA1,2 knockout (pink), and ΔDksA1,2::DksA1 complemented (yellow) fluorescent strains (plotted as mean ± IQR colony forming units (CFU) per fish of at least 3 independent experiments; n = 15-20 larvae per condition. *** p < 0.001; ns not significant (two-way ANOVA with Tukey’s post-test). (F,G) Control and cftr morphant zebrafish larvae with mCherry-labelled macrophages (Tg(mpeg1:mcherry-F)ump2 (45)) were intramuscularly infected with 250-350 GFP-labelled P. aeruginosa PAO1 wildtype, ΔDksA1,2 or ΔDksA1,2::DksA1 strains) and the infection tracked using real-time intravital confocal microscopy. (F) Representative 3D reconstruction of confocal imaging showing macrophages (red) and automatic classification of extracellular (grey) and intracellular (green) P. aeruginosa (Scale bar 10 μm). (G) Quantification of the number of infected macrophages at the site of injection (left) and the level of intracellular bacterial load (calculated by the volume of bacteria-associated fluorescence observed within each macrophage) at 6 hours post infection with P. aeruginosa PAO1 wildtype (blue), ΔDksA1,2 knockout (pink), and ΔDksA1,2::DksA1 complemented (yellow) fluorescent strains. Mean ± IQR of at least 54 cells per condition (from n = 4-6 larvae) recorded from 2 independent experiments. ** p < 0.01; *** p < 0.001; ns not significant (two-way ANOVA with Tukey’s post-test).
We first tested bacterial survival in macrophages and found that, while all three strains were effectively killed in wildtype THP1 macrophages, only the ΔDksA1,2 strain could be killed by isogenic CF (F508del knock-in) cells while wildtype and complemented bacteria were able to resist macrophage killing and replicate intracellularly (Figure 3B). Our findings reveal a number of important features of host-Pseudomonas interactions: firstly, that there are intrinsic defects in CF macrophages that facilitate intracellular survival of P. aeruginosa (observations that are supported by previous in vitro reports (45) and by our in vivo experiments in zebrafish, where deletion (46) or morpholino knockdown of the Cystic Fibrosis Transmembrane Regulator (cftr) compromises survival after intravenous infection (Figure S9)); secondly, that intracellular survival in CF macrophages is mediated by DskA1, raising intriguing mechanistic questions about the role of the stringent response in surviving the phagosomal environment; and finally, that differences in DksA1 expression across epidemic clones may explain their observed different abilities to survive within macrophages and, potentially as a consequence, their varying intrinsic host preferences.
To explore the potential role of DksA1 further, we examined the behaviour of fluorescently labeled wildtype, ΔDksA1,2 mutant and complemented P. aeruginosa PAO1 during in vivo infection in zebrafish larvae (Figure 3C-G). We observed increased survival of both control and cftr morpholino-treated fish after infection with ΔDksA1,2 bacteria compared to wildtype and complemented strains (Figure 3D); findings which correlated with an observed decreased survival of ΔDksA1,2 bacteria in vivo (Figure 3E).
We then exploited the optical transparency of zebrafish larvae to track the interaction of macrophages with P. aeruginosa following intramuscular infection using intravital confocal microscopy. Utilizing a fluorescent macrophage reporter fish line (Tg(mpeg1:mcherry-F)ump2; (47)), we could clearly identify macrophages and distinguish extracellular from intracellular fluorescent bacteria (Figure 3F). We found no difference in the mobilization of macrophage to the site of infection, or the proportion of infected macrophages, in control and cftr morpholino fish infected with wildtype, ΔDksA1,2 mutant or complemented P. aeruginosa PAO1 bacteria (Figure S9; Figure 3G). We did however observe a clear reduction in macrophage bacterial burden (suggesting reduced intracellular replication) following ΔDksA1,2 infection of both control and cftr morpholino fish lines (Figure 3G), confirming the critical role of DksA1 for intracellular survival during in vivo P. aeruginosa infection.
Taken together, our data indicate that intrinsic elevations in DksA1 expression levels in some epidemic clones may have enabled them to exploit potential innate immune defects in CF and adopt the specific evolutionary strategy of replicating within macrophages.
Convergent host-specific adaptation of P. aeruginosa
We next examined how, once selected from the environment, epidemic clones of P. aeruginosa have adapted to the human host through multiple rounds of within-patient evolution by analysing the recent mutation history of individual clones.
By reconstructing mutations that had likely occurred since the emergence of each clone, we found strong evidence for convergent molecular evolution, identifying 224 out of 5641 genes that had a higher total mutational burden than expected by chance (Poisson test, FDR = 0.05; Figure 4A; Table S5), which we term ‘pathoadaptive’. Mutations in these pathoadaptive genes were more likely to be non-synonymous and deleterious (by variant effect annotation (48)) than those found in other genes (Fisher exact test p < 1.0 x 10-16), and were predicted to be more likely to cause protein dysfunction, as estimated by both sequence conservation methods (SIFT (49); Wilcoxon rank-sum test, p = 9.04 x 10-15) and structural modelling approaches (FoldX (50); Wilcoxon rank-sum test, p = 1.34 x 10-6) (Figure S10), suggesting that pathoadaptation is largely driven by loss-of-function mutations. We explored the functional impact of pathoadaptive mutations experimentally by using existing RNAseq datasets (25) to examine the effect of transcription factor variants on expression of their previously characterised regulons (51–54) and found that clinical isolates with pathoadaptive variants in several transcription factor had statistically significant shifts in regulon expression levels compared to controls (two tailed t-test with adjusted p-value < 0.00014; Figure S10), supporting the concept of a general loss-of-function evolutionary process driving P. aeruginosa pathoadaptation.
Figure 4. Host-specific pathoadaptation of P. aeruginosa.
(A) Manhattan plot showing nominal p values (plotted as -Log10) from genome-wide mutational burden test across all genes in P. aeruginosa PAO1. Significance was assessed using a Poisson test comparing the expected and observed number of mutations in each gene accounting for the proportion of genomes that gene was found in the pan-genome (FDR = 0.1; genes with a significant mutational burden, termed pathoadaptive, shown in black, others in grey). (B) UMAP projections of host adaptation of isolates (based on acquired mutations in pathoadaptive genes) colour-coded by (left) number of pathoadaptive mutations and (right) type of infection (centroids denoted by larger dots). Isolates without any pathoadaptive mutations were removed from the analysis. (C) Protein-protein interaction network for the pathoadaptive genes extracted from the STRING database (only main connected component shown, full graphs shown in Fig. S12; (56)). Genes are shown as nodes which are connected by an edge if they had an interaction reported in STRING (confidence > 0.7). Top: To estimate host-specific pathoadaption, the number of cystic fibrosis (CF) vs non-CF mutations (determined by stratifying mutations in pathoadaptive genes on terminal branches by the infection type of isolates) were compared using a Fisher exact test (FDR = 0.1) and expressed as an odds ratio for each gene. Bottom: Gene nodes were colour-coded by class of functional annotation (based on overrepresented pathways using Gene Ontology (89) biological process enrichment analysis with TopGO (57) among CF: transmembrane transport and fatty acid biosynthesis, and non-CF: transcriptional regulation and chemotaxis).
We were able to functionally annotate the majority of these pathoadaptive genes using prior published information (55), identifying many of them as having established roles in recognised pathogenic processes including biofilm formation, antibiotic resistance and LPS modification (Figure S11). The number of genes with an established function was much higher among pathoadaptive genes than in other genes (Fisher exact test p < 1.0 x 10-16; Figure S11), potentially reflecting their central role in P. aeruginosa pathobiology. We also characterised the function of 41 pathoadaptive genes experimentally by de novo screening relevant transposon mutants in a series of functional assays to quantify virulence traits (Supplementary Methods; Figure S11).
We next examined the nature of host adaptation achieved by individual P. aeruginosa isolates by using the profile of their pathoadaptive gene mutations to map them in evolutionary space (defined by the presence or absence of mutations in the 224 pathoadaptive genes). We found that CF isolates clustered separately from others and had accumulated more mutations, suggesting that the CF lung is a distinct niche with different selective pressures compared to other lung or non-lung environments (Figure 4B).
We found that the products of these pathoadaptive genes were tightly interconnected, with more protein-protein interactions than expected by chance (STRING database (56); p < 1 x 10-16; Figure 4C; Figure S12), indicating their likely coordinated functional roles. We observed that 70 genes were more frequently mutated in CF isolates while 55 genes were more commonly mutated in non-CF isolates (Fisher exact test, FDR = 0.1). Among genes that were more commonly mutated in CF or non-CF, we found several overrepresented pathways (using Gene Ontology biological pathway enrichment analysis with TopGO (57)), suggesting that distinct functional programmes were being modified as part of host-specific adaptation (Figure 4C). For example, CF isolates were more likely to have mutated AlgU, a key regulator of mucoidy (58) (with mutations occurring predominantly at the interface between this sigma-H factor and its negative regulator protein, MucA), and PcnA, a putative nicotinamidase (with mutations found within the protein core or at sites of protein-protein interaction; Figure S13). In contrast, non-CF isolates were more likely to have mutated LadS, a calcium-responsive histidine kinase (59) (with mutations concentrated in the N-terminal (sensor) and transmembrane domains), and in the putative choline transporter BetT2 (60), with helix-breaking mutations found within the transmembrane domain (Figure S13).
Distinct evolutionary trajectories lead to host specialisation
We then used ancestral state reconstruction to determine the order of acquisition of each pathoadaptive mutation and thereby recreate the evolutionary trajectory of each isolate. We found that, on average, CF isolates had longer trajectories than non-CF isolates (with 20.5 compared to 11.2 steps, Wilcoxon signed-rank test p < 1.0 x 10-16, Figure S14). By looking at the frequency of mutations in pathoadaptive genes at each evolutionary time-step, we were able to cluster genes into 5 groups with distinct temporal signatures (Figure 5A; Figure S15), suggesting that mutations in specific genes may be important at different stages of evolution (as noted previously, specifically for AMR evolution (61)).
Figure 5. Evolutionary trajectories of P. aeruginosa during pathoadaptation.
(A) Normalised frequency of mutations over evolutionary time in specific pathoadaptive genes. The trajectories of the 50 most commonly mutated genes were manually assigned to one of 5 classes (Figure S15), based on the shape of their mutation frequency curves (relative size of each class and representative examples (with trendlines from locally-weighted smoothing) shown). (B) The relative transmissibility and host-specific adaptation of pathoadaptive genes was calculated. To estimate host-specific pathoadaptation, the number of cystic fibrosis (CF) vs non-CF mutations (determined by stratifying mutations in pathoadaptive genes on terminal branches by the infection type of isolates) were compared using a Fisher exact test (FDR = 0.1) and expressed as an odds ratio. To assess the transmissibility of pathoadaptive changes, the number of mutations that had been observed in at least two isolates were compared with mutations that had only been observed once using a Fisher exact test (FDR = 0.1). Genes were colour-coded if showing significant host-specific adaptation (blue), changes in transmissibility (purple), or both (pink). Genes with zero or infinite odds ratio not shown. (C) Functional annotation of pathoadaptive genes associated in (top) host-specific adaptation and (bottom) changes in transmissibility. (D) The number (top) and proportion (bottom) of transmission links across a range of pairwise SNP thresholds occurring between CF to CF (red), CF to non-CF (yellow), and non-CF to non-CF (blue) individuals (data were down-sized to contain equal numbers of CF and non-CF infections). (E) Transmission clusters involving patients with CF (red), non-CF (blue), or unknown status (white). Nodes representing isolates were connected by edges if pairwise SNP distances were 26 SNPs or less. This cut-off represents the 95th percentile of the within-host genetic diversity analysed in 81 patients.
We wondered therefore whether specific sets of pathoadaptive genes might be driving distinct evolutionary processes such as host-specific adaptation, person-to-person transmission, or both. To examine this, we inferred the impact of each pathoadaptive gene on host-specific pathoadaptation (by examining the relative frequency of gene mutations occurring in CF compared to non-CF lung isolates), and on bacterial transmissibility (based on the frequency of specific gene mutations being found in isolates from at least two patients) to create a map of the contribution of each pathoadaptive gene to each of these evolutionary processes (Figure 5B), annotating each gene by previously known or experimentally derived function, or the type of temporal mutation signature observed (Figure 5C; Figure S16).
We found that, while some pathoadaptive genes were associated with changes in either transmissibility or host-specific pathoadaptation, many were implicated in both processes (Figure S16). For example, mutations in several genes (such as mvfR and morA) occurred early in the evolutionary trajectories of isolates and were associated with both adaptation to the non-CF host and increased transmissibility, while mutations in other genes (such as in fusA1 and algU) occurred late in evolutionary journeys and were associated with adaptation to the CF host and decreased transmissibility (Figure S16).
Since we observed a likely deleterious impact of several pathoadaptive mutations on transmissibility, we examined whether pathoadaptation might lead to host specialisation and result in reduced transmission of isolates between CF and non-CF individuals. To explore this possibility, we used the genomic relatedness of isolates to plot the number (and proportion) of transmission links over a range of SNP pairwise thresholds (representing transmission chains of various lengths) and found strong evidence for CF-to-CF patient transmission and non-CF to non-CF patient cross-infection but very little CF to non-CF transmission (Figure 5D). Additionally, we reconstructed transmission clusters at a specific SNP threshold (26 SNPs), based on the measured genetic diversity within individual patients (Supplementary Methods), and found transmission clusters of variable sizes but very few containing both CF and non-CF patients (Figure 5E). It seems likely therefore that host-specific pathoadaptation of epidemic P. aeruginosa clades limits transmission between different hosts.
Discussion
Our findings describe the key sequential steps involved in the evolution of P. aeruginosa from an environmental organism to a major human pathogen. We identify horizontal gene acquisition as a likely driver for the emergence of epidemic clones from the environment through saltational evolution (as previously described for M. abscessus (38)) and infer their spatio-temporal spread which suggests an increasing rate of new epidemic clone expansions over time (accepting that only extant clones are considered). We identify an intrinsic and variable host-specific affinity across epidemic clones with CF preference potentially causally associated with improved intracellular survival in macrophages. We then describe how deleterious mutations in a discrete set of functionally interrelated genes likely mediate further host specialisation (through multiple rounds of within-patient adaptation) and onward transmission, thereby plausibly explain the observed lack of person-to-person transmission between CF and non-CF patients.
Our work highlights the importance of preventing pathogenic evolution by minimising cross-infection, not just within CF cohorts (where infection control measures are well established) but also between non-CF patients, and emphasises how global surveillance and targeted monitoring of high-risk patient groups will be needed to detect expansion, pathoadaptation, and transmission of new and extant epidemic P. aeruginosa clones.
Methods summary
Genomic datasets and clone assignment
We collated Pseudomonas aeruginosa genome datasets from studies of antibiotic resistance (18, 23–25), from individuals with cystic fibrosis (14, 26) and non-CF bronchiectasis (22); from the International Pseudomonas Consortium (21); and from studies targeting high-risk clones (19, 20, 27). Newly sequenced genomes from the TeleCF study, which involved adults with CF (n = 15) who underwent home monitoring for six months and were chronically infected with Pseudomonas, and from bacteaemia infections (n = 365) as part of the UK BSAC bacteraemia resistance programme (62) and from patients attending hospitals in Cambridgeshire, UK were included. DNA was extracted using QIAxtractor (QIAgen), and samples were sequenced on the Illumina HiSeq 2000 and 2500 and X10 platforms.
Variants were called by mapping reads against the P. aeruginosa PAO1 reference genome (accession number AE004091.2) using the multiple_mappings_to_bam 1.6 pipeline with default parameters (https://github.com/sanger-pathogens/bact-gen-scripts) employing BWA (63) for mapping followed by stringent QC filtering and removing samples with an excess number of minority variants. Ariba 2.14.6 (64) was used for multi-locus sequencing typing (28). FastTree (2.1.10) was used to infer a global phylogenetic tree (65).
Clone were assigned by first grouping samples based on pairwise SNP distances using the ultra-metric pairwise group method with arithmetic means (UPGMA) and then applying a cut-off of 7000 SNPs. SNP-sites was used to infer a clone-specific alignment of variable sites (66). Gubbins version 2.4.1 (67) was used to remove recombination for individual clones with at least four available genomes.
Dating and phylogeography
Molecular dating was performed for all 21 epidemic clone separately. Potential hypermutators (distorting the temporal signal) were removed by identifying samples with an unusual ratio of transition and transversion mutations. The temporal signal was assessed with TempEst (68) by comparing collection dates with root-to-tip distances using non-dated phylogenetic tress inferred with RAxML 8.2.12 (69). The significance of the signal was assessed using a permutation test using a custom script (https://github.com/chrisruis/tree_scripts/blob/main/bootstrap_TempEst_rttd_date.R). Clones with a significant temporal signal in this test (P < 0.05) were taken forward for molecular dating with BEAST 2.6.6 (29). We modelled the population history using the coalescent Bayesian skyline population prior. Convergence was assessed with Tracer 1.7.1 (70) with 10% burn-in. For clones that didn’t pass the bootstrap randomisation test (N = 9), a uniform prior for the substitution rate was set informed by the above clones.
For clones that passed the initial test, we ran a more thorough date randomisation test as described previously (71). The estimated median substitution rates and most recent common ancestor dates for randomised BEAST runs (n=10) did not overlap with those of the runs using real collection dates, indicating a significant temporal signal. To test whether each epidemic clone has undergone a historical population expansion, we analysed Bayesian skyline plot estimates of relative genetic diversity across the posterior distribution.
The association index was computed to find evidence of geographic clustering within clone phylogenies (72). We identified clones for further spatiotemporal analysis where less than 5% of randomisations had a higher association index than the non-permuted dataset. Asymmetric phylogeographic discrete trait reconstructions of the isolate continents were then performed using the BEAST classic 1.9.0 package of BEAST 2.6.6 (29). Subsampling to account for overrepresentation of certain continents was repeated five times and results compared between subsamples. Spread 0.9.7.1 (73) was used to identify candidate migration routes between continents (Bayes factor >= 3).
Pan-genome analysis
Genomes were assembled from short-read data and Panaroo 1.2.8 (39) was used to cluster the gene sequences from all samples into gene families and to infer a graphical pan-genome, which was reduced, ordered against the P. aeruginosa PAO1 genome, pruned of long-range connections, and then visualised (see Supplementary Methods for details). Parsimony ancestral character state reconstruction was then used to infer gene gains and losses on the branches of the rooted tree leading to the ancestral epidemic and sporadic clones. Gene functions were then annotated using EggNog-mapper 2.1.6 (74), with the number of genes gained and annotated within a specific COG functional category compared using a Fisher exact test (adjusted p-value < 0.05).
Macrophage infection experiments
Isogenic F508del homozygous THP1 cells were created from wild type THP1 cells (obtained from ATCC) using CRISPR-Cas9 editing and confirmed by Sanger sequencing (see Supplementary Methods). Wild type (WT) and F508del THP-1 monocytes were cultured, seeded at 200,000 cells/mL, and differentiated into macrophages (as previously described (75)) before being exposed to pooled clinical isolates of P. aeruginosa at a multiplicity of infection (MOI) of 1:1 and then incubated at 37 °C for 1 hour before the supernatant was removed and cells were lysed at 1h time point or incubated in fresh media for further time points (2h or 4h) before supernatant removal, cell lysis, and DNA extraction and sequencing. Strain abundance was quantified using the mSWEEP 1.4.0 sequence-based deconvolution method (76). Strains with less than 1% abundance at the 1h time point were excluded from the analysis.
Transcriptomic analysis
Gene expression data for clinical P. aeruginosa strains (and the UCBPP- PA14 wildtype control strain) was obtained (25), and pseudoaligned to strain-specific gene indices to produce abundance estimates using Kallisto (77). Length-scaled abundance estimates were size-factor normalised by the median ratio method and modelled as a response to CF proportion per genomic cluster (as defined by the number of CF vs non-CF patients and environmental samples) using a negative binomial generalised linear model (GLM) with DESeq2 (78). The coefficients for gene models were assessed using the Wald test (adjusted p-value < 0.05). To assess the distribution and clustering of transcriptional diversity of strains with respect to CF proportion, we used k-means clustering (k=20) on the principal components (PCs) of the gene expression data, and then then calculated the mean standard deviation (σ) of the CF proportion by cluster (mean σ = 0.135). A permutation test was used to assess significance (see Supplementary Methods for details)
Zebrafish infection models
The following zebrafish lines were used (see Supplementary Methods for details): wild type AB line; the knockout cftr sh540 mutant (46); Tg(mpeg1:mcherry-F)ump2 line (47). The morpholino for cftr knockdown (5’-GACACATTTTGGACACTCACACCAA-3’) were prepared and injected into one-cell-stage as previously described (79). Systemic infections were achieved by microinjection of GFP-expressing P. aeruginosa strains into the caudal vein of 30 hours post-fertilization (hpf) zebrafish embryos as previously described (80), with survival post infection assessed daily and viable in vivo P. aeruginosa quantified by colony forming units (CFU) at 1 day post infection (dpi). Macrophage responses were examined by intramuscular injection of anesthetized Tg(mpeg1:mcherry-F)ump2 larvae at 3 days post fertilisation (dpf) with GFP-expressing fluorescent P. aeruginosa as previously described (79, 81). Macrophage chemotaxis, phagocytosis, and intracellular P. aeruginosa burden were quantified by confocal microscopy (see Supplementary Methods for details).
Mutational burden analysis
Treetime 0.8.1 (82) was used to reconstruct ancestral character states of every nucleotide position in every clone. We then implemented a pipeline (83) to identify single nucleotide changes and annotate variant effect in their phylogenetic context using the gene annotation from Pseudomonas.com (PAO1 107) and the ancestral character state reconstructions (55). Parsimony ancestral character state reconstruction was used to infer ancestral insertions and deletions, using SNPeff (48) for variant effect annotation.
We assessed the mutational burden of every gene based on the number of non-synonymous variants across all clones (using a Poisson test, adjusted p-value < 0.05). The 224 genes passing the adjusted p-value threshold were used to query the STRING 11.5 database (56) of protein-protein interaction. Pathoadaptive genes were assigned to 17 functional categories based on the gene products description on Pseudomonas.com (55) (Figure S1). A Fisher exact test was used to compare the number of assigned with the number of unassigned genes among pathoadaptive genes and non-hits.
Impact of amino acid changes on protein stability and structural analysis
All amino acid changes were analysed with SIFT 4G 6.2.1 (49) and FoldX 5 (50) (see Supplementary Methods for details). A two-tailed t-test was used to compare the averaged scores per gene/protein scores between mutational burden test hits and non-hits. Mutational frequencies were mapped on the structural models of the identified hotspot genes in P. aeruginosa using the Chimera molecular modelling package (84). Models were downloaded from the Protein Data Bank and UniProt (85).
Phenotyping of pathoadaptive gene mutations
PAO1 mutants with transposon insertions in 154 pathoadaptive genes (selected from the Manoil library (86) were arrayed in 96 well plates and imaged using the Phenobooth Imager (Singer Instruments) to quantify the following phenotypic traits: swimming motility, twitching motility, siderophore production, caseinase activity, gelatinase activity, and rhamnolipid production (see Supplementary Methods for details).
To assess the association between genetic variants and the expression of transcription factor (TF) regulons, gene expression data from (25) were pseudoaligned to strain-specific gene sets and the normalised expression levels of TF regulons were compared between strains with and without genetic variants using Welch’s two-sample t-tests (adjusted p-value < 0.05) (see Supplementary Methods for details).
Transmission and host selectivity of pathoadaptive mutations
To assess the transmissibility of pathoadaptive changes, the number of mutations that had been observed in at least two isolates (from different patients) was compared with hitherto untransmitted mutations using a Fisher exact test (adjusted p-value < 0.1). TopGO 2.4.6 was used for functional enrichment analysis of the host-specific Gene Ontology biological pathway annotation compared to background (57), using annotations from Pseudomonas.com (55) (p-value < 0.05).
Mutations in pathoadaptative genes were stratified by the (ancestral) infection type (CF or non-CF) of every branch based on outgroup-rooted rooted clone trees. To assess host-specific pathoadapdation, the number of CF vs non-CF mutations were compared using a Fisher exact test (adjusted p-value < 0.1). Mutations on branches with non-concordant ancestral infection types were discarded.
Trajectories were inferred as the sequence of mutations in pathoadaptive genes since the emergence of the clone ancestor as implied by the PAO1-rooted tree stratified by cystic fibrosis (CF) and non-CF infection types. Mutation frequencies were position normalised and the frequency plots of the 40 genes with the lowest p-value from the mutational-burden test were manually assigned into five groups of genes with similar frequency curve shapes. Trendlines were generated by locally-weighted smoothing.
We established a relatedness cut-off to define potential transmission links using pairwise SNP differences between pairs of isolate genomes from the same patient (n = 81 patients). We then identified potential transmission events as isolates from the same clone sampled from different patients that differed by 26 SNPs or fewer, visualised using Cytoscape.
Supplementary Material
Acknowledgements
We would like to thank Sanger Pathogen Informatics for providing informatics support, Humphrey Gardner and Robert E McLaughlin for sharing their unpublished short-read sequencing data of their previously published genomes (18), and David Aanensen and Susanne Häußler for sharing their genomic and transcriptomic data in the public domain (23, 25). We would like to thank BSAC, and the laboratory staff at the clinical microbiology lab at Cambridge Unversity Hospitals for their support in isolating the hospital strains. We are grateful to Dr Anne-Béatrice Blanc-Potard for VLM support and the gift of pMF230. We thank Dr Mai Nguyen-Chi for support with access to animal experimentation, and Pr Stephen Renshaw for support and helpful discussions. We acknowledge the ZEFIX-LPHI (ZEbraFIsh and Xenopus platform, Lphi, University of Montpellier) and the Bateson Center (University of Sheffield, Sheffield) Aquarium teams for zebrafish maintenance and care. We wish to thank the imaging facility MRI (Montpellier Ressources Imagerie), and Elodie Jublanc and Vicki Diakou for their assistance. We also thank the University of Montpellier and the University of Sheffield for support.
Funding
This work was supported by: The Wellcome Trust (107032AIA (RAF, AW, CR, AD), 226602/Z/22/Z (RAF, AW); 10224/Z/15/Z JMB, 098051(JP), 220472/Z/20/Z SK); The Botnar Foundation (6063; RAF, AW, TLB, CR, JP); The UK Cystic Fibrosis Trust (Innovation Hub grant 001 (RAF, TLB, JP, SB, AW, CR, LE), NIHR Cambridge Biomedical Research Centre (RAF, KPB), Health Enterprise East (SBRI; KPB, JR, RAF), Royal Papworth Hospital Research Innovation Fund (RAF), the Canadian Institute of Health Research (168006), Cystic Fibrosis Canada (2610), Genome Quebec and Genome Canada (RCL), the Italian Ministry of University and Research (art. 1, commi 314-337 Legge 232/2016; GR, LL), the Italian Cystic Fibrosis Research Foundation (FFC 10/2023; GR), Horizon 2020 Research (H2020-MSCA-IF-2016; AB), the Marie Skłodowska-Curie Innovation Framework Program (CFZEBRA (751977); AB), the FC3R through the ZFishforCFCare project (22FC3R-017; AB) and the French cystic fibrosis foundation Vaincre La Mucoviscidose (VLM, RF20220503060; SP).
Footnotes
Author contributions
AW, JP and RAF conceived the project. AW, RCL, JP and RAF designed the experiments and wrote the manuscript. AW performed the bioinformatic analyses. AD and AW performed the differential expression analysis. AW and CR performed the Bayesian phylodynamic analysis. APP and TLB performed the computational structural modelling. LE and MW performed the virulence phenotyping assays. LS and PTH made the THP1 F508 knockin cell line. KB, JMB, SK, and AW performed the THP1 in vitro screen, sequencing and analysis. EU, KB, JR, JMB and RAF undertook the TeleCF study, samples collection, DNA extraction, and analysis. BB, KJ, RB and SJP conducted the bacteraemia sample collection and processing. GR and LL generated knockout bacterial lines and contributed to experimental design. AB and SP conceived and designed experiments and analysed data for the in vivo testing in zebrafish models. SP generated fluorescent P. aeruginosa strains. NRT provided feedback on the project outline and access to the Sanger computing facilities. JG, JF and RL provided clinical isolates and contributed to analysis. RAF and JP provided supervisory support. Competing interests: none. Ethical approvals: National Ethical approval for the TeleCF study (ClinicalTrials.gov number NCT01877707). was granted from the National Research Ethics Service (NRES) Committee of Hertfordshire, UK (REC 12/EE/0462). The study design was peer reviewed by the Cambridge Centre for Lung Infection (CCLI) and the Cambridge Institute for Medical Research (CIMR). All zebrafish (Danio rerio) procedures described in the present study were performed by authorized staff and conducted by following the 3Rs -Replacement, Reduction and Refinement- principles in compliance with the European Union guidelines for handling of laboratory animals to local standards set approved by the UK Home Office under Animal Welfare and Ethical Review Body, and by the Direction Sanitaire et Vétérinaire de l’Hérault et Comité d’Ethique pour l’Expérimentation Animale de la région Languedoc Roussillon and the French Ministry of Agriculture (authorization number: APAFIS #36309-2022040114222432). Breeding and maintenance of adult zebrafish were performed at the ZEFIX (Lphi, UMR5294, University of Montpellier, France) and the Bateson Centre (University of Sheffield, UK) Home Office-approved aquaria, according to the local animal welfare standards (license numbers CEEA-LR-B3417237 and P1A4A7A5E). The number of animals used for each procedure was guided by pilot experiments.
Data and Materials availability
Short-read DNA and RNA sequence data for the clinical isolates was downloaded from the European Nucleotide Archive (ENA). THP1 pooled infection assay DNA sequencing data was uploaded to the ENA under PRJEB20836 (Supplementary Table S4). Short-read DNA sequencing data for newly sequenced isolates genomes (or where only assemblies were previously available) was uploaded to the ENA (TeleCF: ERP022089, IPC: PRJNA325248, AZ: PRJEB66158, LES: PRJEB69223, Bronchiectasis UK PRJEB69219). ENA run accessions can be found in Supplementary Table S1. The analysis codes and the PhyloEffects software were made available on GitHub and snapshots provided on Zenodo (83, 87). Intermediate and additional data were made available in a Zenodo repository (88).
References
- 1.Whitby JL, Rampling A. Pseudomonas Æruginosa contamination in domestic and hospital environments. Lancet. 1972;299:15–17. doi: 10.1016/s0140-6736(72)90006-2. [DOI] [PubMed] [Google Scholar]
- 2.Green SK, Schroth MN, Cho JJ, Kominos SK, Vitanza-jack VB. Agricultural plants and soil as a reservoir for Pseudomonas aeruginosa. Appl Microbiol. 1974;28:987–991. doi: 10.1128/am.28.6.987-991.1974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Pirnay J-P, Matthijs S, Colak H, Chablain P, Bilocq F, Van Eldere J, De Vos D, Zizi M, Triest L, Cornelis P. Global Pseudomonas aeruginosa biodiversity as reflected in a Belgian river. Environ Microbiol. 2005;7:969–980. doi: 10.1111/j.1462-2920.2005.00776.x. [DOI] [PubMed] [Google Scholar]
- 4.Ringen LM, Drake CH. A study of the incidence of Pseudomonas aeruginosa from various natural sources. J Bacteriol. 1952;64:841–845. doi: 10.1128/jb.64.6.841-845.1952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Crone S, Vives-Flórez M, Kvich L, Saunders AM, Malone M, Nicolaisen MH, Martínez-García E, Rojas-Acosta C, Catalina Gomez-Puerto M, Calum H, Whiteley M, et al. The environmental occurrence of Pseudomonas aeruginosa. APMIS. 2020;128:220–231. doi: 10.1111/apm.13010. [DOI] [PubMed] [Google Scholar]
- 6.ECDC. Healthcare-associated infections acquired in intensive care units Annual epidemiological report for 2017. European Centre for Disease Prevention and Control; 2019. [Google Scholar]
- 7.Murphy TF, Brauer AL, Eschberger K, Lobbins P, Grove L, Cai X, Sethi S. Pseudomonas aeruginosa in chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2008;177:853–860. doi: 10.1164/rccm.200709-1413OC. [DOI] [PubMed] [Google Scholar]
- 8.Rajan S, Saiman L. Pulmonary infections in patients with cystic fibrosis. Semin Respir Infect. 2002;17:47–56. doi: 10.1053/srin.2002.31690. [DOI] [PubMed] [Google Scholar]
- 9.Finch S, McDonnell MJ, Abo-Leyah H, Aliberti S, Chalmers JD. A Comprehensive Analysis of the Impact of Pseudomonas aeruginosa Colonization on Prognosis in Adult Bronchiectasis. Ann Am Thorac Soc. 2015;12:1602–1611. doi: 10.1513/AnnalsATS.201506-333OC. [DOI] [PubMed] [Google Scholar]
- 10.Kosorok MR, Zeng L, West SE, Rock MJ, Splaingard ML, Laxova A, Green CG, Collins J, Farrell PM. Acceleration of lung disease in children with cystic fibrosis after Pseudomonas aeruginosa acquisition. Pediatr Pulmonol. 2001;32:277–287. doi: 10.1002/ppul.2009.abs. [DOI] [PubMed] [Google Scholar]
- 11.Tacconelli E, Carrara E, Savoldi A, Harbarth S, Mendelson M, Monnet DL, Pulcini C, Kahlmeter G, Kluytmans J, Carmeli Y, Ouellette M, WHO Pathogens Priority List Working Group et al. Discovery, research, and development of new antibiotics: the WHO priority list of antibiotic-resistant bacteria and tuberculosis. Lancet Infect Dis. 2018;18:318–327. doi: 10.1016/S1473-3099(17)30753-3. [DOI] [PubMed] [Google Scholar]
- 12.Ikuta KS, et al. Global mortality associated with 33 bacterial pathogens in 2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2022;400:2221–2248. doi: 10.1016/S0140-6736(22)02185-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mitchelmore PJ, Randall J, Bull MJ, Moore KA, O’Neill PA, Paszkiewicz K, Mahenthiralingam E, Scotton CJ, Sheldon CD, Withers NJ, Brown AR. Molecular epidemiology of Pseudomonas aeruginosa in an unsegregated bronchiectasis cohort sharing hospital facilities with a cystic fibrosis cohort. Thorax. 2017 doi: 10.1136/thoraxjnl-2016-209889. [DOI] [PubMed] [Google Scholar]
- 14.Stapleton PJ, Izydorcyzk C, Clark S, Blanchard A, Wang PW, Yau Y, Waters V, Guttman DS. Pseudomonas aeruginosa strain sharing in early infection among children with cystic fibrosis. Clin Infect Dis. 2020 doi: 10.1093/cid/ciaa788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tramper-Stranders GA, van der Ent CK, Wolfs TFW, Kimpen JLL, Fleer A, Johansen U, Johansen HK, Høiby N. Pseudomonas aeruginosa diversity in distinct paediatric patient groups. Clin Microbiol Infect. 2008;14:935–941. doi: 10.1111/j.1469-0691.2008.02054.x. [DOI] [PubMed] [Google Scholar]
- 16.Woo TE, Lim R, Surette MG, Waddell B, Bowron JC, Somayaji R, Duong J, Mody CH, Rabin HR, Storey DG, Parkins MD. Epidemiology and natural history of Pseudomonas aeruginosa airway infections in non-cystic fibrosis bronchiectasis. ERJ Open Res. 2018;4 doi: 10.1183/23120541.00162-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.AbdulWahab A, Taj-Aldeen SJ, Ibrahim E, Abdulla SH, Muhammed R, Ahmed I, Abdeen Y, Sadek O, Abu-Madi M. Genetic relatedness and host specificity of Pseudomonas aeruginosa isolates from cystic fibrosis and non-cystic fibrosis patients. Infect Drug Resist. 2014;7:309–316. doi: 10.2147/IDR.S72112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kos VN, Déraspe M, McLaughlin RE, Whiteaker JD, Roy PH, Alm RA, Corbeil J, Gardner H. The resistome of Pseudomonas aeruginosa in relationship to phenotypic susceptibility. Antimicrob Agents Chemother. 2015;59:427–436. doi: 10.1128/AAC.03954-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Moore MP, Lamont IL, Williams D, Paterson S, Kukavica-Ibrulj I, Tucker NP, Kenna DTD, Turton JF, Jeukens J, Freschi L, Wee BA, et al. Transmission, adaptation and geographical spread of the Pseudomonas aeruginosa Liverpool epidemic strain. Microb Genom. 2021;7 doi: 10.1099/mgen.0.000511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.López-Causapé C, Sommer LM, Cabot G, Rubio R, Ocampo-Sosa AA, Johansen HK, Figuerola J, Cantón R, Kidd TJ, Molin S, Oliver A. Evolution of the Pseudomonas aeruginosa mutational resistome in an international Cystic Fibrosis clone. Sci Rep. 2017;7:5555. doi: 10.1038/s41598-017-05621-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Freschi L, Vincent AT, Jeukens J, Emond-Rheault J-G, Kukavica-Ibrulj I, Dupont M-J, Charette SJ, Boyle B, Levesque RC. The Pseudomonas aeruginosa pan-genome provides new insights on its population structure, horizontal gene transfer and pathogenicity. Genome Biol Evol. 2018 doi: 10.1093/gbe/evy259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hilliam Y, Moore MP, Lamont IL, Bilton D, Haworth CS, Foweraker J, Walshaw MJ, Williams D, Fothergill JL, De Soyza A, Winstanley C. Pseudomonas aeruginosa adaptation and diversification in the non-cystic fibrosis bronchiectasis lung. Eur Respir J. 2017;49 doi: 10.1183/13993003.02108-2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chilam J, Argimón S, Limas MT, Masim ML, Gayeta JM, Lagrada ML, Olorosa AM, Cohen V, Hernandez LT, Jeffrey B, Abudahab K, et al. Philippines Antimicrobial Resistance Surveillance Program, Genomic surveillance of Pseudomonas aeruginosa in the Philippines, 2013-2014. Western Pac Surveill Response J. 2021;12:4–18. doi: 10.5365/wpsar.2020.11.1.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Del Barrio-Tofiño E, López-Causapé C, Cabot G, Rivera A, Benito N, Segura C, Montero MM, Sorlí L, Tubau F, Gómez-Zorrilla S, Tormo N, et al. Genomics and Susceptibility Profiles of Extensively Drug-Resistant Pseudomonas aeruginosa Isolates from Spain. Antimicrob Agents Chemother. 2017;61:AAC.01589-17. doi: 10.1128/AAC.01589-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Khaledi A, Weimann A, Schniederjans M, Asgari E, Kuo T-H, Oliver A, Cabot G, Kola A, Gastmeier P, Hogardt M, Jonas D, et al. Predicting antimicrobial resistance in Pseudomonas aeruginosa with machine learning-enabled molecular diagnostics. EMBO Mol Med. 2020;12:e10264. doi: 10.15252/emmm.201910264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nat Genet. 2015;47:57–64. doi: 10.1038/ng.3148. [DOI] [PubMed] [Google Scholar]
- 27.Turton JF, Wright L, Underwood A, Witney AA, Chan Y-T, Al-Shahib A, Arnold C, Doumith M, Patel B, Planche TD, Green J, et al. High-Resolution Analysis by Whole-Genome Sequencing of an International Lineage (Sequence Type 111) of Pseudomonas aeruginosa Associated with Metallo-Carbapenemases in the United Kingdom. J Clin Microbiol. 2015;53:2622–2631. doi: 10.1128/JCM.00505-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Curran B, Jonas D, Grundmann H, Pitt T, Dowson CG. Development of a multilocus sequence typing scheme for the opportunistic pathogen Pseudomonas aeruginosa. J Clin Microbiol. 2004;42:5644–5649. doi: 10.1128/JCM.42.12.5644-5649.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu C-H, Xie D, Suchard MA, Rambaut A, Drummond AJ. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol. 2014;10:e1003537. doi: 10.1371/journal.pcbi.1003537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Drummond AJ, Rambaut A, Shapiro B, Pybus OG. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol. 2005;22:1185–1192. doi: 10.1093/molbev/msi103. [DOI] [PubMed] [Google Scholar]
- 31.McNeill WH. Human Migration in Historical Perspective. Popul Dev Rev. 1984;10:1–18. [Google Scholar]
- 32.Dobson AP, Carper ER. Infectious Diseases and Human Population History. Bioscience. 1996;46:115–126. [Google Scholar]
- 33.Kelly FJ, Fussell JC. Air pollution and airway disease. Clin Exp Allergy. 2011;41:1059–1071. doi: 10.1111/j.1365-2222.2011.03776.x. [DOI] [PubMed] [Google Scholar]
- 34.Chauhan AJ, Johnston SL. Air pollution and infection in respiratory illness. Br Med Bull. 2003;68:95–112. doi: 10.1093/bmb/ldg022. [DOI] [PubMed] [Google Scholar]
- 35.Kampa M, Castanas E. Human health effects of air pollution. Environ Pollut. 2008;151:362–367. doi: 10.1016/j.envpol.2007.06.012. [DOI] [PubMed] [Google Scholar]
- 36.Lemey P, Rambaut A, Drummond AJ, Suchard MA. Bayesian phylogeography finds its roots. PLoS Comput Biol. 2009;5:e1000520. doi: 10.1371/journal.pcbi.1000520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Katsnelson MI, Wolf YI, Koonin EV. On the feasibility of saltational evolution. Proc Natl Acad Sci USA. 2019;116:21068–21075. doi: 10.1073/pnas.1909031116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bryant JM, Brown KP, Burbaud S, Everall I, Belardinelli JM, Rodriguez-Rincon D, Grogono DM, Peterson CM, Verma D, Evans IE, Ruis C, et al. Stepwise pathogenic evolution of Mycobacterium abscessus. Science. 2021;372 doi: 10.1126/science.abb8699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tonkin-Hill G, MacAlasdair N, Ruis C, Weimann A, Horesh G, Lees JA, Gladstone RA, Lo S, Beaudoin C, Floto RA, Frost SDW, et al. Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biol. 2020;21:180. doi: 10.1186/s13059-020-02090-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Martin K, Baddal B, Mustafa N, Perry C, Underwood A, Constantidou C, Loman N, Kenna DT, Turton JF. Clusters of genetically similar isolates of Pseudomonas aeruginosa from multiple hospitals in the UK. J Med Microbiol. 2013;62:988–1000. doi: 10.1099/jmm.0.054841-0. [DOI] [PubMed] [Google Scholar]
- 41.Day J, Friedman A, Schlesinger LS. Modeling the immune rheostat of macrophages in the lung in response to infection. Proc Natl Acad Sci USA. 2009;106:11246–11251. doi: 10.1073/pnas.0904846106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Craig A, Mai J, Cai S, Jeyaseelan S. Neutrophil recruitment to the lungs during bacterial pneumonia. Infect Immun. 2009;77:568–575. doi: 10.1128/IAI.00832-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Fortuna A, Collalto D, Schiaffi V, Pastore V, Visca P, Ascenzioni F, Rampioni G, Leoni L. The Pseudomonas aeruginosa DksA1 protein is involved in H2O2 tolerance and within-macrophages survival and can be replaced by DksA2. Sci Rep. 2022;12:10404. doi: 10.1038/s41598-022-14635-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Fortuna A, Bähre H, Visca P, Rampioni G, Leoni L. The two Pseudomonas aeruginosa DksA stringent response proteins are largely interchangeable at the whole transcriptome level and in the control of virulence-related traits. Environ Microbiol. 2021;23:5487–5504. doi: 10.1111/1462-2920.15693. [DOI] [PubMed] [Google Scholar]
- 45.Turton KB, Ingram RJ, Valvano MA. Macrophage dysfunction in cystic fibrosis: Nature or nurture? J Leukoc Biol. 2021;109:573–582. doi: 10.1002/JLB.4RU0620-245R. [DOI] [PubMed] [Google Scholar]
- 46.Bernut A, Loynes CA, Floto RA, Renshaw SA. Deletion of cftr Leads to an Excessive Neutrophilic Response and Defective Tissue Repair in a Zebrafish Model of Sterile Inflammation. Front Immunol. 2020;11 doi: 10.3389/fimmu.2020.01733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bernut A, Herrmann J-L, Kissa K, Dubremetz J-F, Gaillard J-L, Lutfalla G, Kremer L. Mycobacterium abscessus cording prevents phagocytosis and promotes abscess formation. Proc Natl Acad Sci USA. 2014;111:E943–52. doi: 10.1073/pnas.1321390111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Vaser R, Adusumalli S, Leng SN, Sikic M, Ng PC. SIFT missense predictions for genomes. Nat Protoc. 2016;11:1–9. doi: 10.1038/nprot.2015.123. [DOI] [PubMed] [Google Scholar]
- 50.Delgado J, Radusky LG, Cianferoni D, Serrano L. FoldX 5.0: working with RNA, small molecules and a new graphical interface. Bioinformatics. 2019;35:4168–4169. doi: 10.1093/bioinformatics/btz184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Westbrock-Wadman S, Sherman DR, Hickey MJ, Coulter SN, Zhu YQ, Warrener P, Nguyen LY, Shawar RM, Folger KR, Stover CK. Characterization of a Pseudomonas aeruginosa efflux pump contributing to aminoglycoside impermeability. Antimicrob Agents Chemother. 1999;43:2975–2983. doi: 10.1128/aac.43.12.2975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Poole K, Tetro K, Zhao Q, Neshat S, Heinrichs DE, Bianco N. Expression of the multidrug resistance operon mexA-mexB-oprM in Pseudomonas aeruginosa: mexR encodes a regulator of operon expression. Antimicrob Agents Chemother. 1996;40:2021–2028. doi: 10.1128/aac.40.9.2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Rust L, Pesci EC, Iglewski BH. Analysis of the Pseudomonas aeruginosa elastase (lasB) regulatory region. J Bacteriol. 1996;178:1134–1140. doi: 10.1128/jb.178.4.1134-1140.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yahr TL, Frank DW. Transcriptional organization of the trans-regulatory locus which controls exoenzyme S synthesis in Pseudomonas aeruginosa. J Bacteriol. 1994;176:3832–3838. doi: 10.1128/jb.176.13.3832-3838.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Winsor GL, Griffiths EJ, Lo R, Dhillon BK, Shay JA, Brinkman FSL. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database. Nucleic Acids Res. 2016;44:D646–53. doi: 10.1093/nar/gkv1227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2018;47:D607–D613. doi: 10.1093/nar/gky1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Alexa A, Rahnenführer J, Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006;22:1600–1607. doi: 10.1093/bioinformatics/btl140. [DOI] [PubMed] [Google Scholar]
- 58.Martin DW, Schurr MJ, Yu H, Deretic V. Analysis of promoters controlled by the putative sigma factor AlgU regulating conversion to mucoidy in Pseudomonas aeruginosa: relationship to sigma E and stress response. J Bacteriol. 1994;176:6688–6696. doi: 10.1128/jb.176.21.6688-6696.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Broder UN, Jaeger T, Jenal U. LadS is a calcium-responsive kinase that induces acute-to-chronic virulence switch in Pseudomonas aeruginosa. Nat Microbiol. 2016;2:16184. doi: 10.1038/nmicrobiol.2016.184. [DOI] [PubMed] [Google Scholar]
- 60.Chen C, Beattie GA. Pseudomonas syringae BetT is a low-affinity choline transporter that is responsible for superior osmoprotection by choline over glycine betaine. J Bacteriol. 2008;190:2717–2725. doi: 10.1128/JB.01585-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Santi I, Manfredi P, Maffei E, Egli A, Jenal U. Evolution of Antibiotic Tolerance Shapes Resistance Development in Chronic Pseudomonas aeruginosa Infections. MBio. 2021;12 doi: 10.1128/mBio.03482-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Reynolds R, Hope R, Williams L. BSAC Working Parties on Resistance Surveillance, Survey, laboratory and statistical methods for the BSAC Resistance Surveillance Programmes. J Antimicrob Chemother. 2008;62(Suppl 2):ii15–28. doi: 10.1093/jac/dkn349. [DOI] [PubMed] [Google Scholar]
- 63.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv [q-bioGN] 2013 http://arxiv.org/abs/1303.3997 . [Google Scholar]
- 64.Hunt M, Mather AE, Sánchez-Busó L, Page AJ, Parkhill J, Keane JA, Harris SR. ARIBA: rapid antimicrobial resistance genotyping directly from sequencing reads. Microb Genom. 2017;3:e000131. doi: 10.1099/mgen.0.000131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Page AJ, Taylor B, Delaney AJ, Soares J, Seemann T, Keane JA, Harris SR. SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments. Microb Genom. 2016;2:e000056. doi: 10.1099/mgen.0.000056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Croucher NJ, Page AJ, Connor TR, Delaney AJ, Keane JA, Bentley SD, Parkhill J, Harris SR. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res. 2015;43:e15. doi: 10.1093/nar/gku1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) Virus Evol. 2016;2:vew007. doi: 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst Biol. 2018;67:901–904. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Menardo F, Duchêne S, Brites D, Gagneux S. The molecular clock of Mycobacterium tuberculosis. PLoS Pathog. 2019;15:e1008067. doi: 10.1371/journal.ppat.1008067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Parker J, Rambaut A, Pybus OG. Correlating viral phenotypes with phylogeny: accounting for phylogenetic uncertainty. Infect Genet Evol. 2008;8:239–246. doi: 10.1016/j.meegid.2007.08.001. [DOI] [PubMed] [Google Scholar]
- 73.Bielejec F, Rambaut A, Suchard MA, Lemey P. SPREAD: spatial phylogenetic reconstruction of evolutionary dynamics. Bioinformatics. 2011;27:2910–2912. doi: 10.1093/bioinformatics/btr481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, Bork P. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Mol Biol Evol. 2017;34:2115–2122. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Bryant JM, Grogono DM, Rodriguez-Rincon D, Everall I, Brown KP, Moreno P, Verma D, Hill E, Drijkoningen J, Gilligan P, Esther CR, et al. Emergence and spread of a human-transmissible multidrug-resistant nontuberculous mycobacterium. Science. 2016;354:751–757. doi: 10.1126/science.aaf8156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Mäklin T, Kallonen T, David S, Boinett CJ, Pascoe B, Méric G, Aanensen DM, Feil EJ, Baker S, Parkhill J, Sheppard SK, et al. High-resolution sweep metagenomics using fast probabilistic inference. Wellcome Open Res. 2020;5:14. doi: 10.12688/wellcomeopenres.15639.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34:525–527. doi: 10.1038/nbt.3519. [DOI] [PubMed] [Google Scholar]
- 78.Love M, Anders S, Huber W. Differential analysis of count data--the DESeq2 package. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Bernut A, Dupont C, Ogryzko NV, Neyret A, Herrmann J-L, Floto RA, Renshaw SA, Kremer L. CFTR Protects against Mycobacterium abscessus Infection by Fine-Tuning Host Oxidative Defenses. Cell Rep. 2019;26:1828–1840.:e4. doi: 10.1016/j.celrep.2019.01.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Belon C, Soscia C, Bernut A, Laubier A, Bleves S, Blanc-Potard A-B. A Macrophage Subversion Factor Is Shared by Intracellular and Extracellular Pathogens. PLoS Pathog. 2015;11:e1004969. doi: 10.1371/journal.ppat.1004969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Le Moigne V, Roux A-L, Mahoudo H, Christien G, Ferroni A, Dumitrescu O, Lina G, Bouchara J-P, Plésiat P, Gaillard J-L, Canaan S, et al. Serological biomarkers for the diagnosis of Mycobacterium abscessus infections in cystic fibrosis patients. J Cyst Fibros. 2022;21:353–360. doi: 10.1016/j.jcf.2021.08.019. [DOI] [PubMed] [Google Scholar]
- 82.Sagulenko P, Puller V, Neher RA. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 2018;4:vex042. doi: 10.1093/ve/vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Weimann A, Ruis C. PhyloEffects. Zenodo. 2024 doi: 10.5281/zenodo.10606175. [DOI] [Google Scholar]
- 84.Pettersen EF, Goddard TD, Huang CC. UCSF Chimera—a visualization system for exploratory research and analysis. Journal of. 2004 doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 85.Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics. 2007;23:1282–1288. doi: 10.1093/bioinformatics/btm098. [DOI] [PubMed] [Google Scholar]
- 86.Held K, Ramage E, Jacobs M, Gallagher L, Manoil C. Sequence-verified two-allele transposon mutant library for Pseudomonas aeruginosa PAO1. J Bacteriol. 2012;194:6387–6389. doi: 10.1128/JB.01479-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Weimann A, Dinan A. Computer codes for Weimann et al - Evolution and host-specific pathoadaptation of Pseudomonas aeruginosa. Zenodo. 2024 doi: 10.5281/ZENODO.10625500. [DOI] [Google Scholar]
- 88.Weimann A. Data for Weimann et al - Evolution and host-specific pathoadaptation of Pseudomonas aeruginosa. Zenodo. 2024 doi: 10.5281/ZENODO.10600286. [DOI] [Google Scholar]
- 89.Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27:2987–2993. doi: 10.1093/bioinformatics/btr509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10 doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the Pubmlst.org website and their applications. Wellcome Open Res. 2018;3:124. doi: 10.12688/wellcomeopenres.14826.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Yu G, Smith DK, Zhu H, Guan Y, Lam TT-Y. Ggtree : An r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8:28–36. [Google Scholar]
- 95.Page AJ, De Silva N, Hunt M, Quail MA, Parkhill J, Harris SR, Otto TD, Keane JA. Robust high-throughput prokaryote de novo assembly and improvement pipeline for Illumina data. Microb Genom. 2016;2:e000083. doi: 10.1099/mgen.0.000083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021 doi: 10.1093/molbev/msab293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Valley HC, Bukis KM, Bell A, Cheng Y, Wong E, Jordan NJ, Allaire NE, Sivachenko A, Liang F, Bihler H, Thomas PJ, et al. Isogenic cell models of cystic fibrosis-causing variants in natively expressing pulmonary epithelial cells. J Cyst Fibros. 2019;18:476–483. doi: 10.1016/j.jcf.2018.12.001. [DOI] [PubMed] [Google Scholar]
- 101.Figurski DH, Helinski DR. Replication of an origin-containing derivative of plasmid RK2 dependent on a plasmid function provided in trans. Proc Natl Acad Sci USA. 1979;76:1648–1652. doi: 10.1073/pnas.76.4.1648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Nivens DE, Ohman DE, Williams J, Franklin MJ. Role of alginate and its O acetylation in formation of Pseudomonas aeruginosa microcolonies and biofilms. J Bacteriol. 2001;183:1047–1057. doi: 10.1128/JB.183.3.1047-1057.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc Series B Stat Methodol. 1995;57:289–300. [Google Scholar]
- 104.Louden BC, Haarmann D, Lynne AM. Use of Blue Agar CAS Assay for Siderophore Detection. J Microbiol Biol Educ. 2011;12:51–53. doi: 10.1128/jmbe.v12i1.249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, Yuan D, Stroe O, Wood G, Laydon A, Žídek A, et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50:D439–D444. doi: 10.1093/nar/gkab1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Short-read DNA and RNA sequence data for the clinical isolates was downloaded from the European Nucleotide Archive (ENA). THP1 pooled infection assay DNA sequencing data was uploaded to the ENA under PRJEB20836 (Supplementary Table S4). Short-read DNA sequencing data for newly sequenced isolates genomes (or where only assemblies were previously available) was uploaded to the ENA (TeleCF: ERP022089, IPC: PRJNA325248, AZ: PRJEB66158, LES: PRJEB69223, Bronchiectasis UK PRJEB69219). ENA run accessions can be found in Supplementary Table S1. The analysis codes and the PhyloEffects software were made available on GitHub and snapshots provided on Zenodo (83, 87). Intermediate and additional data were made available in a Zenodo repository (88).





