Skip to main content
PLOS Pathogens logoLink to PLOS Pathogens
. 2021 Mar 15;17(3):e1009418. doi: 10.1371/journal.ppat.1009418

Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity

Cédric Lood 1,2,, Charlotte Peeters 3,, Quentin Lamy-Besnier 1, Jeroen Wagemans 1, Daniel De Vos 4, Marijke Proesmans 5, Jean-Paul Pirnay 4, Fedoua Echahidi 6, Denis Piérard 6, Matthieu Thimmesch 7, Anca Boeras 8, Katrien Lagrou 9,10, Evelien De Canck 3, Elke De Wachter 11, Vera van Noort 2,12, Rob Lavigne 1,*, Peter Vandamme 3,*
Editor: William Navarre13
PMCID: PMC7993779  PMID: 33720991

Abstract

Burkholderia multivorans is a member of the Burkholderia cepacia complex (Bcc), notorious for its pathogenicity in persons with cystic fibrosis. Epidemiological surveillance suggests that patients predominantly acquire B. multivorans from environmental sources, with rare cases of patient-to-patient transmission. Here we report on the genomic analysis of thirteen isolates from an endemic B. multivorans strain infecting four cystic fibrosis patients treated in different pediatric cystic fibrosis centers in Belgium, with no evidence of cross-infection. All isolates share an identical sequence type (ST-742) but whole genome analysis shows that they exhibit peculiar patterns of genomic diversity between patients. By combining short and long reads sequencing technologies, we highlight key differences in terms of small nucleotide polymorphisms indicative of low rates of adaptive evolution within patient, and well-defined, hundred kbps-long segments of high enrichment in mutations between patients. In addition, we observed large structural genomic variations amongst the isolates which revealed different plasmid contents, active roles for transposase IS3 and IS5 in the deactivation of genes, and mobile prophage elements. Our study shows limited within-patient B. multivorans evolution and high between-patient strain diversity, indicating that an environmental microdiverse reservoir must be present for this endemic strain, in which active diversification is taking place. Furthermore, our analysis also reveals a set of 30 parallel adaptations across multiple patients, indicating that the specific genomic background of a given strain may dictate the route of adaptation within the cystic fibrosis lung.

Author summary

In many countries, Burkholderia multivorans is the most prevalent species within the Burkholderia cepacia complex (Bcc) found infecting the lungs of patients with cystic fibrosis (CF). Its positive identification is of immediate concern to the health of the patient as it is notoriously hard to eradicate using antibiotics and can cause necrosis of the lung tissues (cepacia syndrome). Infection control measures reduced the prevalence of B. cenocepacia in CF wards, but patients continue to acquire infections by B. multivorans from environmental sources. In most reported cases, the infecting strains are unique except in rare cases in which cross-infection is observed between patients. We report here an endemic strain of B. multivorans with sequence type ST-742 that has been infecting multiple patients, without evidence for cross-infection. We investigated the epidemiology and genomics of this ST-742 strain and show that it is microdiverse, as isolates between-patients exhibit numerous genomic differences, at scales that have not been observed previously when looking at evolutionary trajectories within-patients. Additionally, we found that the specific genomic background of a given strain may dictate the strategy of adaptation within the CF lung.

Introduction

Burkholderia cepacia complex (Bcc) bacteria are relatively rare but notorious opportunistic pathogens in cystic fibrosis (CF) patients. They are associated with higher rates of morbidity and mortality, as well as lower post-lung transplant survival [14]. Bcc infections in CF are characterized by highly variable clinical outcomes, but commonly result in a progressive decline of lung function. In extreme cases, Bcc infection can result in "cepacia syndrome", a necrotizing pneumonia and septicemia that engages a lethal prognosis for the patient [5]. Bcc infections are difficult to eradicate because the infecting strains have an innate resistance to multiple antibiotics [3,4,6].

Infection control measures and patient segregation were globally implemented in the 1990s to reduce patient-to-patient transmission and thereby the prevalence of Burkholderia cenocepacia [710]. Subsequently, Burkholderia multivorans emerged as the most prevalent Bcc representative in many countries [1,2,4,1113]. B. multivorans is considered less virulent than B. cenocepacia [6,8] and infections are often characterized as chronic with episodes of exacerbations [3,4]. Nevertheless, several cases of “cepacia syndrome” and B. multivorans epidemic outbreaks have been reported [1417].

The ability to differentiate Bcc strains has been key in understanding their epidemiology and improving infection control guidelines for the CF community. Multilocus sequence typing (MLST) is a well-established molecular technique to study the epidemiology and population structure of Bcc organisms [1820]. The Bcc MLST scheme is based on the allelic variations of seven housekeeping genes (atpD, gltB, gyrB, recA, lepA, phaC and trpB) and each strain is defined by its unique allelic profile and sequence type (ST) [18,21]. While several B. multivorans STs were shown to be globally distributed [22,23], only a limited number of genetically distinct B. multivorans outbreak strains have been described [22]. The small number of B. multivorans outbreaks and the fact that B. multivorans isolates from CF patients mostly represent unique strains strongly suggest that person-to-person transmissions are limited and that B. multivorans strains are usually acquired from environmental sources [10,12,22,24,25].

B. multivorans has been the most prevalent Bcc species in Belgian CF patients for more than three decades [1,26,27]. B. multivorans isolates from Belgian CF patients typically represent unique strains, but rather unexpectedly, several B. multivorans isolates with the same sequence type (ST-742) were found in multiple Belgian CF patients treated in different CF centers [26,28]. A study was performed to investigate the epidemiology and risk factors that play a role in the spread of these clones, but no evidence to support cross-infection was detected [26], suggesting these patients acquired this B. multivorans strain from the natural environment.

The aim of the current study was to investigate the genomic epidemiology and evolution of B. multivorans ST-742 that were isolated from respiratory samples of four Belgian CF patients over a period of at least ten years. We combined sequencing technologies to enable an optimal resolution in the de novo reconstruction of the genomes and elucidation of structural variations [29,30], with clinical and antibiotic resistance data to discriminate between ST-742 isolates. We show that ST-742 isolates of different patients have distinct genomic profiles and propose that each patient has been colonized independently from a natural reservoir of ST-742 where recombination events diversify the population of this microdiverse endemic B. multivorans strain.

Results

Isolates from four different patients share the same sequence type

Between 2009 and 2019, thirteen B. multivorans isolates with ST-742 were obtained from respiratory samples of four unrelated patients attending three different pediatric CF centers in Belgium (Fig 1, Table 1, Table A in S1 Tables) [26].

Fig 1. Timeline of B. multivorans ST-742 isolations from sputum samples in Patients 1 to 4.

Fig 1

Each dot represents an isolate (exact isolation dates in Table 1). The 2011 and 2012 isolates of patient 1 and 2012 isolates of patient 2 each originated from the same sputum sample and were studied in parallel as they displayed distinct colony morphologies. The line connecting the isolates indicates the infection timeline, with the infections of patients 2 and 4 still currently ongoing.

Table 1. List of isolates included in our study.

Isolates marked with an asterisk were obtained from the same sputum sample but displayed distinct colony morphologies.

Culture date Size (bp) CDS NCBI Accession
Patient 1 (°1999)
P1Bm2009 29/01/2009 6,569,085 5,917 CP048460-63
P1Bm2011a 31/03/2011* 6,498,339 5,830 CP048557-59
P1Bm2011b 31/03/2011* 6,531,926 5,829 CP048454-56
P1Bm2012a 16/02/2012* 6,545,859 5,828 JAAEEF000000000
P1Bm2012b 16/02/2012* 6,495,911 5,828 CP048451-53
P1Bm2013 25/07/2013 6,496,014 5,830 JAAEEE000000000
P1Bm2014 21/01/2014 6,497,010 5,831 CP048448-50
P1Bm2015 12/06/2015 6,499,180 5,833 CP048445-47
Patient 2 (°2004)
P2Bm2011a 11/07/2011* 6,705,051 5,982 CP048441-44
P2Bm2011b 11/07/2011* 6,707,647 5,983 JAAEED000000000
P2Bm2018 20/08/2018 6,661,721 5,988 JACKVR000000000
Patient 3 (°2006)
P3Bm2015 09/11/2015 6,725,592 6,045 JAAEEC000000000
Patient 4 (°1992)
P4Bm2019 23/12/2019 6,538,969 5,799 JACKVS000000000

Variability of antibiotic resistance profiles in the ST-742 isolates

We tested the susceptibility of our isolates to 20 different antibiotics and interpreted as resistant, low level of resistance, or susceptible based on EUCAST interpretation of the Minimum Inhibitory Concentrations (MIC) value when available, or the species independent Pharmacokinetics/Pharmacodynamics (PK/PD) interpretation otherwise (Fig 2). The complete table with the MIC values is given in Table B in S1 Tables.

Fig 2. Antibiotic resistance profiles of the thirteen isolates.

Fig 2

MIC values (Table B in S1 Tables) were interpreted using the EUCAST or PK/PD when available, and the results are summarized visually: Red = resistant, orange = low level of resistance, and green = sensitive.

Isolates of the first positive cultures (i.e., P1Bm2009 and P3Bm2015) were susceptible to multiple antibiotics (Fig 2). Interestingly, P2Bm2011b, and P4Bm2019 also display similar profiles despite not being the first B. multivorans isolate for the respective patients. For patient 1, pairs of isolates from the same sample with distinct colony morphologies (P1Bm2011a-b, P1Bm2012a-b) showed highly similar or identical profiles; the last isolate (P1Bm2015) was resistant to all antibiotics tested, except trimethoprim/sulfamethoxazole. For patient 2, the two isolates from the same sample with distinct colony morphologies (P2Bm2011a-b) showed different profiles: one isolate (P2Bm2011b) had a resistance profile similar to that of other initial isolates (i.e. P1Bm2009 and P3Bm2015), while the other isolate (P2Bm2011a) had a resistance profile that resembled that of later isolates such as P1Bm2012b. The resistance profile of the isolate P2Bm2018 was most similar to that of P2Bm2011a and P1Bm2012b (Fig 2).

The replicons are collinear and broadly share nucleotide sequence identity across isolates

The hybrid sequencing data, combining Nanopore and Illumina sequencing, enabled highly contiguous de novo reconstructions of the genomes for all thirteen isolates. The genomes ranged between 6.5 and 6.7 Mb in size with a G+C content of approximately 67%. The functional annotation revealed 5,837 to 6,045 coding DNA sequences (CDS) per genome (Table 1). The annotation of antibiotic resistance genes revealed a single class A beta-lactamase encoding gene that was present in all isolates and was a homolog to penA, previously identified in B. cepacia ATCC 25416 (NG_048030.1). Each of the replicons contained multiple genomic islands that included prophage elements and secondary metabolites (Table C in S1 Tables). No clustered regularly interspaced short palindromic repeats (CRISPRs) were found.

The assembly graphs (S1 Fig) confirmed the typical highly conserved genome structure of B. multivorans with three large replicons [6,23] (from here on referred to as C1, C2 and C3). The genomic sequences of each of the three chromosomes across all isolates were collinear (S2 Fig) and broadly shared nucleotide sequence identity (>99.8%).

The B. multivorans population structure reveals a unique, clonal ST-742 cluster

There are no other reports of ST-742 isolates in the Bcc PubMLST database [31]. Furthermore, MLST analysis of all publicly available B. multivorans genomes (Table D in S1 Tables) did not reveal other ST-742 isolates either. To examine the position of ST-742 isolates in the general population structure of B. multivorans a core-genome based phylogenomic analysis was performed of the genomes of our thirteen ST-742 isolates together with 97 publicly available B. multivorans genomes (Table D in S1 Tables). The core genome consisted of 995 genes that were used to build a multiple sequence alignment and construct a core-genome phylogeny (Fig 3). This analysis demonstrated that ST-742 isolates represent a unique line of descent within B. multivorans with FDAARGOS_719, a clinical isolate from the US [32], as nearest neighbor.

Fig 3. Population structure of Burkholderia multivorans.

Fig 3

We established the population structure via the isolation of core genes across all publicly available B. multivorans genomes in order to position our cluster of isolates in the population. We collapsed the subtree (highlighted in red) consisting of our thirteen ST-742 isolates as they showed to be clonal within the population. B. cenocepacia J2315 was used as outgroup. The strains highlighted in green are from the study by Silva et al. (2016) and Caballero et al. (2018), D2095 with D2214 and Bmul_CF170.1a respectively.

To further investigate the evolutionary relationships within the ST-742 cluster, a SNP analysis was performed to detect small insertions or deletions (indels) and single nucleotide polymorphisms (SNPs) in the ST-742 genomes and the FDAARGOS_719 genome. The resulting core SNP alignment was used for a second, more detailed phylogenetic analysis (Fig 4). While all patient 1 isolates clustered closely together, this was not the case for the isolates from patient 2. Indeed, the core SNP analysis revealed that the two initial patient 2 isolates (P2Bm2011a-b) were more similar to each other and to the single patient 4 isolate (P4Bm2019), than to the later isolate of patient 2 (P2Bm2018) (Fig 4).

Fig 4. Core-SNP phylogeny of the ST-742 isolates.

Fig 4

We used the information contained in the 61,845 core SNPs found between our cluster of ST-742 isolates and the strain FDAARGOS_719 to further zoom in on the evolutionary relations between the ST-742 isolates.

Distinct plasmid content and large structural variations between patients

Plasmids could be detected in at least one isolate of each of the four patients but appears to be lost during chronic infection (patient 1 and 2). Pairwise comparisons of these four plasmid sequences using BLASTn revealed that the plasmid of patient 1 isolate P1Bm2009 had only 4% coverage with high identity (97%) to the group of plasmids found in patient 2, 3, and 4 (P2Bm2011a-b, P3Bm2015, and P4Bm2019). When compared against the NCBI nr/nt database [33], we found a best BLASTn hit (83% identity over 20% sequence coverage) to the pINT23 plasmid of Burkholderia pseudomultivorans SUB-INT23-BP2 (CP013379.1). On the other hand, the group of plasmids from the first isolates of patients 2 to 4 shared a pairwise sequence identity and coverage of over 99% and had a best database hit (99% identity and 66% sequence coverage) to the pTGL1 plasmid of Burkholderia multivorans ATCC 17616 (AP009388.1). The functional content of the plasmids found in P1Bm2009 and P2Bm2011a (taken as representative of the group of plasmids found in P2Bm2011a-b, P3Bm2015, and P4Bm2019) is described in Tables E and F of S1 Tables. The plasmids differ extensively in size (72 kbp for P1Bm2009 and 162 kbp for P2Bm2011a) and in functional content. Indeed, over 24% of the large plasmid genes (P2Bm2011a) are related to DNA maintenance and repair compared to 9% in the small plasmid (P1Bm2009) (Table F of S1 Tables). Both plasmids contain proteins linked to Type IV secretion systems, with the smaller plasmid (P1Bm2009) specifically carrying a VirB4 component. Importantly, we did not identify any antibiotics resistance associated genes on those plasmids.

The long-read sequencing data were further used to survey larger structural variations between isolates (Table 2, Tables G and H of S1 Tables). This analysis revealed an active role for transposons from families IS3 and IS5 in the deactivation of genes (Table 2), particularly in the isolates from patient 2. Additionally, three complete prophages were present in all isolates except those of patient 2, in which they appear to be actively mobile. Indeed, the prophage located on C3 was absent from all isolates of patient 2, but the prophage on C2 was absent in P2Bm2011b, while the prophage on C1 was absent in P2Bm2011a and P2Bm2018.

Table 2. Large structural variations in the isolates of Patient 2–4.

By setting the isolate P1Bm2009 as a reference, we mapped the long Nanopore reads of the isolates from patients 2 to 4 to identify large structural variations that escape SNPs analysis. We omitted from this list indels found in intergenic regions (11) and hypothetical proteins (8). The full list of structural variations can be found in Tables G and H in S1 Tables.

Strains Chr Position Function
P2Bm2011b 1 17536 Insertion of IS3 into type IV secretion system tip protein VgrG
P2Bm2011b, P2Bm2018 1 59006 Insertion of IS5 into intergenic promoter of type II secretion system secretin GspD
P2Bm2018 1 482440 Deletion in long-chain fatty acid—CoA ligase
P4Bm2019 1 762540 Deletion EamA family transporter, hypothetical prot, RES family NAD+ phosphorylase
P4Bm2019 1 825939 Deletion glutamine—fructose-6-phosphate transaminase (isomerizing)
P2Bm2011b, P2Bm2018 1 860357 Large deletion including regulator proteins
P2Bm2011b, P2Bm2018 1 1084705 Deletion IS3 family transposase
P2Bm2011b 1 1126373 Insertion of IS3 into GlxA family transcriptional regulator
P2Bm2011b, P2Bm2018, P4Bm2019 1 1250973 Deletion IS256 family transposase
P2Bm2011b 1 1383320 Insertion of IS5 into ornithine acetyltransferase
P2Bm2011a, P2Bm2018 1 2112493 Prophage deletion
P2Bm2011a, P2Bm2018 1 2882469 Insertion of IS5 into PTS sugar transporter/HPr kinase/phosphorylase
P2Bm2011a,b, P2Bm2018 2 199631 Deletion in polyketide cyclase
P2Bm2011b, P2Bm2018 2 234973 Insertion of IS5 into peroxidase-related enzyme
P2Bm2011b 2 297796 Insertion of IS5 into transposase
all 2 386097 Deletion IS3 family transposase
P2Bm2011b 2 851128 Insertion of IS5 into type IV secretion system baseplate subunit TssF
P2Bm2011b 2 865794 Insertion of IS3 into DUF3304 domain-containing protein
P2Bm2018 2 906456 Insertion of IS5 into DEAD/DEAH box helicase
P2Bm2011b 2 921221 Insertion of IS5 into multidrug transporter subunit MdtN
P2Bm2011b 2 1052623 Insertion of IS5 into twin-arginine translocation pathway signal
P2Bm2011b, P2Bm2018 2 1770117 Insertion of IS5 into Tat pathway signal protein
P2Bm2011b 2 1892870 Prophage deletion
P2Bm2011b 2 2294168 Insertion of IS5 into aldo/keto reductase
P2Bm2011a,b, P2Bm2018 3 128274 Prophage deletion
P2Bm2011a,b 3 599926 Insertion of IS5 into PLP-dependent aminotransferase family protein

ST-742 isolates have regions with unusually high SNP densities in C1 and C2

To assess in which regions of the chromosomes SNPs occurred, all identical SNPs found on the different genomes of isolates from patient 1 were collapsed into a single collection of SNPs (85 in total). The same analysis was performed on the isolates P2bm2011a and P2Bm2011b from patient 2 (4,694 in total). However, the later isolate of patient 2 (P2Bm2018) was kept separated as it displayed a large number of unique variants (1,990 SNPs when compared to P2Bm2011a-b, S3 Fig).

Figs 5, 6 and 7 visually summarize the SNP densities for each chromosome and sets of isolates. The full list of SNPs is provided in Table I in S1 Tables. S3 Fig shows the number of shared SNPs among the isolates.

Fig 5. Features of chromosome 1 and SNP densities comparisons.

Fig 5

In concentric circles, from the center and outward are depicted: 1) the GC skew, 2) the GC content, 3) the genomic islands detected by the software IslandViewer 4 (in purple), the prophages detected with Phaster (green) and the secondary metabolites identified by antiSMASH (in blue), 4) a total of five SNP densities, we separated the isolates from 2011 and the isolate from 2018 in patient 2 as we regard them as two separate acquisition events, 5) and 6) the annotated features found on the negative and positive strand respectively.

Fig 6. Features of chromosome 2 and SNP densities comparisons.

Fig 6

In concentric circles, from the center and outward are depicted: 1) the GC skew, 2) the GC content, 3) the genomic islands detected by the software IslandViewer 4 (in purple), the prophages detected with Phaster (green) and the secondary metabolites identified by antiSMASH (in blue), 4) a total of five SNP densities, we separated the isolates from 2011 and the isolate from 2018 in patient 2 as we regard them as two separate acquisition events, 5) and 6) the annotated features found on the negative and positive strand respectively.

Fig 7. Feature of chromosome 3 and SNP densities comparisons.

Fig 7

In concentric circles, from the center and outward are depicted: 1) the GC skew, 2) the GC content, 3) the genomic islands detected by the software IslandViewer 4 (in purple), the prophages detected with Phaster (green) and the secondary metabolites identified by antiSMASH (in blue), 4) a total of five SNP densities, we separated the isolates from 2011 and the isolate from 2018 in patient 2 as we regard them as two separate acquisition events, 5) and 6) the annotated features found on the negative and positive strand respectively.

In general, SNPs were randomly distributed across the chromosomes, with 53, 20 and 12 SNPs on C1, C2 and C3 of isolates from patient 1, respectively. However, an enrichment of SNPs was detected in three regions for the isolates from the other patients: a first centered around position 900 kbp for C1 (Fig 5), and a second and third in C2 around 170 kbp and 310 kbp, respectively (Fig 6). No such pattern was observed in the smaller C3 (Fig 7). On C1, this region of P2Bm2011a-b, P2Bm2018, P3Bm2015 and P4Bm2019 all ended at the exact same position (991,509) but started at different positions (702,401–794,204) (Table J in S1 Tables). On C2, the enrichment of SNPs was observed in all isolates except in isolate P3Bm2015 (Fig 6). These regions all start at the same position (150,841 and 318,125) but ended at different positions (193,566–204,452 and 331,399–331,929). The analysis of the functional content found in these regions did not reveal specific enrichments for certain COG categories when compared to the whole genomic background of the strain (Table M in S1 Tables).

Genome evolution of patient 1 isolates over the course of the infection and parallel evolution in different patients

To reveal within-patient genomic variations at the level of SNPs and larger structural variations, the genomes of all sequential isolates of patient 1 were compared using isolate P1Bm2009 as a reference. When collapsing all SNPs as described above, a total of 85 unique SNPs were detected on the three chromosomes. Many of these mutations were fixed over time once they appeared (Fig 8, Table I in S1 Tables). Some genes had mutations at multiple independent positions (Fig 8, Table 3). Both pairs of isolates P1Bm2011a-b and P1Bm2012a-b, which originated from the same sputum samples but displayed a different colony morphology, also showed a distinct SNP profile. This suggests an evolution of different population lineages within this patient. These 85 SNPs occurred over six years, yielding an average mutation rate of 15 SNPs/year (S4 Fig).

Fig 8. Longitudinal evolution of isolates from patient 1.

Fig 8

Single nucleotide polymorphisms (SNPs) are shown for each of the three chromosomes of B. multivorans. When available, we additionally indicate the annotated function that is associated with the locus. The mutations that occur at multiple independent positions in the same locus, which are indicative of selective pressure, are marked with number between parentheses (Table 4).

Table 3. Loci with multiple independent mutations in patients 1 and 2.

Genes with multiple mutations in the corresponding CDSs of the longitudinal isolates from patient 1 (P1bm2009 –P1Bm2015) and patient 2 (P2Bm2011a, b), see Tables I and K in S1 Tables respectively. These adaptive mutations have not been reported in the B. multivorans literature (Table L in S1 Tables).

Gene Product (locus) Patient Function Previously identified
PurB (0592) 1 Adenylosuccinate lyase no
Hypothetical (1166) 1 Fusaric acid resistance protein family no
Glycosyltransferase (2375) 1 LPS modification no
AlgP (2966) 1 Alginate production no
LysR (3936) 1 Regulation of transcription no
CheA (4025) 1 Chemotaxis no
FimV (4911) 1 Type IV motility, pilus assembly no
TetR (7680) 2 TetR family transcriptional regulator no
Amidase (20235) 2 Amidase protein no

Additionally, we searched for evidence of parallel evolution of loci across multiple patients. In total, 30 CDS (and ten non-coding regions) were detected with mutations in at least three patients, and only two loci (129 and 6144) had mutations in all patients (Table I in S1 Tables and Table 4).

Table 4. Loci undergoing parallel evolution in patients 1–4.

List of 30 locus with mutations (SNPs) in at least three patients (see also Table I in S1 Tables). We separated the isolates from 2011 and the isolate from 2018 in patient 2 as we regard them as two separate acquisition events.

Mutated locus present?
P1 P2_2011 P2_2018 P3 P4 Locus Gene Function
0129 MFS transporter
1045 plsX phosphate acyltransferase PlsX
1134 astD succinylglutamate-semialdehyde dehydrogenase
1149 nodulation protein
1201 DUF2242 domain-containing protein
1600 TetR family transcriptional regulator
1764 protease modulator HflC
1788 DNA polymerase III subunit gamma/tau
2211 DNA alkylation response protein
2519 ATPase
2580 amino acid permease
2840 potassium transporter
2966 alginate biosynthesis protein AlgP
3147 hypothetical protein
3210 rsmB 16S rRNA (cytosine(967)-C(5))-methyltransferase
3483 MFS transporter
3702 GNAT family N-acetyltransferase
3720 hemagglutinin
4338 molybdopterin-dependent oxidoreductase
4442 CBS domain-containing protein
4911 pilus assembly protein FimV
5135 hypothetical protein
5336 sensor histidine kinase
5391 helix-turn-helix transcriptional regulator
5441 type II toxin-antitoxin system RelE/ParE
5799 ABC transporter substrate-binding protein
6056 ctaD cytochrome c oxidase subunit I
6124 BON domain-containing protein
6144 DUF1269 domain-containing protein
6156 LysR family transcriptional regulator

Larger structural variations that were not detected using the SNP analysis included the loss of a plasmid that was only detected in the first isolate P1Bm2009 (see above), as well as several deletions and insertions, some of which could again be linked to transposase activity (Table 5, Table H in S1 Tables).

Table 5. List of large structural variations in the isolates of Patient 1.

Using isolate P1Bm2009 as a reference, we mapped the long Nanopore reads of the subsequent isolates from Patient 1 to reveal large structural variations that escape SNP analysis. All structural variations are found in C1 and are fixed after their appearance in agreement with the SNP analysis (Fig 8). Isolates P1Bm2011a and P1Bm2012b, which originated from the same sputum sample as P1Bm2011b and P1Bm2012a, respectively, appear to come from separate lineages.

Strains Chr Type Function
P1Bm2013-2015 1 Insertion Intergenic, IS3 in the rRNA cluster (5S, 16S, 23S)
P1Bm2011b-2015 1 Breakend Glycosyltransferase
P1Bm2011b-2015 1 Breakend Intergenic, next to IS256
P1Bm2011a-2015 1 Deletion Phosphoribosyltransferase
P1Bm2012b 1 Deletion Glycosyltransferase
P1Bm2015 1 Deletion SDR family NAD-p-dependent oxidoreductase

Discussion

B. multivorans has been the most prevalent Bcc species in Belgian CF patients for more than three decades [1,26,27] and patients generally carry distinct sequence types, suggesting the presence of many different B. multivorans STs in environmental reservoirs [22]. In contrast, between 2009 and 2019, ST-742 isolates were found in respiratory samples of four patients attending three different CF centers [26,28]. To investigate the epidemiology and evolution of these ST-742 isolates, we examined their genome sequences and their susceptibility to antimicrobial agents commonly used in CF therapy.

The high level of sequence identity, the collinearity of the replicons (S2 Fig), and the comparison with other publicly available B. multivorans genomes (Fig 4) demonstrated that all ST-742 isolates belonged to a single genomic lineage and that the shared ST was not simply a result of homoplasy [34,35]. This also corroborated an earlier study that showed that the ST predicted both phylogeny and gene content of B. multivorans isolates, supporting the use of MLST for epidemiological surveillance of Bcc bacteria [23].

The lack of evidence to support cross-infection [26] suggested that all patients acquired B. multivorans ST-742 from an environmental reservoir and that this strain is endemic in Belgium. A recent genomic study of B. multivorans reported that the same genomic lineages were isolated from CF and environmental samples and on different continents many years apart, demonstrating the evolutionary persistence and ubiquity of these strains in different niches and on different continents [23]. Intriguingly, although several studies point towards the acquisition of B. multivorans from non-human sources such as the natural environment, its preferred natural habitat remains elusive [4,36].

Different elements suggest that this environmental reservoir holds a microdiverse B. multivorans ST-742 population [37] that is susceptible to several antimicrobial agents and from which different genetic variants colonized different patients. First, there is a considerable SNP diversity between genomes of isolates of different patients compared to the more limited diversity observed among the genomes of patient 1 isolates (Fig 4). Second, we observed different start positions of the high-density SNP regions in isolates from different patients (Figs 5, 6 and 7, Table 2). Third, we observed an apparent mobility of prophage elements which are differentially present in some isolates. Lastly, there was a shared plasmid present in the isolates of patient 2, 3 and 4 and a unique plasmid in the first isolate of patient 1 (Table A in S1 Tables). Indeed, this microdiverse B. multivorans ST-742 population appears to comprise different plasmids which can be dispensed once in a human host.

De Boeck et al. [1] reported on the re-appearance of Bcc strains up to ten years after first colonization and hypothesized that reacquisition from the environment occurred. In patient 2, we hypothesize that isolate P2Bm2018 is derived from a re-infection from the same environmental reservoir and is not an immediate evolutionary descendant of the previously reported isolates of this patient (P2Bm2011a-b). This hypothesis is supported by the position of isolate P2Bm2018 in the core SNP tree (Fig 4), its different SNP profile (Figs 5, 6 and 7) with 1,990 unique SNPs when compared to P2Bm2011a-b, the presence of different regions with high SNP content in C1 and C2 (Table J in S1 Tables) and a differing pattern of prophage elements between early isolates (P2Bm2011a-b) and the late isolate (P2Bm2018). However, no plasmid was detected in P2Bm2018 and the antibiotic resistance profile of P2Bm2018 resembled that of P2Bm2011a (Fig 2), potentially indicating that the lineage of that isolate was already established in the lungs of the patient before 2018.

Comparative genomic analyses of Bcc isolates sampled from single patients in the course of chronic infections have shown that diversifying lineages can co-exist in the CF lungs for many years, with limited rates of evolution (as proxied by single nucleotide polymorphisms), i.e. around 2.1 to 2.4 SNPs/year [3843]. This evolution within the lungs of CF patients can result in the loss of plasmids, increased antimicrobial resistance and a subset of genes that undergo positive selection [38,40,4244].

In the present study, the presence of co-existing clades was supported in samples of patients 1 and 2, in which the same sample yielded isolates with distinct colony morphologies that showed distinct antibiotic resistance profiles (Fig 2, P2Bm2011a-b) and SNP profiles (Fig 8, P1Bm2011a-b and P1Bm2012a-b). Among the eight isolates of patient 1, no gradual increase in antimicrobial resistance was observed, as the isolates collected in 2011 were already fully resistant to virtually all compounds tested (Fig 2). We found an average mutation rate of 15 SNPs/year (S4 Fig) which was much higher than earlier reported values for B. multivorans (2.2–2.4 SNPs/year), B. cenocepacia (1.7–2.1 SNPs/year) or B. dolosa (2.1 SNPs/year) [39,40,42,43].

As expected from the evolution within the lungs of CF patients, we found a subset of genes with mutations at multiple independent positions (Table 3) and 30 loci that had a mutation in at least three patients (Table 4). As this latter set of genes underwent parallel adaptation in different patients, these genes may be involved in the adaptation of the ST-742 strain to the CF lung environment. Importantly, we sought to correlate the loci we discovered to those found in the B. multivorans literature. However, we found little commonalities despite almost all loci being present on the ST-742 strain (Table 3, Table L in S1 Tables). Indeed, only three mutations (i.e., ABC transporter permease, DNA-binding protein, and an acyltransferase) were found in common with the fixed mutations found in the study by Silva et al. (2016). Another three (hypothetical protein, LysR, and an amidophosphoribosyltransferase) mutations matched to the list of 61 pathoadapted proteins reported by Caballero et al. (2018), and a single ABC transporter in common with the recent study by Hassan et al. (2020). Only one gene (plsX, locus 1045) was previously reported by Silva et al. (2016) as part of adaptation within the lungs (Table 4, Table L in S1 Tables), while all other genes that underwent parallel evolution appear to be unique to this ST-742 lineage. This highlights the different strategies for adaptation that can be used by different strains and the crucial need for large-scale investigations similar to those conducted in B. dolosa, B. cenocepacia and P. aeruginosa [41,42,45]. This also corroborates the work of Hassan et al. (2020) who discussed that different strains appeared to have different routes of adaptation within the patient.

Using the short reads for SNP analysis, we observed that some regions of C1 and C2 had a remarkably high SNP density extending over 250 kb and 70 kb, respectively (Figs 5 and 6, Table J in S1 Tables). A previous study by Silva et al. (2016) hinted that potential hypermutator phenotypes existed in B. multivorans due to mutations in the DNA mismatch-repair mechanisms, specifically in MutL. However, we could not corroborate this hypothesis in our study. Although disruptive mutations are reported in the MutL gene of isolates from Patient 2 and 4 (SNP analysis against P1Bm2009, Table I in S1 Tables), the gene is found correctly annotated de novo in those isolates. Additionally, these regions were not identified as genomic islands, nor were they flanked by mobile elements. Functional analysis of their coding content did not reveal enrichments for specific COG categories when compared to the complete genome, which may indicate that the recombination of this region is tolerated by ST-742 (Table M in S1 Tables). The potential presence of these regions in other B. multivorans genomes (including both 97 public and 59 non-public sequences from the National Reference Centre database) was analyzed using BLASTn but no evidence was found that they originated from recombination between any of those strains. This suggested that they may represent events of recombination with donor strains for which no genome sequences are currently available. Additionally, as the start positions of the high-density SNP regions differ in isolates from different patients and share significant numbers of SNPs (Figs 5, 6 and 7 and S3, Table J in S1 Tables), these recombination events likely took place prior to infection, not during co-infection of the patients airways, and contribute to the microdiversity of the B. multivorans ST-742 population in its environmental reservoir [37,46].

Finally, from a technology perspective, we observed that while short read sequencing technologies such as Illumina were well suited to examine SNP diversity, long-read sequencing technologies readily allowed us to observe differences in plasmid and prophage content. In patient 2, we observed that prophages were mobile (Table 2), with an uncharacterized prophage element that was present on C1 of P2Bm2011b but absent from P2Bm2011a and P2Bm2018, and a second prophage, Burkholderia virus KS5 [47], that was present on C2 of P2Bm2011a and P2Bm2018, yet absent from P2Bm2011b. This could be relevant as previous studies in P. aeruginosa highlighted the importance of mobile prophage elements in driving within-host adaptation of the infecting strains [48]. Interestingly, we also noted two variants (deletions) in phosphoribosyltransferase and glycosyltransferase proteins that only appeared in the large structural variation analysis (Table 5) and corresponded to loci reported by Caballero et al. [38]. These findings illustrate the limitations of Illumina-based SNP analysis in providing a comprehensive list of genomic changes among isolates and emphasize the strength of combining sequencing technologies.

Material and methods

Ethics statement

The strains used in this study were collected by the participating clinical laboratories in the frame of routine diagnostic without additional testing and sent to the National Reference Center for Burkholderia cepacia complex where they were stored in the frame of activities mentioned in the Royal Decree 9 February 2011 setting the conditions for funding for reference centers in human microbiology and precised on the website of Sciensano: https://nrchm.wiv-isp.be/nl/oproep2019/legaal/default.aspx. In the frame of the specific terms of reference for the NRC for Burkholderia cepacia complex, a collection of representative strains was managed as outline in https://nrchm.wiv-isp.be/nl/oproep2019/lastenboek/Rapporten/STR06-Burkholderia%20cepacia_2019.pdf. Epidemiological data were collected retrospectively from patient charts and anonymously stored in a database. As no additional sampling or information was asked from patients, no formal approval from an ethical committee or informed consents are needed.

Bacterial isolates and patient data

The thirteen clinical isolates presented in this study (Table 1) were isolated from sputum samples of four patients diagnosed with cystic fibrosis (Fig 1) and were collected at the Belgian National Reference Centre for Burkholderia cepacia complex (NRC Bcc). Isolates were identified as Burkholderia multivorans using MALDI-TOF mass spectrometry and recA gene sequence analysis [19,49]. Isolates were further molecularly characterized by multi-locus sequence typing (MLST) using the Bcc MLST scheme and the pubMLST database [19,21]. Isolates were grown aerobically in lysogeny broth (LB) medium at 37°C and cultures were preserved in 25% glycerol at -80°C.

For patient 1, B. multivorans was first detected in a sputum sample at age nine (P1Bm2009, January 2009). The initial treatment was intravenous (IV) ceftazidime, piperacillin-tazobactam and ciprofloxacin for 14 days, followed by inhaled temocillin and oral ciprofloxacin for 3 months. Cultures were negative until August 2009, date after which they remained positive until 2015, when the lung function declined drastically, and the patient received a double lung transplant. In 2011, two isolates from the same sputum sample were examined as they displayed a different colony morphology (P1Bm2011a and P1Bm2011b), and similarly in 2012 with isolates P1Bm2012a and P1Bm2012b (Fig 1).

For patient 2, B. multivorans was first detected in a sputum sample, at age five (December 2009, isolate not available for the NRC Bcc) and the infection is still ongoing at the time of the present study. Treatment has included IV administration of ceftazidime-amikacin, tobramycin, piperacillin, tazobactam, ciprofloxacin, and meropenem. Two isolates from the same sputum sample were sent to the NRC in 2012 as they displayed distinct colony morphologies (P2Bm2012a and P2Bm2012b), and an additional sample was received in 2018 (P2Bm2018) (Fig 1). Intermediate isolates (period 2012–2018) and later isolates (period 2019–2020) are not available for the NRC Bcc.

For patient 3, B. multivorans was first detected in a sputum sample at age nine (P3Bm2015, November 2015). The infection was successfully eradicated with an IV course of ceftazidime and tobramycin for 14 days, followed by 3 months of inhaled ceftazidime. No further samples were collected since.

For patient 4, B. multivorans was first detected in a sputum sample at age 25 (June 2017, not available for the NRC Bcc) and an isolate was sent to the NRC in December 2019 (P4bm2019) (Fig 1). The infection was treated for 14 days using intravenous meropenem, and no further therapy was administered due to other health related complications.

Antibiotic sensitivity testing

We tested the in vitro susceptibility of all isolates against twenty different antibiotics (Table B in S1 Tables) as described previously [50]. The MIC values were determined by microdilutions in microtiter plates which were read on a Sensititre Vizion System device (ThermoScientific). To determine the in vitro susceptibility, EUCAST pharmacokinetic/pharmacodynamics (PK/PD) breakpoints were used, as no EUCAST species-specific breakpoints were available (tables available at https://www.eucast.org/clinical_breakpoints/ PK PD breakpoints).

Genomic DNA isolation and hybrid sequencing

The isolates were inoculated in LB medium and grown overnight at 37°C to stationary phase (OD600 over 1.0). The genomic DNA was extracted using the QIAGEN DNeasy ultraclean microbial kit. The quality and quantity of the extracted DNA was assessed using a Qubit 4.0 fluorometer, ThermoFisher Scientific Nanodrop spectrophotometer (OD280/260 and OD230/260), and agarose gel electrophoresis (1% w/v).

The genomic DNA was sequenced combining both short-read and long-read technologies. The first set of reads were obtained on an Illumina HiSeq4000 or NovaSeq6000 platform using a paired-end 2*150 bp approach at the Oxford Genomics Centre. The second set of reads were obtained on a MinION nanopore sequencer (Oxford Nanopore Technology) equipped with a flowcell of type R9.4.1 and a library prepared either with the 1D ligation approach or the Rapid library preparation kit (Table A in S1 Tables).

Hybrid genome assembly

Quality of the Illumina reads was assessed using FastQC v0.11.9 [51] and Trimmomatic v0.38 [52] was used for adapter clipping, quality trimming (LEADING:3 TRAILING:3 SLIDINGWINDOW:4.15), and filtering on length (>50 bp). Quality of the Nanopore reads was assessed using Nanoplot v1.28.2 [53], while Porechop v0.2.3 [54] was used for barcode clipping and NanoFilt v2.6.0 [53] for filtering on quality (Q>8) and length (>500 bp).

The genomes of the isolates were assembled de novo using both the short-read SPAdes assembler v3.14.0 [55], and the hybrid assembler Unicycler v0.4.7 [56] with default options. The quality of the resulting assemblies was assessed using QUAST v5.0.2 [57] (S1 Fig) and the assembly graphs were visualized with Bandage v0.8.1 [58]. Pairwise average nucleotide identities between the ST-742 genomes were calculated using PYANI (v0.2.10) with the ANIb method.

Functional annotation

All genomes were functionally annotated via the NCBI Prokaryotic Genome Annotation Pipeline [59]. The proteome was also annotated with COG categories and KEGG pathways using the eggNOG mapper [60]. This CDS-centric annotation was subsequently complemented by annotating genetic systems and genomic islands with 1) the prophage detection system PHASTER [61], 2) CRISPRCasFinder software [62], 3) genomic IslandViewer v4 [63], 4) ABRicate for the detection of acquired antimicrobial resistance genes [64], and 5) the biosynthetic gene cluster annotation tool ANTISmash v5 [65].

Population structure and MLST

To examine the position of the clinical isolates within the general population structure of B. multivorans, all B. multivorans genomes were retrieved from the NCBI RefSeq database (release 98) as GenBank files (Table D in S1 Tables), together with B. cenocepacia J2315 [66] as an outgroup. Genomes with GC contents below 66% (one isolate), and more than 1,000 contigs (seven isolates) were excluded. Remaining GenBank files were converted into gff3 files using the perl script “bp_genbank2gff3” available in the BioPerl software library [67] and given as input to roary v3.1 3 [68] to delineate the pangenome and perform a core gene multiple sequence alignment. The core gene alignment was processed using FastTree v2.0 [69] to reconstruct the population structure. The phylogenetic tree was visualized and annotated using FigTree v1.4.4 [70]. MLST analysis on all downloaded genomes was performed using mlst (https://github.com/tseemann/mlst) and the Bcc pubMLST database [21].

SNP analysis

SNP calling was done using snippy v.4.4.5 [71] with the quality-controlled Illumina reads of each isolate and the annotation of isolate P1Bm2009 or P2Bm2011a as reference. The SNP density was calculated by tabulating the number of unique SNPs per 1000 nucleotides windows along the genome (Figs 5, 6 and 7 and Table I in S1 Tables). The visualization of shared SNPs (S3 Fig) was created with the software UpsetR [72] and the Venn web application (http://bioinformatics.psb.ugent.be/webtools/Venn/). The core SNP alignment from snippy was used to construct a focused phylogenetic tree of our ST-742 isolates and the strain FDAARGOS_719 using FastTree v2.0 [69] and visualized using FigTree v1.4.4 [70]. Comparisons with previously published mutation data was conducted using BLASTp between the proteome of P1Bm2009 and the proteins reported in the studies listed in the Table L in S1 Tables.

Large genomic structural variations

Overall sequence identity as well as collinearity of the genomes was examined by creating dot plot figures (S2 Fig) using Gepard v1.40 [73] with default options and word length adjusted to 100. The large structural variations were called using a combination of the long-read mapper ngmlr v0.2.7 [74] and the structural variation caller sniffles v1.0.11 [74] with the quality-controlled Nanopore reads and P1Bm2009 was as reference. The results were visually inspected using Ribbon v1.1 [75] and all the VCF files were consolidated into a spreadsheet, with the following filters applied: 1) variations related to the linearization of the replicons or smaller than 100 nt were removed, 2) imprecise variations were removed except when they matched a precise variations in another isolate, and 3) remaining variations in P1Bm2009 (three in total) were used to filter out false positives.

Supporting information

S1 Fig. Sequencing data, assembly metrics and assembly graphs.

The assembly graphs for the short-reads assemblies and hybrid assemblies are reported, enabling a quick visual inspection of the fragmentation of the assemblies as well as the presence of multiple replicons.

(PNG)

S2 Fig. Pairwise comparisons of the isolates chromosomes.

Chromosomes 1, 2, and 3 were compared in pairs using dot plots, highlighting the collinearity and overall sequence identity of the genomes across all isolates. Note that some chromosomes were not fully closed and subsequently rotated during the assembly resulting in visual artefacts including large inversions or re-arrangements.

(PNG)

S3 Fig. Shared and unique SNPs across the isolates from different patients.

This visualization supplements Figs 5, 6 and 7 by delineating the sharing of SNPs found in the different isolates. For each chromosome, we provide two visualizations of the data using UpSetR and a standard Venn diagram. We collapsed the 85 SNPs found in the isolates from P1Bm2009 to P1Bm2015 and kept separate the P2Bm2011a,b and P2Bm2018 isolates from patient 2, as they differed by 1,990 unique (not shared) SNPs.

(PNG)

S4 Fig. Cumulative SNPs Patient 1.

The cumulative SNPs are plotted over the time period of infection in patient 1. A linear model was used to estimate the rate of appearance of SNPs (15 SNPs/year).

(TIF)

S1 Tables. Table A: isolates description.

The thirteen ST-742 isolates are listed with a summary description of their sequencing and the related genomics data available the NCBI GenBank database. Table B: antibiotic resistance profile. The table contains the MIC values for all strains organized by type of antibiotics. The EUCAST or PK/PD interpretations are given by color (RED = resistant, ORANGE = intermediate, GREEN = sensitive). Table C: location of genomic islands and genetic clusters. Results from the three different software tools to predict the genomic locations of mobile elements and secondary metabolites clusters. Table D: list of isolates from NCBI used for the population structure. The genome sequences of the isolates were downloaded from the RefSeq database (release 98). Table E: features of plasmid pP1Bm2009. List of annotated features of plasmid 1 in patient 1 annotated using hmmer and eggNOG with a summary count of genes per COG categories. Table F: features of plasmid pP2Bm2011a. List of annotated features of plasmid 1 in patient 2 annotated using hmmer and eggNOG with a summary count of genes per COG categories and a comparison of COG categories between pP1Bm2009 and pP2Bm2011a. Table G: master list of large structural variations. List of all structural variations identified using the long reads (Nanopore data). Table H: filtered list of large structural variations. The filtering steps are described in the Materials and Methods section. Table I: master list SNPs. List of all SNPs identified when comparing the 13 isolates against P1Bm2009. Table J: regions of high SNPs density. Detailed start and stop positions of the regions displayed on the Figs 5, 6 and 7. Table K: SNPs isolates patient 2. List of SNPs identified when comparing the P2Bm2011b against P2Bm2011a. Table L: List of loci under selection reported in previous studies. This list contains the loci that were highlighted in previous longitudinal studies of B. multivorans infections of patients with cystic fibrosis. For each study, we mention the locus number (as reported), the NCBI protein accession number, whether or not the locus was present in the ST-742 strains, and whether there was a SNP found in the locus for the longitudinal isolates from patient 1. Table M: COG annotation of the genes in high-density SNP regions. Here we list the proteins found in the regions enriched in SNPs (Tables I and J in S1 Tables), functionally annotated with their COG categories. The graphic at the top compares the specific presence of the COG categories in those high-density SNP regions versus the complete genome.

(XLSX)

Acknowledgments

We thank the referring laboratories of the following hospitals: Universitair Ziekenhuis Leuven (UZ Leuven; Leuven, Belgium), Clinique CHC MontLégia (Centre Hospitalier Chrétien, Liège, Belgium), and Universitair Ziekenhuis Brussel (UZ Brussel, Brussel, Belgium). We also thank the Oxford Genomics Centre at the Welcome Centre for Human Genetics for the generation and initial processing of the Illumina sequencing data.

Data Availability

All genomics data was deposited in the NCBI GenBank database. The initial Illumina and Nanopore sets of reads are available in the SRA database via the accession numbers of Table A in S1 Tables. The genomes and their annotations can be found via the BioProject accession number PRJNA601875.

Funding Statement

CL is supported by a PhD fellowship from FWO Vlaanderen (1S64720N). The labs of VvN and RL are supported by the ID-N grant PHAGEFORCE from the KU Leuven (IDN/20/024). Part of this work was performed in the framework of the Belgian National Reference Centre for Burkholderia (CP, PV, DP), supported by the Ministry of Social Affairs through a fund within the National Health Insurance System. The Oxford Genomics Centre at the Wellcome Centre for Human Genetics is funded by Wellcome Trust grant reference 203141/Z/16/Z. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.De Boeck K, Malfroot A, Van Schil L, Lebecque P, Knoop C, Govan JRW, et al. Epidemiology of Burkholderia cepacia complex colonisation in cystic fibrosis patients. Eur Respir J. 2004;23(6):851–6. 10.1183/09031936.04.00118804 [DOI] [PubMed] [Google Scholar]
  • 2.Govan JRW, Brown AR, Jones AM. Evolving epidemiology of Pseudomonas aeruginosa and the Burkholderia cepacia complex in cystic fibrosis lung infection. Future Microbiol. 2007;2(2):153–64. 10.2217/17460913.2.2.153 [DOI] [PubMed] [Google Scholar]
  • 3.Hauser AR, Jain M, Bar-Meir M, McColley SA. Clinical Significance of Microbial Infection and Adaptation in Cystic Fibrosis. Clin Microbiol Rev. 2011;24(1):29–70. 10.1128/CMR.00036-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.LiPuma JJ. The changing microbial epidemiology in cystic fibrosis. Clin Microbiol Rev. 2010;23(2):299–323. 10.1128/CMR.00068-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Isles A, Maclusky I, Corey M, Gold R, Prober C, Fleming P, et al. Pseudomonas cepacia infection in cystic fibrosis: An emerging problem. J Pediatr. 1984. February 1;104(2):206–10. 10.1016/s0022-3476(84)80993-2 [DOI] [PubMed] [Google Scholar]
  • 6.Mahenthiralingam E, Urban TA, Goldberg JB. The multifarious, multireplicon Burkholderia cepacia complex. Nat Rev Microbiol. 2005;3(2):144–56. 10.1038/nrmicro1085 [DOI] [PubMed] [Google Scholar]
  • 7.France MW, Dodd ME, Govan JR, Doherty CJ, Webb AK, Jones AM. The changing epidemiology of Burkholderia species infection at an adult cystic fibrosis centre. J Cyst Fibros. 2008;7(5):368–72. 10.1016/j.jcf.2008.01.002 [DOI] [PubMed] [Google Scholar]
  • 8.Mahenthiralingam E, Vandamme P, Campbell ME, Henry DA, Gravelle AM, Wong LTK, et al. Infection with Burkholderia cepacia complex genomovars in patients with cystic fibrosis: virulent transmissible strains of genomovar III can replace Burkholderia multivorans. Clin Infect Dis. 2001;33(9):1469–75. 10.1086/322684 [DOI] [PubMed] [Google Scholar]
  • 9.Speert DP, Henry D, Vandamme P, Corey M, Mahenthiralingam E. Epidemiology of Burkholderia cepacia complex in patients with cystic fibrosis, Canada. Emerg Infect Dis. 2002;8(2):181–7. 10.3201/eid0802.010163 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Turton JF, Kaufmann ME, Mustafa N, Kawa S, Clode FE, Pitt TL. Molecular comparison of isolates of Burkholderia multivorans from patients with cystic fibrosis in the United Kingdom. J Clin Microbiol. 2003;41(12):5750–4. 10.1128/jcm.41.12.5750-5754.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Brisse S, Cordevant C, Vandamme P, Bidet P, Loukil C, Chabanon G, et al. Species distribution and ribotype diversity of Burkholderia cepacia complex isolates from French patients with cystic fibrosis. J Clin Microbiol. 2004;42(10):4824–7. 10.1128/JCM.42.10.4824-4827.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Norskov-Lauritsen N, Johansen HK, Fenger MG, Nielsen XC, Pressler T, Olesen HV, et al. Unusual distribution of Burkholderia cepacia complex species in Danish cystic fibrosis clinics may stem from restricted transmission between patients. J Clin Microbiol. 2010;48(8):2981–3. 10.1128/JCM.00383-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pope CE, Short P, Carter PE. Species distribution of Burkholderia cepacia complex isolates in cystic fibrosis and non-cystic fibrosis patients in New Zealand. J Cyst Fibros. 2010;9(6):442–6. 10.1016/j.jcf.2010.08.011 [DOI] [PubMed] [Google Scholar]
  • 14.Biddick R, Spilker T, Martin A, LiPuma JJ. Evidence of transmission of Burkholderia cepacia, Burkholderia multivorans and Burkholderia dolosa among persons with cystic fibrosis. FEMS Microbiol Lett. 2003;228(1):57–62. 10.1016/S0378-1097(03)00724-9 [DOI] [PubMed] [Google Scholar]
  • 15.Millar-Jones L, Ryley HC, Paull A, Goodchild MC. Transmission and prevalence of Burkholderia cepacia in Welsh cystic fibrosis patients. Respir Med. 1998;92(2):178–83. 10.1016/s0954-6111(98)90092-0 [DOI] [PubMed] [Google Scholar]
  • 16.Segonds C, Heulin T, Marty N, Chabanon G. Differentiation of Burkholderia species by PCR-restriction fragment length polymorphism analysis of the 16S rRNA gene and application to cystic fibrosis isolates. J Clin Microbiol. 1999;37(7):2201–8. 10.1128/JCM.37.7.2201-2208.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Whiteford ML, Wilkinson JD, McColl JH, Conlon FM, Michie JR, Evans TJ, et al. Outcome of Burkholderia (Pseudomonas) cepacia colonisation in children with cystic fibrosis following a hospital outbreak. Thorax. 1995;50(11):1194–8. 10.1136/thx.50.11.1194 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Baldwin A, Mahenthiralingam E, Thickett KM, Honeybourne D, Maiden MCJ, Govan JR, et al. Multilocus sequence typing scheme that provides both species and strain differentiation for the Burkholderia cepacia complex. J Clin Microbiol. 2005;43(9):4665–73. 10.1128/JCM.43.9.4665-4673.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Spilker T, Baldwin A, Bumford A, Dowson CG, Mahenthiralingam E, LiPuma JJ. Expanded multilocus sequence typing for Burkholderia species. J Clin Microbiol. 2009;47(8):2607–10. 10.1128/JCM.00770-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Waine DJ, Henry DA, Baldwin A, Speert DP, Honeybourne D, Mahenthiralingam E, et al. Reliability of multilocus sequence typing of the Burkholderia cepacia complex in cystic fibrosis. J Cyst Fibros. 2007;6(3):215–9. 10.1016/j.jcf.2006.10.003 [DOI] [PubMed] [Google Scholar]
  • 21.Jolley KA, Maiden MC. BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11(595). 10.1186/1471-2105-11-595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Baldwin A, Mahenthiralingam E, Drevinek P, Pope C, Waine DJ, Henry DA, et al. Elucidating global epidemiology of Burkholderia multivorans in cases of cystic fibrosis by multilocus sequence typing. J Clin Microbiol. 2008;46(1):290–5. 10.1128/JCM.01818-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Peeters C, Cooper VS, Hatcher PJ, Verheyde B, Carlier A, Vandamme P. Comparative genomics of Burkholderia multivorans, a ubiquitous pathogen with a highly conserved genomic structure. Cloeckaert A, editor. PLoS ONE. 2017;12(4):e0176191. 10.1371/journal.pone.0176191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Baldwin A, Mahenthiralingam E, Drevinek P, Vandamme P, Govan JR, Waine DJ, et al. Environmental Burkholderia cepacia complex isolates in human infections. Emerg Infect Dis. 2007;13(3):458–61. 10.3201/eid1303.060403 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ramsay KA, Butler CA, Paynter S, Ware RS, Kidd TJ, Wainwright CE, et al. Factors influencing acquisition of Burkholderia cepacia complex organisms in patients with cystic fibrosis. J Clin Microbiol. 2013;51(12):3975–80. 10.1128/JCM.01360-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.NRC Bcc: data 2012–2017 [Internet]. Available from: https://nrchm.wiv-isp.be/nl/ref_centra_labo/burkholderia_cepacia_complex/Rapporten/Burkholderia%202012-2017.pdf
  • 27.Revets H, Vandamme P, Zeebroeck AV, Boeck KD, Struelens MJ, Verhaegen J, et al. Burkholderia (Pseudomonas) Cepacia and Cystic Fibrosis: the Epidemiology in Belgium. Acta Clin Belg. 1996. January 1;51(4):222–30. 10.1080/22953337.1996.11718514 [DOI] [PubMed] [Google Scholar]
  • 28.Peeters C. Burkholderia multivorans and beyond: epidemiology, genomics and taxonomy of opportunistic pathogens in cystic fibrosis patients. Ghent University; 2016. [Google Scholar]
  • 29.Botelho J, Lood C, Partridge SR, Noort V van, Lavigne R, Grosso F, et al. Combining sequencing approaches to fully resolve a carbapenemase-encoding megaplasmid in a Pseudomonas shirazica clinical strain. Emerg Microbes Infect. 2019. January 1;8(1):1186–94. 10.1080/22221751.2019.1648182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lechuga A, Lood C, Salas M, Noort V van, Lavigne R, Redrejo-Rodríguez M. Completed Genomic Sequence of Bacillus thuringiensis HER1410 Reveals a Cry-Containing Chromosome, Two Megaplasmids, and an Integrative Plasmidial Prophage. G3 Genes Genomes Genet [Internet]. 2020. September 20 [cited 2020 Jul 29]; Available from: https://www.g3journal.org/content/early/2020/07/20/g3.120.401361 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Bcc pubMLST database [Internet]. Available from: https://pubmlst.org/bcc/
  • 32.Sichtig H, Minogue T, Yan Y, Stefan C, Hall A, Tallon L, et al. FDA-ARGOS is a database with public quality-controlled reference genomes for diagnostic use and regulatory science. Nat Commun. 2019. July 25;10(1):1–13. 10.1038/s41467-018-07882-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Coordinators NR. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2014. January 1;42(D1):D7–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Aziz A, Sarovich DS, Harris TM, Kaestli M, McRobb E, Mayo M, et al. Suspected cases of intracontinental Burkholderia pseudomallei sequence type homoplasy resolved using whole-genome sequencing. Microb Genomics. 2017;3(11):e000139. 10.1099/mgen.0.000139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.De Smet B, Sarovich DS, Price EP, Mayo M, Theobald V, Kham C, et al. Whole-genome sequencing confirms that Burkholderia pseudomallei multilocus sequence types common to both Cambodia and Australia are due to homoplasy. J Clin Microbiol. 2015;53(1):323–6. 10.1128/JCM.02574-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Peeters C, Depoorter E, Praet J, Vandamme P. Extensive cultivation of soil and water samples yields various pathogens in patients with cystic fibrosis but not Burkholderia multivorans. J Cyst Fibros. 2016;15(6):769–75. 10.1016/j.jcf.2016.02.014 [DOI] [PubMed] [Google Scholar]
  • 37.Larkin AA, Martiny AC. Microdiversity shapes the traits, niche space, and biogeography of microbial taxa. Environ Microbiol Rep. 2017;9(2):55–70. 10.1111/1758-2229.12523 [DOI] [PubMed] [Google Scholar]
  • 38.Caballero JD, Clark ST, Wang PW, Donaldson SL, Coburn B, Tullis DE, et al. A genome-wide association analysis reveals a potential role for recombination in the evolution of antimicrobial resistance in Burkholderia multivorans. PLOS Pathog. 2018. December 7;14(12):e1007453. 10.1371/journal.ppat.1007453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Hassan AA, dos Santos SC, Cooper VS, Sá-Correia I. Comparative Evolutionary Patterns of Burkholderia cenocepacia and B. multivorans During Chronic Co-infection of a Cystic Fibrosis Patient Lung. Front Microbiol. 2020. September 25;11:574626. 10.3389/fmicb.2020.574626 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Lieberman TD, Michel J-B, Aingaran M, Potter-Bynoe G, Roux D, Davis MR, et al. Parallel bacterial evolution within multiple patients identifies candidate pathogenicity genes. Nat Genet. 2011;43(12):1275–80. 10.1038/ng.997 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lieberman TD, Flett KB, Yelin I, Martin TR, McAdam AJ, Priebe GP, et al. Genetic variation of a bacterial pathogen within individuals with cystic fibrosis provides a record of selective pressures. Nat Genet. 2013. December 8;(December):1–7. 10.1038/ng.2848 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Nunvar J, Capek V, Fiser K, Fila L, Drevinek P. What matters in chronic Burkholderia cenocepacia infection in cystic fibrosis: Insights from comparative genomics. PLOS Pathog. 2017. December 11;13(12):e1006762. 10.1371/journal.ppat.1006762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Silva IN, Santos PM, Santos MR, Zlosnik JEA, Speert DP, Buskirk SW, et al. Long-Term Evolution of Burkholderia multivorans during a Chronic Cystic Fibrosis Infection Reveals Shifting Forces of Selection. mSystems [Internet]. 2016. June 28 [cited 2020 Apr 29];1(3). Available from: https://msystems.asm.org/content/1/3/e00029-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Seth-Smith HMB, Casanova C, Sommerstein R, Meinel DM, Abdelbary MMH, Blanc DS, et al. Phenotypic and Genomic Analyses of Burkholderia stabilis Clinical Contamination, Switzerland—Volume 25, Number 6—June 2019—Emerging Infectious Diseases journal—CDC. [cited 2020 Aug 24]; Available from: https://wwwnc.cdc.gov/eid/article/25/6/17-2119_article [DOI] [PMC free article] [PubMed]
  • 45.Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nat Genet. 2015. January;47(1):57–64. 10.1038/ng.3148 [DOI] [PubMed] [Google Scholar]
  • 46.Berg G, Eberl L, Hartmann A. The rhizosphere as a reservoir for opportunistic human pathogenic bacteria. Environ Microbiol. 2005;7(11):1673–85. 10.1111/j.1462-2920.2005.00891.x [DOI] [PubMed] [Google Scholar]
  • 47.Lynch KH, Stothard P, Dennis JJ. Genomic analysis and relatedness of P2-like phages of the Burkholderia cepacia complex. BMC Genomics. 2010. October 25;11:599. 10.1186/1471-2164-11-599 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Davies EV, James CE, Williams D, O’Brien S, Fothergill JL, Haldenby S, et al. Temperate phages both mediate and drive adaptive evolution in pathogen biofilms. Proc Natl Acad Sci. 2016. July 19;113(29):8266–71. 10.1073/pnas.1520056113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Peeters C, Zlosnik JEA, Spilker T, Hird TJ, LiPuma JJ, Vandamme P. Burkholderia pseudomultivorans sp. nov., a novel Burkholderia cepacia complex species from human respiratory samples and the rhizosphere. Syst Appl Microbiol. 2013;36(7):483–9. 10.1016/j.syapm.2013.06.003 [DOI] [PubMed] [Google Scholar]
  • 50.Van Dalem A, Herpol M, Echahidi F, Peeters C, Wybo I, De Wachter E, et al. In vitro susceptibility of Burkholderia cepacia complex isolated from cystic fibrosis patients to ceftazidime-avibactam and ceftolozane-tazobactam. Antimicrob Agents Chemother. 2018;62(9):AAC.00590-18. 10.1128/AAC.00590-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Andrews S. FastQC: a quality control tool for high throughput sequence data. Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom; 2010. [Google Scholar]
  • 52.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014. August 1;30(15):2114–20. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018. August 1;34(15):2666–9. 10.1093/bioinformatics/bty149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wick RR. Porechop. Github [Internet]. 2017; Available from: https://github.com/rrwick/Porechop
  • 55.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J Comput Biol. 2012. May;19(5):455–77. 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLOS Comput Biol. 2017. June 8;13(6):e1005595. 10.1371/journal.pcbi.1005595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013. April 15;29(8):1072–5. 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015. October 15;31(20):3350–2. 10.1093/bioinformatics/btv383 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016. August 19;44(14):6614–24. 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, et al. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Mol Biol Evol. 2017. August 1;34(8):2115–22. 10.1093/molbev/msx148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016. July 8;44(W1):W16–21. 10.1093/nar/gkw387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Couvin D, Bernheim A, Toffano-Nioche C, Touchon M, Michalik J, Néron B, et al. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 2018. July 2;46(W1):W246–51. 10.1093/nar/gky425 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Bertelli C, Laird MR, Williams KP, Lau BY, Hoad G, Winsor GL, et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. 2017. July 3;45(W1):W30–5. 10.1093/nar/gkx343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Seemann T. tseemann/abricate [Internet]. 2020 [cited 2020 Aug 24]. Available from: https://github.com/tseemann/abricate
  • 65.Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, et al. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res. 2019. July 2;47(W1):W81–7. 10.1093/nar/gkz310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Holden MTG, Seth-Smith HMB, Crossman LC, Sebaihia M, Bentley SD, Cerdeño-Tárraga AM, et al. The Genome of Burkholderia cenocepacia J2315, an Epidemic Pathogen of Cystic Fibrosis Patients. J Bacteriol. 2009. January 1;191(1):261–77. 10.1128/JB.01230-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Stajich JE, Block D, Boulez K, Brenner SE, Chervitz SA, Dagdigian C, et al. The Bioperl Toolkit: Perl Modules for the Life Sciences. Genome Res. 2002. October 1;12(10):1611–8. 10.1101/gr.361602 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015. November 15;31(22):3691–3. 10.1093/bioinformatics/btv421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Price MN, Dehal PS, Arkin AP. FastTree 2 –Approximately Maximum-Likelihood Trees for Large Alignments. PLOS ONE. 2010. March 10;5(3):e9490. 10.1371/journal.pone.0009490 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Andrew Rambaut. FigTree. 2018 [cited 2020 Jul 29]; Available from: https://github.com/rambaut/figtree/
  • 71.Seemann T. Snippy: rapid haploid variant calling and core SNP phylogeny. Github [Internet]. 2015; Available from: https://github.com/tseemann/snippy [Google Scholar]
  • 72.Conway JR, Lex A, Gehlenborg N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics. 2017. September 15;33(18):2938–40. 10.1093/bioinformatics/btx364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Krumsiek J, Arnold R, Rattei T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007. April 15;23(8):1026–8. 10.1093/bioinformatics/btm039 [DOI] [PubMed] [Google Scholar]
  • 74.Sedlazeck FJ, Rescheneder P, Smolka M, Fang H, Nattestad M, von Haeseler A, et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat Methods. 2018. June;15(6):461–8. 10.1038/s41592-018-0001-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Nattestad M, Chin C-S, Schatz MC. Ribbon: Visualizing complex genome alignments and structural variation. bioRxiv. 2016. October 20;082123. [Google Scholar]

Decision Letter 0

Denise M Monack, William Navarre

11 Jan 2021

Dear Mr. Lood,

Thank you very much for submitting your manuscript "Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity" for consideration at PLOS Pathogens. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

You will see that the three reviewers were generally positive. Two reviewers indicate they would like more in-depth and contextual interpretation of the data as presented. They have outlined their specific concerns/requests below. We agree with their assessment - and therefore we ask that you go through their comments and concerns as it will strengthen the impact of your work. However, given that most of the points are asking for more interpretation of your current analysis, as opposed to further experimentation or strong issues with the methods you employed, we believe you should be able to comply with these requests in a rather short amount of time.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. 

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. 

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

William Navarre

Associate Editor

PLOS Pathogens

Denise Monack

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

You will see that the three reviewers were generally positive. Two reviewers indicate they would like more in-depth and contextual interpretation of the data as presented. They have outlined their specific concerns/requests below. We agree with their assessment and ask that you go through their comments and concerns as it will strengthen the impact of this work considerably. However, given that most of the points are asking for more interpretation of your current analysis, as opposed to further experimentation or strong issues with the methods you employed, we believe you should be able to comply with these requests in a rather short amount of time.

Reviewer Comments (if any, and for reference):

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: The manuscript by Lood et al describes a genomic analysis of Burkholderia multivorans isolates recovered from persons with cystic fibrosis (CF). The authors conclude that the findings demonstrate limited genomic variation among isolates serially recovered from the same person, but high diversity among isolates recovered from different persons.

In general, this is a well written, nicely presented manuscript that clearly describes comprehensive genomic analyses of an important opportunistic human pathogen. The work is of interest on a few levels: (i) the analysis of serial isolates from two individuals furthers our understanding of bacterial adaptation/diversification during chronic human infection; (ii) the study nicely illustrates the strength of integrated long- and sort-read WGS approaches to assess genomic differences between related bacteria; and (iii) the work provides convincing evidence of independent acquisition of the same strain of B multivorans by multiple persons with CF, which adds to our understanding of the epidemiology of Burkholderia and is an important consideration in infection control in CF. The analyses, including antibiotic susceptibility phenotype, phylogenetic context of this strain, and detailed SNP analyses support the authors conclusions.

Reviewer #2: Strengths

The manuscript "Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity" by Lood, Peeters, et al., is well-written, the data well-analyzed, and the conclusions provided are sound.

The combination of two different sequencing technologies is important, giving a broader picture of what is happening genetically, than either technique can produce alone.

It is of concern that the number of SNPs detected per year are significantly higher than that previously reported for Burkholderia cepacia complex strains. This suggests that even without being hypermutator strains, the strains analysed were capable of much higher rates of mutation than previously known.

The report highlights the importance of tracking the genetics of sequential patient isolates. Significantly, although the antibiotic resistances from strain to strain did not vary appreciably, that is not to say that there were not mutations occurring and being selected for. For each infecting strain, perhaps it is only a matter of time before the correct combination of random mutations coalesce to produce a better adapted variant, whether that be SNPs, plasmid loss, prophage acquisition, or some other genetic event.

Weaknesses

Although, as with any genomics study aimed at inferring past adaptations, this retrospective study cannot truly know that the mutations observed were "adaptive". Given that similar recent studies showed an accumulation of almost completely different SNPs, it suggests not only as the authors' state ... different strategies for adaptation to the human lung ..., but also possibly that the random mutations (or that all the mutations) are not significantly adaptive.

Reviewer #3: This is an exceptional comparative genomic analysis of a cluster of B. multivorans isolates belonging to a common lineage endemic to Belgium and capable of establishing chronic infections in persons with CF. The genomic methods are first-rate, attentive to structural variation and mobile elements, and show that this lineage is distinct from various others reported in this small field by undergoing patterns of adaptive evolution in vivo with limited overlap with prior studies. Another advantage is that four independent infections from this lineage are studied, enabling assessment of convergent evolution. Overall, this is a valuable contribution to the microbial genomic literature of Bcc, which remains relatively scarce.

Yet, the primary conclusion appears to be, as the abstract states, that these infections accumulate “peculiar patterns of genomic diversity between patients.” This leaves me with more questions than answers from this largely descriptive study, outlined below.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: I have no major concerns with any of the analyses and believe the conclusions are well-justified.

Reviewer #2: None.

Reviewer #3: 1. If they derived from a common, unknown environment, do the changes in vivo indicate shared adaptations that distinguish infected isolates from the natural source, in effect enabling “triangulation” of the source environment?

2. Is the natural environment a source for certain mobile elements that might be adaptive or maladaptive?

3. What are the phenotypic effects of any parallel genetic adaptations? How exactly (genetically or phenotypically) is this novel ST different from other infectious isolates in CF or environmental isolates from your prior work?

4. What can we learn about these dynamics about the epidemiology and evolution of opportunistic bacterial pathogens more generally? The manuscript offers little discussion outside our field of Bcc clinical microbiology, a missed opportunity.

5. Given that a major discovery is of clusters of mutations in certain regions, which as the authors acknowledge suggest recombination events, what is the novelty? That recombination may happen perhaps due to coinfection in the airway or possibly in the reservoir prior to reinfection?

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: Lines 39 and 291: SNP is typically used to describe single nucleotide polymorphisms (not small nucleotide…)

Line 53: necrosis, not necrotisis

Line 70: “…higher RATES OF morbidity and mortality…”

Line 105: omit “in”

Line 168: define ‘indels’ with first use

Line 338: elements, not element

Line 348: omit “different”

Methods: check spelling of antibiotics throughout: be consistent, use English spellings

Line 385: can a reference be provided for EUCAST PK/PD breakpoints?

Line 697: each originated from, instead of “were originating from”

Figures 3, 5-7: can these be uploaded in higher quality; very difficult to read even with magnification

Reviewer #2: Minor issues

Line 50: In the author summary, the authors (surprisingly) indicate that " multivorans is the most prevalent species within the Burkholderia cepacia complex (Bcc) found infecting the lungs of patients with cystic fibrosis (CF)." Later, in lines 96 and 244, they go on to clarify that this fact is true for Belgium. In fact, although changes have occurred in some clinics and some countries recently, there are clinics and countries where B. cenocepacia is still the most prevalent Bcc species. Therefore, the authors should change the wording on line 50 to say "In some countires, B. multivorans is the most prevalent species ....".

Line 53: the word "necrotisis is rarely used. The more highly preferred English word is "necrosis".

Line 151: The authors refer to the "typical" conserved genome structure of Bm being three large replicons. If this structure is typical, then they are referring to previously determined data, and therefore should supply references for this work.

Line 316: Or perhaps elsewhere. The authors indicate the locations of high SNP density in some of the replicons in some of the strains. The authors should comment on these stretches further. Why is this happening, What genes in common are being affected if any, and Why here and not everywhere. (Do these modified regions suggest improvements in adaptation, or merely tolerable unimportant mutations?)

Line 416: It is apparent that PHAST can identify additional prophage element components not detected by PHASTER. Did the authors attempt to use this, or any of the other recent prophage software packages (e.g. ProphET, Prophage Hunter, PhageWeb, Phage_Finder, PhiSpy, LysoPhD, ) to analyze their data?

Reviewer #3: (No Response)

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: John J. LiPuma

Reviewer #2: No

Reviewer #3: Yes: Vaughn Cooper

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/plospathogens/s/submission-guidelines#loc-materials-and-methods

Decision Letter 1

Denise M Monack, William Navarre

22 Feb 2021

Dear Mr. Lood,

We are pleased to inform you that your manuscript 'Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity' has been provisionally accepted for publication in PLOS Pathogens.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

William Navarre

Associate Editor

PLOS Pathogens

Denise Monack

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************************************************

We note that all three reviewers were generally supportive of publication during the last round of review.  The associate editor has reviewed the recent changes to the manuscript and believes the concerns by the reviewers have been adequately addressed.

Acceptance letter

Denise M Monack, William Navarre

9 Mar 2021

Dear Dr. Lavigne,

We are delighted to inform you that your manuscript, "Genomics of an endemic cystic fibrosis Burkholderia multivorans strain reveals low within-patient evolution but high between-patient diversity," has been formally accepted for publication in PLOS Pathogens.

We have now passed your article onto the PLOS Production Department who will complete the rest of the pre-publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Pearls, Reviews, Opinions, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript, if you opted to have an early version of your article, will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Sequencing data, assembly metrics and assembly graphs.

    The assembly graphs for the short-reads assemblies and hybrid assemblies are reported, enabling a quick visual inspection of the fragmentation of the assemblies as well as the presence of multiple replicons.

    (PNG)

    S2 Fig. Pairwise comparisons of the isolates chromosomes.

    Chromosomes 1, 2, and 3 were compared in pairs using dot plots, highlighting the collinearity and overall sequence identity of the genomes across all isolates. Note that some chromosomes were not fully closed and subsequently rotated during the assembly resulting in visual artefacts including large inversions or re-arrangements.

    (PNG)

    S3 Fig. Shared and unique SNPs across the isolates from different patients.

    This visualization supplements Figs 5, 6 and 7 by delineating the sharing of SNPs found in the different isolates. For each chromosome, we provide two visualizations of the data using UpSetR and a standard Venn diagram. We collapsed the 85 SNPs found in the isolates from P1Bm2009 to P1Bm2015 and kept separate the P2Bm2011a,b and P2Bm2018 isolates from patient 2, as they differed by 1,990 unique (not shared) SNPs.

    (PNG)

    S4 Fig. Cumulative SNPs Patient 1.

    The cumulative SNPs are plotted over the time period of infection in patient 1. A linear model was used to estimate the rate of appearance of SNPs (15 SNPs/year).

    (TIF)

    S1 Tables. Table A: isolates description.

    The thirteen ST-742 isolates are listed with a summary description of their sequencing and the related genomics data available the NCBI GenBank database. Table B: antibiotic resistance profile. The table contains the MIC values for all strains organized by type of antibiotics. The EUCAST or PK/PD interpretations are given by color (RED = resistant, ORANGE = intermediate, GREEN = sensitive). Table C: location of genomic islands and genetic clusters. Results from the three different software tools to predict the genomic locations of mobile elements and secondary metabolites clusters. Table D: list of isolates from NCBI used for the population structure. The genome sequences of the isolates were downloaded from the RefSeq database (release 98). Table E: features of plasmid pP1Bm2009. List of annotated features of plasmid 1 in patient 1 annotated using hmmer and eggNOG with a summary count of genes per COG categories. Table F: features of plasmid pP2Bm2011a. List of annotated features of plasmid 1 in patient 2 annotated using hmmer and eggNOG with a summary count of genes per COG categories and a comparison of COG categories between pP1Bm2009 and pP2Bm2011a. Table G: master list of large structural variations. List of all structural variations identified using the long reads (Nanopore data). Table H: filtered list of large structural variations. The filtering steps are described in the Materials and Methods section. Table I: master list SNPs. List of all SNPs identified when comparing the 13 isolates against P1Bm2009. Table J: regions of high SNPs density. Detailed start and stop positions of the regions displayed on the Figs 5, 6 and 7. Table K: SNPs isolates patient 2. List of SNPs identified when comparing the P2Bm2011b against P2Bm2011a. Table L: List of loci under selection reported in previous studies. This list contains the loci that were highlighted in previous longitudinal studies of B. multivorans infections of patients with cystic fibrosis. For each study, we mention the locus number (as reported), the NCBI protein accession number, whether or not the locus was present in the ST-742 strains, and whether there was a SNP found in the locus for the longitudinal isolates from patient 1. Table M: COG annotation of the genes in high-density SNP regions. Here we list the proteins found in the regions enriched in SNPs (Tables I and J in S1 Tables), functionally annotated with their COG categories. The graphic at the top compares the specific presence of the COG categories in those high-density SNP regions versus the complete genome.

    (XLSX)

    Attachment

    Submitted filename: ResponseToTheReviewers_latest.docx

    Data Availability Statement

    All genomics data was deposited in the NCBI GenBank database. The initial Illumina and Nanopore sets of reads are available in the SRA database via the accession numbers of Table A in S1 Tables. The genomes and their annotations can be found via the BioProject accession number PRJNA601875.


    Articles from PLoS Pathogens are provided here courtesy of PLOS

    RESOURCES