Public health data show that a significant fraction of the nation’s waterborne disease outbreaks are attributable to premise plumbing. We report the draft genome sequences of seven Legionella pneumophila serogroup 1 isolates from hot water lines of a large building.
ABSTRACT
Public health data show that a significant fraction of the nation’s waterborne disease outbreaks are attributable to premise plumbing. We report the draft genome sequences of seven Legionella pneumophila serogroup 1 isolates from hot water lines of a large building. Genomic analysis identified the isolates as belonging to sequence type 1.
ANNOUNCEMENT
Legionella pneumophila is a Gram-negative bacterium and is the major causative agent of Legionnaires’ disease (1); serogroup 1 has been implicated in most cases associated with built environments (2). Legionella spp. colonize and persist in building water systems despite disinfection strategies that aim to mitigate their presence (3), due to the complexity of the pipe network (4). Regardless of the public health relevance of Legionella spp., very little information is available about their occurrence in large buildings (5).
Strains were isolated from drinking water obtained from branch lines serving a hot water system in a large occupational building. A 100-ml aliquot of each sample was concentrated by membrane filtration (0.2-μm pore size) and resuspended in 5 ml of Butterfield’s phosphate buffer, and 100 μl was cultured on buffered charcoal-yeast extract (BCYE) selective agar (catalog number R110107; Remel, Lenexa, KS) for 5 days at 35°C. The agar is used for the selective recovery of L. pneumophila from potable water samples and contained vancomycin and anisomycin to suppress contaminating flora. A single colony was transferred to Remel BCYE agar (catalog number R01334), incubated for 2 days at 35°C, and screened with the Legionella pneumophila latex agglutination kit (Oxoid Ltd., Basingstoke, UK). Latex agglutination confirmed the identification of the seven isolates as L. pneumophila serogroup 1. DNA was extracted using the UltraClean DNA microbial isolation kit, following the manufacturer’s instructions (Mo Bio Laboratories, Solana Beach, CA). Genomic libraries were prepared using the Nextera XT index kit v2 set A and sequenced on the HiSeq 4000 platform (Illumina, Inc., San Diego, CA) with a HiSeq 3000/4000 paired-end (PE) cluster kit (2 × 150 bp).
A total of 197,184,931 reads were generated. Prior to assembly, libraries were cleaned from adapters and phiX artifacts, error corrected, normalized (≤100×), and filtered to a minimum length of 100 nucleotides (nt) using the software package BBMap v37.90 (ktrim=r k = 23 mink = 11 hdist = 1 tbo tpe maxns = 0 trimq = 10 qtrim=r maq = 12 minlength = 100 ecco=t eccc=t ecct=t target = 100) (6). A reference-assisted de novo assembly approach were used to assemble the processed reads using Unicycler v0.4.4 (7). Average nucleotide identity (ANI) was calculated using the ANI calculator (8). ANI, an index of similarity between two genomes (9), was used to verify the taxonomic affiliation of the isolates (10). The in silico multilocus sequence type (MLST) based on seven alleles (11, 12) was obtained using MentaLiST v0.2.3 (13). Default parameters were used for all software unless otherwise specified. Genome quality and statistics were estimated with BBMap and annotated with Prokka v1.13.1 (14) (Table 1).
TABLE 1.
Summary statistics of whole-genome assemblies
| Strain | Coverage (×) | Genetic element | No. of contigs | Assembly size (bp) | Contig N50 (bp) | G+C content (%) | Gene annotation data (no.)a |
GenBank accession no. | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Genes | CDS | rRNAs | tRNAs | ||||||||
| L01C.1 | 193 | Chromosome | 10 | 3,518,411 | 2,171,451 | 38.33 | 3,361 | 3,308 | 9 | 43 | QWDR00000000 |
| Plasmid 1 | 1 | 129,883 | 129,883 | 37.46 | |||||||
| Plasmid 2 | 1 | 51,330 | 51,330 | 39.53 | |||||||
| L02C.1 | 193 | Chromosome | 4 | 3,527,736 | 2,108,061 | 38.32 | 3,171 | 3,117 | 9 | 44 | QWDS00000000 |
| L03B.1 | 190 | Chromosome | 10 | 3,523,457 | 2,176,402 | 38.34 | 3,362 | 3,309 | 9 | 43 | QWDT00000000 |
| Plasmid 1 | 1 | 129,883 | 129,883 | 37.46 | |||||||
| Plasmid 2 | 1 | 51,330 | 51,330 | 39.53 | |||||||
| L04A.1 | 190 | Chromosome | 9 | 3,501,953 | 2,172,681 | 38.32 | 3,150 | 3,097 | 9 | 43 | QWDU00000000 |
| L05B.1 | 192 | Chromosome | 10 | 3,527,329 | 2,180,468 | 38.36 | 3,310 | 3,257 | 9 | 43 | QWDV00000000 |
| Plasmid 1 | 1 | 129,883 | 129,883 | 37.46 | |||||||
| L08A.1 | 194 | Chromosome | 9 | 3,512,166 | 2,171,040 | 38.32 | 3,303 | 3,250 | 9 | 43 | QWDW00000000 |
| Plasmid 1 | 1 | 129,883 | 129,883 | 37.46 | |||||||
| L09A.1 | 197 | Chromosome | 9 | 3,510,538 | 2,181,930 | 38.36 | 3,294 | 3,241 | 9 | 43 | QWDX00000000 |
| Plasmid 1 | 1 | 129,883 | 129,883 | 37.46 | |||||||
CDS, coding sequences.
ANI calculations between isolates revealed an average genome similarity of 99.96%. The proposed cutoff for Legionella subspecies is 96% (15). The isolates share an average of 99.97% ANI with L. pneumophila subsp. pneumophila strain OLDA (16) and 91.87% to 90.55% ANI with L. pneumophila subsp. fraseri strain Lansing 3 and L. pneumophila subsp. pascullei strain U8W, respectively (15). In silico MLST identified the isolates as sequence type 1 (ST1) (11, 12), and gene annotation confirmed the presence of the lpeAB genes encoding a macrolide efflux pump, which confers reduced sensitivity to azithromycin (17, 18).
Data availability.
The whole-genome shotgun project has been deposited in DDBJ/ENA/GenBank under the accession numbers listed in Table 1. The raw sequence reads have been submitted to the NCBI SRA under the accession numbers SRR8523645, SRR8523642, SRR8523643, SRR8523644, SRR8523646, SRR8523647, and SRR8523648. The versions described in this paper are the first versions.
ACKNOWLEDGMENTS
We thank Chris Bennett for valuable assistance on this project.
The opinions expressed are those of the authors and do not necessarily reflect the official positions and policies of the U.S. EPA. Any mention of product or trade names does not constitute recommendation for use by the U.S. EPA.
REFERENCES
- 1.Phin N, Parry-Ford F, Harrison T, Stagg HR, Zhang N, Kumar K, Lortholary O, Zumla A, Abubakar I. 2014. Epidemiology and clinical management of Legionnaires' disease. Lancet Infect Dis 14:1011–1021. doi: 10.1016/S1473-3099(14)70713-3. [DOI] [PubMed] [Google Scholar]
- 2.Cunha BA, Burillo A, Bouza E. 2016. Legionnaires' disease. Lancet 387:376–385. doi: 10.1016/S0140-6736(15)60078-2. [DOI] [PubMed] [Google Scholar]
- 3.Berjeaud JM, Chevalier S, Schlusselhuber M, Portier E, Loiseau C, Aucher W, Lesouhaitier O, Verdon J. 2016. Legionella pneumophila: the paradox of a highly sensitive opportunistic waterborne pathogen able to persist in the environment. Front Microbiol 7:486. doi: 10.3389/fmicb.2016.00486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gilbert JA, Stephens B. 2018. Microbiology of the built environment. Nat Rev Microbiol 16:661–670. doi: 10.1038/s41579-018-0065-5. [DOI] [PubMed] [Google Scholar]
- 5.David S, Mentasti M, Lai S, Vaghji L, Ready D, Chalker VJ, Parkhill J. 2018. Spatial structuring of a Legionella pneumophila population within the water system of a large occupational building. Microb Genom 4. doi: 10.1099/mgen.0.000226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Bushnell B. 2016. BBMap short-read aligner, and other bioinformatics tools. http://sourceforge.net/projects/bbmap/.
- 7.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing read. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yoon SH, Ha SM, Lim JM, Kwon SJ, Chun J. 2017. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek 110:1281–1286. doi: 10.1007/s10482-017-0844-4. [DOI] [PubMed] [Google Scholar]
- 9.Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. 2007. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol 57:81–91. doi: 10.1099/ijs.0.64483-0. [DOI] [PubMed] [Google Scholar]
- 10.Figueras MJ, Beaz-Hidalgo R, Hossain MJ, Liles MR. 2014. Taxonomic affiliation of new genomes should be verified using average nucleotide identity and multilocus phylogenetic analysis. Genome Announc 2:e00927-14. doi: 10.1128/genomeA.00927-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gaia V, Fry NK, Afshar B, Luck PC, Meugnier H, Etienne J, Peduzzi R, Harrison TG. 2005. Consensus sequence-based scheme for epidemiological typing of clinical and environmental isolates of Legionella pneumophila. J Clin Microbiol 43:2047–2052. doi: 10.1128/JCM.43.5.2047-2052.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ratzow S, Gaia V, Helbig JH, Fry NK, Luck PC. 2007. Addition of neuA, the gene encoding N-acylneuraminate cytidylyl transferase, increases the discriminatory ability of the consensus sequence-based scheme for typing Legionella pneumophila serogroup 1 strains. J Clin Microbiol 45:1965–1968. doi: 10.1128/JCM.00261-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Feijao P, Yao HT, Fornika D, Gardy J, Hsiao W, Chauve C, Chindelevitch L. 2018. MentaLiST—a fast MLST caller for large MLST schemes. Microb Genom 4. doi: 10.1099/mgen.0.000146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 15.Kozak-Muiznieks NA, Morrison SS, Mercante JW, Ishaq MK, Johnson T, Caravas J, Lucas CE, Brown E, Raphael BH, Winchell JM. 2018. Comparative genome analysis reveals a complex population structure of Legionella pneumophila subspecies. Infect Genet Evol 59:172–185. doi: 10.1016/j.meegid.2018.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mercante JW, Morrison SS, Raphael BH, Winchell JM. 2016. Complete genome sequences of the historical Legionella pneumophila strains OLDA and Pontiac. Genome Announc 4:e00866-16. doi: 10.1128/genomeA.00866-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Vandewalle-Capo M, Massip C, Descours G, Charavit J, Chastang J, Billy PA, Boisset S, Lina G, Gilbert C, Maurin M, Jarraud S, Ginevra C. 2017. Minimum inhibitory concentration (MIC) distribution among wild-type strains of Legionella pneumophila identifies a subpopulation with reduced susceptibility to macrolides owing to efflux pump genes. Int J Antimicrob Agents 50:684–689. doi: 10.1016/j.ijantimicag.2017.08.001. [DOI] [PubMed] [Google Scholar]
- 18.Massip C, Descours G, Ginevra C, Doublet P, Jarraud S, Gilbert C. 2017. Macrolide resistance in Legionella pneumophila: the role of LpeAB efflux pump. J Antimicrob Chemother 72:1327–1333. doi: 10.1093/jac/dkw594. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The whole-genome shotgun project has been deposited in DDBJ/ENA/GenBank under the accession numbers listed in Table 1. The raw sequence reads have been submitted to the NCBI SRA under the accession numbers SRR8523645, SRR8523642, SRR8523643, SRR8523644, SRR8523646, SRR8523647, and SRR8523648. The versions described in this paper are the first versions.
