Skip to main content
Microbial Genomics logoLink to Microbial Genomics
. 2016 Aug 25;2(8):e000068. doi: 10.1099/mgen.0.000068

Massive dispersal of Coxiella burnetii among cattle across the United States

Sonora Olivas 1, Heidie Hornstra 1, Rachael A Priestley 2, Emily Kaufman 1, Crystal Hepp 1,3, Derek L Sonderegger 4, Karthik Handady 1, Robert F Massung 2, Paul Keim 1,5, Gilbert J Kersh 2, Talima Pearson 1,
PMCID: PMC5320587  PMID: 28348863

Abstract

Q-fever is an underreported disease caused by the bacterium Coxiella burnetii, which is highly infectious and has the ability to disperse great distances. It is a completely clonal pathogen with low genetic diversity and requires whole-genome analysis to identify discriminating features among closely related isolates. C. burnetii, and in particular one genotype (ST20), is commonly found in cow’s milk across the entire dairy industry of the USA. This single genotype dominance is suggestive of host-specific adaptation, rapid dispersal and persistence within cattle. We used a comparative genomic approach to identify SNPs for high-resolution and high-throughput genotyping assays to better describe the dispersal of ST20 across the USA. We genotyped 507 ST20 cow milk samples and discovered three subgenotypes, all of which were present across the entire country and over the complete time period studied. Only one of these sub-genotypes was observed in a single dairy herd. The temporal and geographic distribution of these sub-genotypes is consistent with a model of large-scale, rapid, frequent and continuous dissemination on a continental scale. The distribution of subgenotypes is not consistent with wind-based dispersal alone, and it is likely that animal husbandry and transportation practices, including pooling of milk from multiple herds, have also shaped the patterns. On the scale of an entire country, there appear to be few barriers to rapid, frequent and large-scale dissemination of the ST20 subgenotypes.

Keywords: Coxiella burnetii; Q fever, genotyping; dairy cattle; Phylogeography; SNP typing; canSNP

Data Summary

  1. Genomes CMSC1, CMCA1c1, and ESFL1 have been deposited in GenBank; accession numbers: SRR3347458, SRR3347491 and SRR3347474: http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3347458;

    http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3347491;

    http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3347474;

Impact Statement

Pathogen dissemination is typically limited by environmental survival, infectious dose, dispersal mechanisms and pathogen–host interactions. Understanding these characteristics is key for prevention and management of disease. Coxiella burnetii, the causative agent of Q fever in humans, survives well in the environment, is highly infectious, and readily dispersed via wind. C. burnetii ST20 is highly prevalent in cow’s milk, and even though it does not often cause human disease, it serves as a pragmatic model for understanding the dispersal of a pathogen with seemingly few, if any, dispersal barriers. We developed genotyping assays to subdivide this common genotype in an evolutionarily meaningful context. We then genotyped C. burnetii DNA extracted from cow milk collected through intensive longitudinal sampling and from processing facilities across the USA. Our results show that ST20 has persisted long enough for mutations to accumulate and that these variants are distributed across space and time. This is consistent with large-scale, rapid, frequent and continuous dissemination on a continental scale. Animal husbandry and transportation practices probably influence this distribution, which is not consistent with wind-based dispersal alone. At least for this clone of C. burnetii, there appear to be few dispersal barriers.

Introduction

Coxiella burnetii is the etiological agent of the zoonotic disease Q fever. It causes an acute febrile illness that can result in pneumonia, hepatitis, and endocarditis in humans (Maurin & Raoult, 1999). It is endemic in ruminants yet difficult to detect as it often causes asymptomatic or subclinical disease; however, visible symptoms include abortion and premature delivery (Agerholm, 2013). C. burnetii is highly infectious and as few as one to ten viable cells can cause disease in humans (Benenson & Tigertt, 1956). The environment can become contaminated with C. burnetii when shed in feces, urine, milk and placental material from infected animals. Once in the environment, the bacteria persist in a spore-like form for extensive lengths of time, even under non-ideal conditions (Kershet al., 2013; Scott & Williams, 1990). C. burnetii is easily aerosolized and can be transferred by wind over long distances. In 1998, 147 cases of Q fever were reported in Birmingham, UK and tied to outdoor lambing at a single farm and dispersal by unusually strong winds (Hawkeret al., 1998). While attack rates decreased with distance from the source farm, the main cluster of cases stretched as far away as 18 km (Hawkeret al., 1998). Examples of aerosol-based transmission and dispersal of C. burnetii are common (Nusinoviciet al., 2015; Tissot-Dupontet al., 2004). High infectivity, environmental persistence, and ease of aerosolization all have important implications for dispersal of this pathogen and pose the question of whether there are any barriers that may limit or slow its dissemination.

C. burnetii is distributed across the world (with the possible exception of New Zealand), however geographic associations are limited to a few individual genotypes that seem to coexist with widely distributed genotypes (D'Amatoet al., 2016). The widespread distribution of most genotypes may be due to an almost complete lack of barriers to dissemination that can result in frequent, widespread and rapid dispersion that prevents such geographic associations. Unfortunately, studies exploring mechanisms and patterns of geographic spread are limited. Genotyping studies involving isolates collected around the world show that many genotypes have a widespread geographic distribution (Glazunova et al., 2005; Hornstra et al., 2011). Many regional genotyping studies have employed multiple locus VNTR analysis (MLVA) (Arricau-Bouvery et al., 2006; Svraka et al., 2006), however this method is not high-throughput, inter-lab comparisons using MLVA data can be problematic and the lack of a standardized MLVA scheme further complicates comparisons of results (Sidi-Boumedine et al., 2015). Using PCR-based assays (Hornstra et al., 2011) to detect single-nucleotide polymorphisms (SNPs) that are diagnostic for clades defined by multispacer sequence typing (MST) (Glazunova et al., 2005). Pearson et al. (2014) and Bauer et al. (2015) conducted high-throughput surveys of genotypes associated with commercial milk across the USA and within the state of Indiana, USA, respectively. These studies showed that ST20 and ST8 were common across the USA and more likely to be associated with cows and goats, respectively (Bauer et al., 2015; Pearson et al., 2014). Together, these results indicate that if geographic associations exist, higher resolution genotyping coupled with intensive sampling is needed to detect it. To test this hypotheses on C. burnetii dispersal, we revisit the ST20 samples characterized by Pearson et al. (2014), include additional ST20 samples and, importantly, use higher resolution genotyping assays to search for geographic and temporal associations. The higher resolution is possible due to additional genomic sequences from ST20 isolates.

Methods

Multi-locus variable number tandem repeat analysis (MLVA) has shown higher levels of resolution than multispacer sequence typing (MST) (Glazunovaet al., 2005) and even variation within ST20 (Astobizaet al., 2012). However, MLVA is not well-suited for genotyping the low levels of C. burnetii DNA contained in complex backgrounds that are recovered from environmental and animal sampling. Most importantly, it is not clear how MLVA genotypes are related to each other, further limiting the phylogenetic utility of MLVA in understanding dispersal patterns. Single nucleotide polymorphisms (SNPs) are ideal for accurately depicting evolutionary relationships among clonal bacteria (Pearsonet al., 2004, 2009), including C. burnetii (Pearsonet al., 2013). Furthermore, assays based on SNP signatures are ideal for low-level genotyping of C. burnetii (Pearsonet al., 2014). However, comparisons of closely-related whole genome sequences are required to find the rare SNP signatures capable of high-resolution discrimination (Eppingeret al., 2014; Scholzet al., 2014), and phylogenetic discovery bias (Pearsonet al., 2004) must be taken into account when SNP loci are characterized in samples whose genomes are not sequenced. We therefore sequenced and compared three C. burnetii isolates from the USA and belonging to ST20 to find SNPs, developed genotyping assays to target those SNPs, and tested the assays against the 507 ST20 samples from our previous study (Pearsonet al., 2014). In addition, we included an extension of the time series presented there.

Culturing and DNA isolation for whole-genome sequencing.

Three C. burnetii isolates (CMSC1, CMCA1c1 and ESFL1) were grown axenically in Acidified Citrate Cysteine Medium 2 (ACCM2) following the methods of Omslandet al. (2011). DNA was extracted and purified using the QIAamp DNA mini kit tissue protocol (Qiagen) according to manufacturer directions.

Two of these isolates were derived from unpasteurized cow’s milk purchased in South Carolina (CMSC1) and California (CMCA1c1) in 2007. The third isolate, ESFL1, was from an environmental sample collected from soil at a cow dairy in June 2008 in Florida.

Whole-genome sequencing and SNP detection.

DNA libraries were prepared for multiplexed, paired-end sequencing on GAIIx sequencing platforms (Illumina). Sequence data generated have been submitted to the Sequence Read Archive (CMSC1, SRR3347458; CMCA1c1, SRR3347491; and ESFL1, SRR3347474). SNP detection among the three whole-genome sequences presented here and the three whole-genome sequences of ST20 from NCBI (Table S1, available in the online Supplementry Material) was performed using the Northern Arizona SNP Pipeline (NASP) (http://tgennorth.github.io/NASP/) and the In Silico Genotyper pipeline (Sahlet al., 2015) with RSA 493 (NC_002971.3) as the reference. All SNPs (excluding those that were autapomorphic for the reference) are given in Table S2 available in the online supplementary material.

Phylogenetics of ST20 isolates.

We used parsimony criteria and a heuristic search with default options using PAUP 4.0b10 (Swofford, 2003). As a measure of accuracy, we report homoplasies using the consistency index (Archie, 1996); bootstrapping is a poor method for measuring accuracy for trees with little homoplasy (Felsenstein, 1985) such are recovered when using SNPs from clonal organisms like C. burnetii (Pearsonet al., 2013). The phylogeny was rooted using RSA493 which was previously established as an outgroup to ST20 (Pearsonet al., 2013). Phylogenetic branches were named according to precedent (Hornstraet al., 2011), assigned to SNPs (Table S2), and designated on trees (Fig. S1, available in the online Supplementary Material). In short, each branch contains a prefix ‘Br.III’ to indicate the genomic group, followed by an assigned number based upon the order of branch discovery within the clade. For SNPs that are specific to a single genome, the genome name, rather than a branch number is used. This method maintains the branch name from previous publications and allows for the identification of novel branches. These SNPs (Table S2) can therefore serve as a resource for others looking to differentiate among isolates in this clade. Novel genome sequences will cause existing branches to be split, requiring additional branch names and updating the branch designation of these SNPs. For evolutionarily stable characters such as SNPs found in clonal organisms like C. burnetii, a single locus can define a branch and thus serve as a ‘canonical SNP’ (Hornstraet al., 2011). As such, the character states of only a small number of SNP loci need to be interrogated in order to place an unknown strain into the established phylogenetic order.

Assay development for increased resolution of ST20.

TaqMan® minor groove binding (MGB) non-fluorescent quencher (NFQ) dual-probe genotyping assays (Supplementary Appendix A) were designed for SNPs on the branch connecting two of the three genomes from the isolates from the USA. This branch was targeted as the other genomes were not available at the beginning of this study. These assays allowed for the potential division of U.S. ST20 samples into three sub-ST20 genotypes (Fig. 1b). Both TaqMan® assays were designed using Primer Express® Software v3.0.1 and optimized through rigorous testing of the limits of each assay (Supplementary Appendix A).

Fig. 1.

Fig. 1.

Phylogeography of samples. (a) Map shows the location of the sampled bovine herd from Michigan (star), and milk processing plants (squares) that produced samples with ST20 genotypes. Large squares indicate longitudinally sampled processing plants. Shapes include a pie chart to indicate the distribution of ST20 subgenotypes. White portions of pie charts indicate ST20, but also that a subgenotype of ST20 could not be established. (b) Phylogenetic tree depicting known MST genotypes and the location of ST20 (inset). An expanded phylogeny of ST20 shows the location of samples with available genome sequences, the number of SNPs on each branch (Table S2), the location of assays (red arrows), and a color designation for each subgenotype. This maximum-parsimony tree was reconstructed using 82 SNPs from whole-genome comparisons of six ST20 genomes and RSA493 (used to root the tree).

Subgenotyping of ST20 derived from bulk tank milk

DNA extracted from bulk tank milk samples (n = 507) from processing plants across the USA, previously identified as positive for C. burnetii DNA, and genotyped as ST20 (Pearsonet al., 2014) were screened across both the Cox1910969 and Cox1336818 assays developed here (Supplementary Appendix A). DNA was tested in duplicate to ensure consistency of results. Cox1910969 assay conditions were as follows: 1 µl of DNA was added to form a total reaction volume of 10 µl that contained 5 µl of 2× TaqMan® Universal PCR Master Mix (Life Technologies), 0.20 µl of each 20 µM primer, 0.10 µl of the 20 µM FAM MGB-NFQ probe, 0.15 µl of the 20 µM VIC MGB-NFQ probe, and 3.35 µl of sterile, molecular-grade water. Cox1336818 reaction conditions were identical to the above but with the following exceptions: 0.06 µl of each 20 µM MGB-NFQ probe and 3.48 µl of sterile, molecular-grade water were added. No-template-controls were tested on each PCR plate to allow the detection of cross-contamination during DNA template addition. Positive template controls of whole-genome amplified DNA samples that were known to be of the ancestral allele state and the derived allele state for both assays were also tested on all PCR plates to detect assay or instrument errors. Thermal cycling conditions for both assays were as follows: 50 °C for 2 min., 95 °C for 10 min., followed by 45 cycles of 95 °C for 15 sec. and 60 °C for 1 min. PCR was performed on a 7900HT Fast real-time PCR system (Applied Biosystems) with SDS v2.4 software. Allele calls were made based upon amplification of competitive probes, with a manual threshold of 0.1 and an automatic baseline. The derived allele state for Cox1336818 (T), is indicative of the presence of genotype 20.3. The ancestral allele for Cox1910969 (C), is indicative of genotype 20.1. Samples that show the ancestral allele for Cox1336818 (C) and the derived allele for Cox1910969 (T) contain genotype 20.2 (Fig. 1b). For either assay, the amplification of both probes in a given sample is typically evidence of a mixture (see Supplementary Appendix A). The amplification of both probes in both assays is indicative of the presence of all three genotypes in a single sample. The amplification of both probes for one assay and a single probe for the other assay is indicative of two genotypes. Given the low amounts of C. burnetii DNA in some samples, we expected some assays to fail. When both genotyping assays failed, we were not able to determine which subgenotype(s) within ST20 was present. The failure of one assay did not preclude our ability to determine subgenotype(s) as a derived call for Cox1336818 is indicative of genotype 20.3. Similarly, an ancestral call for Cox1910969 is indicative of genotype 20.1. Both assays needed to amplify to identify the presence of genotype 20.2. The presence of an undetermined genotype(s) was noted. In some cases, the amplification of both probes for one assay and a failure of the other assay is indicative of one genotype as well as an undetermined genotype.

Geographic and temporal distribution of ST20 subgenotypes in bovine milk.

All subgenotype results were compiled and represented on a map of the USA (Fig. 1a) to depict the geographic distribution of ST20 subgenotypes. In addition, we graphed all subgenotype results for the five milk processing plants that were sampled at least monthly from 2009 to 2014 as a function of time in order to determine whether there was a temporal distribution of ST20 subgenotypes among processing plants.

For all samples, we tested for any geographic structuring among four quadrants of the country (northeast, southeast, southwest and northwest) using a Chi-squared test as implemented in the R base package (chisq.test), and post-hoc tests using the fdr method as implemented in the R package fifer (chisq.post.hoc). We also performed statistical tests for the correlation between presence of subgenotypes and of geographic and temporal structuring of the five longitudinally sampled facilities. We tested the correlation of the presence of the subgenotype in two ways: we performed a log-linear analysis (Agresti & Kateri, 2011) and, to verify the result, we also performed a non-parametric permutation test on the determinant correlation matrix (a determinant near one represents uncorrelated variables).

To explore the temporal structure, we analyzed each subgenotype separately and fit generalized additive models (Wood, 2006) using a cyclic cubic regression spline with a within-year smoothed term for each milk processing plant. Statistical significance was assessed via likelihood ratio test statistics compared with the asymptotic χ2 distribution. To account for analyzing each subgenotype separately, we adjust the P-values using a Bonferroni adjustment factor of three.

Results

Phylogenetics of current ST20 isolates

The three European ST20 isolates from NCBI were all isolated from bovine placental material or a bovine vaginal swabs and group separately from genomes from the USA (Fig. 1b). Excluding the SNPs that were autapomorphic for the reference strain, there were a total of 52 SNPs. Maximum-parsimony analyses yielded a single phylogenetic tree with a consistency index of 1.000 as no SNPs were homoplastic.

ST20 subgenotypes are ubiquitious in bulk milk samples throughout the USA

The locations indicated by the expanded squares in Fig. 1(a) were sampled at least monthly from May 2010 to July 2014 by purchasing milk at local stores that had been processed in these locations as indicated by the plant code on each bottle. The smaller squares on the map indicate milk processing plants sampled to assess geographic distribution of C. burnetii genotypes (Fig. 1a) (Pearsonet al., 2014). The expanded star indicates the location of a single farm in Michigan from which the bulk milk tank and individual animals were sampled according to the methods of Pearsonet al. (2014). The assays used here on ST20 samples allow for the detection of three distinct ST20 subgenotypes. Of the 507 samples, 95 samples contained genotype 1 (GT_20.1), 141 samples contained genotype 2 (GT_20.2), 103 samples contained genotype 3 (GT_20.3), 55 samples contained more than one genotype, and 236 samples contained an undetermined genotype(s), meaning that they were ST20 but the presence of an exact subgenotype could not be determined if one or both of our assays failed. It is important to note that in many instances, the subgenotype could be determined even if one of the assays failed. For example, the derived allele state for Cox1336818 (T), is indicative of the presence of genotype 20.3, independent of the results, or lack thereof for Cox1910969. These failures were likely to be due to low levels of target DNA. All three subgenotypes were found across the USA (Fig. 1a) and some processing plants contained more than one genotype. There were statistically significant differences in the composition of subgenotypes across different regions of the USA [χ2 (6, n=674)=46.77, P.adj <0.001]. The genotype composition sampled from the northwest was significantly different from the other three regions (P.adj <0.001), driven by the high proportion of genotype 20.2 from the single processing plant A. This is consistent with comparisons of only the longitudinally sampled facilities which also suggest that milk from different processing plants differs in genotype compositions (see below).

ST20 subgenotypes are present through time in bulk tank milk in the USA

The presence or absence of each genotype over time across five longitudinally sampled processing plants show a strong difference in percentage presence between plants but little temporal structure (Fig. 2). These data also show differences by processing facility for each subgenotype: genotype 20.1 [χ2 (4, n=182)=17.69, P.adj <0.01], genotype 20.2 [χ2 (4, n=286)=23.46, P.adj <0.01], genotype 20.3 [χ2 (4, n=206)= 24.33, P.adj <0.01). Although the prevalence of each subgenotype differed by processing plant, there was little readily observable geographic pattern. For example, the genotype proportions from the two plants in Arizona differed despite close geographic proximity. The presence or absence of one genotype does not provide any information about the presence or absence of another genotype. We found no significant correlation between subgenotype presence and absence among any of the five processing plants sampled between 2009 and 2014 (permutation test: det(R)=0.99, P=0.48; log-linear test: [χ2 (3, n=398)=4.66, P.adj =0.20]. Results of the temporal analysis show no significant yearly trends with the single exception of an August peak for processing facility ‘A’ for genotype 20.2 [χ2 (8, n=398)=8.383, P.adj =0.0205]. The statistically significant differences among processing facilities and temporal trends explained a maximum of 17 % deviance and thus are likely to have little biological significance at this level of phylogenetic and geographic resolution.

Fig. 2.

Fig. 2.

Longitudinal presence and absence of subgenotypes across processing plants.

Only a single ST20 subgenotype was found on a single farm at one time point

In a single herd in Michigan, genotype 20.1 was detected in a sample from a single cow and in two samples from the bulk milk tank.

Discussion

Coxiella burnetii is prevalent around the world. It is endemic in ruminants and contaminates the environment through shedding in urine, milk, fecal and placental material. It has the ability to persist in the environment without degradation for long periods of time, is easily aerosolized, and can be transmitted over a few kilometers. ST20 is highly prevalent in cow’s milk across the USA (Pearsonet al., 2014), and even though it is rarely associated with human disease, its high prevalence makes it a pragmatic model for better understanding of how this species is dispersed, persists and perhaps could be controlled.

The whole genome SNP phylogeny of three isolates from the USA and three European isolates of ST20 shows that members of this clade have persisted long enough for SNP mutations to arise. The basal position of CbB1 (France) and the derived position of the ESFL1/CMCA1 clade suggest that ST20 may have arisen in Europe and subsequently spread to North America. The short branch (1 SNP) that separates the CbB1 lineage from the other genomes and the subsequent polytomy leading to a European clade (CbB18/EVCbBK10) and two groups in the USA is indicative of a radiation that may indicate a rapid dissemination from Europe to the USA where multiple lineages became ecologically established. While these hypotheses are consistent with the data generated from the SNP-based assays presented here, more SNP loci and more genomes and samples collected from Europe and other regions need to be tested to confirm or refute this global model of ST20 evolution.

To best illustrate dissemination patterns and phylogeographic structuring, high-throughput whole-genome sequencing of these DNA from milk samples would provide the best estimate of circulating diversity and relationships among samples. Unfortunately, current sequencing technologies are not capable of providing adequate genome coverage for environmental samples containing low levels of target DNA in complex mixtures. However, sequencing a small number of isolates and comparing their genomes can lead to the discovery of SNPs that can be converted into high-throughput and highly sensitive assays that are amenable to environmental samples. This approach can provide additional resolution and phylogenetic insights into the behavior of a microbial population that may otherwise be difficult to study (Pearsonet al., 2004, 2009). The two new ST20 assays presented here (Cox1910969 and Cox1336818) provide increased phylogenetic resolution of the ubiquitous ST20 genotype of C. burnetii that dominates dairy bovine populations in the USA and appears to be common throughout the world. These two assays, in conjunction with the ST20 assay (Cox56bp10) allow for the segregation of three ST20 genotypes. The future development of additional assays based upon other SNPs within this phylogeny (Table S2) will provide further insights into the diversity of C. burnetii in commercial bovine milk in the USA and the presence or absence of geographic segregation within any of these ST20 genotypes. Although we could not identify SNPs capable of differentiating the European and American isolates that were sequenced, it is possible that European ST20 samples are distinct from samples from the USA, however, SNPs that provide resolution within GT_20.1 (Table S2) would need to be assayed against samples from the USA and Europe, the latter of which were not available for the current study. If such phylogeographic structuring exists and GT_20.1 is dominant in Europe, the assays presented here would not provide additional resolution among European isolates. Conversely, the lack of a European-specific ST20 subclade would also be indicative of massive dissemination and the lack of dispersal obstacles on a global scale. If this were the case, genotypes 20.2 and 20.3 would be expected among European samples.

Pearsonet al. (2014) proposed a model of rapid spread and subsequent persistence of ST20 in the USA. While dispersal of small-cell variants in a quiescent phase can occur over long periods of time with no evolutionary change, the widespread distribution of ST20 would have required massive amplification by animal hosts. Mutation rate estimates are not currently available for C. burnetii, limiting our ability to quantify dispersal speed, however as ST20 was not found in the USA prior to 2007 and previous samples from cattle in 1947 and 1958 were a different ST (Hornstraet al., 2011), this dispersal must have occurred in less than approximately 50 years (Pearsonet al., 2014). Our characterization here of three subgenotypes of ST20 dispersed across the USA and across time is consistent with this model but adds elements of dispersal frequency and pathogen population size. The coexistence of multiple subgenotypes of ST20 across the country demonstrates a general lack of geographic structuring at the country scale, although genotype compositional differences do exist (Fig. 1a). That we found only one subgenotype within a single herd may be suggestive of spatial structure at the herd level, however. The genotype proportions among the longitudinally sampled facilities do show a correlation with facility, however the extent to which this pattern is geographically related (rather than facility-related) appears to be small. Similarly, the temporal coexistence of ST20 subgenotypes suggests the lack of temporal patterns. Frequent convergent and independent evolution of genotypes provides an alternative mechanism by which these three subgenotypes could be widespread with structured populations. However, convergent evolution is rare in C. burnetii (Hornstraet al., 2011; Pearsonet al., 2013) and is non-existent here, given the complete lack of homoplasy among the ST20 genomes studied. While these three subgenotypes show little, if any, geographic or temporal structure across the USA, our results do not preclude the likelihood that such structure may exist within a subgenotype or within a herd as our results from the single dairy farm in Michigan suggest.

In any case, our results provide evidence of rapid, frequent and continuous dissemination on almost a continental scale. The distribution patterns of genotypes, coupled with the high prevalence of ST20 throughout the country, is consistent with a lack of genetic bottlenecks and the likely involvement of a large portion of the pathogen population in dispersal. The coexistence of multiple subgenotypes and the persistence of this coexistence through time is also consistent with this model of continuous, population-wide dispersal and not due to ongoing adaptation within ST20 subtypes. In addition, coexistence of subgenotypes demonstrates the ability for novel types to become established and persist with other subgenotypes. The speed, frequency and extent of population-wide involvement in dispersal would be even better defined by further phylogenetic resolution and quantification of mutation rate to define the speed of dispersal.

Geographic or temporal patterns may exist at the level of an individual herd. Such structure would be largely undetectable with these samples as they contain mixtures of milk from an unknown number of separate farms. Our detection of only one genotype in a single bovine herd sampled from Michigan (Fig. 1a) suggests the possibility of population structuring at the herd level. Nonetheless, our results suggest that continual, frequent and widespread movement of variants has resulted in massive dispersal of ST20 on a continental scale.

The mechanisms driving these regional and potentially global patterns of dissemination and persistence are not known and difficult to imagine. However, they are likely to involve animal husbandry and transportation practices coupled with wind dispersal and long-term environmental viability of C. burnetii. The degree to which ST20 in other regions and other sequence types share these dispersal characteristics is unknown, however some genotypes have only been reported in certain countries (D'Amatoet al., 2016). Certain genotypes of C. burnetii (ST20 and ST8) are thought to be evolutionarily adapted to cows and goats, respectively (Pearsonet al., 2014). The extent to which other genotypes are host-adapted and the propensity for host movement may play a large role in defining the dispersal of each genotype as well as the ability for different genotypes to coexist. Efficient and effective control efforts for this pathogen must be based upon a thorough understanding of the mechanisms of local, regional and global dispersal and persistence of different genotypes.

Acknowledgements

This work was supported by a grant from the Department of Homeland Security (HSHQDC-10-C-00139). We thank Nicolette Janke and Adina Doyle for DNA sequencing.

Supplementary Data

Supplementary File 1

Abbreviations:

VNTR

variable-number tandem repeat

MLVA

multiple-locus, VNTR analysis

ST20

sequence type 20 according to multispacer sequence typing

NASP

Northern Arizona SNP Pipeline

ISG

In Silico Genotyper

GT_20.X

subgenotypes of ST20

References

  1. Agerholm J. S.(2013). Coxiella burnetii associated reproductive disorders in domestic animals – a critical review. Acta Vet Scand 5513. 10.1186/1751-0147-55-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Agresti A., Kateri M.(2011). Categorical Data Analysis. Berlin, Heidelberg: Springer. [Google Scholar]
  3. Archie J. W.(1996). Measures of homoplasy. Homoplasy: The Recurrence of Similarity in Evolution 153–188. Edited by Sanderson M. J., Hufford L.San Diego: Academic Press. [Google Scholar]
  4. Arricau-Bouvery N., Hauck Y., Bejaoui A., Frangoulidis D., Bodier C. C., Souriau A., Meyer H., Neubauer H., Rodolakis A., et al. (2006). Molecular characterization of Coxiella burnetii isolates by infrequent restriction site-PCR and MLVA typing. BMC Microbiol 638. 10.1186/1471-2180-6-38 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Astobiza I., Tilburg J. J., Piñero A., Hurtado A., García-Pérez A. L., Nabuurs-Franssen M. H., Klaassen C. H.(2012). Genotyping of Coxiella burnetii from domestic ruminants in northern Spain. BMC Vet Res 8241. 10.1186/1746-6148-8-241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bauer A. E., Olivas S., Cooper M., Hornstra H., Keim P., Pearson T., Johnson A. J.(2015). Estimated herd prevalence and sequence types of Coxiella burnetii in bulk tank milk samples from commercial dairies in Indiana. BMC Vet Res 11186. 10.1186/s12917-015-0517-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Benenson A. S., Tigertt W. D.(1956). Studies on Q fever in man. Trans Assoc Am Physicians 6998–104. [PubMed] [Google Scholar]
  8. D'Amato F., Eldin C., Raoult D.(2016). The contribution of genomics to the study of Q fever. Future Microbiol 11253–272. 10.2217/fmb.15.137 [DOI] [PubMed] [Google Scholar]
  9. Eppinger M., Pearson T., Koenig S. S., Pearson O., Hicks N., Agrawal S., Sanjar F., Galens K., Daugherty S., et al. (2014). Genomic epidemiology of the Haitian cholera outbreak: a single introduction followed by rapid, extensive, and continued spread characterized the onset of the epidemic. MBio 5e01721. 10.1128/mBio.01721-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Felsenstein J.(1985). Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39783 791. 10.2307/2408678 [DOI] [PubMed] [Google Scholar]
  11. Glazunova O., Roux V., Freylikman O., Sekeyova Z., Fournous G., Tyczka J., Tokarevich N., Kovacava E., Marrie T. J., et al. (2005). Coxiella burnetii genotyping. Emerg Infect Dis 111211–1217. 10.3201/eid1108.041354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hawker J. I., Ayres J. G., Blair I., Evans M. R., Smith D. L., Smith E. G., Burge P. S., Carpenter M. J., Caul E. O., et al. (1998). A large outbreak of Q fever in the West Midlands: windborne spread into a metropolitan area? Commun Dis Public Health 1180–187. [PubMed] [Google Scholar]
  13. Hornstra H. M., Priestley R. A., Georgia S. M., Kachur S., Birdsell D. N., Hilsabeck R., Gates L. T., Samuel J. E., Heinzen R. A., et al. (2011). Rapid typing of Coxiella burnetii. PLoS One 6e26201. 10.1371/journal.pone.0026201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Kersh G. J., Fitzpatrick K. A., Self J. S., Priestley R. A., Kelly A. J., Lash R. R., Marsden-Haug N., Nett R. J., Bjork A., et al. (2013). Presence and persistence of Coxiella burnetii in the environments of goat farms associated with a Q fever outbreak. Appl Environ Microbiol 791697–1703. 10.1128/AEM.03472-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Maurin M., Raoult D.(1999). Q fever. Clin Microbiol Rev 12518–553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Nusinovici S., Frössling J., Widgren S., Beaudeau F., Lindberg A.(2015). Q fever infection in dairy cattle herds: increased risk with high wind speed and low precipitation. Epidemiol Infect 1433316–3326. 10.1017/S0950268814003926 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Omsland A., Beare P. A., Hill J., Cockrell D. C., Howe D., Hansen B., Samuel J. E., Heinzen R. A.(2011). Isolation from animal tissue and genetic transformation of Coxiella burnetii are facilitated by an improved axenic growth medium. Appl Environ Microbiol 773720–3725. 10.1128/AEM.02826-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Pearson T., Busch J. D., Ravel J., Read T. D., Rhoton S. D., U'Ren J. M., Simonson T. S., Kachur S. M., Leadem R. R., et al. (2004). Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing. Proc Natl Acad Sci U S A 10113536–13541. 10.1073/pnas.0403844101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Pearson T., Okinaka R. T., Foster J. T., Keim P.(2009). Phylogenetic understanding of clonal populations in an era of whole genome sequencing. Infect Genet Evol 91010–1019. 10.1016/j.meegid.2009.05.014 [DOI] [PubMed] [Google Scholar]
  20. Pearson T., Hornstra H. M., Sahl J. W., Schaack S., Schupp J. M., Beckstrom-Sternberg S. M., O'Neill M. W., Priestley R. A., Champion M. D., et al. (2013). When outgroups fail; phylogenomics of rooting the emerging pathogen, Coxiella burnetii. Syst Biol 62752–762. 10.1093/sysbio/syt038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Pearson T., Hornstra H. M., Hilsabeck R., Gates L. T., Olivas S. M., Birdsell D. M., Hall C. M., German S., Cook J. M., et al. (2014). High prevalence and two dominant host-specific genotypes of Coxiella burnetii in U.S. milk. BMC Microbiol 1441. 10.1186/1471-2180-14-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Sahl J. W., Beckstrom-Sternberg S. M., Babic-Sternberg J., Gillece J. D., Hepp C. M., Auerbach R. K., Tembe W., Wagner D. M., Keim P. S., Pearson T.(2015). The In Silico Genotyper (ISG): an open-source pipeline to rapidly identify and annotate nucleotide variants for comparative genomics applications. bioRxiv. [Google Scholar]
  23. Scholz H. C., Pearson T., Hornstra H., Projahn M., Terzioglu R., Wernery R., Georgi E., Riehm J. M., Wagner D. M., et al. (2014). Genotyping of Burkholderia mallei from an outbreak of glanders in Bahrain suggests multiple introduction events. PLoS Negl Trop Dis 8e3195. 10.1371/journal.pntd.0003195 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Scott G. H., Williams J. C.(1990). Susceptibility of Coxiella burnetii to chemical disinfectants. Ann N Y Acad Sci 590291–296. 10.1111/j.1749-6632.1990.tb42235.x [DOI] [PubMed] [Google Scholar]
  25. Sidi-Boumedine K., Duquesne V., Prigent M., Yang E., Joulié A., Thiéry R., Rousset E.(2015). Impact of IS1111 insertion on the MLVA genotyping of Coxiella burnetii. Microbes Infect 17789–794. 10.1016/j.micinf.2015.08.009 [DOI] [PubMed] [Google Scholar]
  26. Svraka S., Toman R., Skultety L., Slaba K., Homan W. L.(2006). Establishment of a genotyping scheme for Coxiella burnetii. FEMS Microbiol Lett 254268–274. 10.1111/j.1574-6968.2005.00036.x [DOI] [PubMed] [Google Scholar]
  27. Swofford, D. L. PAUP*: phylogenetic analysis using parsimony (and other methods). version 4. Sunderland, MA: Sinauer Associates. 2002.
  28. Tissot-Dupont H., Amadei M.-A., Nezri M., Raoult D.(2004). Wind in November, Q fever in December. Emerg Infect Dis 101264–1269. 10.3201/eid1007.030724 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Wood S.(2006). Generalized Additive Models: An Introduction with R. Boca Raton, FL: Chapman and Hall/CRC press. [Google Scholar]

Data Bibliography

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File 1


Articles from Microbial Genomics are provided here courtesy of Microbiology Society

RESOURCES