Summary
Objectives
Integrating pathogen genomic surveillance with bioinformatics can enhance public health responses by identifying risk and guiding interventions. This study focusses on the two predominant Campylobacter species, which are commonly found in the gut of birds and mammals and often infect humans via contaminated food. Rising incidence and antimicrobial resistance (AMR) are a global concern, and there is an urgent need to quantify the main routes to human infection.
Methods
During routine US national surveillance (2009–2019), 8856 Campylobacter genomes from human infections and 16,703 from possible sources were sequenced. Using machine learning and probabilistic models, we target genetic variation associated with host adaptation to attribute the source of human infections and estimate the importance of different disease reservoirs.
Results
Poultry was identified as the primary source of human infections, responsible for an estimated 68% of cases, followed by cattle (28%), and only a small contribution from wild birds (3%) and pork sources (1%). There was also evidence of an increase in multidrug resistance, particularly among isolates attributed to chickens.
Conclusions
National surveillance and source attribution can guide policy, and our study suggests that interventions targeting poultry will yield the greatest reductions in campylobacteriosis and spread of AMR in the US.
Data availability
All sequence reads were uploaded and shared on NCBI’s Sequence Read Archive (SRA) associated with BioProjects; PRJNA239251 (CDC / PulseNet surveillance), PRJNA287430 (FSIS surveillance), PRJNA292668 & PRJNA292664 (NARMS) and PRJNA258022 (FDA surveillance). Publicly available genomes, including reference genomes and isolates sampled worldwide from wild birds are associated with BioProject accessions: PRJNA176480, PRJNA177352, PRJNA342755, PRJNA345429, PRJNA312235, PRJNA415188, PRJNA524300, PRJNA528879, PRJNA529798, PRJNA575343, PRJNA524315 and PRJNA689604. Contiguous assemblies of all genome sequences compared are available at Mendeley data (assembled C. coli genomes doi: 10.17632/gxswjvxyh3.1; assembled C. jejuni genomes doi: 10.17632/6ngsz3dtbd.1) and individual project and accession numbers can be found in Supplementary tables S1 and S2, which also includes pubMLST identifiers for assembled genomes. Figshare (10.6084/m9.figshare.20279928). Interactive phylogenies are hosted on microreact separately for C. jejuni (https://microreact.org/project/pascoe-us-cjejuni) and C. coli (https://microreact.org/project/pascoe-us-ccoli).
Keywords: Campylobacteriosis, Gastroenteritis, Source attribution, Machine learning, Chicken consumption
Introduction
Campylobacter is the most common cause of bacterial gastro-intestinal infections in high-income countries including the United States. Currently, the US Centers for Disease Control and Prevention (CDC) estimates approximately 17 cases per 100,000 people, equating to 1.5 million illnesses each year with many more un-diagnosed or unreported cases.1,2 There are differences in disease reporting, based on bacterial culturing and culture-independent methods,3 and many more cases go unreported as symptoms, including nausea, diarrhoea, and vomiting, typically subside within five days. However, over 13,000 cases annually require hospitalisation, of which around 1% are fatal.4 Occasional persistent and debilitating sequelae, such as reactive arthritis, Guillian-Barré and irritable bowel syndromes intensify the public health impact.5,6 In serious cases, infections may require treatment with antibiotics. However, as for some other gastrointestinal pathogens, antimicrobial resistance (AMR) is increasing,7–10 limiting treatment options. Control of multidrug resistant (MDR) C. jejuni is now considered a high priority by the World Health Organization (WHO),11 and there is an urgent need to identify the relative contribution to human infection of different sources of this pervasive foodborne pathogen.
Understanding the origin of human campylobacteriosis is challenging. The ubiquity of C. jejuni and C. coli in animal intestines, and its shedding into the environment, leads to numerous infection routes. Recent confirmed outbreaks have been traced to contaminated dairy products,12 contact with pets,13–15 chicken livers,16–19 raw peas (wild birds),20 and shellfish,21 but these represent only a small number of cases.22 Most human infections are sporadic, likely representing a continuous stream of contamination from diffuse sources.22–26 Indirect evidence of infection sources has come from risk assessments and case-control studies,27–29 but DNA-sequence-based methods have provided opportunities for promising novel approaches. Campylobacter has adapted to colonise multiple host species, leaving signatures in the genome. Thus, the origin of isolates that infect humans can be determined by analysing the clinical isolate genomes. Source attribution studies have been employed to identify poultry as the primary source of infections in several European countries.24,26,30–32 However, the importance of different disease sources varies among countries and regions,23,33–35 and there are currently no reliable, routine source attribution estimates for the US.
Public health organizations in the US have been among the fastest to incorporate genome sequencing technologies into routine national surveillance.36 Cross-agency data sharing, including Campylobacter genomes from clinical infections and host sources, is ideal for quantitative source attribution modelling to determine the relative contribution of different infection sources from genotype data. In this study, we analysed Campylobacter genomes from US national surveillance programs using machine learning and probabilistic source attribution to estimate the sources of human C. jejuni and C. coli isolates since 2009. Campylobacter infection can be considered through a “one health” perspective, where rapid intensification in agricultural practices has contributed to the evolution of the pathogen37 and widespread use of antibiotics has been selected for increased rates of antibiotic resistance.38,39 Increasing numbers of human infections can be attributed to MDR isolates from poultry sources. This represents a significant public health risk and provides important information for interventions and infection control.
Methods
Sampling and microbiology
Human clinical campylobacteriosis cases in the US have been nationally notifiable since 2015, and the CDC has supported collection and sequencing of ~9000 Campylobacter isolates since 2009 (Supplementary tables S1 & S2). Our dataset included C. jejuni and C. coli isolates collected from confirmed campylobacteriosis cases, and historical cases reported nationally by state and local public health laboratories. These were grouped within 10 US health and human services (HHS) regions to de-identify the source state (Supplementary Fig. S1). Following the removal of potential duplicate samples, the collection comprised 8856 genomes. No ethical approval was required as all isolates were collected as part of routine diagnosis and surveillance.
Context isolates were collected from livestock animals contemporaneously by three US public health agencies. These were supplemented with a previously published global collection of genomes from wild birds40 and environmental isolates.41 Genomic signatures of host source are robust to spatial and temporal sample variation in Campylobacter,42 therefore isolate genomes collected at different times from multiple countries can be used in source attribution models. This strong host signal can be observed among black birds, gulls, sandpipers/dunlins, and ducks/geese from Europe and Australia that are found together in discrete phylogenetic clusters, separate from livestock,37 despite the considerable distances between samples.43
The Food Safety and Inspection Service (FSIS) is part of the national system to monitor the effectiveness of controls to reduce Campylobacter and other foodborne pathogens in raw meat and poultry products. As part of its role in routine surveillance, FSIS has employed whole genome sequencing to identify pathogens of concern in poultry products since 2013 (specifically, Campylobacter since 2019) (Supplementary table S2). Sampling was conducted across the US, including from animal faeces and food. FSIS USDA sampled chicken, turkey, cattle, and swine and animal food products at slaughter and processing facilities. Animal ceca were sampled immediately after slaughter. Turkey carcasses were swabbed; chicken carcasses and parts were rinsed, while portions of ground chicken and turkey and retail meat were collected for testing. Campylobacter was isolated using direct plating or enrichment using methods as described in the USDA’s Microbiology Laboratory Guidebook Chapter 41.
Retail poultry isolates were included in the study from the NARMS (National Antimicrobial Resistance Monitoring System) retail meat surveillance program at the Food and Drug Administration’s (FDA) Center for Veterinary Medicine (CVM). FDA collaborates with state public health and academic partners to collect and test retail poultry products. Isolation of Campylobacter from poultry samples was performed at participating laboratories using methods outlined in the NARMS Manual of Laboratory Methods, Retail Meat Isolation Protocol. Whole genome sequencing of isolates was conducted, and sequence data were uploaded to NCBI. Species confirmation and antimicrobial susceptibility testing were then performed at FDA laboratories.
Genome sequencing, assembly, and quality control
Sequencing was performed locally by laboratories with standard CDC operating procedures for Campylobacter, using Illumina sequencers (San Diego, CA).44 Sequence reads were downloaded and assembled using FLASH (version 1.2.11)45 and SPAdes (version 3.12.0)46 (accession numbers in Supplementary tables S1 & S2). Assemblies were annotated using prokka (version 1.13).47 Quality control statistics were generated using QUAST (version 4.6.3),48 fastqc (version 0.11.7)49 and snippy (version 3.2).50 Non-C. jejuni or C. coli genomes were removed, as were assemblies comprising > 500 contigs, or an assembly length < 1.5 Mbp or > 2 Mbp. Datasets also included C. jejuni (NCTC 11168) and C. coli (ASM202418v1) reference strains.35,51 Datasets were augmented with additional C. jejuni and C. coli isolates from previous studies,37,52 subject to the same selection criteria. The final dataset comprised 18,306 C. jejuni isolates from humans, chickens, cows, pigs, and turkeys; and 7253 C. coli isolates from humans, cows, chickens, and wild birds (Supplementary Fig. S2;). Phylogenies were constructed using Mashtree (version 1.0.4),53 visualised and shared on microreact.54 Each isolate was assigned sequence types (STs) using the seven-locus MLST scheme incorporated in pubMLST55,56 (Supplementary tables S1 & S2; Supplementary Fig. S3). Isolates that shared five or more MLST alleles were grouped into defined clonal complexes (CCs)57,58 or C. coli clades and clustered on the phylogeny with previously typed isolates.59
Random forest identification of pangenome host segregating markers between reservoir sources
Isolates were grouped into food categories according to the Interagency Food Safety Analytics Collaboration (IFSAC) categories designed for harmonisation among public health agencies.60,61 C. jejuni isolates from chicken, cattle, and wild bird sources, and C. coli isolates from chicken, cattle, pork, and turkey sources were included in separate, species-specific random forest analyses (Supplementary Fig. S2).
Unitigs were generated from assemblies using unitig counter (version 1.1.0),62 and unitigs found in the same subset of isolates were grouped into a single presence-absence pattern.63 Pattern distribution was scored using mutual information (MI) in each reservoir source population.64 Statistically, MI measures the marginal association between the two variables. Biologically, it gives a measure of how effectively a pattern (and ultimately unitigs) can distinguish the source in question from every other source, where greater values represent a greater separation between the source and every other source. Patterns with the greatest source-associated MI score (100,000 per source) were chosen for machine learning (ML) attribution analysis. Previous ensemble ML studies using Campylobacter genomes have identified random forest analyses, with XGBoost on a balanced cgMLST dataset, as an optimal approach for source attribution.65 A weighted random forest using equal numbers of isolates from each source population was generated using the R package ranger (version 0.12.1)66 and repeated 10 times. Each model was trained to predict source populations using patterns and 10,000 bootstrap trees were trained, and out-of-bag predictions calculated. The robustness of each model was tested via recalculation using 10 different seeds.
A pangenome was constructed from a subset of isolates. To ensure that these represented diversity across C. jejuni and C. coli, unitig covariation patterns were determined for all isolates and 2000 isolates were chosen per source. These included covariation patterns with the highest MI between sources, such that all selected patterns were present in at least 20 isolates. Reference strains for each species were also added and a pangenome was constructed using PIRATE (version 1.0.4)67 with the option to generate pangenome alignments. Unitigs were mapped to the pangenome alignment using bwa fastmap (version 0.7.17),68 with the maximum interval size set to a number greater than the number of isolates and the position of every match was returned. Full length matches at unique positions that did not border different genes were analysed and the number of hits and highest MI score was recorded for each matched unitig in each source population. Five genes with the greatest number of matches and five genes with the greatest MI scores per source population were carried forward as host-segregating markers.
Probabilistic attribution of the source of human infection isolates
Source attribution models were employed to predict the most likely source host for each human campylobacteriosis case. Allele numbers were assigned to isolate genomes for each of the host segregating markers (Supplementary table S3) and compared with the current best performing core genome multi-locus sequence typing (MLST) method for source attribution. Missing alleles, novel alleles, and unique combinations of multicopy alleles were each given a typing number starting at 10,000. C. coli isolates from turkey and chicken sources were merged into a single source (poultry) for attribution analyses with iSource, as there was a significant overlap between markers that could accurately segregate the sources.24,32 The iSource algorithm was run with a thinning interval of 1 for 100,000 iterations, reporting every 50 iterations. 10 runs of self-attribution were performed for each species, masking the source of one-third of isolates from non-clinical populations. Human clinical isolates were then attributed separately for C. jenuni and C. coli and compared across health regions and putative sources.
Comparison with core genome MLST method for source attribution
To further verify our findings, the putative source of human infections was also attributed using cgMLST profiles and implementing machine learning algorithms in aiSource.65 Profiles were exported from PubMLST using their cgMLST loci (C. jejuni-C.coli cgMLST v1.0).55,69 Incomplete or missing alleles were replaced by the flag “NA” and treated as a distinct allele using the catboost algorithm, which can incorporate categorical data. No under-sampling or hyperparameter optimisation was used in training the model. A verification dataset was constructed of human C. coli (n = 696), human C. jejuni (n = 8160), non-human C. coli (n = 6538) and non-human C. jejuni (n = 10,187) isolates. Non-human C. coli isolates sampled from environmental or wild bird sources were grouped as “other”, resulting in a dataset comprising chicken (n = 3755), cattle (n = 1181), pig (n = 1112), turkey (n = 398) and other (n = 91) isolates. C. jejuni isolates sampled from environmental, turkey or pig sources were grouped as “other”, resulting in a dataset comprising chicken (n = 5660), cattle (n = 3842), wild birds (n = 328) and other (n = 356). All scripts are available on GitHub (https://github.com/narning1992/aiSource).
Identification of antimicrobial resistance elements
Nucleotide sequences for all genomes were screened using StarAMR (version 0.4.0)70 for the presence of known AMR determinants against ResFinder71 and point mutations with PointFinder (databases updated April 2020).72 A positive hit was defined when a gene had > 75% nucleotide identity over > 50% of the sequence length. A matrix of gene presence for every antimicrobial resistance gene was generated for every genome. Comparisons were made between species, sampled sources, and assigned sources following attribution.
Results
In total, 8856 isolates were collected and sequenced from confirmed human campylobacteriosis between 2009–2019, with most identified as C. jejuni (n = 8160; 92%; Supplementary table S1). The remaining isolates were predominantly C. coli (n = 696; 8%), consistent with the relative prevalence of species in similar surveillance studies from other high-income countries.73 Contemporaneous isolates were also collected and sequenced by the CDC, FSIS, and the FDA during this period from possible Campylobacter infection sources (n = 16,703; Supplementary table S2). These source data comprise C. jejuni (n = 10,146) and C. coli (n = 6557) isolates from chicken (n = 9395), cattle (n = 5023), pork (n = 1175), turkey (n = 497), wild bird (n = 365) and environmental (n = 248) sources (Fig. 1A).
Fig. 1. Isolate genomes used in this study.
(A) Overview of the number of isolates from each source, including chicken (yellow; n = 9395), cattle (green; n = 5023), swine (pink; n = 1175), clinical cases (grey; n = 8856), environmental sources (blue; n = 248), turkey (orange; n = 497) and a global collection from birds (purple; n = 365). Samples were collected from all over the US, with states that contributed higher numbers of samples coloured more deeply on the maps. (B) Phylogeny of 18,306 C. jejuni genomes constructed using mashtree, coloured by source. The two large host generalist clonal complexes (ST-21 CC and ST-45 CC) are labelled along with the chicken-associated clonal complex.74 (C) Mash phylogeny of 7253 C. coli coloured by source. The large host generalist clonal complex, ST-828 CC and three ancestral clades (Clades 1, 2 and 3) are labelled along with the highly introgressed ST-1150 clonal complex.41,58,75
Population structure reveals host-associated C. jejuni and C. coli lineages
Clusters of isolates from the same host were clearly visible on C. jejuni and C. coli phylogenies (Fig. 1B, 1C), however there were differences between species. Common C. jejuni clonal complexes (CCs) included source-associated sub-lineages (Fig. 1B). CCs containing 10 or more human clinical isolates were analysed (Supplementary Fig. S3). There was evidence of cattle- and chicken-associated sub-lineage clustering within the ST-21 CC, one of the most common lineages found in human infections worldwide. The major chicken-associated clade contained many lineages where non-human samples were only collected from chickens, including the ST-464, ST-353, ST-574, ST-607, ST-52, ST-460, ST-443 and ST-354 CCs, although the ST-257 CC also included isolates from cattle. Cattle-associated lineages included the ST-22, ST-42, ST-403 and ST-508 CCs. Wild bird isolates also displayed strong clustering within the ST-692, ST-1287, ST-179, ST-177, ST-1034 and ST-1332 CCs. Clonal complexes containing multiple turkey isolates, such as the ST-179 and ST-692 CCs, also contained wild bird isolates. Environmental isolates also clustered with wild bird isolates in the ST-177, ST-1034 and ST-1332 CCs although the ST-574 CC contained both environmental and chicken isolates. In contrast to C. jejuni, most C. coli isolates belonged to a single clonal complex, the ST-828 CC. Within this clonal complex, isolates clearly segregated by host source (Fig. 1C).
Identification of host segregating markers in C. jejuni and C. coli pangenomes
Contigs from assembled C. jejuni and C. coli genomes were partitioned into 5,653,690 unitigs capturing variation in the core and accessory genomes. These were collapsed to groups of unitigs with shared presence-absence patterns (n = 3,580,741). Random forests were generated for C. jejuni (Fig. 2), and C. coli (Fig. 3) isolates using 298,330 and 350,575 source-associated patterns and scored for accuracy using mutual information (MI). There was much clearer partitioning between patterns that identified C. jejuni isolates from cattle and chicken, than those that identified wild bird isolates (Fig. 2A, B). Chicken and cattle patterns also demonstrated the highest discriminatory power (MI score) to identify potential C. coli sources (Fig. 3A, B). A pangenome was constructed which included the 2000 best performing patterns per potential source population, to which unitigs were mapped and the loci (genes) within identified. To improve computational runtime and develop a schema for public health use, the 30 most accurate unitigs were selected for each species, which encompassed 24 C. jejuni and 22 C. coli genes (Figs. 2C & 3C; Supplementary table S3).
Fig. 2. Overview of C. jejuni attribution study.
(A) Random Forest analyses were performed on training data from chicken, cattle, and wild bird sources to score patterns of unitigs according to their ability to predict isolate source. Dots within each box show how common each pattern is in the two intersecting hosts. Patterns within the small, dotted boxes are common in the host on the horizontal axis, but rare in the host labelled on the vertical axis. (B) Patterns of unitigs with highly discriminatory mutual information (MI) scores were used to select markers for different hosts. (C) Genes from which unitig markers were selected and assigned allele numbers for attribution. (D) Markers were tested on a subset of the data for accuracy (overall accuracy > 90%); and (E) used to predict the source of 8160 C. jejuni infection cases.
Fig. 3. Overview of C. coli attribution study.
(A) Random Forest analyses were performed on training data from chicken, cattle, turkey, and pig sources to score patterns of unitigs according to their ability to predict isolate source. Dots within each box show how common each pattern is in the two intersecting hosts. Patterns within the small, dotted boxes are common in the host on the horizontal axis, but rare in the host labelled on the vertical axis. Patterns of unitigs with highly discriminatory mutual information (MI) scores were used to select markers for different hosts. (C) Genes from which unitig markers were selected and assigned allele numbers for attribution. (D) Markers were tested on a subset of the data for accuracy (overall accuracy > 90%). (E) Chicken and turkey markers were combined to predict the source of 696 C. coli infection cases from poultry, cattle, and pigs.
Chicken is the dominant source of US human infections
Isolates of each species from our clinical and attribution datasets were typed and assigned allele values for each of the selected host segregating markers (unitigs). Accuracy in identifying the correct source of an isolate was tested by masking the source of a random third of isolates from the training dataset (isolates from known sources) (Figs. 2D & 3D). Using an asymmetric island model (iSource32), 10 self-attribution tests were performed and averaged. The overall attribution accuracy was 90% for both C. jejuni and C. coli. Poultry, cattle, and pork (C. coli) isolates were correctly attributed more often than wild bird (C. jejuni) isolates (Supplementary table S4). Nearly three-quarters of all human C. jejuni cases (n = 8160) were assigned to chicken sources (67.9%; n = 5538), more than a quarter to cattle sources (29.2%; n = 2384), and a small percentage to wild bird sources (2.9%; n = 238) (Fig. 2E, Table 1). Similarly, C. coli cases (n = 696) were also pre-dominantly assigned to poultry sources (70.1%; n = 491). However, nearly a fifth of cases were assigned to pork sources (17.1%; n = 119) and only a small amount to cattle (12.4%; n = 86) (Fig. 3E, Table 1). Overall, the majority of campylobacteriosis cases were attributed to poultry (68.1%; n = 6029) and cattle (27.9%; n = 2470), with only a small contribution from wild bird (2.7%; n = 238) and pork sources (1.3%; n = 119; Supplementary table S5). Similar results were obtained using the current best performing core genome MLST (cgMLST) source attribution model (aiSource), which when applied to human cases, also predicted chicken as the primary source of C. jejuni infections (Supplementary methods; Supplementary table S6).
Table 1. Putative source of infection for 8856 campyobacteriosis cases in the US between 2009 and 2019.
| Poultry | Cattle | Pork | Wild bird | |
|---|---|---|---|---|
| C. jejuni (n = 8160) | 5538 (67.9%) | 2384 (29.2%) | – | 238 (2.9%) |
| C. coli (n = 696) | 491 (70.1%) | 86 (12.4%) | 119 (17.1%) | – |
| Total (n = 8856) | 6029 (68.1%) | 2470 (27.9%) | 119 (1.3%) | 238 (2.7%) |
The relative contribution of each infection source reservoir varies between states
Poultry (chicken in C. jejuni; chicken and turkey combined in C. coli) was identified as the primary source of human campylo-bacteriosis in all 10 HHS regions (Supplementary Fig. S4). The proportion of cases (C. jejuni and C. coli, combined) attributed to poultry sources ranged from 49% to 87% (Supplementary table S5). In 7/10 HHS regions more than 70% of campylobacteriosis cases could be attributed to poultry sources. This is consistent with other studies that have identified chicken as the primary source of campylo-bacteriosis in the UK, France, Germany, and other countries.23–26,32,33,34,73 The lowest proportion of cases attributed to poultry sources was 49% of cases from HHS region 8 (which in-corporates Colorado, Montana, North and South Dakota, Utah, and Wyoming). In this region, a significant proportion of cases were attributed to cattle (49% of C. jejuni cases) and pork (26% of C. coli cases) sources (Supplementary table S5). According to National Agricultural Statistics Service (NASS) data, this HHS region was not heavily involved in the poultry industry (https://quickstats.nass.usda.gov/; Supplementary table S7).
Chicken is an increasing source of multidrug resistance Campylobacter in human infections
The general trend over recent years has been for an increasing number of campylobacteriosis cases to be attributed to poultry sources. When combined with incidence rates from the CDC FoodNet Fast tool, the number of infections attributed to poultry sources rose from 22.8% in 2009 to 71.0% in 2018 (Fig. 4A; Supplementary table S5; https://wwwn.cdc.gov/foodnetfast/). The poultry-associated clonal complexes, ST-353, ST-443, ST-179, ST-607, ST-508, ST-464, ST-354, ST-460, ST-283 and ST-257 alone accounted for nearly 25% more infections in 2018 than in 2009 (Supplementary table S8). Alongside a documented increase in the incidence of cases, the proportion of multidrug resistant (MDR) isolates (resistance to three or more anti-microbial classes) has also increased since 2009, from 12.3% to 22.3% in 2018 (Fig. 4B; Supplementary table S9); this increase was specifically driven by an increase in fluroquinolone resistance in isolates that infect humans (Fig. 4B; Supplementary table S10).
Fig. 4. Rise in chicken-associated campylobacteriosis and multi-drug resistance in the US.
(A) Heat map of the rise in multidrug resistant Campylobacter across the US since 2009. Darker red colouring indicates increasing rates of MDR isolates in 2018 compared to 2009. Yellow fractions of pie charts reflect the proportion of isolates from that HSS health region that was predicted to be from chicken sources in our attribution study. Pie chart radius reflects incidence per 100,000 people in 2018 (FoodNet Fast tool: https://wwwn.cdc.gov/foodnetfast/). (B) Plot of rising campylobacteriosis incidence (left axis; black line), driven by an increase in the proportion of fluoroquinolone resistant isolates (right axis; purple line).
Discussion
The global poultry industry rapidly expanded following the Second World War and continues to grow. Chicken is now the primary source of low-cost protein and a staple of the American diet.76 Competition in the 1950s transformed the chicken industry, centralising breeding stocks with a limited number of companies.77 As a result, the market today is dominated by very few companies that supply breeding stock for the global poultry market, and a single breeder supplies primary stock chickens to more than 130 countries and represents 44% of the global poultry market share.78 This rapid expansion in poultry production, alongside a massive decline in chicken diversity, has brought significant challenges in food safety and animal welfare, including infection control.
Addressing these challenges, multiple public health agencies across the US conduct routine surveillance of food-borne pathogens, including Campylobacter from clinical infections and potential source populations. While specific clusters of isolates have been linked to outbreak events traced to pets,13,14 raw milk,79 and chicken livers,19 most infections are considered sporadic with no clearly definable point source. Consistent with this, isolates from known outbreaks represented only 0.5% of the cases submitted to FoodNet between 2004 and 2011.80 In these circumstances it may be necessary to consider alternative methods for describing disease spread.
Here, by combining genomic data from clinical infections and potential sources, collected by multiple US public health agencies, we were able to assess the relative importance of common food sources for human infection. Most cases in the US were attributed to poultry sources across all HHS regions. This is consistent with previous surveillance efforts of individual states81 and European public health agencies.23,24,30,32,73 Differences in the relative proportion of cases attributed to poultry have been observed between countries, states, and populations and these have been explained by local demographic (urban v rural)4 or dietary31,33 differences. However, to date, all national scale source attribution studies have identified poultry as a major reservoir for human infection.
Antimicrobial resistance (AMR) among Campylobacter is a concern, not only because this reduces treatment options for serious and persistent infections but also because AMR genes are mobile between strains and species. Widespread fluoroquinolone and macrolide resistance has been reported worldwide82,83 and our approach to identifying putative AMR, based on inferred gene function, significantly augments these findings. In particular, isolates belonging to poultry-associated lineages (e.g., ST-353/ST-354 clonal complexes) were found to harbour the most AMR elements.84 As clinical isolates reflect populations in source hosts, increasing cases from poultry could increase AMR in human infections. Consistent with this, enteric pathogens from US travellers returning from Asia and South America,8 where antibiotic use in livestock is sometimes less regulated,6 have high levels of quinolone resistance. The association of AMR with C. jejuni and C. coli isolates from poultry suggests that even in high-income countries, where AMR is thought to be stabilising,85 reducing infection from poultry will have a major impact.
Probabilistic source attribution studies have been successful for assigning Campylobacter disease cases to putative source reservoirs.23,26,30,33,65,86–88 The fundamental principal is that host adaptation leaves signatures in the Campylobacter genome that transcend geographic variation.42,89,90 Therefore, when human disease isolates are sequenced, their source can be inferred through genome comparisons. The efficacy of source attribution is largely dependent upon the representativeness of the isolate collection and the specificity of the attribution model. Addressing the need for representative data, our multi-agency cooperative study, combined public health, agricultural, and food regulatory sectors, to create very large-scale surveillance datasets. While this is undoubtedly the largest genomic attribution study to date, it is important to note that US surveillance is heavily weighted towards poultry, beef, and pork (and their source animals) due to perceived risk. Other sources may cause human disease, but this will not impact the primary finding that chicken is the major source. For example, companion animals such as cats and dogs have been implicated as a risk factor for Campylobacter infection.91 However, their contribution appears to be small, and they may be infected from similar sources to humans.13,14
The biology of the bacteria imposes a limit on the specificity of the attribution model. To be informative, genomes used in the model must contain host segregating markers. This requires that the respective isolates have undergone a period of physically isolated (allopatric) evolution in a specific host, long enough to accumulate host-associated genomic signatures. Regular host switching erodes these signatures74,92 and it is impossible to attribute an isolate from human infection if an identical strain inhabits two reservoir hosts.73 To maximise the identification of host signals, we used machine learning to guide marker selection.33,65 In most cases, there was sufficient host segregation to identify host markers. By employing ML methods for marker selection and subsequent attribution, these can be updated in future studies of larger genome collections, with a broader suite of potential infection sources.
Effective public health management is dependent upon evidence to guide interventions and policy. US Campylobacter management has included National Advisory Committee (NACMCF) discussions (200793), with minimum standards announced for chickens and turkeys in 2011 and 2016.94 Building on these data, our attribution study provides evidence for an increase in the number of cases attributed to poultry sources (since 2011). While there is some difference in attribution between states, this clearly suggests that poultry should be a primary target for reducing the spread of Campylobacter and disease incidence with AMR strains. Implementing routine national-scale pathogen genomic surveillance and machine learning source attribution has great utility for monitoring infection trends and supporting interventions to reduce infections by Campylobacter, and potentially other pathogens.
Supplementary Material
Appendix A. Supporting information
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.jinf.2024.106265.
Acknowledgements
We would like to thank Peter S. Evans, from the USDA Food Safety and Inspection Service, who instigated this project and contributed throughout with helpful and considered feedback. We express our gratitude for the countless efforts and contributions of the Centers for Disease Control and Prevention (CDC), the state health department, and CDC staff who collected, transmitted, and verified the data included in this analysis. Additionally, we extend appreciation to US federal public health partners (CDC, FSIS, FDA, IFSAC, NARMS, FoodNet). The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention. Use of trade names and commercial sources is for identification only and does not imply endorsement by the Centers for Disease Control and Prevention, the Public Health Service, or the U.S. Department of Health and Human Services.
Funding
UK research was funded by UKRI and Wellcome Trust grants awarded to SKS: MR/V001213/1; MR/S009264/1; MR/L015080/1; MR/T030062/1; 088786/C/09/Z. The funders had no role in the design and conduct of the study, collection, management, analysis, and interpretation of the data, preparation, review, or approval of the manuscript. Researchers from US public health agencies provided the data, and analysis was conducted by Oxford University. The findings of the study were not influenced by US public health agencies. All high-performance computing was performed on MRC CLIMB. This publication used the PubMLST website (http://pubmlst.org/) developed by Keith Jolley and Martin Maiden and hosted at the University of Oxford. DJW was funded by the Robertson Foundation, the Wellcome Trust and the Royal Society (101237/Z/13/B).
Footnotes
Author contributions
BP, BBB, and SKS conceptualised the study. BP, GF, JP, SCB, EM, JKC and NA conducted analyses and MDH, KAJ, MCJM, LAJ, CGL, TG, CTP, and KKC were involved in data curation. BP, GF, JKC and SKS visualised the data. DJW, JC, EBR and KH provided feedback and guidance supporting and interpreting analysis. BP, GF, LAJ, BBB and SKS wrote the original draft and all authors contributed, edited, and approved the final manuscript. All authors had direct access to the underlying data reported in the manuscript, which has been verified by BP, GF, SCB, LAJ and SKS.
Declaration of Competing Interest
All authors declare no conflict of interest.
References
- 1.Collier SA, Deng L, Adam EA, Benedict KM, Beshearse EM, Blackstock AJ, et al. Estimate of burden and direct healthcare cost of infectious waterborne disease in the United States. Emerg Infect Dis. 2021;27:140–9. doi: 10.3201/eid2701.190676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Collins JP, Shah HJ, Weller DL, Ray LC, Smith K, McGuire S, et al. Preliminary incidence and trends of infections caused by pathogens transmitted commonly through food – Foodborne Diseases Active Surveillance Network, 10 U.S. Sites, 2016–2021. MMWR Morb Mortal Wkly Rep. 2022;71:1260–4. doi: 10.15585/mmwr.mm7140a2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gu W, Dutta V, Patrick M, Bruce BB, Geissler A, Huang J, et al. Statistical adjustment of culture-independent diagnostic tests for trend analysis in the Foodborne Diseases Active Surveillance Network (FoodNet), USA. Int J Epidemiol. 2018;47:1613–22. doi: 10.1093/ije/dyy041. [DOI] [PubMed] [Google Scholar]
- 4.Geissler AL, Bustos Carrillo F, Swanson K, Patrick ME, Fullerton KE, Bennett C, et al. Increasing Campylobacter infections, outbreaks, and antimicrobial resistance in the United States, 2004–2012. Clin Infect Dis. 2017;65:1624–31. doi: 10.1093/cid/cix624. [DOI] [PubMed] [Google Scholar]
- 5.Peters S, Pascoe B, Wu Z, Bayliss SC, Zeng X, Edwinson A, et al. Campylobacter jejuni genotypes are associated with post-infection irritable bowel syndrome in humans. Commun Biol. 2021;4:1015. doi: 10.1038/s42003-021-02554-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Berumen A, Lennon R, Breen-Lyles M, Griffith J, Patel R, Boxrud D, et al. Characteristics and risk factors of post-infection irritable bowel syndrome after Campylobacter enteritis. Clin Gastroenterol Hepatol. 2021;19:1855–1863.:e1. doi: 10.1016/j.cgh.2020.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Luangtongkum T, Jeon B, Han J, Plummer P, Logue CM, Zhang Q. Antibiotic resistance in Campylobacter: emergence, transmission and persistence. Future Microbiol. 2009;4:189. doi: 10.2217/17460913.4.2.189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Grass JE, Kim S, Huang JY, Morrison SM, McCullough AE, Bennett C, et al. Quinolone nonsusceptibility among enteric pathogens isolated from international travelers – Foodborne Diseases Active Surveillance Network (FoodNet) and National Antimicrobial Monitoring System (NARMS), 10 United States sites, 2004–2014. PLoS One. 2019;14:e0225800. doi: 10.1371/journal.pone.0225800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wallace RL, Cribb DM, Bulach DM, Ingle DJ, Joensen KG, Nielsen EM, et al. Campylobacter jejuni ST50, a pathogen of global importance: a comparative genomic analysis of isolates from Australia, Europe and North America. Zoonoses Public Health. 2021;68:638–49. doi: 10.1111/zph.12853. [DOI] [PubMed] [Google Scholar]
- 10.Sproston EL, Wimalarathna HML, Sheppard SK. Trends in fluoroquinolone resistance in Campylobacter. Micro Genom. 2018;4:e000198. doi: 10.1099/mgen.0.000198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tacconelli E, Carrara E, Savoldi A, Harbarth S, Mendelson M, Monnet DL, et al. Discovery, research, and development of new antibiotics: the WHO priority list of antibiotic-resistant bacteria and tuberculosis. Lancet Infect Dis. 2018;18:318–27. doi: 10.1016/S1473-3099(17)30753-3. [DOI] [PubMed] [Google Scholar]
- 12.Taylor EV, Herman KM, Ailes EC, Fitzgerald C, Yoder JS, Mahon BE, et al. Common source outbreaks of Campylobacter infection in the USA, 1997-2008. Epidemiol Infect. 2013;141:987–96. doi: 10.1017/S0950268812001744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Joseph LA, Francois Watkins LK, Chen J, Tagg KA, Bennett C, Caidi H, et al. Comparison of molecular subtyping and antimicrobial resistance detection methods used in a large multistate outbreak of extensively drug-resistant Campylobacter jejuni infections linked to pet store puppies. J Clin Microbiol. 2020;58:e00771-20. doi: 10.1128/JCM.00771-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Francois Watkins LK, Laughlin ME, Joseph LA, Chen JC, Nichols M, Basler C, et al. Ongoing outbreak of extensively drug-resistant Campylobacter jejuni infections associated with US pet store puppies, 2016-2020. JAMA Netw Open. 2021;4:e2125203. doi: 10.1001/jamanetworkopen.2021.25203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Montgomery MP, Robertson S, Koski L, Salehi E, Stevenson LM, Silver R, et al. Multidrug-resistant Campylobacter jejuni outbreak linked to puppy exposure—United States, 2016–2018. MMWR Morb Mortal Wkly Rep. 2018;67:1032–5. doi: 10.15585/mmwr.mm6737a3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Yeh H-Y, Cox NA, Hinton A, Jr, Berrang ME, Plumblee Lawrence JR, Thompson TM. Prevalence and characterization of quinolone resistance in Campylobacter spp. isolates in chicken livers from retail stores in Georgia, USA. J Food Prot. 2021;85:406–13. doi: 10.4315/JFP-21-357. [DOI] [PubMed] [Google Scholar]
- 17.Wensley A, Padfield S, Hughes GJ. An outbreak of campylobacteriosis at a hotel in England: the ongoing risk due to consumption of chicken liver dishes. Epidemiol Infect. 2020;148:e32. doi: 10.1017/S095026882000028X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Petrovska L, Tang Y, Jansen van Rensburg MJ, Cawthraw S, Nunez J, Sheppard SK, et al. Genome reduction for niche association in Campylobacter hepaticus, a cause of spotty liver disease in poultry. Front Cell Infect Microbiol. 2017;7:354. doi: 10.3389/fcimb.2017.00354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lanier WA, Hale KR, Geissler AL, Dewey-Mattia D. Chicken liver-associated outbreaks of campylobacteriosis and salmonellosis, United States, 2000-2016: identifying opportunities for prevention. Foodborne Pathog Dis. 2018;15:726–33. doi: 10.1089/fpd.2018.2489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kwan PSL, Xavier C, Santovenia M, Pruckler J, Stroika S, Joyce K, et al. Multilocus sequence typing confirms wild birds as the source of a Campylobacter outbreak associated with the consumption of raw peas. Appl Environ Microbiol. 2014;80:4540–6. doi: 10.1128/AEM.00537-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.CDC TC for DC. Multistate Outbreak of Gastrointestinal Illnesses Linked to Oysters Imported from Mexico | Multistate Outbreak of Gastrointestinal Illnesses Linked to Oysters Imported from Mexico | Vibrio Illness (Vibriosis) | CDC. 2019. [accessed Feb 8, 2022]. ⟨ https://www.cdc.gov/vibrio/investigations/rawoysters-05-19/index.html⟩.
- 22.Dunn SJ, Pascoe B, Turton J, Fleming V, Diggle M, et al. Genomic epidemiology of clinical Campylobacter spp. at a single health trust site. Micro Genom. 2018;4:e000227. doi: 10.1099/mgen.0.000227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Strachan NJC, Gormley FJ, Rotariu O, Ogden ID, Miller G, Dunn GM, et al. Attribution of Campylobacter infections in northeast Scotland to specific sources by use of multilocus sequence typing. J Infect Dis. 2009;199:1205–8. doi: 10.1086/597417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sheppard SK, Dallas JF, Strachan NJC, MacRae M, McCarthy ND, Wilson DJ, et al. Campylobacter genotyping to determine the source of human infection. Clin Infect Dis. 2009;48:1072–8. doi: 10.1086/597402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mughini Gras L, Smid JH, Wagenaar JA, de Boer AG, Havelaar AH, Friesema IHM, et al. Risk factors for campylobacteriosis of chicken, ruminant, and environmental origin: a combined case-control and source attribution analysis. PLoS One. 2012;7:e42599. doi: 10.1371/journal.pone.0042599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rosner BM, Schielke A, Didelot X, Kops F, Breidenbach J, Willrich N, et al. A combined case-control and molecular source attribution study of human Campylobacter infections in Germany, 2011–2014. Sci Rep. 2017;7:1–12. doi: 10.1038/s41598-017-05227-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Nichols GL, Richardson JF, Sheppard SK, Lane C, Sarran C. Campylobacter epidemiology: a descriptive study reviewing 1 million cases in England and Wales between 1989 and 2011. BMJ Open. 2012;2:e001179. doi: 10.1136/bmjopen-2012-001179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Frost JA, Gillespie IA, O’Brien SJ. Public health implications of campylobacter out-breaks in England and Wales, 1995-9: epidemiological and microbiological investigations. Epidemiol Infect. 2002;128:111–8. doi: 10.1017/s0950268802006799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Cui Z, Marder EP, Click ES, Hoekstra RM, Bruce BB. Nearest-neighbors matching for case-control study analyses: better risk factor identification from a study of sporadic campylobacteriosis in the United States. Epidemiology. 2022;33:633–41. doi: 10.1097/EDE.0000000000001504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Mossong J, Mughini-Gras L, Penny C, Devaux A, Olinger C, Losch S, et al. Human campylobacteriosis in Luxembourg, 2010–2013: a case-control study combined with multilocus sequence typing for source attribution and risk factor analysis. Sci Rep. 2016;6:20939. doi: 10.1038/srep20939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Berthenet E, Thépault A, Chemaly M, Rivoal K, Docournau A, Buissonniere A, et al. Source attribution of Campylobacter jejuni shows variable importance of chicken and ruminants reservoirs in non-invasive and invasive French clinical isolates. Sci Rep. 2019;9:8098. doi: 10.1038/s41598-019-44454-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wilson DJ, Gabriel E, Leatherbarrow AJH, Cheesbrough J, Gee S, Bolton E, et al. Tracing the source of campylobacteriosis. PLoS Genet. 2008;4:e1000203. doi: 10.1371/journal.pgen.1000203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Thépault A, Méric G, Rivoal K, Pascoe B, Mageiros L, Touzain F, et al. Genome-wide identification of host-segregating epidemiological markers for source attribution in Campylobacter jejuni. Appl Environ Microbiol. 2017;83:e03085. doi: 10.1128/AEM.03085-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pascoe B, Schiaffino F, Murray S, Meric G, Bayliss SC, Hitchings MD, et al. Genomic epidemiology of Campylobacter jejuni associated with asymptomatic pediatric infection in the peruvian amazon. PLoS Negl Trop Dis. 2020;14:e0008533. doi: 10.1371/journal.pntd.0008533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Pascoe B, Williams LK, Calland JK, Meric G, Hitchings MD, Dyer M, et al. Domestication of Campylobacter jejuni NCTC 11168. Micro Genom. 2019;5:e000279. doi: 10.1099/mgen.0.000279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lienau EK, Strain E, Wang C, Zheng J, Ottesen AR, Keys CE, et al. Identification of a salmonellosis outbreak by means of molecular sequencing. N Engl J Med. 2011;364:981–2. doi: 10.1056/NEJMc1100443. [DOI] [PubMed] [Google Scholar]
- 37.Mourkas E, Taylor AJ, Méric G, Bayliss SC, Pascoe B, Mageiros L, et al. Agricultural intensification and the evolution of host specialism in the enteric pathogen Campylobacter jejuni. Proc Natl Acad Sci USA. 2020;117:11018–28. doi: 10.1073/pnas.1917168117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.VanderWaal K, Deen J. Global trends in infectious diseases of swine. Proc Natl Acad Sci USA. 2018;115:11495–500. doi: 10.1073/pnas.1806068115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kittiwan N, Calland JK, Mourkas E, Hitchings MD, Murray S, Tadee P, et al. Genetic diversity and variation in antimicrobial-resistance determinants of non-serotype 2 Streptococcus suis isolates from healthy pigs. Micro Genom. 2022;8:000882. doi: 10.1099/mgen.0.000882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mourkas E, Valdebenito JO, Marsh H, Hitchings MD, Cooper KK, Parker CT, et al. Proximity to humans is associated with antimicrobial-resistant enteric pathogens in wild bird microbiomes. Curr Biol. 2024;34:3955–65. doi: 10.1016/j.cub.2024.07.059. [DOI] [PubMed] [Google Scholar]
- 41.Sheppard SK, Didelot X, Jolley KA, Darling AE, Pascoe B, Meric G, et al. Progressive genome-wide introgression in agricultural Campylobacter coli. Mol Ecol. 2013;22:1051–64. doi: 10.1111/mec.12162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Sheppard SK, Colles F, Richardson J, Cody AJ, Elson R, Lawson A, et al. Host association of Campylobacter genotypes transcends geographic variation. Appl Environ Microbiol. 2010;76:5269–77. doi: 10.1128/AEM.00124-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Griekspoor P, Colles FM, McCarthy ND, Hansbro PM, Ashhurst-Smith C, Olsen B, et al. Marked host specificity and lack of phylogeographic population structure of Campylobacter jejuni in wild birds. Mol Ecol. 2013;22:1463. doi: 10.1111/mec.12144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Joseph LA, Francois Watkins LK, Chen J, Tagg KA, Bennett C, Caidi H, et al. Comparison of molecular subtyping and antimicrobial resistance detection methods used in a large multistate outbreak of extensively drug-resistant Campylobacter jejuni infections linked to pet store puppies. J Clin Microbiol. 2020;58:e00771. doi: 10.1128/JCM.00771-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Magoč T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957–63. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Seemann T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–9. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 48.Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–5. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Andrews S. Babraham Bioinformatics – FastQC A Quality Control tool for High Throughput Sequence Data. 2010. [accessed Feb 8, 2022]. ⟨ https://www.bioinformatics.babraham.ac.uk/projects/fastqc/⟩.
- 50.Seemann T. SNIPPY: Fast bacterial variant calling from NGS reads. 2015. ⟨ https://github.com/tseemann/snippy⟩.
- 51.O’Kane PM, Connerton IF. Characterisation of aerotolerant forms of a robust chicken colonizing Campylobacter coli. Front Microbiol. 2017;8:513. doi: 10.3389/fmicb.2017.00513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Mourkas E, Yahara K, Bayliss SC, Calland JK, Johansson H, Mageiros L, et al. Host ecology regulates interspecies recombination in bacteria of the genus Campylobacter. Elife. 2022;11:73552. doi: 10.7554/eLife.73552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Katz LS, Griswold T, Morrison SS, Caravas JA, Zhang S, den Bakker HC, et al. Mashtree: a rapid comparison of whole genome sequence files. J Open Source Softw. 2019;4:1762. doi: 10.21105/joss.01762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Argimón S, Abudahab K, Goater RJE, Fedosejev A, Bhai J, Glasner C, et al. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Micro Genom. 2016;2:e000093. doi: 10.1099/mgen.0.000093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res. 2018;3:124. doi: 10.12688/wellcomeopenres.14826.1. [version 1; referees: 2 approved] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Dingle KE, Colles FM, Wareing DRA, Ure R, Fox AJ, Boulton FE, et al. Multilocus sequence typing system for Campylobacter jejuni. J Clin Microbiol. 2001;39:14–23. doi: 10.1128/JCM.39.1.14-23.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dingle KE, Colles FM, Falush D, Maiden MCJ. Sequence typing and comparison of population biology of Campylobacter coli and Campylobacter jejuni. J Clin Microbiol. 2005;43:340–7. doi: 10.1128/JCM.43.1.340-347.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Sheppard SK, Dallas JF, Wilson DJ, Strachan NJC, McCarthy ND, Jolley KA, et al. Evolution of an agriculture-associated disease causing Campylobacter coli clade: evidence from national surveillance data in Scotland. PLoS One. 2010;5:e15708. doi: 10.1371/journal.pone.0015708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Sheppard SK, Maiden MCJ. The evolution of Campylobacter jejuni and Campylobacter coli. Cold Spring Harb Perspect Biol. 2015;7:a018119. doi: 10.1101/cshperspect.a018119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Balouiri M, Sadiki M, Ibnsouda SK. Methods for in vitro evaluating antimicrobial activity: a review. J Pharm Anal. 2016;6:71–9. doi: 10.1016/j.jpha.2015.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Richardson LC, Bazaco MC, Parker CC, Dewey-Mattia D, Golden N, Jones K, et al. An updated scheme for categorizing foods implicated in foodborne disease outbreaks: a tri-agency collaboration. Foodborne Pathog Dis. 2017;14:701–10. doi: 10.1089/fpd.2017.2324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jaillard M, Lima L, Tournoud M, Mahe P, van Belkum A, Lacroix V, et al. A fast and agnostic method for bacterial genome-wide association studies: bridging the gap between k-mers and genetic events. PLoS Genet. 2018;14:e1007758. doi: 10.1371/journal.pgen.1007758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Lees JA, Galardini M, Bentley SD, Weiser JN, Corander J. pyseer: a comprehensive tool for microbial pangenome-wide association studies. Bioinformatics. 2018;34:4310–2. doi: 10.1093/bioinformatics/bty539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Cover TM, Thomas JA. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) Hardcover; [Google Scholar]
- 65.Arning N, Sheppard SK, Bayliss S, Clifton DA, Wilson DJ. Machine learning to predict the source of campylobacteriosis using whole genome data. PLoS Genet. 2021;17:e1009436. doi: 10.1371/journal.pgen.1009436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Wright MN, Ziegler A. ranger: A fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw. 2017;77:1–17. [Google Scholar]
- 67.Bayliss SC, Thorpe HA, Coyle NM, Sheppard SK, Feil EJ. PIRATE: A fast and scalable pangenomics toolbox for clustering diverged orthologues in bacteria. Gigascience. 2019;8:giz119. doi: 10.1093/gigascience/giz119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013 doi: 10.48550/arxiv.1303.3997. published online March 16. [DOI] [Google Scholar]
- 69.Jolley KA, Maiden MCJ. BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595. doi: 10.1186/1471-2105-11-595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bharat A, Petkau A, Avery BP, Chen JC, Folster JP, Carson CA, et al. Correlation between phenotypic and in silico detection of antimicrobial resistance in Salmonella enterica in Canada using Staramr. Microorganisms. 2022;10:292. doi: 10.3390/microorganisms10020292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, et al. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012;67:2640–4. doi: 10.1093/jac/dks261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Zankari E, Allesøe R, Joensen KG, Cavaco LM, Lund O, Aarestrup FM. PointFinder: a novel web tool for WGS-based detection of antimicrobial resistance associated with chromosomal point mutations in bacterial pathogens. J Antimicrob Chemother. 2017;72:2764–8. doi: 10.1093/jac/dkx217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Cody AJ, Maiden MCJ, Strachan NJC, McCarthy ND. A systematic review of source attribution of human campylobacteriosis using multilocus sequence typing. Eurosurveillance. 2019;24:1800696. doi: 10.2807/1560-7917.ES.2019.24.43.1800696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Sheppard SK, Cheng L, Méric G, de Haan CPA, Llarena A-K, Marttinen P, et al. Cryptic ecology among host generalist Campylobacter jejuni in domestic animals. Mol Ecol. 2014;23:2442–51. doi: 10.1111/mec.12742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Sheppard SK, McCarthy ND, Falush D, Maiden MCJ. Convergence of Campylobacter species: implications for bacterial evolution. Science. 2008;320:237–9. doi: 10.1126/science.1155532. [DOI] [PubMed] [Google Scholar]
- 76.Chan I, Franks B, Hayek MN. The ‘sustainability gap’ of US broiler chicken production: trade-offs between welfare, land use and consumption. R Soc Open Sci. 2022;9:210478. doi: 10.1098/rsos.210478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.McKenna M. Big Chicken: The Story of How Antibiotics Transformed Modern Farming and Changed the Way the World Eats: The Incredible Story of How Antibiotics … and Changed the Way the World Eats. 2017 [Google Scholar]
- 78.US Poultry Industry Manual – Broilers: breeding flocks | The Poultry Site. [accessed Feb 9, 2023]. ⟨ https://www.thepoultrysite.com/articles/fad-broilers-breeding-flocks⟩.
- 79.Davis KR, Dunn AC, Burnett C, McCullough L, Dimond M, Wagner J, et al. Campylobacter jejuni infections associated with raw milk consumption — Utah, 2014. MMWR Morb Mortal Wkly Rep. 2019;65:301–5. doi: 10.15585/mmwr.mm6512a1. [DOI] [PubMed] [Google Scholar]
- 80.Ebel ED, Williams MS, Cole D, Travis CC, Klontz KC, Golden NJ, et al. Comparing characteristics of sporadic and outbreak-associated foodborne illnesses, United States, 2004–2011. Emerg Infect Dis. 2016;22:1193. doi: 10.3201/eid2207.150833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Hudson LK, Andershock WE, Yan R, Golwalkar M, M’ikanatha NM, Nachamkin I, et al. Phylogenetic analysis reveals source attribution patterns for Campylobacter spp. in Tennessee and Pennsylvania. Microorganisms. 2021;9:2300. doi: 10.3390/microorganisms9112300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Schiaffino F, Parker CT, Paredes-Olortegui M, Pascoe B, Manzanares-Villanueva K, Garcia-Bardales PF, et al. Genomic resistant determinants of multidrug-resistant Campylobacter spp. isolates in Peru. J Glob Antiicrob Resist. 2024;36:309–18. doi: 10.1016/j.jgar.2024.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Cobo-Díaz JF, González del Río P, Álvarez-Ordóñez A. Whole resistome analysis in Campylobacter jejuni and C. coli genomes available in public repositories. Front Microbiol. 2021;12:662144. doi: 10.3389/fmicb.2021.662144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Veltcheva D, Colles FM, Varga M, Maiden MCJ, Bonsall MB. Emerging patterns of fluoroquinolone resistance in Campylobacter jejuni in the UK [1998–2018] Micro Genom. 2022;8:000875. doi: 10.1099/mgen.0.000875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Van Vliet AHM, Thakur S, Prada JM, Mehat JW, La Ragione RM. Genomic screening of antimicrobial resistance markers in UK and US Campylobacter isolates highlights stability of resistance over an 18-year period. Antimicrob Agents Chemother. 2022;66:e0168721. doi: 10.1128/aac.01687-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Baily JL, Méric G, Bayliss S, Foster G, Moss SE, Watson E, et al. Evidence of land-sea transfer of the zoonotic pathogen Campylobacter to a wildlife marine sentinel species. Mol Ecol. 2015;24:208–21. doi: 10.1111/mec.13001. [DOI] [PubMed] [Google Scholar]
- 87.Jehanne Q, Pascoe B, Bénéjat L, Ducournau A, Buissonnière A, Mourkas E, et al. Genome-wide identification of host-segregating single-nucleotide polymorphisms for source attribution of clinical Campylobacter coli isolates. Appl Environ Microbiol. 2020;86:01787–20. doi: 10.1128/AEM.01787-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Pérez-Reche FJ, Rotariu O, Lopes BS, Forbes KJ, Strachan NJC. Mining whole genome sequence data to efficiently attribute individuals to source populations. Sci Rep 2020. 2020;10:1–16. doi: 10.1038/s41598-020-68740-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Pascoe B, Méric G, Yahara K, Wimalarathna H, Murray S, Hitchings MD, et al. Local genes for local bacteria: evidence of allopatry in the genomes of transatlantic Campylobacter populations. Mol Ecol. 2017;26:4497–508. doi: 10.1111/mec.14176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Mouftah SF, Pascoe B, Calland JK, Mourkas E, Tonkin N, Lefevre C, et al. Local accessory gene sharing among Egyptian Campylobacter potentially promotes the spread of antimicrobial resistance. Micro Genom. 2022;8:000834. doi: 10.1099/mgen.0.000834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Thépault A, Rose V, Queguiner M, Chemaly M, Rivoal K. Dogs and cats: reservoirs for highly diverse Campylobacter jejuni and a potential source of human exposure. Animals. 2020;10:838. doi: 10.3390/ani10050838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Dearlove BL, Cody AJ, Pascoe B, Méric G, Wilson DJ, Sheppard SK. Rapid host switching in generalist Campylobacter strains erodes the signal for tracing human infections. ISME J. 2016;10:721–9. doi: 10.1038/ismej.2015.149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.National Advisory Committee on Microbiological Criteria for Foods. Analytical utility of Campylobacter methodologies. J Food Prot. 2007;70:241–50. doi: 10.4315/0362-028x-70.1.241. [DOI] [PubMed] [Google Scholar]
- 94.Food Safety and Inspection Service. New performance standards for Salmonella and Campylobacter in not-ready-to-eat comminuted chicken and turkey products and raw chicken parts and changes to related agency verification procedures: response to comments and announcement of implementation schedule U.S. Department of Agriculture. 2016:7285–7300. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All sequence reads were uploaded and shared on NCBI’s Sequence Read Archive (SRA) associated with BioProjects; PRJNA239251 (CDC / PulseNet surveillance), PRJNA287430 (FSIS surveillance), PRJNA292668 & PRJNA292664 (NARMS) and PRJNA258022 (FDA surveillance). Publicly available genomes, including reference genomes and isolates sampled worldwide from wild birds are associated with BioProject accessions: PRJNA176480, PRJNA177352, PRJNA342755, PRJNA345429, PRJNA312235, PRJNA415188, PRJNA524300, PRJNA528879, PRJNA529798, PRJNA575343, PRJNA524315 and PRJNA689604. Contiguous assemblies of all genome sequences compared are available at Mendeley data (assembled C. coli genomes doi: 10.17632/gxswjvxyh3.1; assembled C. jejuni genomes doi: 10.17632/6ngsz3dtbd.1) and individual project and accession numbers can be found in Supplementary tables S1 and S2, which also includes pubMLST identifiers for assembled genomes. Figshare (10.6084/m9.figshare.20279928). Interactive phylogenies are hosted on microreact separately for C. jejuni (https://microreact.org/project/pascoe-us-cjejuni) and C. coli (https://microreact.org/project/pascoe-us-ccoli).




