Skip to main content
Microbial Genomics logoLink to Microbial Genomics
. 2019 Feb 19;5(2):e000251. doi: 10.1099/mgen.0.000251

Genomic correlates of extraintestinal infection are linked with changes in cell morphology in Campylobacter jejuni

Nicole E Wheeler 1,2,3,*, Timothy Blackmore 4, Angela D Reynolds 5, Anne C Midwinter 5, Jonathan Marshall 5, Nigel P French 5,6, Matthew S Savoian 7, Paul P Gardner 2,3,8, Patrick J Biggs 5,9,10
PMCID: PMC6421344  PMID: 30777818

Abstract

Campylobacter jejuni is the most common cause of bacterial diarrheal disease in the world. Clinical outcomes of infection can range from asymptomatic infection to life-threatening extraintestinal infections. This variability in outcomes for infected patients has raised questions as to whether genetic differences between C. jejuni isolates contribute to their likelihood of causing severe disease. In this study, we compare the genomes of ten C. jejuni isolates that were implicated in extraintestinal infections with reference gastrointestinal isolates, in order to identify unusual patterns of sequence variation associated with infection outcome. We identified a collection of genes that display a higher burden of uncommon mutations in invasive isolates compared with gastrointestinal close relatives, including some that have been previously linked to virulence and invasiveness in C. jejuni. Among the top genes identified were mreB and pgp1, which are both involved in determining cell shape. Electron microscopy confirmed morphological differences in isolates carrying unusual sequence variants of these genes, indicating a possible relationship between extraintestinal infection and changes in cell morphology.

Keywords: Campylobacter jejuni, random forest, bacteremia

Data Summary

  1. Sequence data has been deposited in the Sequence Read Archive; accession number PRJNA475221 (URL - https://www.ncbi.nlm.nih.gov/bioproject/PRJNA475221).

  2. Lists of orthologous genes, corresponding bitscore data and code used for this analysis have been deposited in GitHub (URL - https://github.com/Gardner-BinfLab/invasive_campylobacter).

Impact Statement.

Campylobacter is the largest contributor to bacterial diarrhea in the world. Most infections are self-limiting, however, a small subset result in extraintestinal infection, which is associated with an increased mortality rate. While risk factors influencing host susceptibility have been identified, it is still unclear whether genomic factors associated with the bacteria also contribute to infection outcome. In this study, we have identified genes that have accumulated unusual mutations in isolates captured from bacteremic patients. Among them were two genes involved in determining cell shape, highlighting a possible link between cell morphology and invasive potential. Our analysis showcases a high-throughput approach to the detection of functionally important sequence variation in genes which lead to measurable differences in phenotype. However, whether these mutations contributed to, or resulted from this shift in niche remains unclear.

Introduction

Campylobacter jejuni is the most common bacterial cause of diarrhea in the world [1]. Variability in the clinical presentation of infection with members of the genus Campylobacter has been a subject of interest for some time now [2], as clinical presentation can range from asymptomatic infection [3], to diarrhea [4], to invasive infections, such as meningitis and bacteremia [4–6]. Campylobacter bacteremia appears to originate from acute colitis, indicating a progression from diarrheal disease to extraintestinal infection in a subset of cases [6]. While extraintestinal infections with members of the genus Campylobacter disproportionately affect the very young, the very old and the immunocompromised, the wide range of disease presentations cannot be explained purely by host factors, indicating that differences in the bacterial pathogen may also contribute [2, 6, 7]. Over decades of study, cytotoxin production, motility and the ability to adhere to epithelial cells have been implicated as genetic factors in members of the genus Campylobacter that are associated with disease presentation [8–10]. In addition, early studies identified associations between specific genetic loci and virulence [2, 11–14], however, follow-up studies that attempted to replicate these findings have returned largely negative results [15–17]. As a result, there is still uncertainty as to which genetic factors make some isolates of members of the genus Campylobacter intrinsically more likely to cause more severe disease.

Members of the genus Campylobacter are a major cause of bacterial infections in New Zealand. New Zealand has relatively high rates of campylobacteriosis notifications compared with other countries [18], despite a major decline in human cases of campylobacteriosis in 2007/8 associated with interventions in the poultry industry [19]. In 2016 the rate of campylobacteriosis in New Zealand was 158.9 per 100, 000 people, the highest number of cases for any notifiable disease [20]. As a comparison, in the UK the rate of campylobacteriosis in 2016 was 90.8 per 100,000 [21]. Although Campylobacter bacteremia rates are believed to be under-detected and under-reported [10], of a total 59, 801 cases of Campylobacter infection in New Zealand from 2010 to the present, 167 cases (0.28 %) involved Campylobacter isolated from blood (P. Cressey, Institute for Environmental Science and Research Ltd, personal communication, 23 October, 2018). While precise recent rates of Campylobacter bloodstream infections are not available for the UK, 1,665 out of 994,791 (0.17%) of Campylobacter patients in England and Wales from 1989 to 2011 had infections involving bloodstream infection [22], and these rates have not changed significantly since then (G. Godbole, Public Health England, personal communication, 23 October 2018). These figures indicate a significant difference in both the rates of campylobacteriosis (P=2.0×10−5, odds ratio=1.7) and the incidence of subsequent bloodstream infection (P=3.6×10−9, odds ratio=1.7) between the two countries. This makes species of the genus Campylobacter infectious agents of strong relevance to public health, and indicates that there may be lifestyle and/or genetic factors of both host and pathogen that put New Zealanders at greater risk. In order to assess whether there may be genetic factors associated with extraintestinal infection in New Zealand isolates of Campylobacter jejuni, ten isolates of C. jejuni were collected at Wellington Hospital from the blood (n=9) or joint aspirate (n=1) of patients showing symptoms of Campylobacter bacteremia. These were compared with closely related reference strains from the Oxfordshire surveillance project (OXC), a genomic surveillance program investigating seasonal and temporal trends in members of the genus Campylobacter that cause gastroenteritis in Oxfordshire, United Kingdom [23].

We performed a comparative analysis to determine whether there were any genomic changes that occurred preferentially and independently in these extraintestinal isolates from different clonal complexes. We found no significant differences in gene content between extraintestinal and gastrointestinal strains, indicating that any genomic differences relating to potential to cause extraintestinal infection would either be unique to individual strains or reflected in different allelic variants of genes shared by these isolates. We next searched the genome for genes which carry an excess of rare mutations in extraintestinal isolates. Using this screening strategy, we have identified in extraintestinal isolates from our study a small number of candidate genes which accumulated mutations that are rarely observed in nature, some of which have been associated with virulence and invasiveness in the past. Our results indicate that there may be a shift in the selective pressures on genes involved in virulence, lipopolysaccharide biosynthesis, stress tolerance, iron acquisition and cell shape when C. jejuni transitions from gastrointestinal to invasive infection.

Methods

In this investigation we compared ten invasive clinical isolates of Campylobacter jejuni with closely related gastrointestinal strains from the Oxfordshire surveillance project in order to identify genetic differences associated with extraintestinal infection. We employed a profile hidden Markov model (HMM)-based approach [24] to weight sequence variation in genes based on its predicted functional significance. We then used a random forest model to identify genes whose burden of rare mutations best separated out extraintestinal and gastrointestinal strains.

Isolate collection

Isolates were collected from hospitals in the Wellington region from 2010 to 2012. The age of patients the isolates were collected from ranged from 19 to 89 years. Six presented with diarrhea, others presented with headache, prosthetic hip infection and exacerbation of chronic pulmonary disease. Nine isolates were taken from blood and one from joint aspirate. Six out of ten patients were subsequently treated with ciprofloxacin, one patient died, and one had had a previous invasive infection four years earlier. Only one person in the study showed sepsis syndrome. As comparators for each isolate, sequence data for two randomly selected isolates from the same clonal complex were retrieved from the Oxfordshire surveillance project collection [23] using BIGSdb [25], based on a recommended sampling strategy for identifying phenotypically relevant sequence variation in structured populations [26]. Two additional New Zealand gastrointestinal isolates were included to supplement a clonal complex not found in the Oxfordshire surveillance project (Table S1 , available in the online version of this article). For electron microscopy, contemporaneous human gastrointestinal isolates of the same sequence type (ST) isolated as part of the Manawatu sentinel surveillance project [27] were used for comparison of morphological features.

DNA extraction and sequencing

Isolates were grown on Columbia horse blood agar plates (Fort Richard Laboratories) in a microaerobic atmosphere (85 % N2, 10 % CO2, 5 % O2) at 42 °C for 24 h in a VA500 variable atmosphere incubator (Don Whitley Scientific). DNA was extracted using a QIAamp DNA Mini kit (Qiagen). The resultant DNA was then sent to the Massey Genome Service (Massey University, Palmerston North, New Zealand) for quality control checking, library preparation and sequencing. The isolates were sequenced on an Illumina MiSeq (250 bp paired end run) to approximately 140×–200× estimated coverage based on the size of the Campylobacter jejuni genome.

Read curation and assembly

Sequence reads were analysed with the following quality controls: read quality analysis and visualization (SolexaQA ++and FastQC) [28, 29], PhiX removal with Bowtie2 [30] and adapter removal through the ‘fastq-mcf’ program from the ea-utils suite of tools [31]. In addition, the reads were analysed with FastQScreen (http://www.bioinformatics.babraham.ac.uk/projects/fastq_screen) as a further check for any potential Illumina adapters and cloning vector contamination. For each sample, sequence reads were used to create multiple assemblies using VelvetOptimiser v2.2.5 [32] and Velvet v1.2 [33]. Assemblies were performed for kmer sizes varying between 60 and 90 % of the read length. An assembly improvement step was applied to the assembly with the best N50, in which contigs were scaffolded using SSPACE [34] and sequence gaps filled using GapFiller [35]. Genome Sequence data for the New Zealand isolates was deposited in the Sequence Read Archive (BioProject PRJNA475221) [Data Citation 1]. Accession numbers for all strains are listed in Table S1.

Core genome generation

Contigs were annotated using Prokka v1.5 [36] with default parameters and a genus-specific database from RefSeq [37]. Protein-coding regions were extracted and translated, and gene clustering was performed using Roary [38]. A core gene alignment produced by Roary was then visualized in SplitsTree [39]. A full list of orthologous genes can be found in the Github repository accompanying this paper [Data Citation 2]. SplitsTree visualization indicated that the representatives that were chosen from ST50 were too distantly related to the New Zealand strains, so all samples from ST50 from the Oxfordshire Surveillance project with publicly available sequencing data were collected and assembled using Velvet. Relatedness of strains was assessed using mash [40]. Four better representatives were chosen for this ST and replaced the previously selected strains and gene clustering was repeated.

Identification of functionally significant variation in orthologous proteins

To screen for differences in the burden of rare mutations on a gene across infection outcomes, genes were scored against HMM profile models for Epsilonproteobacterial protein sequences retrieved from the EggNOG database (v. 4.5.1) [41], using the hmmsearch function from the HMMER3.0 package (http://hmmer.org). In order to resolve many-to-many matches between EggNOG profiles and protein sequences in our dataset the EggNOG profile with the best score summed over all genes in an orthologous group was chosen as the gene model for that group. Bitscore data for all orthologous groups can be found in the file ‘bitscores’ [Data Citation 2]. Start codons for some genes were annotated inconsistently. This difference in length led to differences in bitscore that affected the analysis and led to spurious associations, so we aligned the orthologous gene families and trimmed off any gap columns at the beginning of the alignment before scoring with HMMER, and if the length trimmed exceeded 20 amino acids, the gene was discarded from this part of the analysis. We also excluded any genes that had paralogs identified by Roary, because we noted that some of these families showed arbitrary splitting of gene families into two, which also produced spurious results. Finally, we removed genes that showed little to no variation in bitscore using the nearZeroVar function in the ‘randomForest’ R package [42]. This approach resulted in 1,161 orthologous genes families with EggNOG model hits for analysis. DeltaBS, a weighting of the functional effects of mutations on protein coding genes [24], was calculated by subtracting individual bitscores from the median bitscore for an orthologous group.

Identification of informative genes using a random forest

A random forest algorithm [43] was used to build classification trees from bitscore data for orthologous groups, using infection outcome (extraintestinal or gastrointestinal) as classes for prediction. Random forests can capture interactions between genes, have been found to be a particularly effective way of dealing with sparse data [44], and their utility in linking genotype to phenotype is being explored for these reasons [45–47]. They consist of an ensemble of recursive partitioning trees, each built using a random sub-sampling of samples and variables (genes) [43, 48]. Although random forests are primarily designed for the classification of new samples, they produce variable importance measures which indicate how informative each variable has been in separating cases according to category. These variable importance measures can be used to assess which genes may be important in identifying extraintestinal C. jejuni isolates.

Because the dataset was likely to be sparse, meaning that most of the genes in the training dataset offer little to no predictive value in determining the phenotype of interest, we selected a subset of genes before training the random forest model. To further enrich for potentially informative genes, we selected only genes that showed differences in median bitscore between the two groups that are in the highest 80 % [|median(BSGI)-median(BSEI)| ≥0.15], and separation of bitscore distributions, as measured by a Kolmogorov–Smirnov (KS) test that fell in the highest 20 % of all KS statistics computed for orthologous groups (KS ≥0.1). This reduced the set of genes used to train the random forest from 1161 to 103.

The R package ‘randomForest’ was used to build a forest of decision trees in an approach based on that of Wheeler et al. [47]. A random forest was built using 10  000 trees, sampling five genes for each node (mtry=5). The same model building process was repeated using a range of values of mtry and model accuracy (as measured by F1 score), did not change. Each tree was reconstructed using a bootstrap sampling of the strains, taking seven extraintestinal and seven gastrointestinal strains for each tree. The gene whose bitscore values best separated the two classes was selected as the first node in the decision tree, and assigned an optimal scoring cutoff for partitioning samples into daughter nodes. For each of the two groups resulting from the split, if another gene could further separate the two classes, this was included in the tree as well. For each node in the tree, a new sample of five genes was taken. Trees are reconstructed until no more information from the sampling of genes can improve discrimination between the two classes. Key indicator genes were selected based on their feature importance scores, which were calculated based on the decrease in accuracy of predictions caused by permuting the variable of interest in the test data [43]. To assess the significance of the variable importance measures, the random forest model was re-trained 100 times with the classes permuted each time, in an approach similar to that taken by Huynh-Thu et al. [49]. Permutations that resulted in the same phenotype allocation as the true dataset were re-run. Empirical P-values were computed by identifying the percentage of feature importance values generated using the permuted data that were higher than each true feature importance value. Genes of interest were identified by running the model building process with the true classes 100 times and identifying the genes that had a variable importance with an empirical P-value <0.1 in 75 or more models.

Electron microscopy

Electron microscopy was performed by the Manawatu Microscopy and Imaging Centre (Massey University, Palmerston North, New Zealand). Bacteria were pelleted and fixed in Modified Karnovsky’s fixative in 0.1M phosphate buffer (pH 7.2) at 4 °C overnight. Following buffer washing, samples were passed through a syringe filter holder containing a nylon membrane to collect the cells (0.4 um pore size; Isopore, Merck Millipore). Bacteria-exposed filters were dehydrated with a graded ethanol series before critical point drying. Samples were mounted and gold sputter coated according to standard procedures. Images were captured on a Quanta 200 scanning electron microscope (FEI) using an accelerating voltage of 20 kV.

Results

In this investigation we examined whether there were any significant genomic differences between C. jejuni isolated from extraintestinal and gastrointestinal environments. We found no significant difference in the overall gene content of the isolates tested, however, we identified differences in the burden of rare mutations in key genes that have been implicated in Campylobacter virulence. We found that the patterns of sequence variation we identified in our study are unlikely to have strong predictive value in identifying other strains of invasive C. jejuni, however, they are indicative of a shift in selection pressures that occurs during extraintestinal infection.

No significant difference in gene content between extraintestinal and gastrointestinal isolates

A NeighborNet diagram of relatedness of the strains used in our study indicated that the randomly selected ST50 strains we included in our analysis were distantly related to the New Zealand invasive isolates, and further analysis of the ST50 clade indicated that our strains were members of a rare subgroup (Fig. S1). After inclusion of gastrointestinal representatives from this smaller clade we were able to confirm that each invasive strain was closely related to the selected reference strains, and invasive strains from the same clonal complex were also closely related to each other, but that the clonal complexes (CCs) and STs included in our study were genetically distinct from each other (Fig. 1). The overall genome size of the extraintestinal isolates was similar to that of the gastrointestinal isolates examined (1.655 Mb±13 kb vs 1.659 Mb±28 kb), as was the number of predicted genes (1729±22 vs 1745±42). GC content was also indistinguishable (30.34±0.05 % vs 30.36±0.06 %). We found no clear evidence of gene presence/absence patterns associated with invasive infection (Table S2).

Fig. 1.

Fig. 1.

NeighborNet diagram of the genetic similarity of strains included in the study. The scale bar indicates 0.01 substitutions per site in a core gene alignment. Invasive strains are highlighted in red type. Each invasive strain was paired with two gastrointestinal strains, shown in black type, from the Oxfordshire surveillance project from the same clonal complex. Clonal complexes are highlighted in bold blue type.

A subset of genes show an increased burden of rare mutations in invasive strains

Given that there were no differences in overall genome content between gastrointestinal and extraintestinal strains, we scanned the genomes for differences in mutational burden across genes using the DeltaBS approach [24] and a random forest classifier [47]. The DeltaBS approach uses profile hidden Markov models [50] to take a diverse collection of sequences of the same gene from different organisms and capture patterns of sequence variation that occur commonly in nature (in this case, within the Epsilonproteobacteria). These models are then used to score the protein coding genes of these strains to produce bitscores, which indicate how well each protein conforms to modelled sequence constraints on that gene. The distribution of these bitscores can then be used by the random forest to identify genes which show a difference in adherence to sequence constraints between strains from the two infection niches. A difference in bitscore distribution is likely to be caused by a change in selective pressures on a gene in one niche, leading to increased accumulation of mutations not commonly observed in nature (called rare or uncommon mutations in this manuscript, for brevity). This accumulation of rare mutations is likely to result in the degradation of the gene, however, these mutations could also result in a change to the function or structural stability of the protein. For a given protein family, the difference in bitscore for one strain and the median bitscore for the strain collection (DeltaBS) can be used as an indication of the likelihood that this accumulation of mutations is deleterious [24].

To assess the value of the genes we identified, we compared the performance of the random forest classifier trained to discriminate extraintestinal and gastrointestinal strains with those of classifiers built using permuted classes (randomized assignments of extraintestinal and gastrointestinal labels in the same ratios as the true classes). Out-of-bag (OOB) estimates of error can be used to assess the accuracy of a random forest predictor by testing the prediction accuracy of each tree in the forest on previously unseen data. The OOB estimate of error for the true classes was 26.7 . Of the models built using permuted labels 18 % had error rates this low (Fig. 2a). This 26.7 % overall error rate included 70 % error in identifying extraintestinal strains (Table S3). These results indicate that the patterns associated with invasiveness identified in our study do not generalize well across the different isolates, owing to weak convergence in the patterns of mutational burden observed. While the model did not have high predictive accuracy, examining the most informative genes identified by the model can give us insight into the genomic features that best separate the two classes. A set of 21 genes were identified that met the selection criterion (Fig. 2b and Table 1). Sequences were manually inspected, and one gene (pseA) was excluded as a false positive due to incorrect ortholog calling by Roary, and two others (purE and group_904) were excluded for only showing unusual patterns of sequence variation in ST51. Re-building the model with a training dataset that excluded these genes did not change the variable importances of the remaining candidates.

Fig. 2.

Fig. 2.

A random forest model identifies a subset of genes that show increased sequence divergence in invasive isolates (a): Performance of the random forest model compared with that of 100 models built using permuted labels. Accuracy of the classifier built using true classes is indicated with a red line, and accuracies for the permuted models are shown as a histogram. (b): Feature importance for each gene in a representative random forest model. Grey lines correspond to empirical P-values of 0.05 (dashed line) and 0.1 (dotted line), calculated using feature importance values derived from permuted data.

Table 1. Genes with the greatest value in discriminating between invasive and gastrointestinal Campylobacter.

Reference locus ID refers to the locus ID for the homologous sequence in Campylobacter jejuni ATCC 700819. VI: variable importance, as measured by mean decrease in classification accuracy upon permuting a variable. % top: percentage of models that listed this gene among its top predictors.

Gene Ref. locus ID Description Mean VI VI P-value % top
mreB Cj0276 Rod shape-determining protein MreB 0.007 0.011 100
msbA Cj0803 Lipid export ABC transport protein 0.003 0.026 100
nuoL Cj1568c NADH dehydrogenase subunit L 0.003 0.027 100
htpX Cj0723c Peptidase, M48 family protein 0.003 0.036 100
group_728 Cj1236 COG1565: Uncharacterized conserved protein 0.002 0.038 100
nrfA Cj0584 Cytochrome c552 precursor 0.002 0.040 100
group_922 Cj1357c DNA polymerase III subunit delta' 0.002 0.040 100
gltD Cj1345c Glutamate synthase [NADPH] small chain 0.002 0.046 100
pgp1 Cj1706c Purine nucleoside phosphorylase 0.002 0.048 100
rplD Cj0612c LSU ribosomal protein L4p (L1e) 0.002 0.050 100
cft Cj1476c Non-heme iron-containing ferritin 0.002 0.054 100
porA_3 Cj1572c Pyruvate-flavodoxin oxidoreductase 0.002 0.057 100
nuoH Cj1131c NADH-ubiquinone oxidoreductase chain H 0.002 0.062 99
group_3265 Cj0041 Xaa-Pro aminopeptidase 0.002 0.066 97
galE Cj0653c UDP-glucose 4-epimerase 0.002 0.067 99
fliK Cj0002 Flagellar hook-length control protein FliK 0.002 0.067 95
rplL Cj0477 LSU ribosomal protein L7/L12 (P1/P2) 0.002 0.069 96
gltX Cj1288c Glutamyl-tRNA synthetase 0.001 0.069 94
dnaN Cj0002 DNA polymerase III subunit beta 0.001 0.073 94
selU Cj0500 tRNA 2-selenouridine synthase 0.001 0.082 86
group_537 Cj0530 Periplasmic protein 0.001 0.081 84

There were five major, interrelated themes that emerged in the functions of the genes identified. We identified a number of genes which have previously been linked with virulence, LPS biosynthesis, cell shape, stress response and iron acquisition. htpX is not well documented in Campylobacter jejuni, but is a known virulence gene whose inactivation affects adhesion, cell morphology and surface antigens in Streptococcus gordonii [51]. The gene is upregulated in C. jejuni in a piglet stomach model [52]. Three unique mutations were observed in H1664 and several mutations were shared by H1606, H1607 and one of the Oxfordshire strains from the same sequence type.

Both msbA and galE are involved in lipopolysaccharide biosynthesis. msbA carried the same mutation in both invasive members of ST50 and galE carried the same mutation in both invasive members of ST51, indicating that these are likely to be lineage effects, and therefore don’t show evidence of convergent evolution. msbA functions as a flippase of the lipid-A core oligosaccharide, transporting it from the inner to the outer cell membrane [53]. Mutation of the gene has led to an elongated rod morphology in Eschericiha coli [54], and a mutant in Neisseria gonorrhoeae showed a growth defect with less LPS and more phospholipids in the membrane [55]. The galE gene provides Gal and GalNAc for the three major cell-surface carbohydrates: lipopolysaccharides, capsule and glycoprotein N-linked heptasaccharide [56]. galE also has a role in virulence [57], and a C. jejuni mutant of the gene has been found to be impaired in its ability to adhere to and invade intestinal cells, but still colonized at wild-type levels [57]. The enzyme is thought to be required for synthesis of the polysaccharide capsule of C. jejuni, which modulates the host immune response [56, 58].

Members of the genus Campylobacter prefer a microaerobic environment [59], and are relatively intolerant of environmental perturbations compared with other pathogens [60]. Among the top genes affected by rare mutations were stress-response genes that coded for iron-containing proteins. nrfA is a cytochrome c552 precursor, with a role in nitrosative stress survival [61]. Human patients with C. jejuni-induced diarrhea produce elevated levels of nitric oxide, so the ability to tolerate nitrosative stress is thought to be a key feature of pathogenesis for this microbe [62]. The gene is up-regulated in C. jejuni in the pig stomach [52], and a nrfA mutant of C. jejuni has been shown to be defective for colonization in a chick model [63]. We observed unique mutations in isolates H1666, H1668 and H1670, all clustered in the same small area of the gene, a conserved haem binding region [64] (Fig. 3). Three out of ten invasive isolates carrying mutations not seen in any other strain in the study vs none out of 20 gastrointestinal isolates is significant according to a Fisher's exact test (P=0.03). However, the significance of these results does not come purely from the number of mutations, but also from their predicted functional effects – if most mutations don't have a dramatic impact on function, the bitscore distribution for each gene sequence is tightly clustered around the median value. Outliers from this bitscore distribution indicate a mutation that the profile HMM for that gene family has penalised heavily, because an amino acid with those chemical and structural features is rarely observed in that position in nature, usually because it is selected against. Two of these strains have incurred two nonsynonymous mutations in the same site, for a total of five mutations within nine amino acids in a conserved region of the protein, which is highly unusual. The DeltaBS values for these three mutations are 6.2, 9.3 and 11.3, which are extremely high values for proteins that haven't accumulated any insertions or deletions (see [24], Fig. 1C, for an indication of typical DeltaBS values for inactivated proteins). The DeltaBS values for the rest of the strains range between −0.3 and 2.5, which are consistent with what we expect for functional protein sequence variants. gltD is a glutamate synthase gene which contains an iron–sulfur complex. Knockout of this gene causes reduced growth under high osmolar conditions [65]. gltD carried unique mutations in isolates H1606, H1608 and H1666, as well as some gastrointestinal strains, however the unique mutations in the invasive strains were predicted to have a greater functional effect. Another protein containing an iron–sulfur complex in our analysis was porA_3, a pyruvate–flavodoxin reductase. This gene is involved in oxidative stress tolerance and redox [66]. The gene had unique mutations in isolates H1604, H1605, H1608 and H1664. cft is a non-heme iron-containing ferritin. The gene provides protection against oxidative stress, and its mutation increases susceptibility to killing by peroxide [67]. The gene is generally well-conserved across our strains, except for unique mutations in isolates H1608, H1664 and H1670. mreB and pgp1 mutations have previously been linked with changes in cell morphology. MreB forms helical tracts down the length of the cell and positions peptidoglycan synthesis machinery [68]. Reduced expression of mreB results in a spherical phenotype in C. jejuni [69], and alteration of the protein results in morphological variation in other species [70–72]. The protein has a role in adherence to host cells in E. coli [73], and is upregulated in C. jejuni in response to mucins secreted by the human gut [74]. Variation in the gene has also been associated with both adaptation to new conditions [75, 76], and localization of virulence proteins in the cell [77–79]. The mreB gene has acquired unique mutations in four invasive strains, H1662, H1666, H1668 and H1670. Phase variation in pgp1 has previously been shown to generate a rod-shaped morphology in C. jejuni [80, 81], and knockout of the gene results in a loss of pathogenicity and loss of ability to effectively burrow through intestinal mucus in a mouse model [82]. Phase variation in pgp1 was observed in one gastrointestinal ST50 isolate, but pgp1 also carried unique nonsynonymous substitutions in H1604 and H1664 and a truncation in H1664. Similar to pgp1, pgp2 is also involved in mediating cell morphology and deletion has been shown to cause a switch to rod-shaped morphology and defects in motility and biofilm formation, as well as impairment in a chick colonization model [83]. Because we identified mutations in both mreB and pgp1, we examined the sequences for this gene as well. One invasive strain carried a nonsynonymous mutation deemed by deltaBS to have a potentially functional effect. This strain did not carry unusual sequence variants in mreB or pgp1. Overall, eight out of ten invasive isolates showed rare mutations in either mreB, pgp1 or pgp2, compared with four out of 20 gastrointestinal isolates (see Figs, S2–S4 for sequence alignments).

Fig. 3.

Fig. 3.

Partial sequence alignment highlighting unique mutations in nrfA in invasive strains, compared with lineage-specific mutations observed in other regions. The sequence alignment was visualised using Jalview (http://www.jalview.org) and coloured by BLOSUM62 score. The maximum-likelihood tree was reconstructed using RAxML (v.8.2.8) using the GTR-GAMMA model and 100 bootstraps. The PDB structure 1FS7 was visualised using Jmol (www.jmol.org). The highlighted residues are the only unique mutations in sequence alignment and occur in a conserved haem binding site. Haem molecules are highlighted in red.

In addition to fitting themes consistent with plausible adaptive mechanisms for invasiveness in Campylobacter, some of our genes have been identified as relevant to pathogenesis by other studies. Garénaux et al. [66] assayed the response of NCTC 11168 to oxidative stress caused by paraquat, and found that porA_3, cft and mreB were overexpressed under these conditions. Hyytiäinen et al. [84] also showed that porA_3 and nuoM are both downregulated in response to ciprofloxacin. In a study of the abortifacient sheep clone IA 3902 [85] flgK and nuo genes were expressed at a higher level than in NCTC 11168 and iron acquisition genes were expressed at a lower level. Further study also identified upregulation of nrfA in this strain [86]. A study of the genes involved in response to acidic pH and stomach transit [52] identified changes in the expression of htpX, nrfA and cft as important for stomach transit and mreC, which is also involved in cell shape, as influenced by acid shock.

Changes in cell shape in MreB and Pgp1 mutants

To test whether the unusual sequence variants of mreB, pgp1 and pgp2 resulted in morphological changes, we performed scanning electron microscopy on all invasive isolates, as well as other gastrointestinal isolates from the same sequence type, collected at a similar time in New Zealand (see Methods). We found morphological changes in four out of ten invasive isolates in our collection. Strain H1664 had a rod-shaped morphology, which is likely to be attributable to a truncation of pgp1, while strains H1607, H1662 and H1668 had an elongated helical phenotype. These cell shape changes were not observed in the extraintestinal isolates in our study that carried typical sequence variants of mreB, pgp1 and pgp2 (Fig. 4), or in the gastrointestinal isolates.

Fig. 4.

Fig. 4.

Sequence changes in cell shape genes relate to changes in cell morphology. Bitscore values are plotted for extraintestinal and gastrointestinal isolates for two cell-shape-determining proteins identified as associated with invasiveness which appeared in the top scoring genes. High bitscores correspond to strong agreement between individual protein sequences and modelled sequence constraints for that gene. Large differences in bitscore within an ST indicate the accumulation of a mutation/mutations that are not frequently observed in that gene within the Epsilonproteobacteria. The position of the points is offset slightly (up to 0.3 bits) to allow discrimination of strains with identical scores for each gene. Electron micrographs for representative invasive isolates from this study are shown to illustrate the cell morphologies described. Size scale is the same for each image. Note: one ST50 sample has been excluded from the plot because its score for pgp1 was much lower and distorted the figure.

Discussion

Campylobacter is the most common cause of enteric infection in the world, but only rarely causes extraintestinal infection. It is still unclear whether there are any specific genetic factors that make strains more likely to cause systemic infection. Genetic drivers of invasiveness have been identified in Neisseria meningitidis [87], Streptococcus pneumoniae [88] and Group A streptococci [89]. Other comparative analyses have successfully identified genetic differences leading to differences in survival through the food processing chain [90] and differences in host tropism [91] in Campylobacter. This study aimed to address whether there was a common genetic signature associated with adaptation to an invasive lifestyle in a sampling of Campylobacter jejuni isolated from extraintestinal infections in New Zealand.

If there is a single evolutionary path to extraintestinal infection in Campylobacter involving strong selection on key genes, we would expect this to be reflected in convergence on the same gene loss/gain or mutational events, however, if multiple independent routes to achieving extraintestinal infection exist, or if selective pressures on specific genes are weak, this signal is likely to be difficult to identify in such a small sampling of isolates. Other confounding factors, such as host susceptibility and microbiota, are also likely to play a role in infection outcomes, further complicating the link between genotype and infection outcome. The random forest built in our study achieved better accuracy in predicting infection outcome with real rather than permuted labels, but failed to identify consistent signatures of sequence variation present in all extraintestinal isolates in the study, indicating that the genomic signatures identified in our study do not occur consistently enough to be used to confidently identify other invasive isolates of C. jejuni.

Some of the genes that have been identified as top predictors of invasive infection have been linked with infection mechanisms in the existing literature. We discovered that mutations in two genes that control cell shape and adhesion to host cells when considered collectively were effective at separating six invasive strains from the two major sequence spaces occupied by the remaining strain collection, and that three of these six showed an observable loss of their tight helical morphology. These morphological changes could be adaptation to a new infection mechanism, or a response to increased stress caused by the shift of a gastrointestinal pathogen to a new environment. Cell shape has been tied to pathogenicity in the past, with the helical shape of cells of members of the genus Campylobacter specifically implicated in colonization ability [79, 80, 92, 93]. The helical shape of members of the genus Campylobacter is thought to aid in burrowing through the mucus layer of the gut [82, 94], however, rod-shaped mutants are frequently observed during passage in the laboratory [95] or in a chicken host [96], partly owing to phase variation causing phenotypic plasticity within a strain, which is thought to improve the long-term survival potential of the population [80]. While cell shape switching has been observed in the past due to phase variation in pgp1 and pgp2 [81], the mutations identified in our study are amino acid substitutions in the case of the elongated helical variants and a premature stop codon in the case of the rod-shaped variant, indicating that they would be less prone to reversal. An in vivo selection experiment performed by Field et al. [97] yielded rod-shaped strains with increased ability to survive in an in vivo chick embryo model, indicating that this phenotype may emerge and be adaptive in the bloodstream. Stahl et al. [82] showed that later stages of infection were not impaired in a rod-shaped pgp1 knockout, but initial infection could not be achieved, indicating that this phenotype isn’t likely to be selected for by serial invasive infections. In this study, variants that had a curved-rod morphology could still burrow through mucus and achieve infection [82], indicating that our elongated helical variants could still be effective colonizers. pgp1 mutants show no defect in adhesion to and invasion of human epithelial and macrophage cells in vitro, and in fact survive slightly longer than wild type strains [80, 93]. In addition, changes in cell shape and an alteration in peptidoglycan structure resulting from pgp1 and pgp2 mutations also altered levels of activation of human host cell receptors in response to infection [80, 93], indicating a possible advantage of the change in cell shape. An isolate showing a similar elongated helical morphology to that observed in our study, arising from mutation in a different gene (Cj1564), showed decreased ability to migrate through agar and decreased ability to adhere to and invade Caco-2 cells, however, it showed no impairment in chick colonization [83]. These findings collectively indicate that cell shape variation could be adaptive in the systemic niche, though ability to colonize a new host could potentially be compromised.

While some of the key indicator genes identified in this study are promising, we are cautious about interpreting the results of our study, as we have a small sampling of invasive isolates from which to identify trends. While the random forest model performed well in permutation testing, the likelihood that the model would be predictive in a separate set of invasive isolates is low, due to the small number of informative genes identified and the limited classification accuracy for the invasive class. A limitation of our study is that we have compared invasive isolates of Campylobacter from New Zealand with gastrointestinal isolates from the United Kingdom, which could introduce some confounding in patterns of sequence variation due to country of isolation. The Oxford Surveillance project provided greater coverage of isolates that were closely related to our invasive cases, while isolates and whole-genome sequences for representatives from human cases of gastroenteritis in New Zealand were only available for a small number of matching sequence types. The isolates from Oxford were therefore chosen to maximize our power to detect any associations.

Concluding statements

Our findings add to a growing body of literature linking cell shape and virulence in members of the genus Campylobacter [80, 82, 83, 93]. To the best of our knowledge, this is the first reported finding of this cell shape change occurring in a human host, however similar changes have been noted in members of the genus Campylobacter under stress conditions [98] and during passage through a chicken host [96]. The helical cell shape of the members of the genus Campylobacter is thought to be important for the passage of cells through the mucus layer of the gut [82], so these findings raise the question of whether this ability is not required for systemic infection, or whether this cell shape change is a response to the stress of a new environment and would prevent transmission to a new host. Given support in the literature for an association between invasiveness and lipopolysaccharide biosynthesis, stress response and cell shape, the candidates identified in this study are worth investigating further for their role in invasive infection. The workflow used to perform this analysis is straightforward and requires little manual intervention to perform, therefore similar investigations could be repeated with other collections of invasive pathogens, allowing for cross-validation of results. The code for performing this analysis is freely available at https://github.com/Gardner-BinfLab/invasive_campylobacter.

Data bibliography

  1. Blackmore T., Reynolds A. D., Midwinter, A. C., Marshall, J., French, N. P., Biggs, P. J. Sequence Read Archive BioProject PRJNA475221 (2018).

  2. Wheeler, N. E., Marshall, J. GitHub https://github.com/Gardner-BinfLab/invasive_campylobacter (2018).

Funding information

This work was supported by funding from mEpiLab, Massey University. N. E. W. was supported by a PhD scholarship from the University of Canterbury. N. E. W. and P. P. G. were supported by a Rutherford Discovery Fellowship administered by the Royal Society of New Zealand. N. P. F. was supported by the New Zealand Food Safety Science and Research Centre.

Acknowledgements

We would like to thank Julian Parkhill for his helpful comments on the manuscript and Simon Harris and the Pathogen Informatics team at the Wellcome Sanger Institute for developing the assembly and annotation pipelines used in this work.

Conflicts of interest

The author(s) declare that there are no conflicts of interest.

Ethical statement

Ethical approval from the Multi-region Ethics Committee of the Ministry of Health, New Zealand is in place for our work on Campylobacter (application number MEC/10/16/EXP).

Footnotes

Abbreviations: HMM, hidden Markov model; ST, sequence type; KS, Kolmogorov–Smirnov; OOB, out-of-bag.

Accession numbers for all strains used in this study are available in Supplementary Table 1. Newly sequenced strains have the accession numbers: SRR7280458, SRR7280459, SRR7280460, SRR7280461, SRR7280454, SRR7280455, SRR7280456, SRR7280457, SRR7280462 and SRR7280463.

All supporting data, code and protocols have been provided within the article or through supplementary data files. Four supplementary figures and three supplementary tables are available with the online version of this article.

Supplementary Data

Supplementary data
Supplementary data

References

  • 1.Kirk MD, Pires SM, Black RE, Caipo M, Crump JA, et al. World Health Organization estimates of the global and regional disease burden of 22 foodborne bacterial, protozoal, and viral diseases, 2010: a data synthesis. PLoS Med. 2015;12:e1001921. doi: 10.1371/journal.pmed.1001921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Carvalho AC, Ruiz-Palacios GM, Ramos-Cervantes P, Cervantes LE, Jiang X, et al. Molecular characterization of invasive and noninvasive Campylobacter jejuni and Campylobacter coli isolates. J Clin Microbiol. 2001;39:1353–1359. doi: 10.1128/JCM.39.4.1353-1359.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lee G, Pan W, Peñataro Yori P, Paredes Olortegui M, Tilley D, et al. Symptomatic and asymptomatic Campylobacter infections associated with reduced growth in Peruvian children. PLoS Negl Trop Dis. 2013;7:e2036. doi: 10.1371/journal.pntd.0002036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Calva JJ, Ruiz-Palacios GM, Lopez-Vidal AB, Ramos A, Bojalil R. Cohort study of intestinal infection with campylobacter in Mexican children. Lancet. 1988;1:503–506. doi: 10.1016/S0140-6736(88)91297-4. [DOI] [PubMed] [Google Scholar]
  • 5.Goossens H, Henocque G, Kremp L, Rocque J, Boury R, et al. Nosocomial outbreak of Campylobacter jejuni meningitis in newborn infants. Lancet. 1986;2:146–149. doi: 10.1016/S0140-6736(86)91956-2. [DOI] [PubMed] [Google Scholar]
  • 6.Skirrow MB, Jones DM, Sutcliffe E, Benjamin J. Campylobacter bacteraemia in England and Wales, 1981-91. Epidemiol Infect. 1993;110:567–573. doi: 10.1017/S0950268800050986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Nielsen H, Hansen KK, Gradel KO, Kristensen B, Ejlertsen T, et al. Bacteraemia as a result of Campylobacter species: a population-based study of epidemiology and clinical risk factors. Clin Microbiol Infect. 2010;16:57–61. doi: 10.1111/j.1469-0691.2009.02900.x. [DOI] [PubMed] [Google Scholar]
  • 8.Ruiz-Palacios GM, Torres J, Torres NI, Escamilla E, Ruiz-Palacios BR, et al. Cholera-like enterotoxin produced by Campylobacter jejuni. Characterisation and clinical significance. Lancet. 1983;2:250–253. doi: 10.1016/s0140-6736(83)90234-9. [DOI] [PubMed] [Google Scholar]
  • 9.Fauchere JL, Rosenau A, Veron M, Moyen EN, Richard S, et al. Association with HeLa cells of Campylobacter jejuni and Campylobacter coli isolated from human feces. Infect Immun. 1986;54:283–287. doi: 10.1128/iai.54.2.283-287.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Louwen R, van Baarlen P, van Vliet AH, van Belkum A, Hays JP, et al. Campylobacter bacteremia: a rare and under-reported event? Eur J Microbiol Immunol. 2012;2:76–87. doi: 10.1556/EuJMI.2.2012.1.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wassenaar TM, Bleumink-Pluym NM, van der Zeijst BA. Inactivation of Campylobacter jejuni flagellin genes by homologous recombination demonstrates that flaA but not flaB is required for invasion. Embo J. 1991;10:2055–2061. doi: 10.1002/j.1460-2075.1991.tb07736.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Konkel ME, Garvis SG, Tipton SL, Anderson DE, Cieplak W. Identification and molecular cloning of a gene encoding a fibronectin-binding protein (CadF) from Campylobacter jejuni. Mol Microbiol. 1997;24:953–963. doi: 10.1046/j.1365-2958.1997.4031771.x. [DOI] [PubMed] [Google Scholar]
  • 13.Konkel ME, Kim BJ, Rivera-Amill V, Garvis SG. Bacterial secreted proteins are required for the internaliztion of Campylobacter jejuni into cultured mammalian cells. Mol Microbiol. 1999;32:691–701. doi: 10.1046/j.1365-2958.1999.01376.x. [DOI] [PubMed] [Google Scholar]
  • 14.Bacon DJ, Alm RA, Burr DH, Hu L, Kopecko DJ, et al. Involvement of a plasmid in virulence of Campylobacter jejuni 81-176. Infect Immun. 2000;68:4384–4390. doi: 10.1128/IAI.68.8.4384-4390.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Müller J, Schulze F, Müller W, Hänel I. PCR detection of virulence-associated genes in Campylobacter jejuni strains with differential ability to invade Caco-2 cells and to colonize the chick gut. Vet Microbiol. 2006;113:123–129. doi: 10.1016/j.vetmic.2005.10.029. [DOI] [PubMed] [Google Scholar]
  • 16.Fearnley C, Manning G, Bagnall M, Javed MA, Wassenaar TM, et al. Identification of hyperinvasive Campylobacter jejuni strains isolated from poultry and human clinical sources. J Med Microbiol. 2008;57:570–580. doi: 10.1099/jmm.0.47803-0. [DOI] [PubMed] [Google Scholar]
  • 17.Nielsen H, Persson S, Olsen KE, Ejlertsen T, Kristensen B, et al. Bacteraemia with Campylobacter jejuni: no association with the virulence genes iam, cdtB, capA or virB. Eur J Clin Microbiol Infect Dis. 2010;29:357–358. doi: 10.1007/s10096-009-0863-9. [DOI] [PubMed] [Google Scholar]
  • 18.Kaakoush NO, Castaño-Rodríguez N, Mitchell HM, Man SM. Global Epidemiology of Campylobacter Infection. Clin Microbiol Rev. 2015;28:687–720. doi: 10.1128/CMR.00006-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sears A, Baker MG, Wilson N, Marshall J, Muellner P, et al. Marked campylobacteriosis decline after interventions aimed at poultry, New Zealand. Emerg Infect Dis. 2011;17:1007–1015. doi: 10.3201/eid/1706.101272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Institute of Environmental Science and Research Ltd New Zealand Public Health Surveillance Report. 2018 https://surv.esr.cri.nz/PDF_surveillance/NZPHSR/2018/NZPHSRMarch2018.pdf
  • 21.Public Health England Zoonoses Overview Report, UK. 2016 https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/664448/UK_Zoonoses_report_2016.pdf December 2017.
  • 22.Nichols GL, Richardson JF, Sheppard SK, Lane C, Sarran C. Campylobacter epidemiology: a descriptive study reviewing 1 million cases in England and Wales between 1989 and 2011. BMJ Open. 2012;2:e001179. doi: 10.1136/bmjopen-2012-001179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Cody AJ, Mccarthy NM, Wimalarathna HL, Colles FM, Clark L, et al. A longitudinal 6-year study of the molecular epidemiology of clinical Campylobacter isolates in Oxfordshire, United kingdom. J Clin Microbiol. 2012;50:3193–3201. doi: 10.1128/JCM.01086-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wheeler NE, Barquist L, Kingsley RA, Gardner PP. A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes. Bioinformatics. 2016;32:btw518–3574. doi: 10.1093/bioinformatics/btw518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jolley KA, Maiden MC. BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595. doi: 10.1186/1471-2105-11-595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Farhat MR, Shapiro BJ, Sheppard SK, Colijn C, Murray M. A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens. Genome Med. 2014;6:101. doi: 10.1186/s13073-014-0101-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bolwell CF, Gilpin BJ, Campbell D, French NP. Evaluation of the representativeness of a sentinel surveillance site for campylobacteriosis. Epidemiol Infect. 2015;143:1990–2002. doi: 10.1017/S0950268814003173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cox MP, Peterson DA, Biggs PJ. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics. 2010;11:485. doi: 10.1186/1471-2105-11-485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010 http://www.bioinformatics.babraham.ac.uk/projects/fastqc
  • 30.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Aronesty E. ea-utils: Command-line tools for processing biological sequencing data. Expression Analysis, Durham, NC [Google Scholar]
  • 32.Zerbino DR. Using the Velvet de novo assembler for short-read sequencing technologies. Curr Protoc Bioinformatics. 2010;Chapter 11:11–15. doi: 10.1002/0471250953.bi1105s31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–579. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
  • 35.Boetzer M, Pirovano W. Toward almost closed genomes with GapFiller. Genome Biol. 2012;13:R56. doi: 10.1186/gb-2012-13-6-r56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 37.Pruitt KD, Tatusova T, Brown GR, Maglott DR. NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res. 2012;40:D130–D135. doi: 10.1093/nar/gkr1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31:3691–3693. doi: 10.1093/bioinformatics/btv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23:254–267. doi: 10.1093/molbev/msj030. [DOI] [PubMed] [Google Scholar]
  • 40.Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016;17:132. doi: 10.1186/s13059-016-0997-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 2016;44:D286–D293. doi: 10.1093/nar/gkv1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Liaw A, Wiener M. Classification and regression by randomForest. R news. 2002;2:18–22. [Google Scholar]
  • 43.Breiman L. Random forests. Mach Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
  • 44.Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, et al. Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inf Comput Sci. 2003;43:1947–1958. doi: 10.1021/ci034160g. [DOI] [PubMed] [Google Scholar]
  • 45.Goldstein BA, Hubbard AE, Cutler A, Barcellos LF. An application of random forests to a genome-wide association dataset: methodological considerations & new findings. BMC Genet. 2010;11:49. doi: 10.1186/1471-2156-11-49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Touw WG, Bayjanov JR, Overmars L, Backus L, Boekhorst J, et al. Data mining in the life sciences with random forest: a walk in the park or lost in the jungle? Brief Bioinform. 2013;14:315–326. doi: 10.1093/bib/bbs034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wheeler NE, Gardner PP, Barquist L. Machine learning identifies signatures of host adaptation in the bacterial pathogen Salmonella enterica. PLoS Genet. 2018;14:e1007333. doi: 10.1371/journal.pgen.1007333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and Regression Trees. Chapman and Hall/CRC; 1984. [Google Scholar]
  • 49.Huynh-Thu VA, Wehenkel L, Geurts P. Exploiting tree-based variable importances to selectively identify relevant variables. FSDM. 2008:60–73. [Google Scholar]
  • 50.Eddy SR. Profile hidden Markov models. Bioinformatics. 1998;14:755–763. doi: 10.1093/bioinformatics/14.9.755. [DOI] [PubMed] [Google Scholar]
  • 51.Vickerman MM, Mather NM, Minick PE, Edwards CA. Initial characterization of the Streptococcus gordonii htpX gene. Oral Microbiol Immunol. 2002;17:22–31. doi: 10.1046/j.0902-0055.2001.00000.x. [DOI] [PubMed] [Google Scholar]
  • 52.Reid AN, Pandey R, Palyada K, Naikare H, Stintzi A. Identification of Campylobacter jejuni genes involved in the response to acidic pH and stomach transit. Appl Environ Microbiol. 2008;74:1583–1597. doi: 10.1128/AEM.01507-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Doerrler WT, Gibbons HS, Raetz CR. MsbA-dependent translocation of lipids across the inner membrane of Escherichia coli. J Biol Chem. 2004;279:45102–45109. doi: 10.1074/jbc.M408106200. [DOI] [PubMed] [Google Scholar]
  • 54.Morris FC. The role of outer membrane homeostasis in the virulence of Gram-negative bacteria. D_ph; University of Birmingham. 2014 http://etheses.bham.ac.uk/4966/ Accessed 14 August 2018.
  • 55.Tefsen B, Bos MP, Beckers F, Tommassen J, de Cock H. MsbA is not required for phospholipid transport in Neisseria meningitidis. J Biol Chem. 2005;280:35961–35966. doi: 10.1074/jbc.M509026200. [DOI] [PubMed] [Google Scholar]
  • 56.Bernatchez S, Szymanski CM, Ishiyama N, Li J, Jarrell HC, et al. A single bifunctional UDP-GlcNAc/Glc 4-epimerase supports the synthesis of three cell surface glycoconjugates in Campylobacter jejuni. J Biol Chem. 2005;280:4792–4802. doi: 10.1074/jbc.M407767200. [DOI] [PubMed] [Google Scholar]
  • 57.Fry BN, Feng S, Chen YY, Newell DG, Coloe PJ, et al. The galE gene of Campylobacter jejuni is involved in lipopolysaccharide synthesis and virulence. Infect Immun. 2000;68:2594–2601. doi: 10.1128/IAI.68.5.2594-2601.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Maue AC, Mohawk KL, Giles DK, Poly F, Ewing CP, et al. The polysaccharide capsule of Campylobacter jejuni modulates the host immune response. Infect Immun. 2013;81:665–672. doi: 10.1128/IAI.01008-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Woodall CA, Jones MA, Barrow PA, Hinds J, Marsden GL, et al. Campylobacter jejuni gene expression in the chick cecum: evidence for adaptation to a low-oxygen environment. Infect Immun. 2005;73:5278–5285. doi: 10.1128/IAI.73.8.5278-5285.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Cameron A, Frirdich E, Huynh S, Parker CT, Gaynor EC. Hyperosmotic stress response of Campylobacter jejuni. J Bacteriol. 2012;194:6116–6130. doi: 10.1128/JB.01409-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Pittman MS, Elvers KT, Lee L, Jones MA, Poole RK, et al. Growth of Campylobacter jejuni on nitrate and nitrite: electron transport to NapA and NrfA via NrfH and distinct roles for NrfA and the globin Cgb in protection against nitrosative stress. Mol Microbiol. 2007;63:575–590. doi: 10.1111/j.1365-2958.2006.05532.x. [DOI] [PubMed] [Google Scholar]
  • 62.Enocksson A, Lundberg J, Weitzberg E, Norrby-Teglund A, Svenungsson B. Rectal nitric oxide gas and stool cytokine levels during the course of infectious gastroenteritis. Clin Diagn Lab Immunol. 2004;11:250–254. doi: 10.1128/CDLI.11.2.250-254.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Weingarten RA, Grimes JL, Olson JW. Role of Campylobacter jejuni respiratory oxidases and reductases in host colonization. Appl Environ Microbiol. 2008;74:1367–1375. doi: 10.1128/AEM.02261-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lockwood CW, Clarke TA, Butt JN, Hemmings AM, Richardson DJ. Characterization of the active site and calcium binding in cytochrome c nitrite reductases. Biochem Soc Trans. 2011;39:1871–1875. doi: 10.1042/BST20110731. [DOI] [PubMed] [Google Scholar]
  • 65.Stahl M, Butcher J, Stintzi A. Nutrient acquisition and metabolism by Campylobacter jejuni. Front Cell Infect Microbiol. 2012;2:5. doi: 10.3389/fcimb.2012.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Garénaux A, Guillou S, Ermel G, Wren B, Federighi M, et al. Role of the Cj1371 periplasmic protein and the Cj0355c two-component regulator in the Campylobacter jejuni NCTC 11168 response to oxidative stress caused by paraquat. Res Microbiol. 2008;159:718–726. doi: 10.1016/j.resmic.2008.08.001. [DOI] [PubMed] [Google Scholar]
  • 67.Wai SN, Nakayama K, Umene K, Moriya T, Amako K. Construction of a ferritin-deficient mutant of Campylobacter jejuni: contribution of ferritin to iron storage and protection against oxidative stress. Mol Microbiol. 1996;20:1127–1134. doi: 10.1111/j.1365-2958.1996.tb02633.x. [DOI] [PubMed] [Google Scholar]
  • 68.Errington J. Bacterial morphogenesis and the enigmatic MreB helix. Nat Rev Microbiol. 2015;13:241–248. doi: 10.1038/nrmicro3398. [DOI] [PubMed] [Google Scholar]
  • 69.Rodrigues RC, Pocheron AL, Hernould M, Haddad N, Tresse O, et al. Description of Campylobacter jejuni Bf, an atypical aero-tolerant strain. Gut Pathog. 2015;7:30. doi: 10.1186/s13099-015-0077-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Schirner K, Errington J. Influence of heterologous MreB proteins on cell morphology of Bacillus subtilis. Microbiology. 2009;155:3611–3621. doi: 10.1099/mic.0.030692-0. [DOI] [PubMed] [Google Scholar]
  • 71.Dye NA, Pincus Z, Fisher IC, Shapiro L, Theriot JA. Mutations in the nucleotide binding pocket of MreB can alter cell curvature and polar morphology in Caulobacter. Mol Microbiol. 2011;81:368–394. doi: 10.1111/j.1365-2958.2011.07698.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Reimold C, Defeu Soufo HJ, Dempwolff F, Graumann PL. Motion of variable-length MreB filaments at the bacterial cell membrane influences cell morphology. Mol Biol Cell. 2013;24:2340–2349. doi: 10.1091/mbc.e12-10-0728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Bendezú FO, de Boer PA. Conditional lethality, division defects, membrane involution, and endocytosis in mre and mrd shape mutants of Escherichia coli. J Bacteriol. 2008;190:1792–1811. doi: 10.1128/JB.01322-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Tu QV, Mcguckin MA, Mendz GL. Campylobacter jejuni response to human mucin MUC2: modulation of colonization and pathogenicity determinants. J Med Microbiol. 2008;57:795–802. doi: 10.1099/jmm.0.47752-0. [DOI] [PubMed] [Google Scholar]
  • 75.Tenaillon O, Rodríguez-Verdugo A, Gaut RL, Mcdonald P, Bennett AF, et al. The molecular diversity of adaptive convergence. Science. 2012;335:457–461. doi: 10.1126/science.1212986. [DOI] [PubMed] [Google Scholar]
  • 76.Deatherage DE, Traverse CC, Wolf LN, Barrick JE. Detecting rare structural variation in evolving microbial populations from new sequence junctions using breseq. Front Genet. 2014;5:468. doi: 10.3389/fgene.2014.00468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Cowles KN, Gitai Z. Surface association and the MreB cytoskeleton regulate pilus production, localization and function in Pseudomonas aeruginosa. Mol Microbiol. 2010;76:1411–1426. doi: 10.1111/j.1365-2958.2010.07132.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Mauriello EM, Mouhamar F, Nan B, Ducret A, Dai D, et al. Bacterial motility complexes require the actin-like protein, MreB and the Ras homologue, MglA. Embo J. 2010;29:315–326. doi: 10.1038/emboj.2009.356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Doble AC, Bulmer DM, Kharraz L, Karavolos MH, Khan CMA. The function of the bacterial cytoskeleton in Salmonella pathogenesis. Virulence. 2012;3:446–449. doi: 10.4161/viru.20993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Frirdich E, Biboy J, Adams C, Lee J, Ellermeier J, et al. Peptidoglycan-modifying enzyme Pgp1 is required for helical cell shape and pathogenicity traits in Campylobacter jejuni. PLoS Pathog. 2012;8:e1002602. doi: 10.1371/journal.ppat.1002602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Esson D, Mather AE, Scanlan E, Gupta S, de Vries SP, et al. Genomic variations leading to alterations in cell morphology of Campylobacter spp. Sci Rep. 2016;6:38303. doi: 10.1038/srep38303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Stahl M, Frirdich E, Vermeulen J, Badayeva Y, Li X, et al. The helical shape of Campylobacter jejuni promotes in vivo pathogenesis by aiding transit through intestinal mucus and colonization of crypts. Infect Immun. 2016;84:3399–3407. doi: 10.1128/IAI.00751-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Esson D, Gupta S, Bailey D, Wigley P, Wedley A, et al. Identification and initial characterisation of a protein involved in Campylobacter jejuni cell shape. Microb Pathog. 2017;104:202–211. doi: 10.1016/j.micpath.2017.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Hyytiäinen H, Juntunen P, Scott T, Kytömäki L, Venho R, et al. Effect of ciprofloxacin exposure on DNA repair mechanisms in Campylobacter jejuni. Microbiology. 2013;159:2513–2523. doi: 10.1099/mic.0.069203-0. [DOI] [PubMed] [Google Scholar]
  • 85.Wu Z, Sahin O, Shen Z, Liu P, Miller WG, et al. Multi-omics approaches to deciphering a hypervirulent strain of Campylobacter jejuni. Genome Biol Evol. 2013;5:2217–2230. doi: 10.1093/gbe/evt172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Kreuder AJ. Investigation of the gallbladder host environment and small RNAs in the pathobiology of Campylobacter jejuni sheep abortion clone IA 3902 Iowa State University. 2016 https://lib.dr.iastate.edu/etd/14982/ Accessed 14 August 2018.
  • 87.Collins C, Didelot X. A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. PLoS Comput Biol. 2018;14:e1005958. doi: 10.1371/journal.pcbi.1005958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Donkor ES, Stabler RA, Hinds J, Adegbola RA, Antonio M, et al. Comparative phylogenomics of Streptococcus pneumoniae isolated from invasive disease and nasopharyngeal carriage from West Africans. BMC Genomics. 2012;13:569. doi: 10.1186/1471-2164-13-569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Shea PR, Beres SB, Flores AR, Ewbank AL, Gonzalez-Lugo JH, et al. Distinct signatures of diversifying selection revealed by genome analysis of respiratory tract and invasive bacterial populations. Proc Natl Acad Sci USA. 2011;108:5039–5044. doi: 10.1073/pnas.1016282108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Yahara K, Méric G, Taylor AJ, de Vries SP, Murray S, et al. Genome-wide association of functional traits linked with Campylobacter jejuni survival from farm to fork. Environ Microbiol. 2017;19:361–380. doi: 10.1111/1462-2920.13628. [DOI] [PubMed] [Google Scholar]
  • 91.Méric G, Mcnally A, Pessia A, Mourkas E, Pascoe B, et al. Convergent amino acid signatures in polyphyletic Campylobacter jejuni subpopulations suggest human niche tropism. Genome Biol Evol. 2018;10:763–774. doi: 10.1093/gbe/evy026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Ferrero RL, Lee A. Motility of Campylobacter jejuni in a viscous environment: comparison with conventional rod-shaped bacteria. Microbiology. 1988;134:53–59. doi: 10.1099/00221287-134-1-53. [DOI] [PubMed] [Google Scholar]
  • 93.Frirdich E, Vermeulen J, Biboy J, Soares F, Taveirne ME, et al. Peptidoglycan ld-carboxypeptidase Pgp2 influences Campylobacter jejuni helical cell shape and pathogenic properties and provides the substrate for the dl-carboxypeptidase Pgp1. J Biol Chem. 2014;289:8007–8018. doi: 10.1074/jbc.M113.491829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Lertsethtakarn P, Ottemann KM, Hendrixson DR. Motility and chemotaxis in Campylobacter and Helicobacter. Annu Rev Microbiol. 2011;65:389–410. doi: 10.1146/annurev-micro-090110-102908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Ziprin RL, Hume ME, Andrews K, Droleskey RE, Harvey RB, et al. News and notes: an atypical Campylobacter coli exhibiting unusual morphology. Curr Microbiol. 2005;51:161–163. doi: 10.1007/s00284-005-4527-4. [DOI] [PubMed] [Google Scholar]
  • 96.Hänel I, Borrmann E, Müller J, Müller W, Pauly B, et al. Genomic and phenotypic changes of Campylobacter jejuni strains after passage of the chicken gut. Vet Microbiol. 2009;136:121–129. doi: 10.1016/j.vetmic.2008.10.018. [DOI] [PubMed] [Google Scholar]
  • 97.Field LH, Underwood JL, Payne SM, Berry LJ. Characteristics of an avirulent Campylobacter jejuni strain and its virulence-enhanced variants. J Med Microbiol. 1993;38:293–300. doi: 10.1099/00222615-38-4-293. [DOI] [PubMed] [Google Scholar]
  • 98.Tangwatcharin P, Chanthachum S, Khopaibool P, Griffiths MW. Morphological and physiological responses of Campylobacter jejuni to stress. J Food Prot. 2006;69:2747–2753. doi: 10.4315/0362-028X-69.11.2747. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data
Supplementary data

Articles from Microbial Genomics are provided here courtesy of Microbiology Society

RESOURCES