Abstract
Extinct lineages of Yersinia pestis, the causative agent of the plague, have been identified in several individuals from Eurasia between 5000 and 2500 years before present (BP). One of these, termed the ‘LNBA lineage’ (Late Neolithic and Bronze Age), has been suggested to have spread into Europe with human groups expanding from the Eurasian steppe. Here, we show that the LNBA plague was spread to Europe’s northwestern periphery by sequencing three Yersinia pestis genomes from Britain, all dating to ~4000 cal BP. Two individuals were from an unusual mass burial context in Charterhouse Warren, Somerset, and one individual was from a single burial under a ring cairn monument in Levens, Cumbria. To our knowledge, this represents the earliest evidence of LNBA plague in Britain documented to date. All three British Yersinia pestis genomes belong to a sublineage previously observed in Bronze Age individuals from Central Europe that had lost the putative virulence factor yapC. This sublineage is later found in Eastern Asia ~3200 cal BP. While the severity of the disease is currently unclear, the wide geographic distribution within a few centuries suggests substantial transmissibility.
Subject terms: Archaeology, Bacterial infection
An extinct prehistoric plague lineage of Yersinia pestis has been documented from Central Europe to Asia during the Late Neolithic and Bronze Age. Here, Swali et al. show that this lineage spread to Europe’s northwestern periphery by sequencing three ~4000 year-old Yersinia pestis genomes from Britain.
Introduction
Yersinia pestis is a zoonotic bacterium that can be transmitted via the bite of an infected flea vector, causing either bubonic or septicaemic plague, or via respiratory droplets through human-to-human contact causing pneumonic plague. Ancient DNA analysis has identified Yersinia pestis as the causative agent of not only historic epidemics such as the Justinianic plague1–3 and the Black Death4,5, but also identified previously unknown evidence of Yersinia pestis circulating in prehistory; Late Neolithic and Early Bronze Age (LNBA) plague has been found across Eurasia in the period between ~4700–2800 BP (years Before Present)6–12. The most frequently observed LNBA lineage lacks the ymt virulence factor (LNBA ymt-)11,13. The first known evidence of lineages with the ymt gene (LNBA ymt+) was found in a ~3800-BP-year-old individual (RT5) from Samara, Russia9 and a ~3300-BP-year-old individual (I2470) from Álava, Spain11.
The LNBA lineages were likely brought to Central and Western Europe ~4800 BP by human expansions originating in the Eurasian steppe6,14,15, and it has been hypothesised that these lineages contributed to the decline of certain Late Neolithic European societies6,7. However, it has been unclear how far LNBA plague spread throughout Europe, with the most westward finding of the LNBA ymt- lineage previously identified in present-day southern Germany ~3400 BP11. Human movements associated with the expansion of Bell Beaker cultures introduced steppe-derived ancestries to Britain and intensified links with continental Europe from ~4400 BP16, opening the possibility that Bronze Age groups from north-western Europe were also affected by LNBA lineages of Yersinia pestis, although this has not been observed directly until now.
Here, we show evidence of LNBA Yersinia pestis infections from two sites from different regions in Britain, suggesting that this lineage could have been widespread across Britain after the westward expansion of populations tracing ancestry to the Eurasian steppe. The LNBA Yersinia pestis lineage found in Britain belongs to a clade, which has undergone large losses of their genome, including the putative virulence factor yapC.
Results
Detection of Yersinia pestis in Bronze Age Britain
We sampled 34 individuals from two sites from the British Early Bronze Age—Charterhouse Warren and Levens Park—to screen for the presence of LNBA Yersinia pestis in Britain. We sampled teeth in ancient DNA cleanroom conditions and prepared double-indexed single-stranded DNA libraries, which we screened with 1.8 to 7.3 million paired-end reads each (Methods). We performed a metagenomic analysis in Kraken 217 and identified individuals with a substantial excess of k-mers matching Yersinia pestis compared to the closely related species Yersinia pseudotuberculosis (Methods). This analysis identified two individuals (C10091/1233b and C10098/6265) out of 30 screened for pathogen DNA from Charterhouse Warren Farm in Somerset, United Kingdom (see Methods for further contextual details). The site is a mass burial assemblage of disarticulated human remains that were likely deposited together in a single event in a natural shaft. The two sampled teeth were derived from children aged 12 ± 3 yr and 10 ± 3 yr, respectively. The mandible associated with the C10098 tooth (6265/10-year-old child) has been directly radiocarbon dated to 4145–3910 cal BP (95.4% confidence; OxA-37840: 3685 ± 30 BP, calibrated in OxCal v4.418 using IntCal2019), consistent with two previously published radiocarbon dates on human bones from this assemblage20. The other individual can be assumed to be of similar date.
In addition, our analysis identified Yersinia pestis in one of four individuals recovered from the Levens Park ring cairn monument in Levens, Cumbria, United Kingdom21,22 (see Methods for detailed archaeological context). One complete and three disturbed skeletons (Burials 1, 2, 3 and 4) were recovered from the monument and all have been directly dated to ca. 4300–3700 cal BP. We detected Yersinia pestis in a tooth from Burial 2 (C10928), a 35–45-year-old female recovered in a crouched position from a plank-lined grave (possibly a wooden coffin) and accompanied by Beaker pottery sherds. The skeleton has been directly dated to 4065–3780 cal BP (95.4% confidence, GU-51281: 3602 ± 33 BP22). Comparison of this radiocarbon date to the one obtained from Charterhouse C10098 suggests that we cannot reject the possibility that these two individuals were approximate contemporaries (χ2-test, df = 1, T = 2.746 (5% 3.841)).
These results from multiple sites demonstrate that Yersinia pestis had spread to Britain during the Bronze Age (Fig. 1A), but we note that the frequency of occurrence of Yersinia pestis remains unknown, and could be lower than the observed frequency in the two sites. The false-negative rate of Yersinia pestis detection in archaeological remains is unknown but likely high23, so we cannot exclude that other individuals at the sites were infected.
Genome sequencing
We used the Illumina NovaSeq platform to generate direct shotgun sequencing data of ~768 million and ~322 million read-pairs from individuals C10098 and C10091 from Charterhouse Warren, respectively (Table 1), after size selection to enrich for molecules at least 35 bp in length24 (Methods). The individual from Levens Park (C10928) was not amenable to shotgun sequencing but libraries from this individual, as well as from the two other individuals, underwent two rounds of hybridisation capture with an in-solution target enrichment approach using Yersinia pestis RNA baits (Daicel Arbor Biosciences). Merging data from the direct shotgun sequencing and the targeted enrichment experiments resulted in an average 8.4X, 6.1X and 3.3X coverage of the Yersinia pestis genome for C10091, C10098 and C10928, respectively when filtered with mapping quality of MQ1 (Table 1). All three genomes showed evidence of authenticity:25 postmortem damage, a decreasing number of observed sequences as a function of edit distance to the reference genome, a unimodal fragment length distribution, and a relatively even breadth of coverage across the genome (Supplementary Fig. 1 and Fig. 2A).
Table 1.
Total sequences, length > 35bp | Proportion Yersinia pestis (%) | Sequences aligned to Yersinia pestis after duplicate removal | Clonality (%) | Mean X-fold coverage (MQ1) | Breadth of coverage >1x (%) (MQ1) | Breadth of coverage >2x (%) (MQ1) | X fold coverage pCD1 (MQ1) | X fold coverage pMT1 (MQ1) | X fold coverage pPCP1 (MQ1) | |
---|---|---|---|---|---|---|---|---|---|---|
C10091 Capture | 2,73,38,232 | 19.6 | 4,81,157 | 91.0 | 5.6 | 78.4 | 62.6 | 12.5 | 6.0 | 32.6 |
C10091 Shotgun | 32,17,93,774 | 1.6 | 3,03,420 | 94.2 | 3.9 | 86.3 | 65.5 | 7.9 | 4.6 | 18.7 |
C10091 final coverage | 34,91,32,006 | 3.0 | 6,90,105 | 93.5 | 8.4 | 93.5 | 82.9 | 17.8 | 9.1 | 43.6 |
C10098 Capture | 1,56,97,965 | 36.7 | 3,36,045 | 94.2 | 4.6 | 70.2 | 53.0 | 12.3 | 5.1 | 30.3 |
C10098 Shotgun | 76,77,47,954 | 0.02 | 87,634 | 49.1 | 1.7 | 56.2 | 22.1 | 2.9 | 1.6 | 7.2 |
C10098 Final coverage | 78,34,45,919 | 0.8 | 4,12,351 | 93.1 | 6.1 | 85.0 | 67.1 | 14.9 | 6.6 | 36.3 |
C10928 Capture | 19,49,92,184 | 4.3 | 2,08,793 | 97.5 | 3.3 | 61.3 | 41.7 | 5.5 | 3.2 | 23.7 |
C10928 Shotgun | 46,76,025 | 0.002 | 88 | 3.3 | 0.001 | 0.0 | 0.0 | 0.0 | 0.0 | 0.01 |
C10928 final coverage | 19,96,68,209 | 4.2 | 2,08,881 | 97.5 | 3.3 | 61.3 | 41.7 | 5.5 | 3.2 | 23.7 |
Phylogenetic reconstruction
We aligned genomes from this study and previously published Yersinia pestis Neolithic and LNBA genomes6–12,26 (Supplementary Table 1) to the modern-day Yersinia pestis genome 1.ORI CO92 (NC_003143.1)27, and aligned the Yersinia pseudotuberculosis strain IP3288128 in the same way (Methods). We retained a subset of the genomes, which had a high degree of called sites given a per-site minimum sequence depth of three on all published genomes and a minimum depth of five for the in-house single-stranded library sequenced genomes (Methods). We performed phylogenetic inference with IQ-TREE29, assessed uncertainty with 1000 bootstrap replicates, and rooted the phylogeny with Y. pseudotuberculosis in FigTree. Our final phylogeny consisted of 27 ancient published genomes and both the Charterhouse Warren individuals (C10091 and C10098) from this study. We found that in the reconstructed phylogeny, the Charterhouse Warren Yersinia pestis genomes fall within the broader LNBA ymt- clade. In the asymmetrical clade of LNBA ymt- genomes, grouping broadly according to chronology11, we found that the British genomes are derived with respect to genomes from the Czech Republic (HOP001 and HOP004), dating to ~4300 cal BP and ~4250 cal BP, but basal to a genome from Russia, which dates to ~4200 cal BP (KLZ001)11. Indeed, the topology of the previously published genomes matches previously reconstructed phylogenies. We also placed the lower-coverage genome from Levens Park (C10928) in a reduced phylogeny that included the Charterhouse Warren genomes and high-coverage genomes from adjacent clades (Methods) (Fig. 1B), and found that it was likely closely related to those from Charterhouse Warren.
The two Charterhouse Warren genomes show 2 transversion mismatches out of 939,710 sites with a minimum sequence depth of 5 (a mismatch rate of 0.0002%). Both of these are due to apparent changes in the C10091 genome, and they are located adjacent to each other at positions 2227793 and 2227794 when aligned to the CO92 (NC_003143.1) reference genome. While BlastN searches of the NCBI nucleotide archive found that all sequences had Yersinia pseudotuberculosis and Yersinia pestis as the top match, the observed transversions were adjacent to an insertion (one sequence read further contained a deletion), and it therefore seems possible that they represent spurious alignment from a source uncharacterised in the database (Supplementary Fig. 2, Supplementary Table 230).
Deletions and functional elements
We searched for major deletions by assessing the sequence depth across the higher-coverage Charterhouse Warren genomes (Methods). We identified no new deletions larger than 1000 bp, but the previously identified ‘event 1’, the oldest deletion in LNBA strains, which resulted in the loss of a 36 kb region and was observed in multiple previous sequenced individuals8,11, is also observed in both the Charterhouse Warren and Levens Park genomes. The loss of this region resulted in the loss of the yapC virulence gene, which has been shown in vitro to mediate adhesion in cultured cells and could thus have enhanced the colonisation ability of Yersinia pestis during infection31.
We observe the presence and absence of other functional elements that are all congruent with the phylogenetic placement of the British Yersinia pestis genomes. We confirm the absence of the filamentous prophage, absent in all the published Neolithic and LNBA genomes, which is today mostly identified in 1.ORI strains32. The absence and presence of previously reported virulence factors on the British Yersinia pestis plasmids are all congruent with previously published LNBA ymt- plasmids (Fig. 2B). UreD, associated with increased flea toxicity33, pde-2 which is involved in the downregulating mechanism for biofilm formation34 and flhD, the inactivation of which is associated with immune invasion35, all show variation consistent with no loss of function in the Charterhouse Warren genomes (Methods)9.
Discussion
Our results reveal that the LNBA ymt- Yersinia pestis lineage was not confined to Bronze Age continental Europe, but had spread westward to Britain. We show that Yersinia pestis genomes in Britain had close affinities with Bronze Age genomes in present-day Germany, which may have been introduced by human expansions tracing back to the Eurasian steppe. Our results, alongside previously published genomes, thus show that in the period ~4700-2800 cal BP the LNBA ymt- strain of Yersinia pestis was widespread across Eurasia, from Britain to Eastern Asia. The distance between the Levens Park and Charterhouse Warren sites, in the northwest and southwest of England, respectively, and their statistically indistinguishable radiocarbon dates could suggest the widespread distribution of LNBA ymt- Yersinia pestis strains also across Britain.
Individuals from this study all lack the ymt virulence gene, which is known to have played an important role in flea-mediated transmission. The earliest known Yersinia pestis strain carrying the ymt gene has been found in the Samara region in the western part of the Eurasian steppe and dates to 3800 BP9. This is close in time to our two Yersinia pestis genomes from Britain, providing further evidence for differential contemporary frequencies of the ymt gene across an even wider geographic distance. The ymt gene, alongside the hms gene, enables survival and colonisation of the bacteria in the midgut and proventriculus of fleas, allowing for flea-mediated transmission36–38. However, research has also suggested an early-phase transmission model, which does not require the full extrinsic incubation period for flea-mediated transmission39. It has also been suggested that the ymt gene plays less of an important role in flea-mediated transmission for brown rat blood as opposed to blood from mice, humans, or black rats40.
In terms of the scale of deposition and evidence for extensive trauma, Charterhouse Warren is a rare site in the context of Early Bronze Age Europe41. It is clear that the individuals represented in the Charterhouse mass burial were subject to a non-normative form of funerary treatment, and it could be natural to suggest that this unusual treatment may have been a specific response to a local plague outbreak, much like the mass graves that appear across mediaeval Europe in response to the Black Death epidemic42. However, evidence of fatal trauma amongst the Charterhouse Warren assemblage (R. J. S., T. F-C., J. O., Christophe Snoeck, Fiona Brock, L. L., T. A., Don Walker, in preparation) makes it unlikely that this mass burial was due to a deadly outbreak of plague. Nevertheless, we cannot completely rule out a higher prevalence of plague amongst the remaining 28 individuals from this site as false negatives are possible, due to factors such as poor preservation of endogenous microbial DNA and variable pathogen load, making detection of ancient pathogens more difficult23. This raises the question–which we cannot answer at this point–of whether there may have been any connection between the disease and violent treatment of these individuals. At least one of the two child mandibles shows signs of perimortem trauma, although the disarticulated state of the remains means that potential trauma to the rest of the skeleton cannot be assessed. The presence of Yersinia pestis DNA in the two children suggests that they were either diseased when any trauma was inflicted on them, or alternatively that any trauma was inflicted after their death from the plague. The latter scenario cannot be dismissed altogether, although it is improbable given the rate of perimortem trauma observed in the assemblage as a whole, which suggests most or all individuals were killed during a specific violent event. By contrast, Burial 2 from Levens Park ring cairn represents a much more normative funerary rite for the period22. Further high-resolution sampling will be needed to answer these questions and better understand the transmission dynamics and functional evolution of Yersinia pestis in Britain and beyond.
Methods
All radiocarbon dates were calibrated in OxCal 4.4 using the IntCal20 calibration curve18,19. There is no stable carbon and nitrogen isotopic evidence for any detectable input of marine or freshwater foods that would require a correction for reservoir effects.
Charterhouse Warren: Archaeological context
Charterhouse Warren is a natural shaft in the limestone of the Mendip Hills, Somerset. Initial excavations in the 1970s were instigated in order to determine whether the shaft led to a cave system, as no archaeological remains from the site were known at that time. Excavators noted the presence of human and animal bones in a disarticulated state, some of which showed cutmarks and rodent gnawing43. The assemblage of human remains sampled for aDNA was encountered in a distinct layer at ca. 15 m depth, together with faunal remains and a small number of artefacts, including sherds of a Beaker vessel. At least 40 men, women and children are represented by fragmentary and disarticulated remains. Culturally and chronologically, the assemblage dates to the late Beaker period/Early Bronze Age, ca. 4150–3950 cal BP. This is a non-normative burial context for this period, which is dominated by single articulated burials associated with funerary monuments and cemeteries. Modifications found on a significant proportion of bones suggest that many, if not all, of these individuals had been subject to fatal perimortem trauma and subsequent processing before disarticulated bones were deposited together in the shaft in what is likely to have been a single event, though analysis of the assemblage and modelling of the radiocarbon dates is ongoing. While disarticulated remains are sometimes encountered in Bronze Age Britain, the scale of deposition at Charterhouse Warren is unique in a British context, suggesting that a very unusual event is represented. The evidence for blunt force perimortem trauma to a number of crania together with evidence for dismemberment suggests that this may relate to an episode of extreme violence.
Levens Park: Archaeological context
The Levens Park ring cairn, consisting of a low kerbed mound below which were further rings of boulders or wall-like structures, in Levens, Cumbria, UK, was excavated between 1968 and 1971 by David Sturdy21. Examination of existing archival material by members of Levens Local History Group has produced a recent reassessment of the excavation, resulting in a more detailed understanding of the monument and its purpose22. We provide a general narrative of the site here but see Clare et al.22 for details on associated uncertainties. Four skeletons (Burials 1, 2, 3 and 4) were recovered from the monument and represent two separate phases of burial activity. Burial 4 comprised an unaccompanied crouched burial of a 25–45-year-old male covered with a large boulder but no associated artefacts. The skeleton has been radiocarbon dated to 4229-3976 cal BP (95% confidence, 3731 ± 34 BP, GU-51283), significantly earlier than dates from the other three skeletons (R_Combine χ2 Test in OxCal 4.4: df = 3, T = 11.2 (5%, 11.8))18,19. Burials 1, 2 and 3 were recovered from a larger adjacent structure within the monument. Their radiocarbon dates were statistically indistinguishable, suggesting they could have died around the same time or within a few decades of one another (R_Combine χ2 Test in OxCal 4.4: df = 2, T = 0.7 (5%, 6.0));18,19 Burial 2, which is thought to be the primary interment in this phase, owing to its position in the centre of the new structure, was a 35–45-year-old female recovered in a crouched position from a plank-lined grave (possibly a wooden coffin) and accompanied by sherds of Beaker pottery. The skeleton has been directly dated to 4065–3780 cal BP (95% confidence, 3602 ± 33 BP, GU-51281). Burial 1, a 17–35-year-old female dating to 4080–3840 cal BP, 95% confidence, 3626 ± 33 BP, GU-51280), and Burial 3, a probable adult female dating to 3982–3879 cal BP (95% confidence, 3587 ± 33 BP, GU-51282), were recovered from the cairn matrix and their original character and associations are unclear. Strontium and oxygen stable isotope analyses of Burials 1, 3 and 4 are consistent with them having grown up locally, although the results would also be consistent with an origin in other parts of Britain, likewise Burial 2.
Sampling, DNA extraction and library preparation
Samples were processed at The Francis Crick Institute in a dedicated cleanroom facility. We used an EV410-230 EMAX Evolution Dentistry drill to clean the surface of the teeth and sampled both the cementum and multiple fractions of the dentine, resulting in 7–25 mg of powder from the dentine. Dentine powders were then lysed with 300 µl (<10 mg of powder), 600 µl (10–25 mg) or 1000 µl (>25 mg of powder) of lysis buffer (0.5 EDTA pH 8.0, 0.05% Tween-20, 0.25 mg/ml Proteinase K44) and incubated overnight at 37 °C. Lysates were centrifuged for 2 min at maximum speed of 16,400 × g (13,200 rpm) in a table centrifuge and 140 µl of the lysate was transferred into FluidX tubes for automated extraction on an Agilent Bravo Workstation45. Extracts were turned into single-stranded DNA libraries24, then double-indexed46 and underwent paired-end sequencing on an Illumina HiSeq4000 or NextSeq500 platform resulting in 1.8 to 7.3 million read-pairs per sample for initial screening. All samples were processed alongside negative lysate and extraction controls as well as positive and negative library controls.
Targeted enrichment
Following initial pathogen detection, libraries were taken forward for target enrichment using Yersinia pestis baits predesigned by myBaits Arbor Biosciences, following myBaits custom RNA seq v5.1 (March 2021) High Sensitivity protocol47. We used 7 μl of the initial library for two rounds of hybridisation at 55 °C with 23-h overnight incubation and 20 PCR cycles, followed by a heteroduplex removal using a one-cycle PCR and MinElute Purification (Qiagen) cleanup. We sequenced each library with a 2 × 100 paired-end read configuration on the Illumina MiSeq and NovaSeq platforms (Table 1).
Size selection for shotgun sequencing
Before larger-scale direct shotgun sequencing, fragments shorter than 35 bp and longer than 150 bp were removed from the libraries, as in Gansauge et al.24. Specifically, 100 ng of the initial library was biotinylated and streptavidin beads were used to isolate the non-biotinylated strand and obtain a single-stranded library. These samples (pooled with 3 others) were then loaded on a denaturing polyacrylamide gel along with 35 bp and 150 bp insert markers, and fragments within the desired sequence length were physically excised and eluted from the gel, after overnight incubation. The resulting size-selected libraries were further amplified and sequenced on the Illumina NovaSeq (Table 1).
Bioinformatic processing
Samples were processed via the nf-core/eager v2 pipeline48. First, adapters were removed, paired-end reads were merged and bases with a quality below 20 were trimmed using AdapterRemoval v249 with –trimns –trimqualities –collapse –minadapteroverlap 1 and –preserve5p. Merged reads with a minimum length of 35 bp were mapped to the hs37d5 human reference genome with Burrows-Wheeler Aligner (BWA-0.7.17 aln)50 using the following parameters “-l 16500 -n 0.01”51,52.
Metagenomic screening and authentication
We analysed sequences that did not align successfully to the human genome using Kraken 217 and identified individuals as putatively positive for Yersinia pestis by assessing the number of observed k-mers (sequence matches) to Yersinia pestis compared to Yersinia pseudotuberculosis. These libraries were subsequently aligned to the CO92 Yersinia pestis reference genome (NC_003143.1) and the CO92 plasmids, pMT1 (NC_003134.1), pCD1 (NC_003131.1) and pPCP1 (NC_003132.1) using the BWA aln50 parameters “-l 16500 -n 0.01 -o 2”. Duplicates were removed by keeping only the first sequence out of any set of sequences with the same start position and length (https://github.com/pontussk/samremovedup). We assessed the authenticity of the final set of sequences using the following criteria:25 the observation of postmortem damage, with the number of sequences being negatively correlated with edit distance from the reference genome, and a unimodal fragment length distribution via DamageProfiler53. Additionally, we used SAMTools v1.3.1 depth54 to confirm an even breadth of coverage across the reference genome. Screening libraries that passed these authentication criteria, were taken forward for further shotgun sequencing and target enrichment. For the final BAM files, we merged shotgun and target enriched BAM files using samtools merge resulting in a final coverage of 8.4x, 6.1x and 3.3x coverage for C10091, C10098 and C10928, respectively when filtered on a minimum mapping quality of q1 (MQ1) (Table 1, Fig. 2A, Supplementary Fig. 1).
Analysis of functional elements and virulence factors
We used previously identified virulence genes11 on the Yersinia pestis chromosome, pCD1, pMT1 and pPCP1 plasmids, and analysed the coverage of these genes with BEDTools v2.29.255 (Fig. 2A). The ggplot256 package in R was then used to make a coloured heatmap of the percentage of positions in each virulence gene covered by at least one sequence. Additionally, we used BEDTools v2.29.255 genomecov to assess the missingness across the genome and identify regions that were larger than 1 kb in the two Charterhouse individuals. Some differences in coverage across the Yersinia pestis chromosome between the pre-designed Arbor RNA capture baits compared to shotgun data can be observed in direct comparisons, most likely due to preferential capture of some parts of the chromosome over others with the bait set (Supplementary Fig. 3). However, we see no evidence that this could lead to conclusions about major deletions or loss of function. We manually inspected the ureD, pde-2 and flhD genes to identify the presence of indels and SNPs specific to these genes, to assess whether they were functional or not in the Charterhouse Warren genomes9. However, the coverage for C10928 was too low to assess these indels.
Mismatches between the two Charterhouse genomes
We compared the two Charterhouse Warren genomes to identify transversion SNPs at differing base-fold coverages (Supplementary Fig. 2, Supplementary Table 2) after filtering for a mapping and base quality of 30 using samtools calmd54 and removing heterozygous sites using an in-house script (https://github.com/pontussk/mpileup_mismatch_pathogen.py).
Phylogenetic reconstruction
Previously published data from Bronze age and Neolithic plague genomes and the IP32881 strain of Yersinia pseudotuberculosis were downloaded from the European Nucleotide Archive (Supplementary Table 1) and processed using the same parameters as data generated in this study, described above (Bioinformatic processing) with the exception of duplicate removal, which removed duplicates with the same start position regardless of length to adjust for single-end sequencing in published data.
After duplicate read removal, we used an in-house script (https://github.com/pontussk/mpileup2consensus.py) to keep only homozygous sites with a base quality score of Q30, thus excluding any heterozygous sites and small (shorter than a sequence read length) insertions or deletions and a maximum edit distance of 90% using samtools calmd54. We then filtered all sites to retain only sites with a minimum base coverage of 3-fold to obtain a consensus sequence for all published genomes and a 5-fold coverage for the Charterhouse Warren genomes.
Consensus sequences were then filtered to keep only sites that were polymorphic within the set of sequences used for input in the phylogeny, removing all transitions to remove effects of cytosine deamination in the ancient DNA sequences. For the main phylogeny (Fig. 1A) we excluded individuals with more than 70% missingness across the genome and removed sites that were missing in more than 20% of the remaining individuals, resulting in 2306 transversion SNPs that were taken forward to IQ-TREE v.1.6.1229. We implemented model testing using the ModelFinder in IQ-TREE57, which suggested a TVMe+ASC as the best-fit model according to the Akaike information criterion for both phylogenies in Fig. 1A and Fig. 1B. We implemented 1000 bootstrap replicates for the phylogeny in Fig. 1A and rooted the maximum likelihood phylogeny in FigTree58, with the Yersinia pseudotuberculosis outgroup (IP32881) (the outgroup is not shown in the figure to increase the visibility of the LBNA clades in Fig. 1A).
Due to its low coverage, a separate phylogenetic inference was used to position the Levens Park genome, C10928. Here, we took the filtered consensus sequences of the LNBA strains, which were topologically close to the monophyletic Charterhouse Warren (C10091 and C10098) group, and filtered them to a minimum base coverage of 3, keeping transversions only and sites present in at least 70% of the genomes (regardless of missingness across the genomes). This resulted in 104 SNPs taken forward to IQ-TREE, using a TVMe+ASC model with 100 bootstrap replicates. The maximum likelihood phylogeny was rooted in FigTree58 based on Fig. 1A and transformed into the cladogram in Fig. 1B.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank the Advanced Sequencing Facility and Chris Barrington from the Bioinformatics and Biostatistics Science Technology Platform at The Francis Crick Institute for technical support. P. Sk. was supported by the Vallee Foundation, the European Research Council (grant no. 852558), the European Molecular Biology Organisation, the Wellcome Trust (217223/Z/19/Z), and Francis Crick Institute core funding (FC001595) from Cancer Research UK, the UK Medical Research Council, and the Wellcome Trust. M.H. was supported by Marie Skłodowska Curie Actions (grant no. 844014). Osteological analyses of Charterhouse Warren human skeletal assemblage were supported by the British Academy (SG163375), with funding for radiocarbon dating supplied through NERC’s NEIF programme (NF/2018/1/3) to R.S. We thank Allan Steward and the Levens Local History Group for their collaboration on this project. Archaeological work in Levens ring cairn was supported by a CWAAS grant to Levens Local History Group (to T.C., G.C., I.H., M. Simpson, S.R.). This research was funded in whole, or in part, by the Wellcome Trust (FC001595 and 217223/Z/19/Z). We would also like to thank Gunnar Neumann and the SPAAM community for help and advice. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Author contributions
Conceptualisation: P. Swali, P. Skoglund. Data curation: P. Swali, A.G. Formal analysis: P. Swali. Funding acquisition: R.S., P. Skoglund. Investigation: P. Swali, R.S., A.G., M.K., M.W., J.M., K.A., I.G., T.D., M. Silva, M.H., A.B., T. B., P. Skoglund. Project administration: P. Swali, R.S., T.B., P. Skoglund. Resources: R.S., T.A., L.L., T.F.C., J.O., D.W., T.C., G.C., I.H., M. Simpson, S.R. Supervision: P. Skoglund. Visualisation: P. Swali, A.G., K.A. Writing—original draft: P. Swali, P. Skoglund. Writing—review & editing: P. Swali, R.S., M.K., K.A., T.F.C, J.O., M.H., A.B., T.B., P. Skoglund.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.
Funding
Open Access funding provided by The Francis Crick Institute.
Data availability
Sequencing data generated in this study have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB61230.
Code availability
Code used in this study is made available at https://github.com/pontussk/.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Deceased: Stephen Read.
Contributor Information
Pooja Swali, Email: swalipooja@gmail.com.
Thomas Booth, Email: thomas.booth@crick.ac.uk.
Pontus Skoglund, Email: pontus.skoglund@crick.ac.uk.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-38393-w.
References
- 1.Harbeck M, et al. Yersinia pestis DNA from skeletal remains from the 6(th) century AD reveals insights into Justinianic Plague. PLoS Pathog. 2013;9:e1003349. doi: 10.1371/journal.ppat.1003349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Keller M, et al. Ancient Yersinia pestis genomes from across Western Europe reveal early diversification during the First Pandemic (541-750) Proc. Natl Acad. Sci. USA. 2019;116:12363–12372. doi: 10.1073/pnas.1820447116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Feldman M, et al. A high-coverage Yersinia pestis genome from a sixth-century Justinianic Plague victim. Mol. Biol. Evol. 2016;33:2911–2923. doi: 10.1093/molbev/msw170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bos KI, et al. A draft genome of Yersinia pestis from victims of the Black Death. Nature. 2011;478:506. doi: 10.1038/nature10549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Spyrou MA, et al. Historical Y. pestis genomes reveal the european black death as the source of ancient and modern plague pandemics. Cell Host Microbe. 2016;19:874–881. doi: 10.1016/j.chom.2016.05.012. [DOI] [PubMed] [Google Scholar]
- 6.Rasmussen S, et al. Early divergent strains of Yersinia pestis in Eurasia 5000 years ago. Cell. 2015;163:571–582. doi: 10.1016/j.cell.2015.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rascovan N, et al. Emergence and spread of basal lineages of Yersinia pestis during the neolithic decline. Cell. 2019;176:295–305.e10. doi: 10.1016/j.cell.2018.11.005. [DOI] [PubMed] [Google Scholar]
- 8.Andrades Valtueña A, et al. The stone age plague and its persistence in Eurasia. Curr. Biol. 2017;27:3683–3691.e8. doi: 10.1016/j.cub.2017.10.025. [DOI] [PubMed] [Google Scholar]
- 9.Spyrou MA, et al. Analysis of 3800-year-old Yersinia pestis genomes suggests Bronze Age origin for bubonic plague. Nat. Commun. 2018;9:2234. doi: 10.1038/s41467-018-04550-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Susat J, et al. A 5000-year-old hunter-gatherer already plagued by Yersinia pestis. Cell Rep. 2021;35:109278. doi: 10.1016/j.celrep.2021.109278. [DOI] [PubMed] [Google Scholar]
- 11.Andrades Valtueña A, et al. Stone age Yersinia pestis genomes shed light on the early evolution, diversity, and ecology of plague. Proc. Natl Acad. Sci. USA. 2022;119:e2116722119. doi: 10.1073/pnas.2116722119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Neumann GU, et al. Ancient Yersinia pestis and Salmonella enterica genomes from Bronze Age Crete. Curr. Biol. 2022;32:3641–3649.e8. doi: 10.1016/j.cub.2022.06.094. [DOI] [PubMed] [Google Scholar]
- 13.Hinnebusch J, et al. Murine toxin of Yersinia pestis shows phospholipase D activity but is not required for virulence in mice. Int. J. Med. Microbiol. 2000;290:483–487. doi: 10.1016/S1438-4221(00)80070-3. [DOI] [PubMed] [Google Scholar]
- 14.Haak W, et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature. 2015;522:207. doi: 10.1038/nature14317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Allentoft ME, et al. Population genomics of Bronze Age Eurasia. Nature. 2015;522:167–172. doi: 10.1038/nature14507. [DOI] [PubMed] [Google Scholar]
- 16.Olalde I, et al. The Beaker phenomenon and the genomic transformation of northwest Europe. Nature. 2018;555:190–196. doi: 10.1038/nature25738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20:257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ramsey CB. Bayesian analysis of radiocarbon dates. Radiocarbon. 2009;51:337–360. doi: 10.1017/S0033822200033865. [DOI] [Google Scholar]
- 19.Reimer PJ, et al. The IntCal20 northern hemisphere radiocarbon age calibration curve (0–55 cal kBP) Radiocarbon. 2020;62:725–757. doi: 10.1017/RDC.2020.41. [DOI] [Google Scholar]
- 20.Levitan BM, Smart PL. Charterhouse Warren farm swallet, mendip, radiocarbon dating evidence. Proc. Univ. Bristol Spelaeological Soc. 1989;18:390–394. [Google Scholar]
- 21.Sturdy, D. A ring-cairn in Levens Park, Westmorland. Scottish Archaeology Forum.
- 22.Clare T, Cook G, Hodkinson ID, Simpson M, Read S. A re-interpretation of the Levens Park ring cairn, Cumbria, based on the original excavation archives. Archaeological J. 2021;178:298–329. doi: 10.1080/00665983.2021.1898259. [DOI] [Google Scholar]
- 23.Spyrou MA, Bos KI, Herbig A, Krause J. Ancient pathogen genomics as an emerging tool for infectious disease research. Nat. Rev. Genet. 2019;20:323–340. doi: 10.1038/s41576-019-0119-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gansauge M-T, Aximu-Petri A, Nagel S, Meyer M. Manual and automated preparation of single-stranded DNA libraries for the sequencing of DNA from ancient biological remains and other sources of highly degraded DNA. Nat. Protoc. 2020;15:2279–2300. doi: 10.1038/s41596-020-0338-0. [DOI] [PubMed] [Google Scholar]
- 25.Key FM, Posth C, Krause J, Herbig A, Bos KI. Mining metagenomic data sets for ancient DNA: recommended protocols for authentication. Trends Genet. 2017;33:508–520. doi: 10.1016/j.tig.2017.05.005. [DOI] [PubMed] [Google Scholar]
- 26.Yu H, et al. Paleolithic to Bronze Age Siberians reveal connections with first Americans and across Eurasia. Cell. 2020;181:1232–1245.e20. doi: 10.1016/j.cell.2020.04.037. [DOI] [PubMed] [Google Scholar]
- 27.Parkhill J, et al. Genome sequence of Yersinia pestis, the causative agent of plague. Nature. 2001;413:523–527. doi: 10.1038/35097083. [DOI] [PubMed] [Google Scholar]
- 28.Reuter S, et al. Parallel independent evolution of pathogenicity within the genus Yersinia. Proc. Natl Acad. Sci. USA. 2014;111:6768–6773. doi: 10.1073/pnas.1317161111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Robinson JT, et al. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Felek S, Lawrenz MB, Krukonis ES. The Yersinia pestis autotransporter YapC mediates host cell binding, autoaggregation and biofilm formation. Microbiology. 2008;154:1802–1812. doi: 10.1099/mic.0.2007/010918-0. [DOI] [PubMed] [Google Scholar]
- 32.Derbise A, Carniel E. YpfΦ: a filamentous phage acquired by Yersinia pestis. Front. Microbiol. 2014;5:701. doi: 10.3389/fmicb.2014.00701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chouikha I, Hinnebusch BJ. Silencing urease: a key evolutionary step that facilitated the adaptation of Yersinia pestis to the flea-borne transmission route. Proc. Natl Acad. Sci. USA. 2014;111:18709–18714. doi: 10.1073/pnas.1413209111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sun Y-C, Jarrett CO, Bosio CF, Hinnebusch BJ. Retracing the evolutionary path that led to flea-borne transmission of Yersinia pestis. Cell Host Microbe. 2014;15:578–586. doi: 10.1016/j.chom.2014.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Minnich SA, Rohde HN. A rationale for repression and/or loss of motility by pathogenic Yersinia in the mammalian host. Adv. Exp. Med. Biol. 2007;603:298–310. doi: 10.1007/978-0-387-72124-8_27. [DOI] [PubMed] [Google Scholar]
- 36.Jarrett CO, et al. Transmission of Yersinia pestis from an infectious biofilm in the flea vector. J. Infect. Dis. 2004;190:783–792. doi: 10.1086/422695. [DOI] [PubMed] [Google Scholar]
- 37.Eisen RJ, et al. Early-phase transmission of Yersinia pestis by unblocked fleas as a mechanism explaining rapidly spreading plague epizootics. Proc. Natl Acad. Sci. USA. 2006;103:15380–15385. doi: 10.1073/pnas.0606831103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hinnebusch BJ, Erickson DL. Yersinia pestis biofilm in the flea vector and its role in the transmission of plague. Curr. Top. Microbiol. Immunol. 2008;322:229–248. doi: 10.1007/978-3-540-75418-3_11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Eisen RJ, Dennis DT, Gage KL. The role of early-phase transmission in the spread of Yersinia pestis. J. Med. Entomol. 2015;52:1183–1192. doi: 10.1093/jme/tjv128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bland DM, Miarinjara A, Bosio CF, Calarco J, Hinnebusch BJ. Acquisition of yersinia murine toxin enabled Yersinia pestis to expand the range of mammalian hosts that sustain flea-borne plague. PLoS Pathog. 2021;17:e1009995. doi: 10.1371/journal.ppat.1009995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Johnston, R. Bronze Age Worlds: A Social Prehistory of Britain and Ireland (Routledge, 2020).
- 42.Gottfried, R. S. Black Death (Simon and Schuster, 2010).
- 43.Levitan BM, et al. Charterhouse Warren Farm Swallet, Mendip, Somerset: exploration, geomorphology, taphonomy and archaeology. Proc. Univ. Bristol Spelaeol. Soc. 1988;18:P171–P239. [Google Scholar]
- 44.Dabney J, et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl Acad. Sci. USA. 2013;110:15758–15763. doi: 10.1073/pnas.1314445110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rohland N, Glocke I, Aximu-Petri A, Meyer M. Extraction of highly degraded DNA from ancient bones, teeth and sediments for high-throughput sequencing. Nat. Protoc. 2018;13:2447–2461. doi: 10.1038/s41596-018-0050-5. [DOI] [PubMed] [Google Scholar]
- 46.Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, gkr771 (2011). [DOI] [PMC free article] [PubMed]
- 47.Furtwängler A, et al. Comparison of target enrichment strategies for ancient pathogen DNA. Biotechniques. 2020;69:455–459. doi: 10.2144/btn-2020-0100. [DOI] [PubMed] [Google Scholar]
- 48.Fellows Yates, J. A. et al. Reproducible, portable, and efficient ancient genome reconstruction with nf-core/eager. Peer J. 9, e10947 (2021). [DOI] [PMC free article] [PubMed]
- 49.Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes. 2016;9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Meyer M, et al. A high-coverage genome sequence from an Archaic denisovan individual. Science. 2012;338:222–226. doi: 10.1126/science.1224344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Poullet M, Orlando L. Assessing DNA sequence alignment methods for characterizing ancient genomes and methylomes. Front. Ecol. Evolution. 2020;8:105. doi: 10.3389/fevo.2020.00105. [DOI] [Google Scholar]
- 53.Neukamm J, Peltzer A, Nieselt K. DamageProfiler: fast damage pattern calculation for ancient DNA. Bioinformatics. 2021;37:3652–3653. doi: 10.1093/bioinformatics/btab190. [DOI] [PubMed] [Google Scholar]
- 54.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer, New York, NY, 2009).
- 57.Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Rambaut, A. FigTree v1.3.1: Tree Figure Drawing Tool (Available from http://tree.bio.ed.ac.uk/software/figtree/) Accessed November 2, 2011 (2009).
- 59.Schuenemann VJ, et al. Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc. Natl Acad. Sci. USA. 2011;108:E746–E752. doi: 10.1073/pnas.1105107108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data generated in this study have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB61230.
Code used in this study is made available at https://github.com/pontussk/.