Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2022 Oct 24;32(20):4350–4359.e6. doi: 10.1016/j.cub.2022.08.036

Genomes from a medieval mass burial show Ashkenazi-associated hereditary diseases pre-date the 12th century

Selina Brace 1,11, Yoan Diekmann 2,3,11, Thomas Booth 1,4,8,9,11, Ruairidh Macleod 1,2,5,11, Adrian Timpson 2, Will Stephen 1, Giles Emery 6, Sophie Cabot 7, Mark G Thomas 2,10,, Ian Barnes 1,12,∗∗
PMCID: PMC10499757  PMID: 36044903

Summary

We report genome sequence data from six individuals excavated from the base of a medieval well at a site in Norwich, UK. A revised radiocarbon analysis of the assemblage is consistent with these individuals being part of a historically attested episode of antisemitic violence on 6 February 1190 CE. We find that four of these individuals were closely related and all six have strong genetic affinities with modern Ashkenazi Jews. We identify four alleles associated with genetic disease in Ashkenazi Jewish populations and infer variation in pigmentation traits, including the presence of red hair. Simulations indicate that Ashkenazi-associated genetic disease alleles were already at appreciable frequencies, centuries earlier than previously hypothesized. These findings provide new insights into a significant historical crime, into Ashkenazi population history, and into the origins of genetic diseases associated with modern Jewish populations.

Keywords: ancient DNA, Britain, Medieval, Jewish, Ashkenazi, genetic disease

Highlights

  • Medieval English human remains are source for earliest Jewish genomes

  • Population continuity between 12th century and modern Ashkenazi Jews

  • Four alleles associated with genetic disorders in Ashkenazi Jews observed

  • Founder event elevating Jewish genetic disorder frequencies predates 12th century


Brace et al. find that human remains recovered from a medieval well in Norwich, UK, have strong genetic affinities with modern-day Ashkenazi Jews and were likely victims of recorded antisemitic violence. The presence of Ashkenazi-associated genetic disease alleles indicates a pre-12th century population founder event.

Introduction

In 2004 construction workers excavating land in central Norwich, UK, as part of the Chapelfield shopping center development recovered human skeletal elements from their spoil.1 Subsequent archaeological investigations led to the discovery and excavation of a probable well containing the commingled remains of at least seventeen people. The stratigraphic position of the remains, their completeness, and state of articulation suggested that they had all been deposited in a single event shortly after their death. The overrepresentation of subadults and the unusual location of the burial outside of consecrated ground suggested that they may have been victims of a mass fatality event such as famine, disease, or mass murder.

Pottery sherds from the well were dated typologically to 12th–14th centuries CE, and two initial radiocarbon determinations on the skeletal remains placed these in the 11th–12th centuries.1 The most prominent historically attested mass death in Norwich within this date range was in 1190 CE when members of the Jewish community were killed during antisemitic riots precipitated by the beginning of the Third Crusade,2 although the number of individuals killed is unclear.3 Norwich had been the setting for a previous notable event in the history of medieval antisemitism when, in 1144 CE, the family of William of Norwich claimed that local Jews were responsible for his murder, an argument taken up by Thomas of Monmouth through the first documented invocation of the blood libel myth. This represents the beginnings of an antisemitic conspiracy theory that persists up to the present day.4 The possibility that the remains found at the Chapelfield well site were those of the victims of antisemitic violence is given further support by the site’s location just to the south of the medieval Jewish quarter of the city.1,5 However, no additional archaeological evidence linked the human remains to a specific historical event or group of people. During the High Medieval period (ca. 1000–1300 CE), Norwich witnessed a number of outbreaks of large-scale violence,6 and additional data were therefore required to test the hypothesis that these individuals were of Ashkenazi Jewish descent.

Judaism is a shared religious and cultural identity, with endogamous marriage practices and distinctive diasporic histories of communities worldwide, particularly a Levantine origin and complex history of migrations over the last ∼2.5 millennia. Present-day Ashkenazim are descendants of medieval Jewish populations with histories primarily in northern and eastern Europe. As a result, they carry distinctive ancestries,7,8,9,10 and Jewish and non-Jewish medieval individuals living in the same regions would likely show characteristic patterns of genetic variation.11

Hereditary disorders in Ashkenazi Jewish populations have been the focus of considerable medical research,12,13,14,15,16 with genetic screening now commonplace to mitigate risks.17 Their prevalence is generally attributed to strong genetic drift during Ashkenazi population bottlenecks,10,18,19,20,21,22,23 coupled with high endogamy,7,11,24 although other processes such as heterozygote advantage have been proposed.25,26 Candidate population bottlenecks include the phase of dispersion following the destruction of the Second Temple in 70 CE, the formation of Ashkenazi communities in northern Europe during the medieval period, antisemitic persecution arising from the Crusades, unfounded reprisals for the Black Death, and the movement from western and central Europe to eastern Europe that preceded rapid population growth from the 15th to 18th centuries.19,21,27,28

No genomes from known Jewish individuals are currently available from the medieval period or earlier, largely because exhumation and scientific testing of Jewish remains are prohibited. Such data could inform on the migration and admixture histories of Jewish populations. Furthermore, the presence of any pathogenic variants would provide valuable clues to the origins and spread of Ashkenazim-associated genetic disorders. Here, we examine results from radiocarbon dating and genetic analyses of the Chapelfield individuals to better establish who they were, when they died, and the nature of their death and burial, and identify potential broader implications for Ashkenazim population history and genetics.

Results

Radiocarbon dating

The two previously published radiocarbon dates1 were supplemented by three further radiocarbon dates obtained by directly sampling the human remains (Method details: Radiocarbon dating). All five radiocarbon dates were consistent with each other (Acomb = 75.8), so they were calibrated and modeled as a single event using the OxCal function Combine.29,30 This indicated that the bodies were deposited 1161–1216 calCE (calibrated radiocarbon years in the CE; 95% confidence) or 1165–1207 calCE (68% confidence). This date range is consistent with the only historically attested antisemitic massacre in Norwich in 1190 CE (Figure 1). However, this range also encompasses the so-called Great Revolt of 1174 CE when many people were killed during the sack of Norwich by Hugh Bigod.6

Figure 1.

Figure 1

Details of the archaeological and radiocarbon information from the Chapelfield well deposit

(A) Location of the site in Norwich, UK.

(B) West-facing vertical section drawing of the Chapelfield well shaft.1

(C) Probability distribution for the date of deposition of the human remains based on 5 radiocarbon dates modeled as a single event using the Combine function in OxCal 4.4 and the IntCal20 curve. 1165–1179 CE (31.4% probability), 1190–1207 (36.8% probability), and 1161–1216 CE (95.4% probability). χ2 test, df = 4, T = 4.882 (5% 9.488); agreement, n = 5; Acomb = 75.8% (An = 31.6%),29,30 from isotopic data in Table S1.

See also Method details: Radiocarbon dating.

Ancient DNA

NGS libraries from 25 skeletal elements were screened, and nine libraries from six individuals were selected for higher coverage sequencing on the basis of endogenous DNA content > 4.5% (Data S1A). This resulted in autosomal average coverage (average read depth) per individual ranging from 0.16× to 13.81× and a mean read length of 68 bp from the six individuals (Data S1B). In addition to established methods for authenticating aDNA,31,32 we studied DNA molecule degradation using the lambda parameter to estimate true fragment length33 and compared results with a depositionally varied panel of ancient genomes (Method details: DNA fragmentation). We found significant variation in DNA fragmentation among the sequenced Chapelfield samples, indicating this cannot be predicted by depositional history.

Familial relationships

We inferred familial relationships among the Chapelfield group on the basis of pairwise relatedness coefficients and summary statistics (Method details: Inferring familial relationships and inbreeding). From this, it was inferred that three individuals were full-sibling sisters: SB606 (from Deposit Sk 75), a 10- to 15-year-old; SB671 (Deposit Sk 78), a young adult; and SB605 (Deposit Sk 69), a 5- to 10-year-old (Data S1G). These sisters were found to share the mitochondrial haplotype H5c2. In addition, SB696 appears more distantly related to this group, and SB676 is in turn distantly related to SB696.

Individual SB604 had multiple long runs of homozygosity (RoHs) comprising large portions of many chromosomes (Figures 2 and S3), and an inbreeding coefficient (0.21) close to that expected for the offspring of a first-order union. The very long RoHs (up to approximately 40 cM) identified in this individual indicate a very recent inbreeding event. Additionally, the proportions of long RoHs in SB676 and SB605 are consistent with their parents having been second-degree relatives (Figure S3). We exclude the possibility that the observed RoHs are entirely explained by low effective population size, as we would expect the distribution of RoH lengths to show an excess of short RoHs, which we do not observe (Figure 2).

Figure 2.

Figure 2

Runs of homozygosity inferred among the Chapelfield individuals using hapROH

The stacked plot on the right shows total lengths of RoHs for different length categories in the six sequenced Chapelfield individuals. Plots on the right show expected sum of RoH lengths for close kin (“Recent Loops”) and under small population size scenarios. Detailed plots of RoH length distributions and autosome locations for SB604, SB605, and SB696 are provided in Figure S3, with comparisons indicated for certain recent inbreeding scenarios.

Genetic ancestry

Present-day Ashkenazim are primarily descended from diasporas who settled in various parts of northwestern and, later, central and eastern Europe through the medieval period.34 As such, they represent the present-day population that we would expect to be genetically most similar to Jewish individuals in medieval England. In addition, historical sources indicate that the Norwich Jewish community were descendants of Ashkenazi Jews from Rouen, Normandy, who were invited to England by William the Conqueror after 1066 CE.35 We therefore explored the genetic affinities of the six Chapelfield individuals using principal component analysis (Figure 3) and tested whether modern Ashkenazi ancestry is better explained by the ancestry of the Chapelfield assemblage or a mixture of modern populations acting as proxies for ancient admixture components (e.g., Middle Eastern and Southern and Eastern European). We inferred ancestry proportions for modern Ashkenazi with qpAdm,36 with Chapelfield, Turkish Jews, Sicilian, French, and Polish as potential sources, and found the best model to be one of 100% Chapelfield (p = 0.65; Data S1I; by convention values below 0.01 indicate a poor fit). We also modeled Chapelfield ancestry as a mixture of modern populations, which we use as proxies for hypothesized ancestry components: Turkish Jews, Sicilian, French, and Polish. We estimate a mixture of ∼33%, ∼67%, ∼0%, and ∼0%, respectively (p = 0.88). These results are consistent with a previous demographic model,37 which places the introgression of Eastern European ancestry after the date of these individuals.

Figure 3.

Figure 3

Principal component analysis (PCA) exploring the genetic affinities of the six Chapelfield individuals

We projected the six Chapelfield genomes on a PCA defined by variation among modern western Eurasian population samples, including modern Jewish individuals. All six Chapelfield individuals project well away from present-day British samples, as well as northern Europeans more generally. Instead, they partially overlap with Southern Europeans, close to Cypriots, modern Ashkenazi, Turkish, and North African Jews. These results are consistent with the Chapelfield individuals having Jewish ancestry (cf. Kopelman et al.38).

Uniparental haplogroup calls for all Chapelfield individuals also support genetic affinities to the Near East and in some cases to Ashkenazi Jewish populations (Method details: Sex and uniparental haplogroups). Specifically, Ashkenazi Jews form the majority of modern carriers for mitochondrial haplogroup H5c2,39,40,41 in particular the back mutation at 16304, observed in the three sisters SB605, SB606, and SB671 (Table S3). The Y chromosome haplotype of SB676 (E1b1b1b2a1b1a) is within haplogroup E-M34, which is common in semitic language speakers and has a frequency of 11.7% among Ashkenazim for the parent haplogroup E-M12332. Similarly, the parent subclades of SB604 (J1a2a1a2d2b2) and SB696 (T1a1a) are particularly associated with Levantine ancestry.42,43

In order to temporally resolve genomic affinities, we estimated coalescence rates using Colate44 based on allele ages inferred from a genealogy of a diverse set of modern populations.45 In panmictic population models, intra-group coalescence rates are expected to be inversely related to effective population size. Between groups, they can be interpreted as a function of the intensity of gene flow prior to the coalescence event. They are therefore informative on population size history and demographic processes like admixture.

In the epoque roughly corresponding to 119 BCE to 1140 CE (Figure 4A), we find that Europeans are separated from modern Ashkenazi Jews and Middle Eastern individuals on the first axis of variation, with Chapelfield individuals in between but closer to Europeans. The second dimension sets modern Ashkenazi Jews apart, and to a lesser extent the Chapelfield individuals. Figure 4B summarizes the same pairwise coalescence rates by showing inter- and intra-group average pairwise coalescence rates (apCRs). Inter-group apCRs mirror the patterns of the first dimension of the multidimensional scaling (MDS) plot, while intra-group apCRs are highest in modern Ashkenazi Jews, followed by Chapelfield individuals, and can be seen as reflecting the variation shown in the second dimension of the MDS plot. The Chapelfield individuals have the highest inter-group apCR with modern Ashkenazi Jews. The relative ranking of apCRs for the older epoque roughly from 3278 BCE to 119 BCE shown in Figure 4C are similar to those in the later epoque, with one difference being approximately equal intra-group apCRs in Ashkenazi Jews and Chapelfield individuals.

Figure 4.

Figure 4

Pairwise coalescence rates between European, Middle Eastern, modern Ashkenazi, and Chapelfield individuals

(A) Multidimensional scaling (MDS) of pairwise coalescence rates. Rates between the chromosomes of a single individual are not included.

(B and C) Heatmaps of average pairwise coalescence rates (apCRs) between and within groups of individuals. Color code for inter-group comparisons goes from light yellow (low apCR) to dark red (high apCR); for intra-goup apCR from light gray (low apCR) to black (high apCR). years ago, ya; Middle Eastern, ME; European, Eur; modern Ashkenazi Jews, AJ; Chapelfield, Ch.

We interpret these patterns as showing (1) some degree of population continuity between Chapelfield individuals and modern Ashkenazi Jews, consistent with the qpAdm results; (2) modern Ashkenazi Jews being a mixture of Middle Eastern and European populations, as, for example, found by Xue et al.37 and consistent with the qpAdm results; (3) both modern Ashkenazi Jews and Chapelfield individuals having smaller effective population sizes than pan-Middle Eastern and -European populations, which is not sufficient to infer but consistent with a potential population bottleneck in Ashkenazim before 1140 CE; and (4) modern Ashkenazim having experienced additional bottlenecks or increased endogamy after 1140 CE.

Phenotypes and genetic disorders

Three individuals have sufficient genotyped SNPs to pass the threshold for HirisPlex46 pigmentation phenotype inference (Method details: Inferring pigmentation phenotypes; Table S4). Two individuals were inferred to have had brown eyes, one with “dark” and one with “light” hair (SB605 and SB676, respectively), while the 0- to 3-year-old boy (SB604) was inferred to have had blue eyes and red hair, the latter of which is associated with historical stereotypes of European Jews.47

We examined the six Chapelfield genomes for variants associated with hereditary diseases in Ashkenazi Jews10,16,48,49,50 at 159 loci. Previous studies have attributed the high frequencies of certain genetic disease alleles in Ashkenazi Jews principally to high rates of drift during population bottlenecks, variously hypothesized to have occurred around 1,100–1,400 CE,21 ca. 900 CE,19 ∼1,300 CE,24 33 generations ago,51 or 30 generations ago.16 Because the Chapelfield individuals lived prior to, or at the start of these hypothesized bottlenecks, we would not expect the comparatively high frequencies of modern Ashkenazim-associated disease alleles to have been reached in the population to which they belonged. Rather, we expect the population at this time to have disease allele frequencies that are more typical of modern European populations.

Across the 159 Ashkenazi-associated genetic disorder loci considered, we amassed a total of 4,755 reads for the six Chapelfield individuals. Of these, 45 reads from 35 loci were genetic disorder alleles (one locus had 4 reads, two loci had 3 reads each, three loci had 2 reads each, and 29 loci had 1 read each). However, it is important to note that some of these reads will be type 1 errors, falsely indicating the presence of a disease allele. Therefore, we simulated datasets assuming both the modern European (non-Finnish) and modern Ashkenazi population allele frequencies in the gnomAD database,52 to explore how many disease allele reads should be expected in our sample at different read error rates. We sampled A,C,G,T nucleotides (at each locus, for each individual) from a multinomial distribution, using the observed total read depth as the number of trials. To account for read errors, we introduce a read error parameter α (assumed constant across all loci) to adjust the gnomAD allele frequencies, which were then used as the multinomial probabilities. As a simple test statistic to compare our observed data with simulations, we use the total number of disease alleles with one or more reads. Figure 5 illustrates that across a range of plausible error rates (0% to 1.5%), our test statistic for the Chapelfield data is typically expected given modern Ashkenazi Jewish population allele frequencies (one-tail test for greater or equal to 35: p = 0.8143), but unlikely given modern European (non-Finnish) frequencies (one-tail test for greater or equal to 35, p = 0.0048).

Figure 5.

Figure 5

Heatmap of simulation results for genetic disorders

Comparison of test statistic (number of disease alleles with a read depth > 0) of observed Chapelfield data with 22,500 simulations under the assumption of modern Ashkenazi Jewish population allele frequencies, and modern European (non-Finnish) population allele frequencies, across a range of plausible read error rates. Methods underlying this analysis are described in Method details: Analysis of Ashkenazi-associated Mendelian disorders, and observed read depth per loci in the Chapelfield samples are shown in Figure S4.

To explore this further, we formulated a likelihood function to calculate the exact probability of the six individuals’ observed allele reads at the 159 disease loci, given the allele frequencies of any proposed population. The likelihood function utilizes the read error parameter, summing the probability of the observed data for all 106 permutations of the ten possible genotypes at a single locus. We assume loci are independent; therefore, the overall probability is simply the product of each locus-specific probability. This allows us to calculate the likelihood of either the modern European (non-Finnish) or modern Ashkenazi populations, given the data. To further account for uncertainty in the allele frequencies of these modern populations, we use the gnomAD52 counts of alleles as parameters in a Dirichlet distribution (with a uniform prior) to sample across possible population allele frequencies. Figure 6 illustrates these likelihoods under 5,000 random samples, each with a different read error rate sampled from a uniform distribution between 0% and 1.5%. The maximum likelihood read error rate estimates are notably similar (0.87% and 0.94%, respectively), and crucially these results show that the data are 4,615 times more probable under a model that these individuals were sampled from the modern Ashkenazi population than they were sampled from the modern non-Finnish European population. This approach assumes the six individuals are randomly sampled from either population. Further assessment of the effect of this assumption given that three individuals are siblings suggests that in the case of these data our assumption has a conservative effect on the likelihood ratio (Method details: Analysis of Ashkenazi-associated Mendelian disorders).

Figure 6.

Figure 6

Determining the probability of observed genetic disorder reads in the Chapelfield sample, given different population origins

Log likelihoods compare the probability that the observed data were generated under a model of either the modern Ashkenazi population or the modern European (non-Finnish) population, using respective population allele frequencies in the gnomAD database. Each population model has its own parameter α, which determines the allele read error probability and is assumed constant across all loci. The maximum likelihood ratio indicates the data are 4,615 times more probable under the Ashkenazi model than the European model.

The likelihood ratios (LRs) at the vast majority of loci (155 of 159) have little influence on this (mean = 0.967, SD = 0.054, range 0.777 to 1.125), since loci with no reads are equally probable (certain) under either population model, and those with only non-pathogenic allele reads are marginally more probable under the European population in which the pathogenic alleles are at a lower frequency. Instead, the substantial overall difference in likelihoods is driven by variant NC_000021.9:g.32602299G>C (rs202094637, 21:33974609G>C) linked to primary ciliary dyskinesia50 where SB676 has allele reads of A = 0, C = 3, G = 6, T = 0 (LR=113.4); variant NC_000007.14:g.83961537G>A (rs138952094, 7:83590853G>A) reported as linked to delayed puberty16 where SB605 has allele reads of A = 3, C = 0, G = 6, T = 0 (LR=67.8); variant NC_000005.10:g.112839514T>A (rs1801155, 5:112175211T>A) linked to cancer predisposition10 where SB676 has allele reads of A = 4, C = 0, G = 0, T = 6 (LR=48.6); and variant NC_000022.11:g.50528591C>T (rs188802138, 22:50967020C>T) linked to mitochondrial DNA depletion syndrome 116 where SB696 has allele reads of A = 0, C = 0, G = 0, T = 1 (LR = 2.7). We note that the interpretation of NC_000007.14:g.83961537G>A as pathogenic has been recently changed to uncertain on ClinVar,53 but this does not affect the overall conclusion from our analyses that disease-associated alleles rose to frequencies similar to that in modern Ashkenazi populations by the 12th century.

Discussion

We identify the presence of multiple related individuals at the Chapelfield site, with ancestry similar to modern Ashkenazi Jews, and a combined radiocarbon date of 1161–1216 calCE (95% confidence). These findings are consistent with accounts of the 1190 CE antisemitic attacks,2 involving the targeting of households. It is therefore highly probable that the Chapelfield remains were those of victims of the 1190 CE riots, despite the challenges of associating archaeological sites with specific historical events. The red hair pigmentation prediction for SB604 is notable as medieval antisemitic tropes often incorporated an association between Jews and red hair.47 Our results also indicate that Ashkenazim-associated disease alleles rose to near-modern frequencies prior to the 12th century CE. Since the majority of these alleles are at comparatively low frequencies in Sephardi Jews, the population bottleneck most likely to have resulted in elevated frequencies is one associated with the formation of Ashkenazim communities in northern Europe during the early medieval period.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Biological samples

Human skeletal remains This study SB604
Human skeletal remains This study SB605
Human skeletal remains This study SB606
Human skeletal remains This study SB671
Human skeletal remains This study SB676
Human skeletal remains This study SB696

Chemicals, peptides, and recombinant proteins

EDTA Sigma Aldrich Cat#03690-100ML
Tris-HCL Fisher Scientific Cat#10336763
Roche large volume viral kit, nucleic acid kit Scientific Labs Cat#5114403001
MinElute PCR Purification Kit Qiagen Cat#28006
UDG User Enzyme New England Biolabs Cat#M5505L
UGI New England Biolabs Cat#M0281S
10% Tween-20 Sigma Aldrich Cat#P1379-25ML
AmpliTaq Gold Fisher Scientific Cat#N8080241

Deposited data

The Genome Aggregation Database (gnomAD) v2.1.1 Karczewski et al.52 https://gnomad.broadinstitute.org/
The Ashkenazi Genomes Consortium (TAGC) dataset Carmi et al.10 https://ega-archive.org/dacs/EGAC00001000151
Allen Ancient DNA Resource Reich54 https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-data

Software and algorithms

OxCal 4.4 Ramsey29 https://c14.arch.ox.ac.uk/oxcalhelp/hlp_contents.html
IntCal20 calibration curve Reimer30 https://c14.arch.ox.ac.uk/oxcalhelp/hlp_curves.html
AdapterRemoval v2 Schubert et al.55 https://github.com/MikkelSchubert/adapterremoval
Sambamba Tarasov et al.56 https://github.com/biod/sambamba
Picard Tools v2.23.3 Broad Institute57 https://broadinstitute.github.io/picard/
GATK v3.8 McKenna et al.58 https://github.com/broadinstitute/gatk
ATLAS Link et al.32 https://bitbucket.org/wegmannlab/atlas/src/master/
ContamMix Fu et al.31 https://github.com/DReichLab/adna-workflow
SAMtools v.1.9 Li59 https://github.com/samtools/samtools
Haplogrep Weissensteiner et al.60 https://github.com/seppinho/haplogrep-cmd
Yleaf Ralf et al.61 https://github.com/genid/Yleaf
LASER v.2.04 Wang et al.62 https://github.com/statgen/LASER
ADMIXTOOLS Patterson et al.63 https://github.com/DReichLab/AdmixTools
Colate Speidel et al.44 https://github.com/leospeidel/Colate
HIrisPlex-S Chaitanya et al.46 https://hirisplex.erasmusmc.nl/
NgsRelate v.2 Hanghøj et al.64 https://github.com/ANGSD/NgsRelate
PLINK v1.09 Chang et al.65 https://www.cog-genomics.org/plink/
HapROH Ringbauer et al.66 https://github.com/hringbauer/hapROH
ROHan Renaud et al.67 https://github.com/grenaud/ROHan

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Ian Barnes (i.barnes@nhm.ac.uk).

Materials availability

This study did not generate new unique reagents.

Experimental model and subject details

Description of archaeological and historical background to human remains

In 2004, workers constructing the Chapelfield shopping center (52.626N, 1.292E) in Norwich, Norfolk, UK, recovered human remains. Subsequent archaeological investigations by NAU Archaeology led to the discovery and excavation of a circular shaft, interpreted to have been originally used as a well. Further commingled human remains were found in partial articulation in a single mass burial deposit at the base of the well. The deposit was less than 0.5 m deep and 1 m in diameter, but contained the highly compacted remains of at least seventeen people. The well shaft had been truncated by the machine digger, which means that this number is probably an underestimate of the number of people originally deposited. Sediment surrounding the remains comprised soils from outside the well, suggesting bodies had been rapidly buried by soil after deposition. The well stratigraphy suggested that the well was disused and dry at the time the human remains were interred, and probably being used as a midden. Osteological analysis1 identified at least six adults and eleven sub-adults. The subadult remains comprised at least one adolescent, two 10-15-year-olds, three 5-10-year-olds, three 3-5-year-olds and two children aged 0-3 years. The adult remains included both males and females. This overrepresentation of subadults suggests a catastrophic profile, where people of all ages had a similar risk of death, and the compaction of the remains suggested that they had been deposited in a single event. Patterns of skeletal articulation and completeness indicated that the skeletons were originally interred as complete, intact bodies, with little to no delay between their death and deposition. The Chapelfield burials thus appear to represent a catastrophic mortality event: a famine, epidemic or mass killing. In some cases the bones of the legs were higher up in the sediment than the skulls from the same skeletons, suggesting bodies had been deposited in the well head-first. There were no signs of trauma indicative of any mass killing event, beyond broken ribs that could have been produced when the bodies hit the bottom of the well, although it is possible they had been intentionally killed by a method which left no signs of trauma on the bones. There were no signs of skeletal trauma of a type commonly found in individuals who attempted to break a fall, providing some indication that the people had died before their bodies were deposited in the well. Some bones showed extremely localized brown-black or gray-blue discolorations and longitudinal splitting which can indicate exposure to high temperatures, although these changes can also be produced by diagenesis and mineral staining.

Background sampling information

In 2011, eight skeletal elements were sampled as part of the BBC series ‘History Cold Case’ (Series 2, Episode 3 ‘The Bodies in The Well’). DNA was extracted and the sections of the mitochondrial genome were targeted through PCR amplification. These methods and results are not described here, as they have been superseded by newer methods and data. In 2016 we accessed five additional skeletal elements, and these were sampled along with resampling of three of the most promising skeletal elements (based on the 2011 PCR results). DNA was extracted and NGS libraries constructed as described in the methods section below. In 2019 we were given permission to sample a further seventeen skeletal elements. DNA extraction and NGS library construction were also carried out as described below, with two additional libraries constructed for two of the most promising extracts from the 2016 sampling effort.

Ethical considerations

Rabbinical law prohibits exhumation of Jewish remans for most purposes. However, these remains were not thought to be those of Jews when excavated and initially analysed. Once the possibility that these individuals might have been Jewish was established, subsequent analyses were conducted with the cooperation and support of the Norwich Hebrew Congregation, and with the approval of the Office of the Chief Rabbi. The remains were reburied in 2013 at the Jewish cemetery in Norwich. There was a multi-faith reburial ceremony to accommodate the uncertainties surrounding the identity of the burials at that time. A commemorative plaque was also placed near the site of the well.

Method details

Radiocarbon dating

Two radiocarbon dates had previously been obtained at the University of Waikato (New Zealand) by Norvic Archaeology and reported in Emery et al.1 Three previously unreported radiocarbon dates were commissioned by SHINE TV as part of the History Cold Case TV Series. The new dates were generated by Accelerator Mass Spectrometry (AMS) at the SUERC AMS facility, from extracted skeletal collagen, pretreated and reduced to graphite. All five radiocarbon dates were modelled in OxCal 4.4 using the IntCal20 calibration curve29,30 The commingled skeletons were recovered from a single stratigraphic unit and were in partial articulation, suggesting that they had been buried in a single event soon after death.1 The skeletons therefore meet assumptions required for their radiocarbon dates to be modelled with Bayesian inference as representing a single event using the Combine function in OxCal 4.4 to produce a refined posterior density estimate. The Combine model produces good agreement indices (Acomb = 75.8) and produces posterior density estimates for the deposition of the bodies of 1161-1216 calCE (95% confidence) or 1165-1207 calCE (68% confidence). Accelerator Mass Spectrometry (AMS) data generated is included in Table S1, and modelled likelihood distributions for these dating results are presented in Figure 1C.

Ancient DNA extraction and sequencing

Bone powder (27-67 mg) was removed using a Dremel drill at slow speed. DNA was extracted following a modified standard protocol after Dabney et al.68 The protocol was modified by replacing Zymo-Spin V columns with Roche High Pure Viral Nucleic Acid Large Volume spin columns, and two final elution steps of 50 μl (total 100 μl) TET buffer (10 mM Tris·HCl, 1 mM EDTA, 0.05% Tween-20, pH 8.0). Dual indexed libraries were constructed based on standard double-stranded DNA protocols.49,50 Prior to blunt end repair, the DNA extracts were partially UDG treated, 25 μl DNA extract, 3.6 μl USER enzyme (New England Biolabs), incubated at 37°C for 30 mins, followed immediately by adding 3.6 μl UGI enzyme (New England Biolabs) and incubation at 37°C for 30 mins. Reaction purification steps were carried out using minelute purification kits (Qiagen). Indexing PCR step used AmpliTaq Gold (Fisher Scientific) DNA polymerase. All pre PCR steps were carried out in the dedicated aDNA laboratory at the Natural History Museum, London (NHM). All 27 libraries (Data S1A) were screened on a NextSeq sequencing platform at the NHM using mid output 75 PE (150 cycles) kits. Nine libraries from six individuals were selected as the most likely to generate unique DNA reads (based on endogenous content and complexity), and deep sequenced on a NovaSeq 6000 S4 flow cell with v. 1 chemistry, for 200 cycles (Data S1B).

Quantification and statistical analysis

DNA sequence data processing and alignment

Sequencing data was analyzed with bioinformatics methods accounting for the properties of aDNA. Residual adapters were removed from both read pairs prior to merging using AdapterRemoval,55 discarding reads shorter than 30 base pairs, trimming and collapsing forward and reverse reads with default parameters. Collapsed reads were aligned against the thousand genomes reference genome (hs37d5) with bwa mem and filtered for a minimal mapping quality of 30 with samtools.59,69 BAM files were sorted with sambamba,56 read groups set with Picard,57 files merged and PCR duplicates marked with sambamba. GATK v3.870 was used for realignment around known InDels listed in the Broad ‘1000G phase1’ and ‘Mills and 1000G gold standard’ resource files (provided as part of GATK). Additionally, we used a custom file with InDels causing diseases frequent in Jewish populations (Data S1D), from Carmi et al.10

Ancient DNA authentication and genotype calling

We inferred empirical post-mortem damage (PMD) patterns and recalibrated base quality scores with ATLAS.33 PMD patterns at the first and last 50 base pairs of reads are shown in Figure S1 and confirm ancient DNA authenticity. Contamination estimates based upon mitochondrial genomes on a per fragment basis using ContamMix31 (Figure S2) indicate no contamination. After computing PMD and recalibration patterns, genotypes were called with ATLAS, generating pseudo-haploid majority-allele calls (ATLAS options ‘task=call method=majorityBase’) for the sites covered by the 1240k capture array,70 and diploid Bayesian maximum a posteriori calls (ATLAS options ‘task=call method=Bayesian’) for all sites with theta prior fixed at 0.001 (ATLAS options ‘prior=theta fixedTheta=0.001’) and equal base frequencies (ATLAS option ‘equalBaseFreq’). In both cases two bases were trimmed from the ends of the reads. SAMtools-mpileup59 was also used for studying observed disease alleles probabilistically.

Sex and uniparental haplogroups

Chromosomal sex was inferred from X and Y chromosomal read ratios following Skoglund et al.71 (Figure S2). We note that individual SB696 did not yield a clear assignment using this method, as the Ry statistic minimally but confidently lies below the threshold for an XY call. We therefore computed the Rx statistic presented in Mittnik et al.,72 which confidently classifies SB696 as male (Rx 95% CI [0.49, 0.55], Pearson's r 0.99).

Mitochondrial and Y-chromosomal haplogroups were inferred with Haplogrep60 and Yleaf,61 respectively (see Table S2). Table S3 provides further details of mitochondrial mutations observed among the chapelfield individuals (assigned through Haplogrep); this supports the association of observed uniparental haplotypes with Ashkenazi ancestry, discussed above.

DNA fragmentation

To further assess DNA degradation/fragmentation across our six similarly deposited individuals, we compared these individuals to a dataset of 95 archaeological individuals with varied depositional histories (age range ca. 40 to 10579 BP). We used the lambda parameter to estimate true fragment length33 and control for differences in sequencing platforms (NextSeq and NovaSeq). We randomly selected six subsamples 1000 times from our pool of 95 archaeological individuals and plotted the standard deviation of lambda for each random subsample onto a density distribution. 95% confidence intervals were calculated and the standard deviation from our six samples added to the plot.

To assess whether the highly similar depositional histories of the Chapelfield individuals would lead to similar DNA degradation/fragmentation patterns across individuals we used the lambda parameter to estimate true fragment length.33 When the standard deviation of six randomly sampled individuals (from a pool of 95 individuals with varied depositional histories (dated from approximately 40 to 10579 years BP) were plotted onto a density distribution, the standard deviation from our six Chapelfield individuals plotted outside the 95% confidence intervals of the distribution (Figure S1B). The result highlights that there is significantly more variation in lambda (fragment length) within the six Chapelfield individuals than there is between a set of randomly chosen samples over the last ∼10,000 years. Deposition history does not therefore predict fragmentation of DNA on this timescale.

Inferring population ancestry

Explorative genetic ancestry analysis was performed by PCA using LASER v.2.0473 following Hofmanová et al.,73 projecting Chapelfield BAMs onto a reference space of modern Eurasian populations.74 Populations used: Southern European (Italian North/South, Sicilian, Spanish/- North, Canary Islander, Maltese, Greek), Basque, Sardinian, Cypriot, Central European (Albanian, Bulgarian, Romanian, Hungarian, Croatian, Czech, German, French), Eastern European (Russian, Ukrainian, Belarussian, Polish, Sorb), Mordovian, Baltic and Finnish (Estonian, Lithuanian, Finnish), British Isles (English, Orcadian, Scottish, Irish/-Ulster, Shetlander), Scandinavian (Icelandic, Norwegian), Caucasian (Georgian, North Ossetian, Abkhasian, Chechen, Adygei, Lezgin, Kumyk, Balkar), West Asian (Turkish, Armenian), Iranian/-Bandari, Near Eastern (Palestinian, Druze, Jordanian), Ashkenazi-, Turkish-, and North African Jews (Libyan, Moroccan, Tunisian). This used pileup files (SAMtools v.1.956) using filter criteria of minimum mapping quality 30 and minimum base quality 20. f4 admixture proportions were computed with qpAdm from ADMIXTOOLS,63 using default parameters and on pseudohaploid calls at the positions overlapping with the human origin/Illumina capture, on a non-related subset of Chapelfield individuals (SB604, SB605, SB676). Outgroups for qpAdm were individuals with labels Han, Karitiana, Mbuti, and Papuan74 and ancient genomes Russia_MA1_HG.SG, Ethiopia_4500BP_published.SG, Belgium_UP_GoyetQ116_1_published, Russia_Ust_Ishim.DG, Spain_ElMiron, retrieved from the Allen Ancient DNA Resource.54 Pairwise coalescence rates using published genomes (Data S1C) were estimated with Colate.44 Colate input was precomputed from recalibrated BAM files (ATLAS option ‘task=recal’) of the Chapelfield individuals, using the provided SGDP45 ‘half_ne_fixed’ mutation ages (Colate mode ‘make_tmp’). Similarly, we precomputed Colate input from VCF files for the SGDP samples used, additionally specifying the 1000 Genomes accessible regions75 as the target mask. Lastly, we estimated pairwise coalescence rates between each pair of samples (Colate mode ‘mut’), with accessible regions mask for Chapelfield individuals, 20 bootstraps, and epoques determined by bins ‘2.92,4.52,0.4’.

Inferring pigmentation phenotypes

Pigmentation phenotypes were predicted based on the HIrisPlex-S method46 on the three individuals with sufficient genomic coverage (SB604, SB605 and SB676, see Table S4). In case of missing genotypes in the VCF, we identify the corresponding positions in the recalibrated BAM files, and count one allele if the allele required by HIrisPlex-S is observed in at least one read (See Data S1H). We solely report the category with the highest probability if it is above 75%, and both the highest and second highest if the former is above 50% and the latter above 25%. See Table S2 in Chaitanya et al.46 for the author’s preliminary guide on how to interpret the skin pigmentation probability profiles. We obtained results only for the three individuals with sufficient genomic coverage.

Inferring familial relationships and inbreeding

To determine biological familial relationships among the six sequenced Chapelfield individuals we used the KING-robust method76 implemented in NgsRelate v.264 to estimate R0, R1 and KING- robust kinship statistics. This approach is intended for sequencing data of such low coverage that accurate genotypes cannot necessarily be called.76

To study parental relatedness among the sequenced Chapelfield individuals we first studied runs of homozygosity using pseudohaploid data on the basis of a modern phased haplotype reference panel.66 Inbreeding coefficients were additionally calculated on the basis of the fraction of the genome estimated to be within homozygous-by-descent segments using the sliding window approach implemented in PLINK v1.09.65 Initially a MAF filter of 0.05 was applied, then data was pruned for linkage disequilibrium (command: PLINK --indep 50 2 2), before a sliding window of 50 SNPs was applied (command: PLINK --homozyg --homozyg-window-het 0 --homozyg-snp 50 --homozyg-kb 1 --homozyg- density 5000 --homozyg-gap 5000). Detected ROH lengths (in cM) greater than the threshold above were then used to calculate inbreeding coefficients (Table S5). However, only SB604 had sufficient coverage for the inbreeding coefficient to be confidently calculated using this approach. ROHs were further inferred using a panel of reference haplotypes using hapROH,66 and ROHan67 for the single individual with sufficient coverage (SB604).

Analysis of Ashkenazi-associated mendelian disorders

To investigate Ashkenazi-associated Mendelian Disorders among the Chapelfield individuals we first collated a dataset of 178 SNVs interpreted as associated with disorders observed in Ashkenazi populations (Data S1E, with sources detailed) based on published data.10,16,50 InDel variants were detected by realignment (see above). We considered allele frequencies for these variants in gnomAD52 and retained 159 loci where the population allele frequency for modern Ashkenazi Jewish (ASJ) was greater than for modern non-Finnish European (NFE) for disease-associated variants (Data S1F). We considered genotypes for these loci probabilistically, introducing a read error parameter α, defining the probability that a single allele is incorrectly read as one of the other three nucleotides. To determine the expected number of observed disease alleles at different rates of read error, datasets assuming ASJ and NFE population allele frequencies were simulated by sampling A,C,G,T nucleotides at each locus, for each individual from a multinomial distribution, using the observed total read depth as the number of trials. To calculate the exact probability of the observed allele reads, we applied a likelihood function utilizing α, summing the probability of the observed data for all 106 permutations of the ten possible genotypes at each locus.

Probabilistic inference of genotypes

In order to assess the frequency of disease alleles in the Chapelfield individuals, we needed to address two key problems associated with ancient DNA data. Firstly, read errors were likely to be present, such as observing nucleotide T at position 11:71146886 for individual SB604, which is not present in any modern population in the gnomAD database. Secondly, read-depths were low, varying from 16.1 reads per locus for SB604 to only 0.17 reads per locus for SB606, with zero reads at 39.6% of loci when considering all 6 individuals separately (see Figure S4). We addressed these problems by considering genotypes probabilistically (rather than making categorical calls) and introducing a read error parameter α, defining the probability that a single allele is incorrectly read as one of the other three nucleotides. This value is used globally (same value for each individual, and at each locus), and we assume symmetry between nucleotides, such that the probability of A incorrectly read as C is the same for all other nine pairwise errors. For example, we assume the probability of a true T being read as G is α/3.

Simulating allele reads

A single simulated dataset was generated in a three stage process. Firstly, the gnomAD allele counts of A,C,G,T (from a proposed population at a specific locus) were used as shape parameters in the Dirichlet distribution (plus one additional count for each nucleotide, as a uniform prior), to generate a single set of four allele frequencies. Secondly, these proposed allele frequencies were modified by the proposed read error rate α, according to the formulas D1, where: freqs = a vector of the proposed frequencies of A,C,G,T at a locus (summing to 1); error = the proposed read error rate α. Thirdly, allele counts were randomly sampled from the multinomial distribution, where the total observed counts (across all four nucleotides) were used as the ‘number of trials’ parameter, and the allele frequencies (modified by α) were used as the multinomial probabilities.

Formulas D1 (in R code):

A <- freqs[1](1-error) + sum(freqs[c(2,3,4)])error/3

C <- freqs[2](1-error) + sum(freqs[c(1,3,4)])error/3

G <- freqs[3](1-error) + sum(freqs[c(1,2,4)])error/3

T <- freqs[4](1-error) + sum(freqs[c(1,2,3)])error/3

Likelihoods and likelihood ratios

Likelihoods were calculated using a four stage process that utilized the observed allele read counts, proposed population allele frequencies, and the read-error rate α. Firstly, for a single locus, we generated all 1,000,000 permutations of the six individuals’ ten possible genotypes (AA, AC, AG, AT, CC, CG, CT, GG, GT, TT), and calculated the frequency of each genotype permutation, given the gnomAD population allele frequencies and assuming Hardy-Weinberg equilibrium. Where gnomAD data provided counts for exomes and genomes we used the combination (sum of counts) of both. Secondly, we calculated the likelihood of each individual’s ten genotypes (again at a single locus), using a proposed read-error rate α and the observed allele counts in the multinomial distribution as specified in formulas M1. Thirdly, we summed all 1,000,000 permutations of these likelihoods, weighted by the frequency of each genotype permutation (since each permutation is a possible explanation of the observed data). Fourthly, we repeated for each of the 159 loci, with the α parameter fixed across all loci, and the overall product (under the assumption that loci are independent) provided the exact probability of the observed data, under a model of thegnomAD allele frequencies and a single α parameter. This approach deliberately avoids making any categorical genotype calls, and instead maintains probabilistic genotypes for downstream calculations. This is of particular value when analyzing aDNA where allele read depths are typically low and read errors high. In comparison, data with high read coverage and low read error rates can be assigned genotypes with such high confidence that the computational cost of this permutational method is not justified. Note, for computational efficiency, where two of the four possible nucleotides have a zero count, these can be aggregated into a single ‘other’ category requiring only 46,656 permutations of six genotypes (V1/V1, V1/V2, V1/V3, V2/V2, V2/V3, V3/V3), see formulas M2, and similarly where three nucleotides have zero counts, only 729 permutations of three genotypes need calculating (V1/V1, V1/V2 and V2/V2), see formulas M3. Where all four nucleotides have zero counts there is no information, and the likelihood equals 1.

Formulas (in R code):

p1 <- 1-error

p2 <- error/3

p3 <- 0.5 - p2

Formulas M1 if 4 nucleotides have counts, all 10 genotypes need to be considered. Likewise if only 3 nucleotides have counts, the fourth remains a possibility if produced by a read error.

V1.V1 <- dmultinom(counts, prob=c(p1,p2,p2,p2))

V1.V2 <- dmultinom(counts, prob=c(p3,p3,p2,p2))

V1.V3 <- dmultinom(counts, prob=c(p3,p2,p3,p2))

V1.V4 <- dmultinom(counts, prob=c(p3,p2,p2,p3))

V2.V2 <- dmultinom(counts, prob=c(p2,p1,p2,p2))

V2.V3 <- dmultinom(counts, prob=c(p2,p3,p3,p2))

V2.V4 <- dmultinom(counts, prob=c(p2,p3,p2,p3))

V3.V3 <- dmultinom(counts, prob=c(p2,p2,p1,p2))

V3.V4 <- dmultinom(counts, prob=c(p2,p2,p3,p3))

V4.V4 <- dmultinom(counts, prob=c(p2,p2,p2,p1))

Formulas M2 if only 2 nucleotides have counts, the remaining two can be combined into a single ‘other’, so that 6 genotypes need to be considered:

V1.V1 <- dmultinom(counts, prob=c(p1,p2,p2+p2))

V1.V2 <- dmultinom(counts, prob=c(p3,p3,p2+p2))

V1.V3 <- dmultinom(counts, prob=c(p3,p2,p3+p2))

V2.V2 <- dmultinom(counts, prob=c(p2,p1,p2+p2))

V2.V3 <- dmultinom(counts, prob=c(p2,p3,p3+p2))

V3.V3 <- dmultinom(counts, prob=c(p2,p2,p1+p2))

Formulas M3 if only 1 nucleotide has counts, the remaining 3 can be combined into a single ‘other’, so that only 3 genotypes need to be considered:

V1.V1 <- dmultinom(counts, prob=c(p1,p2+p2+p2))

V1.V2 <- dmultinom(counts, prob=c(p3,p3+p2+p2))

V2.V2 <- dmultinom(counts, prob=c(p2,p1+p2+p2))

Our method calculates likelihoods under the assumption that the six individuals are randomly sampled from a proposed population, and therefore does not take into account relatedness. In the case of these particular data, this assumption has a conservative influence on the likelihood ratio for the following reason. The overwhelming majority of the likelihood ratio is driven by variants that are private to a single individual (SB676 21-33974609-G-C LR=113.4; SB605 7-83590853-G-A LR=67.8; SB676 5-112175211-T-A LR=48.6; SB696 22-50967020-C-T LR=2.7), which removes any influence of relatedness on the likelihood ratios. Four further non-private disease alleles were observed in SB604 and SB676 14-94770808-C-T LR=0.954; 14-97342370-C-T LR=1.079; 21-43808633-C-A LR=0.863; 21-45713715-C-T LR=0.870), but since the likelihood ratios at these loci overall slightly favor the european population (less than 1), adjusting for relatedness would have the effect of slightly increasing the likelihood ratio. In any case, our familial relationship analysis did not find a close relationship between SB604 and SB676 that would justify such an adjustment. In contrast, the closest relationships identified were between siblings SB605, SB606 and SB671 who had no disease alleles in common.

Acknowledgments

I.B., M.G.T., S.B., Y.D., and T.B. were supported by a Wellcome Trust Investigator Award (project no. 100713/Z/12/Z). Y.D. and M.G.T. received funding from the ERC Horizon 2020 research and innovation programme (grant agreement no. 788616 YMPACT), and A.T. and M.G.T. received funding from the ERC Horizon 2020 research and innovation programme (grant agreement no. 951385 COREX). R.M. is supported by an SSHRC doctoral studentship grant (G101449: Individual Life Histories in Long-Term Cultural Change). S.B. was supported by Illumina, who provided reagents for NovaSeq sequencing. The authors would like to thank Alan West and Tim Pestell at Norwich Castle Museum for their help with sampling and contextualizing the site, David Dobson at NPS Archaeology for Figure 1A, and Dr. Dean Irwin and Dr. Adam Rutherford for their comments on earlier versions of the manuscript. We wish to thank The Ashkenazi Genomes Consortium (TAGC) for providing us access to the most recent TAGC dataset and are grateful to Doron Behar and Shai Carmi for advice on access. The authors acknowledge the role of the Norwich Hebrew Congregation and the Office of the Chief Rabbi in supporting this project.

Author contributions

Conceptualization, I.B. and S.B.; methodology, I.B., S.B., T.B., Y.D., R.M., and M.G.T.; investigation, S.B. and T.B.; formal analysis, Y.D., T.B., M.G.T., A.T., W.S., and R.M.; writing – original draft, T.B., I.B., S.B., R.M., A.T., Y.D., and M.G.T.; writing – review & editing, T.B., I.B., S.B., R.M., A.T., Y.D., M.G.T., G.E., and S.C.; visualization, Y.D., T.B., A.T., W.S., and R.M.; supervision, I.B. and M.G.T.; funding acquisition, I.B., S.B., and M.G.T.

Declaration of interests

The authors declare no competing interests.

Inclusion and diversity

One or more of the authors of this paper self-identifies as a member of the LGBTQ+ community.

Published: August 30, 2022

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.cub.2022.08.036.

Contributor Information

Mark G. Thomas, Email: m.thomas@ucl.ac.uk.

Ian Barnes, Email: i.barnes@nhm.ac.uk.

Supplemental information

Document S1. Figures S1–S4 and Tables S1–S5
mmc1.pdf (588.9KB, pdf)
Data S1. Summary data for samples and data used in analyses, related to STAR Methods and Figures 2, 4, and 6
mmc2.xlsx (167.3KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (3.3MB, pdf)

Data and code availability

References

  • 1.Emery G., Dobson D., Hoggett R., Whitmore D. NAU Archaeology; 2010. A Medieval Mass Grave on the site of the Chapelfield Shopping Centre. [Google Scholar]
  • 2.Lipman V.D. Jewish Historical Society of England; 1967. The Jews of Medieval Norwich. [Google Scholar]
  • 3.Hillaby J. In: Christians and Jews in Angevin England: The York Massacre of 1190, Narratives and Contexts. Jones S.R., Watson S., editors. Melton: Boydell & Brewer; 2013. Prelude and Postscript to the York Massacre: Attacks in East Anglia and Lincolnshire, 1190; pp. 43–56. [Google Scholar]
  • 4.Thomas of Monmouth . Penguin Classics; 2014. The Life and Passion of William of Norwich. Translated by M. Rubin. [Google Scholar]
  • 5.Green B., Young R.M.R. Norfolk Museums Service; 1972. Norwich: the Growth of a City: A Brief History of the City. [Google Scholar]
  • 6.de Diceto R. In: Stubbs W., editor. Longmans; 1876. Historical works of Master Ralph de Diceto, Dean of London. [Google Scholar]
  • 7.Ostrer H. A genetic profile of contemporary Jewish populations. Nat. Rev. Genet. 2001;2:891–898. doi: 10.1038/35098506. [DOI] [PubMed] [Google Scholar]
  • 8.Behar D.M., Metspalu E., Kivisild T., Achilli A., Hadid Y., Tzur S., Pereira L., Amorim A., Quintana-Murci L., Majamaa K., et al. The matrilineal ancestry of Ashkenazi Jewry: portrait of a recent founder event. Am. J. Hum. Genet. 2006;78:487–497. doi: 10.1086/500307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Atzmon G., Hao L., Pe’er I., Velez C., Pearlman A., Palamara P.F., Morrow B., Friedman E., Oddoux C., Burns E., Ostrer H. Abraham’s children in the genome era: major Jewish diaspora populations comprise distinct genetic clusters with shared Middle Eastern Ancestry. Am. J. Hum. Genet. 2010;86:850–859. doi: 10.1016/j.ajhg.2010.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Carmi S., Hui K.Y., Kochav E., Liu X., Xue J., Grady F., Guha S., Upadhyay K., Ben-Avraham D., Mukherjee S., et al. Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins. Nat. Commun. 2014;5:4835. doi: 10.1038/ncomms5835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Behar D.M., Garrigan D., Kaplan M.E., Mobasher Z., Rosengarten D., Karafet T.M., Quintana-Murci L., Ostrer H., Skorecki K., Hammer M.F. Contrasting patterns of Y chromosome variation in Ashkenazi Jewish and host non-Jewish European populations. Hum. Genet. 2004;114:354–365. doi: 10.1007/s00439-003-1073-7. [DOI] [PubMed] [Google Scholar]
  • 12.Myerowitz R., Costigan F.C. The major defect in Ashkenazi Jews with Tay-Sachs disease is an insertion in the gene for the alpha-chain of beta-hexosaminidase. J. Biol. Chem. 1988;263:18587–18589. [PubMed] [Google Scholar]
  • 13.Beutler E., Nguyen N.J., Henneberger M.W., Smolec J.M., McPherson R.A., West C., Gelbart T. Gaucher disease: gene frequencies in the Ashkenazi Jewish population. Am. J. Hum. Genet. 1993;52:85–88. [PMC free article] [PubMed] [Google Scholar]
  • 14.Roa B.B., Boyd A.A., Volcik K., Richards C.S. Ashkenazi Jewish population frequencies for common mutations in BRCA1 and BRCA2. Nat. Genet. 1996;14:185–187. doi: 10.1038/ng1096-185. [DOI] [PubMed] [Google Scholar]
  • 15.Struewing J.P., Hartge P., Wacholder S., Baker S.M., Berlin M., McAdams M., Timmerman M.M., Brody L.C., Tucker M.A. The risk of cancer associated with specific mutations of BRCA1 and BRCA2 among Ashkenazi Jews. N. Engl. J. Med. 1997;336:1401–1408. doi: 10.1056/NEJM199705153362001. [DOI] [PubMed] [Google Scholar]
  • 16.Rivas M.A., Avila B.E., Koskela J., Huang H., Stevens C., Pirinen M., Haritunians T., Neale B.M., Kurki M., Ganna A., et al. Insights into the genetic epidemiology of Crohn’s and rare diseases in the Ashkenazi Jewish population. PLoS Genet. 2018;14:e1007329. doi: 10.1371/journal.pgen.1007329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Klugman S., Gross S.J. Ashkenazi Jewish screening in the twenty-first century. Obstet. Gynecol. Clin. North Am. 2010;37:37–46. doi: 10.1016/j.ogc.2010.01.001. [DOI] [PubMed] [Google Scholar]
  • 18.Risch N., de Leon D., Ozelius L., Kramer P., Almasy L., Singer B., Fahn S., Breakefield X., Bressman S. Genetic analysis of idiopathic torsion dystonia in Ashkenazi Jews and their recent descent from a small founder population. Nat. Genet. 1995;9:152–159. doi: 10.1038/ng0295-152. [DOI] [PubMed] [Google Scholar]
  • 19.Risch N., Tang H., Katzenstein H., Ekstein J. Geographic distribution of disease mutations in the Ashkenazi Jewish population supports genetic drift over selection. Am. J. Hum. Genet. 2003;72:812–822. doi: 10.1086/373882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kedar-Barnes I., Rozen P. The Jewish people: their ethnic history, genetic disorders and specific cancer susceptibility. Fam. Cancer. 2004;3:193–199. doi: 10.1007/s10689-004-9544-0. [DOI] [PubMed] [Google Scholar]
  • 21.Slatkin M. A population-genetic test of founder effects and implications for Ashkenazi Jewish diseases. Am. J. Hum. Genet. 2004;75:282–293. doi: 10.1086/423146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Behar D.M., Yunusbayev B., Metspalu M., Metspalu E., Rosset S., Parik J., Rootsi S., Chaubey G., Kutuev I., Yudkovsky G., et al. The genome-wide structure of the Jewish people. Nature. 2010;466:238–242. doi: 10.1038/nature09103. [DOI] [PubMed] [Google Scholar]
  • 23.Bray S.M., Mulle J.G., Dodd A.F., Pulver A.E., Wooding S., Warren S.T. Signatures of founder effects, admixture, and selection in the Ashkenazi Jewish population. Proc. Natl. Acad. Sci. USA. 2010;107:16222–16227. doi: 10.1073/pnas.1004381107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mozersky J. Routledge; 2012. Risky genes: genetics, breast cancer and Jewish identity. [Google Scholar]
  • 25.Rotter J.I., Diamond J.M. What maintains the frequencies of human genetic diseases? Nature. 1987;329:289–290. doi: 10.1038/329289a0. [DOI] [PubMed] [Google Scholar]
  • 26.Motulsky A.G. Jewish diseases and origins. Nat. Genet. 1995;9:99–101. doi: 10.1038/ng0295-99. [DOI] [PubMed] [Google Scholar]
  • 27.Weinryb B.D. Jewish Publication Society of America; 1973. The Jews of Poland: a social and economic history of the Jewish community in Poland from 1100 to 1800. [Google Scholar]
  • 28.DellaPergola S. In: Papers in Jewish Demography. DellaPergola S., Even J., editors. Hebrew University; Jerusalem: 1997. Some fundamentals of Jewish demographic history. [Google Scholar]
  • 29.Bronk Ramsey C. Bayesian Analysis of Radiocarbon Dates. Radiocarbon. 2009;51:337–360. [Google Scholar]
  • 30.Reimer P.J., Austin W.E.N., Bard E., Bayliss A., Blackwell P.G., Bronk Ramsey C., Butzin M., Cheng H., Edwards R.L., Friedrich M., et al. The IntCal20 Northern Hemisphere Radiocarbon Age Calibration Curve (0–55 cal kBP) Radiocarbon. 2020;62:725–757. [Google Scholar]
  • 31.Fu Q., Mittnik A., Johnson P.L.F., Bos K., Lari M., Bollongino R., Sun C., Giemsch L., Schmitz R., Burger J., et al. A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 2013;23:553–559. doi: 10.1016/j.cub.2013.02.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Link V., Kousathanas A., Veeramah K., Sell C., Scheu A., Wegmann D. ATLAS: Analysis Tools for Low-depth and Ancient Samples. bioRxiv. 2017 doi: 10.1101/105346. [DOI] [Google Scholar]
  • 33.Kistler L., Ware R., Smith O., Collins M., Allaby R.G. A new model for ancient DNA decay based on paleogenomic meta-analysis. Nucleic Acids Res. 2017;45:6310–6320. doi: 10.1093/nar/gkx361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Heller E.G. In: Jewish Religious Architecture: From Biblical Israel to Modern Judaism. Fine S., editor. Brill; 2019. Western Ashkenazi Synagogues in Medieval and Early Modern Europe; pp. 169–183. [Google Scholar]
  • 35.Mundill R.R. Bloomsbury Publishing; 2010. The King’s Jews: Money, Massacre and Exodus in Medieval England. [Google Scholar]
  • 36.Haak W., Lazaridis I., Patterson N., Rohland N., Mallick S., Llamas B., Brandt G., Nordenfelt S., Harney E., Stewardson K., et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature. 2015;522:207–211. doi: 10.1038/nature14317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Xue J., Lencz T., Darvasi A., Pe’er I., Carmi S. The time and place of European admixture in Ashkenazi Jewish history. PLoS Genet. 2017;13:e1006644. doi: 10.1371/journal.pgen.1006644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kopelman N.M., Stone L., Hernandez D.G., Gefel D., Singleton A.B., Heyer E., Feldman M.W., Hillel J., Rosenberg N.A. High-resolution inference of genetic relationships among Jewish populations. Eur. J. Hum. Genet. 2020;28:804–814. doi: 10.1038/s41431-019-0542-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pereira L., Richards M., Goios A., Alonso A., Albarrán C., Garcia O., Behar D.M., Gölge M., Hatina J., Al-Gazali L., et al. Evaluating the forensic informativeness of mtDNA haplogroup H sub-typing on a Eurasian scale. Forensic Sci. Int. 2006;159:43–50. doi: 10.1016/j.forsciint.2005.06.008. [DOI] [PubMed] [Google Scholar]
  • 40.Brandstätter A., Zimmermann B., Wagner J., Göbel T., Röck A.W., Salas A., Carracedo A., Parson W. Timing and deciphering mitochondrial DNA macro-haplogroup R0 variability in Central Europe and Middle East. BMC Evol. Biol. 2008;8:191. doi: 10.1186/1471-2148-8-191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Behar D.M., van Oven M., Rosset S., Metspalu M., Loogväli E.-L., Silva N.M., et al. A “Copernican” reassessment of the human mitochondrial DNA tree from its root. Am. J. Hum. Genet. 2012;90:675–684. doi: 10.1016/j.ajhg.2012.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Agranat-Tamir L., Waldman S., Martin M.A.S., Gokhman D., Mishol N., Eshel T., Cheronet O., Rohland N., Mallick S., Adamski N., et al. The genomic history of the bronze age southern levant. Cell. 2020;181:1146–1157.e11. doi: 10.1016/j.cell.2020.04.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Skourtanioti E., Erdal Y.S., Frangipane M., Balossi Restelli F., Yener K.A., Pinnock F., Matthiae P., Özbal R., Schoop U.-D., Guliyev F., et al. Genomic History of Neolithic to Bronze Age Anatolia, Northern Levant, and Southern Caucasus. Cell. 2020;181:1158–1175.e28. doi: 10.1016/j.cell.2020.04.044. [DOI] [PubMed] [Google Scholar]
  • 44.Speidel L., Cassidy L., Davies R.W., Hellenthal G., Skoglund P., Myers S.R. Inferring Population Histories for Ancient Genomes Using Genome-Wide Genealogies. Mol. Biol. Evol. 2021;38:3497–3511. doi: 10.1093/molbev/msab174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Mallick S., Li H., Lipson M., Mathieson I., Gymrek M., Racimo F., Zhao M., Chennagiri N., Nordenfelt S., Tandon A., et al. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature. 2016;538:201–206. doi: 10.1038/nature18964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chaitanya L., Breslin K., Zuñiga S., Wirken L., Pośpiech E., Kukla-Bartoszek M., Sijen T., Knijff P.d., Branicki W., Liu F., et al. The HIrisPlex-S system for eye, hair and skin colour prediction from DNA: Introduction and forensic developmental validation. Forensic Sci. Int. Genet. 2018;35:123–135. doi: 10.1016/j.fsigen.2018.04.004. [DOI] [PubMed] [Google Scholar]
  • 47.Livak L. Stanford University Press; 2010. The Jewish Persona in the European Imagination. [Google Scholar]
  • 48.Zlotogora J. Mendelian Disorders Among Jews (Tel Aviv: Department of Community Genetics, Public Health Services, Ministry of Health, Israel) 2015. https://www.health.gov.il/Subjects/Genetics/Documents/book_jews.pdf
  • 49.Shi L., Webb B.D., Birch A.H., Elkhoury L., McCarthy J., Cai X., Oishi K., Mehta L., Diaz G.A., Edelmann L., Kornreich R. Comprehensive population screening in the Ashkenazi Jewish population for recurrent disease-causing variants. Clin. Genet. 2017;91:599–604. doi: 10.1111/cge.12834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wallace S.E., Bean L.J.H. GeneReviews®[Internet] University of Washington, Seattle; 2018. Resources for Genetics Professionals—Genetic Disorders Associated with Founder Variants Common in the Ashkenazi Jewish Population. [Google Scholar]
  • 51.Palamara P.F., Lencz T., Darvasi A., Pe’er I. Length distributions of identity by descent reveal fine-scale demographic history. Am. J. Hum. Genet. 2012;91:809–822. doi: 10.1016/j.ajhg.2012.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Karczewski K.J., Francioli L.C., Tiao G., Cummings B.B., Alföldi J., Wang Q., Collins R.L., Laricchia K.M., Ganna A., Birnbaum D.P., et al. The mutational constraint spectrum quantified from variation in 141, 456 humans. Nature. 2020;581:434–443. doi: 10.1038/s41586-020-2308-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Landrum M.J., Lee J.M., Benson M., Brown G.R., Chao C., Chitipiralla S., Gu B., Hart J., Hoffman D., Jang W., et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res. 2018;46:D1062–D1067. doi: 10.1093/nar/gkx1153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Reich D. Allen Ancient DNA Resource version 44.3. 2021. https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-data
  • 55.Schubert M., Lindgreen S., Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes. 2016;9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Tarasov A., Vilella A.J., Cuppen E., Nijman I.J., Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31:2032–2034. doi: 10.1093/bioinformatics/btv098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Broad Institute Picard Tools version 2.23.3. 2020. http://broadinstitute.github.io/picard/
  • 58.McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., Garimella K., Altshuler D., Gabriel S., Daly M., DePristo M.A. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27:2987–2993. doi: 10.1093/bioinformatics/btr509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Weissensteiner H., Pacher D., Kloss-Brandstätter A., Forer L., Specht G., Bandelt H.-J., Kronenberg F., Salas A., Schönherr S. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016;44 doi: 10.1093/nar/gkw233. W58–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Ralf A., González D.M., Zhong K., Kayser M. Yleaf: software for human Y-chromosomal haplogroup inference from next-generation sequencing data. Mol. Biol. Evol. 2018;35:1820. doi: 10.1093/molbev/msy080. [DOI] [PubMed] [Google Scholar]
  • 62.Wang C., Zhan X., Bragg-Gresham J., Kang H.M., Stambolian D., Chew E.Y., Branham K.E., Heckenlively J., FUSION Study. Fulton R., et al. Ancestry estimation and control of population stratification for sequence-based association studies. Nat. Genet. 2014;46:409–415. doi: 10.1038/ng.2924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Patterson N., Moorjani P., Luo Y., Mallick S., Rohland N., Zhan Y., Genschoreck T., Webster T., Reich D. Ancient Admixture in Human History. Genetics. 2012;192:1065–1093. doi: 10.1534/genetics.112.145037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Hanghøj K., Moltke I., Andersen P.A., Manica A., Korneliussen T.S. Fast and accurate relatedness estimation from high-throughput sequencing data in the presence of inbreeding. GigaScience. 2019;8:giz034. doi: 10.1093/gigascience/giz034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Chang C.C., Chow C.C., Tellier L.C., Vattikuti S., Purcell S.M., Lee J.J. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Ringbauer H., Novembre J., Steinrücken M. Parental relatedness through time revealed by runs of homozygosity in ancient DNA. Nat. Commun. 2021;12:5425. doi: 10.1038/s41467-021-25289-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Renaud G., Hanghøj K., Korneliussen T.S., Willerslev E., Orlando L. Joint estimates of heterozygosity and runs of homozygosity for modern and ancient samples. Genetics. 2019;212:587–614. doi: 10.1534/genetics.119.302057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Dabney J., Knapp M., Glocke I., Gansauge M.-T., Weihmann A., Nickel B., Valdiosera C., García N., Pääbo S., Arsuaga J.-L., Meyer M. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl. Acad. Sci. USA. 2013;110:15758–15763. doi: 10.1073/pnas.1314445110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Mathieson I., Lazaridis I., Rohland N., Mallick S., Patterson N., Roodenberg S.A., Harney E., Stewardson K., Fernandes D., Novak M., et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature. 2015;528:499–503. doi: 10.1038/nature16152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Skoglund P., Storå J., Götherström A., Jakobsson M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. J. Archaeol. Sci. 2013;40:4477–4482. [Google Scholar]
  • 72.Mittnik A., Wang C.-C., Svoboda J., Krause J. A molecular approach to the sexing of the triple burial at the upper paleolithic site of dolní věstonice. PLoS One. 2016;11:e0163019. doi: 10.1371/journal.pone.0163019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Hofmanová Z., Kreutzer S., Hellenthal G., Sell C., Diekmann Y., Díez-Del-Molino D., van Dorp L., López S., Kousathanas A., Link V., et al. Early farmers from across Europe directly descended from Neolithic Aegeans. Proc. Natl. Acad. Sci. USA. 2016;113:6886–6891. doi: 10.1073/pnas.1523951113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Lazaridis I., Nadel D., Rollefson G., Merrett D.C., Rohland N., Mallick S., Fernandes D., Novak M., Gamarra B., Sirak K., et al. Genomic insights into the origin of farming in the ancient Near East. Nature. 2016;536:419–424. doi: 10.1038/nature19310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.1000 Genomes Project Consortium. Auton A., Brooks L.D., Durbin R.M., Garrison E.P., Kang H.M., Korbel J.O., Marchini J.L., McCarthy S., McVean G.A., et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Waples R.K., Albrechtsen A., Moltke I. Allele frequency-free inference of close familial relationships from genotypes or low-depth sequencing data. Mol. Ecol. 2019;28:35–48. doi: 10.1111/mec.14954. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S4 and Tables S1–S5
mmc1.pdf (588.9KB, pdf)
Data S1. Summary data for samples and data used in analyses, related to STAR Methods and Figures 2, 4, and 6
mmc2.xlsx (167.3KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (3.3MB, pdf)

Data Availability Statement

RESOURCES