Abstract
Ancient DNA is a valuable tool for investigating genetic and evolutionary history that can also provide detailed profiles of the lives of ancient individuals. In this study, we develop a generalised computational approach to detect aneuploidies (atypical autosomal and sex chromosome karyotypes) in the ancient genetic record and distinguish such karyotypes from contamination. We confirm that aneuploidies can be detected even in low-coverage genomes ( ~ 0.0001-fold), common in ancient DNA. We apply this method to ancient skeletal remains from Britain to document the first instance of mosaic Turner syndrome (45,X0/46,XX) in the ancient genetic record in an Iron Age individual sequenced to average 9-fold coverage, the earliest known incidence of an individual with a 47,XYY karyotype from the Early Medieval period, as well as individuals with Klinefelter (47,XXY) and Down syndrome (47,XY, + 21). Overall, our approach provides an accessible and automated framework allowing for the detection of individuals with aneuploidies, which extends previous binary approaches. This tool can facilitate the interpretation of burial context and living conditions, as well as elucidate past perceptions of biological sex and people with diverse biological traits.
Subject terms: Population genetics, Evolutionary genetics
A sex chromosome identification method for ancient DNA revealed six individuals who lived with chromosomal aneuploidies in Britain from ~ 2,500 to 250 years ago. Genomic and osteological information was combined to shed light on their lives.
Introduction
The study of ancient genomes has revolutionised our ability to examine human biology over thousands of years, providing insight into phenotypic variation, social stratification and their impact on health throughout history1,2. In particular, ancient DNA can contribute to our understanding of chromosomal sex in the past, its relationship with gender, and allow for the identification of non-binary chromosomal sex via investigation of the number of sex chromosomes detected in an individual’s karyotype. Deploying genomic data for sex identification can surpass limitations in the sex estimation methods that rely on osteological features, as those can be less accurate or inapplicable when human skeletons are only partially complete3–5. Moreover, the subtlety of prepubescent sexual dimorphism means that established osteological sex estimates are rarely applicable to nonadult remains6,7. Several methods have been used to identify chromosomal sex from low-coverage genomes by comparing the proportion of sequences aligning to chromosome Y with sequences aligning to chromosome X or to autosomes8–10. However, publicly available tools do not aim to detect other, less frequent, combinations of sex chromosomes or aneuploidies.
Embryonic aneuploidies vary in their severity from pregnancy loss to milder conditions, like Klinefelter syndrome11. Klinefelter syndrome is characterised by infertility and a slightly increased risk for disorders like type 2 diabetes. The majority of individuals with Klinefelter syndrome carry a 47,XXY karyotype, accounting for 1 in 500–650 newborn males, which makes it the most common chromosomal disorder in males12. Although other karyotypes with supernumerary X chromosomes have been described (e.g. 48,XXXY or 49,XXXXY), these occur at much lower frequencies13. Additional copies of chromosome Y (47,XYY), observed in about 1 in 1000 male births, are associated with taller than average stature and an increased risk for asthma, seizures and learning disabilities14,15. On the other hand, individuals with Turner syndrome, a condition that affects 1 in 2500–3000 live female births16, carry only a single chromosome X with karyotype 45,X0 or in mosaic form (e.g. 45,X0/46,XX or 45X0/46,XY). Somatic presentation of Turner syndrome depends on the presence or degree of mosaicism and is associated with shorter stature and a higher incidence of infertility, cardiovascular, renal and endocrine disease17. Autosomal aneuploidies that survive to birth are trisomies 13, 18 and 21 and they are often associated with developmental and heart issues, with trisomy 21 (Down syndrome) being the most common one by far18.
Klinefelter is the only aneuploidy observed multiple times in the ancient genomic record to date, with reports of individuals with a 47,XXY karyotype in Copper Age Iberia19, Early Medieval Finland20, Viking Age Orkney21 and Medieval Portugal22 and Iceland23. Other aneuploidies identified with ancient DNA include a neonate with a 47,XXX karyotype (trisomy X, or triple X syndrome) from Iberia from the second millennium BCE19 and a neonate from Neolithic Ireland with trisomy 2124. The ability to detect these atypical karyotypes in the ancient genomic record has the dual advantage of providing a more detailed insight into the life conditions and societal surroundings of those individuals, while also adding a historical perspective for medical practitioners and patients today.
In this paper, we develop and validate a method that distinguishes between all sex chromosome karyotypes, as well as detecting autosomal aneuploidies such as trisomy 21. We apply this method to new data from ancient Britain and present the oldest known instance of mosaic Turner syndrome dating to the Early Iron Age, three individuals with Klinefelter syndrome, spanning from the Iron Age to the Post-Medieval Period from England, an individual with 47,XYY syndrome in Early Medieval England and an Iron Age infant with Down syndrome.
Results
A computational method to identify aneuploidies using ancient DNA
Our computational approach is based on independently quantifying the number of observed sequences aligning to any chromosome compared to an ‘autosomal baseline’, Na, which we defined as the sum of sequences aligning to chromosomes 1 through 22, excluding chromosomes 13, 18 and 21 which are the only autosomal aneuploidies that survive to birth in appreciable numbers25. We then defined the following metrics. The Rx estimate was calculated by dividing the number of sequences aligning to chromosome X with Na, and the Ry estimate was calculated by dividing the number of sequences aligning to chromosome Y with Na. We estimated the theoretical values for Na, Rx, and Ry expected for chromosome sizes in human genome build 37 and applied our method to published shotgun-sequenced genomes to investigate their overlap with the theoretical values. Leveraging the observed distribution of the empirical data, we optimised the assignment thresholds used to estimate the number of copies of chromosomes X and Y in the karyotype. Additionally, we defined an alternative set of assignment thresholds, optimised for use with target enriched libraries on the widely used “1240k” SNP positions26,27 (see Supplementary Note 4 and Supplementary Figure 3). We obtained standard errors for the assignments using binomial approximations, and used these to test if individuals are significantly deviating from theoretical expectations.
The method described here was applied to 570 published ancient shotgun sequenced genomes from Viking Age Northern Europe21 and ancient Rome28, representing two independent studies from different laboratories to corroborate the absence of batch effects. Individuals with a 46,XX karyotype clustered close to the mean expected Rx value of 0.056 for two copies of chromosome X and their Ry values were close to 0. Meanwhile, individuals with a 46,XY karyotype had Rx values within the typical range expected for a single copy of chromosome X (0.028), while having Ry values close to the mean value of 0.0026, expected of individuals carrying one chromosome Y (Fig. 1a).
Detection of individuals with 47,XXY and 47,XYY karyotypes
Three individuals, published in this study, clustered in the top right corner in Fig. 1a, having an Rx value within the typical range observed in XX individuals and an Ry value within the typical range for XY individuals. This indicated the presence of two copies of chromosome X and one copy of chromosome Y and suggested that these three individuals (C10427, C11119, C11569) had Klinefelter syndrome (47,XXY). Details about their age and site of origin can be found in Table 1. Contamination estimates based on heterozygosity on chromosome X, commonly used to measure contamination in males, were high (>20%) for C11119 (Sk4205 from the medieval cemetery under Longwall Quad, Magdalen College) and C11569 (Sk30828 from Trinity Burial Ground), consistent with the presence of more than one X chromosome29 (Supplementary Data 1). However, C10427 (WS224 from Wetwang Slack) had a contamination estimate in the typical range of XY individuals (i.e., carrying only one X chromosome). Lack of heterozygosity is a strong indication that the supernumerary chromosome resulted from a copy of the maternal X occurring due to a mitotic error at an early postzygotic stage30. Individual C13582 had the highest observed Ry value out of all studied individuals (Ry = 0.0048) suggesting the presence of an additional copy of Y chromosome in their karyotype, which was also evident using the coverage per chromosome results (Fig. 2).
Table 1.
Sample ID | Skeleton ID | Human skeletal element | Archaeological site | Absolute/relative dates | Karyotype | Age at death |
---|---|---|---|---|---|---|
C10090 | CH163 | Right mandibular second molar | Charterhouse Warren | Iron Age (2703–2365 cal BP, cal AD 754–416) | 45,X0/46,XX | 18–22 y |
C10427 | WS224 (Gen Lab 199) | Left stapes | Wetwang Slack | Iron Age | 47,XXY | 18–19 y |
C10462 | WS267 (Gen Lab 162) | Right temporal | Wetwang Slack | Iron Age | 47,XY, + 21 | Neonate |
C11119 | Sk4205 | Left incus | Medieval cemetery under Longwall Quad, Magdalen College, Oxford | Medieval Period (cal AD 1050–1290) | 47,XXY | 36–45 y / 45+ y |
C11569 | Sk30828 | Right temporal | Trinity Burial Ground, Kingston-Upon-Hull | Post-medieval Period (early 19th c.) | 47,XXY | 16–19 y |
C13582 | 16586 | Right malleus | Lincoln Eastern Bypass | Early Medieval Period (8th c.) | 47,XYY | 46+ y |
Detection of an Iron Age individual with mosaic Turner syndrome (45,X0/46,XX)
Individual C10090 (CH163 from Charterhouse Warren) had an Rx value (0.033), indicating the presence of one copy of chromosome X and an Ry value (2.32 × 10−5) suggesting the absence of any copies of chromosome Y, thus revealing a 45,X0 karyotype, characteristic of Turner syndrome. Deeper sequencing (final average coverage 9.8x), confirmed the 45,X0 karyotype (Fig. 2).
However, upon closer investigation of the genome of C10090, two observations were suggestive of mosaicism. First, the estimated Rx value was higher than the Rx values of 375 tested individuals identified as XY (5.1 SD from the mean Rx for XY) (Fig. 1a), suggesting the presence of additional chromosome X sequences. Second, heterozygosity estimates29 on polymorphic sites of chromosome X were closer to those of individuals with two copies of chromosome X (16.37%), even when restricting the analysis to authentic ancient DNA sequences to exclude contamination31 (Supplementary Data 1). This does not seem related to structural variation, like the presence of an isochromosome, duplications or deletions of whole regions, that could have explained the observed additional X chromosome sequences, since there was even coverage and consistent levels of heterozygosity across the short and long arms of the chromosome (Supplementary Note 2 and Supplementary Fig. 1).
Hence, an interpretation accommodating a 45,X0 karyotype with a small but substantial excess of chromosome X sequences was mosaicism (45,X0/46,XX) i.e., a smaller proportion of the cells of C10090 carrying a typical 46,XX and the majority of cells carrying a 45,X0 karyotype. Indeed, the majority of adults with Turner syndrome exhibit some degree of mosaicism32,33. Mosaic presentations have also been detected in large-scale genomic datasets like the UK Biobank, where the 45,X0/46,XX karyotype was 6 times more prevalent than the non-mosaic 45,X0 and it was associated with more average stature and fewer reproductive and cardiovascular complications34,35.
Statistical power to detect aneuploidy at low genomic sequencing coverages
In order to confirm that the detected monosomy could not have been an artefact of poor DNA preservation, the high-coverage genome was downsampled to six different coverage levels 100 times to test the consistency and thresholds of the karyotypic assignment. As shown in Fig. 3b and c, the assignment remained consistent with a 45,X0 karyotype across coverage levels, even with as low as 0.0001x genomic coverage. The Rx and Ry values exhibited higher variability in lower coverage levels, as would be expected by the increasingly limited number of reads, but they remained within the established borders for one copy of X (Fig. 3b) and no copies of Y (Fig. 3c) chromosomes respectively.
Detection of an Iron Age individual with Trisomy 21 (47,XY, + 21)
In addition to estimating the number of sex chromosomes in a karyotype, our approach can be applied to quantify the coverage of any chromosome by estimating the proportion of sequences aligning to that chromosome over the ‘autosomal baseline’ or Na and thus also identifying autosomal aneuploidies. Indeed, a neonate with trisomy 21 was identified with this method (C10462, WS267 from Wetwang Slack) from Iron Age Yorkshire with a 47,XY, + 21 karyotype (Fig. 2, Supplementary Note 3 and Supplementary Fig. 2).
Discussion
The identification of individuals with autosomal and sex chromosomal aneuploidies in the ancient genomic record contributes to a more detailed reconstruction of past societies and highlights the limitations of approaches focusing on binary classification of karyotypic sex. Examining the genomic information from these individuals in conjunction with osteological and archaeological data is crucial in providing a comprehensive perspective of aneuploidy across time.
Certain aneuploidies affecting the sex chromosomes can lead to disorders of sex development (DSDs) which result in mixed or ambiguous sex-related physical characteristics36. Many people with mild disorders experience few effects and are often unaware that they have a DSD. However, severe cases can produce stronger physical effects, including factors, which may be relevant to their own and society’s perception of their sex or gender20. There is evidence that people with DSDs exhibit elevated rates of gender dysphoria37, highlighting the potentially complex relationship between the physical manifestations of DSDs and a person’s sex and social gender as understood by themselves and their wider community38–40. It is difficult to know an ancient individual’s conception of their own gender identity and gender norms in the past may not align with those of the present day. However, while most people with sex chromosomal aneuploidies in the past may have lived their lives within conventional gender norms, it is possible that an elevated proportion would have been seen to transgress gender boundaries. Therefore, consideration of past people with sex chromosomal aneuploidies alongside accompanying gendered grave goods may provide a way of exploring past gender variability and how this may or may not relate to the physical effects of DSDs20. This idea was explored by Moilanen and colleagues20 in relation to an individual with possible Klinefelter Syndrome from early medieval Finland who was buried with both typically masculine- and feminine-gendered grave goods.
The three skeletons with Klinefelter syndrome from this study were buried in different contexts and time periods, but with certain similarities, the most important one being that their burials did not reveal any differences in how they were perceived by their contemporaries. The individual with Klinefelter syndrome from Wetwang Slack41 (C10427, Skeleton ID: WS224) was identified osteologically as male, around 18–19 years at death without associated grave goods. The individual found under Longwall Quad, Magdalen College (C11119, Skeleton ID: Sk4205) possessed sexually dimorphic features, which were consistently male; nearly all of them strongly male and two, probably male42. With an estimated stature of 176 cm (+/−3.27 cm), the individual was tall compared with the rest of the burial assemblage. The individual from Trinity Burial Ground (C11569, Skeleton ID: Sk30828) was an adolescent, found without additional artefacts and occupying a plain wooden coffin lacking metal fittings, in a part of the burial ground where many of the burials may have represented poorer members of society. By employing observations of epiphyseal fusion43 and dental development and eruption44, C11569 was estimated to have been between 16 and 19 years of age. Generally, many of the epiphyses were unfused and this suggested delayed skeletal development, or a prolonged growth period. Had the individual lived to attain skeletal maturity, this may have resulted in a taller-than-average stature. Similarly, the individual from Lincoln Eastern Bypass (C13582, Skeleton ID: 16586) with a 47,XYY karyotype had a stature of 184.2 cm, taller than average but within the male height range for the cemetery and was osteologically characterised as male42 (more details on the archaeological contexts and osteological assessment for all individuals in Supplementary Note 1).
Regarding the individual with mosaic Turner syndrome from Charterhouse Warren (C10090, Skeleton ID: CH163), it was possible to confidently match the cranium and mandible (the latter yielding the tooth analysed in this study), but no postcranial elements could be associated. Relatively few formal burials are known for the Early Iron Age of southern Britain, so it is difficult to assess how unusual Charterhouse Warren was for its period. Indeed, considerable variability seems to be the norm45 and disarticulated Iron Age human remains have been found in other caves in southwest England46,47. The main skeletal manifestation of Turner syndrome is short stature48, which could not be assessed given this lack of association. The clinical literature also records some changes in skull morphology49; while these are difficult to address given the skull’s incomplete and reconstructed state (see Supplementary Note 1), the cranium does appear to have a high-arched palate which is one of the skeletal changes seen in Turner syndrome (Fig. 3a). Morphological characteristics of the cranium including slight brow ridges, sharp orbital margins, rounded orbits, high frontal, and smooth nuchal area support a female biological sex estimation42 (Fig. 3a). The Charterhouse Warren individual died as a young adult, ca. 18–22 years old (see Supplementary Note 1). Interestingly, there is a reported relationship between spheno-occipital synchondrosis closure and the onset of puberty50, and in the Charterhouse Warren individual the spheno-occipital synchondrosis is unfused, implying that they would not have experienced menarche, despite their estimated age. A delay in, or absence of, menarche is a feature of Turner syndrome, caused by the absence or low levels of oestrogen16,17.
Identification of aneuploidies such as Down syndrome, which can result in neurodevelopmental problems51 can provide insights into care within ancient societies, as well as how people with these conditions, which have characteristic physical manifestations, were perceived by their peers. The individual with Down syndrome from Wetwang Slack41 (C10462, Skeleton ID: WS267) was a male neonate, buried in the ditch of a square barrow containing a primary adult female burial. However, the burial of neonates within the ditches of pre-existing barrows appears to have been relatively common at Wetwang Slack and is not likely to relate directly to the genetic condition of this individual.
Overall, individuals with aneuploidy from this study were largely buried in accordance with the customs that prevailed in their lifetimes, with the exception of CH163 from Charterhouse Warren (C10090), suggesting that they were most likely considered ordinary members of their community. Inferences regarding their view of themselves and their gender are limited, not least due to the lack of associated gendered grave goods or records. Of course, our understanding of attitudes on identity and gender in Medieval England, surpasses that of the Iron Age, primarily due to the abundance of historical records available for the former. Thus, while patriarchal stereotypes and strictly defined gender roles prevailed in the Medieval period52, female individuals from the Iron Age have intriguingly often been found in elite burials with “objects of power” typically associated with male occupants53,54. An additional aspect to consider, beyond gender identity, is the fact that people with sex chromosomal aneuploidies are substantially more likely to experience later onset of puberty and childlessness55 and the influence these would exert on their daily lives. Indeed, there were osteological indications of delayed growth in one of the three individuals with Klinefelter (C11569), as well as in the individual with mosaic Turner syndrome (C10090) and none of the six individuals in this study were buried in proximity to progeny, although this is not a rare occurrence. Lastly, we would advise caution when interpreting the frequency of aneuploidies in the past, as certain aspects of ancient DNA studies could introduce bias and influence our observations; namely the preferential targeting of crania (often treated differently to postcranial elements in mortuary contexts) due to better DNA preservation56, the scarcity of skeletal material covering certain periods and regions (for example due to widespread cremation) or variable infant representation in skeletal assemblages57.
The method presented here provides a straightforward approach that can identify chromosomal sex as well as autosomal and sex chromosomal aneuploidies (XXY, X0, XYY, XXX and +21) on genomic data of varying quality, while also flagging up likely contamination. In principle, our method can be expected to detect around half of contamination scenarios, i.e. those cases when the contamination originates from an individual with a different chromosomal sex to the individual under study. One individual from a published study (VK12521, who was excluded from downstream analysis in the original paper due to low coverage) had intermediate values both for Rx and Ry (Fig. 1a), indicating contamination with modern-day DNA from someone with different chromosomal sex to the individual under study, which could be relatively common in low-yield ancient DNA samples that have been handled extensively58–60. The presence of different-sex contamination tends to inflate the observed Rx values while decreasing Ry values for XY individuals and inflate the Ry values while decreasing Rx values for XX individuals, due to the presence of additional sets of autosomes in the denominator. Importantly, these values are different from those observed in any type of sex chromosomal aneuploidy and thus they can be distinguished with this method, unlike previously8. We provide the classification of putatively contaminated libraries in our implementation of the methods in this paper. A potential limitation of the method is its lack of sensitivity to structural variation, such as micro-deletions in the Y chromosome61. Rare cases of aneuploidy that can be attributed to chromosomal rearrangements18 can also not be excluded, as they cannot be identified by this method currently.
As indicated from our results, the ever-increasing number of ancient genomes will only multiply the opportunities to observe and contextualise genomic diversity and more importantly, methods like the one presented here will provide another layer of information that can contribute to a more detailed reconstruction of the human past.
Methods
Ancient DNA extraction and sequencing
For the genomic data generated in this study, DNA was extracted from auditory ossicles, dentine or the petrosal part of the temporal bone62 and double-indexed single-stranded libraries were prepared and amplified using Agilent Bravo Workstations63,64 before sequencing approximately 2.5–5 million read-pairs per library on the Illumina NextSeq 500 platform (with the exception of C13582 that was sequenced directly on the Illumina NovaSeq 6000 to approximately 70 million reads). The same libraries, except C13582, were then sequenced for a second round on the Illumina NovaSeq 6000 to increase the sequencing depth per library, obtaining a final average coverage of ~0.85x, except in the case of C10090 which was sequenced to 9.8x coverage.
Bioinformatic processing
Sequenced libraries were preprocessed and analysed using the nf-core/eager pipeline65 version 2.3.3 and the version-matched Singularity image. The hs37d5 genome and version 50.0 of AADR (v50.0_1240K)66 were used as references. Read trimming and adapter removal was applied within the pipeline with the following options specified:
“--complexity_filter_poly_g”,
“--clip_forward_adaptor AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC”,
“--clip_reverse_adaptor GGAAGAGCGTCGTGTAGGGAAAGAGTGT”,
“--preserve5p” and “--clip_readlength 35”.
Paired reads were collapsed (“--mergedonly”) and aligned using bwa aln67 with maximum edit distance 0.01 (“--bwaalnn 0.01”) and subsequently filtered (“--run_bam_filtering”). PCR duplicates were identified and removed with DeDup68 (“--dedupper dedup”). Contamination estimates based on heterozygosity on the X chromosome were calculated using ANGSD29 and contamination estimates based on mitochondrial DNA were calculated using schmutzi version 1.5.669.
Y-chromosome haplogroups were inferred with Yleaf version 3.170, using only reads with mapping quality over 30. To determine mitochondrial haplogroups, the deduplicated BAM files output by nf-core/eager were first filtered for mitochondrial reads and realigned using bwa aln (“-n 0.01”, “-l 1024”, and ”-k 2”) to rCRS (rCRS_NC_012920). Variants in mitochondrial reads were identified with BCFtools version 1.10.271,72: BAM files were filtered with “bcftools view” for minimum depth 1 (“--include ‘FMT/DP > = 1”) and variants called with “bcftools call” specifying “--multiallelic-caller”. Mitochondrial haplogroups were then assessed using “haplogrep classify” version 2.2.873 with the best 3 hits calculated (“--hits 3”).
Published data
Published data21,28 for 570 individuals were downloaded from the European Nucleotide Archive.
Sex identification
Sex identification was done using the method reported in this study. The karyotype assignment thresholds were defined separately for shotgun sequenced and for target-enriched libraries and details on their determination can be found in Supplementary Note 4. BAM files for both published and newly generated genomes were filtered for mapping quality (-q30) using SAMtools version 1.1374 and the sex identification script (karyo_RxRy_script.py, from https://github.com/kyriaki-anast/karyo_RxRy) was applied using this command:
“samtools view -q30 bam_file.bam | python2 karyo_RxRy_script.py > output_file”.
Chromosomal coverage
Chromosomal coverage for all newly published individuals (Fig. 2) was calculated by counting the number of sequences mapping to the human reference genome hs37d5, filtering only for regions included in the 1000 Genomes Strict Mask75. The counts of sequences mapping to each chromosome were then normalised between chromosomes and across individuals to account for differences in sequencing effort per library.
Downsampling
The high coverage genome (9.8x) of individual C10090 was randomly downsampled to six different coverage levels, in 100 independent repetitions for each level. Final genomic coverage levels were expressed as proportions of initial coverage. The sex identification method was applied to all downsampled BAM files as described above (applying a mapping quality filter (-q) of 30). Raw output is in Supplementary Data 4 and data is plotted in Fig. 3b and c.
Statistics and reproducibility
Sample sizes were not determined in advance. Methods were applied to published data generated from different groups to control for batch effects. Raw data, analysis output, newly developed scripts and software versions have been made available.
Ethics declaration – sample provenance
Permissions to study the archaeological samples presented in this study were obtained as follows. CH163 - Permissions acquired from David Walker, archaeological curator at Wells Museum. WS 224 (Gen Lab 199) and WS 267 (Gen Lab 162) - Permissions acquired from Jo Buckberry, Reader in Biological Anthropology at the University of Bradford. Sk 4205 - Permissions acquired from Louise Loe, head human osteologist at Oxford Archaeology South and David Radford, archaeologist at Oxford City Council. Sk 30828 - Permissions acquired from Louise Loe, head human osteologist at Oxford Archaeology South and Stephen Rowland, project manager at Oxford Archaeology. Sk 16586 – Permissions acquired from Malin Holst, Managing Director at York Osteoarchaeology. Minimally destructive sampling for aDNA analysis followed guidelines issued by the Department for Culture, Media and Sport (DCMS) and the Advisory Panel on the Archaeology of Burials in England (APABE) (apabe.archaeologyuk.org).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This study was supported by the Wellcome Trust (217223/Z/19/Z) and Francis Crick Institute core funding (FC001595) from Cancer Research UK, the UK Medical Research Council, and the Wellcome Trust. P.Sk. is supported by the European Molecular Biology Organization, the Vallee Foundation, and the European Research Council (grant no. 852558). L.S. acknowledges support provided by a Sir Henry Wellcome fellowship [220457/Z/20/Z]. Analysis of the Charterhouse Warren assemblage was supported by the British Academy (SG163375). Analysis of the samples from Wetwang Slack forms part of the COMMIOS project which has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 834087). Access to the Wetwang Slack human remains collections was provided by the Biological Anthropology Research Centre (BARC), University of Bradford. The excavation and analysis of the medieval cemetery under Longwall Quad, Magdalen College was commissioned by Magdalen College and excavation and analysis of the Trinity Burial Ground Hull assemblage was commissioned by Balfour Beatty for National Highways. The excavation and analysis of the Lincoln Eastern Bypass assemblage was commissioned by the Lincolnshire County Council and conducted by Network Archaeology and York Osteoarchaeology. Image credits for Fig. 3a: Rick Schulting, Ian Cartwright. The authors thank the staff of the Advanced Sequencing Facility and the Bioinformatics and Biostatistics STP at The Francis Crick Institute for their contributions to this study. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Author contributions
Conceptualization: K.A., P.Sk. Data curation: K.A., C.B. Formal Analysis: K.A. Methodology: K.A., P.Sk. Software: K.A., P.Sk. Validation: K.A. Funding acquisition: L. L., I. A., R. S., P. Sk. Investigation: K.A., M.S., T.B., L.S., M.K., M.W., F.T., J.M., P.Sw., I.G., J.B., M.H., M.M., D.F., P.P., J.R.V.,T.A., B.F., M.G., A.G., L.M., S.R., D.W., H.W., A.W., P.Sk. Supervision: L.L., I.A., R.S., P.Sk. Visualisation: K.A., R.S. Writing – original draft & revisions: K.A., M.S., T.B., C.B., R.S., A.W., L.M., H.W.,K.K., P.P., J.R.V., J.B., M.H., S.R., L.L., I.A., P.Sk.
Peer review
Peer review information
Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Zuzana Hofmanova and George Inglis.
Funding
Open Access funding provided by The Francis Crick Institute.
Data availability
Sequencing data (BAM files) are available on the European Nucleotide Archive (accession code: PRJEB65239). Public datasets21,28 were accessed through the European Nucleotide Archive (accession codes: PRJEB37976 and PRJEB32566). Source data used to plot Fig. 1a are in Supplementary Data 2. Source data used to plot Fig. 2b, c are in Supplementary Data 4. Source data used to plot Fig. 3 are in Supplementary Data 3.
Code availability
The sex identification method76 described and used in this paper can be accessed at https://github.com/kyriaki-anast/karyo_RxRy. This includes an option for target-enriched libraries and autosomal aneuploidy detection.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Kyriaki Anastasiadou, Email: kyriaki.anastasiadou@crick.ac.uk.
Pontus Skoglund, Email: pontus.skoglund@crick.ac.uk.
Supplementary information
The online version contains supplementary material available at 10.1038/s42003-023-05642-z.
References
- 1.Skoglund P, Mathieson I. Ancient Genomics of Modern Humans: The First Decade. Annu. Rev. Geno. Hum. Genet. 2018;19:381–404. doi: 10.1146/annurev-genom-083117-021749. [DOI] [PubMed] [Google Scholar]
- 2.Racimo F, Sikora M, Vander Linden M, Schroeder H, Lalueza-Fox C. Beyond broad strokes: sociocultural insights from the study of ancient genomes. Nat. Rev. Genet. 2020;21:355–366. doi: 10.1038/s41576-020-0218-z. [DOI] [PubMed] [Google Scholar]
- 3.Kjellström A. Evaluations of sex assessment using weighted traits on incomplete skeletal remains. Int. J. Osteoarchaeol. 2004;14:360–373. doi: 10.1002/oa.720. [DOI] [Google Scholar]
- 4.Meindl RS, Lovejoy CO, Mensforth RP, Don Carlos L. Accuracy and direction of error in the sexing of the skeleton: implications for paleodemography. Am. J. Phys. Anthropol. 1985;68:79–85. doi: 10.1002/ajpa.1330680108. [DOI] [PubMed] [Google Scholar]
- 5.Armit, I. et al. Kinship practices in Early Iron Age South-east Europe: genetic and isotopic analysis of burials from the Dolge njive barrow cemetery, Dolenjska, Slovenia. Antiquity.97, 403–418 (2023).
- 6.Ubelaker DH. Estimating age at death from immature human skeletons: an overview. J. Forensic Sci. 1987;32:1254–1263. doi: 10.1520/JFS11176J. [DOI] [PubMed] [Google Scholar]
- 7.Stull, K. E., Cirillo, L. E., Cole, S. J. & Hulse, C. N. Subadult sex estimation and KidStats. in Sex Estimation of the Human Skeleton 219–242 (Elsevier, 2020).
- 8.Skoglund P, Storå J, Götherström A, Jakobsson M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. J. Archaeol. Sci. 2013;40:4477–4482. doi: 10.1016/j.jas.2013.07.004. [DOI] [Google Scholar]
- 9.Mittnik A, Wang C-C, Svoboda J, Krause J. A Molecular Approach to the Sexing of the Triple Burial at the Upper Paleolithic Site of Dolní Věstonice. PLoS One. 2016;11:e0163019. doi: 10.1371/journal.pone.0163019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lamnidis TC, et al. Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe. Nat. Commun. 2018;9:5018. doi: 10.1038/s41467-018-07483-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gu C, et al. Chromosomal Aneuploidy Associated With Clinical Characteristics of Pregnancy Loss. Front. Genet. 2021;12:667697. doi: 10.3389/fgene.2021.667697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kanakis GA, Nieschlag E. Klinefelter syndrome: more than hypogonadism. Metabolism. 2018;86:135–144. doi: 10.1016/j.metabol.2017.09.017. [DOI] [PubMed] [Google Scholar]
- 13.Visootsak J, Graham JM., Jr. Klinefelter syndrome and other sex chromosomal aneuploidies. Orphanet. J. Rare Dis. 2006;1:42. doi: 10.1186/1750-1172-1-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bardsley MZ, et al. 47,XYY syndrome: clinical phenotype and timing of ascertainment. J. Pediatr. 2013;163:1085–1094. doi: 10.1016/j.jpeds.2013.05.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ross JL, Zeger MPD, Kushner H, Zinn AR, Roeltgen DP. An extra X or Y chromosome: contrasting the cognitive and motor phenotypes in childhood in boys with 47,XYY syndrome or 47,XXY Klinefelter syndrome. Dev. Disabil. Res. Rev. 2009;15:309–317. doi: 10.1002/ddrr.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ranke MB, Saenger P. Turner’s syndrome. Lancet. 2001;358:309–314. doi: 10.1016/S0140-6736(01)05487-3. [DOI] [PubMed] [Google Scholar]
- 17.Gravholt CH, Viuff MH, Brun S, Stochholm K, Andersen NH. Turner syndrome: mechanisms and management. Nat. Rev. Endocrinol. 2019;15:601–614. doi: 10.1038/s41574-019-0224-4. [DOI] [PubMed] [Google Scholar]
- 18.Cuckle H, Benn P. Review of epidemiological factors (other than maternal age) that determine the prevalence of common autosomal trisomies. Prenat. Diagn. 2021;41:536–544. doi: 10.1002/pd.5822. [DOI] [PubMed] [Google Scholar]
- 19.Villalba-Mouco V, et al. Genomic transformation and social organization during the Copper Age-Bronze Age transition in southern Iberia. Sci. Adv. 2021;7:eabi7038. doi: 10.1126/sciadv.abi7038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Moilanen U, et al. A Woman with a Sword? – Weapon Grave at Suontaka Vesitorninmäki, Finland. Eur. J. Archaeol. 2022;25:42–60. doi: 10.1017/eaa.2021.30. [DOI] [Google Scholar]
- 21.Margaryan A, et al. Population genomics of the Viking world. Nature. 2020;585:390–396. doi: 10.1038/s41586-020-2688-8. [DOI] [PubMed] [Google Scholar]
- 22.Roca-Rada X, et al. A 1000-year-old case of Klinefelter’s syndrome diagnosed by integrating morphology, osteology, and genetics. Lancet. 2022;400:691–692. doi: 10.1016/S0140-6736(22)01476-3. [DOI] [PubMed] [Google Scholar]
- 23.Ebenesersdóttir SS, et al. Ancient genomes from Iceland reveal the making of a human population. Science. 2018;360:1028–1032. doi: 10.1126/science.aar2625. [DOI] [PubMed] [Google Scholar]
- 24.Cassidy LM, et al. A dynastic elite in monumental Neolithic society. Nature. 2020;582:384–388. doi: 10.1038/s41586-020-2378-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hassold T, Hunt P. To err (meiotically) is human: the genesis of human aneuploidy. Nat. Rev. Genet. 2001;2:280–291. doi: 10.1038/35066065. [DOI] [PubMed] [Google Scholar]
- 26.Fu Q, et al. DNA analysis of an early modern human from Tianyuan Cave, China. Proc. Natl Acad. Sci. USA. 2013;110:2223–2227. doi: 10.1073/pnas.1221359110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Rohland N, et al. Three assays for in-solution enrichment of ancient human DNA at more than a million SNPs. Genome Res. 2022;32:2068–2078. doi: 10.1101/gr.276728.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Antonio ML, et al. Ancient Rome: A genetic crossroads of Europe and the Mediterranean. Science. 2019;366:708–714. doi: 10.1126/science.aay6826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinforma. 2014;15:356. doi: 10.1186/s12859-014-0356-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tüttelmann F, Gromoll J. Novel genetic aspects of Klinefelter’s syndrome. Mol. Hum. Reprod. 2010;16:386–395. doi: 10.1093/molehr/gaq019. [DOI] [PubMed] [Google Scholar]
- 31.Skoglund P, et al. Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal. Proc. Natl Acad. Sci. USA. 2014;111:2229–2234. doi: 10.1073/pnas.1318934111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hook EB, Warburton D. The distribution of chromosomal genotypes associated with Turner’s syndrome: livebirth prevalence rates and evidence for diminished fetal mortality and severity in genotypes associated with structural X abnormalities or mosaicism. Hum. Genet. 1983;64:24–27. doi: 10.1007/BF00289473. [DOI] [PubMed] [Google Scholar]
- 33.Zhong Q, Layman LC. Genetic considerations in the patient with Turner syndrome-45,X with or without mosaicism. Fertil. Steril. 2012;98:775–779. doi: 10.1016/j.fertnstert.2012.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bycroft C, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tuke MA, et al. Mosaic Turner syndrome shows reduced penetrance in an adult population study. Genet. Med. 2019;21:877–886. doi: 10.1038/s41436-018-0271-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hughes IA. Disorders of sex development: a new definition and classification. Best. Pract. Res. Clin. Endocrinol. Metab. 2008;22:119–134. doi: 10.1016/j.beem.2007.11.001. [DOI] [PubMed] [Google Scholar]
- 37.Furtado PS, et al. Gender dysphoria associated with disorders of sex development. Nat. Rev. Urol. 2012;9:620–627. doi: 10.1038/nrurol.2012.182. [DOI] [PubMed] [Google Scholar]
- 38.Fisher AD, et al. Gender identity, gender assignment and reassignment in individuals with disorders of sex development: a major of dilemma. J. Endocrinol. Invest. 2016;39:1207–1224. doi: 10.1007/s40618-016-0482-0. [DOI] [PubMed] [Google Scholar]
- 39.Lee PA, et al. Global Disorders of Sex Development Update since 2006: Perceptions, Approach and Care. Horm. Res. Paediatr. 2016;85:158–180. doi: 10.1159/000442975. [DOI] [PubMed] [Google Scholar]
- 40.Garofalo, E. M. & Garvin, H. M. The confusion between biological sex and gender and potential implications of misinterpretations. in Sex Estimation of the Human Skeleton 35–52 (Elsevier, 2020).
- 41.Armit, I. The Wetwang/Garton Slack Project (WGSP). York:Archaeology Data Service.10.5284/1030285 (2021).
- 42.Ubelaker, D. H. & Buikstra, J. E. Standards for Data Collection from Human Skeletal Remains: Proceedings of a Seminar at the Field Museum of natural history. (Arkansas Archeological Survey, 1994).
- 43.Scheuer, L. & Black, S. Developmental Juvenile Osteology. (Elsevier Academic Press, 2000).
- 44.AlQahtani, S. J., Hector, M. P. & Liversidge H. M. Brief communication: The London atlas of human tooth development and eruption. Am. J. Phys. Anthropol.142, 481–490 (2010). [DOI] [PubMed]
- 45.Harding, D. W. Death and burial in Iron Age Britain / D.W. Hardin. (Oxford University Press, 2016).
- 46.Cox M, Loe L. The Human skeletal remains from Fishmonger’s Swallet, Alveston, Gloucestershire: Evidence for anthropogenic modification. Proc. Univ. Bristol Spelaeol. Soc. 2022;29:33–66. [Google Scholar]
- 47.Langford, F. Third report on Read’s Cavern (Keltic Cavern). Proceedings of the University of Bristol Spelaeological Society1, 135–143 (1922).
- 48.Binder G. Short stature due to SHOX deficiency: genotype, phenotype, and therapy. Horm. Res. Paediatr. 2011;75:81–89. doi: 10.1159/000324105. [DOI] [PubMed] [Google Scholar]
- 49.Doswell BH, Visootsak J, Brady AN, Graham JM., Jr. Turner syndrome: an update and review for the primary pediatrician. Clin. Pediatr. 2006;45:301–313. doi: 10.1177/000992280604500402. [DOI] [PubMed] [Google Scholar]
- 50.Alhazmi A, et al. Timing and rate of spheno-occipital synchondrosis closure and its relationship to puberty. PLoS One. 2017;12:e0183305. doi: 10.1371/journal.pone.0183305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Bull MJ. Down Syndrome. N. Engl. J. Med. 2020;382:2344–2352. doi: 10.1056/NEJMra1706537. [DOI] [PubMed] [Google Scholar]
- 52.Hill, B. Women, Work And Sexual Politics In Eighteenth-Century England. (Routledge, 1993).
- 53.Arnold B. “Honorary Males” or Women of Substance? Gender, Status, and Power in Iron-Age Europe. J. Eur. Archaeol. 1995;3:153–168. doi: 10.1179/096576695800703757. [DOI] [Google Scholar]
- 54.Mays, S. et al. Sex identification of a Late Iron Age sword and mirror cist burial from Hillside Farm, Bryher, Isles of Scilly, England. J. Archaeol. Sci. Rep.52, 104099 (2023).
- 55.Zhao Y, et al. Detection and characterization of male sex chromosome abnormalities in the UK Biobank study. Genet. Med. 2022;24:1909–1919. doi: 10.1016/j.gim.2022.05.011. [DOI] [PubMed] [Google Scholar]
- 56.Charlton S, Booth T, Barnes I. The problem with petrous? A consideration of the potential biases in the utilization of pars petrosa for ancient DNA analysis. World Archaeol. 2019;51:574–585. doi: 10.1080/00438243.2019.1694062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.McFadden C, Muir B, Oxenham MF. Determinants of infant mortality and representation in bioarchaeological samples: A review. Am. J. Biol. Anthropol. 2022;177:196–206. doi: 10.1002/ajpa.24406. [DOI] [Google Scholar]
- 58.Green RE, et al. Analysis of one million base pairs of Neanderthal DNA. Nature. 2006;444:330–336. doi: 10.1038/nature05336. [DOI] [PubMed] [Google Scholar]
- 59.Malmström H, Storå J, Dalén L, Holmlund G, Götherström A. Extensive human DNA contamination in extracts from ancient dog bones and teeth. Mol. Biol. Evol. 2005;22:2040–2047. doi: 10.1093/molbev/msi195. [DOI] [PubMed] [Google Scholar]
- 60.Krause J, et al. A Complete mtDNA Genome of an Early Modern Human from Kostenki, Russia. Curr. Biol. 2010;20:231–236. doi: 10.1016/j.cub.2009.11.068. [DOI] [PubMed] [Google Scholar]
- 61.Kohn TP, Kohn JR, Owen RC, Coward RM. The Prevalence of Y-chromosome Microdeletions in Oligozoospermic Men: A Systematic Review and Meta-analysis of European and North American Studies. Eur. Urol. 2019;76:626–636. doi: 10.1016/j.eururo.2019.07.033. [DOI] [PubMed] [Google Scholar]
- 62.Rohland N, Glocke I, Aximu-Petri A, Meyer M. Extraction of highly degraded DNA from ancient bones, teeth and sediments for high-throughput sequencing. Nat. Protoc. 2018;13:2447–2461. doi: 10.1038/s41596-018-0050-5. [DOI] [PubMed] [Google Scholar]
- 63.Gansauge M-T, Aximu-Petri A, Nagel S, Meyer M. Manual and automated preparation of single-stranded DNA libraries for the sequencing of DNA from ancient biological remains and other sources of highly degraded DNA. Nat. Protoc. 2020;15:2279–2300. doi: 10.1038/s41596-020-0338-0. [DOI] [PubMed] [Google Scholar]
- 64.Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, 10.1101/pdb.prot5448 (2010). [DOI] [PubMed]
- 65.Fellows Yates JA, et al. Reproducible, portable, and efficient ancient genome reconstruction with nf-core/eager. PeerJ. 2021;9:e10947. doi: 10.7717/peerj.10947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Mallick, S. et al. The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes. bioRxiv 2023.04.06.535797. 10.1101/2023.04.06.535797 (2023). [DOI] [PMC free article] [PubMed]
- 67.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Peltzer A, et al. EAGER: efficient ancient genome reconstruction. Genome Biol. 2016;17:60. doi: 10.1186/s13059-016-0918-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Renaud G, Slon V, Duggan AT, Kelso J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 2015;16:224. doi: 10.1186/s13059-015-0776-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Ralf A, Montiel González D, Zhong K, Kayser M. Yleaf: Software for Human Y-Chromosomal Haplogroup Inference from Next-Generation Sequencing Data. Mol. Biol. Evol. 2018;35:1291–1294. doi: 10.1093/molbev/msy032. [DOI] [PubMed] [Google Scholar]
- 71.Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27:2987–2993. doi: 10.1093/bioinformatics/btr509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience10, (2021). [DOI] [PMC free article] [PubMed]
- 73.Weissensteiner H, et al. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016;44:W58–W63. doi: 10.1093/nar/gkw233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.1000 Genomes Project Consortium. et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Anastasiadou, K. karyo_RxRy.v1.0.1. 10.5281/zenodo.10186749 (2023).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data (BAM files) are available on the European Nucleotide Archive (accession code: PRJEB65239). Public datasets21,28 were accessed through the European Nucleotide Archive (accession codes: PRJEB37976 and PRJEB32566). Source data used to plot Fig. 1a are in Supplementary Data 2. Source data used to plot Fig. 2b, c are in Supplementary Data 4. Source data used to plot Fig. 3 are in Supplementary Data 3.
The sex identification method76 described and used in this paper can be accessed at https://github.com/kyriaki-anast/karyo_RxRy. This includes an option for target-enriched libraries and autosomal aneuploidy detection.