Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Jun 13;38(15):3768–3777. doi: 10.1093/bioinformatics/btac390

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2022. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

PMC Copyright notice

Fig. 5. — Assessing performance on empirical aDNA Data. (a, b) We performed downsampling experiments on 1240k data of two Sardinian samples, SUA001 and SUA002, both from Marcus et al. (2020). The original BAM files were down-sampled to various coverages with 100 independent replicates for each coverage. (a) Comparison between our method and ANGSD on SUA001, estimated to be 10.45% (95% CI: 9.56–11.34%) contaminated by ANGSD (on full data, visualized by the horizontal red line). (b) Comparison between our method and ANGSD on SUA002, estimated to be 0.38% (95% CI: 0.072–0.69%) contaminated by ANGSD (on full data, visualized by the horizontal red line). (c) We down-sampled WGS data of DA43, XiongNu, Mongolia from de Barros Damgaard et al. (2018). The original BAM file for DA43 was down-sampled to various coverages 0.01–0.5×, with 100 independent replicates for each target coverage. We only visualized ANGSD’s results on 0.05×, 0.1×, 0.5× because its estimates at coverage lower than 0.05× were highly variable. DA43 is estimated to be 2.83 % (95% CI: 2.35–3.31%) contaminated by ANGSD (on full data, visualized by the horizontal red line). (d) We compared our new method and ANGSD on 1240k aDNA data of 89 samples from the Iberian Peninsula and of 66 Eurasian hunter-gatherers. The true contamination rate is unknown. No down-sampling was performed and all individuals (dots) are color coded by the average coverage on 1240k SNPs on chromosome X. The inlet visualizes a zoom-in into $[0, 0.05] \times [0, 0.05]$ . A similar figure that only shows the Eurasian hunter-gatherers is available in Supplementary Figure S13