Abstract
Poor DNA preservation is the most limiting factor in ancient genomic research. In the majority of ancient bones and teeth, endogenous DNA molecules represent a minor fraction of the whole DNA extract, rendering shot-gun sequencing inefficient for obtaining genomic data. Based on ancient human bone samples from temperate and tropical environments, we show that an EDTA-based enzymatic ‘pre-digestion’ of powdered bone increases the proportion of endogenous DNA several fold. By performing the pre-digestion step between 30 min and 6 hours on five bones, we observe an asymptotic increase in endogenous DNA content, with a 2.7-fold average increase reached at 1 hour. We repeat the experiment using a brief pre-digestion (15 or 30 mins) on 21 ancient bones and teeth from a variety of archaeological contexts and observe an improvement in 16 of these. We here advocate the implementation of a brief pre-digestion step as a standard procedure in ancient DNA extractions. Finally, we demonstrate on 14 ancient teeth that by targeting the outer layer of the roots we obtain up to 14 times more endogenous DNA than when using the inner dentine. Our presented methods are likely to increase the proportion of ancient samples that are suitable for genome-scale characterization.
With the introduction of next-generation sequencing (NGS) technology, ancient DNA (aDNA) research has advanced over the last decade from retrieving short segments of mitochondrial DNA (mtDNA) to the characterization of complete genomes (reviewed in1). Among many highlights in such paleogenomic research is the documentation of a Late Pleistocene admixture between Neanderthals and anatomically modern humans2,3, the discovery of a previously unknown hominin, the Denisovan4,5,6, the description of the early human colonisation of the Americas and the Arctic7,8,9,10 and the clarification of equine evolution by sequencing the complete genome of a 700,000 years old horse11.
With some exceptions12,13, the ancient hominin samples subjected to genomic sequencing have displayed an exceptional biomolecule preservation with 28–70% of the DNA in the extracts identified as authentic3,5,7,9,14,15,16,17,18. However, this proportion of authentic DNA molecules, referred to as the ‘endogenous’ DNA content, represents less than 1% in the vast majority of DNA extracts from ancient remains (e.g.,19,20). Instead, the bulk of the DNA content in aDNA extracts is normally microbial21,22. Obtaining whole genomes, or genome-wide information is preferentially achieved using Next Generation ‘shotgun’ Sequencing, but when the target molecules represent such a minute fraction this is either not feasible or, at best, extremely expensive. Although genomic capture methods can be used to enrich for the target DNA19, most ancient samples will remain unsuitable for genome-scale characterization. It is therefore common to extract a large number of samples in an initial screening phase and then shotgun sequence them at low depth, in order to identify a few good candidate samples that have high endogenous contents. This approach is time-consuming and expensive, and requires the destruction of many samples that will not be amenable to genome analyses. Thus, the field of paleogenomics will benefit greatly from methodological advances that result in an increase of the endogenous DNA fraction during the DNA extraction. In this study we present two such advances.
Although the biochemical processes driving DNA preservation in bone are not fully understood, it has been shown that aDNA is preserved both in association with the bone minerals, hydroxy-apatite aggregates, and within the organic collagen fibrils23,24. During decomposition the bone structure degrades, which increases the porosity and total surface area of the bone23,25. Although the colonization of microorganisms is likely heterogenous26, we expect that during the digestion of bone material in the first step of an aDNA extraction, surface contaminants will be released into solution first, regardless of their exact location of deposition. This is in contrast to the endogenous DNA which is likely located more protected within the bone’s microniches27. We therefore hypothesized that treating the grinded bone material with a digestion buffer for a short period of time (a “pre-digestion”) would remove a fraction of the exogenous non-target DNA, and thereby enrich the DNA extract for endogenous DNA. Similarly, we hypothesized that modern human DNA contamination, deposited on the bone surface during recent handling, would also be preferentially removed with such pre-digestions. Higher endogenous DNA fractions were recently observed on second extractions undertaken on remaining bone pellets that had not been fully dissolved after 24 hours of incubation in a digestion buffer22,27,28. These promising observations provided an impetus for a more systematic assessment of the phenomenon in order to validate the potential for implementing a pre-digestion step into standardized aDNA extraction protocols, and identify the optimal duration of a pre-digestion.
We therfore used Next Generation shotgun sequencing to monitor the changes in endogenous DNA content and sequence complexity in DNA extracts from bone material treated with varying pre-digestion times (30 mins to 6 hours). Four bones from Easter Island (post-AD 1200), and one bone from Copenhagen, Denmark (18th century) were included in this initial experiment. Additionally, 21 ancient bones and teeth from Easter Island (post-AD 1200), Hungary (Bronze-Age 2000-1500 BC), and Guadeloupe (400-1400 AD) were used in a follow-up experiment to confirm the efficiency of the method and test the significance of the improvement when applying brief pre-digestions (i.e. 15–30 minutes).
Teeth roots have been demonstrated as an excellent resource for aDNA29,30,31 but a systematic comparison of the endogenous DNA proportions in various parts of a root is lacking. It has been shown that the nuclear DNA concentrations decline drastically in the inner dentine layer throughout the life of an individual32, whereas levels of nucleated cells in the apical cementum layer are unaffected by age31 . Moreover, a quantitative PCR approach determined that the concentration of human mtDNA in ancient teeth is generally elevated in the cementum as compared to the dentine29. However, because the cementum layer is exposed at the root surface, it could potentially be more affected by microbial colonization than dentine, and would thus still display a lower endogenous DNA proportion. To investigate this we extracted DNA and used shotgun sequencing to estimate the endogenous DNA proportions in crushed root surface (likely to be enriched for the outermost cementum layer), and the deeper parts of the root (containing mainly dentine) from 14 ancient teeth (Table 1), from Denmark (18th century and Iron Age c. 100 AD), Easter Island (post-AD 1200), and Greenland (c. 1100 AD).
Table 1. Sample information.
Experiment type | Sample name | Origin | App. age | Sample type |
---|---|---|---|---|
Pre-digestion time | EI9 | Easter Island | Post-1200 AD | Long bone |
EI19 | Easter Island | Post-1200 AD | Long bone | |
EI22 | Easter Island | Post-1200 AD | Long bone | |
EI8 | Easter Island | Post-1200 AD | Long bone | |
Trinitatis | Denmark | 18th century | Long bone | |
Brief pre-digestion | ANR 4704 | Easter Island | Post-1200 AD | Cranial bone |
ANR 4705 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4706 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4707 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4708 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4709 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4710 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4712 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4714 | Easter Island | Post-1200 AD | Cranial bone | |
ANR 4715 | Easter Island | Post-1200 AD | Cranial bone | |
Rise 479 | Hungary | Bronze Age, 2000-1500 BC | Long bone | |
Rise 483 | Hungary | Bronze Age, 2000-1500 BC | Long bone | |
AAG1_1 | Guadeloupe | 400-1400 AD | Long bone | |
AAG2_1 | Guadeloupe | 400-1400 AD | Long bone | |
AAG3_1 | Guadeloupe | 400-1400 AD | Long bone | |
AAG4_1 | Guadeloupe | 400-1400 AD | Long bone | |
AAG5_1 | Guadeloupe | 400-1400 AD | Long bone | |
AAG6_1 | Guadeloupe | 400-1400 AD | Tooth | |
AAG7_1 | Guadeloupe | 400-1400 AD | Tooth | |
AAG8_1 | Guadeloupe | 400-1400 AD | Tooth | |
AAG9_1 | Guadeloupe | 400-1400 AD | Tooth | |
Cementum vs dentine | Trinitatis 1 | Denmark | 18th century | Tooth |
Trinitatis 2 | Denmark | 18th century | Tooth | |
ANR 4709 | Easter Island | Post-1200 AD | Tooth | |
ANR 4711 | Easter Island | Post-1200 AD | Tooth | |
ANR 4713 | Easter Island | Post-1200 AD | Tooth | |
VHM00500 X7 | Denmark | Iron Age, c.100 AD | Tooth | |
VHM00500 X22 | Denmark | Iron Age, c.100 AD | Tooth | |
VHM00500 X73 | Denmark | Iron Age, c.100 AD | Tooth | |
VHM00500 X77 | Denmark | Iron Age, c.100 AD | Tooth | |
VHM00500 X81 | Denmark | Iron Age, c.100 AD | Tooth | |
ID-530 | Greenland | c.1100 AD | Tooth | |
ID-532 | Greenland | c.1100 AD | Tooth | |
ID-677 | Greenland | c.1100 AD | Tooth | |
ID-678 | Greenland | c.1100 AD | Tooth |
Materials and methods
All the laboratory work was performed in the dedicated clean laboratory facilities at the Centre for GeoGenetics, Natural History Museum, University of Copenhagen, according to strict aDNA standards33,34.
Sample information
A total of 26 ancient human bones and teeth from various archaeological contexts spanning tropical and temperate environments were included in the pre-digestion experiments. Fourteen ancient teeth were used in the comparison between DNA extracted from the root core (dentine) and root surface (cementum-enriched). All relevant information regarding the samples are provided in Table 1.
DNA extractions with varying pre-digestion time
The bone surface at the sampling area was removed using a scalpel or a sterile drill bit. Cortical bone mass was drilled and homogenized, and 400 mg of bone powder was transferred to each of six 15 mL Falcon tubes (labelled A–F) (Fig. 1). To counteract the effect of granular convection by which the smallest bone particles end up in the first tubes, we homogenized the powder between each transfer.
The six sub-samples were subjected to a digestion buffer containing 4.7 mL 0.5 M EDTA, 50 μL recombinant Proteinase K, and 250 μL 10% N-Laurylsarcosyl and incubated simultaneosuly at 50 °C. After 30 minutes, extraction A was centrifuged and the supernatant (the pre-digest) was removed. An identical digestion buffer was then transferred to the undigested and sedimented bone powder, and the sample was vortexed and returned to incubation for a full 24 hour digestion (Fig. 1). Similarly, pre-digest supernatants were removed for extractions B-E at time points: 1 hour, 2 hours, 3 hours, 6 hours respectively and new digestion buffer added. No pre-digest supernatant was removed for extraction F. At 24 hours all digestions were stopped (Fig. 1). The samples were then centrifuged and the supernatants transferred to new tubes for DNA extraction. Five bones (Table 1) were selected for this initial experiment aiming to optimize pre-digestion time, yielding a total of 30 DNA extracts.
A guanidinum thiocyanate-based binding buffer11 was used when extracting the DNA from the supernatant. The buffer was prepared by mixing 118.2 g Guanidinium Thiocyanate with 10 mL Tris 1 M, 1 mL NaCl 5 M, 8 mL EDTA 0.5 M, 1 g N-Lauryl-Sarcosyl and water to a total volume of 200 mL. 20 mL of the binding buffer was transferred to each sample and left rotating for 3 hours with 100 ul silica powder in solution to bind the DNA. After DNA-binding, the silica was centrifuged and washed twice with 1 ml 80% cold ethanol, and the DNA eluted in 80 μl EB Buffer (Qiagen). The DNA concentration in all final extracts was measured using Qubit® Fluorometric Quantitation (Life Technologies, Grand Island, NY).
Following this initial experiment, we tested the consistency of the improvement with short pre-digestion times on 17 ancient and historical bone samples from Easter Island, Guadeloupe and Hungary (Table 1) and 4 ancient teeth from Guadeloupe. Each drilled bone sample was homogenized and split into two equal amounts, one that was extracted with 15 or 30 minutes of pre-digestion treatment, and one that was not pre-digested. The roots of the four teeth were drilled into powder post-removal of the inner dentine and likewise split into two fractions. The extraction procedure was identical to the one described above.
In an additional experiment we used 14 ancient teeth (Table 1) to compare the endogenous DNA content of root surface against the inner dentine core. First, the outermost surface of the teeth was removed with a drill-bit or a scalpel, as is standard procedure to exclude the most obvious surface DNA contamination. The effect of the cleaning was observed as a change in surface color, but it was done gently and did not result in a visible effect on the thickness of the root. Next, each tooth was split with a cutting disk into two pieces (the crown and the root) on the transverse plane (Fig. 2). The dentine was then drilled out of the root from the pulp cavity and transferred to a sterile tube, leaving a hollow ‘root cap’. This root cap, likely to be enriched for cementum, was then crushed with a mortar or cut into smaller pieces before being transferred to a sterile tube. The two fractions from each tooth root (crushed root surface and drilled dentine core) were then extracted separately as above, but without pre-digestion.
Library preparation and sequencing
Blunt-end Illumina sequencing libraries were built following guidelines previously outlined11, using the NEBNext® DNA Library Prep Master Mix Set E6070 (New England Biolabs Inc., Manual Version 2.135. The libraries were amplified using a two-round PCR setup9,36 with primers containing a 6 bp known index sequence. Details on the library preparation and PCR amplification conditions can be found in Supplementary Information Methods S1.
The amplified libraries were quantified on an Agilent 2200 TapeStation (Agilent Technologies, Palo Alto, CA, USA) or an Agilent Bioanalyzer 2100. The library pools were sequenced (100 bp, single read) at the Danish National High-throughput DNA Sequencing Centre. Basecalling and sequence sorting by sample-specific indexes was performed by the Sequencing Centre using CASAVA v.1.8.2.
Data analyses
All reads were trimmed for adapter sequences using AdapterRemoval 1.5.237, and only reads with a minimal length of 30 bp were retained. The trimmed sequences were mapped against the human reference genome Hg19, HS Build37.1, using bwa38 with the samse function using standard parameters except that seeding was disabled, following published recommendations39. We used all the sequences that mapped uniquely to one position in the human reference genome and then removed duplicate sequences from the output bam file using the rmdup function in samtools40. The relevant summary statistics (Supplementary Tables S1-S3) used to estimate the endogenous DNA fraction (fraction of uniquely mapped human sequences divided by the total number of sequences passing trimming) and sequence clonality (proportion of duplicate human sequences), were extracted with a custom Perl script. The clonality of each library will increase with increased sequencing depth, implying that the overall sequencing efficiency (fraction of non-duplicated endogenous DNA sequences divided by total sequences) decreases. Hence we down-sampled the raw sequencing files (fastq) to match the smallest number of sequences per bone to allow for a direct comparison of sequencing efficiency with or without the new extraction methods.
We also investigated the data for signatures of DNA damage. This was done in part to confirm that the profiled human DNA was not modern contamination, and in part to measure if the pre-digestion treatment would result in any obvious compositional biases in the DNA, or damage it further.
Based on the sequence length distributions of the sequences identified as human, we estimated the decay constant k (representing the fraction of broken bonds in the DNA backbones) and average DNA fragment length in the extract (1/k), as previously described41,42. A large k value reflects a pronounced exponential accumulation of small DNA fragments as a consequence of post mortem DNA breakage which is a signature of highly degraded DNA. Following the approach described in42, we investigated only the declining part of the distribution for each sample (40–90 bp) since the ends of the distribution are biased respectively by poor recovery of <40 bp fragments during the DNA extractions (and the library building process), and the accumulation of >94 bp reads sequenced to the maximum length on the Illumina platform with the here applied chemistry.
Using standard parameters in the Bayesian approach implemented in mapDamage 2.043 we estimated the position-specific cytosine deamination probability (δs) as well as the probability of a base being positioned within a single-stranded overhang (λ), which thus relates directly to the average length of the overhangs, reflecting post mortem damage of aDNA35. In order to increase the accuracy of the damage estimates performed on the ancient human DNA fractions, these were based on the total mapped datasets and not the downsampled files. Outputs from mapDamage 2.0 were analysed and plotted with R.
Sequence quality control statistics were generated using fastqc on the retrieved human sequences from each (not downsampled) file, as well as the total sequences of the library (downsampled human + non-human sequences), and were used to check for abnormalities, and particularly to investigate for potential changes in GC-content following pre-digestion. The results were plotted with R.
Despite the implementation of strict aDNA protocols, it is difficult to completely avoid contamination from modern DNA when working with ancient human material - in particularly when dealing with samples that have been handled previously during excavation and while stored at museum collections44. It was therefore important to establish that any potential increase in endogenous DNA content following our treatments was not an effect of DNA contamination. Accurate estimates of contamination levels require large amounts of genomic data, so we here restricted this analysis to samples with a mitochondrial genome coverage above five times (5X). We used contamMix45 (Fu et al., 2013) to estimate the level of human DNA contamination in the mtDNA sequences. This method compares for each individual the mapping affinities of its mtDNA sequences to its own consensus mitogenome sequence, relative to the mapping affinity of its mtDNA sequences to a dataset of potential contaminants represented by 311 mitogenomes from worldwide populations. The mitogenome consensus sequences were made using the samtools mpileup function (Li et al., 2009) and filtering the variant list outputted with bcftools with a script previously used (Jacobsen et al., 2014), selecting only bases with a coverage >5X and >50% concordance between bases, in order not to incorporate bases with only limited depth-of-coverage and minimize biases from sequencing errors and DNA damage misincorporations. One sample, Rise479 A, displayed heavy contamination at informative sites (ie. ~50%) which prevented the determination of a reliable consensus sequence, thus challenging the contamination assessment. However, a sample from the same individual Rise479 B (the pre-digested extract from the same bone) revealed only minimal contamination (ie. 1.2%). We could therefore use the consensus sequence obtained from this extract as a representation of the true mitochondrial sequence of Rise479 A, and thereby produce a reliable contamination estimate in the latter (see Table 2 for results).
Table 2. Contamination estimates.
Experiment type | Sample | Mitogenome coverage | Estimated contaminated fraction | 95 % probability interval |
---|---|---|---|---|
Pre-digestion time | Rise479 A | 10.6 X | 45.0%1 | 37.5 to 53.7%1 |
Rise479 B | 16.7 X | 1.2% | 0.2 to 3.9 % | |
Rise483 A | 14.8 X | 2.5% | 0.4 to 5.7% | |
Rise483 B | 5.9 X | 0.3% | 0.0 to 6.0% | |
Dentine vs. cementum-enriched | ID-530_A | 8.8 X | 1.1% | 0.1 to 5.3% |
ID-530_B | 11.8 X | 0.9% | 0.1 to 3.5% | |
ID-532_A | 10.1 X | 1.4% | 0.2 to 4.5% | |
ID-532_B | 33.6 X | 0.7% | 0.2 to 2.3% | |
VHM00500 X7_A | 15.7 X | 0.3% | 0.0 to 4.8% | |
VHM00500 X7_B | 16.7 X | 0.2% | 0.0 to 3.1% | |
VHM00500 X81_A | 18.2 X | 19.5% | 15.3 to 25.5% | |
VHM00500 X81_B | 21.8 X | 0.1% | 0.0 to 1.3% |
The values represent the estimated fraction of human mtDNA contaminants as determined using contamMix. In the pre-digestion experiment, samples named “B” were pre-digested as opposed to samples named “A”. In the dentine vs. root surface experiments, samples named “B” represent the surface as opposed to the inner dentine named “A”.
1estimated using the consensus of Rise479 B as reference.
Results
Effects of pre-digestion time
Following 24 hours of digestion, the samples displayed a spectrum of colors, in which long pre-digestion times resulted in lighter colors of the final digest, likely attributable to the removal of dirt and contaminant particles with long pre-digestion treatments. This observation is accompanied by a decline in total DNA concentrations in the extracts as a function of pre-digestion time (Supplementary Fig. S1). In three cases, the DNA concentration as a function of pre-digestion time is described well by an exponential decline (R2 values 0.72–0.99), while the fit is more modest in two cases (R2 = 0.40 and 0.63) (Supplementary Fig. S1).
By aligning an average of ~17 million reads per library against the human reference genome Hg19, we tracked the endogenous DNA content as a function of pre-digestion time. Despite very low endogenous DNA contents (0.001% to 1.6%, see Table 1) the pre-digested extracts showed an increase in human DNA (Fig. 3), with an average fold-increase of c. 2 after 30 minutes of pre-digestion and c. 3 after 1 to 6 hours of pre-digestion. Despite sample to sample variation, and some extreme outlier values, the average increase in endogenous DNA content following pre-digestion appeared to be logarithmic (R2 = 0.87, p = 2.5 e-5) reaching the asymptotic maximum after c.1 hour of pre-digestion (Fig. 3).
Because DNA concentrations decrease with longer pre-digestion treatments, we tested if this could be tracked as an increase in sequence clonality among the human DNA sequences due to endogenous DNA loss. An increase in clonality would result in poor library sequencing efficiency despite a higher endogenous DNA content. Overall, clonality levels were low with average values ranging from 1.5% (no pre-digestion) to 4.8% (6 hours of pre-digestion) (see Supplementary Table S1), although we note that library complexity predictions from small datasets of shallow sequencing can give false estimates of library complexity (Daley and Smith, 2013). For bone EI8 the clonality increased from 1.3% (no pre-digestion) to 18% (6 hours of pre-digestion) causing a complete stagnation in the increase in library efficiency (see Supplementary Table S1).
Brief pre-digestions on 21 samples
Given that short pre-digestion times appeared to improve access to endogenous DNA content, we next compared the endogenous human DNA content of 17 bones and 4 teeth extracted with and without a brief pre-digestion of 15 or 30 minutes (Table 1), in order to confirm the efficiency of the method. The mean enrichment in endogenous DNA content was c. 2-fold (Fig. 4), and highly significant as revealed by a one-sided paired t-test (t = 2.56, df = 20, p-value = 0.009).
The overall sequencing efficiency increase was similar to the increase in endogenous DNA content, reflecting that clonality levels were nearly equivalent in the pre-digested and the non-pre-digested samples. Only one sample (Rise483) displayed a clear loss of sequence complexity when pre-digested (Fig. 4, Sample 15 and Supplementary Table S2) resulting in lower library efficiency (140353 normalized non-clonal human reads) compared to the non-pre-digested sample (165954 normalized non-clonal human reads).
Effects of pre-digestion on DNA composition
In general we observed a negligible change in the GC-content in the total datasets (human + non-human) following pre-digestion (Supplementary Fig. S3). The genomic GC-content in the identified human reads was ~50% for AmpliTaq Gold amplified libraries and ~40% GC for Kapa U + amplified libraries, but there was no correlation between GC-content in the human fraction and pre-digestion times (Supplementary Fig. S2).
Similarly, the DNA damage parameters δs and λ for the human reads displayed no general trend as a function of pre-digestion time and likewise no general pattern was discernible for the decay constants (k) and the estimated average fragment length (1/k) of the human DNA in the extracts (Supplementary Figure S4-S6, Supplementary Table S4).
Human DNA from the dentine core and cementum-rich surface of teeth roots
Finally, we investigated whether the outer layer of teeth roots contained higher endogenous DNA proportions than the dentine, which represents the inner part of the tooth. For 11 of 14 teeth, we observed a higher fraction of human DNA in the root surface when compared to the dentine (Fig. 5). The mean fold-increase in endogenous DNA proportion was c. 5-fold with values ranging from 0.3-fold to 14-fold. The one-sided paired t-test reveals a significant increase in endogenous DNA at the surface as compared to the dentine core (t = 2.10, df = 13, p-value = 0.05).
The library efficiency increase was almost identical to the endogenous DNA proportion increase (Fig. 5), signifying that extracting from the root surface will generally yield a higher proportion of human DNA without compromising complexity among the template molecules. Finally we note that in 11 out of 14 teeth the cytosine deamination ratio (δs) was significantly elevated in the dentine as compared to the cementum-enriched surface (Table 3).
Table 3. Comparing DNA composition.
Ancient DNA authentication
For all DNA extracts we observed elevated δs and λ values (see Supplementary Fig. S6) suggesting that the bulk of the DNA templates are of ancient origin. The level of DNA contamination was investigated for 12 extracts where we had sufficient data to meaningfully conduct this analysis (Table 2). Contamination levels proved negligible in 10 extracts, confirming that our observed enrichment in endogenous DNA was not driven by modern human contaminant DNA. Encouragingly, in two bone samples the pre-digestion resulted in reduced contamination levels from 45% to 1.2% in the Rise479 sample, and from 2.5% to 0.3% in Rise483. Likewise, while tooth VHM00500 × 81 appeared contaminated in the dentine fraction, contamination levels in the cementum fraction was estimated to be only 0.1% (Table 2).
Discussion
Pre-digestion time
This study has documented several improvements of immediate value to aDNA research. We show that pre-digestion is a simple and effective means to remove a proportion of non-target DNA from ancient samples. With an average value of 2.7-fold enrichment of the endogenous DNA content following 1 hour of pre-digestion, one would ultimately generate 2.7 times more usable data for the same price.
We interpret the asymptotic increase with longer pre-digestion treatments as a gradual change in the ratio of dissolved exogenous DNA over endogenous DNA. Although the endogenous DNA fraction increases with longer pre-digestions, the benefits of waiting this long are likely to be marginal.
We expected to observe an increase in DNA sequence clonality following longer pre-digestion times because of two complementary phenomena, namely that i) we have sequenced the human DNA fraction deeper with the same sequencing effort because exogenous DNA is now reduced, and/or ii) we have sequenced the human fraction deeper because there has been a significant reduction in the DNA library complexity - the human fraction included. The first phenomenon would be a positive outcome of pre-digestion, as it simply requires less sequencing effort before saturation is reached, similar to genomic capture methods19. The second phenomenon is problematic as it would reflect a loss of sequenceable genomic material. With long pre-digestion times we find saturation in the increase of endogenous DNA content, accompanied with a loss in DNA concentration (see Supplementary Fig. S1). In one case, we observe a considerable increase in sequence clonality. We deduce from these observations that it is advisable to apply this method conservatively using short pre-digestion times (15–30 mins).
We also stress that the optimal time of pre-digestion will depend on the temperature used during incubation. Here we have incubated at 50 °C, but if incubating at lower temperatures, it may be advantageous to apply a longer pre-digestion step.
Changes in genomic composition and DNA damage
It is conceivable that a pre-digestion treatment could result in DNA damage patterns different from those in non-pre-digested samples, either because the pre-digestion itself would induce more DNA damage or because it would increase accessibility to human DNA molecules in subniches with different preservation conditions. However, for the pre-digestion lengths tested in this study, we find no trends on the estimated DNA damage parameters or GC-contents (Supplementary Fig. S2, S4–S6, Supplementary Table S4). These observations provide evidence that the pre-digestion procedure will not damage the DNA or otherwise alter the genomic composition. This could be because the pre-digestions used in these experiments are relatively brief (<6 hours) and unspecific in dissolving both the organic and mineral phases of the samples. Hence they may not provide a basis for comparing endogenous DNA preserved within different subniches, such as that analyzed in Schwarz et al., (2009).
Comparing DNA in the surface and dentine core of teeth roots
In 11 of 14 samples we observe an increase in endogenous DNA content when extracting DNA from the hollow root cap compared to the drilled-out dentine. This demonstrates that the root surface is a highly advantageous substrate for aDNA extractions and we propose two possible explanations for this observation.
The thin cementum layer is located near the surface of the roots as shown on Fig. 2. Although we remove the external outermost surface in the initial sample preparation and cannot remove all of the dentine from the root cap, it seems highly likely that we enrich the sample for cementum with this method. Cementum has previously been shown to contain a higher concentration of human DNA compared to dentine29 and our results suggest that this is manifested in a higher proportion of human DNA reads in the total extract. Moreover, our DNA damage assessments determined that the human DNA preserved in the inner dentine displayed higher cytosine deamination ratios as compared to the DNA preserved in the cementum-enriched fraction, indicative of differential preservation conditions. While these observation are not necessarily related to altered access to bacterial invasion and hence endogenous levels, it underlines the impact of histological differences in DNA preservation.
The dentine is in direct contact with the pulp cavity (Fig. 2). With the traditional dentine drilling method it is therefore not unlikely that exogenous microbial DNA from the pulp cavity is co-extracted with the dentine, which then translates into a lower human DNA content in the sequencing.
The results of the contamination analyses (Table 2) showed that the elevated fraction of human DNA in the outer root layers is not simply due to DNA contamination. When teeth are available for aDNA studies, the results presented herein strongly support targeting the cementum-rich root surface. As a side note, using only the root of a tooth for the aDNA extraction facilitates leaving the crown intact, which can then be used for morphological analyses. The removed dentine remains a suitable material for stable isotope analyses or radiocarbon dating.
Recommendations and conclusion
We recommend following these five points when extracting aDNA from ancient bone or teeth:
Apply a brief pre-digestion step (15–30 mins).
If sufficient material is available, then run several extractions in parallel with differing pre-digestion lengths.
Do not discard the supernatant (the pre-digest) at first, as it will contain a fraction of endogenous DNA molecules.
Do not discard undigested bone pellets post-24-hour digestion, as they are likely to contain a higher endogenous DNA fraction than the first extraction (whether this was pre-digested or not).
Sample the surface of teeth roots in favor of the inner dentine but remove the outermost surface layer to minimize the risk of including DNA contamination.
We advocate caution when implementing a pre-digestion step if only a small amount of sample material is available (i.e. <50 mg). In such cases it is not recommendable to pre-digest the sample because the DNA concentration in the final extract may become critically low. Finally, we note that if pre-digestion is combined with other methods that have demonstrated an enrichment in the endogenous DNA fraction, such as the capture of mitogenomes46 or whole genomes19, single-stranded sequencing libraries47, or damage-enriched single-stranded sequencing libraries48, it is likely to result in a many-fold increase in the endogenous DNA proportion. We believe that an implementation of the simple and inexpensive procedures we have here described will increase the discovery rate of ancient samples that are suitable for genomic research, but the methods should prove equally useful for other disciplines working with degraded DNA such as forensic sciences.
Additional Information
How to cite this article: Damgaard, P. B. et al. Improving access to endogenous DNA in ancient bones and teeth. Sci. Rep. 5, 11184; doi: 10.1038/srep11184 (2015).
Supplementary Material
Acknowledgments
We thank our colleagues Anna-Sapfo Malaspinas, José Victor Moreno-Mayar, Morten Rasmussen, Thorfinn Korneliussen, Lasse Vinner, and Peter Ilsøe for insightful discussions, and the Danish National High-throughput DNA Sequencing Centre and Jesper Stenderup for technical assistance. We are grateful to Erik Thorsby (Dept. of Immunology, University of Oslo), Reidar Solsvik (The Kon-Tiki Museum), and Per Holck (The Anatomical Institute, University of Oslo) for access to the Easter Island samples. We thank Sidsel Wåhlin (The Historical Museum of Vendsyssel) for the Danish Iron Age samples, Jette Arneborg (The Greenland National Museum and Archives) and Niels Lynnnerup (Dept. of Forensic Medicine, University of Copenhagen) for the Greenlandic samples, Jacob Mosekilde (Museum of Copenhagen) for the Danish Trinitatis samples, and Menno Hoogland (Leiden University) for the samples from Guadeloupe. The Bronze Age samples from Hungary were collected and analysed as part of “The Rise” project funded by the European Research Council (FP/2007–2013, grant no. 269442 to Kristian Kristiansen). The Danish and Greenlandic samples (Trinitatis, Iron Age, and Norse Eastern Settlement in Greenland) were collected and analysed as part of “The Genomic History of Denmark” project (KU2016) funded by the University of Copenhagen. HS was supported by a Synergy Grant from the European Research Council (FP7/2007–2013, grant no. 319209). PBD was supported in part by the Siemens Foundation, and MEA was supported by the Marie Curie Actions of the European Union (FP7/2007–2013, grant no. 300554) and the Villum Foundation (Young Investigator Programme , grant no. 10120). GeoGenetics is supported by the Lundbeck Foundation and the Danish National Research Foundation.
Footnotes
Author Contributions M.E.A. and E.W. designed the study. P.B.D. carried out the lab work with assistance from M.E.A., H.S. and A.M. P.B.D. and M.E.A. analyzed the data and wrote the manuscript, with considerable input from L.O. and the remaining authors.
References
- Der Sarkissian C. et al. Ancient genomics. Philos. Trans. R. Soc. B Biol. Sci. 370, 20130387 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Green R. E. et al. A draft sequence of the Neandertal genome. Science 328, 710–722 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prüfer K. et al. The complete genome sequence of a Neanderthal from the Altai Mountains. Nature 505, 43–49 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krause J. et al. The complete mitochondrial DNA genome of an unknown hominin from southern Siberia. Nature 464, 894–897 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer M. et al. A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reich D. et al. Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468, 1053–1060 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raghavan Maanasa, et al. “Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans.” Nature 505, 87–91 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raghavan M. et al. The genetic prehistory of the New World Arctic. Science 345, 1255832 (2014). [DOI] [PubMed] [Google Scholar]
- Rasmussen M. et al. Ancient human genome sequence of an extinct Palaeo-Eskimo. Nature 463, 757–762 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rasmussen M. et al. The genome of a Late Pleistocene human from a Clovis burial site in western Montana. Nature 506, 225–229 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orlando Ludovic et al. “Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse.” Nature 499, 74–78 (2013). [DOI] [PubMed] [Google Scholar]
- Fu Q. et al. Genome sequence of a 45,000-year-old modern human from western Siberia. Nature 514, 445–449 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seguin-Orlando A. et al. Genomic structure in Europeans dating back at least 36,200 years. Science 346, 1113–1118 (2014). [DOI] [PubMed] [Google Scholar]
- Gamba C. et al. Genome flux and stasis in a five millennium transect of European prehistory. Nat. Commun. 5, 5257 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keller A. et al. New insights into the Tyrolean Iceman’s origin and phenotype as inferred by whole-genome sequencing. Nat. Commun. 3, 698 (2012). [DOI] [PubMed] [Google Scholar]
- Lazaridis I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olalde I. et al. Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European. Nature 507, 225–228 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skoglund P. et al. Genomic Diversity and Admixture Differs for Stone-Age Scandinavian Foragers and Farmers. Science 344, 747–750 (2014). [DOI] [PubMed] [Google Scholar]
- Carpenter M. L. et al. Pulling out the 1%: whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. Am. J. Hum. Genet. 93, 852–864 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rizzi E., Lari M., Gigli E., De Bellis G. & Caramelli D. Ancient DNA studies: new perspectives on old samples. Genet Sel Evol 44, 21 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poinar H. N. et al. Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. Science 311, 392–394 (2006). [DOI] [PubMed] [Google Scholar]
- Sarkissian C. et al. Shotgun microbial profiling of fossil remains. Mol. Ecol. 23, 1780–1798 (2014). [DOI] [PubMed] [Google Scholar]
- Campos P. F. et al. DNA in ancient bone–Where is it located and how should we extract it? Ann. Anat.-Anat. Anz. 194, 7–16 (2012). [DOI] [PubMed] [Google Scholar]
- Schwarz Carsten, et al. “New insights from old bones: DNA preservation and degradation in permafrost preserved mammoth remains.” Nucleic Acids Research 37, 3215–3229 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jans Miranda ME. “Microbial bioerosion of bone–a review.” In Current developments in bioerosion, pp. 397–413. Springer Berlin Heidelberg, 2008 [Google Scholar]
- Collins M. J. et al. The survival of organic matter in bone: a review. Archaeometry 44, 383–394 (2002). [Google Scholar]
- Ginolhac A. et al. Improving the performance of true single molecule sequencing for ancient DNA. BMC Genomics 13, 177 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orlando L. et al. True single-molecule DNA sequencing of a pleistocene horse bone. Genome Res. 21, 1705–1719 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Adler C. J., Haak W., Donlon D. & Cooper A. Survival and recovery of DNA from ancient teeth and bones. J. Archaeol. Sci. 38, 956–964 (2011). [Google Scholar]
- Higgins D. & Austin J. J. Teeth as a source of DNA for forensic identification of human remains: a review. Sci. Justice 53, 433–441 (2013). [DOI] [PubMed] [Google Scholar]
- Higgins D., Kaidonis J., Townsend G., Hughes T. & Austin J. J. Targeted sampling of cementum for recovery of nuclear DNA from human teeth and the impact of common decontamination measures. Investig. Genet. 4, 18 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trivedi R., Chattopadhyay P. & Kashyap V. K. A new improved method for extraction of DNA from teeth for the analysis of hypervariable loci. Am. J. Forensic Med. Pathol. 23, 191–196 (2002). [DOI] [PubMed] [Google Scholar]
- Gilbert M. T. P., Bandelt H.-J., Hofreiter M. & Barnes I. Assessing ancient DNA studies. Trends Ecol. Evol. 20, 541–544 (2005). [DOI] [PubMed] [Google Scholar]
- Willerslev E. & Cooper A. Review paper. ancient dna. Proc. R. Soc. B Biol. Sci. 272, 3–16 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Briggs A. W. et al. Patterns of damage in genomic DNA sequences from a Neandertal. Proc. Natl. Acad. Sci. 104, 14616–14621 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer M. & Kircher M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 6, (2010). [DOI] [PubMed] [Google Scholar]
- Lindgreen S. AdapterRemoval: easy cleaning of next-generation sequencing reads. BMC Res. Notes 5, 337 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H. & Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schubert M. et al. Improving ancient DNA read mapping against modern reference genomes. BMC Genomics 13, 178 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deagle B. E., Eveson J. P. & Jarman S. N. Quantification of damage in DNA recovered from highly degraded samples–a case study on DNA in faeces. Front. Zool. 3, 11 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allentoft M. E. et al. The half-life of DNA in bone: measuring decay kinetics in 158 dated fossils. Proc. R. Soc. B Biol. Sci. 279, 4724–4733 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jónsson Hákon, et al. “mapDamage2. 0: fast approximate Bayesian estimates of ancient DNA damage parameters.” Bioinformatics 29, 1682–1684 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allentoft M. E. Recovering samples for ancient DNA research—guidelines for the field archaeologist. Antiquity 87, http://www.antiquity.ac.uk/projgall/allentoft338/ (2013). (Date of access:27/04/15) [Google Scholar]
- Fu Q. et al. A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 23, 553–559 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maricic T., Whitten M. & Pääbo S. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PloS One 5, DOI: 10.1371/journal.pone.0014004 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gansauge M.-T. & Meyer M. Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA. Nat. Protoc. 8, 737–748 (2013). [DOI] [PubMed] [Google Scholar]
- Gansauge M.-T. & Meyer M. Selective enrichment of damaged DNA molecules for ancient genome sequencing. Genome Res. 24, 1543–1549 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.