Ancient human genome sequence of an extinct Palaeo-Eskimo

Morten Rasmussen; Yingrui Li; Stinus Lindgreen; Jakob Skou Pedersen; Anders Albrechtsen; Ida Moltke; Mait Metspalu; Ene Metspalu; Toomas Kivisild; Ramneek Gupta; Marcelo Bertalan; Kasper Nielsen; M Thomas P Gilbert; Yong Wang; Maanasa Raghavan; Paula F Campos; Hanne Munkholm Kamp; Andrew S Wilson; Andrew Gledhill; Silvana Tridico; Michael Bunce; Eline D Lorenzen; Jonas Binladen; Xiaosen Guo; Jing Zhao; Xiuqing Zhang; Hao Zhang; Zhuo Li; Minfeng Chen; Ludovic Orlando; Karsten Kristiansen; Mads Bak; Niels Tommerup; Christian Bendixen; Tracey L Pierre; Bjarne Grønnow; Morten Meldgaard; Claus Andreasen; Sardana A Fedorova; Ludmila P Osipova; Thomas F G Higham; Christopher Bronk Ramsey; Thomas v O Hansen; Finn C Nielsen; Michael H Crawford; Søren Brunak; Thomas Sicheritz-Pontén; Richard Villems; Rasmus Nielsen; Anders Krogh; Jun Wang; Eske Willerslev

doi:10.1038/nature08835

. Author manuscript; available in PMC: 2014 Mar 13.

Published in final edited form as: Nature. 2010 Feb 11;463(7282):757–762. doi: 10.1038/nature08835

Ancient human genome sequence of an extinct Palaeo-Eskimo

Morten Rasmussen ^1,^2,^*, Yingrui Li ^2,^3,^*, Stinus Lindgreen ^1,^4,^*, Jakob Skou Pedersen ⁴, Anders Albrechtsen ⁴, Ida Moltke ⁴, Mait Metspalu ⁵, Ene Metspalu ⁵, Toomas Kivisild ^5,⁶, Ramneek Gupta ⁷, Marcelo Bertalan ⁷, Kasper Nielsen ⁷, M Thomas P Gilbert ^1,², Yong Wang ⁸, Maanasa Raghavan ^1,⁹, Paula F Campos ¹, Hanne Munkholm Kamp ^1,⁴, Andrew S Wilson ¹⁰, Andrew Gledhill ¹⁰, Silvana Tridico ^11,¹², Michael Bunce ¹², Eline D Lorenzen ¹, Jonas Binladen ¹, Xiaosen Guo ^2,³, Jing Zhao ^2,³, Xiuqing Zhang ^2,³, Hao Zhang ^2,³, Zhuo Li ^2,³, Minfeng Chen ^2,³, Ludovic Orlando ¹³, Karsten Kristiansen ^2,^3,⁴, Mads Bak ¹⁴, Niels Tommerup ¹⁴, Christian Bendixen ¹⁵, Tracey L Pierre ¹⁶, Bjarne Grønnow ¹⁷, Morten Meldgaard ¹⁸, Claus Andreasen ¹⁹, Sardana A Fedorova ^5,²⁰, Ludmila P Osipova ²¹, Thomas F G Higham ⁹, Christopher Bronk Ramsey ¹⁰, Thomas v O Hansen ²², Finn C Nielsen ²², Michael H Crawford ²³, Søren Brunak ^7,²⁴, Thomas Sicheritz-Pontén ⁷, Richard Villems ⁵, Rasmus Nielsen ^4,⁸, Anders Krogh ^2,⁴, Jun Wang ^2,^3,⁴, Eske Willerslev ^1,²

¹Centre for GeoGenetics, Natural History Museum of Denmark and Department of Biology, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen, Denmark

²Sino-Danish Genomics Center, BGI-Shenzhen, Shenzhen 518083, China, and University of Copenhagen, DK-2100 Copenhagen, Denmark

³BGI-Shenzhen, Shenzhen 518083, China

⁴Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen, Denmark

⁵Department of Evolutionary Biology, Tartu University and Estonian Biocentre, 23 Riia Street, 510101 Tartu, Estonia

⁶Leverhulme Centre for Human Evolutionary Studies, Department of Biological Anthropology, Henry Wellcome Building, Fitzwilliam Street, University of Cambridge, Cambridge CB2 1QH, UK

⁷Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Lyngby, Denmark

⁸Departments of Integrative Biology and Statistics, UC-Berkeley, 4098 VLSB, Berkeley, California 94720, USA

⁹Research Laboratory for Archaeology and the History of Art, Dyson Perrins Building, South Parks Road, Oxford OX1 3QY, UK

¹⁰Department of Archaeological Sciences, School of Life Sciences, University of Bradford, West Yorkshire, Bradford BD7 1DP, UK

¹¹Biological Criminalistics, Australian Federal Police, 1 Unwin Place, Weston, ACT 2611, Australia

¹²Ancient DNA Laboratory, School of Biological Sciences and Biotechnology, Murdoch University, Perth 6150, Australia

¹³Paleogenetics and Molecular Evolution, Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Lyon 1, CNRS, INRA, Ecole Normale Supérieure de Lyon, 46 allée d'Italie, 69364 Lyon Cedex 07, France

¹⁴Wilhelm Johannsen Centre For Functional Genome Research, University of Copenhagen, Department of Cellular and Molecular Medicine, The Panum Institute, Blegdamsvej 3A, DK-2200 Copenhagen, Denmark

¹⁵Department of Genetics and Biotechnology, Aarhus University, Blichers Allé 20, PO BOX 50, DK-8830 Tjele, Denmark

¹⁶Department of Biological Anthropology, University of Cambridge, Cambridge CB2 3QY, UK

¹⁷Ethnographic Collections, National Museum of Denmark, Frederiksholms Kanal 12, DK-1220 Copenhagen, Denmark

¹⁸Natural History Museum of Denmark, University of Copenhagen, Øster Voldgade 5-7, DK-1350 Copenhagen, Denmark

¹⁹Greenland National Museum and Archives, PO Box 145, DK-3900 Nuuk, Greenland

²⁰Department of Molecular Genetics, Yakut Research Centre, Russian Academy of Medical Sciences, 4 Sergelyahonskoe Shosse, Yakutsk 677019, Sakha, Russia

²¹The Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Lavrentyeva Ave. Novosibirsk 630090, Russia

²²Department of Clinical Biochemistry, Rigshospitalet, University of Copenhagen, DK-2100 Copenhagen, Denmark

²³Department of Anthropology, University of Kansas, Lawrence, Kansas 66045, USA

²⁴Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3A, DK-2200 Copenhagen, Denmark

^✉

Correspondence and requests for materials should be addressed to E.W. (ewillerslev@snm.ku.dk) or J.W. (wangj@genomics.org.cn)

These authors contributed equally to this work.

PMCID: PMC3951495 NIHMSID: NIHMS467171 PMID: 20148029

Abstract

We report here the genome sequence of an ancient human. Obtained from ∼4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20×, we recover 79% of the diploid genome, an amount close to the practical limit of current sequencing technologies. We identify 353,151 high-confidence single-nucleotide polymorphisms (SNPs), of which 6.8% have not been reported previously. We estimate raw read contamination to be no higher than 0.8%. We use functional SNP assessment to assign possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence for a migration from Siberia into the New World some 5,500 years ago, independent of that giving rise to the modern Native Americans and Inuit.

Recent advances in DNA sequencing technologies have initiated an era of personal genomics. Eight human genome sequences have been reported so far, for individuals with ancestry in three distinct geographical regions: a Yoruba African^1,2, four Europeans^2–5, a Han Chinese⁶, and two Koreans^7,8, and soon this data set will expand significantly as the ‘1000 genomes’ project is completed.

From an evolutionary perspective, however, modern genomics is restricted by not being able to uncover past human genetic diversity and composition directly. To access such data, ancient genomic sequencing is needed. Presently no genome from an ancient human has been published, the closest being two data sets representing a few megabases (Mb) of DNA from a single Neanderthal^9,10. Contamination and DNA degradation have also compromised the possibility of obtaining high sequence depth¹¹, and no ancient nuclear genome has been sequenced deeper than about 0.7×¹²—a level insufficient for genotyping and exclusion of errors owing to sequencing or postmortem DNA damage¹³.

In 2008 we used permafrost-preserved hair from one of the earliest individuals that settled in the New World Arctic (northern Alaska, Canada and Greenland) belonging to the Saqqaq Culture (a component of the Arctic Small Tool tradition; approximately 4,750–2,500 ¹⁴C years before present (yr bp))^14,15 to generate the first complete ancient human mitochondrial DNA (mtDNA) genome¹⁶. A total of 80% of the recovered DNA was human, with no evidence of modern human contaminant DNA. Thus, the specimen is an excellent candidate upon which to sequence the first ancient human nuclear genome. Although cultural artefacts from the Arctic Small Tool tradition are found many places in the New World Arctic, few human remains have been recovered. Thus, the sequencing project described here is a direct test of the extent to which ancient genomics can contribute knowledge about now-extinct cultures, from which little is known about their phenotypic traits, genetic origin and biological relationship to present-day populations.

Sample characteristics, DNA quality and sequencing strategy

The specimen used for genomic sequencing is the largest (approximately 15 × 10 cm) of four human hair tufts excavated directly from culturally deposited permafrozen sediments at Qeqertasussuk (Fig. 1a, b). Stable light isotope analyses of the Saqqaq hair (carbon and nitrogen) revealed that the individual relied on high trophic level marine food resources (Fig. 1e and Supplementary information. Accelerator mass spectrometry (AMS) radiocarbon dating of the hair sample produced a date of 4,044 ± 31 ¹⁴C yr bp and 4,170–3,600 cal. yr bp when correcting for local marine reservoir effect (Supplementary information). Despite its age, morphological analysis of the hair tuft using light and scanning electron microscopes indicated excellent overall preservation (Fig. 1c, d and Supplementary information).

a, Location of the Saqqaq Culture site Qeqertasussuk, north-western Greenland (after ref. 15). b, Saqqaq hair sample. c, Saqqaq and modern hair shafts on a comparison microscope. d, Comparative cross-sections of modern Caucasian and Saqqaq hairs. e, Carbon and nitrogen isotope measurements on the Saqqaq hair (brown square, Qt 86 profile C 85/261:12 Oxford; pink triangle, Qt 86 profile C 85/261:12 Bradford), another Saqqaq hair sample from a similar context (green diamond, Qt 87 FB 20/20), six ancient Thule (Inuit) samples (purple circle), published data on modern Uummannaq (Greenland) omnivores (orange diamond), and modern Danish omnivores (blue square) and vegans (orange circle). Saqqaq and Thule are shown as averages of 2–3 replicates (Supplementary Information). Error bars, s.d. f, Calibrated ages (before present) on the Saqqaq hair and associated reindeer (Rangifer tarandus) bones, plotted using the INTCAL04 calibration curve⁴⁰, are shown. The human hair dates are calibrated twice, one using a correction for the marine reservoir effect (Supplementary information).

A major concern in ancient DNA studies is post-mortem damage, cytosine to uracil deamination, that can result in erroneous base incorporation^17,18. Such miscoding lesions make it difficult to distinguish true evolutionarily derived substitutions from those that are damage-based, especially if sequence depth is low. It is therefore preferential to exclude damaged DNA molecules before sequencing, if achievable without loss of significant amounts of starting templates. We established the practical feasibility of this, by comparing Illumina sequencing libraries that were initially enriched using two different DNA polymerase enzymes: (1) Phusion polymerase (Finnzymes) as suggested in Illumina's own library preparation protocol, which is not able to replicate through uracil¹⁹; and (2) Platinum Taq High Fidelity (HiFi, Invitrogen) polymerase, that can replicate through uracil (Supplementary information).

Results allowed us to estimate an overall deamination-based damage rate in the Saqqaq genome of <1%, which is, as expected, lower than the rate obtained from GS FLX sequencing¹⁶ (Supplementary information). We also found undamaged sequences to be slightly shorter on average than those containing damage (55 base pairs (bp) for Phusion versus 59 bp for HiFi). However, given that GS FLX shotgun sequencing shows an average molecular length of <76 bp in the Saqqaq hair sample¹⁶ (a known overestimate due to automatic computational filtering of short reads), and that quantitative polymerase chain reaction (qPCR) revealed high copy numbers of short fragments (approximately 1.8 million copies per microlitre DNA extract of 85-bp mtDNA), dropping roughly exponentially with sequence length (Supplementary Fig. 10), we concluded that excluding damaged molecules makes little difference to the number of starting DNA molecules available for initial sequence enrichment.

Ancient human DNA is particularly susceptible to contamination by modern DNA²⁰. Although the qPCR results confirmed that DNA preservation in the Saqqaq hair is excellent as judged by ancient DNA standards²¹, we undertook several actions before sequencing to minimize and control for contamination. In addition to using a decontamination protocol that has previously proven successful on the Saqqaq hair sample¹⁶, we also used indexing adaptors and primers in the library preparations¹³, such that any possible contamination entering the samples after they left the ancient DNA clean laboratory in Copenhagen could be easily detected (Supplementary information). This ensures that any possible human contamination should reveal itself as being of European origin, given that any handling steps before indexing were carried out only by ethnic northern Europeans (Supplementary information).

Sequencing and assembly

Twelve DNA libraries were built in the dedicated Copenhagen ancient DNA laboratory, several indexed enrichment PCRs were carried out, and each was sequenced on an average of three lanes using Illumina GAII sequencing platforms at BGI-Shenzhen. In addition, two sequencing runs were completed at Illumina's facilities in Hayward, California and Chesterford, England. With few exceptions, 70 cycles of single-read sequencing were performed, always followed by a 6-bp indexing read (Supplementary Information). The sequencing yielded a total of 3.5 billion reads, from a total of 242 lanes.

Sequences not carrying a 100% match in the index read were excluded from all downstream analyses. This allowed 93.17% of all reads to be attempted to be mapped to the human reference genome (hg18) using a suffix array-based mapping strategy that permits identification of residual primer sequence expected from the libraries of short ancient DNA fragments (Supplementary information). Primer trimming was carried out as an integrated part of the mapping during the alignment of each read to the genome. Specifically, for all positions a check was made as to whether a better alignment could be made between the remainder of the read and the primer. If found, this position in the read was used to cut off the primer (Supplementary information). This provided an average mapped read length of 55.27 nucleotides. Of the correctly indexed reads, 49.2% could be mapped uniquely (46% of total reads). Reads with multiple matches or no matches were discarded (Fig. 2a). Analysis of the reads with no matches indicated that most were unidentifiable, whereas the remainder were of microbial eukaryote, viral, or bacterial origin (Fig. 2b). Read sequences from the same library that were mapped to the reference genome with same start and end positions were considered clonal, and were collapsed to single sequences with higher quality scores (Supplementary information). This resulted in a final data set of 28.47% of all reads. Additionally, to avoid erroneous SNP calls due to insertions and deletions, we discarded the last seven nucleotides from the 3′ end of the mapped reads, yielding a final average read size of 48 nucleotides. This provides an average depth of 20× across 79% of the genome (Fig. 2a). Given a maximum read length of 70 bp and an average mapped read length of 55 bp, we estimate that it is theoretically possible to cover some 85–87% of the genome (Supplementary information), meaning that we are close to having sequenced all that is feasible with the technology at hand. Approximately one-half of the positions are covered with a depth. >7×, with some variation along the chromosomes, largely explained by repetitive structures in the genome, which can both artificially raise or lower the depth locally (Fig. 2c–f).

a, Flow diagram summarizing our data pipeline. b, Distribution of all reads among the major taxonomic groups. c, Cumulative distribution of reads across the genome, for all positions (black), only repeat regions (blue) or exclusive repeat regions (green). d, The read depth (green) varies along the chromosomes. e, Much of this variation can be attributed to the repetitive structure of the genome and is especially pronounced in highly repetitive regions, for example, flanking the centromere, but is also observed in regions with genes (shades of blue). f, Simple repeats, here (CAGC)_n (grey), are common in assembly gaps and therefore cause alleviated read depths. In contrast, a recent segmental duplication (black, with differences in red) will prevent reads from mapping uniquely and lower the read depth.

Genotyping and comparative genomic analyses

For genotyping, we developed a probabilistic model of the sampling of reads from the diploid genome, called SNPest, which takes quality scores and different sources of read errors into account. For the sex chromosomes and the mtDNA a haploid model was used. Given the mapped reads and their quality scores, we assigned the most probable genotype to each position (Supplementary information). We performed genotyping on all positions, using all available read information for depths ≤200×. For read depths >200×, we based the genotyping on 200 randomly sampled reads. This simplification was shown to have negligible effect on the results while speeding up the calculations markedly (Supplementary information). This resulted in 2.2 million SNPs (Fig. 2a), of which 86.2% have previously been reported (dbSNPv130).

We additionally defined a high-quality subset of SNPs, based on positions with read depth between 10× and 50×, to avoid poorly covered and repetitive regions with extreme read depth. We also demanded that these SNPs have posterior probabilities of >0.9999, not to be positioned in annotated repeat regions, and to have a distance of at least 5 bp to the closest neighbouring SNP to account for insertion and/or deletion (indel) errors⁶. This provided a total of 353,151 SNPs with a 93.2% overlap with dbSNP (v130) (Fig. 2a).

The mtDNA genome was sequenced to an average depth of 3,802×. The consensus was identical to that previously recovered by GS FLX sequencing, except that a single position previously called as a heterozygote¹⁶ was now called as a C. Using the diploid model, no high-confidence heterozygotes were found. Applying the diploid model to the X chromosome resulted in 1,707 homozygote (versus 3,071 with the haploid model) and 76 heterozygote high-confidence SNPs. Of the latter, 29% can be explained by known indels and structural variation, whereas the remaining can be referred to mapping errors in repetitive regions (Supplementary information). For the Saqqaq Y chromosome, we found 23 homozygote (versus 243 with the haploid model) and 445 heterozygote high-confidence SNPs. We explain the latter by the well-known fact that human Y chromosomes are difficult to assemble due to structural and repetitive regions²². Importantly, the number of heterozygote SNPs found in the X and Y chromosomes when changing to the diploid model are similar to those from modern human genome sequencing (Supplementary information).

Assessing contamination using the frequency of private European alleles (as defined in the human genome diversity project) as an estimator and a fixed error rate from the observed neighbouring bases, we estimate the raw read contamination to be at most 0.8% (standard error (s.e.) ± 0.2%) (Supplementary information), a level that will not affect our high-confidence genotype calls and will have a negligible effect otherwise.

We investigated the Saqqaq individual for signs of inbreeding using two new statistical approaches that circumvent the problem of uncertainty in the genotype calls of heterozygotes, using the Siberian populations from Supplementary Table 12 as a reference. The methods provide a genome-wide estimate of the inbreeding coefficient (F) and identify regions of identity by descent (IBD) across the genome (Supplementary Fig. 13). The estimated value of F is 0.06 (s.e. 0.011) assuming no genotyping errors, which is equivalent to an offspring of two first cousins, but could have been caused by other family relationships of the parents (Supplementary Information). A positive value of F could possibly also be explained by population subdivision between the Saqqaq population and the Siberian reference population, or by natural selection. However, as many IBD tracts are >10 Mb, far longer than the extent of linkage disequilibrium in the human genome, inbreeding within the Saqqaq population is more likely.

Functional SNP assessment

Although the relationship between risk allele and causation is still in its infancy²³, some phenotypic traits can possibly be inferred from the genome data (all functional SNPs discussed below are listed in Supplementary Table 14). We only included genotypes with a posterior probability above 99%.

Given the A1 antigen allele plus encoding of the rhesus factor in combination with lack of B antigen and the O antigen frameshift mutation, we conclude that the Saqqaq individual had blood type A+²⁴. Although common in all ethnic groups, this has very high frequencies in populations of the east coast of Siberia down to mid China²⁵. Furthermore, we find a combination of four SNPs at the HERC2-OCA2 locus, which among Asians is strongly associated with brown eyes²⁶. SNPs on chromosomes 2, 5, 15 and X suggest that he probably did not have a European light skin colour²⁷, had dark and thick hair^28,29 (in agreement with the morphological examination (Fig. 1b–d)), and an increased risk of baldness^30,31. The same SNP that is characteristic of hair thickness also suggests that he probably had shovel-graded front teeth—a characteristic trait of Asian and Native American populations³². An AA genotype SNP (forward strand) on chromosome 16 is consistent with the Saqqaq individual having earwax of the dry type that is typical of Asians and Native Americans, rather than the wet earwax type dominant in other ethnic groups³³. In addition, the combined influence of 12 SNPs on metabolism and body mass index indicates that the Saqqaq individual was adapted to a cold climate (see Supplementary information and Supplementary Table 14).

Population genetics context of the Saqqaq individual

The origin of the Saqqaq and other Palaeo-Eskimo cultures, and their relationship to present-day populations, has been debated since they were first discovered in the 1950s³⁴. Competing theories have attributed the origins to offshoots of the populations that gave rise to Native American populations such as the Na-Dene of North America, alternatively from the same source as the Inuit currently inhabiting the New World Arctic, or from still other sources entering the New World even later than both the Native American and Inuit ancestors (for summary see ref. 35).

A recent SNP genotyping study³⁶ of the HGDP-CEPH panel of 51 populations has provided comprehensive global coverage of modern human genomic variation, but is limited with respect to Arctic populations. Therefore, we carried out Illumina Bead-Array-based genotyping on four native North American and twelve north Asian populations (Supplementary Table 12). A total of 95,502 SNPs from the resulting combined data set of 35 Eurasian and American populations was covered by high-quality data in the Saqqaq genome and was subject to further analyses (Fig. 3a–c and below).

a, Locations of the studied populations are shown with the most relevant populations indicated by name (numbers in circles correspond to the nr column in Supplementary Table 12). b, PCA plot (PC1 versus PC2) of the studied populations and the Saqqaq genome. c, Ancestry proportions of the studied 492 individuals from 35 extant American and Eurasian populations and the Saqqaq individual as revealed by the ADMIXTURE program³⁹ with K = 5. Each individual is represented by a stacked column of the five proportions, with fractions indicated on the y axis. The analysis assumes no grouping information. The samples are sorted by region/population only after the analysis. For better readability the Saqqaq individual is shown in three columns. Populations added to the published collection³⁶ are shown in bold. Red dots in the expanded plot indicate four individuals whose ancestry proportion pattern showed the highest correlation (Kendall τ >0.95; P <0.05) with that of the Saqqaq individual. d, The phylogenetic tree of Y chromosome haplogroup Q. The position of the Saqqaq individual is ascertained by markers shown on the tree. Information for markers shown in parentheses is missing and their status is therefore inferred. Haplogroup names are according to ref. 38; hash symbol indicates error in reference (Supplementary information).

Principal component analysis (PCA) was used to capture genetic variation. PC1 distinguishes west Eurasians from east Asians and Native Americans, whereas the PC2 captures differentiation between native Asians and Americans (Fig. 3b). Importantly, the PC1 versus PC2 plot shows that the Saqqaq individual falls in the vicinity of three Old World Arctic populations—Nganasans, Koryaks and Chukchis, while being more distantly related to the New World groups (Amerinds, Na-Dene and Greenland Inuit). Koryaks and Chukchis inhabit Chukotka and northern Kamchatka of the Siberian far east. Ethnography describes these groups as having a diverse subsistence economy based on terrestrial and marine hunting as well as reindeer herding. The Nganasans inhabit the Taimyr Peninsula, some 2,000 km from the Bering Strait and are the northernmost living Old World population. Although historically Nganasans have been terrestrial rather than marine hunters, Zhokov, the oldest archaeological Arctic hunting site with a significant marine component (polar bear) on the New Siberian Islands (dating back some 7,000–8,000 yr bp³⁷), is found just east of the Nganasans' current occupation area. In addition, our analysis of more than two hundred Y chromosome SNPs (Supplementary Information) allowed us to assign the Saqqaq individual to Y chromosome haplogroup Q1a (Fig. 3d), commonly found among Siberian and Native American populations³⁸. The mtDNA genome shows close relatedness to Aleuts of Commander Islands (situated in the Bering Sea) and Siberian Sireniki Yuits (Asian Eskimos) as previously described¹⁶.

We explored the data using the algorithm ADMIXTURE³⁹, which assumes a specified number of hypothetical populations (K) and provides a maximum likelihood estimate of allele frequencies for each population and admixture proportion for each individual. We investigated values of K, from K = 2 to K = 10, repeating computing 100 times for each value of K to monitor convergence (Supplementary Information). Figure 3c shows the pattern of distinct colour-coded components at K =5. The analysis suggests that there is a significant amount of west Eurasian admixture in most of the Siberian, Greenland and North American populations. As with the other analyses, this analysis was unable to detect any west Eurasian admixture in the Saqqaq individual, in agreement with a very low level of contamination in our assembled genome. The Saqqaq individual is also practically devoid of the component distinctive to South and Central American populations (dark brown in Fig. 3c). Thus, at K = 5, the Saqqaq genome is comprised of three ethnic influences, specifically the ones characteristic of native populations in East Asia, Siberia in particular, and the Arctic, on both sides of the Bering Strait (Fig. 3c). In this respect the populations closest to the Saqqaq are Koryaks and Chukchis. Importantly, in contrast to Saqqaq and Koryaks, modern Greenlanders carry clear evidence of admixture or shared ancestry with Amerindians. Moreover, at K = 5, the Inuit do not display genetic components of Siberians other than the ‘Beringian’ seen in Chukchis and Koryaks. The admixture results are in agreement with the PCA plots and suggest shared common ancestry of Saqqaq and modern Inuit before the movement of the former to the New World.

We additionally used a population genetic model to obtain maximum likelihood estimates of the divergence times between the Saqqaq individual and the reference populations (Supplementary Information). The population with the shortest divergence time was Chukchis, with an estimated divergence time of approximately 0.043 (±0.08) N_e generations, where N_e is the effective population size. In contrast, the estimated divergence times to the other closely related populations—Na-Dene, Koryaks and Nganasans—were 0.093, 0.11 and 0.089, respectively. The estimated divergence time to the Han Chinese, a more distantly related population, was 0.20. These estimates can be converted to estimates of years or generations, by making assumptions regarding the effective population sizes of the reference populations. The effective population sizes are in general unknown, but can be estimated from DNA sequence data, and are generally much smaller than the census sizes (Supplementary information). We found no evidence in favour of changes in population size. Even when accounting for the uncertainty in the estimate of the mtDNA mutation rate, and possible biases related to the genotyping data, it is still unlikely that N_e >5,000, providing a maximal divergence time between Chukchis and Saqqaqs of 175–255 generations or between 4,400 and 6,400 years. The oldest archaeological evidence of the Arctic Small Tool tradition in the New World is from Kuzitrin Lake, Alaska, dating back ∼5,500 cal. yr bp¹⁴, indicating that the ancestral Saqqaq separated from their Old World relatives almost immediately before their migration into the New World.

Conclusion

We report the successful genome sequencing of a ∼4,000-year-old human. Data authenticity is supported by: (1) the private SNP analyses that indicate contamination levels in the raw sequence data to be ≤0.8%; (2) the mtDNA and Y-chromosome DNA haplotypes fit within haplogroups typical of north-east Asia; (3) population admixture analyses do not record any European component in the Saqqaq genome; and (4) the PCA plots clearly reveal close affiliation of the Saqqaq genome to those of contemporary north-east Siberian populations. These observations, coupled with evidence of excellent DNA preservation, and sample handling being restricted to northern Europeans before incorporation of a sequence indexing, indicate that contamination in the Saqqaq genome is not of concern. Our study thus demonstrates that it is possible to sequence the genome of an ancient human to a level that allows for SNP and population analyses to take place. It also reveals that such genomic data can be used to identify important phenotypic traits of an individual from an extinct culture that left only minor morphological information behind. Additionally, the ancient genomic data prove important in addressing past demographic history by unambiguously showing close relationship between Saqqaq and Old World Arctic populations (Nganasans, Koryaks and Chukchis). A single individual may, or may not, be representative of the extinct culture that inhabited Greenland some 4,000 yr bp. Nevertheless, we may conclude that he, and perhaps the group that once crossed the Bering Strait, did this independently from the ancestors of present-day Native Americans and Inuit, and that he shares ancestry with Arctic north-east Asians, genetic structure components of which can be identified in many of the present-day people on both sides of the Bering Sea. The next technical challenge will be to sequence an ancient human genome from material outside the permafrost regions. Although undoubtedly challenging, it will, if successful, take the emerging field of palaeogenomics to yet another level.

Methods Summary

DNA was extracted from a ∼4,000-year-old hair sample recovered from Qeqertasussuk, Greenland. Indexed Illumina libraries were sequenced following the manufacturer's protocol, and images processed using pipeline v1.4. Reads with correct index were mapped to the human genome (hg 18) with a suffix array-based method that allows for residual primer trimming (Supplementary Information). Genotyping was carried out using a probabilistic model, SNPest, designed to take into account errors specific for ancient samples (Supplementary Information).

Supplementary Material

Supporting Info

NIHMS467171-supplement-Supporting_Info_.pdf^{(3.4MB, pdf)}

Acknowledgments

Centre for Geogenetics, the Copenhagen branch of the Sino-Danish Genomic Centre and Wilhelm Johannsen Centre for Functional Genome Research were supported by Danish National Research Foundation, the Lundbeck Foundation, and the Danish Agency for Science, Technology and Innovation. Center for Biological Sequence Analysis was supported by Villum Kann Rasmussen Fonden; Center for Protein Reseaerch by the Novo Nordisk Foundation. E.W. thanks F. Paulsen for financial support to initiate the project. E.M. thanks Estonian Science Foundation for grant 7858, and R.V. EC DGR for FP7 Ecogene grant 205419 and EU RDF through Centre of Excellence in Genomics grant. J.W. thanks the Shenzhen Municipal Government, the Yantian District local government of Shenzhen, the National Natural Science Foundation of China (30725008), Ole Romer grant from the Danish Natural Science Research Council, the Solexa project (272-07-0196), and Danish Strategic Research Council (2106-07-0021). M.Bu. acknowledges the support of the Australian Research Council. A.K., S.L. and H.M.K. were supported by a grant from the Novo Nordisk Foundation and J.S.P. The Danish Council for Independent Research Medical Sciences. M.H.C. thanks the National Science Foundation for support of the Aleutian and Siberian projects through grants NSF OPP-990590 and OPP-0327676. We thank G. Hudjashov, M. Nelis, L. Anton, V. Soo, A. Wesolowska, H.-H. Staerfeldt, K. Rapacki, P. Wad Sackett, J. Li, H. Yu, Y. Huang, H. Zheng, H. Liang and T. Brand for technical help and Biobase for curation of selected phenotypes and Illumina's core facilities in Hayward, California and Chesterford, England. Use of HGMD Professional was licensed through DTV at the Technical University of Denmark.

Footnotes

Supplementary Information is linked to the online version of the paper at www.nature.com/nature.

Author Contributions E.W. initially conceived and headed the project (J.W. headed research at BGI). M. Rasmussen and E.W. designed the experimental research project setup. T.L.P., M.Mel., C.A., B.G., S.A.F., L.P.O., M.H.C., F.C.N. and R.V. provided samples and/or modern DNA extracts. R.V. carried out Illumina chip analyses on modern populations. T.F.G.H. and C.B.R did the AMS dating. A.S.W., A.G. and S.T. did the morphological analyses. M. Raghavan, A.S.W. and A.G. did the isotope analyses. M.T.P.G., M.Bu. and M.Ras. did ancient DNA extractions. M.Ras. did the library building and qPCR. M.Ras. and P.F.C. did polymerases analyses; ancient extracts were provided by E.D.L. and J.B. Y.L., J.Z., X.G., X.Z., H.Z., Z.L., M.C. and J.W. did the Illumina sequencing and the basic analysis for the sequence raw data. S.L., J.S.P., H.M.K. and A.K. did the method development and data analysis for the genome assembly and genotyping with input from M.Ras., T.S.-P., R.G., M.Be., K.N. and S.B. did the metagenomics, genomic phylogeny and the functional SNP assignment. A.A., I.M., Y.W. and R.N. did the PCA analysis (with input from R.V.), inbreeding estimates and maximum likelihood estimates of the divergence times. M.Met., E.M. and R.V. did the admixture analyses. T.K. did the Y-chromosome analyses. E.W. and J.W. each paid half the genome sequencing costs. E.W. paid the Illumina chip analyses. E.W. and M.Ras. wrote the majority of the manuscript, with critical input from A.K., S.L., J.S.P., R.V., T.K., M.Met., R.N., I.M., A.A., M.T.P.G., T.S.-P., Y.L., J.W. and the remaining authors.

Author Information Sequences have been deposited to the short read archive with accession number SRA010102; summary data are also available via http://www.ancientgenome.dk. Reprints and permissions information is available at www.nature.com/reprints. The authors declare no competing financial interests. This paper is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share-Alike license, and is freely available to all readers at www.nature.com/nature.

References

1.Bentley DR, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. doi: 10.1038/nature07517. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Drmanac R, et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327:78–81. doi: 10.1126/science.1181498. [DOI] [PubMed] [Google Scholar]
3.Levy S, et al. The diploid genome sequence of an individual human. PLoS Biol. 2007;5:e254. doi: 10.1371/journal.pbio.0050254. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Wheeler DA, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. doi: 10.1038/nature06884. [DOI] [PubMed] [Google Scholar]
5.Pushkarev D, et al. Single-molecule sequencing of an individual human genome. Nature Biotechnol. 2009;27:847–850. doi: 10.1038/nbt.1561. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Wang J, et al. The diploid genome sequence of an Asian individual. Nature. 2008;456:60–65. doi: 10.1038/nature07484. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Ahn SM, et al. The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group. Genome Res. 2009;19:1622–1629. doi: 10.1101/gr.092197.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Kim JI, et al. A highly annotated whole-genome sequence of a Korean individual. Nature. 2009;460:1011–1015. doi: 10.1038/nature08211. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Noonan JP, et al. Sequencing and analysis of neanderthal genomic DNA. Science. 2006;314:1113–1118. doi: 10.1126/science.1131412. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Green RE, et al. Analysis of one million base pairs of Neanderthal DNA. Nature. 2006;444:330–336. doi: 10.1038/nature05336. [DOI] [PubMed] [Google Scholar]
11.Wall JD, Kim SK. Inconsistencies in neanderthal genomic DNA sequences. PLoS Genet. 2007;3:1862–1866. doi: 10.1371/journal.pgen.0030175. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Miller W, et al. Sequencing the nuclear genome of the extinct woolly mammoth. Nature. 2008;456:387–390. doi: 10.1038/nature07446. [DOI] [PubMed] [Google Scholar]
13.Green RE, et al. The neandertal genome and ancient DNA authenticity. EMBO J. 2009;28:2494–2502. doi: 10.1038/emboj.2009.222. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Harritt R. Paleo-eskimo beginnings in North America a new discovery at Kuzitrin lake, Alaska. Etud Inuit. 1998;22:61–81. [Google Scholar]
15.Meldgaard M. Meddelelser om Grønland, Man&Society. Danish Polar Center; 2004. Ancient Harp Seal Hunters of Disko Bay. Subsistence and Settlement at the Saqqaq Culture Site Qeqertasussuk (2400–1400 BC), West Greenland. [Google Scholar]
16.Gilbert MTP, et al. Paleo-eskimo mtDNA genome reveals matrilineal discontinuity in Greenland. Science. 2008;320:1787–1789. doi: 10.1126/science.1159750. [DOI] [PubMed] [Google Scholar]
17.Pääbo S. Ancient DNA: extraction, characterization, molecular cloning, and enzymatic amplification. Proc Natl Acad Sci USA. 1989;86:1939–1943. doi: 10.1073/pnas.86.6.1939. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Brotherton P, et al. Novel high-resolution characterization of ancient DNA reveals C >U-type base modification events as the sole cause of post mortem miscoding lesions. Nucleic Acids Res. 2007;35:5717–5728. doi: 10.1093/nar/gkm588. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Fogg MJ, et al. Structural basis for uracil recognition by archaeal family b DNA polymerases. Nature Struct Biol. 2002;9:922–927. doi: 10.1038/nsb867. [DOI] [PubMed] [Google Scholar]
20.Willerslev E, Cooper A. Ancient DNA. Proc Biol Sci. 2005;272:3–16. doi: 10.1098/rspb.2004.2813. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Handt O, et al. The retrieval of ancient human DNA sequences. Am J Hum Genet. 1996;59:368–376. [PMC free article] [PubMed] [Google Scholar]
22.Skaletsky H, et al. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature. 2003;423:825–837. doi: 10.1038/nature01722. [DOI] [PubMed] [Google Scholar]
23.Benfey PN, Mitchell-Olds T. From genotype to phenotype: systems biology meets natural variation. Science. 2008;320:495–497. doi: 10.1126/science.1153716. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Yamamoto F, et al. Molecular genetic basis of the histo-blood group ABO system. Nature. 1990;345:229–233. doi: 10.1038/345229a0. [DOI] [PubMed] [Google Scholar]
25.Cavalli-Sforza LL, et al. The History and Geography of Human Genes. Princeton Univ Press; 1994. [Google Scholar]
26.Iida R, et al. Genotyping of five single nucleotide polymorphisms in the OCA2 and HERC2 genes associated with blue-brown eye color in the Japanese population. Cell Biochem Funct. 2009;27:323–327. doi: 10.1002/cbf.1572. [DOI] [PubMed] [Google Scholar]
27.Soejima M, Koda Y. Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2. Int J Legal Med. 2007;121:36–39. doi: 10.1007/s00414-006-0112-z. [DOI] [PubMed] [Google Scholar]
28.Branicki W, et al. Association of the SLC45A2 gene with physiological human hair colour variation. J Hum Genet. 2008;53:966–971. doi: 10.1007/s10038-008-0338-3. [DOI] [PubMed] [Google Scholar]
29.Sabeti PC, et al. Genome-wide detection and characterization of positive selection in human populations. Nature. 2007;449:913–918. doi: 10.1038/nature06250. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Prodi DA, et al. EDA2R is associated with androgenetic alopecia. J Invest Dermatol. 2008;128:2268–2270. doi: 10.1038/jid.2008.60. [DOI] [PubMed] [Google Scholar]
31.Ellis JA, et al. Baldness and the androgen receptor: the AR polyglycine repeat polymorphism does not confer susceptibility to androgenetic alopecia. Hum Genet. 2007;121:451–457. doi: 10.1007/s00439-006-0317-8. [DOI] [PubMed] [Google Scholar]
32.Kimura R, et al. A common variation in EDAR is a genetic determinant of shovel-shaped incisors. Am J Hum Genet. 2009;85:528–535. doi: 10.1016/j.ajhg.2009.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Yoshiura K, et al. A SNP in the ABCC11 gene is the determinant of human earwax type. Nature Genet. 2006;38:324–330. doi: 10.1038/ng1733. [DOI] [PubMed] [Google Scholar]
34.Meldgaard JA. Paleo-Eskimo culture in West Greenland. Am Antiq. 1952;17:222–230. [Google Scholar]
35.McGhee R. Canadian Prehistory Series. Canadian Museum of Civilization; 1990. Canadian Arctic Prehistory. [Google Scholar]
36.Li JZ, et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science. 2008;319:1100–1104. doi: 10.1126/science.1153717. [DOI] [PubMed] [Google Scholar]
37.Pitul̀ko V, Makeyev V. Ancient Arctic Hunters. Nature. 1991;349:374. [Google Scholar]
38.Karafet T, et al. New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. Genome Res. 2008;18:830–838. doi: 10.1101/gr.7172008. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Alexander DH, et al. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Reimer P, et al. IntCal04 terrestrial radiocarbon age calibration, 0–26 cal kyr BP. Radiocarbon. 2004;46:1029–1058. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Info

NIHMS467171-supplement-Supporting_Info_.pdf^{(3.4MB, pdf)}

[R1] 1.Bentley DR, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. doi: 10.1038/nature07517. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Drmanac R, et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327:78–81. doi: 10.1126/science.1181498. [DOI] [PubMed] [Google Scholar]

[R3] 3.Levy S, et al. The diploid genome sequence of an individual human. PLoS Biol. 2007;5:e254. doi: 10.1371/journal.pbio.0050254. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Wheeler DA, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. doi: 10.1038/nature06884. [DOI] [PubMed] [Google Scholar]

[R5] 5.Pushkarev D, et al. Single-molecule sequencing of an individual human genome. Nature Biotechnol. 2009;27:847–850. doi: 10.1038/nbt.1561. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Wang J, et al. The diploid genome sequence of an Asian individual. Nature. 2008;456:60–65. doi: 10.1038/nature07484. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Ahn SM, et al. The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group. Genome Res. 2009;19:1622–1629. doi: 10.1101/gr.092197.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Kim JI, et al. A highly annotated whole-genome sequence of a Korean individual. Nature. 2009;460:1011–1015. doi: 10.1038/nature08211. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Noonan JP, et al. Sequencing and analysis of neanderthal genomic DNA. Science. 2006;314:1113–1118. doi: 10.1126/science.1131412. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Green RE, et al. Analysis of one million base pairs of Neanderthal DNA. Nature. 2006;444:330–336. doi: 10.1038/nature05336. [DOI] [PubMed] [Google Scholar]

[R11] 11.Wall JD, Kim SK. Inconsistencies in neanderthal genomic DNA sequences. PLoS Genet. 2007;3:1862–1866. doi: 10.1371/journal.pgen.0030175. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Miller W, et al. Sequencing the nuclear genome of the extinct woolly mammoth. Nature. 2008;456:387–390. doi: 10.1038/nature07446. [DOI] [PubMed] [Google Scholar]

[R13] 13.Green RE, et al. The neandertal genome and ancient DNA authenticity. EMBO J. 2009;28:2494–2502. doi: 10.1038/emboj.2009.222. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Harritt R. Paleo-eskimo beginnings in North America a new discovery at Kuzitrin lake, Alaska. Etud Inuit. 1998;22:61–81. [Google Scholar]

[R15] 15.Meldgaard M. Meddelelser om Grønland, Man&Society. Danish Polar Center; 2004. Ancient Harp Seal Hunters of Disko Bay. Subsistence and Settlement at the Saqqaq Culture Site Qeqertasussuk (2400–1400 BC), West Greenland. [Google Scholar]

[R16] 16.Gilbert MTP, et al. Paleo-eskimo mtDNA genome reveals matrilineal discontinuity in Greenland. Science. 2008;320:1787–1789. doi: 10.1126/science.1159750. [DOI] [PubMed] [Google Scholar]

[R17] 17.Pääbo S. Ancient DNA: extraction, characterization, molecular cloning, and enzymatic amplification. Proc Natl Acad Sci USA. 1989;86:1939–1943. doi: 10.1073/pnas.86.6.1939. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Brotherton P, et al. Novel high-resolution characterization of ancient DNA reveals C >U-type base modification events as the sole cause of post mortem miscoding lesions. Nucleic Acids Res. 2007;35:5717–5728. doi: 10.1093/nar/gkm588. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Fogg MJ, et al. Structural basis for uracil recognition by archaeal family b DNA polymerases. Nature Struct Biol. 2002;9:922–927. doi: 10.1038/nsb867. [DOI] [PubMed] [Google Scholar]

[R20] 20.Willerslev E, Cooper A. Ancient DNA. Proc Biol Sci. 2005;272:3–16. doi: 10.1098/rspb.2004.2813. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Handt O, et al. The retrieval of ancient human DNA sequences. Am J Hum Genet. 1996;59:368–376. [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Skaletsky H, et al. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature. 2003;423:825–837. doi: 10.1038/nature01722. [DOI] [PubMed] [Google Scholar]

[R23] 23.Benfey PN, Mitchell-Olds T. From genotype to phenotype: systems biology meets natural variation. Science. 2008;320:495–497. doi: 10.1126/science.1153716. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Yamamoto F, et al. Molecular genetic basis of the histo-blood group ABO system. Nature. 1990;345:229–233. doi: 10.1038/345229a0. [DOI] [PubMed] [Google Scholar]

[R25] 25.Cavalli-Sforza LL, et al. The History and Geography of Human Genes. Princeton Univ Press; 1994. [Google Scholar]

[R26] 26.Iida R, et al. Genotyping of five single nucleotide polymorphisms in the OCA2 and HERC2 genes associated with blue-brown eye color in the Japanese population. Cell Biochem Funct. 2009;27:323–327. doi: 10.1002/cbf.1572. [DOI] [PubMed] [Google Scholar]

[R27] 27.Soejima M, Koda Y. Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2. Int J Legal Med. 2007;121:36–39. doi: 10.1007/s00414-006-0112-z. [DOI] [PubMed] [Google Scholar]

[R28] 28.Branicki W, et al. Association of the SLC45A2 gene with physiological human hair colour variation. J Hum Genet. 2008;53:966–971. doi: 10.1007/s10038-008-0338-3. [DOI] [PubMed] [Google Scholar]

[R29] 29.Sabeti PC, et al. Genome-wide detection and characterization of positive selection in human populations. Nature. 2007;449:913–918. doi: 10.1038/nature06250. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Prodi DA, et al. EDA2R is associated with androgenetic alopecia. J Invest Dermatol. 2008;128:2268–2270. doi: 10.1038/jid.2008.60. [DOI] [PubMed] [Google Scholar]

[R31] 31.Ellis JA, et al. Baldness and the androgen receptor: the AR polyglycine repeat polymorphism does not confer susceptibility to androgenetic alopecia. Hum Genet. 2007;121:451–457. doi: 10.1007/s00439-006-0317-8. [DOI] [PubMed] [Google Scholar]

[R32] 32.Kimura R, et al. A common variation in EDAR is a genetic determinant of shovel-shaped incisors. Am J Hum Genet. 2009;85:528–535. doi: 10.1016/j.ajhg.2009.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Yoshiura K, et al. A SNP in the ABCC11 gene is the determinant of human earwax type. Nature Genet. 2006;38:324–330. doi: 10.1038/ng1733. [DOI] [PubMed] [Google Scholar]

[R34] 34.Meldgaard JA. Paleo-Eskimo culture in West Greenland. Am Antiq. 1952;17:222–230. [Google Scholar]

[R35] 35.McGhee R. Canadian Prehistory Series. Canadian Museum of Civilization; 1990. Canadian Arctic Prehistory. [Google Scholar]

[R36] 36.Li JZ, et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science. 2008;319:1100–1104. doi: 10.1126/science.1153717. [DOI] [PubMed] [Google Scholar]

[R37] 37.Pitul̀ko V, Makeyev V. Ancient Arctic Hunters. Nature. 1991;349:374. [Google Scholar]

[R38] 38.Karafet T, et al. New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. Genome Res. 2008;18:830–838. doi: 10.1101/gr.7172008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Alexander DH, et al. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Reimer P, et al. IntCal04 terrestrial radiocarbon age calibration, 0–26 cal kyr BP. Radiocarbon. 2004;46:1029–1058. [Google Scholar]

PERMALINK

Ancient human genome sequence of an extinct Palaeo-Eskimo

Morten Rasmussen

Yingrui Li

Stinus Lindgreen

Jakob Skou Pedersen

Anders Albrechtsen

Ida Moltke

Mait Metspalu

Ene Metspalu

Toomas Kivisild

Ramneek Gupta

Marcelo Bertalan

Kasper Nielsen

M Thomas P Gilbert

Yong Wang

Maanasa Raghavan

Paula F Campos

Hanne Munkholm Kamp

Andrew S Wilson

Andrew Gledhill

Silvana Tridico

Michael Bunce

Eline D Lorenzen

Jonas Binladen

Xiaosen Guo

Jing Zhao

Xiuqing Zhang

Hao Zhang

Zhuo Li

Minfeng Chen

Ludovic Orlando

Karsten Kristiansen

Mads Bak

Niels Tommerup

Christian Bendixen

Tracey L Pierre

Bjarne Grønnow

Morten Meldgaard

Claus Andreasen

Sardana A Fedorova

Ludmila P Osipova

Thomas F G Higham

Christopher Bronk Ramsey

Thomas v O Hansen

Finn C Nielsen

Michael H Crawford

Søren Brunak

Thomas Sicheritz-Pontén

Richard Villems

Rasmus Nielsen

Anders Krogh

Jun Wang

Eske Willerslev

Abstract

Sample characteristics, DNA quality and sequencing strategy

Figure 1. Sample details.

Sequencing and assembly

Figure 2. Data summary.

Genotyping and comparative genomic analyses

Functional SNP assessment

Population genetics context of the Saqqaq individual

Figure 3. Population genetics and phylogenetics.

Conclusion

Methods Summary

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases