Skip to main content
Biology Letters logoLink to Biology Letters
. 2015 Mar;11(3):20141058. doi: 10.1098/rsbl.2014.1058

Mitochondrial genomes reveal the extinct Hippidion as an outgroup to all living equids

Clio Der Sarkissian 1, Julia T Vilstrup 1, Mikkel Schubert 1, Andaine Seguin-Orlando 1,2, David Eme 3, Jacobo Weinstock 4, Maria Teresa Alberdi 5, Fabiana Martin 6, Patricio M Lopez 7, Jose L Prado 8, Alfredo Prieto 9, Christophe J Douady 3, Tom W Stafford 1, Eske Willerslev 1, Ludovic Orlando 1,10,
PMCID: PMC4387498  PMID: 25762573

Abstract

Hippidions were equids with very distinctive anatomical features. They lived in South America 2.5 million years ago (Ma) until their extinction approximately 10 000 years ago. The evolutionary origin of the three known Hippidion morphospecies is still disputed. Based on palaeontological data, Hippidion could have diverged from the lineage leading to modern equids before 10 Ma. In contrast, a much later divergence date, with Hippidion nesting within modern equids, was indicated by partial ancient mitochondrial DNA sequences. Here, we characterized eight Hippidion complete mitochondrial genomes at 3.4–386.3-fold coverage using target-enrichment capture and next-generation sequencing. Our dataset reveals that the two morphospecies sequenced (H. saldiasi and H. principale) formed a monophyletic clade, basal to extant and extinct Equus lineages. This contrasts with previous genetic analyses and supports Hippidion as a distinct genus, in agreement with palaeontological models. We date the Hippidion split from Equus at 5.6–6.5 Ma, suggesting an early divergence in North America prior to the colonization of South America, after the formation of the Panamanian Isthmus 3.5 Ma and the Great American Biotic Interchange.

Keywords: ancient DNA, Hippidion, mitochondrial genomes, Great American Biotic Interchange

1. Background

All contemporary equids belong to a single genus, Equus [1]. Their most recent common ancestor (MRCA) lived in North America approximately 4.0–4.5 million years ago (Ma) [2], before dispersing into other continents and radiating into a wide range of extant and now-extinct forms. The ancestors of South American equids are believed to have left North America through intercontinental migrations after the formation of the Panamanian Isthmus 3.0–3.7 Ma [3,4], which promoted the Great American Biotic Interchange. No American equid survived the megafaunal extinction approximately ten thousand years ago (kya) [2].

Two main equine lineages are found in the South American Pleistocene fossil record [5]. The first defines a subgenus of its own, Equus (Amerhippus; but see [6]), which appeared approximately 1 Ma and showed horse-like anatomical features and genetic affinities [6]. Hippidiforms represent the second lineage, which appeared approximately 2.5 Ma and includes two genetically similar generalists (Hippidion principale, H. saldiasi) [7] and one genetically distinct high-altitude specialist (H. devillei) [79].

Like Amerhippus, hippidiforms were stocky, but displayed very long nasal bones and extremely deep nasoincisival notches [5]. Palaeontological data support affinities with the northern American Pliohippus [3,8,10], which lived 6–14.5 Ma. This considerably predates the time when hippidiforms first entered South America [3], implying an early divergence from the Equus lineage. However, recent molecular studies based on short mitochondrial sequences from 20 fossil specimens have suggested clustering of hippidiforms within Equus [6,7,11,12], or possible affinities with Dinohippus [12], a North American lineage that lived 5–8 Ma [13]. Therefore, the evolutionary origin of Hippidion remains unclear.

Mitochondrial genomes (mitogenomes) have recently resolved the evolutionary tree underlying the main radiation of equids [1], which was confirmed by whole-exomes [14]. This established mitogenomes as reliable phylogenetic markers for equids. We therefore used target-enrichment capture and next-generation sequencing to characterize mitogenomes of Hippidion and resolve its evolutionary origin.

2. Methods

(a). Samples

We analysed nine Hippidion samples excavated in northern Chile, southern Chile and Argentina (table 1 and the electronic supplementary material, table S1). The specimens represent two morphospecies from the Late Pleistocene (H. principale and H. saldiasi) with six samples radiocarbon-dated to 12.6–17.4 calibrated kya.

Table 1.

Sample and sequencing information. cal. BP, calibrated years before present; #reads (#hits), total number of reads (high-quality hits); M, million; cov, average depth-of-coverage. #hits and cov are estimated following mapping against the sample's mitogenome; LP, Late Pleistocene.

sample species location age (cal. BP) #reads (M) #hits cov (fold)
AE57-17 H. saldiasi Cueva Nordenskjöld, Magallanes, Chile 16 809–17 400 34.6 57 640 293.1
CDM4 H. saldiasi Cueva del Milodón, Magallanes, Chile 13 575–13 810 18.6 37 151 186.6
CE99208-31 H. saldiasi Cueva Escondida, Magallanes, Chile 16 870–17 020 26.5 11 718 51.5
CM66 H. saldiasi Cueva del Medio, Magallanes, Chile LP 36.1 75 624 386.3
CM-A37 H. saldiasi Cueva del Medio, Magallanes, Chile 12 570–12 640 37.4 16 441 69.8
EQB H. sp. Quebrada Opache, Calama, Chile LP 5.5 138 0.6
EQF H. sp. Cueva Baño Nuevo-1, Chile 15 602–16 724 7.7 18 522 78.7
JW251 H. principale Provincia de Buenos Aires, Argentina LP 47.1 1028 3.4
OH H. saldiasi Cueva del Milodón, Magallanes, Chile 13 197–13 468 29.7 2286 7.1

(b). Molecular analyses

We drilled samples (94–485 mg powder), built DNA libraries and set up PCR amplification in ancient DNA facilities following published methods (electronic supplementary material, table S1). We first shotgun-sequenced one/two amplified libraries per sample on Illumina HiSeq 2000 platforms, except for libraries from EQB and EQF, which were directly processed for target-enrichment capture. As sequencing revealed a limited number of mitochondrial sequences, we performed target-enrichment capture for equine mitogenomes for all but JW251 and OH samples (electronic supplementary material, table S2). We followed [15] and built probes from mitochondrial PCR amplicons from living equids (electronic supplementary material). Enriched libraries were amplified for an additional 12 cycles post-capture before being quantified on an Agilent 2100 Bioanalyser, pooled and sequenced on an Illumina HiSeq 2000 platform. DNA contamination was monitored using mock controls during DNA extraction, library construction, PCR amplification and target enrichment. Contamination levels were estimated from sequence reads using the maximum-likelihood (ML) method from Fu et al. [16] and a database of 245 equine mitogenomes (electronic supplementary material, table S3).

(c). Sequence analyses

DNA read processing (adapter trimming), mapping (read alignment, PCR duplicate removal, indel realignment) and damage analyses were performed using the PALEOMIX pipeline [17]. Seeding was disabled for mapping, a relaxed edit distance (−n 0.03) was used, and reads showing mapping qualities less than 25 were excluded. To ensure that all equine-like reads could be identified in the absence of a mitochondrial reference for Hippidion, we first recovered, for each sample, high-quality hits mapping uniquely against at least one equine mitogenome and/or partial Hippidion sequence (electronic supplementary material, table S4). We then used a majority rule and a minimum depth-of-coverage of 2 to call a preliminary consensus sequence for each sample, which was used for read mapping with default parameters. A new and final consensus was then called using a majority rule, a minimum depth-of-coverage of 3 and filtering for base scores less than 30. Alignment statistics and DNA damage parameters were calculated based on a final mapping against each individual mitochondrial sequence.

(d). Phylogenetic analyses

We reconstructed the phylogenetic tree of equid complete mitogenomes using both ML (PhyML v. 3.0; [18]) and Bayesian algorithms (Beast v. 1.8.0; [19]). We constructed a 40-taxon mitochondrial dataset by aligning Hippidion mitochondrial sequences against equine orthologous sequences, adding the white and black rhinoceroses as outgroups (electronic supplementary material, table S5). As Beast allows the application of different models of substitution to partitions in datasets, we divided the alignment into six partitions (one per codon position, rRNA, tRNA, control region CR). For PhyML analyses, we merged these partitions into one alignment, as PhyML does not handle multiple partitions. The best substitution model for each partition/merged dataset (with and without outgroups) was determined with Modelgenerator v. 0.85 [20] on the basis of its Bayesian information criterion, and applied to the corresponding partition/dataset in Beast (unlinked substitution models) and PhyML analyses (electronic supplementary material, tables S6–S7). Beast analyses were performed assuming a birth–death serially sampled tree model, and divergence dates were estimated using a log-uncorrelated molecular clock model together with tip-sampling (electronic supplementary material, table S8) and assuming a time for the MRCA at 4.0–4.5 Ma for Equus [2].

To further support tree topology, we performed approximate unbiased (AU) and Shimodeira–Hasegawa (SH) topological tests in CONSEL [21] using five trees differing in their Hippidion placement and site-wise-likelihood values determined by PhyML.

Finally, in order to compare our Hippidion CR sequences with those characterized previously, we ran ML analyses under a GTR + I + Γ8 model (GTR, general time reversible model; I, invariant sites; Γ8, 8-class gamma distribution; electronic supplementary material, table S9).

3. Results and discussion

We characterized eight Hippidion mitogenomes at 3.4–386.3-fold coverage (table 1 and electronic supplementary material, table S2). Compared with shotgun sequencing, target-enriched libraries showed a 3.0–147.8-fold increase (average 42.4-fold; electronic supplementary material, table S2) in the number of high-quality hits mapping uniquely against the Hippidion mitogenome. An additional sample (EQB) showed poor DNA preservation and resulted in limited mitogenome coverage (0.6-fold). Several lines of evidence support our data as authentic: (i) presence of typical nucleotide mis-incorporation and ancient DNA fragmentation patterns [22], i.e. increasing C → T (G → A) substitutions toward read starts (ends), preferential read starts (ends) at genomic positions following purines (pyrimidines; electronic supplementary material, figures S1–S9), (ii) 6.9–32.5-fold higher cytosine deamination rates at overhangs than in double-stranded DNA (electronic supplementary material, table S10), (iii) contamination levels less than or equal to 4.5% (per library) and 1.4% (per sample; electronic supplementary material, table S11), (iv) ML phylogenetic clustering with all H. saldiasi and H. principale CR sequences previously characterized (figure 1a). In addition, albeit only partially characterized owing to limited coverage, sample JW251 CR sequence clustered with the sequence previously published for this specimen [12].

Figure 1.

Figure 1.

Maximum-likelihood phylogenetic reconstructions. (a) Control region. (b) Mitochondrial genome (rooted using rhinoceroses as outgroups, not shown). The Hippidion devillei clade is represented in brown. The other Hippidion clade including H. saldiasi, H. principale and H. sp., is shown in red. In bold, Hippidion sequences generated in this study. Node supports (aLRT SH-like) <0.80 are not shown. See table 2 for node supports and Bayesian divergence estimates. New World Stilt Legged Horse = extinct Pleistocene North American equid [12].

ML analyses (figure 1b) and Bayesian phylogenetic inference rooted on rhinoceros outgroups revealed Hippidion as a monophyletic assemblage basal to all Equus. Similar Bayesian phylogenetic placement was found when excluding rhinoceros outgroups (data not shown). We furthermore used AU and SH topological tests to show that this topology provided a significantly better alternative than four other topologies where Hippidion was forced to nest within different Equus clades (p-values ≤ 0.001 and ≤ 0.044; electronic supplementary material, figure S10), following all possible alternative topologies highlighted in reference [7]. Our analyses therefore contrast with previous results based on partial mitochondrial sequences [6,7,11,12] and support palaeontological models, which place Hippidion outside the range of variation of all extinct and extant Equus species. Additionally, we found that H. devillei clusters outside a paraphyletic assemblage consisting of H. principale and H. saldiasi, as shown in reference [7]. Whether the latter should still be described as two distinct morphospecies remains to be determined with nuclear data.

We estimated the divergence between Hippidion and Equus at approximately 5.6–6.5 (confidence range = 4.6–7.6 Ma; table 2), before Hippidion entered South America following the formation of the Panamanian Isthmus 3.0–3.7 Ma [5]. This is supported by fossils found in southern North America, which exhibit a more primitive nasal notch than South American hippidions, and could represent earlier hippidiforms [23].

Table 2.

Clade phylogenetic support and divergence dates. See figure 1 for nodes A–F. CR, control region; MG, mitochondrial genome; O, outgroups; n.a., not applicable. Node ages correspond to the median of Bayesian posterior distributions [5% and 95% quantiles].

ML inference
Bayesian inference
supportb
support
node age (Ma)
node CR MG + O MG – Oc MG + O MG– O MG + O MG – O
A n.a. 1.000 n.a. 1.000 1.000 6.502
[5.551–7.608]
5.596
[4.599–6.629]
B 0.891 1.000 0.915 1.000 1.000 0.377
[0.276–0.494]
0.322
[0.247–0.425]
C 0.994 0.860 1.000 1.000 1.000 (4.000–4.500)a (4.000–4.500)a
D 0.888 0.999 0.996 1.000 1.000 3.126
[2.648–3.603]
2.833
[2.314–3.331]
E 1.000 1.000 1.000 1.000 2.965
[2.600–3.340]
2.650
[2.290–3.017]
F 0.895 n.a. n.a. n.a. n.a. n.a. n.a.

aCalibration point.

baLRT SH-like.

cManually rooted according to the MG + O tree.

4. Conclusion and perspectives

In this study, we sequenced for the first time complete mitogenomes of Hippidion saldiasi and H. principale. We reject previous claims that Hippidion is nested within Equus, and estimate an early split between Hippidion and Equus 5.6–6.5 Ma. Although the Equidae family flourished during the Miocene, all present-day equids originate from an MRCA that lived 4.0–4.5 Ma [2]. With closest living relatives diverging 55 Ma, the early evolutionary stages of the genus Equus are difficult to reconstruct based on present-day genetic information alone. The survival of Hippidion until 10 kya and the possibility to sequence whole genomes from extinct species [24] offer a unique opportunity to access a closer phylogenetic outgroup for equids, and thereby reveal the genetic foundation of Equus.

Supplementary Material

Supplementary material
rsbl20141058supp1.pdf (130KB, pdf)

Supplementary Material

Supplementary Tables
rsbl20141058supp2.xlsx (59.1KB, xlsx)

Supplementary Material

Supplementary Figures
rsbl20141058supp3.pdf (1.8MB, pdf)

Acknowledgements

We thank the Danish National High-throughput DNA Sequencing Centre; Tina Brand and Pernille Selmer-Olsen for laboratory assistance; Philip Johnson for sharing R-scripts estimating contamination levels.

Data accessibility

Hippidion mitogenome sequences can be accessed from GenBank (KM881671–KM881679).

Authors' contributions

L.O. conceived and designed the study. C.D.S., J.T.V., A.S.O. and L.O. performed ancient DNA extraction, DNA library construction and amplification. C.D.S. performed target-enrichment capture experiments. M.S. and L.O. performed sequence analyses. D.E., J.W., M.T.A., F.M., P.M.L., J.L.P., A.P., C.J.D., T.W.S., E.W. and L.O. contributed samples, reagents and methods. All authors participated to the interpretation of the data. C.D.S., J.T.V. and L.O. wrote the article.

Funding statement

This research was supported by the Marie Curie Career-Integration Grant (293845), the Danish Council for Independent Research (Natural Sciences), the Danish National Research Foundation (DNRF94), the “Chaires d'Attractivité 2014” IDEX programme (University of Toulouse), and the Lundbeck Foundation (R52-A5062). F.M. was supported by the FONDECYT 1100822 and CD MAG0901 grants.

Competing interests

We have no competing interests.

References

  • 1.Vilstrup JT, et al. 2013. Mitochondrial phylogenomics of modern and ancient equids. PLoS ONE 8, e55950 ( 10.1371/journal.pone.0055950) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Orlando L, et al. 2013. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499, 74–78. ( 10.1038/nature12323) [DOI] [PubMed] [Google Scholar]
  • 3.MacFadden BJ. 1997. Pleistocene horses from Tarija, Bolivia, and validity of the genus Onohippidium (Mammalia: Equidae). J. Vert. Paleontol. 17, 199–218. ( 10.1080/02724634.1997.10010964) [DOI] [Google Scholar]
  • 4.Ibaraki M. 1997. Closing of the Central American Seaway and Neogene coastal upwelling along the Pacific coast of South America. Tectonophysics 281, 99–104. ( 10.1016/S0040-1951(97)00161-3) [DOI] [Google Scholar]
  • 5.Alberdi MT, Prado JL, Prieto A. 2005. Considerations on the paper ‘Morphological convergence in Hippidion and Equus (Amerhippus) South American equids elucidated by ancient DNA analysis’. J. Mol. Evol. 61, 145–147. ( 10.1007/s00239-004-0216-3) [DOI] [PubMed] [Google Scholar]
  • 6.Orlando L, Male D, Alberdi MT, Prado JL, Prieto A, Cooper A, Hänni C. 2008. Ancient DNA clarifies the evolutionary history of American late Pleistocene equids. J. Mol. Evol. 66, 533–538. ( 10.1007/s00239-008-9100-x) [DOI] [PubMed] [Google Scholar]
  • 7.Orlando L, et al. 2009. Revising the recent evolutionary history of equids using ancient DNA. Proc. Natl Acad. Sci. USA 106, 21 754–21 759. ( 10.1073/pnas.0903672106) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Prado JL, Alberdi MT. 1996. A cladistic analysis of the horses of the tribe Equini. Palaeontology 39, 663–680. [Google Scholar]
  • 9.Borrero LA. 2009. The elusive evidence: the archeological record of the South American extinct megafauna. In American megafaunal extinctions at the end of the Pleistocene (ed. Haynes G.), pp. 145–168. Dordrecht, The Netherlands: Springer. [Google Scholar]
  • 10.Alberdi MT, Prado JL, López P, Labarca R, Martínez I. 1998. Late Pleistocene Hippidion saldiasi Roth, 1899 (Mammalia, Perissodactyla) from Calama, northern Chile. Rev. Chil. Hist. Nat. 80, 157–171. ( 10.4067/S0716-078X2007000200003) [DOI] [Google Scholar]
  • 11.Orlando L, Eisenmann V, Reynier F, Sondaar P. 2003. Morphological convergence in Hippidion and Equus (Amerhippus) South American equids elucidated by ancient DNA analysis. J. Mol. Evol. 57, S29–S40. ( 10.1007/s00239-003-0005-4) [DOI] [PubMed] [Google Scholar]
  • 12.Weinstock J, et al. 2005. Evolution, systematics, and phylogeography of Pleistocene horses in the New World: a molecular perspective. PLoS Biol. 3, 1373–1379. ( 10.1371/journal.pbio.0030241) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Groves C, Ryder OA. 2000. Systematics and phylogeny of the horse. In The genetics of the horse (eds Bowling AV, Ruvinsky A.), pp. 1–24. London, UK: CABI Publishing. [Google Scholar]
  • 14.Jónsson H, et al. 2014. Speciation with gene flow in equids despite extensive chromosomal plasticity. Proc. Natl Acad. Sci. USA 111, 18 655–18 660. ( 10.1073/pnas.1412627111) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Maricic T, Whitten M, Pääbo S. 2010. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PLoS ONE 5, e14004 ( 10.1371/journal.pone.0014004) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Fu Q, et al. 2013. A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 23, 553–559. ( 10.1016/j.cub.2013.02.044) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Schubert M, et al. 2014. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nat. Protoc. 9, 1056–1082. ( 10.1038/nprot.2014.063) [DOI] [PubMed] [Google Scholar]
  • 18.Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. ( 10.1093/sysbio/syq010) [DOI] [PubMed] [Google Scholar]
  • 19.Drummond AJ, Suchard MA, Xie D, Rambaut A. 2012. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973. ( 10.1093/molbev/mss075) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Keane TM, Creevey CJ, Pentony MM, Naughton TJ, Mclnerney JO. 2006. Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol. Biol. 6, 29 ( 10.1186/1471-2148-6-29) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shimodaira H, Hasegawa M. 2001. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics 17, 1246–1247. ( 10.1093/bioinformatics/17.12.1246) [DOI] [PubMed] [Google Scholar]
  • 22.Dabney J, Meyer M, Pääbo S. 2013. Ancient DNA damage. Cold Spring Harb. Perspect. Biol. 5, a012567 ( 10.1101/cshperspect.a012567) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.MacFadden BJ, Skinner MF. 1979. Diversification and biogeography of the one-toed horses ‘Onohippidium’ and ‘Hippidion’. New Haven, CT: Peabody Museum of Natural History, Yale University. [Google Scholar]
  • 24.Miller W, et al. 2008. Sequencing the nuclear genome of the extinct woolly mammoth. Nature 456, 387–390. ( 10.1038/nature07446) [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material
rsbl20141058supp1.pdf (130KB, pdf)
Supplementary Tables
rsbl20141058supp2.xlsx (59.1KB, xlsx)
Supplementary Figures
rsbl20141058supp3.pdf (1.8MB, pdf)

Data Availability Statement

Hippidion mitogenome sequences can be accessed from GenBank (KM881671–KM881679).


Articles from Biology Letters are provided here courtesy of The Royal Society

RESOURCES