Abstract
Captive populations provide a valuable insurance against extinctions in the wild. However, they are also vulnerable to the negative impacts of inbreeding, selection and drift. Genetic information is therefore considered a critical aspect of conservation management. Recent developments in sequencing technologies have the potential to improve the outcomes of management programmes; however, the transfer of these approaches to applied conservation has been slow. The scimitar-horned oryx (Oryx dammah) is a North African antelope that has been extinct in the wild since the early 1980s and is the focus of a large-scale and long-term reintroduction project. To enable the selection of suitable founder individuals, facilitate post-release monitoring and improve captive breeding management, comprehensive genomic resources are required. Here, we used 10X Chromium sequencing together with Hi-C contact mapping to develop a chromosomal-level genome assembly for the species. The resulting assembly contained 29 chromosomes with a scaffold N50 of 100.4 Mb, and displayed strong chromosomal synteny with the cattle genome. Using resequencing data from six additional individuals, we demonstrated relatively high genetic diversity in the scimitar-horned oryx compared to other mammals, despite it having experienced a strong founding event in captivity. Additionally, the level of diversity across populations varied according to management strategy. Finally, we uncovered a dynamic demographic history that coincided with periods of climate variation during the Pleistocene. Overall, our study provides a clear example of how genomic data can uncover valuable insights into captive populations and contributes important resources to guide future management decisions of an endangered species.
Graphical Abstract.
Scimitar-horned oryx mother with calf in the Ouadi Rimé-Ouadi Achim Wildlife Reserve in central Chad. Copyright Justin Chuven.
Introduction
As human activities and habitat loss accelerate global species declines (Ceballos, Ehrlich, & Dirzo, 2017; Haipeng Li et al., 2016), captive and semi-captive populations are becoming increasingly important as potential sources for reintroductions (Fritz, Kramer, Hoffmann, Trobe, & Unsöld, 2017; Russell, Thorne, Oakleaf, & Ballou, 1994; Spalton, 1993). A central goal of ex-situ breeding programmes is therefore to achieve population viability through maintaining genetic diversity and minimising inbreeding (Frankham, Ballou, & Briscoe, 2002). Consequently, the value of genetic analysis in conservation management has long been recognised (Lacy, 1987). However, a lack of appropriate resources and baseline data has meant that in practice, genetic information is not always used. This has arguably contributed towards the failure of numerous reintroduction attempts (Robert, 2009; Tallmon, Luikart, & Waples, 2004; Weeks et al., 2011). Continued advances in sequencing technology have now made it possible to generate high resolution genomic data for practically any species, and the wider uptake of these approaches by the conservation community would undoubtedly increase the chance of successful management outcomes (Allendorf, Hohenlohe, & Luikart, 2010; Shafer et al., 2015; Supple & Shapiro, 2018; Wildt et al., 2019).
The advent of next-generation sequencing over the past decade has meant that reference genomes are now available for hundreds of species (Koepfli, Paten, Genome 10K Community of Scientists, & O’Brien, 2015). However, most genomes have been assembled using short-read sequencing technologies and as a result are highly fragmented into hundreds or thousands of scaffolds, often without any chromosomal assignment (Bradnam et al., 2013; Salzberg & Yorke, 2005). Consequently, there has been growing interest in sequencing technologies that incorporate long-range, chromosomal information to improve contiguity, reduce error rates and make downstream annotation more reliable (van Dijk, Jaszczyszyn, Naquin, & Thermes, 2018). For example, 10X Chromium sequencing uses Linked-Reads to provide long-range haplotype information, whilst Hi-C contact mapping uses structural information to build chromosome-length scaffolds (Dudchenko et al., 2017). These approaches show great promise for studies of threatened species where well characterised genomes are rarely available. Reference assemblies can aid in the development of SNP arrays (Humble et al., 2016), which provide a powerful approach for genotyping low quality samples (Carroll et al., 2018), whilst structural and annotation information provide the opportunity to elucidate the genetic basis of inbreeding depression, hybrid sterility and adaptation to captivity (Allendorf et al., 2010; Kardos, Taylor, Ellegren, Luikart, & Allendorf, 2016; Knief et al., 2016).
Alongside these developments in genome assembly, whole genome resequencing is increasingly being employed to generate high resolution datasets of mapped genomic markers (Dobrynin et al., 2015; Ekblom et al., 2018; Kardos, Qvarnström, & Ellegren, 2017; Robinson et al., 2016; Westbury, Petersen, Garde, Heide-Jørgensen, & Lorenzen, 2019). This has opened up the opportunity for precisely measuring genetic diversity, a critical aspect of conservation management, particularly when selecting founders for reintroduction (IUCN/SSC, 2013). However, only a handful of studies have employed genomic approaches for measuring genetic diversity in captive species (Çilingir et al., 2019; Robinson et al., 2019; Willoughby, Ivy, Lacy, Doyle, & DeWoody, 2017) and therefore most estimates are based on traditional markers such as microsatellites and fixed panels of SNPs. These can be associated with high sampling variance and ascertainment bias (Väli, Einarsson, Waits, & Ellegren, 2008), making comparisons across species and populations problematic. As the conservation community continues to integrate the management of captive breeding programmes and natural populations (Redford, Jensen, & Breheny, 2012), there is a growing need to reliably characterise the distribution of diversity across meta-populations.
As well as facilitating the assessment of genetic diversity, sequence data from a diploid genome assembly can be used for reconstructing demographic history. For example, studies are increasingly employing methods such as PSMC (Li & Durbin, 2011) to infer past periods of population instability in wild species (Nadachowska-Brzyska, Li, Smeds, Zhang, & Ellegren, 2015; Palkopoulou et al., 2015; Prado-Martinez et al., 2013) and whilst some have documented dynamic patterns that coincide with past ecological variation (Beichman et al., 2019; Mays et al., 2018), others have uncovered signals of persistent population decline (Dobrynin et al., 2015; Westbury et al., 2019). As contemporary levels of genetic diversity are largely the result of mutations and genetic drift that occurred in the past (Ellegren & Galtier, 2016), an understanding of past population dynamics can place current estimates of diversity into a historical context (Stoffel et al., 2018).
The scimitar-horned oryx (SHO), Oryx dammah, is a large iconic antelope and one of two mammalian species classified as extinct in the wild by the International Union for Conservation of Nature (IUCN SSC Antelope Specialist Group, 2016). This arid-land adapted species was once widespread across North Africa, however a combination of hunting and land-use competition resulted in rapid population decline until the last remaining individuals disappeared in the 1980s (Woodfine & Gilbert, 2016). Before they were declared extinct, captive populations were established from what is thought to be around 50 individuals, mostly originating from Chad (Woodfine & Gilbert, 2016). In the decades that followed, captive SHO numbers increased to reach approximately 15,000 individuals (Gilbert, 2019). These are primarily held in private collections without strict management schemes such as those in the United Arab Emirates (Environment Agency of Abu Dhabi, EAD) and southern USA (Wildt et al., 2019), but also within studbook managed breeding programmes including those in Europe (European Endangered Species Program, EEP) and North America (Species Survival Plan Program, SSP). Rapid reductions in population size, such as those associated with the founding of captive populations, are generally expected to lead to a substantial loss of genetic diversity (Frankham et al., 2002). However, an early study using mitochondrial DNA reported considerably high levels of variation in captive SHO populations (Iyengar et al., 2007). Furthermore, a recent analysis using both microsatellites and a small panel of SNPs found support for higher levels of genetic diversity in studbook managed populations, implying that diversity is not spread evenly across the globe (Ogden et al., 2020).
A programme of SHO reintroductions occurred in Tunisia between 1985–2007 (Woodfine & Gilbert, 2016) and since 2014, a large-scale effort to release the species back into its native range has been led by the Environment Agency of Abu Dhabi. To date, approximately 200 individuals have been released into Chad, and more animals are due to be reintroduced in the coming years until a self-sustaining population is reached. To facilitate an assessment of ex-situ genetic variation and inform the selection of suitable founder individuals, SNP genotyping using reduced representation sequencing (ddRAD) has been carried out across multiple populations (Ogden et al., 2020). However, to carry out analyses that rely on markers with known genomic coordinates, such as demographic inference (Li & Durbin, 2011) or characterising runs of homozygosity (Kardos, Luikart, & Allendorf, 2015), a genome assembly for the SHO is required. This would also support the future development of genotyping arrays, where optimising SNP genomic context has been shown to improve validation outcomes (Humble et al., 2016).
In this study, we used a combination of 10X Chromium sequencing and Hi-C based chromatin contact maps to generate a chromosomal-level genome assembly for the species. We additionally resequenced six individuals from across three captive populations to generate a panel of genome-wide SNPs. The resulting data were used to investigate the strength of chromosomal synteny between oryx and cattle (Bos taurus), elucidate patterns of diversity between mammalian species and across captive SHO populations, and reconstruct historical demography of the oryx. We hypothesised that: i) SHO and cattle would display strong chromosomal synteny given relatively recent divergence times; ii) levels of diversity in the SHO would be low compared to other mammals, considering the species is extinct in the wild; iii) individuals from intensively managed zoo populations would display higher levels of genetic diversity than those from unmanaged collections; and iv) patterns of past population disturbance would coincide with known periods of climatic change in North Africa.
Materials and Methods
Sampling and DNA extraction
Peripheral whole blood (male; international studbook #20612) and liver tissue (female; international studbook #42652) were collected from the captive herd at the National Zoological Park – Conservation Biology Institute in Front Royal, Virginia, USA. The male oryx represents approximately 15% of founders to the global population documented in the international studbook. Whole blood was collected into EDTA blood tubes (BD Vacutainer Blood Tube, Becton, Dickinson and Company, Franklin Lakes, NJ, USA) and stored frozen until analysis. Total genomic DNA was isolated and used to generate the de novo reference genome assembly (see below for details). Additional blood and tissue samples were obtained for whole genome resequencing from six individuals representing three of the main captive populations: the EEP (n = 2, international studbook numbers #35552 and #34412), the SSP (n = 2, international studbook numbers #33556 and #36948) and the EAD (n = 2, for further details, see Table S1). EEP blood samples were collected by qualified veterinarians during routine health procedures and protocols were approved by Marwell Wildlife Ethics Committee. Total genomic DNA was extracted between one and five times using either the Qiagen DNeasy Blood and Tissue Kit (Qiagen, Cat. No. 69504) or the QuickGene DNA Whole Blood or Tissue Kit (Kurabo Industries). Elutions were pooled and concentrated in an Eppendorf Concentrator Plus at 45°C and 1400 rpm until roughly 50 μl remained.
10X Genomics sequencing and assembly
Two technologies were employed to sequence and assemble the SHO reference genome: 10X Genomics linked-read sequencing and chromosome conformation capture (Hi-C). For the 10X assembly, high molecular weight genomic DNA was isolated from ~2 ml of whole blood from individual #20612 using Nanobind magnetic discs (Circulomics, Inc., MD, USA). Genomic DNA concentration and purity were assessed with a Qubit 2.0 Fluorometer (ThermoFisher Scientific, MA, USA) and NanoDrop 2000 spectrophotometer (ThermoFisher Scientific, MA, USA). Capillary electrophoresis was carried out using a Fragment Analyzer (Agilent Technologies, CA, USA) to ensure that the isolated DNA had a minimum molecule length of 40 kb. Genomic DNA was diluted to ~1.2 ng/μl and libraries were prepared using Chromium Genome Reagents Kits Version 2 and the 10X Genomics Chromium Controller instrument fitted with a micro-fluidic Genome Chip (10X Genomics, CA, USA). DNA molecules were captured in Gel Bead-In-Emulsions (GEMs) and nick-translated using bead-specific unique molecular identifiers (UMIs; Chromium Genome Reagents Kit Version 2 User Guide). Size and concentration were determined using an Agilent 2100 Bioanalyzer DNA 1000 chip (Agilent Technologies, CA, USA). Libraries were then sequenced on an Illumina NovaSeq 6000 System following the manufacturer’s protocols (Illumina, CA, USA) to produce >60X read depth using paired-end 150 bp reads. The reads were assembled into phased pseudohaplotypes using Supernova Version 2.0 (10X Genomics, CA, USA). This assembly will hereafter be referred to as the 10X assembly.
Hi-C sequencing and scaffolding
Using liver tissue from individual #42652, an in-situ Hi-C library was prepared as previously described (Rao et al., 2014). The Hi-C library was sequenced on a HiSeq X Platform (Illumina, CA, USA) to a coverage of 60X. The Hi-C data were aligned to the 10X Genomics linked-read assembly using Juicer (Durand et al., 2016). Hi-C genome assembly was then performed using the 3D-DNA pipeline (Dudchenko et al., 2017) and the output was reviewed using Juicebox Assembly Tools (Dudchenko et al., 2018). No bases were changed, added or removed during Hi-C scaffolding and therefore the genome assembly represents a single individual: #201612. Where alternative haplotypes were present due to allelic variation in the 10X assembly, hereafter referred to as under-collapsed heterozygosity, one variant was chosen at random and incorporated into the 29 chromosome-length scaffolds. Alternative haplotypes are reported as unanchored sequences so that the primary scaffolds were free from duplication. This assembly will hereafter be referred to as the 10X+HiC assembly.
Genome annotation and completeness
To identify and annotate interspersed repeat regions we used RepeatMasker v4.0.7 to screen the 10X assembly against both the Dfam_consensus (release 20170127, Wheeler et al., 2013) and RepBase Update (release 20170127, Bao, Kojima, & Kohany, 2015) repeat databases. Sequence comparisons were performed using RMBlastn v2.6.0+ with the -species option set to mammal. We next predicted protein-coding genes with AUGUSTUS version 3.3.2 (Stanke et al., 2006) using the gene model trained in humans. Prediction of untranslated regions of the genes was disabled. Functional annotation of the predicted genes was then performed using eggNOG-mapper v2.0 (Huerta-Cepas et al., 2017) against the eggNOG 5.0 orthology database (Huerta-Cepas et al., 2019). The alignment algorithm DIAMOND was specified as the search tool (Buchfink, Xie, & Huson, 2015). A final set of protein-coding genes was obtained by filtering the genes predicted by AUGUSTUS for those with gene names assigned by eggNOG-mapper. We then transferred the repeat and gene annotations from the 10X assembly to the 10X+HiC assembly using custom scripts. Genome completeness of both the 10X and 10X+Hi-C assemblies was assessed using BUSCO v2 with 4,104 genes from the Mammalia odb9 database (Simão, Waterhouse, Ioannidis, Kriventseva, & Zdobnov, 2015) and the gVolante web interface (Nishimura, Hara, & Kuraku, 2017).
Genome synteny
We aligned the SHO chromosomes from the 10X+HiC assembly to the cattle genome (Bos taurus assembly version 3.1.1, GenBank accession number GCA_000003055.5, Zimin et al., 2009) using LAST v746 (Kiełbasa, Wan, Sato, Horton, & Frith, 2011). The cattle assembly was first prepared for alignment using the command lastdb. Next, lastal and last-split commands in combination with parallel-fastq were used to align the SHO chromosomes to the cattle assembly. Coordinates for alignments over 10 kb were extracted from the resulting multiple alignment format file and visualised using the R package RCircos v1.2.0 (Zhang, Meltzer, & Davis, 2013) and the JavaScript library D3.js.
Whole-genome resequencing and alignment
Library construction was carried out for whole genome resequencing of the six focal individuals using the Illumina TruSeq Nano High Throughout library preparation kit (Illumina, CA, USA). Paired-end sequencing was performed on an Illumina HiSeq X Ten platform at a depth of coverage of 15X. Sequencing reads were mapped to the SHO 10X+HiC chromosomes using BWA MEM v0.7.17 (Heng Li, 2013) with the default parameters. Any unmapped reads were removed from the alignment files using SAMtools v1.9 (Heng Li, 2011). We then used Picard Tools to sort each bam file, add read groups and mark and remove duplicate reads. This resulted in a set of six filtered alignments, one for each of the resequenced individuals.
SNP calling and filtering
HaplotypeCaller in GATK v3.8 (Van der Auwera et al., 2013) was first used to call variants separately for each filtered bam file. GenomicVCF files for each individual were then used as input to GenotypeGVCFs for joint genotyping. The resulting SNP dataset was filtered to include only biallelic SNPs using BCFtools v1.9 (Heng Li, 2011). We then applied a set of filters to obtain a high-quality dataset of variants using VCFtools v0.1.13 (Danecek et al., 2011). First, loci with Phred-scaled quality scores of less than 50 and genotypes with a depth of coverage less than five or greater than 38 (twice the mean sequence read depth) were removed. Second, loci with any missing data were discarded. Finally, we removed loci with a minor allele frequency of less than 0.16 to ensure the minor allele was observed at least twice.
Mitochondrial genome assembly
Sequencing reads for the six resequenced individuals were mapped using BWA MEM v0.7.17 (Heng Li, 2013) to a published mitochondrial reference genome of an SHO originating from the Paris Zoological Park (NCBI accession number: JN632677, Hassanin et al., 2012). Alignment files were filtered to contain only reads that mapped with their proper pair. Variants were called using SAMtools mpileup and BCFtools (Heng Li, 2011) call commands and filtered to include only those with Phred quality scores over 200 using VCFtools (Danecek et al., 2011). The resulting VCF file was manually checked and sites where the called allele was supported by fewer reads than the alternative allele were corrected. Consensus sequences for each individual were extracted using the BCFtools consensus command. We next used Geneious Prime v2019.2.1 (https://www.geneious.com) to annotate the mitochondrial consensus sequences and extract the cytochrome b, 16S and control region from each individual. Sequence similarity and haplotype frequencies were calculated using the R package pegas (Paradis, 2010). To place the mitochondrial data into a broader geographic context, the six control region sequences were aligned to 43 previously described haplotypes (NCBI accession numbers DQ159406–DQ159445 and MN689133–MN689138, Iyengar et al. 2007; Ogden et al., 2020) using Geneious Prime. A median-joining haplotype network was generated using PopArt v1.7 (Leigh & Bryant, 2015).
Genetic diversity
We assessed genetic diversity of SHO using two genome-wide measures. First, we used VCFtools to estimate nucleotide diversity (π) across all six resequenced individuals based on high-quality variants called by GATK. Second, we estimated individual genome-wide heterozygosity as the proportion of polymorphic sites over the total number of sites using the site-frequency spectrum of each individual sample. For this, filtered bam files were used as input to estimate the observed folded site-frequency spectrum (SFS) using the -doSaf and -realSFS functions in the program ANGSD (Korneliussen, Albrechtsen, & Nielsen, 2014). We excluded the X chromosome and skipped any bases and reads with quality scores below 20. Genome-wide heterozygosity was then calculated as the second value of the SFS (number of heterozygous genotypes) over the total number of sites, for each chromosome separately. To compare the level of diversity in SHO with other species, we visualised genome-wide heterozygosity values for other mammalian species collected from the literature (Table S2) against census population size and International Union for Conservation of Nature (IUCN) status. Finally, assuming a per site/per generation mutation rate (μ) of 1.1×10−08, we used our estimate of nucleotide diversity (π) as a proxy for to infer long-term θ to infer long-term Ne, given that θ=4Neμ.
Demographic history
To reconstruct the historical demography of the SHO, we used the Pairwise Sequential Markovian Coalescent (PSMC, Li & Durbin, 2011). This method uses the presence of heterozygous sites across a diploid genome to infer the time to the most recent common ancestor between two alleles. The inverse distribution of coalescence events is referred to as the instantaneous inverse coalescence rate (IICR) and for an unstructured and panmictic population, can be interpreted as the trajectory of Ne over time (Chikhi et al., 2018). To estimate the PSMC trajectory, we first generated consensus sequences for all autosomes in each of the filtered bam files from the six re-sequenced individuals using SAMtools mpileup, bcftools call and vcfutils.pl vcf2fq. Sites with a root-mean-squared mapping quality less than 30, and a depth of coverage below four or above 40 were masked as missing data. PSMC inference was then carried out using the default input parameters to generate a distribution of IICR through time for each individual. To generate a measure of uncertainty around our PSMC estimates, we ran 100 bootstrap replicates per individual. For this, consensus sequences were first split into 47 non-overlapping segments using the splitfa function in PSMC. We then randomly sampled from these, 100 times with replacement, and reran PSMC on the bootstrapped datasets.
To determine the extent to which the PSMC trajectory could vary, we scaled the coalescence rates and time intervals to population size and years based on three categories of neutral mutation rate and generation time. Our middle scaling values corresponded to a mutation rate of 1.1 × 10−08 and a generation time of 6.2 years. These were based on the per site/per generation mutation rate recently estimated for gemsbok (Oryx gazella, L. Chen., 2019) and the generation time reported in the International Studbook for the SHO (Gilbert, 2019) and are therefore considered the most appropriate estimates. Low scaling values corresponded to a mutation rate of 0.8 × 10−08 and a generation time of three and high scaling values corresponded to a mutation rate of 1.3 × 10−08 and a generation time of ten. Finally, to test the reliability of our IICR trajectories, we simulated sequence data under the inferred PSMC models and compared estimates of genome-wide heterozygosity with empirical values (Beichman, Phung, & Lohmueller, 2017). To do this, we used the program MaCS (G. K. Chen, Marjoram, & Wall, 2009) to simulate 1000 × 25 Mb sequence blocks under the full demographic model of each individual, assuming a recombination rate of 1.0 × 10−8 base pair per generation and a mutation rate of 1.1 × 10−08. Simulated heterozygosity was then calculated as the number of segregating sites over the total number of sites for each 25 Mb sequence. Empirical heterozygosity was calculated for each individual as the number of variable sites over the total number of sites in 25 Mb non-overlapping sliding windows along the genome. This was carried out using the filtered SNP dataset and the R package windowscanr.
Results
Chromosomal-level genome assembly
The genome assembly of the SHO, generated using both 10X Chromium and Hi-C technologies, had a total length of 2.7 Gb (Table 1). The use of Hi-C data successfully incorporated scaffolds into 29 chromosomes and increased the scaffold N50 by almost three-fold from 35.2 Mb to 100.4 Mb, and the contig N50 by over two-fold from 378 kb to 852 kb (Table 1). Around 149 Mb of under-collapsed heterozygosity was identified and incorporated into the assembly as unanchored sequence. The estimated GC content of the 10X-Hi-C assembly was 41.8%. BUSCO analysis of gene completeness revealed that 93.3% of core genes were complete in the 10X+Hi-C assembly which represents a marginal improvement in gene completeness compared to the 10X assembly (Table 1). Repetitive sequence content based on LTR elements, SINEs, LINEs, DNA elements, small RNAs, low complexity sequences and tandem repeats corresponded to approximately 47.79% of the genome (Table S3). SINEs and LINEs were the most common repeat elements, representing around 38% of the overall repeat content. Gene prediction using AUGUSTUS identified a total of 29,666 candidate protein-coding genes, of which 14,110 were assigned common gene names using eggNOG-mapper.
Table 1:
Genome assembly statistics for both iterations of the SHO genome assembly.
10X | 10X+Hi-C | |
---|---|---|
| ||
Length (bp) | 2,720,895,635 | 2,720,101,635 |
Scaffold N50 (bp) | 35,228,849 | 100,398,400 |
Scaffold L50 | 21 | 11 |
Longest scaffold (bp) | 136,126,622 | 198,955,781 |
Contig N50 (bp) | 378,550 | 852,138 |
GC content (%) | 41.82 | 41.83 |
Complete core genes (%) | 92.76 | 93.25 |
Complete & partial core genes (%) | 95.98 | 96.15 |
Missing core genes (%) | 4.02 | 3.85 |
Average number of orthologs per core gene | 1.05 | 1.04 |
Complete core genes, complete and partial core genes, missing core genes and average number of orthologs per core gene were assessed using BUSCO v2 with the Mammalia odb9 database (4,104 genes).
Genome synteny
To explore genomic synteny between SHO and cattle, we aligned the 29 chromosomes from the 10X+Hi-C assembly to the cattle assembly (BosTaurus version 3.1.1). Visualisation of the full alignment identified one chromosomal fusion between cattle chromosomes C1 and C25 which was located on SHO chromosome SHO2 (Figure 1 and Figure S1). All remaining SHO chromosomes mapped mainly or exclusively to a single cattle chromosome, reflecting strong chromosomal synteny between the two species. Specifically, for 28 SHO chromosomes, over 90% of the total alignment length was to a single cattle chromosome, with 11 of these aligning exclusively to a single cattle chromosome.
Figure 1.
Synteny between the 29 SHO 10X+HiC chromosomes (prefixed with SHO) and the cattle chromosomes (prefixed with C). Mapping each SHO chromosome resulted in multiple alignment blocks (mean = 2.5 kb, range = 0.3 – 12.5 kb) and alignments over 10 kb are shown.
Whole genome resequencing and SNP discovery
Whole genome resequencing of the six focal individuals resulted in an average sequencing coverage of 18.9 (min = 15.5, max = 27.2). After variant calling, a total of 12,945,559 biallelic SNPs were discovered using GATK’s best practice workflow (see Materials and Methods for details). Of these, a total of 8,063,284 polymorphic SNPs remained after quality filtering, with a mean minor allele frequency of 0.29. A full breakdown of the number of variants remaining after each filtering step is provided in Figure S2.
Mitochondrial genome assembly
We used the whole genome resequencing data, together with a publicly available mitochondrial DNA reference sequence to assemble the mitochondrial genome for the six focal SHO individuals. An average of 1,211,796 reads per individual mapped to the reference sequence (min = 27,178, max = 5,663,594), equivalent to an average mitochondrial sequencing coverage of 3487 (min = 342, max = 7934). Across each of the six consensus sequences, a total of 125 substitutions were identified, with sequence similarity ranging between 99.5 to 100% (Table S4). Individuals from EEP and SSP breeding programmes each displayed a unique mitochondrial haplotype whilst the haplotypes of both EAD animals were identical. Furthermore, we identified a total of five control region haplotypes, five 16S haplotypes and three cytochrome b haplotypes. To place our mitochondrial data into a broader context, we compared the control region sequences for each individual with 43 previously published haplotypes. Visualization of the haplotype network revealed that all five haplotypes from this study corresponded to previously published sequences (Table S1). Haplotypes from the four EAD and SSP animals clustered together on the left-hand side of the haplotype network, whilst haplotypes from the two EEP animals clustered separately on the right-hand side of the network. This suggests that a reasonably wide proportion of the known genetic diversity for the species has been captured (Figure S3).
Genetic diversity
Next, we investigated the level of variation in the SHO using two genome-wide measures. Our estimate for nucleotide diversity (π), the average number of pairwise differences between sequences, was 0.0014. Average genome-wide heterozygosity across all six individuals was in line with this, at 0.0097 (Figure 2A). Whilst this is lower than values estimated for mammals such as the Eurasian wolf and the western gorilla, this is considerably higher than estimates for endangered species such as the baiji river dolphin and the cheetah. Furthermore, given a census population size of around 15,000 individuals, this level of diversity is in line with that of species with similar census sizes such as the orangutan and the bonobo. Among individuals, genome-wide heterozygosity ranged between 0.00076 and 0.0011, with animals from the EAD displaying the lowest levels of genome-wide heterozygosity (Figure 2B). Diversity estimates for animals from European and American captive breeding populations were similar, with American animals being slightly more diverse (Figure 2B). Genome-wide heterozygosity also varied across autosomes, with some individuals displaying larger variance in heterozygosity than others (Figure 2B). Using our estimate of genome-wide heterozygosity as a proxy for θ, and assuming a mutation rate of 1.1e−8, long-term Ne of the SHO was estimated to be approximately 22,237 individuals.
Figure 2.
(A) Relationship between genome-wide heterozygosity and census population size for a selection of mammals, with individual points colour coded according to IUCN status. Some species names have been removed for clarity. Vertical bars correspond to the range of genome-wide heterozygosity estimates when more than one was available. Figure modified from Ekblom et al., (2018). For sources, see Table S2. (B) Differences in genome-wide heterozygosity across SHO individuals with colours corresponding to population. Raw data points represent the average genome-wide heterozygosity of each chromosome in each individual. Centre lines of boxplots reflect the median, bounds of the boxes reflect the 25th and 75th percentiles and upper and lower whiskers reflect the largest and smallest values. Further details about individual animals can be found in Table S1.
Demographic history
To investigate historical demography of SHO, we characterised the temporal trajectory of coalescent rates using PSMC. The PSMC trajectory showed the same pattern across all six individuals and therefore the curve for only one individual (#34412 from the EEP) is presented here (Figure 3, see Figure S4 for all PSMC distributions). Assuming a generation time of 6.2 years and a mutation rate of 1.1 × 10−8, the trajectory could be reliably estimated from approximately two million years ago. It was characterised by an overall decline towards the present day, interspersed with multiple periods of elevated IICR during the Pleistocene. If IICR is assumed to be equivalent to Ne, the period of decline during the early-mid Pleistocene reached a minimum effective population size of approximately 21,000 individuals. There was a sharp increase immediately after this, which peaked approximately 150 ka before it gradually declined again at the onset of the Last Glacial Period. After the Last Glacial Maximum 22 ka, the trajectory underwent a period of increasing IICR, although at this point, estimates become unreliable as demonstrated by the larger variance in bootstrap estimates within individuals and the larger variation in IICR values across individuals (Figure S4). Under alternative generation and mutation rate scalings, population size and year estimates shift in either direction. For example, the peak in Ne around 150 ka could shift by around 15,000 individuals and by up to 70 ka. To test the reliability of our PSMC trajectories, we compared the distributions of genome-wide heterozygosity calculated from both simulated and empirical data. For all individuals, the distribution of simulated heterozygosity was highly similar to empirical values, with the average empirical heterozygosity lying within the 95% confidence intervals of the simulated distribution indicating that the PSMC models are a good fit to the data (Figure S5).
Figure 3.
PSMC inference of the instantaneous inverse coalescent rate (IICR) through time under different scalings for SHO individual #34412 from the EEP. See Figure S4 for PSMC distributions of all individuals. The orange trajectory was scaled by a mutation rate of 1.1 × 10−08 and a generation time of 6.2 (middle), the grey trajectory was scaled by a mutation rate of 0.8 × 10−08 and a generation time of three (low) and the gold trajectory was scaled by a mutation rate of 1.3 × 10−08 and a generation time of 10 (high). Fine lines around the orange trajectory represent 100 bootstrap replicates. The shaded grey area corresponds to the Last Glacial Period and the Last Glacial Maximum (LGM) is indicated by the dashed line.
Discussion
As captive populations become increasingly important for the preservation of species, it is essential that genetic resources and baseline data are available to inform population management and improve reintroduction planning. In this study, we utilised third-generation sequencing technologies to generate a chromosomal-level genome assembly for the scimitar-horned oryx, a species declared extinct in the wild and the focus of a long-term reintroduction programme. We combined this with whole genome resequencing data from six individuals to characterise synteny with the cattle genome, elucidate the level and distribution of genetic diversity, and reconstruct historical demography. These results improve our understanding of an endangered antelope species and provide an important foundation to guide future management decisions.
Genome assembly
One of the main outcomes of this study is a chromosomal-level genome assembly for the SHO, a species belonging to the subfamily Hippotraginae within the family Bovidae and superorder Cetartiodactyla. This was achieved using a combination of 10X Chromium sequencing and Hi-C contact mapping. The total assembly length was 2.7 Gb, similar to the hippotragine sable antelope (Hippotragus niger; Koepfli et al., 2019) and gemsbok (Oryx gazella; Farré et al., 2019) reference assemblies, which have total lengths of 2.9 and 3.2 Gb respectively. The use of Hi-C data successfully incorporated scaffolds into 29 chromosomes, increasing the scaffold N50 to 100.4 Mb. This is more than 20 times the scaffold N50 reported for the sable antelope (~4.6 Mb, Koepfli et al., 2019) and almost double that of the N50 reported for gemsbok (47 Mb, Farré et al., 2019). In contrast, the contig N50 of the 10X-Hi-C assembly was >850 kb which represents a substantial improvement over both sable antelope (45.5 kb) and gemsbok assemblies (17.2 kb). Repeat content (47.79%) was is in line with that of European bison (47.3%, Wang et al., 2017) and sable antelope assemblies (46.7%, Koepfli et al., 2019) but higher than that of the Tibetan antelope (37%, Ge et al., 2013), whilst GC content was identical to that reported for the sable antelope (41.8%, Koepfli et al., 2019). Furthermore, a larger number of protein-coding genes were predicted in the SHO assembly than in studies of sable and Tibetan antelope and BUSCO analysis identified 93.3% of core genes. Our SHO assembly is therefore of very high quality and will serve as an important resource for the wider antelope and bovid research community.
Genome synteny
To further evaluate genome completeness and to explore chromosomal synteny, we mapped the SHO chromosomes to the cattle reference genome. The resulting alignment revealed complete coverage to all chromosomes in the cattle assembly, including the X-chromosome. This is in line with the results of the BUSCO analysis and suggests that the SHO genome assembly is close to complete. Furthermore, all but one of the SHO chromosomes showed near-to, or complete chromosomal homology with cattle, indicating that the Hi-C contact mapping approach reliably anchored scaffolds into chromosomes. In general, while Bovidae genomes show a high degree of synteny, they can vary in their diploid chromosome number due to the occurrence of centric fusions (Gallagher Jr & Womack, 1992; Wurster & Benirschke, 1968). We clearly identified the fixed centric fusion between cattle chromosomes 1 and 25 that has previously been described in the oryx lineage using cytogenic approaches (Kumamoto, Charter, Kingswood, Ryder, & Gallagher, 1999). However, we found no evidence for the fusion between chromosomes 2 and 15 that has been karyotyped in some captive individuals (Kumamoto et al., 1999). Chromosomal rearrangements both within and between species have been implicated in poor reproductive performance due to the disruption of chromosomal segregation during meiosis (Hauffe & Searle, 1998; Steiner et al., 2015; Wallace, Searle, & Everett, 2002). Genotype data from additional individuals would facilitate a comprehensive assessment of structural polymorphism across captive populations of SHO using methods that utilise patterns of linkage and substructure (Knief et al., 2016).
Genetic diversity
To assess the level of genetic diversity in the SHO we used whole genome resequencing data from six individuals originating from three captive populations. A recent meta-analysis has demonstrated that threatened species harbour reduced genetic diversity compared to their non-threatened counterparts due to the elevated impacts of inbreeding and genetic drift in small populations (Willoughby et al., 2015). In contrast, a handful of studies have uncovered unexpectedly high levels of diversity in species thought to have experienced strong population declines (Busch, Waser, & DeWoody, 2007; Dinerstein & McCracken, 1990; Hailer et al., 2006). While the SHO has been kept in captivity for the last 50 years, equivalent to around eight generations, it is unclear to what extent this has impacted its genetic variation. We found several lines of evidence in support for considerably high genetic diversity in the scimitar-horned oryx. First, the SHO genome assembly contained approximately 150 Mb of under-collapsed heterozygosity due to the presence of numerous alternative haplotypes. Second, we detected over 8 million high quality SNP markers, which given the small discovery pool of six individuals is relatively high for a large mammalian genome. Third, our estimates of genetic diversity were appreciably higher than in other threatened mammalian species.
These results are in some respects surprising given that the SHO underwent a period of rapid population decline in the wild, followed by a strong founding event in captivity. However, the species has bred well in captivity, reaching approximately 15,000 individuals in the span of several decades. This is likely to have reduced the strength of genetic drift, which alongside individual-based management, may have prevented the rapid loss of genetic diversity. This is in line with theoretical expectations that only very severe (i.e. a few tens of individuals) and long-lasting bottlenecks will cause a substantial reduction in genetic variation (Nei, Maruyama, & Chakraborty, 1975). With this in mind, it is also possible that the original founder population size was larger than previously thought, particularly for the EAD population, where records of founders are sparse. Additionally, as contemporary levels of genetic diversity are largely determined by long-term Ne (Ellegren & Galtier, 2016), we cannot discount the possibility that historical patterns of abundance have contributed to the variation we see today.
Nevertheless, caution must be taken when comparing estimates of diversity across species as the total number of variable sites, and therefore genetic variation, is sensitive to SNP calling criteria (Hohenlohe et al., 2010; Shafer et al., 2017). Furthermore, there are multiple ways to measure molecular variation (Hahn, 2018). However, our results are broadly in line with similar species such as the sable antelope, where a comparable number of variants were called in a similar number of individuals (Koepfli et al., 2019). Additionally, our estimates of genome-wide heterozygosity were calculated based on genotype likelihoods and therefore should be robust to sensitivities resulting from filtering (Korneliussen et al., 2014). Finally, we took care to compare our estimates of genetic diversity with equivalent measures in the literature. Therefore, we expect our measures of genetic variation to reflect the true level of diversity in the species.
To characterize the distribution of diversity in the SHO we compared genome-wide heterozygosity among captive populations. Diversity estimates varied between groups, with animals from the EAD showing lower levels of diversity than those from European and American captive breeding populations. However, this comparison is based on estimates from a small number of individuals and therefore may not be a true reflection of the overall variation in genetic diversity. Nevertheless, this pattern is consistent with studies both in SHO and Arabian oryx (Oryx leucoryx) that found diversity to be lower in unmanaged populations than in studbook managed populations (El Alqamy, Senn, Roberts, McEwing, & Ogden, 2012) and suggests that captive breeding programmes have been successful at maintaining genetic diversity. We also observed variation in the genetic diversity of individual chromosomes, a pattern which has been demonstrated across a wide variety of taxa (Doniger et al., 2008; Nordborg et al., 2005; The International SNP Map Working Group, 2001). Chromosomal variation in heterozygosity can arise through numerous mechanisms including recombination rate variation, mutation rate variation and selection (Begun & Aquadro, 1992; Hodgkinson & Eyre-Walker, 2011; Martin et al., 2016) and further studies will be required to understand the biological significance of these patterns in more detail.
Historical demography
To provide insights into the historical demography of the SHO, we quantified the trajectory of coalescence rates using PSMC. This method does not necessarily provide a literal representation of past population size change as it assumes a panmictic Wright-Fisher population (Mazet, Rodríguez, Grusea, Boitard, & Chikhi, 2016). Nevertheless, fluctuations in the trajectory provide insights into periods of past population instability which may be attributed to factors including population decline, population structure, gene flow and selection (Beichman et al., 2017; Chikhi et al., 2018; Mazet et al., 2016; Schrider, Shanku, & Kern, 2016). The PSMC trajectory of the SHO was characterised by an initial expansion approximately two million years ago which coincides with the appearance of present day bovid tribes in the fossil record (Bibi, 2013). This was followed by periods of disturbance during the mid-Pleistocene and at the onset of the Last Glacial Period, although these time points shift in either direction under alternative scalings. Similar PSMC trajectories have been observed in other African grassland species such as the gemsbok, greater kudu and impala (L. Chen et al., 2019). Climatic variability in North Africa during these time periods was associated with repeated expansion and contraction of suitable grassland habitat (Dupont, 2011), which is likely to have driven population decline or fragmentation in the SHO. This is consistent with previous findings that ecological variation associated with Pleistocene climate change has shaped the population size and distribution of ungulates in Africa (Lorenzen, Heller, & Siegismund, 2012).
Interestingly, despite the expansion of suitable SHO habitat after the Last Glacial Maxima, the PSMC trajectory does not return to historic levels. PSMC has little power to detect demographic change less than 20,000 years ago (Li & Durbin, 2011), however it is possible that increased human activities during this time-period impacted population numbers. This is in line with a recent study that attributed widespread declines in ruminant populations during the late Pleistocene to increasing human effective population size (L. Chen et al., 2019). Sequencing data from additional individuals will facilitate the reliable estimation of recent population size parameters using either site-frequency based methods or approximate Bayesian computation (Excoffier, Dupanloup, Huerta-Sánchez, Sousa, & Foll, 2013; Pujolar, Dalén, Hansen, & Madsen, 2017; Stoffel et al., 2018).
Implications for management
The outcome of this study provides an important foundation for future management of scimitar-horned oryx and supports the strategy currently in place to optimise the level of genetic diversity within release herds. In particular, our finding that genetic diversity is higher in individuals from captive breeding programmes reinforces previous results that informed the reintroduction program being carried out by the EAD. Here, individuals from European and American breeding programmes are being used to maximise the representation of contemporary global variation and increase the adaptive potential of release herds. Furthermore, our chromosomal-level genome assembly will provide a reference for generating mapped genomic markers in additional animals. This will facilitate individual-based studies into inbreeding, relatedness and admixture which will help improve breeding recommendations and enable hybrid assessment. Moreover, access to genome annotations will open up the opportunity for identifying putative loci associated with functional adaptation in both the wild and captivity. For example, predicting deleterious mutations in individuals within breeding programmes could help minimise the impacts of inbreeding depression (Grossen, Guillaume, Keller, & Croll, 2020; Robinson et al., 2019) whilst population-wide screening of loci conferring disease resistance could help identify suitable source populations for future release (Epstein et al., 2016). Finally, access to a genome assembly will facilitate the development of complementary genetic resources such as genotyping arrays. These would provide the possibility to genotype thousands of non-invasive samples and therefore have direct benefits for post-release monitoring and pedigree reconstruction (Wildt et al., 2019). Overall, these approaches will contribute towards an integrated global management strategy for the scimitar-horned oryx and support the transfer of genomics into applied conservation.
Conclusions
We have generated a chromosomal-level genome assembly and used whole genome resequencing to provide insights into both the contemporary and historical population of an iconic species of antelope. We uncovered relatively high levels of genetic diversity and a dynamic demographic history, punctuated by periods of large effective population size. These insights provide support for the notion that only very extreme and long-lasting bottlenecks lead to substantially reduced levels of genetic diversity. At the population level, we characterised differences in genetic variation between captive and semi-captive collections that emphasise the importance of meta-population management for maintaining genetic diversity in the remaining populations of scimitar-horned oryx.
Supplementary Material
Acknowledgements
We dedicate this paper with great fondness to our co-author, Dr David Wildt, who passed away in January 2020. We are grateful for the encouragement, enthusiasm and wisdom that he provided during this research, which is a testament to the many important contributions he made to conservation science of many endangered species during his career. We would like to thank the EAD and all EAZA and AZA SSP institutions that provided samples for this study. We would like to acknowledge Tania Gilbert at Marwell Wildlife for advice and for access to the international studbook. We would also like to thank Jack Humble for generating the chromosome rearrangement figure using D3.js. ELA was supported by an NSF Physics Frontiers Center Award (PHY1427654), the Welch Foundation (Q-1866), a USDA Agriculture and Food Research Initiative Grant (2017-05741), an NIH 4D Nucleome Grant (U01HL130010), and an NIH Encyclopedia of DNA Elements Mapping Center Award (UM1HG009375). Whole-genome resequencing was carried out by Edinburgh Genomics.
Data accessibility
The scimitar-horned oryx Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number JABAEV000000000. The version described in this paper is version JABAEV010000000. The scimitar-horned oryx 10X+Hi-C assembly is also available on the DNA ZOO website (www.dnazoo.org/assemblies/Oryx_dammah). The 10X read data have been deposited on NCBI under BioProject accession number PRJNA622505 and run accession number SRR11529474. The Hi-C paired-end read data have been deposited to the short-read archive (SRA) under accession number SRX5415931. Whole genome resequencing data have been deposited to the European Nucleotide Archive under study accession number PRJEB37295. Mitogenome sequences have been deposited on NCBI under accession numbers MT248292–MT248297. Code for the analysis of resequencing data is available at https://github.com/elhumble/SHO_reseq_2020 and https://github.com/elhumble/SHO_analysis_2020. Additional supplementary data are available from the Dryad Digital Repository.
References
- Humble E, Dobrynin P, Senn H, Chuven J, Scott AF, Mohr DW, Dudchenko O, Omer AD, Colaric Z, Lieberman Aiden E, Salem Al Dhaheri S, Wildt D, Oliaji S, Tamazian G, Pukazhenthi B, Ogden R, Koepfli KP; 2019; SHO_genome; NCBI; Version 1; PRJNA622505 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humble E, Dobrynin P, Senn H, Chuven J, Scott AF, Mohr DW, Dudchenko O, Omer AD, Colaric Z, Lieberman Aiden E, Salem Al Dhaheri S, Wildt D, Oliaji S, Tamazian G, Pukazhenthi B, Ogden R, Koepfli KP; 2019; SHO_10X_data; NCBI SRA; PRJNA622505 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humble E, Dobrynin P, Senn H, Chuven J, Scott AF, Mohr DW, Dudchenko O, Omer AD, Colaric Z, Lieberman Aiden E, Salem Al Dhaheri S, Wildt D, Oliaji S, Tamazian G, Pukazhenthi B, Ogden R, Koepfli KP; 2019; SHO_HiC_data; NCBI SRA; SRX5415931 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humble E, Dobrynin P, Senn H, Chuven J, Scott AF, Mohr DW, Dudchenko O, Omer AD, Colaric Z, Lieberman Aiden E, Salem Al Dhaheri S, Wildt D, Oliaji S, Tamazian G, Pukazhenthi B, Ogden R, Koepfli KP; 2019; SHO_resequencing data; European Nucleotide Archive; PRJEB37295 [Google Scholar]
- Humble E, Dobrynin P, Senn H, Chuven J, Scott AF, Mohr DW, Dudchenko O, Omer AD, Colaric Z, Lieberman Aiden E, Salem Al Dhaheri S, Wildt D, Oliaji S, Tamazian G, Pukazhenthi B, Ogden R, Koepfli KP; 2019; SHO_resequencing data; NCBI; MT248292–MT248297 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allendorf FW, Hohenlohe PA, & Luikart G (2010). Genomics and the future of conservation genetics. Nature Reviews Genetics, 11(10), 697–709. [DOI] [PubMed] [Google Scholar]
- Bao W, Kojima KK, & Kohany O (2015). Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA, 6, 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Begun DJ, & Aquadro CF (1992). Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature, 356(6369), 519–520. [DOI] [PubMed] [Google Scholar]
- Beichman AC, Koepfli K-P, Li G, Murphy W, Dobrynin P, Kilver S, … Wayne RK (2019). Aquatic adaptation and depleted diversity: a deep dive into the genomes of the sea otter and giant otter. Molecular Biology and Evolution, 29(12), 712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beichman AC, Phung TN, & Lohmueller KE (2017). Comparison of single genome and allele frequency data reveals discordant demographic histories. G3, 7(11), 3605–3620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bibi F (2013). A multi-calibrated mitochondrial phylogeny of extant Bovidae (Artiodactyla, Ruminantia) and the importance of the fossil record to systematics. BMC Evolutionary Biology, 13(1), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradnam KR, Fass JN, Alexandrov A, Baranay P, Bechner M, Birol I, … Korf IF (2013). Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience, 2(1), 1–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchfink B, Xie C, & Huson DH (2015). Fast and sensitive protein alignment using DIAMOND. Nature Methods, 12(1), 59–60. [DOI] [PubMed] [Google Scholar]
- Busch JD, Waser PM, & DeWoody JA (2007). Recent demographic bottlenecks are not accompanied by a genetic signature in banner-tailed kangaroo rats (Dipodomys spectabilis). Molecular Ecology, 16(12), 2450–2462. [DOI] [PubMed] [Google Scholar]
- Carroll EL, Bruford MW, DeWoody JA, Leroy G, Strand A, Waits L, & Wang J (2018). Genetic and genomic monitoring with minimally invasive sampling methods. Evolutionary Applications, 11(7), 1094–1119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ceballos G, Ehrlich PR, & Dirzo R (2017). Biological annihilation via the ongoing sixth mass extinction signaled by vertebrate population losses and declines. Proceedings of the National Academy of Sciences of the United States of America, 47(30), 201704949–E6096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen GK, Marjoram P, & Wall JD (2009). Fast and flexible simulation of DNA sequence data. Genome Research, 19(1), 136–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen L, Qiu Q, Jiang Y, Wang K, Lin Z, Li Z, … Wang W (2019). Large-scale ruminant genome sequencing provides insights into their evolution and distinct traits. Science, 364(6446). [DOI] [PubMed] [Google Scholar]
- Chikhi L, Rodríguez W, Grusea S, Santos P, Boitard S, & Mazet O (2018). The IICR (inverse instantaneous coalescence rate) as a summary of genomic diversity: insights into demographic inference and model choice. Heredity, 120(1), 13–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Çilingir FG, Seah A, Horne BD, Som S, Bickford DP, & Rheindt FE (2019). Last exit before the brink: Conservation genomics of the Cambodian population of the critically endangered southern river terrapin. Ecology and Evolution, 10(6), 720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, … Grp, 1000 Genomes Project Anal. (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dinerstein E, & McCracken GF (1990). Endangered greater one-horned rhinoceros carry high levels of genetic variation. Conservation Biology, 4(4), 417–422. [Google Scholar]
- Dobrynin P, Liu S, Tamazian G, Xiong Z, Yurchenko AA, Krasheninnikova K, … O’Brien SJ (2015). Genomic legacy of the African cheetah, Acinonyx jubatus. Genome Biology, 16(1), 277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doniger SW, Kim HS, Swain D, Corcuera D, Williams M, Yang S-P, & Fay JC (2008). A catalog of neutral and deleterious polymorphism in yeast. PLoS Genetics, 4(8), e1000183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, … Aiden EL (2017). De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science, 356(6333), 92–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudchenko O, Shamim MS, Batra SS, Durand NC, Musial NT, Mostofa R, … Aiden EL (2018). The Juicebox Assembly Tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under $1000. BioRxiv, 254797. [Google Scholar]
- Dupont L (2011). Orbital scale vegetation change in Africa. Quaternary Science Reviews, 30(25–26), 3589–3602. [Google Scholar]
- Durand NC, Shamim MS, Machol I, Rao SSP, Huntley MH, Lander ES, & Aiden EL (2016). Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Systems, 3(1), 95–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ekblom R, Brechlin B, Persson J, Smeds L, Johansson M, Magnusson J, … Ellegren H (2018). Genome sequencing and conservation genomics in the Scandinavian wolverine population. Conservation Biology, 32(6), 1301–1312. [DOI] [PubMed] [Google Scholar]
- El Alqamy H, Senn H, Roberts M-F, McEwing R, & Ogden R (2012). Genetic assessment of the Arabian oryx founder population in the Emirate of Abu Dhabi, UAE: an example of evaluating unmanaged captive stocks for reintroduction. Conservation Genetics, 13(1), 79–88. [Google Scholar]
- Ellegren H, & Galtier N (2016). Determinants of genetic diversity. Nature Reviews Genetics, 17(7), 422–433. [DOI] [PubMed] [Google Scholar]
- Epstein B, Jones M, Hamede R, Hendricks S, McCallum H, Murchison EP, … Storfer A (2016). Rapid evolutionary response to a transmissible cancer in Tasmanian devils. Nature Communications, 7, 12684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Excoffier L, Dupanloup I, Huerta-Sánchez E, Sousa VC, & Foll M (2013). Robust demographic inference from genomic and SNP data. PLoS Genetics, 9(10), e1003905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farré M, Li Q, Zhou Y, Damas J, Chemnick LG, Kim J, … Lewin HA (2019). A near-chromosome-scale genome assembly of the gemsbok (Oryx gazella): an iconic antelope of the Kalahari desert. Gigascience, 8(2), 18644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frankham R, Ballou J, & Briscoe DA (2002). Introduction to conservation genetics. Cambridge: Cambridge University Press. [Google Scholar]
- Fritz J, Kramer R, Hoffmann W, Trobe D, & Unsöld M (2017). Back into the wild: establishing a migratory Northern bald ibis Geronticus eremita population in Europe. International Zoo Yearbook, 51(1), 107–123. [Google Scholar]
- Gallagher DS Jr, & Womack JE (1992). Chromosome Conservation in the Bovidae. Journal of Heredity, 83(4), 287–298. [DOI] [PubMed] [Google Scholar]
- Ge R-L, Cai Q, Shen Y-Y, San A, Ma L, Zhang Y, … Wang J (2013). Draft genome sequence of the Tibetan antelope. Nature Communications, 4, 1858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilbert T (2019). International studbook for the scimitar-horned oryx Oryx dammah (Fourteenth edition). Winchester: Marwell Wildlife. [Google Scholar]
- Grossen C, Guillaume F, Keller LF, & Croll D (2020). Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nature Communications, 11(1), 1–12. doi: 10.1038/s41467-020-14803-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hahn M (2018). Molecular Population Genetics. Oxford, New York: Oxford University Press. [Google Scholar]
- Hailer F, Helander B, Folkestad AO, Ganusevich SA, Garstad S, Hauff P, … Ellegren H (2006). Bottlenecked but long-lived: high genetic diversity retained in white-tailed eagles upon recovery from population decline. Biology Letters, 2(2), 316–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hassanin A, Delsuc F, Ropiquet A, Hammer C, Jansen van Vuuren B, Matthee C, … Couloux A (2012). Pattern and timing of diversification of Cetartiodactyla (Mammalia, Laurasiatheria), as revealed by a comprehensive analysis of mitochondrial genomes. Comptes Rendus Biologies, 335(1), 32–50. [DOI] [PubMed] [Google Scholar]
- Hauffe HC, & Searle JB (1998). Chromosomal heterozygosity and fertility in house mice (Mus musculus domesticus) from northern Italy. Genetics, 150(3), 1143–1154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hodgkinson A, & Eyre-Walker A (2011). Variation in the mutation rate across mammalian genomes. Nature Reviews Genetics, 12(11), 756–766. [DOI] [PubMed] [Google Scholar]
- Hohenlohe PA, Bassham S, Etter PD, Stiffler N, Johnson EA, & Cresko WA (2010). Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genetics, 6(2), e1000862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, & Bork P (2017). Fast genome-wide functional annotation through orthology assignment by eggnog-mapper. Molecular Biology and Evolution, 34(8), 2115–2122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, … Bork P (2019). eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Research, 47(D1), D309–D314. doi: 10.1093/nar/gky1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humble E, Martinez-Barrio A, Forcada J, Trathan PN, Thorne MAS, Hoffmann M, … Hoffman JI (2016). A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them. Molecular Ecology Resources, 16(4), 909–921. [DOI] [PubMed] [Google Scholar]
- IUCN SSC Antelope Specialist Group. (2016). Oryx dammah. The IUCN Red List of Threatened Species. Retrieved from 10.2305/IUCN.UK.2016-2.RLTS.T15568A50191470.en. [DOI] [Google Scholar]
- IUCN/SSC. (2013). Guidelines for reintroductions and other conservation translocations. Version 1.0. Gland, Switzerland: IUCN Species Survival Commission. [Google Scholar]
- Iyengar A, Gilbert T, Woodfine T, Knowles JM, Diniz FM, Brenneman RA, … MaClean N (2007). Remnants of ancient genetic diversity preserved within captive groups of scimitar-horned oryx (Oryx dammah). Molecular Ecology, 16(12), 2436–2449. [DOI] [PubMed] [Google Scholar]
- Kardos M, Luikart G, & Allendorf FW (2015). Measuring individual inbreeding in the age of genomics: marker-based measures are better than pedigrees. Heredity, 115(1), 63–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kardos M, Qvarnström A, & Ellegren H (2017). Inferring individual inbreeding and demographic history from segments of identity by descent in Ficedula flycatcher genome sequences. Genetics, 205(3), 1319–1334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kardos M, Taylor HR, Ellegren H, Luikart G, & Allendorf FW (2016). Genomics advances the study of inbreeding depression in the wild. Evolutionary Applications, 9(10), 1205–1218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiełbasa SM, Wan R, Sato K, Horton P, & Frith MC (2011). Adaptive seeds tame genomic sequence comparison. Genome Research, 21(3), 487–493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knief U, Hemmrich-Stanisak G, Wittig M, Franke A, Griffith SC, Bart K, & Forstmeier W (2016). Fitness consequences of polymorphic inversions in the zebra finch genome. Genome Biology, 17(1), 1–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koepfli K-P, Paten B, Genome 10K Community of Scientists, & O’Brien SJ. (2015). The Genome 10K Project: a way forward. Annual Review of Animal Biosciences, 3, 57–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koepfli K-P, Tamazian G, Wildt D, Dobrynin P, Kim C, Frandsen PB, … Pukazhenthi BS (2019). Whole genome sequencing and re-sequencing of the sable antelope (Hippotragus niger): A resource for monitoring diversity in ex situ and in situ populations. G3, 9(6), 1785–1793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Korneliussen TS, Albrechtsen A, & Nielsen R (2014). ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics, 15(1), 356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumamoto AT, Charter SJ, Kingswood SC, Ryder OA, & Gallagher DS (1999). Centric fusion differences among Oryx dammah, O. gazella, and O. leucoryx (Artiodactyla, Bovidae). Cytogenetics and Cell Genetics, 86(1), 74–80. [DOI] [PubMed] [Google Scholar]
- Lacy RC (1987). Loss of Genetic Diversity from Managed Populations: Interacting Effects of Drift, Mutation, Immigration, Selection, and Population Subdivision. Conservation Biology, 1(2), 143–158. [Google Scholar]
- Leigh JW, & Bryant D (2015). POPART: full-feature software for haplotype network construction. Methods in Ecology and Evolution, 6(9), 1110–1116. [Google Scholar]
- Li Haipeng, Xiang-Yu J., Dai G., Gu Z., Ming C., Yang Z., … Zhang Y-P. (2016). Large numbers of vertebrates began rapid population decline in the late 19th century. Proceedings of the National Academy of Sciences of the United States of America, 113(49), 14079–14084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Heng. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics (Oxford, England), 27(21), 2987–2993. doi: 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Heng. (2013). Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv. [Google Scholar]
- Li Heng, & Durbin R. (2011). Inference of human population history from individual whole-genome sequences. Nature, 475(7357), 493–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lorenzen ED, Heller R, & Siegismund HR (2012). Comparative phylogeography of African savannah ungulates. Molecular Ecology, 21(15), 3656–3670. [DOI] [PubMed] [Google Scholar]
- Martin SH, Möst M, Palmer WJ, Salazar C, McMillan WO, Jiggins FM, & Jiggins CD (2016). Natural selection and genetic diversity in the butterfly Heliconius melpomene. Genetics, 203(1), 525–541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mays HL, Hung C-M, Shaner P-J, Denvir J, Justice M, Yang S-F, … Primerano DA (2018). Genomic analysis of demographic history and ecological niche modeling in the endangered sumatran rhinoceros Dicerorhinus sumatrensis. Current Biology, 28(1), 70–76.e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mazet O, Rodríguez W, Grusea S, Boitard S, & Chikhi L (2016). On the importance of being structured: instantaneous coalescence rates and human evolution – lessons for ancestral population size inference? Heredity, 116(4), 362–371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nadachowska-Brzyska K, Li C, Smeds L, Zhang G, & Ellegren H (2015). Temporal dynamics of avian populations during pleistocene revealed by whole-genome sequences. Current Biology, 25(10), 1375–1380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M, Maruyama T, & Chakraborty R (1975). The bottleneck effect and genetic variability in populations. Evolution, 29(1), 1–10. [DOI] [PubMed] [Google Scholar]
- Nishimura O, Hara Y, & Kuraku S (2017). gVolante for standardizing completeness assessment of genome and transcriptome assemblies. Bioinformatics, 33(22), 3635–3637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, Zheng H, … Bergelson J (2005). The pattern of polymorphism in Arabidopsis thaliana. PLoS Biology, 3(7), e196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ogden R, Chuven J, Gilbert T, Hosking C, Gharbi K, Craig M, … Senn H (2020). Benefits and pitfalls of captive conservation genetic management: Evaluating diversity in scimitar-horned oryx to support reintroduction planning. Biological Conservation, 241, 108244. doi: 10.1016/j.biocon.2019.108244 [DOI] [Google Scholar]
- Palkopoulou E, Mallick S, Skoglund P, Enk J, Rohland N, Li H, … Dalén L (2015). Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth. Current Biology, 25(10), 1395–1400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paradis E (2010). pegas: an R package for population genetics with an integrated-modular approach. Bioinformatics, 26(3), 419–420. [DOI] [PubMed] [Google Scholar]
- Prado-Martinez J, Sudmant PH, Kidd JM, Li H, Kelley JL, Lorente-Galdos B, … Marques-Bonet T (2013). Great ape genetic diversity and population history. Nature, 499(7459), 471–475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pujolar JM, Dalén L, Hansen MM, & Madsen J (2017). Demographic inference from whole-genome and RAD sequencing data suggests alternating human impacts on goose populations since the last ice age. Molecular Ecology, 26(22), 6270–6283. [DOI] [PubMed] [Google Scholar]
- Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, … Aiden EL (2014). A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell, 159(7), 1665–1680. doi: 10.1016/j.cell.2014.11.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Redford KH, Jensen DB, & Breheny JJ (2012). Integrating the captive and the wild. Science, 338(6111), 1157–1158. [DOI] [PubMed] [Google Scholar]
- Robert A (2009). Captive breeding genetics and reintroduction success. Biological Conservation, 142(12), 2915–2922. [Google Scholar]
- Robinson JA, Ortega-Del Vecchyo D, Fan Z, Kim BY, VonHoldt BM, Marsden CD, … Wayne RK (2016). Genomic flatlining in the endangered island fox. Current Biology, 26(9), 1183–1189. [DOI] [PubMed] [Google Scholar]
- Robinson JA, Räikkönen J, Vucetich LM, Vucetich JA, Peterson RO, Lohmueller KE, & Wayne RK (2019). Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Science Advances, 5(5), eaau0757. doi: 10.1126/sciadv.aau0757 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Russell WC, Thorne ET, Oakleaf R, & Ballou J (1994). The genetic basis of black-footed ferret reintroduction. Conservation Biology, 8(1), 263–266. [Google Scholar]
- Salzberg SL, & Yorke JA (2005). Beware of mis-assembled genomes. Bioinformatics, 21(24), 4320–4321. [DOI] [PubMed] [Google Scholar]
- Schrider DR, Shanku AG, & Kern AD (2016). Effects of linked selective sweeps on demographic inference and model selection. Genetics, 204(3), 1207–1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shafer ABA, Peart CR, Tusso S, Maayan I, Brelsford A, Wheat CW, & Wolf JBW (2017). Bioinformatic processing of RAD-seq data dramatically impacts downstream population genetic inference. Methods in Ecology and Evolution, 8(8), 907–917. [Google Scholar]
- Shafer ABA, Wolf JBW, Alves PC, Bergström L, Bruford MW, Brännström I, … Zieliński P (2015). Genomics and the challenging translation into conservation practice. Trends in Ecology & Evolution, 30(2), 78–87. [DOI] [PubMed] [Google Scholar]
- Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, & Zdobnov EM (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics, 31(19), 3210–3212. [DOI] [PubMed] [Google Scholar]
- Spalton A (1993). A brief history of the reintroduction of the Arabian oryx Oryx leucoryx into Oman 1980–1992. International Zoo Yearbook, 32(1), 81–90. [Google Scholar]
- Stanke M, Keller O, Gunduz I, Hayes A, Waack S, & Morgenstern B (2006). AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research, 34(Web Server issue), W435–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steiner CC, Charter SJ, Goddard N, Davis H, Brandt M, Houck ML, & Ryder OA (2015). Chromosomal variation and perinatal mortality in San Diego zoo Soemmerring’s gazelles. Zoo Biology, 34(4), 374–384. [DOI] [PubMed] [Google Scholar]
- Stoffel MA, Humble E, Paijmans AJ, Acevedo Whitehouse K, Chilvers BL, Dickerson B, … Hoffman JI (2018). Demographic histories and genetic diversity across pinnipeds are shaped by human exploitation, ecology and life-history. Nature Communications, 9(1), 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Supple MA, & Shapiro B (2018). Conservation of biodiversity in the genomics era. Genome Biology, 19(1), 131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tallmon D, Luikart G, & Waples R (2004). The alluring simplicity and complex reality of genetic rescue. Trends in Ecology & Evolution, 19(9), 489–496. [DOI] [PubMed] [Google Scholar]
- The International SNP Map Working Group. (2001). A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature, 409(6822), 928–933. [DOI] [PubMed] [Google Scholar]
- Väli Ü, Einarsson A, Waits L, & Ellegren H (2008). To what extent do microsatellite markers reflect genome-wide genetic diversity in natural populations? Molecular Ecology, 17(17), 3808–3817. [DOI] [PubMed] [Google Scholar]
- Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, … DePristo MA (2013). From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Current Protocols in Bioinformatics, 43, 11.10.1–11.10.33. doi: 10.1002/0471250953.bi1110s43 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Dijk EL, Jaszczyszyn Y, Naquin D, & Thermes C (2018). The third revolution in sequencing technology. Trends in Genetics, 34(9), 666–681. [DOI] [PubMed] [Google Scholar]
- Wallace BMN, Searle JB, & Everett CA (2002). The effect of multiple simple Robertsonian heterozygosity on chromosome pairing and fertility of wild-stock house mice (Mus musculus domesticus). Cytogenetic and Genome Research, 96(1–4), 276–286. [DOI] [PubMed] [Google Scholar]
- Wang K, Wang L, Lenstra JA, Jian J, Yang Y, Hu Q, … Liu J (2017). The genome sequence of the wisent (Bison bonasus). Gigascience, 6(4), 1–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weeks AR, Sgrò CM, Young AG, Frankham R, Mitchell NJ, Miller KA, … Hoffmann AA (2011). Assessing the benefits and risks of translocations in changing environments: a genetic perspective. Evolutionary Applications, 4(6), 709–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Westbury MV, Petersen B, Garde E, Heide-Jørgensen MP, & Lorenzen ED (2019). Narwhal genome reveals long-term low genetic diversity despite current large abundance size. IScience, 15, 592–599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wheeler TJ, Clements J, Eddy SR, Hubley R, Jones TA, Jurka J, … Finn RD (2013). Dfam: a database of repetitive DNA based on profile hidden Markov models. Nucleic Acids Research, 41(D1), D70–D82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wildt D, Miller P, Koepfli K-P, Pukazhenthi B, Palfrey K, Livingston G, … Snodgrass K (2019). Breeding centers, private ranches, and genomics for creating sustainable wildlife populations. BioScience, 69(11), 928–943. doi: 10.1093/biosci/biz091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willoughby JR, Ivy JA, Lacy RC, Doyle JM, & DeWoody JA (2017). Inbreeding and selection shape genomic diversity in captive populations: Implications for the conservation of endangered species. PLoS ONE, 12(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willoughby JR, Sundaram M, Wijayawardena BK, Kimble SJA, Ji Y, Fernandez NB, … DeWoody JA (2015). The reduction of genetic diversity in threatened vertebrates and new recommendations regarding IUCN conservation rankings. Biological Conservation, 191, 495–503. [Google Scholar]
- Woodfine T, & Gilbert T (2016). The Fall and Rise of the Scimitar-Horned Oryx. In Antelope Conservation (pp. 280–296). doi: 10.1002/9781118409572.ch14 [DOI] [Google Scholar]
- Wurster DH, & Benirschke K (1968). Chromosome studies in the superfamily Bovoidea. Chromosoma, 25(2), 152–171. [DOI] [PubMed] [Google Scholar]
- Zhang H, Meltzer P, & Davis S (2013). RCircos: an R package for Circos 2D track plots. BMC Bioinformatics, 14, 244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC, Puiu D, … Salzberg SL (2009). A whole-genome assembly of the domestic cow, Bos taurus. Genome Biology, 10(4), R42. doi: 10.1186/gb-2009-10-4-r42 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The scimitar-horned oryx Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number JABAEV000000000. The version described in this paper is version JABAEV010000000. The scimitar-horned oryx 10X+Hi-C assembly is also available on the DNA ZOO website (www.dnazoo.org/assemblies/Oryx_dammah). The 10X read data have been deposited on NCBI under BioProject accession number PRJNA622505 and run accession number SRR11529474. The Hi-C paired-end read data have been deposited to the short-read archive (SRA) under accession number SRX5415931. Whole genome resequencing data have been deposited to the European Nucleotide Archive under study accession number PRJEB37295. Mitogenome sequences have been deposited on NCBI under accession numbers MT248292–MT248297. Code for the analysis of resequencing data is available at https://github.com/elhumble/SHO_reseq_2020 and https://github.com/elhumble/SHO_analysis_2020. Additional supplementary data are available from the Dryad Digital Repository.