Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2019 Dec 5;10(2):437–442. doi: 10.1534/g3.119.400747

Genome Assembly and Analysis of the North American Mountain Goat (Oreamnos americanus) Reveals Species-Level Responses to Extreme Environments

Daria Martchenko *,1, Rayan Chikhi , Aaron B A Shafer *,
PMCID: PMC7003085  PMID: 31806764

Abstract

The North American mountain goat (Oreamnos americanus) is an iconic alpine species that faces stressors from climate change, industrial development, and recreational activities. This species’ phylogenetic position within the Caprinae lineage has not been resolved and their phylogeographic history is dynamic and controversial. Genomic data could be used to address these questions and provide valuable insights to conservation and management initiatives. We sequenced short-read genomic libraries constructed from a DNA sample of a 2.5-year-old female mountain goat at 80X coverage. We improved the short-read assembly by generating Chicago library data and scaffolding using the HiRise approach. The final assembly was 2,506 Mbp in length with an N50 of 66.6 Mbp, which is within the length range and in the upper quartile for N50 published ungulate genome assemblies. Comparative analysis identified 84 gene families unique to the mountain goat. The species demographic history in terms of effective population size generally mirrored climatic trends over the past one hundred thousand years and showed a sharp decline during the last glacial maximum. This genome assembly will provide a reference basis for future population and comparative genomic analyses.

Keywords: de novo genome assembly, HiRise scaffolding, demography, PSMC, mountain goat, Caprinae


Mountain goats (Oreamnos americanus) are a symbol of alpine wilderness and belong to the Caprinae subfamily of ungulates (hoofed mammals) that are known for their exceptional climbing ability (Figure 1). As northern alpine specialists, mountain goats are vulnerable to climate change (Pettorelli et al. 2007) and face pressures from industrial development, recreational activities, and hunting (Côté and Festa-Bianchet 2003). There is a need to provide novel tools and information to support conservation and management initiatives as it pertains to this enigmatic species.

Figure 1.

Figure 1

North American mountain goat female stomping to assert her dominance at Eulachon Creek, Alaska, USA. Mountain goats display a strong social structure and one of the highest levels of aggressiveness among ungulates. Photo by K.S. White.

Mountain goats are not true goats (i.e., genus Capra), but diverged early within the Caprinae lineage and are typically associated with the chamois (Rupicapra rupicapra) and goral (Naemorhedus spp.); their phylogenetic position within the Caprinae is not resolved, but sampling of nuclear and mitochondrial genes suggested an early and independent divergence within the lineage (Shafer and Hall 2010). Similarly, the mountain goats phylogeographic history appears dynamic and is contentious (Shafer et al. 2011a, 2011b). Throughout the last glacial period most of the present-day range was covered with ice (Shafer et al. 2010), indicative of multiple refugia and contraction and expansion events of subpopulations (Shafer et al. 2011a). Within their present-day range, mountain goat populations are not continuous and dispersal is limited (Côté and Festa-Bianchet 2003), that combined with regional variability in habitat and climate, support the hypothesis of locally adapted populations across the range.

With the rapid development and lowering costs of sequencing technologies, the use of genomics in conservation and wildlife management is becoming more widespread. Over 20 ungulate genomes have been assembled (Martchenko et al. 2018), with chromosome-level assemblies for several commercially important ungulate species (ARS-UCD1.2, Oar_v3.1, ARS1), and an increasing number of draft assemblies for wild ungulates (Li et al. 2017; Zhang et al. 2018; Bana et al. 2018; Wang et al. 2019). This draft genome of the North American mountain goat adds to this list of ungulate genomes, and will be used for demographic analyses (e.g., Miller et al. 2015), comparative genomic studies (e.g., Orlando et al. 2013; Agaba et al. 2016), and testing for putative local adaptation (e.g., Savolainen et al. 2013; Fitzpatrick and Keller 2015).

Materials and Methods

Library construction

A tissue sample was obtained from a spleen of a 2.5-year-old female harvested on Revillagigedo Island, Alaska, USA and frozen promptly after harvest. High quality molecular DNA was extracted using the phenol chlorophorm method (Sambrook and Russell 2006). Two TruSeq Nano genomic library preparations with an insert size of 500 bp were constructed and sequenced on an Illumina HiSeqX platform with 150 bp paired-end reads. A total of 214 Gbp of raw short-read data were generated.

We trimmed the adapters and the low quality bases from the reads with BBDuk (part of BBTools) (Bushnell 2018) to a minimum base quality 20 and a minimum read length 50 bp after a quality assessment with FastQC (Andrews 2010). To avoid any potential contamination of the genome with bacterial or viral sequences, we screened the trimmed reads with Kraken (Wood and Salzberg 2014) using the full standard database (Oct. 2017 release). A total of 0.95% of the reads were classified as belonging to a known bacterial or viral taxon and were removed. The final short-read dataset comprised 176 Gbp.

Three Chicago libraries were prepared (Dovetail Genomics) following the approach in Putnam et al. (2016). Briefly, Chicago library preparation involves in vitro chromatin reconstitution to generate libraries with separations between reads up to maximum fragment size of the input DNA. These libraries were sequenced on one lane of the Illumina HiSeqX instrument generating a total of 104 Gbp of sequence data.

Genome assembly

A k-mer size of 101 was selected for the short-read dataset with KmerGenie (Chikhi and Medvedev 2014). The draft de novo assembly was completed with the Meraculous assembler v. 2.0.4 (Chapman et al. 2011) using the following options: ‘diploid_mode 1’, ‘min_depth_cutoff 0’ to allow for auto-detection of k-mer frequency cutoff by the assembler, ‘no_read_validation 0’ to decrease the runtime as the reads were trimmed and screened prior to the assembly, and ‘gap_close_aggressive 1’ to remove the uniqueness requirement and use the most common sequence obtained from potential gap-closing reads. For comparison purposes, the short-read data were also assembled with MEGAHIT v. 1.1.1 (Li et al. 2015) using ‘k-list 101’ option.

We improved the short-read assembly by scaffolding with the Chicago library data using the HiRise pipeline software (Putnam et al. 2016). The HiRise software uses a likelihood model of the Chicago data to infer the genomic distance between the read pairs, which is then used to scaffold and check the orientation of draft scaffolds.

Quality assessment

To assess the assembly quality, we mapped the raw short-insert library reads to the assembled genome with bowtie2 (Langmead and Salzberg 2012). We also aligned the genome against itself with minimap2 (Li 2018) with options ‘-ax asm5 -X’ to check for putatively artifactual duplications, reflected by sequence overlaps between scaffolds (excluding self-hits). We ordered the SAM file by decreasing alignment scores in order to identify large regions that are similar between different scaffolds.

To validate the genome assembly and assess potential contamination and sequencing bias we analyzed the contig and scaffold assemblies with KAT (Mapleson et al. 2017). KAT plots the k-mer spectra – the number of distinct k-mers seen once, twice, three times etc. and compares it between the raw reads and the genome assembly. We calculated an assembly quality value (QV) following the method of Bickhart et al. (2017) and ran FRC_align (Vezzi et al. 2012) to identify any problematic regions in the genome. We also evaluated the quality of the assembled genome using BUSCO V3 and the mammalia odb9 database.

Genome annotation

We identified and classified the repeat regions of the assembled genome using RepeatMasker v. 4.0.8 (Smit et al. 2013). We configured RepeatMasker with RMBlast v. 2.6.0 sequence search engine, Tandem Repeat Finder v. 4.0.9 (Benson 1999), Dfam_Consensus database (20181026 release) and RepBase RepeatMasker edition (20181026 release) (Bao et al. 2015) and used ‘-species artiodactyl’ parameter for the analysis.

We then annotated scaffolds that were greater than 2500 bp (n = 482) using both homology-based and de novo predictions. Proteins from Homo sapiens, Equus caballus, Bos taurus, and Ovis aries (all Ensembl 89 release (Hunt et al. 2018)) were aligned to genome using blastx v. 2.7.1. We also used NCBI mammalian RefSeq transcipt database v. 1.0 and BLAT v. 1.0 to help identify exon structure. For de novo prediction we first mapped RNAseq data (SRX1947618) from Capra hircus, the most closely related species available on NCBI Sequence Read Archive, to the genome using HISAT2 v. 2.10 (Kim et al. 2015) and used these data for prediction with Augustus v. 3.1.1 (Stanke et al. 2006). EVidenceModeler v. 1.1.1 (Haas et al. 2008) was used to generate a consensus gene set with the following default weighting scheme (Humann et al. 2018): gene prediction via Augustus (1x weight); protein alignments via blastx (5x weight); transcript alignment via blat (10x weight). Lastly, we used PASA v. 2.3.3 (Haas et al. 2008) to refine our gene identifications.

To assign function to the newly annotated genes we aligned them to the NCBI mammalian RefSeq protein database using blastp v. 2.7.1 with a maximum HSP distance of 30,000 and e-value of 1e-8. The Interpro v 5.29-68 and KEGG databases were screened to annotate domains and identify pathways based on the top blast hit. We used Infernal v. 1.1.1 (Nawrocki and Eddy 2013) and the Rfam database release 14.1 to annotate miRNA, rRNA, and snRNA genes; tRNAs were detected using tRNAscan-SE version 2.0.1 (Lowe and Eddy 1997).

Species-specific genes, phylogeny and demographic history

Using the predicted protein sequences of Oreamnos americanus, we analyzed the orthologs shared between Capra hircus, Equus caballus, Bos taurus, Sus scrofa, Ovis aries and Oreamnos americanus with OrthoVenn (Wang et al. 2015). We also used PHYLUCE v. 1.5.0 (Faircloth 2016) to identify orthologous UCE sequences between Sus scrofa, Equus caballus, Bos taurus, Capra hircus, and Ovis aries and Oreamnos americanus using the 5k amniote probe set (Faircloth et al. 2012). We extracted UCEs with 250 bp flanks totaling 11.9 aligned Mbp. We conducted a maximum likelihood phylogenetic analysis using RAxML v. 8 (Stamatakis 2014) under a GTRGAMMAI substitution model selected with jModelTest 2 (Darriba et al. 2012). The phylogenetic tree was constructed with PAML MCMCtree v. 4.9 (Yang 2007) and calibrated with the divergence time of goat and cow (18.3–28.5 Mya) (Benton and Donoghue 2007).

Lastly, using a random 30X subset of the sequencing data, we modeled the historical effective population size (Ne) for the North American mountain goat using PSMC [18]. We used the default parameters of 64 atomic time intervals (-p “4+25*2+4+6”), a generation time of 6 years (Li and Durbin 2011) and mutation rate µ= 1.33*10−8 mutations/site/generation, calculated as the average mammalian mutation rate of 2.22*10−9 mutations/site/year (Kumar and Subramanian 2002) multiplied by the generation time of 6 years.

Data availability

The raw sequence data have been deposited in the Short Read Archive (SRA) under accession number PRJNA510081. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession WJNR00000000. The version described in this paper is version WJNR01000000. Supplemental material available at figshare: https://doi.org/10.25387/g3.9884003.

Results and Discussion

Genome assembly

The short-read assembly with Meraculous (Chapman et al. 2011) produced a genome consisting of 172,540 scaffolds with an N50 of 29.0 Kb and total genome size of 2.5 Gbp. For comparison purposes, the short-read data were also assembled with MEGAHIT v. 1.1.1 (Li et al. 2015), which produced a less continuous contig assembly of 2.6 Gbp compared to the Meraculous contig assembly (Tables S1 and S2).

The final assembly including Chicago libraries was 2.5 Gbp in length and consisted of 3,217 scaffolds that have an N50 of 66.6 Mbp (Table 1). The assembled genome is 93% of the predicted size estimated by KmerGenie (Chikhi and Medvedev 2014). Compared to the other ungulate species, the N50 of the mountain goat assembly is in the top quartile and the assembly length is mid-range (Table S3) (Miller et al. 2015; Bickhart et al. 2017; Williams et al. 2017; Bana et al. 2018). The genomic GC content is 41.67%, compared to 41.97% for non-primate mammalian animals (Li and Du 2014).

Table 1. Genome assembly statistics for the North American mountain goat for the short-read data and subsequent scaffolding.

Short-read assembly Scaffolded + Chicago assembly
Total length 2,489 Mbp 2,506 Mbp
N50 / L50 29.0 Kbp / 24,137 scaffolds 66,617 Kbp / 13 scaffolds
N90 / L90 6.5 Kbp / 92,814 scaffolds 18,734 Kbp / 37 scaffolds
Number of scaffolds 172,540 3,217
N count (% of genome) 530,049 (0.02%) 17,462,549 (0.70%)
Gaps 23,758 193,083

Quality assessment

Mapping the sequencing reads back to the assembled genome can be used to identify misassemblies and check assembly quality (Muller et al. 2013). The overall alignment rate of mapped the raw short-insert library reads to the final scaffolded assembly was 97%; 80% of the reads aligned concordantly exactly 1 time and 14% of the reads aligned concordantly more than once (Table S4), indicating the high quality of the assembly. For reference, when the same analysis was completed for the high quality assembly of the domestic goat, 89% of the raw sequencing reads mapped back to the assembled genome (Dong et al. 2013). To detect potential long duplicated regions in the final scaffolded assembly, we aligned the genome assembly against itself. Sixteen out of 30,472 entries in the SAM file had scores over 1000, ranging from 1011 to 1488. By examining the CIGAR strings of each of those entries, we concluded that the assembly did not contain duplicated segments across scaffolds longer than 2 Kbp. The QV score of the genome was 41.8 and the FRC output was consistent with short-read domestic goat genome assemblies (Bickhart et al. 2017; Table S6).

For both contig and scaffold assemblies for the mountain goat (Fig. S1 and S2) the number of distinct k-mers (k = 27) with over 1X coverage, i.e., k-mers that are not single-copy in the assembly, is low, which suggests low levels of sequencing bias and contamination. The 1X k-mer distribution has a much larger first peak, suggestive of an inbred individual, which is ideal for assembly purposes and why our sample was selected. This mountain goat is descended from a founding population of less than 17 individuals (Smith and Nichols 1984); further the mating system of mountain goats is such that only a small number of males sire most offspring (Mainguy et al. 2009), suggesting this individual should have low levels of heterozygosity.

A total of 4,104 single-copy orthologs were screened against the assembled mountain goat genome with 3,817 (93%) being complete, and 287 missing or fragmented BUSCOS (7%); this is in the same range as the BUSCO scores reported for other ungulate species (Li et al. 2017; Zhang et al. 2018). Similarly, we detected 471 of 481 ultraconserved elements (UCEs) (Bejerano et al. 2004) using scripts available in PHYLUCE v. 1.5.0 (Faircloth 2016) which is comparable to high quality mammalian genomes (Seemann et al. 2016).

Genome annotation

Long interspersed nuclear elements (LINEs) comprised 27.2% of the assembled genome, short interspersed nuclear elements (SINEs) comprised 11.4%, and in total repeats represented 46.7% of the assembly (Table 2). The repeat content is consistent with other ungulate genomes (Dong et al. 2013; Li et al. 2017; Zhang et al. 2018).

Table 2. Summary of repeats in the assembled genome of Oreamnos americanus.

Length (bp) Percentage of sequence
Total interspersed repeats: 1,146,349,974 45.74%
 SINEs 286,943,974 11.45%
 LINEs 681,473,093 27.19%
 LTR elements 122,655,871 4.89%
 DNA elements 54,579,895 2.18%
 Unclassified 697,141 0.03%
Satellites 3,429,492 0.14%
Simple repeats 17,168,694 0.69%
Low complexity 3,546,216 0.14%
Total: 1,170,891,890 46.72%

Our annotation pipeline of the longest scaffolds resulted in 22,292 total genes with 16,012 being protein coding. Of these, the Interpro v 5.29-68 databases identified 13,874 (74%) genes with information, and 9,460 (51%) having a gene ontology assignment. Similarly, RefSeq protein database detected a a total of 14,470 genes (78%) with NCBI RefSeq match. Lastly, the Interpro v 5.29-68 databases annotated 13,874 (74%) genes with information, with 9,460 (51%) having a gene ontology assignment. A total of 34,819 putative noncoding RNA sequences were identified encompassing 0.1% of the genome and is comparable to other ungulates (Zhang et al. 2018).

Species-specific genes, phylogeny and demographic history

A total of 84 gene families were found to be specific to the mountain goat (Figure 2a). Seventy-three orthologous clusters were uniquely shared between Ovis aries and O. americanus, 33 between Capra hircus and O. americanus, and 41 between Bos taurus and O. americanus. The gene families unique to mountain goat were enriched for ferroxidase activity, transcription regulation and protein folding and stability; we hypothesize that enrichment for the ferroxidase activity could potentially have allowed mountain goats to adapt to their alpine environment due to its link to erythropoiesis and altitude (Cherukuri et al. 2004).

Figure 2.

Figure 2

a) A Venn diagram of orthologous clusters shared among mountain goat (Oreamnos americanus), pig (Sus scrofa), horse (Equus caballus), cow (Bos taurus), goat (Capra hircus) and sheep (Ovis aries). The numbers represent the number of orthologous clusters and only the clusters unique to each species and shared with O. americanus are labeled for clarity. b) Phylogenetic relationships and divergence times between the six species estimated from ultraconserved elements (UCEs) and MCMCtree. The calibration points are indicated by orange dots; node bars encompass the 95% credible intervals.

The phylogenetic analysis (Figure 2b) supports the mountain goat lineage as ancestral to sheep and goats as expected (Shafer and Hall 2010). There was no relationship between the assembly strategy, number of indels in UCEs, and phylogenetic placement (Table S7). The mitochondrial genome phylogeny confirmed a similar relationship (Fig. S3). As the ice sheets progressed during the last glacial period, the Earth experienced a decrease in average surface temperature; the mountain goat Ne reflects these temperature patterns, with a general downward trend in Ne from 200 kya to 35 kya (Figure 3). PSMC analysis suggested a near 10-fold decrease in the Ne of mountain goats at the end of the last glacial period and the start of the Holocene likely due to warming and loss of suitable habitat.

Figure 3.

Figure 3

Historical effective population size (Ne) of the North American mountain goat estimated with PSMC analysis assuming a mutation rate µ of 1.33*10−8 mutations/site/generation and a generation time of 6 years. Atmospheric and ocean temperatures in °C are given on the vertical right axis, Ne in units of 10,000 individuals on the left vertical axis, and time in Mya on the logarithmic horizontal axis.

Conclusion

This is the first genome assembly and annotation for the North American mountain goat, and the first de novo assembly of a wild caprid. The biological sample came from an island with a small founding population (Smith and Nichols 1984) making it ideal for genome assembly. Using the newly assembled genome we identified species-specific genes and modeled the historical population demography of the species that showed a dramatic decline at the height of the last glaciation. Relative to other ungulate genomes, including economically valuable domesticated species that incorporated long-read sequencing strategies, this is among the highest quality wild ungulate genomes to date (Martchenko et al. 2018).

Acknowledgments

This research was supported by NSERC Discovery Grant, Alaska Department of Fish and Game Federal Aid, Compute Canada award to ABAS and INCEPTION grant (PIA/ANR-16-CONV-0005) to RC. We thank Maël Kerbiriou and Dr. Antoine Limasset from Université de Lille who provided valuable insights for this study. DM is supported by an Ontario Graduate Scholarship; the Globalink Travel Award that made this collaboration possible was provided by Mitacs and Inria. We thank two anonymous reviewers for comments that improved the manuscript.

Footnotes

Supplemental material available at figshare: https://doi.org/10.25387/g3.9884003.

Communicating editor: M. Hufford

Literature Cited

  1. Agaba M., Ishengoma E., Miller W. C., McGrath B. C., Hudson C. N. et al. , 2016.  Giraffe genome sequence reveals clues to its unique morphology and physiology. Nat. Commun. 7: 11519 10.1038/ncomms11519 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andrews, S., 2010 Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
  3. Bana N. Á., Nyiri A., Nagy J., Frank K., Nagy T. et al. , 2018.  The red deer Cervus elaphus genome CerEla1.0: sequencing, annotating, genes, and chromosomes. Mol. Genet. Genomics 293: 665–684. 10.1007/s00438-017-1412-3 [DOI] [PubMed] [Google Scholar]
  4. Bao W., Kojima K. K., and Kohany O., 2015.  Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 6: 11 10.1186/s13100-015-0041-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bejerano G., Pheasant M., Makunin I., Stephen S., Kent W. J. et al. , 2004.  Ultraconserved elements in the human genome. Science 304: 1321–1325. 10.1126/science.1098119 [DOI] [PubMed] [Google Scholar]
  6. Benson G., 1999.  Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27: 573–580. 10.1093/nar/27.2.573 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Benton M. J., and Donoghue P. C. J., 2007.  Paleontological evidence to date the tree of life. Mol. Biol. Evol. 24: 26–53. Erratum: 889–891. 10.1093/molbev/msl150 [DOI] [PubMed] [Google Scholar]
  8. Bickhart D. M., Rosen B. D., Koren S., Sayre B. L., Hastie A. R. et al. , 2017.  Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nat. Genet. 49: 643–650. 10.1038/ng.3802 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bushnell, B., 2018 BBTools. Available online at: https://jgi.doe.gov/data-and-tools/bbtools/.
  10. Chapman J. A., Ho I., Sunkara S., Luo S., Schroth G. P. et al. , 2011.  Meraculous: de novo genome assembly with short paired-end reads. PLoS One 6: e23501 10.1371/journal.pone.0023501 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cherukuri S., Tripoulas N. A., Nurko S., and Fox P. L., 2004.  Anemia and impaired stress-induced erythropoiesis in aceruloplasminemic mice. Blood Cells Mol. Dis. 33: 346–355. 10.1016/j.bcmd.2004.07.003 [DOI] [PubMed] [Google Scholar]
  12. Chikhi R., and Medvedev P., 2014.  Informed and automated k-mer size selection for genome assembly. Bioinformatics 30: 31–37. 10.1093/bioinformatics/btt310 [DOI] [PubMed] [Google Scholar]
  13. Côté S. D., and Festa-Bianchet M., 2003.  Mountain goat, Wild Mammals of North America: Biology, Management, Conservation, edited by Feldhamer G. A., Thompson B., and Chapman J.. The John Hopkins University Press, Baltimore, Maryland. [Google Scholar]
  14. Darriba D., Taboada G. L., Doallo R., and Posada D., 2012.  jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9: 772 10.1038/nmeth.2109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dong Y., Xie M., Jiang Y., Xiao N., Du X. et al. , 2013.  Sequencing and automated whole-genome optical mapping of the genome of a domestic goat (Capra hircus). Nat. Biotechnol. 31: 135–141. 10.1038/nbt.2478 [DOI] [PubMed] [Google Scholar]
  16. Faircloth B. C., 2016.  PHYLUCE is a software package for the analysis of conserved genomic loci. Bioinformatics 32: 786–788. 10.1093/bioinformatics/btv646 [DOI] [PubMed] [Google Scholar]
  17. Faircloth B. C., McCormack J. E., Crawford N. G., Harvey M. G., Brumfield R. T. et al. , 2012.  Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. Syst. Biol. 61: 717–726. 10.1093/sysbio/sys004 [DOI] [PubMed] [Google Scholar]
  18. Fitzpatrick M. C., and Keller S. R., 2015.  Ecological genomics meets community-level modelling of biodiversity: mapping the genomic landscape of current and future environmental adaptation. Ecol. Lett. 18: 1–16. 10.1111/ele.12376 [DOI] [PubMed] [Google Scholar]
  19. Haas B. J., Salzberg S. L., Zhu W., Pertea M., Allen J. E. et al. , 2008.  Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9: R7 10.1186/gb-2008-9-1-r7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Humann, J., T. Lee, S. Ficklin, C. Cheng, H. Hough et al., 2018 Next Generation Genome Annotation and Analysis Workshop., in San Diego, CA, USA. [Google Scholar]
  21. Hunt S. E., McLaren W., Gil L., Thormann A., Schuilenburg H. et al. , 2018.  Ensembl variation resources. Database (Oxford) 2018: bay119 10.1093/database/bay119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Kim D., Langmead B., and Salzberg S. L., 2015.  HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12: 357–360. 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kumar S., and Subramanian S., 2002.  Mutation rates in mammalian genomes. Proc. Natl. Acad. Sci. USA 99: 803–808. 10.1073/pnas.022629899 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Langmead B., and Salzberg S. L., 2012.  Fast gapped-read alignment with Bowtie 2. Nat. Methods 9: 357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li H., 2018.  Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34: 3094–3100. 10.1093/bioinformatics/bty191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Li X.-Q., and Du D., 2014.  Variation, Evolution, and Correlation Analysis of C+G Content and Genome or Chromosome Size in Different Kingdoms and Phyla. PLoS One 9: e88339 10.1371/journal.pone.0088339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Li H., and Durbin R., 2011.  Inference of human population history from individual whole-genome sequences. Nature 475: 493–496. 10.1038/nature10231 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Li Z., Lin Z., Ba H., Chen L., Yang Y. et al. , 2017.  Draft genome of the reindeer (Rangifer tarandus). Gigascience 6: gix102 10.1093/gigascience/gix102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Li D., Liu C.-M., Luo R., Sadakane K., and Lam T.-W., 2015.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31: 1674–1676. 10.1093/bioinformatics/btv033 [DOI] [PubMed] [Google Scholar]
  30. Lowe T. M., and Eddy S. R., 1997.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25: 955–964. 10.1093/nar/25.5.955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Mainguy J., Côté S. D., and Coltman D. W., 2009.  Multilocus heterozygosity, parental relatedness and individual fitness components in a wild mountain goat, Oreamnos americanus population. Mol. Ecol. 18: 2297–2306. 10.1111/j.1365-294X.2009.04197.x [DOI] [PubMed] [Google Scholar]
  32. Mapleson D., Garcia Accinelli G., Kettleborough G., Wright J., and Clavijo B. J., 2017.  KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies. Bioinformatics 33: 574–576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Martchenko D., Prewer E., Latch E. K., Kyle C. J., and Shafer A. B. A., 2018.  Population genomics of ungulates, Population Genomics, edited by P.A. Hohenlohe Springer, Cham: 10.1007/13836_2018_30 [DOI] [Google Scholar]
  34. Miller J. M., Moore S. S., Stothard P., Liao X., and Coltman D. W., 2015.  Harnessing cross-species alignment to discover SNPs and generate a draft genome sequence of a bighorn sheep (Ovis canadensis). BMC Genomics 16: 397 10.1186/s12864-015-1618-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Muller E. E. L., Glaab E., May P., Vlassis N., and Wilmes P., 2013.  Condensing the omics fog of microbial communities. Trends Microbiol. 21: 325–333. 10.1016/j.tim.2013.04.009 [DOI] [PubMed] [Google Scholar]
  36. Nawrocki E. P., and Eddy S. R., 2013.  Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29: 2933–2935. 10.1093/bioinformatics/btt509 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Orlando L., Ginolhac A., Zhang G., Froese D., Albrechtsen A. et al. , 2013.  Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499: 74–78. 10.1038/nature12323 [DOI] [PubMed] [Google Scholar]
  38. Pettorelli N., Pelletier F., von Hardenberg A., Festa-Bianchet M., and Côté S. D., 2007.  Early onset of vegetation growth vs. rapid green-up: impacts on juvenile mountain ungulates. Ecology 88: 381–390. 10.1890/06-0875 [DOI] [PubMed] [Google Scholar]
  39. Putnam N. H., O’Connell B. L., Stites J. C., Rice B. J., Blanchette M. et al. , 2016.  Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 26: 342–350. 10.1101/gr.193474.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Sambrook, J., and D. W. Russell, 2006 Purification of Nucleic Acids by Extraction with Phenol:Chloroform. Cold Spring Harb. Protoc. 2006: pdb.prot4455. [DOI] [PubMed]
  41. Savolainen O., Lascoux M., and Merilä J., 2013.  Ecological genomics of local adaptation. Nat. Rev. Genet. 14: 807–820. 10.1038/nrg3522 [DOI] [PubMed] [Google Scholar]
  42. Seemann S. E., Anthon C., Palasca O., and Gorodkin J., 2016.  Quality Assessment of Domesticated Animal Genome Assemblies. Bioinform. Biol. Insights 9: 49–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Shafer A. B. A., Côté S. D., and Coltman D. W., 2011a Hot spots of genetic diversity descended from multiple Pleistocene refugia in an alpine ungulate. Evolution 65: 125–138. 10.1111/j.1558-5646.2010.01109.x [DOI] [PubMed] [Google Scholar]
  44. Shafer A. B. A., Cullingham C. I., Côté S. D., and Coltman D. W., 2010.  Of glaciers and refugia: a decade of study sheds new light on the phylogeography of northwestern North America. Mol. Ecol. 19: 4589–4621. 10.1111/j.1365-294X.2010.04828.x [DOI] [PubMed] [Google Scholar]
  45. Shafer A. B. A., and Hall J. C., 2010.  Placing the mountain goat: A total evidence approach to testing alternative hypotheses. Mol. Phylogenet. Evol. 55: 18–25. 10.1016/j.ympev.2010.01.015 [DOI] [PubMed] [Google Scholar]
  46. Shafer A. B. A., White K. S., Côté S. D., and Coltman D. W., 2011b Deciphering translocations from relicts in Baranof Island mountain goats: is an endemic genetic lineage at risk? Conserv. Genet. 12: 1261–1268. 10.1007/s10592-011-0227-8 [DOI] [Google Scholar]
  47. Smit, A., R. Hubley, and P. Green, 2013 RepeatMasker Open-4.0.
  48. Smith, C. A., and L. Nichols, 1984 Mountain goat transplants in Alaska: restocking depleted herds and mitigating mining impacts, pp. 467–480 in Proceedings of Biennial Symposium, Northern Wild Sheep and Goat Council, Whitehorse YK, Canada. [Google Scholar]
  49. Stamatakis A., 2014.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30: 1312–1313. 10.1093/bioinformatics/btu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Stanke, M., A. Tzvetkova, and B. Morgenstern, 2006 AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 7 Suppl 1: S11.1–8. [DOI] [PMC free article] [PubMed]
  51. Vezzi F., Narzisi G., and Mishra B., 2012.  Reevaluating assembly evaluations with feature response curves: GAGE and assemblathons. PLoS One 7: e52210 10.1371/journal.pone.0052210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Wang Y., Coleman-Derr D., Chen G., and Gu Y. Q., 2015.  OrthoVenn: a web server for genome wide comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 43: W78–W84. 10.1093/nar/gkv487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Wang Y., Zhang C., Wang N., Li Z., Heller R. et al. , 2019.  Genetic basis of ruminant headgear and rapid antler regeneration. Science 364: eaav6335. 10.1126/science.aav6335 [DOI] [PubMed] [Google Scholar]
  54. Williams J. L., Iamartino D., Pruitt K. D., Sonstegard T., Smith T. P. L. et al. , 2017.  Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50). Gigascience 6: gix088 10.1093/gigascience/gix088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Wood D. E., and Salzberg S. L., 2014.  Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 15: R46 10.1186/gb-2014-15-3-r46 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Yang Z., 2007.  PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol. Biol. Evol. 24: 1586–1591. 10.1093/molbev/msm088 [DOI] [PubMed] [Google Scholar]
  57. Zhang C., Chen L., Zhou Y., Wang K., Chemnick L. G. et al. , 2018.  Draft genome of the milu (Elaphurus davidianus). Gigascience 7: gix130 10.1093/gigascience/gix130 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw sequence data have been deposited in the Short Read Archive (SRA) under accession number PRJNA510081. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession WJNR00000000. The version described in this paper is version WJNR01000000. Supplemental material available at figshare: https://doi.org/10.25387/g3.9884003.


Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES