Abstract
The black-footed ferret (Mustela nigripes) narrowly avoided extinction to become an oft-cited example of the benefits of intensive management, research, and collaboration to save a species through ex situ conservation breeding and reintroduction into its former range. However, the species remains at risk due to possible inbreeding, disease susceptibility, and multiple fertility challenges. Here, we report the de novo genome assembly of a male black-footed ferret generated through a combination of linked-read sequencing, optical mapping, and Hi-C proximity ligation. In addition, we report the karyotype for this species, which was used to anchor and assign chromosome numbers to the chromosome-length scaffolds. The draft assembly was ~2.5 Gb in length, with 95.6% of it anchored to 19 chromosome-length scaffolds, corresponding to the 2n = 38 chromosomes revealed by the karyotype. The assembly has contig and scaffold N50 values of 148.8 kbp and 145.4 Mbp, respectively, and is up to 96% complete based on BUSCO analyses. Annotation of the assembly, including evidence from RNA-seq data, identified 21,406 protein-coding genes and a repeat content of 37.35%. Phylogenomic analyses indicated that the black-footed ferret diverged from the European polecat/domestic ferret lineage 1.6 million yr ago. This assembly will enable research on the conservation genomics of black-footed ferrets and thereby aid in the further restoration of this endangered species.
Keywords: annotation, black-footed ferret, conservation, genome assembly, karyotype, Mustela nigripes
Introduction
Black-footed ferrets once occupied the vast grassland habitats across the North American Great Plains. However, starting around the time the species was first described based on a specimen collected near Fort Laramie, Wyoming, USA (Audubon and Bachman 1851), much of these grasslands were being converted to croplands and pastures for agriculture and livestock grazing, which continued into the 20th century (Klein Goldewijk et al. 2011). The consequent habitat loss coincided with the extermination of the ferrets’ primary prey, prairie dogs (Cynomys), which were further decimated by the introduction and eventual spread of the Yersinia pestis bacterium in 1900, the causative agent of sylvatic plague (Biggins and Godbey 2003; Adjemian et al. 2007). Habitat and prey loss, along with the ferrets’ own morbidity due to sylvatic plague, led to a severe decline in their numbers, such that black-footed ferrets were first listed endangered in 1967. They were then presumed extinct in 1979 when the last ferret in the first small captive breeding colony died (Jachowski 2014).
However, in 1981, a single, small population of ferrets was discovered near Meeteetse, Wyoming. Subsequent studies showed that this population numbered ~129 in 1984 but then declined in 1985 to ~58 individuals, likely as a result of an outbreak of canine distemper as well as sylvatic plague (Forrest et al. 1988). To prevent extinction, 18 of the last remaining ferrets were captured between 1985 and 1987 to commence an ex situ conservation breeding program and Black-footed ferret Recovery Plan coordinated by U.S. Fish & Wildlife Service (USFWS) and multiple Association of Zoos & Aquariums (AZA) facilities (Jachowski 2014). An AZA Species Survival Plan (SSP) was also established to implement a minimization of kinship approach using a studbook pedigree to mitigate the loss of genetic diversity and avoid inbreeding. The SSP was initiated by 7 founders from the remaining 18 ferrets from Meeteetse (Marinari and Lynch 2022). Despite the small founder number, more than 10,500 black-footed ferret kits have been produced since 1986, a large proportion of which were reintroduced at 29 sites across the species’ historic range, only 14 of which remain active (USFWS Black-footed Ferret Recovery Program 2019). Current census numbers indicate a total of ~650 ferrets now living in captivity (310 as of 2022) or in the wild (~340 as of 2019) (USFWS Black-footed Ferret Recovery Program 2019; Marinari and Lynch 2022).
Previous genetic analyses based on allozymes, microsatellites, and mitochondrial sequences revealed low levels of genetic variation in black-footed ferrets following the founder bottleneck (O’Brien 1989; Wisely et al. 2002, 2008). Data derived from genomic approaches can provide a more comprehensive understanding of where genetic diversity has been lost or retained across the genome and help inform the conservation management of threatened species like the black-footed ferret (Russello and Jensen 2021). Such analyses are facilitated by annotated reference genome assemblies, which empower investigation of other features important for species recovery. These include inbreeding levels as assessed by runs of homozygosity, mutational load that might affect individual and population fitness, the basis for disease susceptibility, and adaptation to captivity (Brandies et al. 2019; Totikov et al. 2021; Formenti et al. 2022; Paez et al. 2022). Furthermore, having a black-footed ferret high-quality reference genome assembly becomes an essential resource in supporting ongoing efforts to enhance the species’ recovery and genetic rescue through biotechnology tools such as interspecies somatic cell nuclear transfer and genome editing (Wisely et al. 2015; Adams and Redford 2021).
Here, we report the chromosomal-level assembly and annotation of a reference genome of the black-footed ferret using a combination of linked-read sequencing, optical mapping, and high-throughput chromosome conformation capture (Hi-C). We integrate the assembly with karyotype data to anchor the chromosome-length scaffolds, thereby providing a scaffotype assembly (sensu Lewin et al. 2019) assigned to physical chromosomes. We envision that this annotated draft assembly will provide a rich resource for in-depth studies on the conservation genomics of this iconic species.
Methods
Biological material
Veterinary pathologists in the Center for Animal Care Sciences at Smithsonian’s National Zoological Park-Conservation Biology Institute collected fresh tissues during necropsy from a male black-footed ferret named Capone (studbook #6536), aged 7.6 yr (Fig. 1A). Capone was born in 2009 in a litter that was the product of artificial insemination using cryobanked semen collected from Scarface (studbook #18) (Howard et al. 2016), one of the original 7 founders of the conservation breeding program (Marinari and Lynch 2022). Collected pieces of tissues from multiple organs were temporarily stored on dry ice, and then cryopreserved in liquid nitrogen until DNA or RNA isolation.
Fig. 1.
Genome assembly and karyotype of the black-footed ferret. A) Photograph of Capone (studbook #6536) taken at the Smithsonian’s National Zoo & Conservation Biology Institute. B) Hi-C contact density map of the black-footed ferret genome assembly showing 19 chromosome-length scaffolds. C) To the left: G-banded karyotype of a wild-caught female black-footed ferret (MNI) “Willa” (studbook #10) aligned with the karyotype of a male steppe polecat, Mustela eversmanni (MEV). The X and Y chromosomes of a male black-footed ferret are also shown (black square). To the right: C-banding of the chromosomes with additional heterochromatin blocks. The order of G-banded chromosomes follows the nomenclature for mustelids outlined in Graphodatsky et al. (2002). The G-banding for chromosome pairs 6 and 7 depicted in the figure differ from the corresponding karyotypes of M. nigripes and M. eversmanni in the Atlas of Mammalian Chromosomes (Graphodatsky et al. 2020). Asterisk designates the additional heterochromatin blocks forming an entire chromosome arm. The number sign (#) indicates the presence of one more heterochromatin block of smaller size.
De novo genome sequencing
Tissues were shipped to the Genomic Services Lab at the HudsonAlpha Institute for Biotechnology in Huntsville, Alabama, USA. High molecular weight (HMW) genomic DNA was isolated using the MagAttract HMW DNA Kit (Qiagen, Germantown, Maryland, USA) following the manufacturer’s protocol. The gDNA was quantitated using the Qubit dsDNA HS Assay Kit (Invitrogen, Waltham, Massachusetts, USA) and subsequently profiled on a 0.75% pulsed-field agarose gel, run for 17 h using the 7 to 430 kb protocol on a Pippin Pulse (Sage Science Inc., Beverly, Massachusetts, USA). The samples showed the presence of gDNA fragments >60 kb with minimal to no smearing.
Chromium genomic libraries were prepared and purified with the Chromium Genome Reagent Kit version2 and the microfluidic Genome Chip run in a Chromium Controller instrument according to the manufacturer’s instructions (10× Genomics, Pleasanton, California, USA). Fragment sizes of the libraries were quantitated using a 2100 Bioanalyzer with the High Sensitivity DNA Assay (Agilent Technologies, Santa Clara, California, USA) and by qPCR with the KAPA Library Quantification Kit (Roche, Indianapolis, Indiana, USA) following the manufacturer’s instructions. Libraries were 150 bp paired-end sequenced on an Illumina HiSeq X Ten instrument to a minimum depth of 60×, generating a total of 849,993,416 reads.
Optical mapping
Tissue samples were fixed in formaldehyde, blended with a rotor-stator homogenizer, treated with EtOH, and pelleted at 1,500 × g for 5 min. The resulting pellet was resuspended in Bionano Homogenization Buffer (part no. 20278) and added to prewarmed agarose to make 0.8% agarose plugs. gDNA was extracted from the agarose plugs following the Bionano Prep Animal Tissue DNA Isolation Soft Tissue Protocol, revision C (document no. 30077).
UltraHMW DNA was fluorescently tagged at the recognition sequence Nt.BspQI and Nb.BssSI using the Bionano Genomics NLRS kit (catalog no. 80001), following the Bionano Prep Label—NLRS Protocol, revision K (document no. 30024). Samples were loaded on a Saphyr instrument according to manufacturer specifications (document no. 30247), and run until at least 70× effective coverage (total DNA [molecules >150 kbp] × map rate/reference genome size) and at least 100× raw coverage (total DNA [molecules >150 kbp]/genome size) data were collected, as defined by the manufacturer (document no. 30173, Rev. E).
Hi-C sequencing
In situ Hi-C library preparation and sequencing were performed following the protocol by (Rao et al. 2014). Briefly, gDNA and proteins in intact nuclei were crosslinked with formaldehyde and the gDNA digested using restriction enzymes. Biotinylated nucleotides were then added to the 5ʹ end overhangs and the blunt-end fragments ligated. The gDNA was then sheared by sonication and the fragments with biotinylated DNA ligation junctions captured with streptavidin beads. The library was PCR amplified, and 841.4 million of 150 bp paired-end reads were sequenced on an Illumina HiSeq X Ten instrument.
RNA sequencing
RNA sequencing data were generated at the Genomic Services Lab at the HudsonAlpha Institute of Biotechnology from cerebellum, kidney, liver, lung, spleen, and testes tissue samples obtained from black-footed ferret SB#6536. RNA isolation was performed using 350 µl of lysate and the RNAeasy Plus Mini Kit (Qiagen) with DNase digestion. Extracts were quantitated with the Qubit RNA HS and BR Assay Kits (Invitrogen) and evaluated with a HS RNA Fragment Analyzer chip (Agilent Technologies). PolyA RNA-seq libraries were prepared using the TruSeq RNA Library Preparation Kit (Illumina) following the manufacturer’s protocol. The libraries were quantitated by qPCR using the KAPA Library Quantification Kit (Roche) following the manufacturer’s instructions and then 100 bp paired-end sequenced on an Illumina NovaSeq6000 instrument.
Karyotyping
We generated a karyotype from the cell line of a female black-footed ferret named Willa (studbook #10), which has been stored in liquid nitrogen since 1988 in the FrozenZoo at the Beckman Center for Conservation Research of the San Diego Zoo Wildlife Alliance, USA. Cell culturing, harvesting, and chromosome banding followed the techniques described in Kumamoto et al. (1996). Briefly, metaphase preparations were made from primary fibroblast cell lines grown in minimal essential medium enriched with fetal bovine serum and antibiotics. Cells were treated with colcemid and hypotonic solution (KCl), followed by fixation with methanol and glacial acetic acid. G-banded chromosomes were produced through trypsin treatment. C-banding followed the method of Sumner (1972) with the modification of 2.5% BaOH and a shortened staining time (20 to 30 min) in Giemsa.
Genome size and assembly
Quality control of reads was performed using FastQC v0.11.9 (Andrews 2010). Distributions of 23-mers after read filtration were counted using Jellyfish v2.2.10 (Marçais and Kingsford 2011) and visualized using KrATER v2.5 (https://github.com/mahajrod/krater) to assess genome size by GenomeScope2 (Ranallo-Benavidez et al. 2020, p. 2) and to check for possible abnormalities. Prior genome size estimation barcodes were trimmed from raw linked reads using the EMA pipeline (Shajii et al. 2017).
Genome assembly was performed in 3 stages. First, raw 10× Genomics linked reads were assembled using Supernova v1 (Weisenfeld et al. 2017) to generate the initial draft. Next, resolution of repetitive regions and scaffolding were performed using the 2-enzyme optical mapping data and Bionano Solve package (Lam et al. 2012). Then Hi-C-scaffolding was performed using Juicer and the 3D-DNA pipeline (Durand et al. 2016; Dudchenko et al. 2017). Finally, the assembly was manually corrected using Juicebox Assembly Tools (Dudchenko et al. 2018). We utilized BUSCO v.5.2.2 (Manni et al. 2021) to evaluate the completeness of the black-footed ferret assembly in relation to the assemblies of 3 other mustelid species (domestic ferret, sea otter, giant otter), using the mammalia_odb10.2021-02-19, laurasiatheria_odb10.2021-02-19, and carnivora_odb10.2021-02-19 single-copy orthologs databases.
Repeat annotation
Dispersed repeats were detected in the assembly using Windowmasker (Morgulis et al. 2006) and RepeatMasker (Smit et al. 2015) with the Carnivora repeat library from RepBase (Jurka 2000; Bao et al. 2015). Tandem repeats were annotated using TRF (Benson 1999) with default options. Bedtools (Quinlan and Hall 2010) was used to softmask detected repeats.
Gene annotation
The annotation of protein-coding genes was performed using a combination of homology-based, transcriptome-based, and de novo predictions. RNA-seq reads from the 6 libraries were aligned to the genome assembly using STAR v2.5.2 (Dobin et al. 2013) followed by extraction of splice site coordinates. Proteins of 3 reference species, Mustela putorius furo, H. sapiens, and Canis familiaris, were aligned to a black-footed ferret assembly using Exonerate v 2.2.4 (Slater and Birney 2005) with the protein2genome model. The obtained alignments were classified into primary and secondary hits; the CDS fragments were cut from each side by 3 bp for the top hits and by 9 bp for secondary hits. Coordinates of these truncated fragments and RNA-seq-based splice sites were used as external evidence for the AUGUSTUS software package v3.2.1 (Stanke et al. 2006) and genes were annotated in a soft-masked assembly. Proteins were extracted from the predicted genes and aligned by HMMER v3.1 (Johnson et al. 2010) and BLAST (Altschul et al. 1990) to Pfam v33.1 (Punta et al. 2012) and Swiss-Prot v2017_8 (The UniProt Consortium 2017) databases, respectively. Only genes supported by hits to protein databases and external evidence were retained. Common gene names were assigned using the EggNOG mapper v1.03 and maNOG database (Huerta-Cepas et al. 2017; Table 1).
Table 1.
Tools used in the assembly, annotation, and analysis of the black-footed ferret genome.
Chromosome name assignment
To assign chromosome names to chromosome-scale scaffolds or C-scaffolds (sensu Lewin et al. 2019) of the black-footed ferret (M. nigripes) and domestic ferret (M. putorius furo), we compared whole-genome alignments and cytogenetic data derived from cross-species fluorescent in situ hybridization or ZooFISH. No ZooFISH results have been published for black-footed ferret, however, such data are available for the steppe polecat, M. eversmannii (Graphodatsky et al. 2002) and domestic ferret (Cavagna et al. 2000). Steppe polecat and black-footed ferret are closely related (Koepfli et al. 2008; Law et al. 2018), have similar karyotypes (2n = 38), and the karyotype of the black-footed ferret was arranged according to that of the steppe polecat based on G-banding patterns. We first generated a whole-genome alignment between the ferret genomes and the domestic cat genome, Felis catus (which was used as reference) using LAST v981 (Frith and Kawaguchi 2015). Next, the chain correspondence between chromosomes of 3 species (domestic cat → American mink [Neogale vison] → steppe polecat/black-footed ferret/domestic ferret) was used to identify the correspondence between C-scaffolds and karyotype.
Whole-genome alignment and synteny analysis
To analyze synteny between the genomes of the black-footed ferret and domestic ferret, we generated a whole-genome alignment using Progressive Cactus (Armstrong et al. 2020). Next, synteny blocks were extracted from the whole-genome alignment using halSynteny v2.2 (Krasheninnikova et al. 2020) with the options --minBlockSize 50000 --maxAnchorDistance 50000. Finally, we visualized the results using the draw_synteny.py script from the MACE (https://github.com/mahajrod/mace) package.
Phylogenomic reconstruction
We reconstructed a phylogenetic tree using BUSCO-based gene predictions to unify the quality of the datasets and methodology derived from the assemblies of 15 selected species. These were downloaded from NCBI or www.dnazoo.org: Panthera leo (GCA_008795835.1), F. catus (GCA_000181335.4), Canis lupus familiaris (GCA_000002285.3), M. putorius furo (MusPutFur1.0_HiC), Mustela nigripes (GCA_022355385.1, this study), N. vison (GCA_900108605.1), Lontra canadensis (GSC_riverotter_1.0_HiC), Martes zibellina (GCA_012583365.1), Enhydra lutris (ASM228890v2_HiC), Mustela nivalis (GCA_019141155.1), Ailurus styani (ASM200746v1_HiC), Erignathus barbatus (Erignathus_barbatus_HiC), Neomonachus schauinslandi (GCA_002201575.2), Odobenus rosmarus (Oros_1.0_HiC), and Ursus arctos (ASM358476v1_HiC). Sequences were generated using BUSCO v.5.2.2 (Manni et al. 2021) with the mammalia_odb v.10.2021-02-19 database of 9,226 orthologs. Only single-copy sequences common for all species were included in the analysis. Codon-based sequence alignment was performed separately for each BUSCO using PRANK v.170427 (Löytynoja 2014) followed by filtration of the hypervariable and poorly aligned regions using GBlocks v.0.91b (Castresana 2000; Talavera and Castresana 2007). The processed alignments were then concatenated into a superalignment using a custom script. A maximum likelihood phylogenetic tree was generated using IQ-TREE v.2.2.0 (Minh et al. 2020) with automatic selection of the best-fitting substitution model using ModelFinder (Kalyaanamoorthy et al. 2017). support was evaluated using 1,000 bootstrap replicates. We also quantified genealogical concordance using the site concordance factor based on maximum likelihood (--scfl) for the superalignment (Mo et al. 2022) and the gene concordance factor for the individual gene trees (n = X,XXX) in IQ-TREE v.2.2.0.
For divergence time estimation, we extracted 4-fold degenerate sites from the concatenated sequence alignment and used the MCMCTree tool from the PAML4 package (Yang 2007) with the HKY85+G substitution model and 220,000 MCMC generations, with the first 20,000 generations discarded as burn-in. We used 6 fossil-based priors to calibrate the molecular clock (Supplementary Table S5). The analysis was performed twice according to the recommendation in the MCMCTree manual and no discordance was found between the 2 runs. We verified satisfactory convergence and mixing of the MCMCs for both runs using Tracer v1.7 (Rambaut et al. 2018). All tools and databases used in the study and their versions are listed in Table 1.
Results and discussion
Karyotype
The black-footed ferret karyotype comprises 19 chromosome pairs (2n = 38), including 12 (sub)metacentric, 6 telo/acrocentric autosome pairs and sex chromosomes (Fig. 1C). The diploid chromosome number conforms to the common number found among the Mustelidae and is the same as that found in several close relatives such as the steppe polecat (Mustela eversmanni) and the European mink (M. lutreola), while differing from the domestic ferret (2n = 40) (Perelman et al. 2012; Graphodatsky et al. 2020). Black-footed ferret chromosomes are morphologically similar to those of the steppe polecat except for chr12, which is longer in the former species, and chr18, which is longer in the latter species. These differences appear to be due to the presence of heterochromatin blocks that form additional chromosome arms in chr12 and chr18 of the black-footed ferret and steppe polecat, respectively. Furthermore, C-banding (Supplementary Fig. S1) revealed the presence of additional dense (dark) heterochromatin blocks on chr8, chr15, and chrY in both species (right part of Fig. 1C). Telomere-to-telomere assemblies based on long-read sequencing data of these and related species will help elucidate the evolution of these differences, since such assemblies can resolve the sequence of heterochromatin regions (Nurk et al. 2022).
Genome assembly and repeat content
A total of 849,993,416 reads were generated from sequencing the Chromium genomic libraries, with an effective read coverage of 75.4×. The total sequence length of the chromosome-length assembly is 2,498,532,951 bp, which is encompassed in 53,614 contigs and 20,574 scaffolds. The contig and scaffold N50s are 148,763 and 145,433,501 bp, respectively. The assembly size approximates that of other mustelid species such as the domestic ferret (2.4 Gb; Peng et al. 2014), sea otter (2.43 Gb; Jones et al. 2017; Beichman et al. 2019), and wolverine (2.42 Gb; Ekblom et al. 2018).
The chromosome-length assembly of the black-footed ferret includes 19 C-scaffolds with one-to-one correspondence to chromosomes of the karyotype (Fig. 1B, Supplementary Tables S1 and S2). C-scaffolds range between 39.3 Mbp (chr11) and 270 Mbp (chr1) and encompass 2.39 Gbp or 95.6% of the assembly. Total assembly length (~2.5 Gbp) is slightly lower than the genome size estimated from filtered reads (2.79 Gbp, Supplementary Fig. S2). This difference can be explained by the presence of additional heterochromatin blocks which are well known to be an issue for genome assemblies. The completeness of the assembly is similar to other available chromosome-length assemblies of mustelids (Supplementary Table S3). We detected 94.6% to 96.2% complete BUSCOs using the Laurasiatheria_odb or Mammalia_odb databases.
Interspersed repetitive elements comprised 37.35% of the assembly, which is similar to the 36.83% found in the domestic ferret MusPutFur1.0 assembly (Peng et al. 2014). This class of repeats included 20.36% LINEs, 9.8% SINEs, 4.62% Long Terminal Repeats (LTRs), and 2.53% DNA elements. Unclassified repeats constituted 0.03% and tandem repeats made up 1.12% of the assembly.
RNA sequencing and annotation
RNA sequencing generated between 210.3 and 594.2 million reads from the 6 tissue samples (Supplementary Table S4). Using homology-based, RNA-seq-based, and de novo approaches, we predicted 21,406 models of protein-coding genes and 28,268 corresponding proteins. The number of detected coding genes is slightly higher than the 19,910 reported for the initial assembly of the domestic ferret based on similar lines of evidence (Peng et al. 2014). Median support (% of gene elements supported) by external evidence was 50% for exons (CDS), 94.74% for introns and 67.7% for introns + exons (Supplementary Fig. S3). The dramatically higher value for introns implies the crucial role of RNA-seq data for annotation, as in our pipeline homology-based external evidence support CDS predictions only while RNA-seq support intron predictions only, respectively. We detected 91% to 93.2% complete BUSCOs in our annotation (Supplementary Table S3) using the same databases used to assess assembly completeness and successfully assigned common gene names for 18,004 (84.1%) of the gene models (Supplementary File S2). The full annotation of gene models is available in Supplementary File S3.
Synteny
Whole-genome alignment of the black-footed ferret and domestic ferret assemblies revealed a Robertsonian translocation involving chromosomes 17 and 14 in the latter species (Fig. 2A and B), which was previously detected from karyotypic data comparing the domestic ferret and steppe polecat (Graphodatsky et al. 2002) and explains the difference in diploid chromosome number in the 2 species (2n = 38 versus 40). We also detected 3 megabase-scale candidate inversions at the ends of chr5, chr6, and chr17 in the black-footed ferret and dozens of smaller inversions (Fig. 2B, Supplementary File S4). We compared G-banding of corresponding chromosomes, but found no inverted patterns. However, even the largest candidates are relatively small from a cytogenetic perspective and may be located within a single band, making them undetectable.
Fig. 2.
Synteny between the black-footed ferret and domestic ferret. A) Synteny map between the domestic ferret (M. putorius furo) and black-footed ferret (M. nigripes) viewed at the macro level. Note that domestic ferret chr1 to chr5, chr8, chr12, chr13, chr15 to chr19, and chrX (labeled with ‘) were reverse complemented to fit orientation of homologous M. putorius furo C-scaffolds. Blue, red, light blue, and magenta colors highlight large-scale chromosome rearrangements between domestic ferret and black-footed ferret. B) Synteny blocks in the genome assemblies of M. putorius furo and M. nigripes mapped onto the C-scaffolds of M. nigripes. Each C-scaffold is split into the top and bottom segment by the line in the middle of the corresponding polygon on the plot. Blocks with the same orientation in both assemblies are in the top segment, with different—below, respectively. Colors of the blocks correspond to the C-scaffold of M. putorius furo which they belong to (see legend to the right). Note the large inversions at the ends of M.nigripes chr5, chr6, chr17, and multiple small inversion in all C-scaffolds.
Phylogenomic analysis
The maximum likelihood phylogenetic tree positioned the black-footed ferret as sister to the domestic ferret along with other Mustelinae species (Mustela and Neogale) and within the clade containing the sampled mustelid species (Fig. 3). All nodes received 100% bootstrap support out of 1,000 replicates. Gene concordance factors were higher than site concordance factors across all nodes except the one uniting the black-footed ferret and domestic ferret, which were both 86%. Our dating analysis based on 6 fossil calibrations (Supplementary File S5) revealed that black-footed ferret and European polecat/domestic ferret lineages diverged 1.6 Mya (95% confidence interval [CI] 2.4 to 0.9 Mya), followed by a 4.5 Mya (95% CI 6.4 to 2.7 Mya) divergence between the least weasel (M. nivalis) and the ferret clade, and a 10.0 Mya (95% CI 11.7 to 7.4 Mya) divergence between the American mink (N. vison) and the genus Mustela. These divergence times are slightly older compared with previous analyses based on smaller multilocus datasets (Koepfli et al. 2008; Law et al. 2018) but slightly younger relative to analyses based on mitochondrial genome-only data (Hassanin et al. 2021).
Fig. 3.
Timetree among 15 species of Carnivora. Phylogenetic tree showing divergence times among 15 species of Carnivora including M. nigripes and M. putorius furo. The tree was estimated using MCMCTree with the maximum likelihood topology generated with IQ-TREE v.2.2.0 used as input. All nodes received 100% support based on 1,000 bootstrap replicates. Blue bars indicate 95% CIs for age estimates. The timescale at the bottom corresponds to millions of years ago. Numbers above and below branches correspond to gene concordance factors (blue) and site concordance factors (green), respectively.
Conclusions
We report the first chromosome-length genome assembly for the black-footed ferret. The addition of the karyotype allowed us to anchor chromosomal scaffolds to their respective chromosomes, representing about 96% of the total assembly, thus providing a precise resource enabling in-depth conservation genomic and molecular ecology studies that may aid the ongoing restoration of this well-studied species (Wisely et al. 2015; Totikov et al. 2021). For example, our assembly can be used as a reference for the alignment of whole-genome resequencing data from multiple individuals and the detection of deleterious mutations that may underlie decreased seminal quality due to inbreeding (Santymire et al. 2019) or help understand their high susceptibility to plague relative to their Eurasian congeners (Biggins and Eads 2017). Our assembly might also help elucidate the speciation mechanisms among ferret lineages. We detected 3 candidate megabase-scale inversions between black-footed ferret and domestic ferret (Fig. 2B). Such inversions are known to play a role in at least partial reproductive isolation (Kemppainen et al. 2015; Wellenreuther and Bernatchez 2018) and, therefore, speciation (Cursino et al. 2014). Finally, the contiguity and completeness of our assembly can be improved through the addition of long-read sequencing data. Such data could be used to verify genome rearrangements among species in the Mustelinae lineage, including the candidate inversions mentioned above, as well as provide a foundation for analyses of structural variants and the pangenome of this endangered species (Whibley et al. 2021).
Supplementary material
Supplementary material is available at Journal of Heredity online.
Supplementary File S1. Supplementary tables and figures.
Supplementary File S2. Common gene names of the annotated protein-coding genes of the black-footed ferret genome.
Supplementary File S3. Full EggNOG annotations of the protein-coding genes found in the black-footed ferret genome.
Supplementary File S4. Synteny of chromosome locations (start and end sites) between the black-footed ferret and domestic ferret.
Supplementary File S5. Fossil-based calibrations priors used to date the nodes in the phylogenetic tree.
Acknowledgments
We thank Braden Boone, Erin Fledderman, Deanna Fulford, Salina Hall, Angela Jones, John Mote, Nripesh Prasad, Melanie Robinson, Christopher Savell, Geetika Trivedi, and Shawn Levy at HudsonAlpha Discovery, Huntsville, Alabama, for their excellent services in preparing and sequencing the DNA (10× Genomics Chromium) and RNA libraries. The unpublished genome assembly of domestic ferret, North American river otter, sea otter, red panda, bearded seal, walrus, and polar bear was used with permission from the DNA Zoo Consortium (https://www.dnazoo.org/).
Contributor Information
Sergei Kliver, Center for Evolutionary Hologenomics, The Globe Institute, The University of Copenhagen, Copenhagen, Denmark.
Marlys L Houck, Beckman Center for Conservation Research, San Diego Zoo Wildlife Alliance, Escondido, CA, United States.
Polina L Perelman, Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia.
Azamat Totikov, Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia; Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia.
Andrey Tomarovsky, Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia; Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia.
Olga Dudchenko, The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States; Center for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX, United States.
Arina D Omer, The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States.
Zane Colaric, The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States.
David Weisz, The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States.
Erez Lieberman Aiden, The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States; Center for Theoretical Biological Physics and Department of Computer Science, Rice University, Houston, TX, United States; Broad Institute of MIT and Harvard, Cambridge, MA, United States.
Saki Chan, Department of Research and Development, Bionano Genomics, San Diego, CA, United States.
Alex Hastie, Department of Research and Development, Bionano Genomics, San Diego, CA, United States.
Aleksey Komissarov, Applied Genomics Laboratory, SCAMT Institute, ITMO University, Saint Petersburg, Russia.
Oliver A Ryder, Beckman Center for Conservation Research, San Diego Zoo Wildlife Alliance, Escondido, CA, United States.
Alexander Graphodatsky, Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia.
Warren E Johnson, Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States; The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, United States; Walter Reed Army Institute of Research, Silver Spring, MD, United States; Loyola University Maryland, Baltimore, MD, United States.
Jesús E Maldonado, Center for Conservation Genomics, Smithsonian’s National Zoo and Conservation Biology Institute, Washington, DC, United States.
Budhan S Pukazhenthi, Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States.
Paul E Marinari, Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States.
David E Wildt, Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States.
Klaus-Peter Koepfli, Center for Species Survival, Smithsonian’s National Zoo and Conservation Biology Institute, Front Royal, VA, United States; Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA, United States.
Funding
The sequencing and assembly of the black-footed ferret genome were funded by a grant from the Smithsonian Institution Competitive Pell Grants Program in Science awarded to WEJ, K-PK, JEM, PEM, BSP, and DEW. This study was also partly funded by the RSF grant no. 19-14-00034-P. SK was funded from Carlsbergfondet Research Infrastructure Grant CF22-0680. ELA was supported by the Welch Foundation (Q-1866), a McNair Medical Institute Scholar Award, an NIH Encyclopedia of DNA Elements Mapping Center Award (UM1HG009375), a US-Israel Binational Science Foundation Award (2019276), the Behavioral Plasticity Research Institute (NSF DBI-2021795), NSF Physics Frontiers Center Award (NSF PHY-2019745 and PHY-2210291), and an NIH CEGS (RM1HG011016-01A1).
Data availability
The genome assembly, linked reads, and RNA-seq data are available on NCBI from Bioproject PRJNA634921. Corresponding SRA accessions for the DNA and RNA data are listed in Supplementary Table S1. Hi-C reads are available from Bioproject PRJNA512907 under accession SRR8616954.
References
- Adams WM, Redford KH. Fix that genome? Oryx. 2021;55(4):481–482. [Google Scholar]
- Adjemian JZ, Foley P, Gage KL, Foley JE. Initiation and spread of traveling waves of plague, Yersinia pestis, in the western United States. Am J Trop Med Hyg. 2007;76(2):365–375. [PubMed] [Google Scholar]
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–410. [DOI] [PubMed] [Google Scholar]
- Andrews S. FASTQC. A quality control tool for high throughput sequence data. Cambridge, United Kingdom: Babraham Bioinformatics, Babraham Institute; 2010. [Google Scholar]
- Armstrong J, Hickey G, Diekhans M, Fiddes IT, Novak AM, Deran A, Fang Q, Xie D, Feng S, Stiller J, et al. Progressive Cactus is a multiple-genome aligner for the thousand-genome era. Nature. 2020;587(7833):246–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Audubon JJ, Bachman J. The viviparous quadrupeds of North America. Vol. 2. New York, USA: Wiley and Putnam; 1851. [Google Scholar]
- Bao W, Kojima KK, Kohany O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA. 2015;6(1):11–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beichman AC, Koepfli K-P, Li G, Murphy W, Dobrynin P, Kliver S, Tinker MT, Murray, MJ, Johnson J, Lindblad-Toh K, et al. Aquatic adaptation and depleted diversity: a deep dive into the genomes of the sea otter and giant otter. Mol Biol Evol. 2019:36:msz101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biggins DE, Eads DA. Evolution, natural history, and conservation of black-footed ferrets. In: Macdonald DW, Newman C, Harrington LA, editors. Biology and conservation of musteloids. Oxford, England: Oxford University Press; 2017. [Google Scholar]
- Biggins DE, Godbey JL. Challenges to reestablishment of free-ranging populations of black-footed ferrets. C R Biol. 2003;326:104–111. [DOI] [PubMed] [Google Scholar]
- Brandies P, Peel E, Hogg CJ, Belov K. The value of reference genomes in the conservation of threatened species. Genes. 2019;10(11):846–846. doi: 10.3390/genes10110846 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17(4):540–552. [DOI] [PubMed] [Google Scholar]
- Cavagna P, Menotti A, Stanyon R. Genomic homology of the domestic ferret with cats and humans. Mamm Genome. 2000;11(10):866–870. [DOI] [PubMed] [Google Scholar]
- Cursino MS, Salviano MB, Abril VV, Zanetti E dos S, Duarte JMB. The role of chromosome variation in the speciation of the red brocket deer complex: the study of reproductive isolation in females. BMC Evol Biol. 2014;14(1):40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356(6333):92–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudchenko O, Shamim MS, Batra SS, Durand NC, Musial NT, Mostofa R, Pham M, Glenn St Hilaire, B, Yao, W, et al. The Juicebox Assembly Tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under $1000. Genomics. 10.1101/254797, 2018, preprint: not peer reviewed. [DOI] [Google Scholar]
- Durand NC, Shamim MS, Machol I, Rao SSP, Huntley MH, Lander ES, Aiden EL. Juicer provides a One-Click System for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016;3(1):95–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ekblom R, Brechlin B, Persson J, Smeds L, Johansson M, Magnusson J, Flagstad Ø, Ellegren H. Genome sequencing and conservation genomics in the Scandinavian wolverine population. Conserv Biol. 2018;32(6):1301–1312. [DOI] [PubMed] [Google Scholar]
- Formenti G, Theissinger K, Fernandes C, Bista I, Bombarely A, Bleidorn C, Ciofi C, Crottini A, Godoy, JA, Höglund J, et al. The era of reference genomes in conservation genomics. Trends Ecol Evol. 2022;37(3):197–202. [DOI] [PubMed] [Google Scholar]
- Forrest SC, Biggins DE, Richardson L, Clark TW, Campbell TM III, Fagerstone KA, Thorne ET. Population attributes for the black-footed ferret (Mustela nigripes) at Meeteetse, Wyoming, 1981–1985. J Mammal. 1988;69(2):261–273. [Google Scholar]
- Frith MC, Kawaguchi R. Split-alignment of genomes finds orthologies more accurately. Genome Biol. 2015;16(1):106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graphodatsky A, Perelman P, O’Brien SJ. Atlas of mammalian chromosomes. John Wiley & Sons, Incorporated; 2020. [Google Scholar]
- Graphodatsky AS, Yang F, Perelman PL, O’Brien PCM, Serdukova NA, Milne BS, Biltueva LS, Fu B, Vorobieva NV, Kawada S-I, et al. Comparative molecular cytogenetic studies in the order Carnivora: mapping chromosomal rearrangements onto the phylogenetic tree. Cytogenet Genome Res. Hoboken, New Jersey, U.S.; 2002;96(1–4):137–145. [DOI] [PubMed] [Google Scholar]
- Hassanin A, Veron G, Ropiquet A, Jansen van Vuuren B, Lécu A, Goodman SM, Haider J, Nguyen TT. Evolutionary history of Carnivora (Mammalia, Laurasiatheria) inferred from mitochondrial genomes. PLoS ONE. 2021;16(2):e0240770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Howard JG, Lynch C, Santymire RM, Marinari PE, Wildt DE. Recovery of gene diversity using long-term cryopreserved spermatozoa and artificial insemination in the endangered black-footed ferret: black-footed ferret gene restoration. Anim Conserv. 2016;19(2):102–111. [Google Scholar]
- Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, Bork P. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol. 2017;34(8):2115–2122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jachowski DS. Wild again: the struggle to save the black-footed ferret. Oakland, CA, USA; 2014. [Google Scholar]
- Johnson LS, Eddy SR, Portugaly E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics. 2010;11(1):431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones SJ, Haulena M, Taylor GA, Chan S, Bilobram S, Warren RL, Hammond SA, Mungall KL, Choo C, Kirk H, et al. The genome of the northern sea otter (Enhydra lutris kenyoni). Genes. 2017;8(12):379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jurka J. Repbase Update: a database and an electronic journal of repetitive elements. Trends Genet. 2000;16(9):418–420. [DOI] [PubMed] [Google Scholar]
- Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14(6):587–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kemppainen P, Knight CG, Sarma DK, Hlaing T, Prakash A, Maung Maung YN, Somboon P, Mahanta J, Walton C. Linkage disequilibrium network analysis (LDna) gives a global view of chromosomal inversions, local adaptation and geographic structure. Mol Ecol Resour. 2015;15(5):1031–1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klein Goldewijk K, Beusen A, van Drecht G, de Vos M. The HYDE 3.1 spatially explicit database of human-induced global land-use change over the past 12,000 years. Glob Ecol Biogeogr. 2011;20(1):73–86. [Google Scholar]
- Koepfli K-P, Deere KA, Slater GJ, Begg C, Begg K, Grassman L, Lucherini M, Veron G, Wayne RK. Multigene phylogeny of the Mustelidae: resolving relationships, tempo and biogeographic history of a mammalian adaptive radiation. BMC Biol. 2008;6(1):10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krasheninnikova K, Diekhans M, Armstrong J, Dievskii A, Paten B, O’Brien S. halSynteny: a fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments. GigaScience. 2020;9(6):giaa047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumamoto AT, Charter SJ, Houck ML, Frahm M. Chromosomes of Damaliscus (Artiodactyla, Bovidae): simple and complex centric fusion rearrangements. Chromosome Res. 1996;4(8):614–621. [DOI] [PubMed] [Google Scholar]
- Lam ET, Hastie A, Lin C, Ehrlich D, Das SK, Austin MD, Deshpande P, Cao, H, Nagarajan N, Xiao M, et al. Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly. Nat Biotechnol. 2012;30(8):Article 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Law CJ, Slater GJ, Mehta RS. Lineage diversity and size disparity in Musteloidea: testing patterns of adaptive radiation using molecular and fossil-based methods. Syst Biol. 2018;67(1):127–144. [DOI] [PubMed] [Google Scholar]
- Lewin HA, Graves JAM, Ryder OA, Graphodatsky AS, O’Brien SJ. Precision nomenclature for the new genomics. GigaScience. 2019;8(8):giz086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Löytynoja A. Phylogeny-aware alignment with PRANK. In: Russell DJ, editor. Multiple sequence alignment methods. Humana Press; 2014. p. 155–170. [DOI] [PubMed] [Google Scholar]
- Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021;38(10):4647–4654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27(6):764–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marinari P, Lynch C. Population analysis and breeding and transfer plan for black-footed ferret (Mustela nigripes). Chicago (IL): AZA Population Management Center; 2022. [Google Scholar]
- Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37(5):1530–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mo YK, Lanfear R, Hahn MW, Minh BQ. Updated site concordance factors minimize effects of homoplasy and taxon sampling. Bioinformatics. 2022;39(1):btac741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morgulis A, Gertz EM, Schäffer AA, Agarwala R. WindowMasker: window-based masker for sequenced genomes. Bioinformatics. 2006;22(2):134–141. [DOI] [PubMed] [Google Scholar]
- Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, Vollger MR, Altemose N, Uralsky L, Gershman A, et al. The complete sequence of a human genome. Science. 2022;376(6588):44–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Brien SJ. Genetic variation and molecular systematics of the black-footed ferret. In: Thorne ET, Bogan MA, Anderson SH, editors. Conservation biology and the black-footed ferret. Yale University Press; 1989. p. 21–33. 10.2307/j.ctt2250xjr [DOI] [Google Scholar]
- Paez S, Kraus RHS, Shapiro B, Gilbert MTP, Jarvis ED, Al-Ajli FO, Ceballos G, Crawford AJ, Fedrigo O, Johnson RN, et al. ; Vertebrate Genomes Project Conservation Group PROJECT CONSERVATION GROUP. Reference genomes for conservation. Science. 2022;377(6604):364–366. [DOI] [PubMed] [Google Scholar]
- Peng X, Alföldi J, Gori K, Eisfeld AJ, Tyler SR, Tisoncik-Go J, Brawand D, Law GL, Skunca N, Hatta M, et al. The draft genome sequence of the ferret (Mustela putorius furo) facilitates study of human respiratory disease. Nat Biotechnol. 2014;32(12):1250–1255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perelman PL, Beklemisheva VR, Yudkin DV, Petrina TN, Rozhnov VV, Nie W, Graphodatsky AS. Comparative chromosome painting in Carnivora and Pholidota. Cytogenet Genome Res. 2012;137(2–4):174–193. [DOI] [PubMed] [Google Scholar]
- Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric, G, Clements J, et al. The Pfam protein families database. Nucleic Acids Res. 2012;40(D1):D290–D301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst Biol. 2018;67(5):901–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020;11(1). doi: 10.1038/s41467-020-14998-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, Sanborn AL, Machol I, Omer AD, Lander ES, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159(7):1665–1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Russello MA, Jensen EL. Ex situ wildlife conservation in the age of population genomics. In: Hohenlohe PA, Rajora OP, editors. Population genomics: wildlife. New York, USA: Springer International Publishing; 2021. p. 473–492. [Google Scholar]
- Santymire RM, Lonsdor EV, Lynch CM, Wildt DE, Marinari PE, Kreeger JS, Howard JG. Inbreeding causes decreased seminal quality affecting pregnancy and litter size in the endangered black-footed ferret. Anim Conserv. 2019;22(4):331–340. [Google Scholar]
- Shajii A, Numanagić I, Berger B. Latent variable model for aligning barcoded short-reads improves downstream analyses. In: Raphael BJ, editor. Research in computational molecular biology: 22nd Annual International Conference, RECOMB 2018, Paris, France, April 21-24, 2018, Proceedings. Springer International Publishing; Vol. 10812. 2017. p. 280–282. 10.1007/978-3-319-89929-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Slater GSC, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005;6(1):31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smit A, Hubley R, Green P. RepeatMasker Open-4.0. 2013–2015. Seattle, USA; 2015. [Google Scholar]
- Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006;34(suppl 2):W435–W439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sumner AT. A simple technique for demonstrating centromeric heterochromatin. Exp Cell Res. 1972;75(1):304–306. [DOI] [PubMed] [Google Scholar]
- Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56(4):564–577. [DOI] [PubMed] [Google Scholar]
- The UniProt Consortium. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017;45(D1):D158–D169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Totikov A, Tomarovsky A, Prokopov D, Yakupova A, Bulyonkova T, Derezanin L, Rasskazov D, Wolfsberger WW, Koepfli K-P, Oleksyk TK, et al. Chromosome-level genome assemblies expand capabilities of genomics for conservation biology. Genes. 2021;12(9). doi: 10.3390/genes12091336 [DOI] [PMC free article] [PubMed] [Google Scholar]
- USFWS Black-footed Ferret Recovery Program. Species status assessment report for the black-footed ferret (Mustela nigripes), version 1.0. Bailey's Crossroads, Virginia, United States: U.S. Fish and Wildlife Service; 2019. [Google Scholar]
- Weisenfeld NI, Kumar V, Shah P, Church DM, Jaffe DB. Direct determination of diploid genome sequences. Genome Res. 2017;27(5):757–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wellenreuther M, Bernatchez L. Eco-evolutionary genomics of chromosomal inversions. Trends Ecol Evol. 2018;33(6):427–440. [DOI] [PubMed] [Google Scholar]
- Whibley A, Kelley JL, Narum SR. The changing face of genome assemblies: guidance on achieving high-quality reference genomes. Mol Ecol Resour. 2021;21(3):641–652. [DOI] [PubMed] [Google Scholar]
- Wisely SM, Buskirk SW, Fleming MA, McDonald DB, Ostrander EA. Genetic diversity and fitness in black-footed ferrets before and during a bottleneck. J Hered. 2002;93(4):231–237. [DOI] [PubMed] [Google Scholar]
- Wisely SM, Ryder OA, Santymire RM, Engelhardt JF, Novak BJ. A road map for 21st century genetic restoration: gene pool enrichment of the black-footed ferret. J Hered. 2015;106(5):581–592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wisely SM, Statham MJ, Fleischer RC. Pleistocene refugia and Holocene expansion of a grassland-dependent species, the black-footed ferret (Mustela nigripes). J Mammal. 2008;89(1):87–96. [Google Scholar]
- Yang Z. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol. 2007;24(8):1586–1591. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The genome assembly, linked reads, and RNA-seq data are available on NCBI from Bioproject PRJNA634921. Corresponding SRA accessions for the DNA and RNA data are listed in Supplementary Table S1. Hi-C reads are available from Bioproject PRJNA512907 under accession SRR8616954.



