Abstract
The polar bear (Ursus maritimus) occupies a relatively narrow ecological niche, with many traits adapted for cold temperatures, movement across snow, ice and open water, and for consuming highly lipid-dense prey species. The divergence of polar bears from brown bears (Ursus arctos) and their adaptation to their Arctic lifestyle is a well-known example of rapid evolution. Previous research investigating whole genomes uncovered twelve key genes that are highly differentiated between polar and brown bears, show signatures of selection in the polar bear lineage, and are associated with polar bear adaptation to the Arctic environment. Further research suggested fixed derived alleles in these genes arose from selection on both standing variation and de novo mutations in the evolution of polar bears. Here, we reevaluate these findings based on a larger and geographically more representative dataset of 119 polar bears and 135 brown bears, and assess the timing of derived allele fixation in polar bears by incorporating the genomes of two Late Pleistocene individuals (aged 130–100,000 years old and 100–70,000 years old). In contrast with previous results, we found no evidence of derived alleles fixed in present-day polar bears within the key genes arising from de novo mutation. Most derived alleles fixed in present-day polar bears were also fixed in the Late Pleistocene polar bears, suggesting selection occurred prior to 70,000 years ago. However, some derived alleles fixed in present-day polar bears were not fixed in the two Late Pleistocene polar bears, including at sites within APOB, LYST, and TTN. These three genes are associated with cardiovascular function, metabolism, and pigmentation, suggesting selection may have acted on different loci at different times.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12864-024-10617-3.
Keywords: Adaptation, Ancient DNA, Arctic, Genomics, Polar bear
Introduction
The polar bear (Ursus maritimus) is uniquely adapted to the extreme conditions of life in the High Arctic and spends most of its life out on sea ice. In cold Arctic climates, energy is in high demand. As a result, the polar bear feeds on a lipid-rich diet throughout its life [1]. Polar bears are most closely related to the brown bear (Ursus arctos), a widely distributed omnivore found in a variety of habitats across the Holarctic [2]. The two species differ fundamentally in their ecology, behaviour, and morphology, reflecting adaptations to different ecological niches. Polar bears diverged from brown bears relatively recently – within the past ~ 1,000,000 years [3–5].
A previous study reported twelve key genes showing a signal of strong positive selection in the polar bear lineage [3]. These genes may have played significant roles in the ability of polar bears to rapidly adapt to their new Arctic environment. They included APOB, LYST, and TTN, which are related to cardiovascular functions (APOB, TTN), metabolism (APOB, LYST), and pigmentation (APOB, LYST).
Further research utilising the 109 polar and 33 brown bear genomes available at the time (Fig. 1) investigated whether derived alleles fixed in eleven of those same genes identified as under selection in the polar bear lineage were due to selection on standing variation, or on de novo mutations. Evidence was found of both, suggesting variation present in the ancestral polar/brown bear gene pool and de novo mutations played a role in the evolution of polar bears [6].
Here, we build on this work by incorporating additional, recently published polar bear (n = 10) and brown bear (n = 102) genomes from previously unstudied populations [7–9] (Fig. 1). The expanded geographic coverage of our data allowed us to more generally characterise whether the fixed derived alleles in the previously identified key genes [3] originate from standing variation or de novo mutation. By analysing a larger dataset [7–9], we minimise the possibility of missing data and/or population structure having influenced previous inferences. Furthermore, we incorporate genomic data from two Late Pleistocene polar bears aged 130–100,000 years old (‘Poolepynten’, Svalbard) and 100–70,000 years old (‘Bruno’, Alaska) [2, 10], to investigate the timing of fixation. Establishing a reliable time frame for when derived alleles in the previously identified key genes [3] become fixed can improve our understanding of what evolutionary processes drove speciation, and the rates in which novel adaptations to extreme environments can arise.
Results
Ancient DNA damage in the Late Pleistocene polar bears
Investigations into whether the ancient DNA (aDNA) found in Poolepynten and Bruno were authentic showed typical DNA degradation patterns. We observed high levels of C-T substitution towards the ends of the reads and G-A on the reverse complement in both Late Pleistocene samples (Supplementary Fig. 1). Bruno displayed less damage, with ~ 5% of the sites at the read ends experiencing aDNA damage. Poolepynten had more damage, with ~ 20% of the sites at the read ends showing aDNA damage patterns.
Genomic differentiation
As gene flow is known to occur between polar and brown bears [4, 11], we investigated whether the genes and their surrounding regions were still highly differentiated between polar and brown bears, despite the increase in sample size. To do this we performed independent principal component analysis for each gene, including their 50 kb flanking regions. In all eleven principal component analyses we observed clear differentiation between polar and brown bear individuals, suggesting no interspecific admixture at these loci (Supplementary Figs S2-S12). We note that a lack of differentiation between polar and brown bears in the loci could not only be caused by gene flow, but also other factors such as incomplete lineage sorting. However, the main purpose of this analysis was to see if these regions were still differentiated; if not, they could not be considered responsible for the phenotypic differences observed between polar and brown bears.
Fixed derived alleles in polar bears
To understand whether selection may have occurred on standing variation or de novo mutations in the polar bear lineage, we investigated the presence of fixed derived alleles in the polar bear lineage. We consider a derived allele as de novo in the polar bear lineage if it is not found in either the brown bear, or the outgroups. When comparing genotype calls between all present-day polar and brown bears, we found no sites fixed for the polar bear reference genome allele in polar bear, and fixed for the alternative allele in the brown bear, giant panda (Ailuropoda melanoleuca), and spectacled bear (Tremarctos ornatus) (Fig. 2, Supplementary Table S1). Thus, we found no evidence for any of the fixed derived alleles in the eleven focus genes to have arisen by de novo mutations in the polar bear lineage. Four genes (CUL7, FCGBP, LAMC3, XIRP1) contained no sites fixed for the derived allele in the present-day polar bears. A lack of fixed derived alleles suggests no specific allele within said gene was a necessity in the evolution of polar bears. We therefore did not consider these genes in future interpretations. In the remaining seven genes, we found 48 sites that were fixed for the derived allele in all present-day polar bears.
Timing of allele fixation in polar bears
To determine the timing of allelic fixation, we investigated whether the two Late Pleistocene polar bears were also fixed for the derived allele, as seen in the present-day polar bears. Seven genes remained after filtering out the gene that was previously shown to not be highly differentiated between species (EDH3) [6] and the four genes that did not contain any fixed derived alleles (CUL7, FCGBP, LAMC3, and XIRP1). These seven genes had a total of 34 sites where the Late Pleistocene individuals were also fixed for the derived allele (Fig. 2, Table 1, Supplementary Table S1). Only APOB, LYST and TTN contained sites (a total of 14) where at least one of the Late Pleistocene individuals also contained the ancestral allele (Table 1, Supplementary Table S1).
Table 1.
Gene | Associated Phenotype | Coding Length (bp) | Number of alleles fixed before the age of Late Pleistocene individuals | Number of alleles fixed after the age of Late Pleistocene individuals |
---|---|---|---|---|
ABCC6 | Cardiovascular | 4551 | 1 | 0 |
AIM1 | Pigmentation | 5484 | 5 | 0 |
APOB | Cardiovascular, metabolism, pigmentation | 13,305 | 3 | 7 |
COL5A3 | Adipose tissue, metabolism | 5256 | 2 | 0 |
LYST | Metabolism, pigmentation | 11,403 | 5 | 3 |
POLR1A | Cardiovascular | 5172 | 1 | 0 |
TTN | Cardiovascular | 102,861 | 17 | 4 |
Assessing heterozygous base call reliability
As ancient DNA damage can lead to an increase in C-T and A-G transitions, we considered whether these could bias our results by increasing the number of heterozygous sites in the Late Pleistocene polar bears. Therefore, we set the threshold of minor allele frequency of 30% to determine whether a heterozygous base call in the Late Pleistocene individuals was a false positive. We found 14 heterozygous sites in the Late Pleistocene individuals (Supplementary Table S2), two of which may be false positives in the TTN gene (minor allele frequencies of 25% and 29%). However, through a manual visualisation of the mapped reads we found the minor alleles always occurred within the read as opposed to the end. As aDNA damage occurs mostly towards the ends of the reads, and they were close to the 30% cutoff, we designated these as true heterozygous sites.
Discussion
By including the genomes of two Late Pleistocene polar bears in an analysis of 119 present-day polar bears and 135 present-day brown bears (Fig. 1), we infer when derived alleles in genes previously proposed to have been key in polar bear evolution [3], become fixed in the polar bear lineage.
We identified 34 sites fixed for the derived allele in all polar bears – present-day and Late Pleistocene – suggesting fixation of these derived alleles occurred prior to the ages of the Late Pleistocene polar bears (> 130,000 years ago). This is congruent with morphometric measurements of Poolepynten, which is a well-preserved mandible. In comparison with present-day and other fossil polar bears, as well as brown bears, Poolepynten falls within the range of present-day polar bears [13]. Stable isotopes also revealed it to be subsisting on a marine diet. Therefore, we can assume this individual already possessed key polar bear traits and was adapted to the Arctic environment.
However, we also identified 14 sites with derived alleles that were fixed in all present-day polar bears but not in both Late Pleistocene bears, suggesting their fixation occurred after the age of the Late Pleistocene polar bears (< 70,000 years ago). These derived alleles were found in only three investigated genes: APOB, LYST, and TTN. Although it is difficult to identify the determinant allele for a phenotype, this result suggests these alleles may not have been important initially for polar bear adaptation to an Arctic existence. However, as these three genes have broad overlapping associations with the cardiovascular system, metabolism, and pigmentation (Table 1), as the other genes investigated (ABCC6, AIM1, COL5A3, POLR1A), we suggest they may have played a later but also vital role in refining polar bears’ Arctic adaptation.
The gene showing the highest number of derived alleles (7/10), which our analyses suggest became fixed more recently, was APOB (Fig. 2). APOB encodes apolipoprotein B (apoB), which is associated with the cardiovascular system [14]. It has been suggested that selection on the APOB gene may have played a role in the novel adaptation of polar bears to a lipid-rich diet, and increased the efficacy of cholesterol clearance from the blood [3, 6]. The feeding ecology of Poolepynten was shown to fall within the range of present-day polar bears, who prey mainly on ringed seals and bearded seals [13]. Therefore, we can assume that the ability to process a lipid-rich diet was required more than 70,000 years ago, suggesting selection cannot have driven this phenotype within the last 70,000 years. This could suggest that the variants we discuss here may not have been essential in the early adaptation of polar bears, but may have been driven by increased selective pressures during the later stages of the last glacial period. Other genes previously shown to have strong signals of selection in the polar bear lineage [3], such as ABCC6, POLR1A and COL5A3, also have functions related to the cardiovascular system and metabolism [6, 12]. As these genes only have derived alleles fixed in the Late Pleistocene and present-day polar bears, they may have played a key role in driving the early adaptation of polar bears to a lipid-rich diet.
Similar to APOB, TTN is associated with the cardiovascular system. TTN encodes Titin, an abundant protein of striated muscle, which includes cardiac muscle tissue; mutations in TTN are linked with human cardiac physiology [15]. The genes AIM1 and LYST are both associated with pigmentation [16, 17]. Pigmentation is not preserved in the fossil record, which consists only of skeletal remains, and thus we have no pre-historic evidence of a white phenotype. In LYST, the majority of fixed derived alleles in present-day polar bears (five alleles) are also fixed in the Late Pleistocene polar bears. The three alleles fixed in present-day polar bears, but not in the Late Pleistocene polar bears, may have been driven to fixation in the former by selection in the last ~ 70,000 years, or by linkage disequilibrium.
In contrast with previous findings [6], we did not observe any indication of de novo mutation in the eleven genes investigated. All derived alleles fixed in present-day polar bears were present in brown bears, suggesting their presence in the ancestral brown/polar bear gene pool. The increase from 33 to 135 brown bear individuals in this study relative to previous work (Fig. 1) decreased the chances of allelic drop out. As polar bears rapidly adapted to their novel Arctic environment, the lack of de novo mutations in the polar bear lineage is perhaps not surprising. While standing variation and de novo mutation both provide the raw material for evolution, standing variation is already present in the gene pool for selection to act upon, allowing for immediate use in adaptation. De novo mutations arise randomly, segregate at an initially low frequency, and therefore require more time to reach fixation under the same selective pressure [18]. Thus, standing variation was key to the ability of polar bears to survive the Arctic environment – no matter when selection occurred. De novo mutations that would convey a selective advantage may not have been rapid enough during their transition to the Arctic. This result supports that maintaining high levels of standing variation is key to the long-term survival of a species, and may aid in their adaptation to rapidly changing environments.
Our study provides novel evidence of the timing and modes of selection in the polar bear lineage, but is not without caveats. We mapped all raw reads to the polar bear reference genome. Therefore, there may be a bias towards the reference allele [19, 20] and decreased mapping efficiency for the more distant outgroup individuals [21], which may cause some relevant sites to not be considered. However, as polar and brown bears are relatively closely related [3–5], and as we only considered individuals if the site of interest had a minimum read depth of four, we do not think reference bias would have played a large role in our inferences. We focused on derived allele fixation within coding regions of the genome. However, novel mutations in non-coding regions, e.g. in regulatory elements, may have also played a role and are an interesting avenue for future research. Palaeogenomic data are only available from two Late Pleistocene polar bear individuals. Consequently, inferences regarding the timing of allele fixation must be interpreted with caution, especially when it comes to the generalisability of the alleles fixed in the Late Pleistocene individuals and present-day polar bears. Specifically, the fixation of a given allele in only two individuals cannot be confidently extrapolated to the wider polar bear population that existed during the Late Pleistocene, although our two individuals are geographically disparate and therefore may be representative of polar bear ancestry at the time. Despite this, we are more confident that if an allele is not fixed in the two Late Pleistocene polar bears, then it is highly likely to have only become fixed after the age of the youngest specimen (70,000 years ago), as we observe for alleles in APOB, LYST, TTN. As additional ancient data from a wider temporal and spatial array of polar bears may become available in future, it may be possible to further pinpoint the timing of allelic fixation and these crucial adaptations, which have enabled polar bears to inhabit one of the coldest environments on Earth.
Methods
Late Pleistocene polar bear individuals
Available data from two Pleistocene polar bears were included in the study. Genomic data of Bruno (110–70,000 years old) was previously generated from the skull of a juvenile polar bear sample that was found in 2009 on the coast of the Beaufort Sea, near Point McLeod in Arctic Alaska [2]. Genomic data from Poolepynten (130–110,000 years old) was previously extracted from a left mandible, which was found in Svalbard [10, 13]. Age determination with infrared-stimulated luminescence suggested that it is probably the oldest polar bear fossil discovered [10, 22].
Present-day individuals
Following the previous study by Castruita, Westbury, and Lorenzen [6], our analysis included the data set from Liu et al. of 89 genomes [3] (79 polar bear, 10 brown bear) and the 30 polar bear and 23 brown bear genomes published elsewhere [11, 23–26]. We obtained the mapped files from the Castruita, Westbury, and Lorenzen publication which utilised raw reads from NCBI (Bioproject IDs: PRJNA169236, PRJNA196978, PRJNA210951, PRJNA271471, PRJNA395974, and PRJEB27491).
New to the present study, we incorporated available genomic data from populations of polar bears in Southeast Greenland (n = 10) [7], and brown bears from Hokkaido, Japan (n = 6) [9] and across their Holarctic distribution (n = 96) [8]. We downloaded the SRA files from NCBI from the Bioproject IDs: PRJNA669153, PRJDB11280, and PRJNA913591. Information on the newly incorporated individuals can be found in Supplementary Table S3.
Raw data processing
For the 142 individuals from Castruita, Westbury, and Lorenzen [6], raw sequencing reads were previously processed with the PALEOMIX [27] pipeline. Internally, adapter sequences, stretches of Ns, and low-quality bases were trimmed and filtered with AdapterRemovalv2 [28] using default parameters. BWA v0.7.15 [29] aln was used to map the cleaned reads to the pseudo-chromosomal polar bear genome (Genbank accession: GCA_000687225.1) from Liu et al. [3], with default parameters. We chose the pseudo-chromosome assembly as the reference genome to keep our analyses consistent with the previous studies [3, 6]. Reads with mapping quality of less than 30 were filtered using SAMtools v1.6 [30]. Duplicates were removed with picard v2.6.0 [31]. Possible paralogs were filtered using SAMtools. Finally, local realignment around indels was performed using GATK (v 3.3) [32].
For the 112 newly incorporated individuals, we trimmed adapter sequences and polyG sequences (-g) and merged overlapping read pairs (-m) with Fastp v0.23.2 [33]. To the exclusion of the -g and -m parameters, we otherwise used default parameters. We mapped the cleaned reads to the same pseudochromosome polar bear genome with BWA v0.7.15 [29] aln with the seed disabled (-l 690). We used SAMtools v1.6 [30] to filter the reads with mapping quality of less than 30 and remove duplicates. We assessed aDNA damage in the two Late Pleistocene polar bear individuals and adjusted base quality scores around damage using Mapdamage2 (–rescale) [34, 35]. To determine the ancestral allele, we included single individual representatives of the spectacled bear [36] and the giant panda [37] (Bioprojects PRJNA472085 and PRJNA38683). We mapped the reads to the same pseudochromosome polar bear genome following the same approach as the 112 newly incorporated present-day individuals.
Genomic differentiation
To investigate whether there was still clear genomic differentiation between polar bears and brown bears at the eleven genes of interest, we performed independent principal component analyses (PCAs) for each gene including the 50 kb regions upstream and downstream of the gene. We used a genotype likelihood approach to construct the PCAs: input genotype likelihood files were constructed using ANGSD v0.929 [38], with the SAMtools genotype likelihood algorithm (− GL 1), and specifying the following parameters: remove reads that have multiple mapping best hits (− uniqueonly), remove reads with a flag above 255/secondary hits (− remove_bads), include only read pairs with both mates mapping correctly (− only_proper_pairs), adjust mapQ for reads with excessive mismatches (− C 50), adjust quality scores around indels (− baq 1), a minimum mapping quality of 20 (− minMapQ 20), a minimum base quality of 20 (− minQ 20), determine the major allele based on the genotype likelihoods (-doMajorMinor 1), calculate allele frequencies assuming a fixed major allele and an unknown minor allele (-doMaf 2), generate beagle output file (-doGlf 2), discard sites where there is no data in at least 95% of the individuals (− minInd), skip tri-allelic sites (− skipTriallelic), and remove SNP sites with a p-value larger than 1e − 6 (− SNP_pval 1e-6). The ANGSD output beagle file was run through PCAngsd v0.95 [39] to generate a covariance matrix.
Genotype calling
We investigated eleven of the genes previously inferred using population genomics and demographic modelling to have the strongest signals of positive selection in the polar bear [3]. These included ABCC6, AIM1, APOB, COL5A3, CUL7, FCGBP, LAMC3, LYST, POLR1A, TTN, and XIRP1. We excluded EDH3 due to potential for admixture between polar and brown bears in the genomic region containing the gene [6].
We called genotypes using ANGSD v0.921 [38] following the approach of [6]. To call genotypes we used the SAMtools genotype likelihood algorithm (-GL 1) and the following parameters; remove reads that have multiple mapping best hits (-unique_only 1), remove reads with a flag above 255/secondary hits (-remove_bads 1), adjust quality scores around indels (-baq 1), a minimum mapping quality of 20 (-minMapQ 20), a minimum base quality of 20 (-minQ 20), write major and minor alleles and the genotype directly (-doGeno 5), estimate the posterior genotype probability based on the allele frequency as a prior (-doPost 1), use the reference allele as the major allele (-doMajorMinor 4), and calculate allele frequencies assuming a fixed major allele and an unknown minor allele (-doMaf 2). In order to decrease biases that could arise when calling heterozygous alleles from the low-coverage genomes, we only called genotypes from individuals that had at least 4 × coverage at the site of interest (-geno_minDepth 4). We only included biallelic sites where each allele led to a different amino acid (non-synonymous differences).
To determine which allele was the ancestral allele, we used the outgroup spectacled bear and giant panda sequences. If the allele fixed in all polar bears was found in either of these individuals, we removed that site from further consideration.
We further investigated for false positive heterozygous sites that may have arisen due to aDNA damage in the Late Pleistocene individuals. We investigated the read count for each of the four bases at each site of interest, focusing on heterozygous sites which might be caused by aDNA damage (C-T and G-A). Read counts were generated in ANGSD using the -dumpcount parameter. We calculated the proportion of the minor base of each heterozygous site and only if the ratio is more than 30%, would we assume that this site is heterozygous and not a false positive.
Supplementary Information
Acknowledgements
Not applicable.
Author’s contributions
E.D.L and M.V.W conceptualised the study. Y.S performed the computational analyses. Y.S and M.V.W interpreted the results. Y.S and M.V.W wrote the initial draft of the manuscript. All authors read and approved the final manuscript.
Funding
Open access funding provided by Copenhagen University The work was supported by the Villum Fonden grant no. 37352 and the Independent Research Fund Denmark grant no. 9064-00025B.
Availability of data and materials
All polar and brown bear short read data can be found under the following NCBI Bioproject IDs: PRJNA169236, PRJNA196978, PRJNA210951, PRJNA271471, PRJNA395974, PRJEB27491, PRJNA669153, PRJDB11280, and PRJNA913591. The polar bear genome used as the mapping reference can be found under the Genbank accession: GCA_000687225.1. The pseudo-chromosome version of the above polar bear genome produced by Liu et al. 2013, can be found on the University of Copenhagen’s Electronic Research Data Archive (ERDA) under the following link: https://sid.erda.dk/share_redirect/amLYDcI3uJ The spectacled bear and giant panda short read data found under the following NCBI Bioproject IDs PRJNA472085 and PRJNA38683.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.McKinney MA, Atwood TC, Iverson SJ, Peacock E. Temporal complexity of southern Beaufort Sea polar bear diets during a period of increasing land use. Ecosphere. 2017;8:e01633. 10.1002/ecs2.1633 [DOI] [Google Scholar]
- 2.Wang M-S, Murray GGR, Mann D, Groves P, Vershinina AO, Supple MA, et al. A polar bear paleogenome reveals extensive ancient gene flow from polar bears into brown bears. Nat Ecol Evol. 2022;6:936–44. 10.1038/s41559-022-01753-8 [DOI] [PubMed] [Google Scholar]
- 3.Liu S, Lorenzen ED, Fumagalli M, Li B, Harris K, Xiong Z, et al. Population genomics reveal recent speciation and rapid evolutionary adaptation in polar bears. Cell. 2014;157:785–94. 10.1016/j.cell.2014.03.054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kumar V, Lammers F, Bidon T, Pfenninger M, Kolter L, Nilsson MA, et al. The evolutionary history of bears is characterized by gene flow across species. Sci Rep. 2017;7:46487. 10.1038/srep46487 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zou T, Kuang W, Yin T, Frantz L, Zhang C, Liu J, et al. Uncovering the enigmatic evolution of bears in greater depth: The hybrid origin of the Asiatic black bear. Proc Natl Acad Sci U S A. 2022;119:e2120307119. 10.1073/pnas.2120307119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Castruita JAS, Westbury MV, Lorenzen ED. Analyses of key genes involved in Arctic adaptation in polar bears suggest selection on both standing variation and de novo mutations played an important role. BMC Genomics. 2020;21:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Laidre KL, Supple MA, Born EW, Regehr EV, Wiig Ø, Ugarte F, et al. Glacial ice supports a distinct and undocumented polar bear subpopulation persisting in late 21st-century sea-ice conditions. Science. 2022;376:1333–8. 10.1126/science.abk2793 [DOI] [PubMed] [Google Scholar]
- 8.de Jong MJ, Niamir A, Wolf M, Kitchener AC, Lecomte N, Seryodkin IV, et al. Range-wide whole-genome resequencing of the brown bear reveals drivers of intraspecies divergence. Commun Biol. 2023;6:153. 10.1038/s42003-023-04514-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Endo Y, Osada N, Mano T. Demographic history of the brown bear (Ursus arctos) on Hokkaido Island, Japan, based on whole-genomic sequence analysis. Genome Biol. 2021;13:evab195. [DOI] [PMC free article] [PubMed]
- 10.Lan T, Leppälä K, Tomlin C, Talbot SL, Sage GK, Farley SD, et al. Insights into bear evolution from a Pleistocene polar bear genome. Proc Natl Acad Sci U S A. 2022;119:e2200016119. 10.1073/pnas.2200016119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cahill JA, Green RE, Fulton TL, Stiller M, Jay F, Ovsyanikov N, et al. Genomic evidence for island population conversion resolves conflicting theories of polar bear evolution. PLoS Genet. 2013;9:e1003345. 10.1371/journal.pgen.1003345 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Blake JA, Baldarelli R, Kadin JA, Richardson JE, Smith CL, Bult CJ. Mouse Genome Database (MGD): Knowledgebase for mouse–human comparative biology. Nucleic Acids Res. 2020;49:D981–7. 10.1093/nar/gkaa1083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lindqvist C, Schuster SC, Sun Y, Talbot SL, Qi J, Ratan A, et al. Complete mitochondrial genome of a Pleistocene jawbone unveils the origin of polar bear. Proc Natl Acad Sci U S A. 2010;107:5053–7. 10.1073/pnas.0914266107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Benn M. Apolipoprotein B levels, APOB alleles, and risk of ischemic cardiovascular disease in the general population, a review. Atherosclerosis. 2009;206:17–30. 10.1016/j.atherosclerosis.2009.01.004 [DOI] [PubMed] [Google Scholar]
- 15.Gerull B. The Rapidly Evolving Role of Titin in Cardiac Physiology and Cardiomyopathy. Can J Cardiol. 2015;31:1351–9. 10.1016/j.cjca.2015.08.016 [DOI] [PubMed] [Google Scholar]
- 16.Runkel F, Büssow H, Seburn KL, Cox GA, Ward DM, Kaplan J, et al. Grey, a novel mutation in the murine Lyst gene, causes the beige phenotype by skipping of exon 25. Mamm Genome. 2006;17:203–10. 10.1007/s00335-005-0015-1 [DOI] [PubMed] [Google Scholar]
- 17.Du J, Fisher DE. Identification of Aim-1 as the underwhiteMouse Mutant and Its Transcriptional Regulation by MITF *. J Biol Chem. 2002;277:402–6. 10.1074/jbc.M110229200 [DOI] [PubMed] [Google Scholar]
- 18.Barrett RDH, Schluter D. Adaptation from standing genetic variation. Trends Ecol Evol. 2008;23:38–44. 10.1016/j.tree.2007.09.008 [DOI] [PubMed] [Google Scholar]
- 19.Brandt DYC, Aguiar VRC, Bitarello BD, Nunes K, Goudet J, Meyer D. Mapping Bias Overestimates Reference Allele Frequencies at the HLA Genes in the 1000 Genomes Project Phase I Data. G3. 2015;5:931–41. 10.1534/g3.114.015784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ros-Freixedes R, Battagin M, Johnsson M, Gorjanc G, Mileham AJ, Rounsley SD, et al. Impact of index hopping and bias towards the reference allele on accuracy of genotype calls from low-coverage sequencing. Genet Sel Evol. 2018;50:64. 10.1186/s12711-018-0436-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Prasad A, Lorenzen ED, Westbury MV. Evaluating the role of reference-genome phylogenetic distance on evolutionary inference. Mol Ecol Resour. 2022;22:45–55. 10.1111/1755-0998.13457 [DOI] [PubMed] [Google Scholar]
- 22.Ingólfsson Ó, Wiig Ø. Late Pleistocene fossil find in Svalbard: the oldest remains of a polar bear (Ursus maritimus Phipps, 1744) ever discovered. Polar Res. 2009;28:455–62. 10.1111/j.1751-8369.2008.00087.x [DOI] [Google Scholar]
- 23.Miller W, Schuster SC, Welch AJ, Ratan A, Bedoya-Reina OC, Zhao F, et al. Polar and brown bear genomes reveal ancient admixture and demographic footprints of past climate change. Proc Natl Acad Sci U S A. 2012;109:E2382–90. 10.1073/pnas.1210506109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Cahill JA, Stirling I, Kistler L, Salamzade R, Ersmark E, Fulton TL, et al. Genomic evidence of geographically widespread effect of gene flow from polar bears into brown bears. Mol Ecol. 2015;24:1205–17. 10.1111/mec.13038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Benazzo A, Trucchi E, Cahill JA, Maisano Delser P, Mona S, Fumagalli M, et al. Survival and divergence in a small group: The extraordinary genomic history of the endangered Apennine brown bear stragglers. Proc Natl Acad Sci U S A. 2017;114:E9589–97. 10.1073/pnas.1707279114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Barlow A, Cahill JA, Hartmann S, Theunert C, Xenikoudakis G, Fortes GG, et al. Partial genomic survival of cave bears in living brown bears. Nat Ecol Evol. 2018;2:1563–70. 10.1038/s41559-018-0654-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Schubert M, Ermini L, Der Sarkissian C, Jónsson H, Ginolhac A, Schaefer R, et al. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nat Protoc. 2014;9:1056–82. 10.1038/nprot.2014.063 [DOI] [PubMed] [Google Scholar]
- 28.Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res Notes. 2016;9:88. 10.1186/s13104-016-1900-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Broad institute. Picard Toolkit. 2019. http://broadinstitute.github.io/picard.
- 32.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303. 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–90. 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ginolhac A, Rasmussen M, Gilbert MTP, Willerslev E, Orlando L. mapDamage: testing for damage patterns in ancient DNA sequences. Bioinformatics. 2011;27:2153–5. 10.1093/bioinformatics/btr347 [DOI] [PubMed] [Google Scholar]
- 35.Jónsson H, Ginolhac A, Schubert M, Johnson PLF, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics. 2013;29:1682–4. 10.1093/bioinformatics/btt193 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Saremi NF, Oppenheimer J, Vollmers C, O’Connell B, Milne SA, Byrne A, et al. An annotated draft genome for the Andean bear, Tremarctos ornatus. J Hered. 2021;112:377–84. [DOI] [PMC free article] [PubMed]
- 37.Li R, Fan W, Tian G, Zhu H, He L, Cai J, et al. The sequence and de novo assembly of the giant panda genome. Nature. 2010;463:311–7. 10.1038/nature08696 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014;15:356. 10.1186/s12859-014-0356-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Meisner J, Albrechtsen A. Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data. Genetics. 2018;210:719–31. 10.1534/genetics.118.301336 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All polar and brown bear short read data can be found under the following NCBI Bioproject IDs: PRJNA169236, PRJNA196978, PRJNA210951, PRJNA271471, PRJNA395974, PRJEB27491, PRJNA669153, PRJDB11280, and PRJNA913591. The polar bear genome used as the mapping reference can be found under the Genbank accession: GCA_000687225.1. The pseudo-chromosome version of the above polar bear genome produced by Liu et al. 2013, can be found on the University of Copenhagen’s Electronic Research Data Archive (ERDA) under the following link: https://sid.erda.dk/share_redirect/amLYDcI3uJ The spectacled bear and giant panda short read data found under the following NCBI Bioproject IDs PRJNA472085 and PRJNA38683.