Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 Apr 12;12:6118. doi: 10.1038/s41598-022-09868-5

High genomic diversity in the endangered East Greenland Svalbard Barents Sea stock of bowhead whales (Balaena mysticetus)

José Cerca 1,6,#, Michael V Westbury 2,#, Mads Peter Heide-Jørgensen 3, Kit M Kovacs 4, Eline D Lorenzen 2, Christian Lydersen 4, Olga V Shpak 5,7, Øystein Wiig 1, Lutz Bachmann 1,
PMCID: PMC9005726  PMID: 35414162

Abstract

The East Greenland-Svalbard-Barents Sea (EGSB) bowhead whale stock (Balaena mysticetus) was hunted to near extinction and remains Endangered on the International Union of Conservation of Nature Red List. The intense, temporally extensive hunting pressure may have left the population vulnerable to other perturbations, such as environmental change. However, the lack of genomic baseline data renders it difficult to evaluate the impacts of various potential stressors on this stock. Twelve EGSB bowhead whales sampled in 2017/2018 were re-sequenced and mapped to a previously published draft genome. All individuals were unrelated and void of significant signs of inbreeding, with similar observed and expected homo- and heterozygosity levels. Despite the small population size, mean autosome-wide heterozygosity was 0.00102, which is higher than that of most mammals for which comparable estimates are calculated using the same parameters, and three times higher than a conspecific individual from the Eastern-Canada-West-Greenland bowhead whale stock. Demographic history analyses indicated a continual decrease of Ne from ca. 1.5 million to ca. 250,000 years ago, followed by a slight increase until ca. 100,000 years ago, followed by a rapid decrease in Ne between 50,000 and 10,000 years ago. These estimates are lower than previously suggested based on mitochondrial DNA, but suggested demographic patterns over time are similar.

Subject terms: Genomics, Population genetics, Molecular evolution, Ecological genetics, Ecology, Evolution, Genetics

Introduction

The bowhead whale (Balaena mysticetus) is the only baleen whale that lives its entire life in Arctic and subarctic regions, often in association with sea ice1. Four bowhead whale stocks are currently recognized: (1) the Bering/Chukchi/Beaufort Seas (BCB) stock; (2) the Okhotsk Sea (OKH) stock; (3) the Eastern Canada-West Greenland (ECWG) stock; and (4) the East Greenland-Svalbard-Barents Sea (EGSB) stock, earlier referred to as the Spitsbergen stock2. The bowhead whale, as a species, is listed as Least Concern in the International Union of Conservation of Nature (IUCN) Red List. However, the EGSB stock, which is distributed from the East Greenland Sea across the northern Barents Region into Russia (including Severnaya Zemlya and Franz Josef Land waters), is classified as Endangered2.

There has been ongoing discussion regarding the census size of the EGSB stock. This population was thought to be large prior to the onset of extensive hunting, which commenced circa 1611. Estimates of the preharvest stock size range from 25,000 to 100,000 individuals3. When hunting ceased in 1932, the population was thought to be depleted to near extinction4,5. Recent estimates of the current size of this stock have ranged from a few tens6 to several hundred individuals7. In support of these low estimates, only a few bowhead whale sightings were reported from ship-based surveys in the Fram Strait during spring 2006, 2008 and 2010810. A combined helicopter- and ship-based line transect covering the marginal ice zone (MIZ) from the Russian border westward to the northwest corner of the Svalbard Archipelago estimated some 350 individuals in this region in late summer11. Surveys conducted in East Greenland waters in spring and late summer/early fall documented similar numbers in the western parts of the stock’s range12. However, individual animals move across the whole range; bowhead whales tagged in East Greenland shelf waters have travelled eastward to Franz Josef Land and beyond1.

Several genetic studies have focussed on elucidating the genetic structure and differentiation of the four recognized bowhead whale stocks1317. For the EGSB stock specifically, four studies, all targeting mitochondrial markers, have been conducted. Borge et al.14 assessed mitochondrial haplotype diversity by sequencing parts of the mitochondrial D-loop region in bone remains from 99 historic bowhead whales. They found relatively high genetic diversity with levels similar to the genetic diversity of the extant BCB stock, which has also been depleted by extensive hunting. Three further studies generated full mitochondrial genome sequences from historic18 and contemporary individuals17,19. The contemporary mitochondrial genome sequences supported the earlier findings of Borge et al.14 that the EGBS population is genetically differentiated from the other three stocks. Inferences regarding the demographic history of the EGSB bowhead whale stock suggested a boom–bust scenario that included Late Pleistocene population growth 25,000–35,000 years ago, followed by a Holocene decline17. However, the confidence limits of the estimates were broad and overlapping throughout the time frame of the study, making it impossible to define numerical trends for this population. Since mitochondrial DNA is a single, maternally inherited, non-recombining marker, and thus particularly sensitive to drift in small populations, inferences based on just mitochondrial DNA have to be interpreted cautiously, as the data may not fully represent the diversity and demography of the EGSB stock. Accordingly, it is important to obtain nuclear genomic data that offer more refined insights.

The current study addresses levels of genetic diversity in the EGSB stock of bowhead whales based on nuclear genomes from the 12 individuals that had been previously analysed using mitochondrial genomes17. Nuclear high-throughput sequencing data were analysed for heterozygosity, population structure, inbreeding and demography. The heterozygosity estimates were compared to those obtained for archived sequence data for a conspecific individual from the ECWG stock and a North Atlantic right whale (Eubalaena glacialis). These genomes and the findings provide important baseline data for future conservation and management of bowhead whale stocks.

Materials and methods

Ethics declaration

Permits for animal handling were issued by the Norwegian Animal Research Authority (FOTS ID: 11,821), the Governor of Svalbard (Sysselmannen, permit ID: 16/01,600–6) and the Greenland Ministry of Fisheries, Hunting and Agriculture (ref. 2017–2551, akt. no.8267820). All methods were carried out in accordance with relevant guidelines and regulations. All methods are reported in accordance with ARRIVE guidelines (https://arriveguidelines.org) for the reporting of animal experiments.

Sampling

Twelve bowhead whale skin biopsies collected during field expeditions in 2017 (ten samples) and 2018 (two samples) using a crossbow and custom-made darts were used in this study (for details see Bachmann et al.17). The sampling sites are depicted in Fig. 1. In addition, archived genome sequence data were downloaded from seven bowhead whale samples collected in 2006 and 2010 (for details see Nyhus et al.17, GenBank Bioproject: PRJNA798027).

Figure 1.

Figure 1

Sampling locations of the bowhead whale skin biopsies used in this study. The map was generated based on publicly available ArcMap polar projections documents using ArcGIS 10.1 (www.esri.com).

DNA extraction and sequencing

Total genomic DNA was extracted using the E.Z.N.A. Tissue DNA kit (Omega Bio-Tek) following the Tissue DNA-Spin Protocol provided with the kit. Building of Illumina sequencing libraries and sequenced using 150 bp paired-end read chemistry on an Illumina NextSeq 500 was outsourced to StarSEQ GmbH, Mainz, Germany. The raw reads have been deposited in NCBI's Sequence Read Archive (GenBank Bioproject: PRJNA643010; accessions listed in Supplementary Table S1).

Sequencing read mapping, variant calling and data filtering

This study aimed at a sequencing depth of > 10 for the 12 bowhead whale samples in order to allow for reliable estimates of genetic diversity. Raw reads were quality checked using FASTQC v0.11.820 and trimmed and filtered for Illumina adapters and low-quality bases using Trimmomatic v0.3921 with a default error rate (-e 0.1), an adaptor overlap of 1 (-O 1) and a quality cut-off of 20 (-q 20). Filtered reads were aligned to the Balaena mysticetus draft genome22 using the BWA-mem algorithm23. An overview of the mapping statistics and depth is provided in Supplementary Table S1. The sequences of the resulting alignments were sorted, marked and indexed using Picard tools v2.10.4 (http://broadinstitute.github.io/picard/). The GATK v3.7 (https://gatk.broadinstitute.org/hc/en-us) RealignerTargetCreator and subsequently IndelRealigner tools were used to remove PCR-duplicates and to account for insertion-deletion (indels) polymorphisms, which can affect downstream variant calling. Variant calling was performed both for variant and invariant sites using the GATK v3.7 HaplotypeCaller tool, resulting in a gVCF (genome Variant Call Format file) for each sample. gVCFs were then merged using the GenotypeGVCFs tool, generating a raw VCF file for downstream analyses. This raw VCF file was then filtered for QualByDepth (QD) < 5.0, FisherStrand (FS) > 10.0, RMSMappingQuality < 40.0, MQRankSum < -2.5, and ReadPosRankSum < -2.5. Finally, to include only single nucleotide polymorphism (SNP) variants, the VariantFiltration tool included in GATK and vcftools v0.1.1324 was used.

Population structure and phylogenetic network

A Principal Component Analysis (PCA) and a phylogenetic network were computed from the above variant calls to address population structure, branching pattern and the distribution of individuals within the population. The VCF file generated by the the GenotypeGVCFs tool was further filtered to remove SNPs with a minimum allele frequency < 0.05 and a mean depth value (across all included individuals; > 10 × and < 100x) using vcftools v0.1.1324. For the phylogenetic network analysis, the VCF file was converted into phylip format using vcf2phylip v2.025 and the analysis was performed using Splitstree v426. For the PCA, a linkage disequilibrium (LD) filter was applied with a window size of 50,000 bp and a step size of 10,000 (defining a r2 threshold of 0.18) using plink v1.90b5.227. This high-quality, unlinked set of variants was loaded into R v3.628 using the vcfR package29, and PCA was performed using adegenet30.

Relatedness and inbreeding

Observed and expected numbers of homozygous sites were determined from the GATK filtered variants using vcftools (–het)24. Significance of the differences between the observed and expected homozygous values was tested using a Welch two sample t-test. Additionally, individual inbreeding (F) and relatedness (R) levels between samples were estimated using NGSrelate v2, which uses identity by descent to calculate these metrics31. In contrast to vcftools, this method does not rely on directly calling genotypes, but instead gives the likelihood that any of the four nucleotides could be called at a certain position. Therefore, this method can also produce reliable results for low-coverage data (down to ca. 1x). Furthermore, NGSrelate relies on a population-level reference panel to compute relatedness and inbreeding, the more individuals included in the analysis—the more reliable the results are. Thus, for the relatedness analysis, the seven samples collected in 2006 and 2010 (previously sequenced to low coverage (0.8× to 2.45×), for sample details see SRA; Bioproject: PRJNA798027; accessions listed in Supplementary Table S1, and Nyhus et al.19) were also included. As input for NGSrelate, genotype likelihoods (GL) of the dataset were calculated from the mapped bam files using ANGSD v0.92132. The following filtering options were used: minimum mapping and base quality of 30 (-minmapQ and -minQ 30); calculate genotype likelihoods using the GATK algorithm (-GL 2); output binary genotype likelihoods (-doGlf 3); infer major and minor alleles using GL (-doMajorMinor 1); calculate per site frequencies using a fixed major and unknown minor allele (-doMaf 2); only include SNPs with a p-value less than 1 × 10–6 (-SNP_pval 1e-6), and a minimum minor allele frequency of 0.05 (-minmaf 0.05). The number of sites included in the pairwise comparisons between individuals to calculate relatedness ranged from 2,118,337 to 12,161,246. Further, runs of homozygosity (ROH) as a proxy for inbreeding were determined in each of the 12 newly generated genomes (samples collected in 2017 and 2018) using ROHan33. The program was run using only autosomal scaffolds > 1 Mb in length to avoid biases due to the fragmented assembly. However, it should be noted that although the analysis still contained 968,853,266 bp of data, ROH may be downwardly biased from the fragmented assembly. ROHan was run twice, once specifying a ROH if 1 Mb of data contains an average proportion of heterozygous sites less than 1 × 10–5 and once specifying the average proportion of heterozygous sites less than 5 × 10–5. Bowhead whale X and Y-linked scaffolds were identified by aligning the published bowhead whale genome assembly to the cow X chromosome (GenBank accession number: AC_000187.1) and the human Y chromosome (GenBank accession number: NC_000024.10) using satsuma synteny34. The latter were subsequently removed because they can bias heterozygosity estimates in males.

Genetic diversity

Autosome-wide heterozygosity was calculated for the seven samples with > 20 × average read depth (Supplementary Table S1). All individuals were down-sampled to 20 × read depth using SAMtools35 in order to avoid coverage linked biases and to ensure comparability between previous estimates. Genetic diversity estimates were subsequently determined using ANGSD v0.92132 with the filtering steps following Westbury et al.36 and only compared against estimates from other species calculated using the same software and parameters. The filtering parameters were as follows: calculated genotype likelihoods using the SAMtools algorithm (-GL 1); only consider sites with at least 5 × coverage (-setMinDepthInd 5); minimum mapping and quality scores were defined as 25 (-minmapq 25 and -minq 25); only consider reads mapping to a single region uniquely (-uniqueonly 1); remove secondary alignments (-remove_bads 1); calculate allele frequencies (-doSaf 1); only use scaffolds > 100 kb (-rf); perform indel realignments (-baq 1); and do not specify an ancestral allele (-fold 1). The genomic raw reads (GenBank biosample ID: SAMN14122067) of the North Atlantic right whale (Eubalaena glacialis) were downloaded and mapped to a North Atlantic right whale reference genome retrieved from https://www.dnazoo.org/assemblies/Eubalaena_glacialis using BWA mem and default parameters. From a total of 773,071,534 reads (386,535,767 read pairs) 596,115,335 resulted in a mean read depth of 38.8x. Autosome-wide heterozygosity for the right whale was calculated using the same approach as described above for the bowhead whales.

Demographic analysis

Deep time demographic analyses (changes in effective population size (Ne) through time) were run on each individual using a Pairwise Sequentially Markovian Coalescent (PSMC) model37. As input for PSMC, indel-realigned diploid consensus sequences were built from the mapped bam files using SAMtools35 and BCFtools (http://samtools.github.io/bcftools/bcftools.html). The atomic time interval setting 4 + 25*2 + 4 + 6 was used38. Mutation rate was set to 2.69 × 10–8 substitutions per generation (i.e. 7.69 × 10–10 per site per year), a value specifically calculated for bowhead whales by Westbury et al.38. Following Rooney et al.13 the generation time was set to 35 years, an estimate that is based on information about lifespan, sexual maturity and reproduction intervals of female bowhead whales. However, generation time of bowhead whales is a topic of debate, and estimates may be considered 'guesstimates'. Other studies39,40 have used ca 50 years, and we repeated the analyses with a generation time of 50 years. Different generation times will affect the timelines of demographic analyses but not the overall trajectories. To estimate the more recent demographic history of the EGSB stock stairway plots v241 was implemented using the unfolded site frequency spectrum (SFS) calculated using the 12 newly generated genomes (samples collected in 2017 and 2018). The right whale was mapped to the bowhead whale reference genome using BWA mem to determine the ancestral state. A total of 542,282,179 reads mapped, resulting in a mean read depth of 38.8x. The SFS was calculated from allele frequencies in ANGSD v0.92132 (-doSaf 1) from all scaffolds > 100 kb in length with the following parameters: -minmapQ 25, -minQ 25; -GL 2, -doMajorMinor 1; -doMaf 2, only use sites where all individuals have coverage (-minind 12); skip triallelic sites (-skipTriallelic 1); and -setMinDepthInd 5. The same generation times and mutation rate were used as for the PSMC analyses. An analysis using a folded SFS was also run to investigate whether the ancestral state could impact the results. ANGSD was run in the same fashion as for the unfolded SFS but with the -fold option and the -anc sequence replaced with the bowhead whale reference genome instead of the right whale.

Results

Variant calling and SNPs

Direct variant calling of the mapped reads across all individuals yielded a total of 11,928,111 variable sites after initial GATK filtering. After further filtering for coverage and minimum allele frequency, and after removing indels, the dataset retained 7,750,241 SNPs. The specific numbers of SNPs for each individual included in the downstream analyses are listed in Table 1. After further filtering for linkage disequilibrium (LD) the SNP set used for the PCA numbered 332,255 SNPs across the 12 EGSB stock samples.

Table 1.

Observed and expected numbers of homo- and heterozygous sites and autosome-wide heterozygosity proportions for the EGSB stock samples with an average coverage > 20 × calculated with ANGSD32.

Bowhead whale Number of assessed SNPs Observed homozygous SNPs Expected homozygous SNPs Observed heterozygous SNPs Expected heterozygous SNPs Heterozygosity proportions
17–05 7,707,648 5,191,417 5,114,415 2,516,231 2,593,233
17–07 7,713,101 5,100,155 5,117,900 2,612,946 2,595,201 0.00102
17–08 7,717,794 5,063,352 5,120,835 2,653,842 2,596,959 0.00102
17–10 7,711,971 5,209,113 5,117,141 2,502,858 2,594,830
17–12 7,717,953 5,077,716 5,120,931 2,640,237 2,597,022 0.00102
17–17 7,717,011 5,200,933 5,120,305 2,516,078 2,596,706 0.00098
17–18 7,717,338 5,043,461 5,120,541 2,673,877 2,596,797 0.00103
17–19 7,713,315 5,127,866 5,118,025 2,585,449 2,595,290 0.00105
17–20 7,700,325 5,322,498 5,109,892 2,377,827 2,590,433
17–21 7,718,136 5,072,339 5,121,074 2,645,797 2,597,062 0.00103
18–05 7,713,449 5,144,031 5,117,923 2,569,418 2,595,526
18–06 7,714,979 5,105,321 5,118,945 2,609,658 2,596,034

Population structure

The PCA revealed no population structure within the EGSB stock (Fig. 2). This was supported by the phylogenetic network (Fig. 3), which showed a star-like (branching-from-the-centre) pattern, but no structuring of more closely related individuals.

Figure 2.

Figure 2

PCA exploring population structure in EGSB stock bowhead whales sampled in 2017 and 2018. No obvious population structuring was identified.

Figure 3.

Figure 3

Phylogenetic network based on genomic diversity data for 12 bowhead whales from the EGSB stock indicating a star-like (branching-from-the-centre) pattern, but no structuring of more closely related individuals.

Relatedness and inbreeding

For the 12 bowhead whale individuals sampled in 2017 and 2018, coefficients of relatedness were all smaller than ~ 5% (Supplementary Table S2), suggesting that none of the individuals were closely related. Consistent with this result, none of the individuals sampled in 2006 or 2010 showed any signs of relatedness to the individuals collected in 2017 and 2018 that were analysed in the present study. However, six of the seven samples collected in 2006 (individuals A-F, Supplementary Table S1) showed high degrees of relatedness to each other, which is best explained by re-sampling of the same individual (see below). Coefficients of relatedness between these individuals ranged from 50.5% (as expected for parent–offspring or full siblings), to 87.6% (approaching monozygotic twins). Observed and expected homozygosity did not differ significantly in any individual (p < 0.066). The NGSrelate inbreeding coefficient for the 12 individuals was 0. ROHan did not find any significant ROH in any of the 12 individuals when using the default ROHan setting of 1 × 10–5 for the definition of a ROH. However, when increasing this to 5 × 10–5 there was some ROH in all individuals except for 17–18 and 17–19 (Supplementary Table S3).

Genetic diversity

The heterozygosity proportions (i.e. average proportion of sites within the autosomes that are heterozygous) ranged from 0.00098 to 0.00105 (Table 1) with a mean autosomal heterozygosity of 0.00102. A comparison of these heterozygosity estimates with those of a variety of mammalian species for which comparable data are available (Fig. 4) revealed that the EGSB stock bowhead whales display relatively high levels of heterozygosity. Furthermore, the EGSB individuals display heterozygosity proportions approximately three times higher than estimated for a conspecific from West Greenland21.

Figure 4.

Figure 4

Mean autosome-wide heterozygosity proportions for EGSB stock bowhead whales (mean for seven individuals as listed in Table 1) compared to a variety of other mammalian species36,36,42,43 and the ECWG bowhead whale individual analysed by Keane et al.22, for which estimates are available calculated using the same software and parameters. Red dots show bowhead whales, blue dots other cetacean species and black dots other mammalian species.

Demographic analysis

Demographic analyses using PSMC suggested a more or less constant decrease in effective population size (Ne) from ca. 30,000 individuals some 1.5 million years ago to ca. 7000 individuals ca. 250,000 years ago. Subsequently, there was a slight increase of Ne up to ca. 9000 at 75,000 to 100,000 years ago. Figure 5 illustrates the demographic analysis for the last ca 1,500,000 years for both generation times of 35 and 50 years. Supplementary Table S4 provides the various estimates through time for a generation time of 35 years. For the period 1.5—5 million years ago, Ne was estimated as more or less stable between 20,000 and 30,000 individuals (Supplementary Fig. S1 and Table S4). The demographic histories of all 12 individuals included in this study demonstrated similar demographic trajectories, and supported the results of the population structure analyses that all samples arise from a single population.

Figure 5.

Figure 5

PSMC demographic analysis model outputs for 12 bowhead whales from the EGSB reaching back 1.5 million years. (A) using a generation time of 35 years. (B) using a generation time of 50 years.

The stairway plots (Fig. 6 and Supplementary Fig. S2) revealed several instances of change in Ne. There was a decrease beginning approximately ~ 300,000 years ago that lasted approximately 100,000 years, followed by an increase over an approximately 100,000-year period beginning ~ 200,000 years (until 100,000 years ago), followed by a stable Ne until around 50,000 years ago before a dramatic decrease between 50,000 and 20,000 years ago. The plots were similar when using both the folded and unfolded SFS as inputs (Fig. 6 and Supplementary Fig. S2).

Figure 6.

Figure 6

Stairway plot of the more recent demographic history of the EGSB stock with 2.5 and 97.5% confidence intervals. A: using an unfolded and B: using a folded site frequency spectrum as input. Generation time was set to 35 years.

Discussion

The potential consequences of more than three centuries of extensive whaling on the EGSB bowhead whale stock (previously known as the Spitzbergen stock) have been a concern raised by both researchers and conservation and management bodies such as the International Whaling Commission (IWC) and the IUCN14,17,44. In the Svalbard Archipelago, the hunt for bowhead whales started shortly after the discovery of the archipelago, and by the early nineteenth century few bowhead whales remained in the surrounding waters45. The EGSB stock became officially protected in 19324547. Recent studies suggest that the census size of the EGSB stock is at least in the low hundreds, and that it is either increasing, or has never been as low as the worst-case scenario of tens suggested based on decades with virtually no sightings11,12. Nevertheless, it is undisputed that population size is still far below the estimates calculated for the pre-whaling stock. Moreover, current climate change, and in particular sea ice declines, and Arctic marine traffic intensification may further negatively affect the already heavily depleted EGSB stock1.

Based on the analyses reported here, there are only a few significant runs of homozygosity (ROH) for the 12 individuals sampled in 2017 and 2018. Furthermore, autosomal heterozygosity proportion estimates for seven bowhead whales sampled in 2017/18 (Fig. 4) were relatively high compared to estimates for other mammalian species for which comparable estimates obtained by the same methods are available36,38,42,43. However, the bowhead whale is a long-lived mammal, and its longevity may extend past 200 years21,48. Even if hunting was extreme, the effective population size would have needed to be extremely small over the entire period of hunting to observe substantial reductions in genetic diversity in the present. The findings were somewhat different for six (samples A–F) of the seven individuals sampled in 2006, which showed very high relatedness values. However, as has been speculated earlier19, this 2006 collection is likely biased; all samples were collected within a very small area on the same day from a group of bowhead whales that were moving in synchrony8. Accordingly, re-sampling of the same individual may best explain the finding of identical mitochondrial genome sequences for the six individuals in question18. Importantly, the low coverage data for the samples collected in 2006 and 2010 were only used for the relatedness analysis; accordingly, the most likely duplicate samples did not bias any other analyses.

Low levels of relatedness between individuals and ROH, in addition to high levels of autosomal heterozygosity proportions, is often indicative of large population size. The most recent population estimate of ~ 350 individuals for EGSB bowheads, which is based on a combined helicopter- and ship-based line transect survey from the Russian border westward to the northwest corner of the Svalbard Archipelago stock, is still rather low11. It must be noted, however, that this estimate only represents part of the stock’s range, and the genomes of the sequenced individuals are highly diverse genetically speaking. Of note, autosomal heterozygosity levels were similarly high in all seven bowhead whale individuals analysed in this context. However, high genetic diversity does not always translate to large population size. Factors such as changes in population size, gene flow or inbreeding levels can also strongly affect the estimates. For various remnant populations of Cape buffalos (Syncerus caffer) and Giant panda (Ailuropoda melanoleuca), for example, relatively high genome-wide heterozygosity has been reported while population sizes are known to be low49. This may not be surprising because the population decline is recent in these cases. However, in other Arctic whale species such as the beluga whale (Delphinapterus leucas) and narwhal (Monodon monoceros), which both have a substantially larger population sizes, lower mean autosomal heterozygosity than the average EGSB bowhead whale value derived herein have been observed; for the narwhal, a recent rapid expansion from a much smaller founding population has been suggested as a likely driver of the low genetic diversity found for this species38. Unfortunately, except for the one ECWG bowhead whale individual sequenced by Keane et al.22, there are to date no other genome-wide estimates for individuals from other bowhead whale stocks. As pointed out above, it remains open as to what extent the estimates presented here reflect genetic diversity of bowhead whales across the entire distribution range of the species; though BCB whales have also retained high levels of diversity despite heavy historical harvests. Previous studies on mitochondrial genome diversity of contemporary EGSB stock individuals also indicated relatively high levels of genetic diversity (haplotype diversity (HD): 0.858, nucleotide diversity (π): 0.0027)17,19. Yet, continued very small population size, or even further decline in population size would mean an increasingly rapid accumulation of inbreeding and inbreeding depression.

When attempting to interpret the genetic diversity data in light of census and/or stock size of the EGSB stock, it has to be noted that census size estimates to date do not cover the entire range of this stock (see above). It is very unlikely that the stock consists of just one aggregation of animals moving as a group across the range. Rather, the stock probably segregates seasonally, and observers in different parts of the range see different feeding aggregations over the Arctic summer. Particularly little is known about bowhead whales in the western Russian Arctic, although bowheads are thought to reside in the Franz Josef Land region year-round50,51. Heide-Jørgensen et al.52 conclude that the waters around Franz Josef Land may be occupied by at least a hundred EGSB bowhead whales. Further east, bowhead whales have also been observed in the western Laptev Sea53, and the north of the Novosibirskie Islands Archipelago, which separates the Laptev and East Siberian seas54. Bowhead whales from the Bering-Chukchi-Beaufort (BCB) population are also believed to occasionally penetrate the East Siberian Sea, at least to 170° E longitude55. It is thus possible that individuals from the EGSB and BCB populations may meet in the Russian Arctic seas52. Briefly summarized, the knowledge on the current population size of the EGSB stock remains limited. The survey-based estimates of several hundred individuals are preliminary and further information is needed. The estimates of genetic diversity and an effective population size Ne in the low ten-thousands in the Pleistocene provide important information regarding EGSB stock size.

The demographic analyses of the genome data shed some light on the demographic trajectories through time of effective population size (Ne) of the EGSB stock. They revealed a long-term (past ca. 1.5 M years), gradual decline in Ne, a pattern similar to what has been reported for other baleen whale species56. Several fluctuations were also uncovered in the past 0.5 M years. In contrast, demographic analyses based on mitochondrial genomes from the same bowhead whale individuals sampled in 2017 and 2018 revealed a higher female effective population size (Ne(female)) during the Pleistocene, ranging from ~ 70,000 to ~ 110,000 individuals17. However, as with genetic diversity estimates, estimates of Ne must be interpreted with caution; importantly, numerical estimates from different studies that are based on different genetic markers (nuclear vs mitochondrial genome) cannot be directly compared. Earlier studies have reported that the relative nuclear diversity of cetacean species is correlated with population size, while mitochondrial diversity is not5759. For mitochondrial diversity, social structure and matrilineal social systems were emphasized as important drivers of range-wide and regional mitochondrial genetic diversity. Despite all these uncertainties, the mitochondrial genome analyses also supported an increase in Ne for EGBS bowhead whales during the late Pleistocene17, in agreement with the demographic pattern obtained in this study with nuclear genome data.

Estimates of Ne depend heavily on key parameters such as generation time and mutation rate as well as assumptions on gene flow. Accordingly, they cannot be considered exact values and must be interpreted with caution. The mutation rate of 7.69 × 10–10 substitutions per site per year for nuclear genomic DNA, which was used in the present study, and which was estimated specifically for bowheads36, may be considered low. It is substantially lower than the average mammalian genome mutation rate of 2.23 × 10–9 per site per year suggested by Kumar & Subramanian60. However, this discrepancy is not surprising, as the latter estimate was not based on genome-wide data, but rather on the analyses of some protein coding genes. The mutation rate used in the present study is in line with reduced substitution rates observed within Cetacea61, and is similar to the average nuclear mutation rate of 4.5 × 10–10 per site per year calculated by Jackson et al.62 for baleen whales. Although using another mutation rate would have resulted in different numerical estimates of Ne, the order of magnitude of change in effective population size for the late Pleistocene EGSB stock estimated in the current study lies within the range of census size estimates based on information from other sources, for example, 25,000–100,000 individuals as suggested by Allen & Keay3. The extent of gene flow between stocks is a further uncertainty for estimates of Ne. Given the long period of several millions of years encompassed by the demographic analysis, it is possible that free migration between stocks might have been possible in periods with minimal sea ice, as well as significant changes in ranges. Fossils of bowhead whales have, for example, been discovered along the Norwegian, Swedish and Danish coasts63,64, well south of the species' current distribution, indicating range shifts over time in response to climate change.

Conclusion

Genetic differentiation and population sizes of historic and contemporary bowhead whale populations have been the subject of considerable research effort. Most molecular studies on these topics have used relatively short mitochondrial D-loop sequences to address levels of differentiation. Despite high rates of gene-flow between the EGSB, ECWG and BCB populations in the recent past58, small, but significant genetic differentiation between the four recognized stocks have been observed14,16,17,65. This study provides the first nuclear genomic baseline data set for the EGSB stock and reveals high genetic diversity in bowhead whales in this region. This information calls for more comprehensive analyses of genetic differentiation and for the reconstruction of the demographic history of bowhead whale stocks in the future.

Supplementary Information

Supplementary Figure S1. (97.7KB, pdf)
Supplementary Figure S2. (61.9KB, pdf)
Supplementary Table S1. (27.3KB, pdf)
Supplementary Table S2. (60.3KB, pdf)
Supplementary Table S3. (55.2KB, pdf)
Supplementary Table S4. (70.4KB, pdf)

Acknowledgements

This work was funded by The Norwegian Research Council (ICE-whales Project No. 244488/E10), the Russian-Norwegian Environment Commission, the Greenland Institute of Natural Resources, the Natural History Museum, University of Oslo, the Norwegian Polar Institute and the Independent Research Fund Denmark, Natural Sciences (Forskningsprojekt 1, Grant No. 8021-00218B, to EDL). Access to the Abel and SAGA high computing facilities was granted through the Norwegian Metacenter for Computational Science (NOTUR; project k9201nn). We thank K. Andaur, N. Cobbing, H. Edorsen, J. Engelstad, E. Grønningsæter, H. Hansen, M. Haupt, J. Jones, S. Scotter and the crews on RV Lance and RV Kronprins Haakon for help with the fieldwork. We also thank Anders Skoglund for creation of the map and Mark Ravinet for suggestions for the variant calling and analyses.

Author contributions

J.C., M.V.W. and L.B. designed the study. K.M.K., C.L., O.V.S. and Ø.W. did the fieldwork. J.C. and M.V.W. carried out the sequence analysis. J.C., M.V.W. and L.B. did the downstream data analyses. And drafted the manuscript. All authors interpreted the results and contributed to and reviewed the final manuscript.

Data availability

Data are available as electronic supplementary material. The raw 150 bp paired-end Illumina NextSeq 500 reads are deposited in NCBI's Sequence Read Archives (Bioproject: PRJNA643010 Sand Bioproject: PRJNA798027).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: José Cerca and Michael V. Westbury.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-09868-5.

References

  • 1.Kovacs KM, Lydersen C, Vacquiè-Garcia J, Shpak O, Glazov D, Heide-Jørgensen MP. The endangered Spitsbergen bowhead whales’ secrets revealed after hundreds of years in hiding. Biol. Lett. 2020 doi: 10.1098/rsbl.2020.0148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cooke, J. & Reeves, R. Balaena mysticetus (East Greenland-Svalbard-Barents Sea subpopulation). The IUCN Red List of Threatened Species 2018, e.T2472A50348144 (2018). 10.2305/IUCN.UK.2018-1.RLTS.T2472A50348144.en
  • 3.Allen RC, Keay I. Bowhead whales in the eastern Arctic, 1611–1911: Population reconstruction with historical whaling records. Environ. Hist. 2006;12:89–113. doi: 10.3197/096734006776026791. [DOI] [Google Scholar]
  • 4.Reeves RR. Spitsbergen bowhead stock: A short review. Mar. Fish. Rev. 1980;42:65–69. [Google Scholar]
  • 5.Shelden KEW, Rugh DJ. The Bowhead Whale, Balaena mysticetus: Its Historic and Current Status. Mar. Fish. Rev. 1995;57:1–20. [Google Scholar]
  • 6.Gilg O, Born EW. Recent sightings of the bowhead whale (Balaena mysticetus) in Northeast Greenland and the Greenland Sea. Polar Biol. 2005;28:796–801. doi: 10.1007/s00300-005-0001-9. [DOI] [Google Scholar]
  • 7.Boertmann D, Kyhn LA, Witting L, Heide-Jørgensen MP. A hidden getaway for bowhead whales in the Greenland Sea. Polar Biol. 2015;38:1315–1319. doi: 10.1007/s00300-015-1695-y. [DOI] [Google Scholar]
  • 8.Wiig Ø, Bachmann L, Janik V, Kovac K, Lydersen C. Spitsbergen bowhead whales revisited. Mar. Mamm. Sci. 2007;23:688–693. doi: 10.1111/j.1748-7692.2007.02373.x. [DOI] [Google Scholar]
  • 9.Wiig Ø, Bachmann L, Øien N, Kovacs K, Lydersen C. Observations of bowhead whales (Balaena mysticetus) in the Svalbard area 1940–2009. Polar Biol. 2010;33:979–984. doi: 10.1007/s00300-010-0776-1. [DOI] [Google Scholar]
  • 10.Lydersen C, et al. Lost highway not forgotten: Satellite tracking of a bowhead whale (Balaena mysticetus) from the critically endangered Spitsbergen stock. Arctic. 2012;65:76–86. doi: 10.14430/arctic4167. [DOI] [Google Scholar]
  • 11.Vacquié-Garcia J, et al. Late summer distribution and abundance of ice-associated whales in the Norwegian High Arctic. Endang. Spec. Res. 2017;32:59–70. doi: 10.3354/esr00791. [DOI] [Google Scholar]
  • 12.Givens GH, Heide-Jørgensen MP. Abundance. In: George JC, Thewissen JGM, editors. The Bowhead Whale: Balaena Mysticetus: Biology and Human Interactions. Academic Press; 2020. pp. 77–86. [Google Scholar]
  • 13.Rooney AP, Honeycutt RL, Derr JN. Historical population size change of bowhead whales inferred from DNA sequence polymorphism data. Evolution. 2001;55:1678–1685. doi: 10.1111/j.0014-3820.2001.tb00687.x. [DOI] [PubMed] [Google Scholar]
  • 14.Borge T, Bachmann L, Bjørnstad G, Wiig Ø. Genetic variation in Holocene bowhead whales from Svalbard. Mol. Ecol. 2007;16:2223–2235. doi: 10.1111/j.1365-294X.2007.03287.x. [DOI] [PubMed] [Google Scholar]
  • 15.LeDuc RG, et al. Genetic analyses (mtDNA and microsatellites) of Okhotsk and Bering/Chukchi/Beaufort Seas populations of bowhead whales. J. Cetacean Res. Manag. 2005;7:107–111. [Google Scholar]
  • 16.Meschersky IG, Chichkina AN, Shpak OV, Rozhnov VV. Molecular genetic analysis of the Shantar Summer Group of bowhead whales (Balaenamysticetus L.) in the Okhotsk Sea. Russ. J. Genet. 2014;50:395–405. doi: 10.1134/S1022795414040097. [DOI] [PubMed] [Google Scholar]
  • 17.Bachmann L, et al. Mitogenomics and the genetic differentiation of contemporary Balaena mysticetus (Cetacea) from Svalbard. Zool. J. Linn. Soc. 2021;191:1192–1203. doi: 10.1093/zoolinnean/zlaa082. [DOI] [Google Scholar]
  • 18.Grond J, Płecha M, Hahn C, Wiig Ø, Bachmann L. Mitochondrial genomes of ancient bowhead whales (Balaena mysticetus) from Svalbard. Mitochondrial DNA Part B. 2019;4:4152–4154. doi: 10.1080/23802359.2019.1693284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Nyhus ES, et al. Mitogenomes of contemporary Spitsbergen stock bowhead whales (Balaena mysticetus) Mitochondrial DNA Part B. 2016;1:898–900. doi: 10.1080/23802359.2016.1258345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010).
  • 21.Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Keane M, et al. Insights into the evolution of longevity from the bowhead whale genome. Cell Rep. 2015;10:112–122. doi: 10.1016/j.celrep.2014.12.008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27:2987–2993. doi: 10.1093/bioinformatics/btr509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Danecek P, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ortiz, E. M. vcf2phylip v2.0: Convert a VCF matrix into several matrix formats for phylogenetic analysis. zenodo.org, https://zenodo.org/record/2540861#.YDUOKy1Q0f0 (2019).
  • 26.Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 2006;23:254–267. doi: 10.1093/molbev/msj030. [DOI] [PubMed] [Google Scholar]
  • 27.Purcell S, et al. PLINK: A tool set for whole-genome and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–576. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (2020).
  • 29.Knaus, B. J. & Grunwald, N. J. VcfR: An R package to manipulate and visualize VCF format data. bioRxiv, 041277 (2016). 10.1101/041277 [DOI] [PubMed]
  • 30.Jombart T, Ahmed I. adegenet 1.3–1: New tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27:3070–3071. doi: 10.1093/bioinformatics/btr521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hanghøj K, Moltke I, Alstrup Andersen P, Manica A, Korneliussen TS. Fast and accurate relatedness estimation from high-throughput sequencing data in the presence of inbreeding. GigaScience. 2019;8:giz034. doi: 10.1093/gigascience/giz034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of next generation sequencing data. BMC Bioinform. 2014;15:356. doi: 10.1186/s12859-014-0356-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Renaud G, Hanghøj K, Korneliussen TS, Willerslev E, Orlando L. Joint estimates of heterozygosity and runs of homozygosity for modern and ancient samples. Genetics. 2019;212:587–614. doi: 10.1534/genetics.119.302057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Grabherr MG, et al. Genome-wide synteny through highly sensitive sequence alignment: Satsuma. Bioinformatics. 2010;26:1145–1151. doi: 10.1093/bioinformatics/btq102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Westbury MV, et al. Extended and continuous decline in effective population size results in low genomic diversity in the world’s rarest hyena species, the brown hyena. Mol. Biol. Evol. 2018;35:1225–1237. doi: 10.1093/molbev/msy037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Li H, Durbin R. Inference of human population history from whole genome sequence of a single individual. Nature. 2011;475:493–496. doi: 10.1038/nature10231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Westbury MV, Petersen B, Garde E, Heide-Jørgensen MP, Lorenzen ED. Narwhal genome reveals long-term low genetic diversity despite current large abundance size. iScience. 2019;15:592–599. doi: 10.1016/j.isci.2019.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Taylor, B. et al. Synthesis of lines of evidence for population structure for bowhead whales in the Bering-Chukchi-Beaufort region. Paper SC/59/BRG35 presented to the IWC Scientific Committee, Anchorage, Alaska (2007).
  • 40.Phillips CD, et al. Molecular insights into the historic demography of bowhead whales: Understanding the evolutionary basis of contemporary management practices. Ecol. Evol. 2012;3:18–37. doi: 10.1002/ece3.374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Liu X, Fu YX. Stairway Plot 2: Demographic history inference with folded SNP frequency spectra. Genome Biol. 2020;21:280. doi: 10.1186/s13059-020-02196-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Westbury, M. V. et al. Speciation in the face of gene flow within the toothed whale superfamily Delphinoidea. bioRxiv, 10.1101/2020.10.23.352286 (2020).
  • 43.Westbury MV, et al. Ecological specialisation and evolutionary reticulation in extant Hyaenidae. Mol. Biol. Evol. 2021 doi: 10.1093/molbev/msab055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.IWC. Report of the Scientific Committee Virtual Meeting, 11–26 May 2020. J. Cetacean Res. Manag. (Supplement) 22, 1–122 (2021).
  • 45.Jonsgård Å. A right whale (Balaena sp.), in all probability a Greenland right whale (Balaena mysticetus) observed in the Barents Sea. Norsk Hvalfangst-Tidende. 1964;53:311–313. [Google Scholar]
  • 46.De Jong C. The hunt of the Greenland whale: A short history and statistical sources. Rep. Int. Whaling Comm. Spec. Issue. 1983;5:83–106. [Google Scholar]
  • 47.Weslawski JM, Hacquebord L, Stempniewicz L, Malinga M. Greenland whales and walruses in the Svalbard food web before and after exploitation. Oceanologia. 2000;2:37–56. [Google Scholar]
  • 48.George JC, et al. Age and growth estimates of bowhead whales (Balaena mysticetus) via aspartic acid racemization. Can. J. Zool. 1999;77:571–580. doi: 10.1139/z99-015. [DOI] [Google Scholar]
  • 49.de Jager D, et al. High diversity, inbreeding and a dynamic Pleistocene demographic history revealed by African buffalo genomes. Sci. Rep. 2021;11:4540. doi: 10.1038/s41598-021-83823-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Belikov SE, Gorbunov YA, Shil'nikov VI. Distribution of pinnipedia and cetacea in Soviet arctic seas and the Bering Sea in winter. Sov. J. Marine Biology. 1989;15:251–257. [Google Scholar]
  • 51.Gavrilo, M. V. Status of the bowhead whale Balaena mysticetus in the waters of Franz Josef Land Archipelago. Paper SC/66a/BRG20 Presented to the IWC Scientific Committee, May 2015, San Diego, USA (2015).
  • 52.Heide-Jorgensen MP, Hansen RG, Shpak OV. Distribution, migrations, and ecology of the Atlantic and the Okhotsk Sea Populations. In: George JC, Thewissen JGM, editors. The Bowhead Whale: Balaena Mysticetus: Biology and Human Interactions. Academic Press; 2020. pp. 57–75. [Google Scholar]
  • 53.Petrov, S. A. et al. The results of marine mammal countins during the four expeditions in the Arctic in 2014 and 2015. Collection of scientific papers 9th International Conference 'Marine mammals of the Holarctic', Astrakhan, Russia, 2016. 91–102 (2018).
  • 54.Gavrilo, M. V. & Tretiakov V. Y. Observation of bowhead whales (Balaena mysticetus) in the East-Siberian Sea during 2007 season with record-low ice cover - Marine mammals of the Holarctic. In: Collection of Scientific Papers. Odessa, 191–194 (2008).
  • 55.Citta JJ, Quakenbush L, George JC. Distribution and behavior of Bering-Chukchi-Beaufort bowhead whales as inferred by telemetry. In: George JC, Thewissen JGM, editors. The Bowhead Whale: Balaena Mysticetus: Biology and Human Interactions. Academic Press; 2021. pp. 31–56. [Google Scholar]
  • 56.Arnason Ú, Lammers F, Kumar V, Nilsson MA, Janke A. Whole-genome sequencing of the blue whale and other rorquals finds signatures for introgressive gene flow. Sci. Adv. 2018;4:eaap9873. doi: 10.1126/sciadv.aap9873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Bazin E, Glémin S, Galtier N. Population size does not influence mitochondrial genetic diversity in animals. Science. 2006;312:570–572. doi: 10.1126/science.1122033. [DOI] [PubMed] [Google Scholar]
  • 58.Corbett-Detig R, Hartl DL, Sackton TB. Natural selection constrains neutral diversity across a wide range of species. PLoS Biol. 2015;13:e1002112. doi: 10.1371/journal.pbio.1002112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Vachon F, Whitehead H, Frasier TR. What factors shape genetic diversity in cetaceans? Ecol. Evol. 2018;8:1554–1572. doi: 10.1002/ece3.3727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kumar S, Subramanian S. Mutation rates in mammalian genomes. Proc. Natl. Acad. Sci. U.S.A. 2002;99:803–808. doi: 10.1073/pnas.022629899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Bininda-Emonds ORP. Fast genes and slow clades: Comparative rates of molecular evolution in mammals. Evol. Bioinf. 2007;3:59–85. doi: 10.1177/117693430700300008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Jackson JA, et al. Big and slow: Phylogenetic estimates of molecular evolution in baleen whales (Suborder Mysticeti) Mol. Biol. Evol. 2009;26:2427–2440. doi: 10.1093/molbev/msp169. [DOI] [PubMed] [Google Scholar]
  • 63.Foote AD, et al. Ancient DNA reveals that bowhead whale lineages survived Late Pleistocene climate change and habitat shifts. Nat. Commun. 2013;4:1667. doi: 10.1038/ncomms2714. [DOI] [PubMed] [Google Scholar]
  • 64.Wiig Ø, Bachmann L, Hufthammer AK. Late Pleistocene and Holocene occurrence of bowhead whales (Balaena mysticetus) along the coasts of Norway. Polar Biol. 2018;42:645–656. doi: 10.1007/s00300-019-02460-0. [DOI] [Google Scholar]
  • 65.Alter SE, et al. Gene flow on ice: The role of sea ice and whaling in shaping Holarctic genetic diversity and population differentiation in bowhead whales (Balaena mysticetus) Ecol. Evol. 2012;2:2895–2911. doi: 10.1093/zoolinnean/zlaa082. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure S1. (97.7KB, pdf)
Supplementary Figure S2. (61.9KB, pdf)
Supplementary Table S1. (27.3KB, pdf)
Supplementary Table S2. (60.3KB, pdf)
Supplementary Table S3. (55.2KB, pdf)
Supplementary Table S4. (70.4KB, pdf)

Data Availability Statement

Data are available as electronic supplementary material. The raw 150 bp paired-end Illumina NextSeq 500 reads are deposited in NCBI's Sequence Read Archives (Bioproject: PRJNA643010 Sand Bioproject: PRJNA798027).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES