Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2024 Jul 17;16(8):evae155. doi: 10.1093/gbe/evae155

Chromosome-Level Genome Assembly of the Viviparous Eelpout Zoarces viviparus

Nico Fuhrmann 1,#, Marie V Brasseur 2,✉,#, Christina E Bakowski 3, Lars Podsiadlowski 4, Stefan Prost 5,6,7,#, Henrik Krehenwinkel 8,#, Christoph Mayer 9,#
Editor: Bonnie Fraser
PMCID: PMC11331339  PMID: 39018026

Abstract

The viviparous eelpout Zoarces viviparus is a common fish across the North Atlantic and has successfully colonized habitats across environmental gradients. Due to its wide distribution and predictable phenotypic responses to pollution, Z. viviparus is used as an ideal marine bioindicator organism and has been routinely sampled over decades by several countries to monitor marine environmental health. Additionally, this species is a promising model to study adaptive processes related to environmental change, specifically global warming. Here, we report the chromosome-level genome assembly of Z. viviparus, which has a size of 663 Mb and consists of 607 scaffolds (N50 = 26 Mb). The 24 largest represent the 24 chromosomes of the haploid Z. viviparus genome, which harbors 98% of the complete Benchmarking Universal Single-Copy Orthologues defined for ray-finned fish, indicating that the assembly is highly contiguous and complete. Comparative analyses between the Z. viviparus assembly and the chromosome-level genomes of two other eelpout species revealed a high synteny, but also an accumulation of repetitive elements in the Z. viviparus genome. Our reference genome will be an important resource enabling future in-depth genomic analyses of the effects of environmental change on this important bioindicator species.

Keywords: Zoarcidae, reference genome, bioindicator, marine environmental pollution, viviparous blenny


Significance.

Zoarces viviparus is a bioindicator species of international significance and a highly promising model system to study the mechanisms underlying the differential thermal tolerance of marine ectotherms. In this study, we present a chromosome-level reference genome of Z. viviparus, an essential resource to study the genomic imprints of habitat pollution, ocean warming, and other environmental changes in marine fish.

Introduction

The viviparous eelpout Zoarces viviparus L. is a common fish species in the North Atlantic. Its wide latitudinal distribution area ranges from the English Channel to the northern coast of Norway, including the brackish water habitats of the Baltic Sea (Fig. 1a). As the species is philopatric throughout its life, local populations are exposed to differential habitat conditions related to both natural factors such as temperature and salinity gradients and anthropogenic activities, e.g. specific types of pollution. Given its sensitivity to environmental pollution, which results in distinct physiological responses and phenotypic alterations, its philopatric lifestyle, and its wide distribution, Z. viviparus is used as an ideal marine bioindicator organism (Jacobsson et al. 1986; Hedman et al. 2011). Moreover, its viviparous mode of reproduction allows assessing the effect of contaminants directly on sensitive life stages, i.e. larvae, and quantifying the reproductive success of female individuals (e.g. Bergek et al. 2012; Brande-Lavridsen et al. 2013). For these reasons, Z. viviparus has been extensively used in ecotoxicological studies (Hedman et al. 2011) and is the selected sentinel species in national and international marine monitoring projects (OSPAR, https://www.ospar.org; HELCOM, https://helcom.fi). Environmental specimen banks (ESBs) have sampled and stored tissue from Z. viviparus populations over decades (Klein et al. 2018). These samples are cryo-conserved in liquid nitrogen, ensuring an excellent preservation of DNA and RNA (Krehenwinkel et al. 2022), and provide the invaluable opportunity to study the molecular signatures of habitat degradation in the North Atlantic through time and space. To exploit the full potential of present and future samples, a high-quality reference genome of Z. viviparus, serving as a backbone for genomic and transcriptomic analyses, is essential.

Fig. 1.

Fig. 1.

Zoarces viviparus and its distribution area in the North Atlantic (a), Hi-C contact density map of the Z. viviparus assembly indicating 24 main scaffolds that represent the 24 chromosomes of the haploid Z. viviparus genome (b) and sequencing success and genome assembly statistics (c). The photograph of the Z. viviparus specimens was kindly provided by Joachim Sturve.

Such a reference genome additionally constitutes an important resource for molecular ecologists and evolutionary biologists studying the genomic mechanisms involved in acclimatization and adaptation to changing environments, specifically global warming. Although being considered cold-adapted, Z. viviparus has colonized habitats across a large temperature gradient. Therefore, this species has been used as a model system to show that the thermal limit of marine ectotherms is mainly driven by temperature-limited oxygen supply (Pörtner and Knust 2007). This supports previous studies reporting an elevated hypoxia tolerance of Z. viviparus (Fischer et al. 1992; Zakhartsev et al. 2003), which might have contributed to its ability to cope better with heat stress than, for example, its stenothermic relative Pachycara brachycephalum (Van Dijk et al. 1999). As eelpouts (Zoarcidae) are found in various types of marine habitats and have a cosmopolitan distribution, they are ideally suited for comparative genomic studies focusing on adaptive processes. Yet, only three eelpout genomes are available at the National Center for Biotechnology Information (NCBI) genome database, and only two are chromosome-level genomes (NCBI Genome database, accessed 2024 April 3). In this study, we contribute to closing this resource gap and present the first, high-quality chromosome-level reference genome of Z. viviparus, a bioindicator species of international significance.

Results and Discussion

Genome Sequencing and Assembly Metrics

DNA sequencing produced 114 giga base pairs (gbp) PacBio HiFi data (N50 = 12,396 bp) and 156 gbp Illumina Hi-C data (Fig. 1c). Using the average assembly coverage of 164.04 ± 75.71 (mean ± SD) derived from long-read alignments, we calculated an estimated genome size of approximately 694 mega base pairs (mbp). Our final genome assembly consists of 663 mbp, which is similar to the assembly sizes of the other chromosome-level eelpout genomes from Melanostigma gelatinosum (662 mbp) (Bista et al. 2024) and Lycodes pacificus (646.4 mbp). We assembled 607 scaffolds (N50 = 26 mbp), whereby 94% of all bases are present in the 24 largest scaffolds (Fig. 1b and supplementary figs. S1a and S2, Supplementary Material online). These likely represent the 24 chromosomes of the haploid Z. viviparus genome previously identified in karyograms (Yershov 2005). The GC content among the scaffolds is consistent, and no evidence for contamination was found (supplementary figs. S1b and S2, Supplementary Material online). From the searched Benchmarking Universal Single-Copy Orthologues (BUSCOs) (Manni et al. 2021) defined for ray-finned fish (Actinopterygii_odb10), 97.8% were identified as being complete (Fig. 1c and supplementary fig. S2, Supplementary Material online). Additionally, we assembled the mitochondrial genome with a total size of 16,833 bp and annotated its 13 protein-coding genes, 22 tRNAs, 2 rRNAs, and the D-loop control region (supplementary fig. S4, Supplementary Material online).

Transcriptome Sequencing and Genome Annotation

Illumina RNA sequencing (RNA-seq) produced 583,912,676 reads (115 gbp), from which 574,213,064 reads were retained after quality trimming. Of these, 96.2% ± 0.3 per library aligned to the assembled genome. PacBio Iso-Seq yielded 3,634,738 reads (7 gbp) and 4,334,131 reads (6 gbp) from the muscle (N50 = 1.8 kilo base pairs) and the liver (N50 = 1.7 kilo base pairs) transcriptomes, respectively, of which >99% aligned to the genome. This mapping information was used to predict a set of 30,226 proteins, which includes 98.3% complete BUSCO protein models.

Comparative Analysis

Although the genomic architecture in fish is highly diverse, the synteny in Actinopterygii genomes is well-conserved (Stemshorn et al. 2005; Nolte 2020) and no major structural rearrangements between the Z. viviparus genome and the two other eelpout genomes were observed (Fig. 2a). Interestingly, their genomes show distinct repeat landscapes: the Z. viviparus genome harbors many long tandem repeats (TRs), the longest being a compound repeat with a total length of 1,831,213 bp. The 29 TRs with a length of >100,000 bp are based only on a small number of repeat units (supplementary table S1, Supplementary Material online). While the TR content is comparable between the genomes of Z. viviparus and L. pacificus (∼7%), the genome of M. gelatinosum shows a relatively high TR content of >10% (supplementary figs. S5 and S6, Supplementary Material online). At the same time, we found evidence for a relatively recent expansion of long terminal repeat (LTR) elements in Z. viviparus, accounting for almost 13% of its genome, as compared to <2% in the two other species (Fig. 2b). While transposon bursts may disturb gene regulation by elements being copied into introns or regulatory elements adjacent to the genes, these events can also provide an opportunity for rapid adaptation to environmental change by, for example, enhancing a population's diversity in gene expression (Casacuberta and González 2013).

Fig. 2.

Fig. 2.

A comparison of the Z. viviparus genome with the two other chromosome-level genome assemblies of eelpouts present in the NCBI. While the synteny is highly conserved between the eelpout genomes (a), a recent expansion of LTRs was observed in the repeat landscape of Z. viviparus (b).

Conclusion

We present a high-quality genome assembly for the viviparous eelpout Z. viviparus, an important marine bioindicator organism. This chromosome-level reference genome paves the way for future research focusing on the molecular mechanisms shaping the acclimatization and adaptation of Z. viviparus to environmental change.

Materials and Methods

Specimen Sampling, DNA Extraction and Sequencing

The specimen used for DNA extraction was caught by fisherman in May 2022 in the Meldorf Bay, Germany (location: latitude 54.10, longitude 8.80), transported to the German ESB, and killed according to German animal protection laws (for details, see Klein et al. 2018). Muscle and liver tissues of the specimen were dissected and preserved in liquid nitrogen for long-term storage at the ESB.

For long-read sequencing, DNA was extracted from ∼20 mg of liver tissue using the Monarch HMW DNA extraction kit (New England Biolabs). A 1.5% agarose gel was used to check whether the DNA was successfully extracted. For further library preparation and sequencing, the extracted DNA was sent to the West German Genome Center (WGGC) in Düsseldorf, Germany. Library preparation was performed using the SMRTbell prep kit 3.0 (PacBio), and the sample was sequenced on a PacBio Sequel II device using the HiFi mode.

For chromatin conformation capture, DNA was extracted from ∼100 mg of muscle tissue and a Hi-C proximity ligation library was prepared using the Arima Hi-C+ kit (Arima Genomics). The library was indexed with the Accel-NGS 2S Plus DNA library kit (Swift Biosciences) and sent to the WGGC for paired-end sequencing (150 bp) on an Illumina NextSeq 2000.

Mitochondrial Genome Assembly and Annotation

The Vulcan pipeline v.1.0.3 (Fu et al. 2021) was used to extract mitochondrial reads in the PacBio HiFi data. First, mitochondrial reads were identified by aligning all reads to the mitogenome of Pholis gunnellus (GenBank NC_052755, v1). These reads were fastq-transformed with seqtk v.1.4-r122 (Li 2024, https://github.com/lh3/seqtk) and assembled using Flye v.2.9.2-b1786 (Kolmogorov et al. 2019) with an expected genome size of 17 kilo base pairs and an asm-coverage of 50. The mitogenome annotation was performed with MitoAnnotator v.3.92 (Iwasaki et al. 2013; Sato et al. 2018; Zhu et al. 2023).

Genome Assembly and Repetitive Sequence Content

PacBio HiFi and Illumina Hi-C reads were assembled with Hifiasm v.0.19.4-r575 (Cheng et al. 2021). Assembly scaffolding was performed by using the Arima pipeline (https://github.com/ArimaGenomics/mapping_pipeline), which involves processing and mapping of Hi-C reads with samtools v.1.9 (Danecek et al. 2021) and bwa v.0.7.17-r1188 (Li and Durbin 2009), respectively, and deduplication with Picard v.2.27.5 (http://broadinstitute.github.io/picard/). The final scaffolding step was performed with YaHS v.1.2 (Zhou et al. 2023), followed by a manual processing step with JuicerTools v.2.20.00 (https://github.com/aidenlab/JuicerTools), gap closing with TGS-GapCloser v.1.2.1 (Xu et al. 2020), and a manual curation step with Juicebox desktop v.2.17.00 (https://aidenlab.gitbook.io/juicebox/desktop). The resulting assembly was used for a second round of Hi-C scaffolding and gap closing to obtain the final genome assembly. Mitochondrial contigs were identified by performing BLAST v.2.13.0 (Altschul et al. 1990) searches of the contigs against the Z. viviparus mitochondrial genome (with -max_target_seqs 100 -max_hsps 100 -evalue 1e−25) and removed from the nuclear genome assembly.

Repeat families for the three fish species were independently identified and classified with RepeatModeler2 v2.0.5 (Flynn et al. 2020) and resources from Dfam (Storer et al. 2021). After combining the three repeat libraries, redundancy was reduced and false positives were omitted using MCHelper (Orozco-Arias et al. 2023). Repeats were then masked in all genomes with the combined repeat library using RepeatMasker v4.1.8 (Smit et al. 2013–2015, http://repeatmasker.org) and helper scripts from TETools v1.88 (https://github.com/Dfam-consortium/TETools) to create a soft-masked genome assembly. Tandem repeats were identified with Phobos v3.3.12 (Mayer 2007; Mayer et al. 2010) for a unit size range of 1 to 50 bp, and the telomeric repeat motif (TTAGGG/CCCTAA)n (Moyzis et al. 1988) was searched at the beginning/end of the 24 main scaffolds to ascertain whether the telomeric regions were successfully assembled (supplementary tables S1–S3, Supplementary Material online; for details, see supplementary section S3, Supplementary material online).

RNA Extraction and Sequencing

For transcriptomic long-read sequencing, RNA was extracted from muscle (∼25 mg) and liver (∼20 mg) tissues using the RNeasy Plus Mini kit (Qiagen). A TapeStation 2200 and the RNA ScreenTape Analysis kit (both Agilent) were used for RNA quantification and quality assessment. Further sample processing and sequencing were performed at the WGGC. Full-length cDNA sequences were produced using the TeloPrime Full-Length cDNA Amplification kit V2 (Lexogen). SMRTbell adapters were added using the SMRTbell prep kit 3.0 (PacBio), followed by RNA-seq on PacBio Sequel II systems.

These transcriptomes were complemented with Illumina short-read data generated from 12 Swedish specimens, caught in May 2023 in the Gullmarn Fjord, Lysekil, Sweden (location: latitude 58.23, longitude 11.41). Total RNA was extracted from 5 and 10 mg of liver tissue using the RNeasy Plus Micro and RNeasy Plus Mini kits (Qiagen), respectively. RNA concentrations were quantified with a Quantus fluorometer and the QuantiFluor RNA System kit (both Promega). The sample quality was checked on a Fragment Analyzer with the RNA kit (15 nt) (both Agilent). Library preparation and sequencing were performed at the WGGC. The VAHTS Universal V8 kit (Vazyme) was used for poly-A selection of mRNA from 100 ng of total RNA and cDNA library construction. The 12 libraries were paired-end sequenced (100 bp) on an Illumina NextSeq2000.

Gene Prediction

We employed a customized version of the BRAKER3 v3.0.4 (Hoff et al. 2016; Brůna et al. 2021; Gabriel et al. 2024) protocol to integrate evidence from short- and long-read transcriptomic data as well as from protein data of the vertebrate OrthoDB partition (Kuznetsov et al. 2023).

First, the Illumina RNA-seq libraries were used for gene prediction. Homopolymers at the end of the reads were removed with a custom C++ program, followed by quality trimming using the cutadapt v3.4 (Martin 2011) wrapper script TrimGalore! v0.6.10 (Krueger 2024, https://github.com/FelixKrueger/TrimGalore) in paired-end mode, retaining only reads with a minimum length of 25 bp. HISAT2 v2.1.0 (Kim et al. 2019) was used to align the trimmed reads against the soft-masked genome assembly in strand-specific mode. The resulting .bam files were sorted with samtools and used to run GeneMark-ETP (Bruna et al. 2024).

A second gene set was predicted based on evidence gained from PacBio Iso-Seq reads, which were splice-aware aligned against the soft-masked genome using minimap2 v2.26 (Li 2018), followed by .bam file sorting with samtools. A singularity container specifically set up for long-reads (Hoff, https://hub.docker.com/r/katharinahoff/playground) was used to run GeneMark-ETP.

The final structural genome annotation was created by combining both gene set predictions using TSEBRA v1.1.2 (Gabriel et al. 2021). No functional annotation was performed.

Assembly Evaluation and Comparative Analyses

Assembly statistics were calculated using the bbmap v38.82 (Bushnell 2014) script stats.sh and QUAST v.5.2.0 (Gurevich et al. 2013). K-mer distributions in the Hifi reads as well as in the haplotype resolved assembly before and after scaffolding (supplementary fig. S3, Supplementary Material online) were estimated with Merqury v.1.3 (Rhie et al. 2020). The final assembly was screened for contamination with BlobToolKit v.4.3.3 (Challis et al. 2020).

As no randomly generated Illumina data were available, a genome size estimation based on k-mer frequencies was not possible. Therefore, we estimated the genome size (G) from the PacBio HiFi reads as

G=B/Cov

where B refers to the cumulative length sum from all HiFi reads incorporated in the assembly and Cov to the average genome coverage of HiFi reads. Both parameters were obtained with bash shell commands and samtools.

To evaluate the assembly completeness, the BUSCO program v.5.4.6 and database were used to search for BUSCO genes defined for ray-finned fish (Actinopterygii_odb10) in the genome (-m genome) and in the predicted protein set (-m transcriptome or -m protein).

Genome comparisons were made between Z. viviparus and its closest relatives with chromosome-level genome assemblies i.e. M. gelatinosum (GenBank GCA_949748355.1) and L. pacificus (GenBank GCA_028022725.1). Pairwise genome alignments were conducted with minimap2, and assembly consistency plots were produced with the Circos (Krzywinski et al. 2009) wrapper script Jupiter (Chu 2024, https://github.com/JustinChu/JupiterPlot).

Supplementary Material

evae155_Supplementary_Data

Acknowledgments

The authors thank Diana Teubner from the ESB project group at Trier University for providing the Z. viviparus specimen used for genome sequencing and long-read transcriptome sequencing. The authors also thank Alexander Ben Hamadou for his help with the Hi-C library preparation and Joachim Sturve for his support during the experiment which resulted in the RNA-seq Illumina data as well as provisioning of the Z. viviparus photograph. The project used the high-performance computing resources of the LIB.

Contributor Information

Nico Fuhrmann, Department of Biogeography, Trier University, Trier 54296, Germany.

Marie V Brasseur, Department of Biogeography, Trier University, Trier 54296, Germany.

Christina E Bakowski, Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn 53113, Germany.

Lars Podsiadlowski, Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn 53113, Germany.

Stefan Prost, Ecology and Genetics Research Unit, University of Oulu, Oulu 90014, Finland; South African National Biodiversity Institute, National Zoological Garden, Pretoria 0002, South Africa; Central Research Laboratories, Natural History Museum Vienna, Vienna 1010, Austria.

Henrik Krehenwinkel, Department of Biogeography, Trier University, Trier 54296, Germany.

Christoph Mayer, Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn 53113, Germany.

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online.

Funding

This work was primarily funded by a grant from the Deutsche Forschungsgemeinschaft (project 497783367) to H.K. (KR 4623/5-1) and C.M. (MA 3684/6-1) as part of the DFG Sequencing Cost in Projects initiative. Furthermore, this work was supported by the Research Infrastructure West German Genome Center (WGGC) (project 407493903) funded by the Deutsche Forschungsgemeinschaft as part of the Next Generation Sequencing Competence Network (project 423957469). Next-generation sequencing was carried out at the WGGC, Düsseldorf. N.F. was also funded by the TrendDNA project of the German Federal Environment Agency (Umweltbundesamt) (AZ 93 404/0025). S.P. was funded by the University of Oulu and the Academy of Finland Profi6 336449 program “Biodiverse Anthropocenes.” The publication was supported by the Open Access Fund of Universität Trier and by the Deutsche Forschungsgemeinschaft.

Data Availability

All data are available at the NCBI under Bioproject PRJNA1068064. Raw sequencing data SRA accession numbers: SRR27896878 (PacBio HiFi reads), SRR27896879 (Illumina Hi-C reads), SRR28471744-SRR28471745 (PacBio Iso-Seq reads), and SRR27885821-SRR27885832 (Illumina RNA-seq reads). Genome accession numbers: The annotated genome assembly is deposited under GCA_040110945.1 at GenBank and the Whole Genome Shotgun project is deposited at DDBJ/ENA/GenBank under the master accession JBCEZU000000000. Mitogenome accession number: The mitochondrial genome is deposited at GenBank under PP556339.

Literature Cited

  1. Altschul  SF, Gish  W, Miller  W, Myers  EW, Lipman  DJ. Basic local alignment search tool. J Mol Biol. 1990:215(3):403–410. 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  2. ArimaGenomics/mapping_pipeline . https://github.com/ArimaGenomics/mapping_pipeline [accessed 2024 Apr 3].
  3. Bergek  S, Ma  Q, Vetemaa  M, Franzén  F, Appelberg  M. From individuals to populations: impacts of environmental pollution on natural eelpout populations. Ecotoxicol Environ Saf. 2012:79:1–12. 10.1016/j.ecoenv.2012.01.019. [DOI] [PubMed] [Google Scholar]
  4. Bista  I, Collins  M, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory Team . The genome sequence of the limp eelpout, Melanostigma gelatinosum Günther, 1881. Wellcome Open Res. 2024:9:192. 10.12688/wellcomeopenres.21268.1. [DOI] [Google Scholar]
  5. Brande-Lavridsen  N, Korsgaard  B, Dahllöf  I, Strand  J, Tairova  Z, Bjerregaard  P. Abnormalities in eelpout Zoarces viviparus upon chemical exposure. Mar Environ Res. 2013:92:87–94. 10.1016/j.marenvres.2013.09.004. [DOI] [PubMed] [Google Scholar]
  6. Brůna  T, Hoff  KJ, Lomsadze  A, Stanke  M, Borodovsky  M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom Bioinform. 2021:3(1):lqaa108. 10.1093/nargab/lqaa108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bruna  T, Lomsadze  A, Borodovsky  M. GeneMark-ETP significantly improves the accuracy of automatic annotation of large eukaryotic genomes. Genome Res. 2024:34(5):757–768. 10.1101/gr.278373.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bushnell  B. BBMap: a fast, accurate, splice-aware aligner. Berkeley (CA): Lawrence Berkeley National Lab. (LBNL); 2014. https://www.osti.gov/biblio/1241166 [accessed 2023 Oct 20].
  9. Casacuberta  E, González  J. The impact of transposable elements in environmental adaptation. Mol Ecol. 2013:22(6):1503–1517. 10.1111/mec.12170. [DOI] [PubMed] [Google Scholar]
  10. Challis  R, Richards  E, Rajan  J, Cochrane  G, Blaxter  M. BlobToolKit—interactive quality assessment of genome assemblies. G3 GenesGenomesGenetics. 2020:10(4):1361–1374. 10.1534/g3.119.400908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cheng  H, Concepcion  GT, Feng  X, Zhang  H, Li  H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021:18(2):170–175. 10.1038/s41592-020-01056-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chu  J. JustinChu/JupiterPlot. 2024. https://github.com/JustinChu/JupiterPlot [accessed 2024 Apr 3].
  13. Danecek  P, Bonfield  JK, Liddle  J, Marshall  J, Ohan  V, Pollard  MO, Whitwham  A, Keane  T, McCarthy  SA, Davies  RM, et al.  Twelve years of SAMtools and BCFtools. GigaScience. 2021:10(2):giab008. 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dfam-consortium/TETools . 2024. https://github.com/Dfam-consortium/TETools [accessed 2024 Apr 3].
  15. Fischer  P, Rademacher  K, Kils  U. In situ investigations on the respiration and behaviour of the eelpout Zoarces viviparus under short-term hypoxia. Mar Ecol Prog Ser. 1992:88:181–184. 10.3354/meps088181. [DOI] [Google Scholar]
  16. Flynn  JM, Hubley  R, Goubert  C, Rosen  J, Clark  AG, Feschotte  C, Smit  AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020:117(17):9451–9457. 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fu  Y, Mahmoud  M, Muraliraman  VV, Sedlazeck  FJ, Treangen  TJ. Vulcan: improved long-read mapping and structural variant calling via dual-mode alignment. GigaScience. 2021:10(9):giab063. 10.1093/gigascience/giab063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gabriel  L, Brůna  T, Hoff  KJ, Ebel  M, Lomsadze  A, Borodovsky  M, Stanke  M. BRAKER3: fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. Genome Res. 2024:34(5):769–777. 10.1101/gr.278090.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gabriel  L, Hoff  KJ, Brůna  T, Borodovsky  M, Stanke  M. TSEBRA: transcript selector for BRAKER. BMC Bioinformatics. 2021:22(1):566. 10.1186/s12859-021-04482-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gurevich  A, Saveliev  V, Vyahhi  N, Tesler  G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013:29(8):1072–1075. 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hedman  JE, Rüdel  H, Gercken  J, Bergek  S, Strand  J, Quack  M, Appelberg  M, Förlin  L, Tuvikene  A, Bignert  A. Eelpout (Zoarces viviparus) in marine environmental monitoring. Mar Pollut Bull.  2011:62(10):2015–2029. 10.1016/j.marpolbul.2011.06.028. [DOI] [PubMed] [Google Scholar]
  22. HELCOM | Helsinki Commission . Baltic marine environment protection commission. https://helcom.fi/ [accessed 2024 Apr 3].
  23. Hoff  KJ. katharinahoff/playground—Docker image | Docker hub. https://hub.docker.com/r/katharinahoff/playground [Accessed 2024 Apr 3].
  24. Hoff  KJ, Lange  S, Lomsadze  A, Borodovsky  M, Stanke  M. BRAKER1: unsupervised RNA-seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics. 2016:32(5):767–769. 10.1093/bioinformatics/btv661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Iwasaki  W, Fukunaga  T, Isagozawa  R, Yamada  K, Maeda  Y, Satoh  TP, Sado  T, Mabuchi  K, Takeshima  H, Miya  M, et al.  MitoFish and MitoAnnotator: a mitochondrial genome database of fish with an accurate and automatic annotation pipeline. Mol Biol Evol.  2013:30(11):2531–2540. 10.1093/molbev/mst141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jacobsson  A, Neuman  E, Thoresson  G. The viviparous blenny as an indicator of environmental effects of harmful substances. Ambio. 1986:15:236–238. http://www.jstor.org/stable/4313258. [Google Scholar]
  27. Juicebox (Desktop) | Juicebox . https://aidenlab.gitbook.io/juicebox/desktop [accessed 2024 Apr 3].
  28. JuicerTools . https://github.com/aidenlab/JuicerTools [accessed 2024 Apr 3].
  29. Kim  D, Paggi  JM, Park  C, Bennett  C, Salzberg  SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol.  2019:37(8):907–915. 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Klein  R, Paulus  M, Tarricone  K, Teubner  D. Guideline for sampling and sample processing: Eelpout (Zoarces viviparus). Umweltprobenbank. 2018. https://www.umweltprobenbank.de/upb_static/fck/download/SOP_ESB_Eelpout_V2.0.3_2018_en.pdf.
  31. Kolmogorov  M, Yuan  J, Lin  Y, Pevzner  PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol.  2019:37(5):540–546. 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
  32. Krehenwinkel  H, Weber  S, Broekmann  R, Melcher  A, Hans  J, Wolf  R, Hochkirch  A, Kennedy  SR, Koschorreck  J, Künzel  S, et al.  Environmental DNA from archived leaves reveals widespread temporal turnover and biotic homogenization in forest arthropod communities. eLife. 2022:11:e78521. 10.7554/eLife.78521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Krueger  F. FelixKrueger/TrimGalore. 2024. https://github.com/FelixKrueger/TrimGalore [accessed 2024 Apr 3].
  34. Krzywinski  M, Schein  J, Birol  I, Connors  J, Gascoyne  R, Horsman  D, Jones  SJ, Marra  MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009:19(9):1639–1645. 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kuznetsov  D, Tegenfeldt  F, Manni  M, Seppey  M, Berkeley  M, Kriventseva  EV, Zdobnov  EM. OrthoDB v11: annotation of orthologs in the widest sampling of organismal diversity. Nucleic Acids Res. 2023:51(D1):D445–D451. 10.1093/nar/gkac998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Li  H. lh3/seqtk. 2024. https://github.com/lh3/seqtk [accessed 2024 Apr 3].
  37. Li  H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018:34(18):3094–3100. 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Li  H, Durbin  R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009:25(14):1754–1760. 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Manni  M, Berkeley  MR, Seppey  M, Simão  FA, Zdobnov  EM. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021:38(10):4647–4654. 10.1093/molbev/msab199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Martin  M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011:17(1):10–12. 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  41. Mayer  C. Phobos: highly accurate search for perfect and imperfect tandem repeats in complete genomes. 2007. https://www.ruhr-uni-bochum.de/ecoevo/cm/cm_phobos.htm [accessed 2024 Apr 3].
  42. Mayer  C, Leese  F, Tollrian  R. Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach. BMC Genomics. 2010:11(1):277. 10.1186/1471-2164-11-277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Moyzis  RK, Buckingham  JM, Cram  LS, Dani  M, Deaven  LL, Jones  MD, Meyne  J, Ratliff  RL, Wu  JR. A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes. Proc Natl Acad Sci U S A. 1988:85(18):6622–6626. 10.1073/pnas.85.18.6622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. NCBI Genome database . NCBI. https://www.ncbi.nlm.nih.gov/datasets/genome/?taxon=8193 [accessed 2024 Apr 3].
  45. Nolte  AW. Genomic access to the diversity of fishes. Methods Mol Biol. 2020:2090:397–411. 10.1007/978-1-0716-0199-0_16. [DOI] [PubMed] [Google Scholar]
  46. Orozco-Arias  S, Sierra  P, Durbin  R, González  J. MCHelper automatically curates transposable element libraries across species. October 20, 2023. 10.1101/2023.10.17.562682, preprint: not peer reviewed. [DOI]
  47. OSPAR | Oslo-Paris Commission . Protecting and conserving the North-East Atlantic and its resources. OSPAR Comm. https://www.ospar.org/ [accessed 2024 Apr 3].
  48. Picard Tools—by Broad Institute . http://broadinstitute.github.io/picard/ [accessed 2024 Apr 3].
  49. Pörtner  HO, Knust  R. Climate change affects marine fishes through the oxygen limitation of thermal tolerance. Science. 2007:315(5808):95–97. 10.1126/science.1135471. [DOI] [PubMed] [Google Scholar]
  50. Rhie  A, Walenz  BP, Koren  S, Phillippy  AM. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 2020:21(1):245. 10.1186/s13059-020-02134-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Sato  Y, Miya  M, Fukunaga  T, Sado  T, Iwasaki  W. MitoFish and MiFish pipeline: a mitochondrial genome database of fish with an analysis pipeline for environmental DNA metabarcoding. Mol Biol Evol. 2018:35(6):1553–1555. 10.1093/molbev/msy074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Smit  AFA, Hubley  R, Green  P. RepeatMasker Open-4.0. 2013–2015. http://repeatmasker.org.
  53. Stemshorn  KC, Nolte  AW, Tautz  D. A genetic map of Cottus gobio (Pisces, Teleostei) based on microsatellites can be linked to the physical map of Tetraodon nigroviridis. J Evol Biol. 2005:18(6):1619–1624. 10.1111/j.1420-9101.2005.00929.x. [DOI] [PubMed] [Google Scholar]
  54. Storer  J, Hubley  R, Rosen  J, Wheeler  TJ, Smit  AF. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob DNA. 2021:12(1):2. 10.1186/s13100-020-00230-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Van Dijk  PLM, Tesch  C, Hardewig  I, Pörtner  HO. Physiological disturbances at critically high temperatures: a comparison between stenothermal Antarctic and eurythermal temperate eelpouts (Zoarcidae). J Exp Biol. 1999:202(24):3611–3621. 10.1242/jeb.202.24.3611. [DOI] [PubMed] [Google Scholar]
  56. Xu  M, Guo  L, Gu  S, Wang  O, Zhang  R, Peters  BA, Fan  G, Liu  X, Xu  X, Deng  L, et al.  TGS-GapCloser: a fast and accurate gap closer for large genomes with low coverage of error-prone long reads. GigaScience. 2020:9(9):giaa094. 10.1093/gigascience/giaa094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Yershov  PN. Chromosomal studies of Zoarces viviparus L. (Zoarcidae) and Myoxocephalus scorpius L. (Cottidae) from different parts of distribution area. Ecohydrol Hydrobiol. 2005:5(3):237–243. [Google Scholar]
  58. Zakhartsev  MV, De Wachter  B, Sartoris  FJ, Pörtner  HO, Blust  R. Thermal physiology of the common eelpout (Zoarces viviparus). J Comp Physiol B. 2003:173(5):365–378. 10.1007/s00360-003-0342-z. [DOI] [PubMed] [Google Scholar]
  59. Zhou  C, McCarthy  SA, Durbin  R. YaHS: yet another Hi-C scaffolding tool. Bioinformatics. 2023:39(1):btac808. 10.1093/bioinformatics/btac808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Zhu  T, Sato  Y, Sado  T, Miya  M, Iwasaki  W. MitoFish, MitoAnnotator, and MiFish pipeline: updates in 10 years. Mol Biol Evol. 2023:40(3):msad035. 10.1093/molbev/msad035. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

evae155_Supplementary_Data

Data Availability Statement

All data are available at the NCBI under Bioproject PRJNA1068064. Raw sequencing data SRA accession numbers: SRR27896878 (PacBio HiFi reads), SRR27896879 (Illumina Hi-C reads), SRR28471744-SRR28471745 (PacBio Iso-Seq reads), and SRR27885821-SRR27885832 (Illumina RNA-seq reads). Genome accession numbers: The annotated genome assembly is deposited under GCA_040110945.1 at GenBank and the Whole Genome Shotgun project is deposited at DDBJ/ENA/GenBank under the master accession JBCEZU000000000. Mitogenome accession number: The mitochondrial genome is deposited at GenBank under PP556339.


Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES