Skip to main content
DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes logoLink to DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes
. 2022 Jul 21;29(4):dsac028. doi: 10.1093/dnares/dsac028

Insights into chromosomal evolution and sex determination of Pseudobagrus ussuriensis (Bagridae, Siluriformes) based on a chromosome-level genome

Chuankun Zhu 1,, Haiyang Liu 2,, Zhengjun Pan 3, Lei Cheng 4,, Yanhong Sun 5, Hui Wang 6, Guoliang Chang 7, Nan Wu 8, Huaiyu Ding 9, Haitao Zhao 10, Lei Zhang 11,, Xiangsheng Yu 12
PMCID: PMC9358014  PMID: 35861402

Abstract

Pseudobagrus ussuriensis is an aquaculture catfish with significant sexual dimorphism. In this study, a chromosome-level genome with a size of 741.97 Mb was assembled for female P. ussuriensis. A total of 26 chromosome-level contigs covering 97.34% of the whole-genome assembly were obtained with an N50 of 28.53 Mb and an L50 of 11. A total of 24,075 protein-coding genes were identified, with 91.54% (22,039) genes being functionally annotated. Based on the genome assembly, four chromosome evolution clusters of catfishes were identified and the formation process of P. ussuriensis chromosomes was predicted. A total of 55 sex-related quantitative trait loci (QTLs) with a phenotypic variance explained value of 100% were located on chromosome 8 (chr08). The QTLs and other previously identified sex-specific markers were located in a sex-determining region of 16.83 Mb (from 6.90 to 23.73 Mb) on chr08, which was predicted as the X chromosome. The sex-determining region comprised 554 genes, with 135 of which being differently expressed between males and females/pseudofemales, and 16 candidate sex-determining genes were screened out. The results of this study provided a useful chromosome-level genome for genetic, genomic and evolutionary studies of P. ussuriensis, and also be useful for further studies on sex-determination mechanism analysis and sex-control breeding of this fish.

Keywords: Pseudobagrus ussuriensis, whole-genome sequencing, chromosome evolution, sex-determining region, sex chromosome

1. Introduction

Siluriformes contains more than 3,000 known living species,1 which are commonly called catfishes. Karyotypic studies revealed that chromosome numbers of catfishes ranged from 2n = 24 in Liobagrus marginatus to 2n = 100 in Clarias batrachus, with 2n = 52 being the dominant type.2 In some cases, chromosome numbers even differ among different populations of the same species.3,4 As chromosome numbers showed continuous changes in the interval from 2n = 48 to 2n = 60, and Robertsonian translocation, chromosomal fission and fusion, and even the fourth round whole genome duplication were believed to occur during genome evolution of Siluriformes species,5,6 catfishes were considered as potential models for chromosomal and genomic evolutionary studies of teleost.2 Additionally, visible sex chromosomes were identified in a few species of the Clariidae and Amblycipitidae families,2 but not in most of other catfish families.

In bony fishes, the sex determination mechanisms are diverse, with transitions between different genetic and environmental determination systems, and often differ among close-related species and even different populations of the same fish species.7,8 Scholars have strong and persistent interests in studying the sex determination mechanisms and sex chromosomes of fishes, which can provide significant information for breeding through sex control and revealing the molecular and evolutionary mechanisms of vertebrate sex chromosomes.9,10 Previous karyotypic studies indicate that most fishes have morphologically undifferentiated homomorphic sex chromosomes, and about only 10% species contain heteromorphic sex chromosomes.11,12 The male heterogametic XX/XY and female heterogametic ZW/ZZ sex determination systems are the two most common categories in teleosts, although there are other systems, including XX/XO, ZO/ZZ and X1X1X2X2/X1X2Y.12 Compared with birds and mammals, sex chromosomes in fishes studied so far are believed to be much younger, which make them potential models to analyze the evolution and differentiation of sex chromosomes at the molecular level.13,14

For fishes without visible sex chromosomes, sex determination systems can be deduced using sex-specific markers.15–17 In recent years, different types of sex-specific markers including sequence characterized amplified regions (SCARs),18,19 single sequence repeats (SSRs),20,21 single nucleotide polymorphism (SNPs),22,23 restriction site-associated DNA (RAD)/2b-RAD tags,17,24 and quantitative trait loci (QTLs)25,26 have been identified in aquatic animals. Through mapping these sex-specific markers to genetic linkage maps, physical maps and assembled genomes, sex-determining regions have been identified in many fish species.23,27,28

Chromosome-level genomes are essential for better localizations of sex determination regions and identifications of potential sex-determining genes. With the development of sequencing technologies, the long-read third-generation sequencing technologies, including the Oxford Nanopore and the PacBio methods, are widely used and genome assemblies are continuously reported in aquatic animals in recent years. Through combining the third-generation sequencing method (Oxford Nanopore or PacBio) and the high-through chromosome conformation capture (Hi-C) technology, genomes can be assembled into the chromosome level, which have been successfully used in many fish species.29–31 Chromosome-level genomes have also been reported in catfishes, such as channel catfish (Ictalurus punctatus),32 yellow catfish (Pelteobagrus fulvidraco),33 giant devil catfish (Bagarius yarrelli),34 Southern catfish (Silurus meridionalis),35 Chinese longsnout catfish (Leiocassis longirostris),36 striped catfish (Pangasianodon hypophthalmus),37 and redtail catfish (Hemibagrus wyckioides).38 These chromosome-level assemblies have been widely used as reliable reference genomes for studies of many fields, including ecological adaptation,35,39,40 genome evolution,41–43 gene evolution,44,45 immunology,46–48 and identification of target-trait-related genes,49,50 in aquatic animals. Moreover, chromosome-level genomes have also been used for studies on sex determination mechanisms, including identification of sex determination regions and genes,51–53 sex reversal,54,55 and sex chromosome evolutions.10,14,56

The Ussuri catfish Pseudobagrus ussuriensis belongs to the Bagridae family of Siluriformes. This catfish naturally distributes from the northern Heilongjiang River basin to the southern Pearl River basin of China.57 In recent decades, due to overfishing, habitat destruction and water pollution, natural resources of P. ussuriensis have sharply declined, making it a rare species in its natural distribution areas.58 In order to protect and sustainably use natural resources, artificial reproduction, culturing and breeding have been performed in this fish, and it is now cultured in more than one third provinces of China. Additionally, the sexual dimorphism in P. ussuriensis is extremely obvious,19 with males growing approximately three times faster and larger than females, according to our measurement data of 2-year-old individuals. Given the growth advantages of males, all-male breeding through artificial sex control is a promising approach to increase the aquaculture yields of P. ussuriensis.

Previous studies of P. ussuriensis mainly focused on artificial reproduction and culture,59,60 interspecific hybridization breeding,61,62 and nutrition.63 Although studies are increasing in recent years, available genetic and genomic information of this fish is still limited, and studies on its sex-determination mechanism are seldom reported. According to our previous studies, the sex determination system of P. ussuriensis was preliminarily deduced as the XX/XY type based on two validated male-specific markers,19,21 and more male-specific tags were also identified.64 Nevertheless, our understandings on the sex determination mechanism of P. ussuriensis are still at the preliminary level. In the present study, a chromosome-level genome of P. ussuriensis was intended to be assembled using a combined method of Oxford Nanopore, Illumina, and Hi-C technologies, and the chromosomal evolution of P. ussuriensis was planned to analyze based on the genome assembly. The sex determination region was intended to be identified through mapping newly detected sex-related QTLs and previously identified sex-specific markers to the assembled genome. The results obtained in this study will be useful for genetic, genomic, and evolutionary studies, as well as sex-determination mechanism analysis and sex-control breeding of P. ussuriensis.

2. Materials and methods

2.1. Sample preparation

All applicable international, national, and/or institutional guidelines for the care and use of animals were strictly followed. All animal sample collection protocols complied with the current laws of China. Fish used in this study were provided by Huai’an Fisheries Technical Guidance Station and were reared in tanks of the Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huai’an, China. An adult female, which was identified through gametes, was randomly selected in May 2020 for genome sequencing. The muscle tissue of this fish was collected and stored in liquid nitrogen for following experiments. Total genomic DNA was extracted using a QIAGEN® Genomic DNA extraction kit (Qiagen, Germany), and the DNA quality was measured using a NanoDrop™ One UV-Vis spectrophotometer (Thermo, USA), and was quantified using a Qubit® 3.0 fluorometer (Invitrogen, USA).

2.2. Library construction and sequencing

The nanopore library was constructed using an Oxford Nanopore Ligation Sequencing kit SQKLSK109 (Oxford Nanopore Technologies, UK) following the manufacturer’s instructions. Briefly, size-selection (10–50 kb) was initially performed using ∼10 μg of gDNA by a BluePippin pulsed-field gel apparatus (Sage Science, USA). After damage and end repairs, adenines were added at the ends of DNA segments and then adapters were ligated to finish the construction of the nanopore library. After testing using a Qubit® 3.0 fluorometer (Invitrogen), the library was sequenced on R9.4 Flowcell using a PromethION sequencer (Oxford Nanopore Technologies, UK) at NextOmics Bioscience Co., Ltd (Wuhan, China). Then the ONT software Albacore v1.2.665 was used for base calling on fast5 files, and reads passed the filtration were considered as high-quality data, which were used for further analyses.

The Illumina library, which was used for polishing of the assembled nanopore genome, was constructed using a Truseq Nano DNA HT Sample Preparation Kit (Illumina, USA) according to the manufacturer’s recommendations. In brief, the genomic DNA was fragmented by sonication into short segments with a size of 350 bp. Then DNA segments were end polished, A-tailed, ligated with Illumina sequencing adapters and amplified through PCR. After purification (using the AMPure XP system) of PCR amplicons, the library was analyzed for size distribution using an Agilent2100 Bioanalyzer and quantified through real-time PCR. Finally, the library was sequenced by an Illumina NovaSeq platform at NextOmics Bioscience Co., Ltd (Wuhan) though 150 bp paired-end sequencing.

The Hi-C library was prepared according to a previously reported procedure66 with slight modifications. Firstly, ∼100 mg of muscle sample was shattered and treated with nuclei isolation buffer supplemented with 2% formaldehyde to induce crosslinking. Secondly, 2.5 M glycine was added to stop the crosslinking reaction. After purification, the nuclei were digested using 100 units of DpnII restriction enzyme and marked with biotin-14-dCTP, and then ligated with T4 DNA Ligase. The ligated DNA was sheared into fragments with a size range of 300–600 bp, and then these fragments were blunt-end repaired and A-tailed, followed by purification through biotin–streptavidin-mediated pull down. Finally, after quantification, the Hi-C library was sequenced using an Illumina Hiseq X Ten platform (Illumina) at NextOmics Bioscience Co., Ltd (Wuhan) though 150 bp paired-end sequencing.

A transcriptome library used for gene annotation of the P. ussuriensis genome was constructed using 11 tissues, including liver, spleen, heart, kidney, gills, brain, pituitary, muscle, skin, gonads, and intestine, from the fish used for genome sequencing. Total RNA of each tissue was extracted using TRIzol Reagent Kit (Invitrogen), according to the manufacturer’s protocol. Then 100 ng RNA from each tissue was pooled together for the cDNA library construction. The NEBNext® Ultra™ RNA Library Prep Kit for Illumina® (NEB, USA) was used to construct the library following the manufacturer’s protocol. Finally, the 150 bp paired-end sequencing was performed for the library through a high-throughput Illumina HiSeq 4000 platform at Novogene Co., Ltd., Beijing, China, following the manufacturer’s instructions (Illumina).

2.3. Genome assembly

Before assembly, the filtered Oxford Nanopore sequencing data were corrected by Nextdenovo (https://github.com/Nextomics/NextDenovo, 10 May 2021, date last accessed), using the following parameters: read_cuoff = 1 k, seed_cutoff = 10 k, blocksize = 2 g. Then, the WTDBG v1.2.8 (WTDBG, RRID: SCR_017225) software was used for genome assembly (parameters settings: wtdbg-1.2.8 -k 0 -p 17 -S 2; kbm-1.2.8 -k 0 -p 15 -S 2 -O 0; wtdbg-cns -k 13 -c 3). The assembled contigs were polished through three rounds of consensus correction by BWA v0.7.1267 (BWA, RRID: SCR 010910) and Pilon68 (Pilon, RRID: SCR 014731) using Illumina reads.

Raw Hi-C data were trimmed using HiC-Pro (v2.8.0) according to the reported method.69 Then Bowtie2 (v2.3.2)70 was used to map the clean paired-end reads to the draft assembled contigs. Both reads of the paired-end data were mapped independently, and only read pairs with both ends being uniquely aligned to the assembled contigs were used for further analysis. The valid Hi-C reads were used to assemble the chromosomal-level genome using LACHESIS69 (parameters: CLUSTER MIN RE SITES = 100, CLUSTER MAX LINK DENSITY = 2.5, LUSTER NONINFORMATIVE RATIO = 1.4, ORDER MIN N RES IN TRUNK = 60, and ORDER MIN N RES IN SHREDS = 60) through clustering, ordering and orienting of contigs. Finally, a genome-wide Hi-C heatmap was constructed through ggplot2 in the R package to evaluate the quality of the assembled chromosomal-level genome, and genome characters were plotted using Circos v0.69-6.71

2.4. Annotations of repeat sequences and non-coding RNAs

Tandem repeat elements were searched using tandem repeats finder v4.07b72 with default parameters throughout the genome, and SSRs were identified using the software GMATA v2.2.73 All tandem repeats were softmasked before the identification of transposable elements (TE), which were performed through combining the de novo and homology-based methods. In brief, a de novo repeat library was initially constructed using MITE-hunter74 and RepeatModeler v1.0.1175 with default parameter settings. Then the library was aligned against the TEclass Repbase76 to classify TEs into different repeat families. Finally, the software RepeatMasker v1.33175 was used to search both known and novel TEs by mapping sequences against the de novo repeat library and Repbase TE library, and overlapped TEs belonging to the same repeat class were combined.

Two strategies, which were searching against database and prediction with model, were used to identify non-coding RNAs (ncRNAs) in the assembled genome. Transfer RNAs (tRNAs) were predicted using the software tRNAscan-SE v2.077 with eukaryote parameters. The Rfam database78 was applied to detect MicroRNAs (miRNAs), ribosomal RNAs (rRNAs), small nuclear RNAs (snRNAs), and small nucleolar RNAs (snoRNAs) using cmscan of the software Infernal v1.1.2.79 Additionally, the software RNAmmer v1.280 was used to predict rRNAs and their subunits.

2.5. Gene predictions and functional annotations

All repeat sequences of the assembled genome were masked before gene prediction. The homology search, reference guided transcriptome assembly and de novo prediction approaches were independently used for gene predictions. For the homolog prediction method, protein sequences of five fishes, including yellow catfish (P. fulvidraco) (http://gigadb.org/dataset/100506, 15 June 2021, date last accessed), channel catfish (I. punctatus) (https://www.ncbi.nlm.nih.gov/genome/198, 15 June 2021, date last accessed), grass carp (Ctenopharyngodon idellus) (http://www.ncgr.ac.cn/grasscarp/, 15 June 2021, date last accessed), zebrafish (Danio rerio) (https://www.ncbi.nlm.nih.gov/genome/?term=zebrafish, 15 June 2021, date last accessed), and medaka (Oryzias latipes) (https://www.ncbi.nlm.nih.gov/genome/?term=medaka, 15 June 2021, date last accessed), were downloaded from public databases. Then these sequences were aligned to the genome assembly of P. ussuriensis using GeMoMa v1.6.181 to detect homologous peptides. For the RNA-seq-based prediction approach, raw reads were filtered using fastp v0.12.682 and clean reads were aligned to the genome assembly using STAR v2.7.3a.83 The transcripts were then assembled using Stringtie v1.3.4d84 and open reading frames were predicted using PASA v2.3.3.85 Using predicted genes of the RNA-seq-based method as the training set, Augustus v3.3.186 was applied for de novo gene prediction with default parameters. Finally, an integrated gene set was produced by EVidenceModeler v1.1.1,85 and genes being homologous to transposons were filtered using TransposonPSI v08222010 (http://transposonpsi. sourceforge.net/, 10 May 2021, date last accessed) and the miscoded genes were also removed. Furthermore, based on RNA-seq assemblies, untranslated regions and alternative splicing regions were determined using PASA. The accuracy of gene prediction and completeness of the genome assembly were assessed using Benchmarking Universal Single-Copy Orthologs v3.0.1 (BUSCO)87 by searching the predicted genes in the assembly for the conserved single-copy genes in Actinopterygii.

Five public databases, including NCBI non-redundant protein sequence database (NR), SwissProt, Clusters of orthologous groups for eukaryotic complete genomes (KOG), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO), were used for functional annotation of predicted protein-coding genes. Putative domains and GO terms of these genes were identified using InterProScan v5.32-71.0,88 while BLASTp v2.7.1 was used to compare the protein sequences of the P. ussuriensis genome against the remaining four public databases with an E value criterion of 1e−5. Finally, annotation results from the five databases were integrated to obtain the final gene set.

2.6. Phylogenetic analyses of the P. ussuriensis genome

Genome data of 10 teleosts, including P. fulvidraco, I. punctatus, D. rerio, O. latipes, Astyanax mexicanus (https://www.ncbi.nlm.nih.gov/genome/13073?genome_assembly_id=334665, 15 June 2021, date last accessed), Takifugu rubripes (https://www.ncbi.nlm.nih.gov/genome/63, 15 June 2021, date last accessed), Oreochromis niloticus (https://www.ncbi.nlm.nih.gov/genome/?term=Oreochromis+niloticus, 15 June 2021, date last accessed), Monopterus albus (https://www.ncbi.nlm.nih.gov/genome/24053?genome_assembly_id=302095, 15 June 2021, date last accessed), Glyptosternum maculatum (http://gigadb.org/dataset/100489, 15 June 2021, date last accessed) and S. meridionalis (https://www.ncbi.nlm.nih.gov/genome/9650, 15 June 2021, date last accessed), and an outgroup species Latimeria chalumnae (https://www.ncbi.nlm.nih.gov/genome/?term=coelacanth, 15 June 2021, date last accessed) were downloaded from open-source databases to identify gene families for the P. ussuriensis genome using OrthoMCL v2.0.9.89 Through the all-to-all BLASTP method with an E-value cutoff of 1e−5, protein-sequence similarities of the longest transcript were determined between P. ussuriensis and other species for all genes, which were then classified into three groups: orthologs, paralogs, and single copy orthologs.

The phylogenetic relationships between P. ussuriensis and another 11 fishes were analyzed using the single-copy orthologs shared by them. In brief, multiple alignments were performed through Mafft v7.31390 for each ortholog group, using their coding sequences. After elimination of poorly aligned sequences by Gblocks 0.91b,91 a phylogenetic tree was constructed using the GTRGAMMA model of RAxML v8.2.1092 with a bootstrap replicate of 1,000 times. Finally, the phylogenetic tree was visualized using MEGA v10.1.8.

Based on the phylogenetic tree, the MCMCTree software of the PAML v4.8 package93 was used to estimate divergent times between P. ussuriensis and the other 11 fish species. Four fossil calibration times, including divergence times between D. rerio and I. punctatus (126–179 Mya), D. rerio and O. latipes (205–255 Mya), O. latipes and O. niloticus (87–151 Mya), and O. latipes and L. chalumnae (425–446 Mya), were obtained from the TimeTree database (http://www.timet ree.org/, 15 June 2021, date last accessed) as the time control. Based on the divergent time tree, expansions and contractions of gene families were analyzed using the CAFE software.94

2.7. Comparative genomic analysis

To analyze chromosomal evolution of the P. ussuriensis (Bagridae) genome, syntenic comparisons were performed between P. ussuriensis and another four fishes of Siluriformes, including P. fulvidraco (Bagridae), I. punctatus (Ictaluridae), S. meridionalis (Siluridae), and P. hypophthalmus (Pangasiidae). Alignments of entire genomic DNA sequences between P. ussuriensis and other fishes were performed through Mummer v3.0,95 and syntenic relationships were plotted using the JCVI package.96

2.8. Identification of the sex-determination region

The correspondences between linkage groups (LGs) of the genetic linkage map97 and chromosomes (chrs) of the genome assembly were determined through mapping the SNP-containing 2b-RAD tags from the linkage map to the chromosomal-level genome using the Bowtie v2.4.0 software.98 The sex of individuals used for the genetic linkage map construction of P. ussuriensis97 were identified using the previously identified male-specific microsatellite marker PuGT54.21 Then sex-related QTLs were identified through MapQTL 6.099 using the previously reported method.97 Only the QTLs with phenotypic variance explained (PVE) values of 100% were used for further analyses.

The detected sex-related QTLs and sex-specific 2b-RAD tags identified in a previous study64 were mapped to the assembled P. ussuriensis genome using Bowtie v2.4.0. Moreover, male-specific markers identified in our previous studies19,21 were also mapped on the genome using Local BLASTn with an E-value cutoff of 1e−10. The chromosomal region containing more than 90% of the above segments was considered as the sex-determination region, and the chromosome containing the sex-determination region was predicted to be the sex chromosome of P. ussuriensis. Sequences of genes in the sex-determination region were compared with 15,495 differently expressed genes (DEGs) between gonads of males and females, as well as 14,022 DEGs between males and estradiol-treated males (pseudofemales), which were identified in a previous study.100 The genes corresponded to the DEGs that were shared between the two datasets were used for KOG, GO, and KEGG classifications to detect potential sex-determining genes.

3. Results

3.1. Genome sequencing and assembly

A total of 38.7 Gb Illumina clean data were obtained and used for estimations of the size and heterozygosity of the P. ussuriensis genome. The estimated genome size was 633.69 Mb, and the heterozygosity was 0.32%. Nanopore sequencing resulted in 75.0 Gb clean data, which were preliminarily assembled into 1,142 contigs. The assembled genome size was 741.89 Mb, with the N50 length of 5.61 Mb, the longest contig of 24.25 Mb, and the average contig length of 6.50 Mb (Table 1). The guanine–cytosine (GC) analysis resulted in an average GC content of 39.33% and a unimodal GC content distribution for the P. ussuriensis genome. The completeness of the genome assembly was evaluated to be 94.6% by BUSCO.

Table 1.

Genome assembly statistics of Pseudobagrus ussuriensis

Nanopore Hi-C
Total contigs 1142 819
Total base (bp) 741,894,228 741,974,678
Total N (bp) 0 80,450
Average length (bp) 649,644 905,951
Longest contig (bp) 24,251,496 44,389,119
Shortest contig (bp) 1032 1000
L50 42 11
N50 5,612,645 28,525,251
L75 98 18
N75 1,908,622 22,812,222
L90 207 23
N90 482,052 18,557,309
GC (%) 39.33 39.33
Complete BUSCOs 4335 (94.6%) 3544 (97.4%)
Complete and single-copy BUSCOs 4163 (90.8%) 3484 (95.7%)
Complete and duplicated BUSCOs 172 (3.8%) 60 (1.6%)
Fragmented BUSCOs 83 (1.8%) 24 (0.7%)
Missing BUSCOs 166 (3.6%) 72 (2.0%)
Total BUSCO groups searched 4584 3640

The Hi-C sequencing resulted in 78.60 Gb clean data, containing 524,470,464 clean paired-end reads, among which, 164.62 million unique mapped reads and 118.88 million valid reads (45.33% of clean reads and 72.21% of unique mapped reads) were obtained. Using the Hi-C data, 819 contigs with a size of 741.97 Mb were assembled, and the N50 and average contig lengths were 28.53 and 9.06 Mb (Table 1), respectively. The genome assembly was anchored and orientated into 26 chromosomes with a size range of 17.45–44.39 Mb and a total size of 722.22 Mb, which covered 97.34% of the whole assembled genome (Table 2). The GC contain of the Hi-C genome assembly was 39.33% (Table 1), which was identical with the nanopore assembly. The result of BUSCO estimation showed that the completeness of the Hi-C genome assembly was 97.4% (Table 1). The chromosomal-crosstalk heatmap indicated the complete and robust assembly of the P. ussuriensis genome (Fig. 1A), and characteristics of this genome assembly were also analyzed (Fig. 1B).

Table 2.

Correspondence between the physical genome and the genetic linkage map of Pseudobagrus ussuriensis

Chromosome Linkage group Physical length (Mb) Genetic size (cM) No. of genes Gene density (genes/Mb)
chr01 LG4 44.39 100.28 1555 35.03
chr02 LG3 42.31 121.38 1376 32.52
chr03 LG2 40.92 134.97 1231 30.08
chr04 LG5 37.81 103.10 1210 32.00
chr05 LG8 34.58 85.47 1074 31.06
chr06 LG7 33.77 103.94 1058 31.33
chr07 LG9 33.32 102.64 1090 32.71
chr08 LG1 32.06 108.52 912 28.45
chr09 LG6 29.58 116.98 967 32.69
chr10 LG13 29.44 91.20 751 25.51
chr11 LG12 28.53 151.36 978 34.28
chr12 LG20 28.21 106.71 965 34.21
chr13 LG11 27.56 110.15 648 23.51
chr14 LG23 26.75 109.60 860 32.15
chr15 LG10 26.48 87.00 868 32.78
chr16 LG15 25.77 84.26 805 31.24
chr17 LG14 22.93 118.50 879 38.33
chr18 LG16 22.81 98.47 822 36.04
chr19 LG21 22.37 90.78 848 37.91
chr20 LG22 22.30 92.32 881 39.51
chr21 LG19 19.40 79.15 649 33.45
chr22 LG25 19.03 105.93 681 35.79
chr23 LG18 18.56 93.55 559 30.12
chr24 LG24 18.43 96.61 684 37.11
chr25 LG17 17.46 82.53 633 36.25
chr26 LG26 17.45 81.31 680 38.97
Total 722.22 2656.72 23664 32.77
Coverage rate (%) 97.34 98.93

Figure 1.

Figure 1.

Genome assembly of Pseudobagrus ussuriensis. (A) The chromosomal-crosstalk heatmap based on Hi-C data. The blocks represent the contacts between one location and another. The color reflects the intensity of each contact, with deeper colors representing higher intensity. (B) Genome characteristics of P. ussuriensis, from outer to inner circles: a, the 26 chromosomes at the Mb scale; b–e, GC content, SNP density, repeat density and gene density across the genome, respectively, drawn in 1 Mb nonoverlapping windows.

3.2. Genome annotation

A total of 2,477,914 repeat sequences were identified with a size of 296,239,694 bp, which accounted for 39.93% of the P. ussuriensis genome, and TEs were the most abundant class with a percentage of 32.99% (Supplementary Table S1). Identification of ncRNAs resulted in 14,393 sequences, including 10,755 tRNAs, 337 miRNAs, 312 rRNAs, and 216 snRNAs (Supplementary Table S2).

Among the three gene-prediction methods, the homolog method resulted in 34,685 genes (Table 3), the RNA-seq method identified 18,081 genes, and the de novo method predicted 26,128 genes (Table 3). Through integration of the three gene sets, a total of 24,075 genes with an average length of 14,909 bp and an average CDS length of 1,657 bp (Table 3) were finally predicted for the P. ussuriensis genome. There were ∼10 exons for these genes on average, and the average length of exons was 164 bp with the average intron length of 1,451 bp (Table 3). The length distribution trends of gene, CDS, exon and intron, as well as number distributions of exon and intron were shown in Supplementary Fig. S1. Chromosome 1 (chr1) contained the most genes (1,555), while chr25 had the fewest (633), and the gene density was the highest in chr20 (39.51 genes/Mb) and lowest in chr13 (23.51 genes/Mb, Table 2). The 26 chromosomes comprised 23,664 (98.29%) of the 24,075 genes (Table 2), with the other 411 genes being located on unanchored contigs. According to the BUSCO assessments, 94.51% complete BUSCO genes were successfully predicted, which comprised 92.66% of complete single-copy BUSCOs (S) and 1.84% of complete and duplicated BUSCOs (D) from the 3,640 assessed genes (Supplementary Table S3). The results of BUSCO estimations indicated that the assembly and annotation of the P. ussuriensis genome were high in integrity and quality. Additionally, through annotations in five public databases, 22,039 (91.54%) of the predicted genes were annotated in at least one database (Supplementary Table S4). The assembly and annotation data of P. ussuriensis were compared with those of some reported catfishes (Table 4), including P. fulvidraco,33L. longirostris,36H. wyckioides,38G. maculatum,101B. yarrelli,34S. meridionalis,35Silurus asotus,102Silurus glanis,103Clarias batrachus,104I. punctatus,105 and P. hypophthalmus.37

Table 3.

Gene prediction of the Pseudobagrus ussuriensis genome

No. of genes Average gene length (bp) Average CDS length (bp) Average No. of exons per gene Average exon length (bp) Average intron length (bp)
RNA-seq NGS 18,081 21,022.24 3,781.77 11.09 340.99 1,708.55
Predicted AUGUSTUS 26,128 13,012.66 1,570.09 9.73 161.38 1,310.86
PASA 16,838 17,872.69 3,804.68 11.15 341.2 1,385.87
Homology Pelteobagrus fulvidraco 57,395 21,779.86 1,264.01 6.72 188.2 3,589.04
Ictalurus punctatus 46,565 28,501.24 1,620.15 9.18 176.58 3,288.14
Danio rerio 136,807 18,850.56 1,591.23 6.79 234.31 2,980.32
Oryzias latipes 53,967 44,948.92 1,683.61 9.49 177.4 5,095.86
Ctenopharyngodon idellus 75,906 10,971.60 1,106.53 3.83 289.03 3,487.81
Total Evidence modeler 24,075 14,909.21 1,656.70 10.13 163.5 1,451.06

Table 4.

Comparisons between the Pseudobagrus ussuriensis genome and assembled genomes of other Siluriformes species

Species Family Assembly size (Mb) Number of chrs Contig N50 (Mb) Genome coverage (%) GC (%) Identified genes Annotated genes Repetitive elements (%) Sources
Pseudobagrus ussuriensis Bagridae 741.97 26 28.53 97.34 39.33 24,075 22,039 39.93 This study
Pelteobagrus fulvidraco Bagridae 732.80 26 25.80 94.20 39.60 24,552 24,552 38.47 Gong et al.33
Leiocassis longirostris Bagridae 703.19 26 28.03 97.44 39.67 23,708 23,170 33.99 He et al.36
Hemibagrus wyckioides Bagridae 789.79 29 22.08 97.70 22,794 21,142 40.12 Shao et al.38
Glyptosternon maculatum Sisoridae 704.80 24 27.80 91.70 39.60 22,066 20,234 32.76 Xiao et al.101
Bagarius yarrelli Sisoridae 571.00 3.10 95.30 19,027 17,740 35.26 Jiang et al.34
Silurus meridionalis Siluridae 738.90 29 28.04 99.60 39.00 22,965 22,519 40.12 Zheng et al.35
Silurus asotus Siluridae 744.12 29 28.82 94.00 22,894 29.80 Chen et al.102
Silurus glanis Siluridae 793.40 3.20 84.20 39.20 21,316 20,532 39.50 Ozerov et al.103
Clarias batrachus Clariidae 821.00 0.36 95.20 39.20 22,914 22,587 30.30 Li et al.104
Ictalurus punctatus Ictaluridae 845.00 29 7.73 96.80 39.69 21,556 20,370 32.56 Chen et al.105
Pangasianodon hypophthalmus Pangasiidae 731.70 30 29.50 98.46 38.89 18,895 18,604 36.90 Gao et al.37

3.3. Genomic evolution analysis

Results of gene family clustering with other teleost showed that 21,150 (87.85%) genes, including 2,698 single copy orthologs and 556 unique genes, in the P. ussuriensis genome could be assigned into 15,636 gene families with 152 of which being unique in P. ussuriensis (Supplementary Table S5). Using the 2,698 single copy orthologs, a phylogenetic tree was constructed between P. ussuriensis and another 11 fish species. The phylogenetic tree showed that P. ussuriensis firstly clustered with P. fulvidraco (Bagridae), and then successively clustered with G. maculatum (Sisoridae), I. punctatus (Ictaluridae), and S. meridionalis (Siluridae) forming the Siluriformes clade (Fig. 2A). Then the Siluriformes clade formed the Ostariophysan fish clade with A. mexicanus (Characiformes) and D. rerio (Cypriniformes) (Fig. 2A). Based on fossil calibration times of known fishes, ancestors of P. ussuriensis and P. fulvidraco have separated in ∼11.04 Mya (Fig. 2B). The results of gene family expansions and contractions showed that 318 gene families experienced expansions and 581 underwent contractions (Fig. 2C).

Figure 2.

Figure 2.

Phylogenomic trees between Pseudobagrus ussuriensis and other fishes. (A) Phylogenetic tree between P. ussuriensis and another 11 fishes. (B) Divergence time tree between P. ussuriensis and another 11 fish species. Numbers on the branches represent estimated diverge times in millions of years ago, and nodes marked with red circles indicate the calibration time from fossil. (C) Expansion and contraction of gene families in P. ussuriensis. MRCA stands for the most recent common ancestor; pie charts and numbers represent the proportion and numbers of gene families with expansion marked with green and contraction marked with red, respectively.

Syntenic relationships of the P. ussuriensis chromosomes with four catfish species were analyzed through genomic comparisons. The results between genomes of P. ussuriensis and P. fulvidraco showed that 20 chrs of P. ussuriensis (Pu_chr) exhibited a one-to-one relationship with chrs of P. fulvidraco (Pf_chr), while the remaining six chrs involved three one-to-two relationships (Pu_chr1 to Pf_chr2 and Pf_chr23, Pu_chr2 to Pf_chr7 and Pf_chr8, and Pu_chr3 to Pf_chr7 and Pf_chr16) and two two-to-one relationships (Pu_chr4 and Pu_chr23 to Pf_chr1, and Pu_chr1 and Pu_chr21 to Pf_chr2) (Fig. 3A). Comparisons between Pu_chrs and chrs of I. punctatus (Ip_chr) resulted in 20 one-to-one relationships, five one-to-two relationships (Pu_chr1 to Ip_chr10 and Ip_chr20, Pu_chr2 to Ip_chr13 and Ip_chr29, Pu_chr3 to Ip_chr12 and Ip_chr25, Pu_chr5 to Ip_chr11 and Ip_chr12, and Pu_chr9 to Ip_chr2 and Ip_chr22), and one two-to-one relationship (Pu_chr9 and Pu_chr18 to Ip_chr2) (Fig. 3A). Syntenic relationships between Pu_chrs and chrs of S. meridionalis (Sm_chr) included 21 one-to-one, four one-to-two (Pu_chr1 to Sm_chr9 and Sm_chr20, Pu_chr2 to Sm_chr7 and Sm_chr28, Pu_chr3 to Sm_chr23 and Sm_chr29, and Pu_chr9 to Sm_chr18 and Sm_chr27) and one two-to-one (Pu_chr9 and Pu_chr13 to Sm_chr18) relationships (Fig. 3B). The comparison results between Pu_chrs and chrs of P. hypophthalmus (Ph_chr) showed 22 one-to-one relationships and four one-to-two relationships (Pu_chr1 to Ph_chr11 and Ph_chr21, Pu_chr2 to Ph_chr12 and Ph_chr28, Pu_chr3 to Ph_chr19 and Ph_chr29, and Pu_chr9 to Ph_chr24 and Ph_chr30) (Fig. 3B).

Figure 3.

Figure 3.

Chromosomal syntenic relationships and estimated chromosome evolution processes between Pseudobagrus ussuriensis and another four catfishes. (A) The syntenic relationships among chromosomes of P. ussuriensis (P. uss), Pelteobagrus fulvidraco (P. ful), and Ictalurus punctatus (I. pun). (B) The syntenic relationships among chromosomes of P. ussuriensis (P. uss), Silurus meridionalis (S. mer), and Pangasianodon hypophthalmus (P. hyp). (C)–(F) Estimated catfish chromosome evolution clusters 1–4, Pu P. ussuriensis, Pf P. fulvidraco, Ip I. punctatus, Sm S. meridionalis, and Ph P. hypophthalmus. (G) The estimated evolution process of 26 chromosomes in Pseudobagrus ussuriensis.

Through the chromosomal syntenic relationships between P. ussuriensis and another four catfish species, four evolutionary chromosome clusters (Fig. 3C–F) of the five catfishes were identified, which caused different chromosome numbers of the five fishes. The first cluster involved two chromosomes from P. ussuriensis (Pu_chr1 and Pu_chr21) and P. fulvidraco (Pf_chr2 and Pf_chr23), and three chromosomes from I. punctatus (Ip_chr10, Ip_chr20 and Ip_chr21), S. meridionalis (Sm_chr9, Sm_chr19 and Sm_chr20), and P. hypophthalmus (Ph_chr11, Ph_chr20 and Ph_chr21) (Fig. 3C). The second cluster was the most complex one which contained three chromosomes of P. ussuriensis (Pu_chr2, Pu_chr3 and Pu_chr5), four of P. fulvidraco (Pf_chr5, Pf_chr7, Pf_chr8, and Pf_chr16) and five of I. punctatus (Ip_chr11, Ip_chr12, Ip_chr13, Ip_chr25 and Ip_chr29), S. meridionalis (Sm_chr1, Sm_chr7, Sm_chr23, Sm_chr28 and Sm_chr29) and P. hypophthalmus (Ph_chr2, Ph_chr12, Ph_chr19, Ph_chr28 and Ph_chr29) (Fig. 3D). The third cluster included three chromosomes of P. ussuriensis (Pu_chr9, Pu_chr13 and Pu_chr18), P. fulvidraco (Pf_chr9, Pf_chr15 and Pf_chr19), I. punctatus (Ip_chr2, Ip_chr19 and Ip_chr22) and S. meridionalis (Sm_chr14, Sm_chr18 and Sm_chr27), and four of P. hypophthalmus (Ph_chr13, Ph_chr18, Ph_chr24 and Ph_chr30) (Fig. 3E). The fourth cluster was the simplest, containing two chromosomes from P. ussuriensis (Pu_chr4 and Pu_chr23), I. punctatus (Ip_chr7 and Ip_chr27), S. meridionalis (Sm_chr6 and Sm_chr26), and P. hypophthalmus (Ph_chr4 and Ph_chr25), and one from P. fulvidraco (Pf_chr1) (Fig. 3F). Based on these evolutionary chromosome clusters and the syntenic relationships between channel catfish and zebrafish chromosomes,106 the formation of chromosomes was deduced for P. ussuriensis (Fig. 3G).

3.4. Identification of the sex-determination region

Through homologous mapping of SNP-containing 2b-RAD tags from the linkage map to the assembled genome, each LG of the genetic linkage map corresponded to a chromosome of the genome assembly (Table 2). The results of QTL-mapping indicated that sex-related QTLs were all located on LG1 (chr08 of the genome assembly). A total of 546 sex-related QTLs with an LOD range of 5.24–2,247.55 (the chromosome-wide LOD threshold was 5.2) were finally identified with 539 of which being genome-wide level QTLs (the LOD threshold was 7.0). The PVE values of these QTLs varied from 15.0% to 100.0%, and PVE values of 317 QTLs were higher than 90.0%, among which 143 had PVE values of 95.0–99.9% and 62 owned a PVE value of 100.0% (Supplementary Table S6). The 62 sex-related QTLs with the PVE value of 100.0% were located on an interval of 69.93 cM (from 11.56 to 81.49 cM) on LG1 of the male genetic linkage map (Supplementary Fig. S2).

The 62 sex-related QTLs were then compared to the assembled genome, and 57 of them were mapped on the genome assembly, with 55 (96.5%) of which being mapped on a 15.76 Mb region (from 7.20 to 22.96 Mb) of chr08 (Supplementary Table S6). In one of our previous study, 327 potential male-specific tags were identified through 2b-RAD sequencing,64 and they were also located in the assembled genome. The results showed that only one of the 327 tags were not mapped, and 314 (96.3%) of the mapped 326 tags were all located at a 16.83 Mb region (from 6.90 to 23.73 Mb) of chr08 (Supplementary Table S7). The previously reported male-specific microsatellite PuGT5421 was located at the position of 19.48 Mb, and the male-specific sequence19 was located at the region of 8.36–8.37 Mb on chr08 (Supplementary Table S7). All of the above regions were then integrated and formed a final potential sex-determination region of 16.83 Mb (from 6.90 to 23.73 Mb) on chr08.

3.5. Screening of potential sex-determining genes

According to the results of gene prediction on chr08, a total of 554 genes were identified in the sex-determination region (Supplementary Table S8). Among the 554 predicted genes, 510 were annotated in at least one of the five public databases (Supplementary Table S8). The results of KEGG annotations showed that 353 of the 554 genes could be classified into 260 pathways (Supplementary Table S9), among which some sex-related pathways, such as TGF-beta signaling pathway, Wnt signaling pathway, estrogen signaling pathway and ovarian steroidogenesis, were identified (Supplementary Table S9). Through referring to the comparative transcriptome data between males and females,100 181 of the 554 predicted genes were found as DEGs with 85 being highly expressed in males and 96 highly expressed in females (Supplementary Table S8). The 554 genes were then compared with DEGs between males and pseudofemales,100 and 150 DEGs were found, among which 135 coincided with DEGs between males and females and had the same expression mode (Supplementary Table S8). Therefore, the 135 DEGs, with 65 highly expressed in males and 70 highly expressed in females/pseudofemales, may contain potential sex-determining genes. Results of annotations showed that 130 of the 135 genes could be found as homologs in at least one public database (Supplementary Table S8).

Through GO analysis, 86 of the 135 DEGs were classified into 35 level2 GO terms, among which, 13 terms showed sex-specific expression trends with seven in males and six in females (Supplementary Fig. S3). The seven male-specific terms contained eight genes including Chr08.223 (nuclear receptor subfamily 4 group A member 2, nr4a2), Chr08.261 (insulin-like growth factor-binding protein 5, igfbp5), Chr08.262 (insulin-like growth factor-binding protein 2-B, igfbp2b), Chr08.312 (indian hedgehog B protein, ihhb), Chr08.326 (neuropilin-2, nrp2), Chr08.368 (interferon-induced helicase C domain-containing protein 1, ifih1), Chr08.375 (bromodomain adjacent to zinc finger domain protein 2B, baz2b), and Chr08.409 (homeobox protein Hox-D3, hoxd3a), while the six female-specific terms contained six genes, which were Chr08.269 (gap junction gamma-1 protein, gjc1), Chr08.320 (tubulin alpha chain, tba), Chr08.430 (rap guanine nucleotide exchange factor 4, rapgef4), Chr08.435 (histone acetyltransferase type B catalytic subunit, hat1), Chr08.470 (Cis-aconitate decarboxylase, acod1), and Chr08.475 (general transcription factor IIF subunit 2, gtf2f2) (Supplementary Table S10). The results of KEGG analysis showed that 88 DEGs were classified into six class 1 and 33 class 2 terms involving 126 pathways (Supplementary Fig. S4 and Table S11), and PI3K-Akt signaling pathway, aldosterone synthesis and secretion and cholesterol metabolism were the top three sex-related pathways (Supplementary Table S11). Through integrating the results of GO and KEGG enrichments, 16 candidate sex-determining genes, including ten male-highly expressed and six female-highly expressed genes, were finally identified (Table 5).

Table 5.

Candidate sex-determining genes in the sex-determination region of Pseudobagrus ussuriensis

Gene log2(m/f) NR_Annotation Sex-related KEGG pathway
Chr08.223 8.3753 Nuclear receptor subfamily 4 group A member 2 (nr4a2) Aldosterone synthesis and secretion
Chr08.303 8.5218 Low-density lipoprotein receptor-related protein 2 (lrp2) Hedgehog signaling pathway; cholesterol metabolism
Chr08.358 −2.4828 Acyl-CoA-binding protein (acbp) PPAR signaling pathway
Chr08.368 4.4330 Interferon-induced helicase C domain-containing protein 1 (ifih1) RIG-I-like receptor signaling pathway
Chr08.399 4.3033 Caspase-8 (casp8) TNF signaling pathway
Chr08.409 10.8953 Homeobox protein Hox-D3 (hoxd3a)
Chr08.428 −7.3091 Cell division cycle-associated protein 7 (cdca7)
Chr08.520 −4.4996 Serine/threonine-protein phosphatase 2A regulatory subunit B″ subunit beta (ppp2r3b) PI3K-Akt signaling pathway
Chr08.527 −4.2231 Cell division cycle protein 16 homolog (cdc16) Cell cycle; oocyte meiosis; progesterone-mediated oocyte maturation
Chr08.566 3.5367 Sterol 26-hydroxylase, mitochondrial (cyp27a1) PPAR signaling pathway; cholesterol metabolism
Chr08.570 3.4892 Collagen alpha-2(IV) chain (col4a2) PI3K-Akt signaling pathway
Chr08.571 4.4797 Collagen alpha-2(IV) chain (col4a2) PI3K-Akt signaling pathway
Chr08.572 3.5372 Collagen alpha-1(IV) chain (col4a1) PI3K-Akt signaling pathway
Chr08.622 3.2926 3-Beta-hydroxysteroid dehydrogenase/Delta 5—> 4-isomerase (hsd3b) Aldosterone synthesis and secretion; steroid hormone biosynthesis; ovarian steroidogenesis; cortisol synthesis and secretion
Chr08.746 −4.8826 Testis-expressed protein 30 (tex30)
Chr08.760 −4.7110 Frizzled-5 (fzd5) Wnt signaling pathway

4. Discussion

In the present study, a chromosome-level genome of P. ussuriensis was successfully assembled. The genome size of P. ussuriensis was 741.97 Mb, containing 26 chromosomes, which was identical to the previous karyotypic results of this fish.107 The size of the P. ussuriensis genome was similar to those of reported catfish species (Table 4), and was closest to those of two Siluridae fishes: 738.90 Mb in S. meridionalis35 and 744.12 Mb in S. asotus.102 When compared with genome sizes of three reported Bagridae fishes,33,36,38 the genome size of P. ussuriensis was closest to that of P. fulvidraco (Table 4). The GC content in the genome of P. ussuriensis was similar to other catfishes (Table 4). In fact, the GC content of Siluriformes species seems very stable, with a percentage of about 39% (Table 4). Through three prediction methods, 24,075 protein-coding genes were identified in the P. ussuriensis genome, which was similar to those of reported Bagridae fishes and more than catfishes in families of Sisoridae, Siluridae, Clariidae, Ictaluridae, and Pangasiidae (Table 4). Therefore, this result indicated that Bagridae fishes may contain more genes than the families mentioned above. Moreover, the two Bagridae fishes P. ussuriensis and P. fulvidraco also had more unique genes and gene families than S. meridionalis (Siluridae), I. punctatus (Ictaluridae), and G. maculatum (Sisoridae) (Supplementary Table S5). The more frequent expansions and contractions of gene families may be the key reasons for the larger number of genes and more unique genes and gene families in Bagridae fishes than catfishes of other families (Fig. 2C).

Four different genera of Bagridae, including Pseudobagrus, Pelteobagrus, Leiocassis, and Hemibagrus, have been reported to occur in East Asia.108 The results of phylogenetic analyses showed that P. ussuriensis had the closest relationship to P. fulvidraco, and these two species separated in ∼11.04 Mya (Fig. 2B). According to previously reported studies, separation times were 41.73 Mya between H. wyckioides and P. fulvidraco,38 and 26.2 Mya between L. longirostris and P. fulvidraco.36 Therefore, among the four genera of the Bagridae family, Pseudobagrus and Pelteobagrus had the closet relationship. The phylogenetic relationships within the four genera of Bagridae are still equivocal and controversial. Based on skeleton characters, Dai et al. pointed out that Leiocassis was the most original genus, followed by Pseudobagrus and Pelteobagrus, and Hemibagrus was the most developed genus.109 While Hemibagrus was believed to be at a basal position based on mitochondrial DNA sequences, and Pseudobagrus, Pelteobagrus, and Leiocassis were found to be polyphyly in many studies.108,110,111 Moreover, Ku et al. proposed that Hemibagrus and Pseudobagrus were the only two valid genera of East Asian bagrids, and the Chinese species assigned to Pelteobagrus and Leiocassis should be considered members of the genus Pseudobagrus.108 The assembled genome of P. ussuriensis obtained in this study will be helpful for further investigations on classifications of Bagridae species at the whole genome level.

Previous studies have found that majority fishes from families of Bagridae, Sisoridae, Horabagridae, and Amblycipitidae were involved in a predominant Asian group, which were called the ‘Big Asia’.111,112 The phylogenetic tree obtained in this study also accorded with this opinion, as Bagridae and Sisoridae formed a clade with strong supports (Fig. 2A). The estimated divergent times of Bagridae and Sisoridae in this study (Fig. 2B) also conformed to the viewpoint that these two major families within the ‘Big Asia’ clade occurred just after the K-Pg boundary (<66 Mya).111 The ‘Big Asia’ clade in this study was closer to the family Ictaluridae than Siluridae, and the similar phenomenon has also been reported in previous studies.38,111 It has been reported that, although Ictaluridae is endemic in North American, it can be connected with Asian catfishes through its close relative in Asia, the Cranoglanididae family.111–113 Hardman deduced that the ancestor of Ictaluridae speciated from the ancestor of Cranoglanididae, and have invaded freshwaters of North America at the close of the Cretaceous through northeastern.113 While Siluridae has been reported to be a monophyletic family, and its connections to other catfish families were still unresolved.111,113 Additionally, P. ussuriensis and other catfishes formed the Siluriformes clade, and showed a closer relationship with Characiformes than Cypriniformes in the Ostariophysi, which coincided with results of previous studies.34,38,114

It has been reported that the ancient chromosome number of bony fish was 12 pairs, and formations of the present various chromosome numbers were the results of whole genome duplication, chromosomal fission/fusion, and intra/inter-chromosomal rearrangements.115–118 In Siluriformes species, chromosome numbers varied from 2n = 24 to 2n = 100,2 and 2n = 58 was deduced to be the ancestral number.5 Our results also support this deduction, as the two species S. meridionalis and I. punctatus, both of which were at earlier divergent statuses of the phylogenetic tree (Fig. 2A), had a haploid chromosome number of 29. As both of the two close-related fishes (in orders of Characiformes and Cypriniformes) of Siluriformes had 25 chromosome pairs, the ancient 29 chromosome pairs of Siluriformes may evolved from 25 chromosome pairs. The chromosomal syntenic relationships between I. punctatus and zebrafish showed that each of Ip_chr2 and Ip_chr12 corresponded to a sole zebrafish chromosome instead of two,106 therefore, I. punctatus may preserve more ancestral chromosome structures than S. meridionalis and P. hypophthalmus (Fig. 3D and E). Using chromosomes of I. punctatus as the ancestral type, chromosome number changes among five catfish species were analyzed, which mainly involved four chromosome clusters (Fig. 3C−F). In the first cluster, S. meridionalis and P. hypophthalmus preserved the original chromosomes, Pu_chr1 fused Ip_chr10 and Ip_chr20, while Pf_chr2 fused Ip_chr10 and Ip_chr21 (Fig. 3C). In the second cluster, Ip_chr12 was broken into two segments, one of which solely formed a new chromosome, and the other one fused with Ip_chr11 in S. meridionalis and P. hypophthalmus; in P. ussuriensis, Pu_chr3 fused Ip_chr25 and one segment of Ip_chr12, Pu_chr5 fused Ip_chr11 and the other segment of Ip_chr12, Pu_chr2 fused Ip_chr13 and Ip_chr29; in P. fulvidraco, Pf_chr5 fused Ip_chr11 and an Ip_chr12 segment, Pf_chr7 fused Ip_chr29 and the other Ip_chr12 segment (Fig. 3D). In the third cluster, chromosomal fission was occurred on Ip_chr2, and formed two new chromosomes in P. hypophthalmus, which was also the direct causation of 30 chromosomes in P. hypophthalmus; in S. meridionalis, one segment of Ip_chr2 solely formed a new chromosome and the other segment fused with Ip_chr19; while in P. ussuriensis and P. fulvidraco, an Ip_chr2 segment formed a new chromosome solely and the other segment fused with Ip_chr22 (Fig. 3E). In the fourth cluster, Ip_chr7 and Ip_chr27 fused in P. fulvidraco but not in the other four species (Fig. 3F). Based on these chromosome clusters and chromosomal syntenic relationships between I. punctatus and zebrafish,106 the formation of the haploid 26 chromosome number in P. ussuriensis was inferred, which experienced a process of 25–29–26, and chromosomal fissions and fusions were dominant events for chromosome number changes as reported in other studies.118–120

Sex chromosomes are the most concerned chromosomes in teleosts, and origins and evolutions of them are considered as central topics in evolutionary genetics.14 However, as sex chromosomes are morphologically undifferentiated and homomorphic in most teleosts, identifications of sex chromosomes should rely on sex-determining genes or sex-specific molecular markers. QTL-mapping is a useful method to find sex-related loci and identify potential sex chromosomes, which has been successfully used in many fishes.25,26,121 In the present study, 546 sex-related QTLs including 62 QTLs with a PVE value of 100.0% were detected on LG1 of the P. ussuriensis genetic linkage map, which corresponded to chr08 of the assembled genome. The number of identified sex-related QTLs was much more than that of many fishes.26,121,122 In many previously reported fishes, sex-related QTLs were located at short regions of LGs, such as 5.70 cM in Pundamilia pundamilia,121 3.86 cM in Hypophthalmichthys nobilis,123 and 0.40 cM in Channa argus,122 however, the sex-related QTL region in this study was quite large, which spanned 69.93 cM of LG1. Although large sex-related QTL regions have also been identified in other fishes, including 66.03 cM in channel catfish,106 70.88 cM in Scophthalmus maximus124 and 124.72 cM in Nibea albiflora,26 the values of LOD and PVE of QTLs in P. ussuriensis were much higher. Given the larger number, higher LOD and PVE values and wider spanning regions of sex-related QTLs in P. ussuriensis, the differentiation level of sex chromosomes may be higher than fishes mentioned above.

These QTLs were mapped to the assembled chromosome-level genome of P. ussuriensis, and located at a 15.76-Mb region on chr08. Furthermore, previously identified male-specific 2b-RAD tags,64 the male-specific microsatellite PuGT5421 and a male-specific sequence19 were also located on chr08, forming a combined potential sex-determination region of 16.83 Mb (from 6.90 to 23.73 Mb). It has been reported that to increase the association of sexually antagonistic alleles with the sex-determining locus, recombination on sex chromosomes was usually reduced,125 therefore, sex-biased genetic polymorphisms can be found on sex chromosomes.27 The results in the present study also support this viewpoint, as hundreds of male-specific loci concentrated in the potential sex-determination region. As chr08 was the only chromosome that contained the potential sex-determination region, it was deduced to be the sex chromosome, to be precise, the X chromosome, of P. ussuriensis. Sex chromosomes were believed to be the most difficult part of the genomes to study, thus most fish genome projects have purposely avoided the heterogametic sex, even if new sequencing technologies have been used.14 In the present study, although the homogametic female was used to obtain the genome assembly of P. ussuriensis, the sex chromosome and potential sex-determination region were still identified with the help of sex-specific QTLs, tags, markers and sequences. In our future studies, the YY super-male individuals will be bred to obtain the Y chromosome sequence, which will be compared with the sequence of X chromosome obtained in this study, and the sex-determination region will be located on a more accurate segment of the Y chromosome. By then, answers of many questions, including the evolution process of sex chromosomes, whether there were male-specific sex-determining genes on the Y chromosome, whether some male-specific sites/genes on chr08 were possible fast-evolving sites/genes, will be revealed.

Sex-determining genes are greatly diverse in fish, and genes from classical transcription factors such as dmy126 and sox3,127 to genes related to TGF-β such as gsdf128 and amh129,130 and its receptor amhr2,131 or steroidogenic pathway-related genes such as hsd17b1132 have been identified as sex-determining genes. Among these genes, dmrt1, amh, sox3, and gsdf have conserved roles in the vertebrate sex-determination pathway, and are called ‘usual suspects’.133 However, in recent years, more ‘newcomers’ of sex-determining genes, which had no previously known sex determination functions, have been identified, such as Paics in tilapia,10bcar1 in channel catfish,134 and sdY in salmonids.135 In the present study, 554 genes were identified in the potential sex-determination region of P. ussuriensis. However, none of the sex-determining genes that previously reported in other fishes were detected in this region. In fact, homologs of these genes were not even located on the deduced sex chromosome chr08, which indicated that the sex-determination mechanism of P. ussuriensis may be complex. Although comparisons between transcriptomes of female and male gonads indicated that dmrt1 and gdf9 may play essential roles in sex determination of P. ussuriensis,100 the sex-determining genes should be reconsidered, since the two genes were, respectively, mapped on chr10 and chr15, instead of on the sex-determination region of chr08. Expression data of the 554 genes from the sex-determination region were extracted from a transcriptomic study,100 in which transcriptomes between males and females, and between males and pseudofemales were compared. Finally, 135 of the 554 genes were identified as DEGs. Sixteen potential sex-determining genes were screened out, but their roles in sex determination of P. ussuriensis were still unknown. Studies on expression patterns of these candidate genes at different developmental stages of the gonads will be performed in our following work, which may provide supportive evidences to determine truly potential candidate sex-determining genes. Moreover, further functional analyses will be performed for these candidate sex-determining genes, moreover, other ‘newcomers’ will also be considered in our future studies.

In conclusion, a chromosome-level genome was assembled and annotated for female P. ussuriensis. Chromosomal evolutions between P. ussuriensis and other catfishes were analyzed with four evolutionary chromosome clusters being identified, and the formation of the P. ussuriensis chromosome number was deduced. A sex-determination region was focused on chr08, which was predicted as the X chromosome, and sixteen genes in this region were considered as candidate sex-determination genes. The chromosome-level genome obtained herein will provide basic data for genetic and genomic studies of P. ussuriensis, moreover, results of chromosomal evolution and sex-determination analyses will be helpful to analyze genomic evolution of Siluriformes species and clarify the sex-determination mechanism of P. ussuriensis.

Supplementary data

Supplementary data are available at DNARES online.

Supplementary Material

dsac028_Supplementary_Data

Acknowledgements

This research was supported by Natural Science Foundation of Jiangsu Province (BK20211367), funds of the Key Laboratory of Freshwater Aquatic Biotechnology and Breeding, Ministry of Agriculture and Rural Affairs, Heilongjiang Fisheries Research Institute, Chinese Academy of Fishery Sciences (FBB2020-01), National Natural Science Foundation of China (31602146), and the Start-up Funds of Scientific Research of Huaiyin Normal University (31ZCK00). Authors thank Xiaogang Qiang and Wenyong Chen for material preparation.

Conflict of interest

None declared.

Contributor Information

Chuankun Zhu, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Haiyang Liu, Key Laboratory of Tropical and Subtropical Fishery Resources Application and Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou 510380, China.

Zhengjun Pan, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Lei Cheng, Key Laboratory of Freshwater Aquatic Biotechnology and Breeding, Ministry of Agriculture and Rural Affairs, Heilongjiang Fisheries Research Institute, Chinese Academy of Fishery Sciences, Harbin 150070, China.

Yanhong Sun, Wuhan Aquaculture Science Research Institute, Wuhan 430207, China.

Hui Wang, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Guoliang Chang, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Nan Wu, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Huaiyu Ding, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Haitao Zhao, Jiangsu Key Laboratory for Eco-Agriculture Biotechnology Around Hongze Lake, Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Jiangsu Engineering Laboratory for Breeding of Special Aquatic Organisms, Huaiyin Normal University, Huai’an 223300, China.

Lei Zhang, Key Laboratory of Fishery Sustainable Development and Water Environment Protection of Huai’an City, Huai’an Sub Center of the Institute of Hydrobiology, Chinese Academy of Sciences, Huai’an 223002, China.

Xiangsheng Yu, Huai’an Fisheries Technical Guidance Station, Huai’an 223001, China.

Data Availability

The raw data of genome and RNA sequencing have been deposited in the Sequence Read Archive (SRA) database under the BioProject PRJNA813030, and the genome assembly is available at GenBank under the BioProject PRJNA810258.

References

  • 1. Eschmeyer W.N., Fong J.D.. 2021, Catalog of fishes: species by family/subfamily electronic version. https://www.calacademy.org/scientists/projects/catalog-of-fishes (4 September, date last accessed).
  • 2. Zhu C., Pan Z.. 2021, Progress of karyotype studies on Siluriformes in China, J. Fish. China, 46, 659–75. [Google Scholar]
  • 3. Ueno K. 1974, Chromosomal polymorphism and variant of isozymes in geographical populations of Pseudobagrus aurantiacus, Bagridae, Japan. J. Ichthyol., 21, 158–64. [Google Scholar]
  • 4. Ma Y., Zhang F., Mu P., Yuan W.. 2013, Analysis the karotype of Mystus macropterus Bleeker in the Jialin River, Modern Agricul. Sci. Technol., 2013, 260–1. [Google Scholar]
  • 5. LeGrande W. 1981, Chromosomal evolution in North American catfishes (Siluriformes: Ictaluridae) with particular emphasis on the Madtoms, Noturus, Copeia, 1981, 33–52. [Google Scholar]
  • 6. Xu P., Xu J., Liu G., et al. 2019, The allotetraploid origin and asymmetrical genome evolution of the common carp Cyprinus carpio, Nat. Commun., 10, 4625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Purcell C.M., Seetharam A.S., Snodgrass O., Ortega-García S., Hyde J.R., Severin A.J.. 2018, Insights into teleost sex determination from the Seriola dorsalis genome assembly, BMC Genomics, 19, 31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Li X.Y., Gui J.F.. 2018, Diverse and variable sex determination mechanisms in vertebrates, Sci. China Life Sci., 61, 1503–14. [DOI] [PubMed] [Google Scholar]
  • 9. Gammerdinger W.J., Kocher T.D.. 2018, Unusual diversity of sex chromosomes in African cichlid fishes, Genes., 9, 480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Tao W., Xu L., Zhao L., et al. 2021, High-quality chromosome-level genomes of two tilapia species reveal their evolution of repeat sequences and sex chromosomes, Mol. Ecol. Resour., 21, 543–60. [DOI] [PubMed] [Google Scholar]
  • 11. Ellegren H. 2011, Sex-chromosome evolution: recent progress and the influence of male and female heterogamety, Nat. Rev. Genet., 12, 157–66. [DOI] [PubMed] [Google Scholar]
  • 12. Mei J., Gui J.F.. 2015, Genetic basis and biotechnological manipulation of sexual dimorphism and sex determination in fish, Sci. China Life Sci., 58, 124–36. [DOI] [PubMed] [Google Scholar]
  • 13. Cioffi M.B., Bertollo L.A.. 2010, Initial steps in XY chromosome differentiation in Hoplias malabaricus and the origin of an X(1)X(2)Y sex chromosome system in this fish group, Heredity., 105, 554–61. [DOI] [PubMed] [Google Scholar]
  • 14. Li M., Zhang R., Fan G., et al. 2021, Reconstruction of the origin of a Neo-Y sex chromosome and its evolution in the spotted knifejaw, Oplegnathus punctatus, Mol. Biol. Evol., 38, 2615–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Liu H., Pang M., Yu X., Zhou Y., Tong J., Fu B.. 2018, Sex-specific markers developed by next-generation sequencing confirmed an XX/XY sex determination system in bighead carp (Hypophthalmichthys nobilis) and silver carp (Hypophthalmichthys molitrix), DNA Res., 25, 257–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Zhou Y., Wu J., Wang Z., et al. 2019, Identification of sex-specific markers and heterogametic XX/XY sex determination system by 2b-RAD sequencing in redtail catfish (Mystus wyckioides), Aquac. Res., 50, 2251–66. [Google Scholar]
  • 17. Xue L.-Z., Guo X.-F., Zhou Y-L., et al. 2020, Screening and characterization of sex-specific markers by 2b-RAD sequencing in zig-zag eel (Mastacembelus armatus) with implication of XY sex determination system, Aquaculture, 528, 735550. [Google Scholar]
  • 18. Dan C., Mei J., Wang D., Gui J.F.. 2013, Genetic differentiation and efficient sex-specific marker development of a pair of Y- and X-linked markers in yellow catfish, Int. J. Biol. Sci., 9, 1043–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Pan Z.J., Li X.Y., Zhou F.J., Qiang X.G., Gui J.F.. 2015, Identification of sex-specific markers reveals male heterogametic sex determination in Pseudobagrus ussuriensis, Mar. Biotechnol. (NY), 17, 441–51. [DOI] [PubMed] [Google Scholar]
  • 20. Xu D., Lou B., Li S., et al. 2015, A novel sex-linked microsatellite marker for molecular sexing in rock bream fish Oplegnathus fasciatus, Biochem. Syst. Ecol., 62, 66–8. [Google Scholar]
  • 21. Zhu C., Pan Z., Wang H., Chang G., Ding H.. 2018, A novel male-associated marker for sex identification in Ussuri catfish Pseudobagrus ussuriensis, J. Ocean. Limnol., 36, 850–2. [Google Scholar]
  • 22. Fang S., Zhang Y., Shi X., et al. 2020, Identification of male-specific SNP markers and development of PCR-based genetic sex identification technique in crucifix crab (Charybdis feriatus) with implication of an XX/XY sex determination system, Genomics, 112, 404–11. [DOI] [PubMed] [Google Scholar]
  • 23. Koyama T., Ozaki A., Yoshida K., et al. 2015, Identification of sex-linked SNPs and sex-determining regions in the yellowtail genome, Mar. Biotechnol. (NY), 17, 502–10. [DOI] [PubMed] [Google Scholar]
  • 24. Shi X., Waiho K., Li X., et al. 2018, Female-specific SNP markers provide insights into a WZ/ZZ sex determination system for mud crabs Scylla paramamosain, S. tranquebarica and S. serrata with a rapid method for genetic sex identification, BMC Genomics, 19, 981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Wei J., Chen Y., Wang W.. 2019, A high-density genetic linkage map and qtl mapping for sex and growth-related traits of large-scale loach (Paramisgurnus dabryanus), Front. Genet., 10, 1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Qiu C., Han Z., Li W., Ye K., Xie Y., Wang Z.. 2018, A high-density genetic linkage map and QTL mapping for growth and sex of yellow drum (Nibea albiflora), Sci. Rep., 8, 17271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Wen M., Feron R., Pan Q., et al. 2020, Sex chromosome and sex locus characterization in goldfish, Carassius auratus (Linnaeus, 1758), BMC Genomics, 21, 552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Lin H., Zhou Z., Zhao J., et al. 2021, Genome-wide association study identifies genomic loci of sex determination and gonadosomatic index traits in large yellow croaker (Larimichthys crocea), Mar. Biotechnol. (NY), 23, 127–39. [DOI] [PubMed] [Google Scholar]
  • 29. Zhou Z., Liu B., Chen B., et al. 2019, The sequence and de novo assembly of Takifugu bimaculatus genome using PacBio and Hi-C technologies, Sci. Data., 6, 187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Ge H., Lin K., Shen M., et al. 2019, De novo assembly of a chromosome-level reference genome of red-spotted grouper (Epinephelus akaara) using nanopore sequencing and Hi-C, Mol. Ecol. Resour., 19, 1461–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Tian H.F., Hu Q.M., Li Z.. 2021, A high-quality de novo genome assembly of one swamp eel (Monopterus albus) strain with PacBio and Hi-C sequencing data, G3, 11, jkaa032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Liu Z., Liu S., Yao J., et al. 2016, The channel catfish genome sequence provides insights into the evolution of scale formation in teleosts, Nat. Commun., 7, 11757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Gong G., Dan C., Xiao S., et al. 2018, Chromosomal-level assembly of yellow catfish genome using third-generation DNA sequencing and Hi-C analysis, GigaScience., 7, giy120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Jiang W., Lv Y., Cheng L., et al. 2019, Whole-genome sequencing of the giant devil Catfish, Bagarius yarrelli, Genome Biol. Evol., 11, 2071–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Zheng S., Shao F., Tao W., et al. 2021, Chromosome-level assembly of southern catfish (Silurus meridionalis) provides insights into visual adaptation to nocturnal and benthic lifestyles, Mol. Ecol. Resour., 21, 1575–92. [DOI] [PubMed] [Google Scholar]
  • 36. He W.P., Zhou J., Li Z., et al. 2021, Chromosome-level genome assembly of the Chinese longsnout catfish Leiocassis longirostris, Zool. Res., 42, 417–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Gao Z., You X., Zhang X., et al. 2021, A chromosome-level genome assembly of the striped catfish (Pangasianodon hypophthalmus), Genomics, 113, 3349–56. [DOI] [PubMed] [Google Scholar]
  • 38. Shao F., Pan H., Li P., Ni L., Xu Y., Peng Z.. 2021, Chromosome-level genome assembly of the Asian red-tail catfish (Hemibagrus wyckioides), Front. Genet., 12, 747684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Zhou Y., Qin W., Zhong H., Zhang H., Zhou L.. 2021, Chromosome-level assembly of the Hypophthalmichthys molitrix (Cypriniformes: Cyprinidae) genome provides insights into its ecological adaptation, Genomics, 113, 2944–52. [DOI] [PubMed] [Google Scholar]
  • 40. Han Z., Liu M., Liu Q., Zhai H., Xiao S., Gao T.. 2021, Chromosome-level genome assembly of burbot (Lota lota) provides insights into the evolutionary adaptations in freshwater, Mol. Ecol. Resour., 21, 2022–33. [DOI] [PubMed] [Google Scholar]
  • 41. Thompson A.W., Hawkins M.B., Parey E., et al. 2021, The bowfin genome illuminates the developmental evolution of ray-finned fishes, Nat. Genet., 53, 1373–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Krabbenhoft T.J., MacGuigan D.J., Backenstose N.J.C., et al. 2021, Chromosome-level genome assembly of Chinese sucker (Myxocyprinus asiaticus) reveals strongly conserved synteny following a catostomid-specific whole-genome duplication, Genome Biol. Evol., 13, evab190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Chen L., Li B., Chen B., et al. 2021, Chromosome-level genome of Poropuntius huangchuchieni provides a diploid progenitor-like reference genome for the allotetraploid Cyprinus carpio, Mol. Ecol. Resour., 21, 1658–69. [DOI] [PubMed] [Google Scholar]
  • 44. Liu H., Chen C., Lv M., et al. 2021, A chromosome-level assembly of blunt snout bream (Megalobrama amblycephala) genome reveals an expansion of olfactory receptor genes in freshwater fish, Mol. Biol. Evol., 38, 4238–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Liu D., Wang X., Guo H., Zhang X., Zhang M., Tang W.. 2021, Chromosome-level genome assembly of the endangered humphead wrasse Cheilinus undulatus: insight into the expansion of opsin genes in fishes, Mol. Ecol. Resour., 21, 2388–406. [DOI] [PubMed] [Google Scholar]
  • 46. Zhang Y., Gao H., Li H., et al. 2020, The white-spotted bamboo shark genome reveals chromosome rearrangements and fast-evolving immune genes of Cartilaginous Fish, iScience., 23, 101754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Peng J., Li Q., Xu L., et al. 2020, Chromosome-level analysis of the Crassostrea hongkongensis genome reveals extensive duplication of immune-related genes in bivalves, Mol. Ecol. Resour., 20, 980–94. [DOI] [PubMed] [Google Scholar]
  • 48. Zhou Q., Gao H., Zhang Y., et al. 2019, A chromosome-level genome assembly of the giant grouper (Epinephelus lanceolatus) provides insights into its innate immunity and rapid growth, Mol. Ecol. Resour., 19, 1322–32. [DOI] [PubMed] [Google Scholar]
  • 49. Yin Y.H., Zhang X.H., Wang X.A., et al. 2021, Construction of a chromosome-level genome assembly for genome-wide identification of growth-related quantitative trait loci in Sinocyclocheilus grahami (Cypriniformes, Cyprinidae), Zool. Res., 42, 262–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Xu Z., Gao T., Xu Y., et al. 2021, A chromosome-level reference genome of red swamp crayfish Procambarus clarkii provides insights into the gene families regarding growth or development in crustaceans, Genomics, 113, 3274–84. [DOI] [PubMed] [Google Scholar]
  • 51. Martinez P., Robledo D., Taboada X., et al. 2021, A genome-wide association study, supported by a new chromosome-level genome assembly, suggests sox2 as a main driver of the undifferentiatiated ZZ/ZW sex determination of turbot (Scophthalmus maximus), Genomics, 113, 1705–18. [DOI] [PubMed] [Google Scholar]
  • 52. Einfeldt A.L., Kess T., Messmer A., et al. 2021, Chromosome level reference of Atlantic halibut Hippoglossus hippoglossus provides insight into the evolution of sexual determination systems, Mol. Ecol. Resour., 21, 1686–96. [DOI] [PubMed] [Google Scholar]
  • 53. Cai M., Zou Y., Xiao S., et al. 2019, Chromosome assembly of Collichthys lucidus, a fish of Sciaenidae with a multiple sex chromosome system, Sci. Data., 6, 132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Zhu K.C., Zhang N., Liu B.S., et al. 2021, A chromosome-level genome assembly of the yellowfin seabream (Acanthopagrus latus; Hottuyn, 1782) provides insights into its osmoregulation and sex reversal, Genomics, 113, 1617–27. [DOI] [PubMed] [Google Scholar]
  • 55. Xiao L., Guo Y., Wang D., et al. 2020, Beta-hydroxysteroid dehydrogenase genes in orange-spotted grouper (Epinephelus coioides): genome-wide identification and expression analysis during sex reversal, Front. Genet., 11, 161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Xiao Y., Xiao Z., Ma D., et al. 2020, Chromosome-level genome reveals the origin of Neo-Y chromosome in the male barred knifejaw Oplegnathus fasciatus, iScience., 23, 101039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Chu X., Zheng B., Dai D.. Fauna Sinica. Science Press, Beijing, 1999, p.60–1. [Google Scholar]
  • 58. Wu J., Zhang Y., Huo T., Sun X.. 2013, Genetic structure of Pseudobagrus ussuriensis from Heilongjiang and Songhua River based on mtDNA sequences, Freshwater Fisheries, 43, 16–20. [Google Scholar]
  • 59. Pan W., Wang P., Zhao C., Qi J.. 2001, Preliminary study on propagation biology and artificial breeding of Pseudobagras ussuriensis, Chin. J. Fish., 14, 1–3. [Google Scholar]
  • 60. Wu Y. 2016, Fingerling cultivation and cage culture techniques of Pseudobagrus ussuriensis, Scientific Fish Farming, 5, 37–9. [Google Scholar]
  • 61. Chen Y., Chen X., Pan Y., et al. 2010, RAPD analysis of genetic variation between Pelteobagrus vachelli (♀) and Pseudobagrus ussuriensis (♂) and their hybrids, J. Shanghai Ocean Univ., 19, 12–8. [Google Scholar]
  • 62. Wang M., Cai Y., Chen X., et al. 2013, Growth comparison of offsprings in ussuri bullhead Pseudobagrus ussuriensis and darkbarble catfish Pelteobagrus vachelli and their reciprocal hybrids, Fish. Sci., 32, 50–4. [Google Scholar]
  • 63. Bu X., Chen A., Lian X., et al. 2017, An evaluation of replacing fish meal with cottonseed meal in the diet of juvenile Ussuri catfish Pseudobagrus ussuriensis: growth, antioxidant capacity, nonspecific immunity and resistance to Aeromonas hydrophila, Aquaculture, 479, 829–37. [Google Scholar]
  • 64. Zhu C., Liu H., Cheng L., et al. 2021, Identification of sex-specific sequences through 2b-RAD sequencing in Pseudobagrus ussuriensis, Aquaculture, 539, 736639. [Google Scholar]
  • 65. Sutton M.A., Burton A.S., Zaikova E., et al. 2019, Radiation tolerance of nanopore sequencing technology for life detection on Mars and Europa, Sci. Rep., 9, 5370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Belton J.M., McCord R.P., Gibcus J.H., Naumova N., Zhan Y., Dekker J.. 2012, Hi-C: a comprehensive technique to capture the conformation of genomes, Methods., 58, 268–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Li H., Durbin R.M.. 2009, Fast and accurate short read alignment with Burrows-Wheeler transform, Bioinformatics., 25, 1754–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Walker B.J., Abeel T., Shea T., et al. 2014, Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement, PLoS One., 9, e112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Burton J.N., Adey A., Patwardhan R.P., Qiu R., Kitzman J.O., Shendure J.. 2013, Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions, Nat. Biotechnol., 31, 1119–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Langmead B., Trapnell C., Pop M., Salzberg S.L.. 2009, Ultrafast and memory-efficient alignment of short DNA sequences to the human genome, Genome Biol., 10, R25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Krzywinski M., Schein J., Birol I., et al. 2009, Circos: an information aesthetic for comparative genomics, Genome Res., 19, 1639–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Benson G. 1999, Tandem repeats finder: a program to analyze DNA sequences, Nucleic Acids Res., 27, 573–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Wang X., Wang L.. 2016, GMATA: an integrated software package for genome-scale SSR mining, marker development and viewing, Front. Plant Sci., 7, 1350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Han Y., Wessler S.R.. 2010, MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences, Nucleic Acids Res., 38, e199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Bedell J.A., Korf I., Gish W.. 2000, MaskerAid: a performance enhancement to RepeatMasker, Bioinformatics., 16, 1040–1. [DOI] [PubMed] [Google Scholar]
  • 76. Bao W., Kojima K.K., Kohany O.. 2015, Repbase update, a database of repetitive elements in eukaryotic genomes, Mob. DNA., 6, 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Lowe T.M., Eddy S.R.. 1997, tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence, Nucleic Acids Res., 25, 955–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Kalvari I., Argasinska J., Quinones-Olvera N., et al. 2018, Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families, Nucleic Acids Res., 46, D335–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Nawrocki E.P., Eddy S.R.. 2013, Infernal 1.1: 100-fold faster RNA homology searches, Bioinformatics., 29, 2933–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Lagesen K., Hallin P., Rodland E.A., Staerfeldt H.H., Rognes T., Ussery D.W.. 2007, RNAmmer: consistent and rapid annotation of ribosomal RNA genes, Nucleic Acids Res., 35, 3100–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Keilwagen J., Hartung F., Grau J.. 2019, GeMoMa: homology-based gene prediction utilizing intron position conservation and RNA-seq data, Methods Mol. Biol., 1962, 161–77. [DOI] [PubMed] [Google Scholar]
  • 82. Chen S., Zhou Y., Chen Y., Gu J.. 2018, fastp: an ultra-fast all-in-one FASTQ preprocessor, Bioinformatics, 34, i884–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Dobin A., Davis C.A., Schlesinger F., et al. 2013, STAR: ultrafast universal RNA-seq aligner, Bioinformatics., 29, 15–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Kovaka S., Zimin A.V., Pertea G.M., Razaghi R., Salzberg S.L., Pertea M.. 2019, Transcriptome assembly from long-read RNA-seq alignments with StringTie2, Genome Biol., 20, 278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Haas B.J., Salzberg S.L., Zhu W., et al. 2008, Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments, Genome Biol., 9, R7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Stanke M., Diekhans M., Baertsch R., Haussler D.. 2008, Using native and syntenically mapped cDNA alignments to improve de novo gene finding, Bioinformatics., 24, 637–44. [DOI] [PubMed] [Google Scholar]
  • 87. Simão F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M.. 2015, BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs, Bioinformatics., 31, 3210–2. [DOI] [PubMed] [Google Scholar]
  • 88. Zdobnov E.M., Apweiler R.. 2001, InterProScan–an integration platform for the signature-recognition methods in InterPro, Bioinformatics, 17, 847–8. [DOI] [PubMed] [Google Scholar]
  • 89. Li L., Stoeckert C.J. Jr, Roos D.S.. 2003, OrthoMCL: identification of ortholog groups for eukaryotic genomes, Genome Res., 13, 2178–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Katoh K., Standley D.M.. 2013, MAFFT multiple sequence alignment software version 7: improvements in performance and usability, Mol. Biol. Evol., 30, 772–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Castresana J. 2000, Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis, Mol. Biol. Evol., 17, 540–52. [DOI] [PubMed] [Google Scholar]
  • 92. Stamatakis A. 2006, RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models, Bioinformatics, 22, 2688–90. [DOI] [PubMed] [Google Scholar]
  • 93. Yang Z. 2007, PAML 4: phylogenetic analysis by maximum likelihood, Mol. Biol. Evol., 24, 1586–91. [DOI] [PubMed] [Google Scholar]
  • 94. De Bie T., Cristianini N., Demuth J.P., Hahn M.W.. 2006, CAFE: a computational tool for the study of gene family evolution, Bioinformatics, 22, 1269–71. [DOI] [PubMed] [Google Scholar]
  • 95. Kurtz S., Phillippy A., Delcher A.L., et al. 2004, Versatile and open software for comparing large genomes, Genome Biol., 5, R12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Tang H., Krishnakumar V., Li J.. 2015, jcvi: JCVI utility libraries. Zenodo. https://doi.org/10.5281/zenodo.31631.
  • 97. Zhu C., Liu H., Pan Z., et al. 2019, Construction of a high-density genetic linkage map and QTL mapping for growth traits in Pseudobagrus ussuriensis, Aquaculture, 511, 734213. [Google Scholar]
  • 98. Langmead B. 2010, Aligning short sequencing reads with Bowtie, Curr. Protoc. Bioinformatics, Chapter 11, Unit 11.17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Van Ooijen J. 2009, MapQTL 6, Sofware for the mapping of quantitative trait loci in experimental populations of diploid species. Kyazma BV, Wageningen, The Netherlands.
  • 100. Pan Z., Zhu C., Chang G., Wu N., Ding H., Wang H.. 2021, Differential expression analysis and identification of sex-related genes by gonad transcriptome sequencing in estradiol-treated and non-treated Ussuri catfish Pseudobagrus ussuriensis, Fish Physiol. Biochem., 47, 565–81. [DOI] [PubMed] [Google Scholar]
  • 101. Xiao S.J., Mou Z.B., Yang R.B., et al. 2021, Genome and population evolution and environmental adaptation of Glyptosternon maculatum on the Qinghai-Tibet Plateau, Zool. Res., 42, 502–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102. Chen W., Zou M., Li Y., Zhu S., Li X., Li J.. 2021, Sequencing an F1 hybrid of Silurus asotus and S. meridionalis enabled the assembly of high-quality parental genomes, Sci. Rep., 11, 13797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103. Ozerov M.Y., Flajshans M., Noreikiene K., Vasemagi A., Gross R.. 2020, Draft genome assembly of the freshwater apex predator Wels catfish (Silurus glanis) using linked-read sequencing, G3 (Bethesda), 10, 3897–906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Li N., Bao L., Zhou T., et al. 2018, Genome sequence of walking catfish (Clarias batrachus) provides insights into terrestrial adaptation, BMC Genomics, 19, 952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Chen X., Zhong L., Bian C., et al. 2016, High-quality genome assembly of channel catfish, Ictalurus punctatus, GigaScience, 5, 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106. Zhang S., Zhang X., Chen X., et al. 2019, Construction of a high-density linkage map and QTL fine mapping for growth- and sex-related traits in channel catfish (Ictalurus punctatus), Front. Genet., 10, 251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Xue S., Yin H.. 2008, Karyotype analyses of Pseudobagrus ussuriensis, Chin. J. Fish., 21, 75–8. [Google Scholar]
  • 108. Ku X., Peng Z., Diogo R., He S.. 2007, MtDNA phylogeny provides evidence of generic polyphyleticism for East Asian bagrid catfishes, Hydrobiologia, 579, 147–59. [Google Scholar]
  • 109. Dai F., Su J.. 1998, Study on isozymes, skeleton characters of eight bargrid catfishes and comments on phylogenetic relationship (Siluriformes: Bargridae), Acta Zootaxon. Sin., 23, 432–9. [Google Scholar]
  • 110. Liang H., Meng Y., Luo X., Li Z., Zou G.. 2018, Species identification of DNA barcoding based on COI gene sequences in Bagridae catfishes, J. Fish. Sci. China, 25, 772–82. [Google Scholar]
  • 111. Kappas I., Vittas S., Pantzartzi C.N., Drosopoulou E., Scouras Z.G.. 2016, A time-calibrated mitogenome phylogeny of catfish (Teleostei: Siluriformes), PLoS One, 11, e0166988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112. Sullivan J.P., Lundberg J.G., Hardman M.. 2006, A phylogenetic analysis of the major groups of catfishes (Teleostei: Siluriformes) using rag1 and rag2 nuclear gene sequences, Mol. Phylogenet. Evol., 41, 636–62. [DOI] [PubMed] [Google Scholar]
  • 113. Hardman M. 2005, The phylogenetic relationships among non-diplomystid catfishes as inferred from mitochondrial cytochrome b sequences; the search for the ictalurid sister taxon (Otophysi: Siluriformes), Mol. Phylogenet. Evol., 37, 700–20. [DOI] [PubMed] [Google Scholar]
  • 114. Hughes L.C., Orti G., Huang Y., et al. 2018, Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data, Proc. Natl. Acad. Sci. U S A., 115, 6249–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115. Jaillon O., Aury J.M., Brunet F., et al. 2004, Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype, Nature., 431, 946–57. [DOI] [PubMed] [Google Scholar]
  • 116. Woods I.G., Wilson C., Friedlander B., et al. 2005, The zebrafish gene map defines ancestral vertebrate chromosomes, Genome Res., 15, 1307–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117. Kasahara M., Naruse K., Sasaki S., et al. 2007, The medaka draft genome and insights into vertebrate genome evolution, Nature, 447, 714–9. [DOI] [PubMed] [Google Scholar]
  • 118. Ravi V., Venkatesh B.. 2018, The divergent genomes of teleosts, Annu. Rev. Anim. Biosci., 6, 47–68. [DOI] [PubMed] [Google Scholar]
  • 119. Volff J.N. 2005, Genome evolution and biodiversity in teleost fish, Heredity. (Edinb)., 94, 280–94. [DOI] [PubMed] [Google Scholar]
  • 120. Damas J., Corbo M., Lewin H.A.. 2021, Vertebrate chromosome evolution, Annu. Rev. Anim. Biosci., 9, 1–27. [DOI] [PubMed] [Google Scholar]
  • 121. Feulner P.G.D., Schwarzer J., Haesler M.P., Meier J.I., Seehausen O.. 2018, A dense linkage map of lake Victoria Cichlids improved the Pundamilia genome assembly and revealed a major QTL for sex-determination, G3 (Bethesda), 8, 2411–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122. Liu H., Luo Q., Ou M., Zhu X., Zhao J., Chen K.. 2020, High-density genetic linkage map and QTL fine mapping of growth and sex in snakehead (Channa argus), Aquaculture, 519, 734760. [Google Scholar]
  • 123. Zhou Y., Liu H., Wang X., Fu B., Yu X., Tong J.. 2020, QTL fine mapping for sex determination region in bighead carp (Hypophthalmichthys nobilis) and comparison with silver carp (Hypophthalmichthys molitrix), Mar. Biotechnol. (NY)., 22, 41–53. [DOI] [PubMed] [Google Scholar]
  • 124. Wang W., Hu Y., Ma Y., Xu L., Guan J., Kong J.. 2015, High-density genetic linkage mapping in turbot (Scophthalmus maximus L.) based on SNP markers and major sex- and growth-related regions detection, PLoS One, 10, e0120410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125. Conte M.A., Clark F.E., Roberts R.B., et al. 2021, Origin of a giant sex chromosome, Mol. Biol. Evol., 38, 1554–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126. Matsuda M., Nagahama Y., Shinomiya A., et al. 2002, DMY is a Y-specific DM-domain gene required for male development in the medaka fish, Nature, 417, 559–63. [DOI] [PubMed] [Google Scholar]
  • 127. Takehana Y., Matsuda M., Myosho T., et al. 2014, Co-option of Sox3 as the male-determining factor on the Y chromosome in the fish Oryzias dancena, Nat. Commun., 5, 4157. [DOI] [PubMed] [Google Scholar]
  • 128. Myosho T., Otake H., Masuyama H., et al. 2012, Tracing the emergence of a novel sex-determining gene in medaka, Oryzias luzonensis, Genetics, 191, 163–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129. Hattori R.S., Murai Y., Oura M., et al. 2012, A Y-linked anti-Mullerian hormone duplication takes over a critical role in sex determination, Proc. Natl. Acad. Sci. USA., 109, 2955–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130. Pan Q., Feron R., Yano A., et al. 2019, Identification of the master sex determining gene in Northern pike (Esox lucius) reveals restricted sex chromosome differentiation, PLoS Genet., 15, e1008013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131. Kamiya T., Kai W., Tasumi S., et al. 2012, A trans-species missense SNP in Amhr2 is associated with sex determination in the tiger pufferfish, PLoS Genet., 8, e1002798. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132. Koyama T., Nakamoto M., Morishima K., et al. 2019, A SNP in a steroidogenic enzyme is associated with phenotypic sex in seriola fishes, Curr. Biol., 29, 1901–9.e8. [DOI] [PubMed] [Google Scholar]
  • 133. Herpin A., Schartl M.. 2015, Plasticity of gene-regulatory networks controlling sex determination: of masters, slaves, usual suspects, newcomers, and usurpators, EMBO Rep., 16, 1260–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134. Bao L., Tian C., Liu S., et al. 2019, The Y chromosome sequence of the channel catfish suggests novel sex determination mechanisms in teleost fish, BMC Biol., 17, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135. Yano A., Nicol B., Jouanno E., et al. 2013, The sexually dimorphic on the Y-chromosome gene (sdY) is a conserved male-specific Y-chromosome sequence in many salmonids, Evol. Appl., 6, 486–96. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

dsac028_Supplementary_Data

Data Availability Statement

The raw data of genome and RNA sequencing have been deposited in the Sequence Read Archive (SRA) database under the BioProject PRJNA813030, and the genome assembly is available at GenBank under the BioProject PRJNA810258.


Articles from DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes are provided here courtesy of Oxford University Press

RESOURCES