The pangenome of Acanthopagrus provides genomic variation evidence for genetic diversity

Yan Hu; Wenhao Wang; Hao Wang; Zhanyuan Gao; Xinru Zhu; Yuchen Yang; Jianguo Lu

doi:10.1016/j.isci.2026.115539

. 2026 Mar 30;29(4):115539. doi: 10.1016/j.isci.2026.115539

The pangenome of Acanthopagrus provides genomic variation evidence for genetic diversity

Yan Hu ¹, Wenhao Wang ¹, Hao Wang ¹, Zhanyuan Gao ¹, Xinru Zhu ¹, Yuchen Yang ^3,^∗, Jianguo Lu ^1,^2,^4,^∗∗

PMCID: PMC13091521 PMID: 42006376

Summary

Acanthopagrus species are ecologically and economically important sparid fishes with strong environmental adaptability. To characterize genomic architecture underlying their adaptation and phenotypic diversity, a population-scale survey of intraspecific variation in yellowfin seabream (A. latus) was conducted using whole-genome resequencing data from 80 wild individuals. Among the genomic variations identified, structural variations (SVs) contribute disproportionately high to genomic diversity, defining 11,463 variable genes associated with immunity, ion transport, and environmental adaptation. Compared to the core genes with conserved functions in basic metabolism, these variable genes exhibit lower expression levels but higher transcriptional variance. Furthermore, a pangenome graph for Acanthopagrus species identifies a 24-bp deletion in the gch2 promoter of blackhead seabream (A. schlegelii) as a candidate variant for the loss of xanthophore pigmentation. This deletion disrupts a conserved motif, potentially impairing Pax7a-mediated regulation. These findings uncover genetic mechanisms driving adaptation and phenotypic divergence in Acanthopagrus species with high aquaculture values.

Subject area: Zoology, Ichthyology, Genetics, Genomics

Graphical abstract

Highlights

•
A graph pangenome of Acanthopagrus reveals genomic diversity beyond the reference
•
Structural variants, rather than single-nucleotide polymorphisms (SNPs), dominantly shape the variable gene landscape
•
Transcriptional flexibility of variable genes is a key source of adaptive plasticity
•
Variants in gch2 may contribute to xanthophore pigmentation divergence in Acanthopagrus

Zoology; Ichthyology; Genetics; Genomics

Introduction

The genus Acanthopagrus, a cornerstone of the family Sparidae, possesses substantial economic and ecological value.¹ These fishes are euryhaline and adapt to a broad range of water salinities across coastal and estuarine habitats,²^,³ and most species are protandrous hermaphrodites.⁴^,⁵ Furthermore, Acanthopagrus species are primary targets for commercial fisheries and recreational angling, underpinning significant socio-economic interests.⁶^,⁷^,⁸ Within this genus, yellowfin seabream (A. latus) and blackhead seabream (A. schlegelii) are of paramount economic importance. While they share many biological traits, they can be distinguished by their fin color: Yellowfin seabream features vivid yellow fins, while blackhead seabream displays black-colored fins. This phenotypic divergence is not only taxonomically significant but also serves as a critical quality trait that directly influences consumer preference and market valuation. However, the genetic architecture underlying these distinct color phenotypes remains largely uncharacterized.

Recent advancements in genomics have revolutionized our understanding of genomic variations in Acanthopagrus and other marine taxa. Linear reference genomes, assembled via next-generation sequencing (NGS) and long-read sequencing technologies, have greatly propelled population genomics research.⁹^,¹⁰^,¹¹ These resources facilitate precise identification of single-nucleotide polymorphisms (SNPs), the most prevalent form of genetic variation, and have been instrumental in deciphering population structure, adaptive processes, and the genetic basis of key traits.¹²^,¹³^,¹⁴ For instance, SNP-based approaches have successfully identified genetic markers associated with growth rate, disease resistance, and environmental adaptation of aquaculture species, offering critical insights into aquaculture and conservation management.¹⁵^,¹⁶^,¹⁷ However, while SNPs are highly informative, they capture only a fraction of total genetic diversity; other forms of genomic variations remain comparatively understudied despite their potential functional significance.

Beyond SNPs, genomic variations also encompass insertions and deletions (INDELs, <50 bp) and structural variations (SVs, >50 bp). The latter include deletions (DELs), insertions (INSs), duplications (DUPs), and inversions (INVs).¹⁸^,¹⁹^,²⁰^,²¹ These variations have been shown to substantially influence an organism’s phenotypes by increasing genome diversity, rewiring epigenetic modifications, and altering gene transcription, thereby playing a pivotal role in adaptation and speciation.²²^,²³^,²⁴^,²⁵ For instance, in Chrysophrys auratus, SVs and INDELs collectively accounted for three times more variation than SNPs; this suggests they may play a more significant role in driving the genetic divergence in marine teleost genomes than SNPs.²⁶ However, single-reference genome approaches may introduce bias by overlooking non-reference alleles, leading to underestimated genetic diversity and flawed biological interpretations across species.²⁷^,²⁸^,²⁹

To address the limitations, pangenomics has emerged as a powerful framework for capturing the full spectrum of genetic diversity.³⁰^,³¹ A pangenome represents the collective sequence entities of multiple individuals within a species or closely related taxa. It comprises core components, representing the genome regions shared across all individuals, and the accessory genome, which consists of sequences specific to certain individuals or lineages.³²^,³³ Among the mathematical frameworks developed for pangenome analysis, the pangenome graph has been widely used to integrate multiple genomic datasets into a single data structure.³⁴ In this model, individual-specific genetic variations are represented as unique paths, which significantly reduces the reference bias inherent in linear reference frameworks while facilitating the discovery of genetic variations associated with adaptation, disease resistance, and other critical traits.³⁵^,³⁶^,³⁷^,³⁸ However, the application of pangenome graphs remains largely unexplored in marine biology, despite its extensive use in studies of humans, crops, and livestock.

Here, we present the pangenome graph for the genus Acanthopagrus, integrating SNPs, INDELs, and SVs to provide a holistic view of its genetic landscape. Using the PanGenome Graph Builder (PGGB) pipeline, we constructed a reference-free, bidirected sequence variation graph incorporating genomic data from yellowfin seabream and blackhead seabream.³⁹ This resource facilitated the identification of associations between pigmentation genes and phenotypic divergence between the two species, shedding light on the genetic mechanisms driving adaptive evolution and speciation within Acanthopagrus. These findings represent a significant advancement in marine genomics, laying a strong foundation for understanding species divergence and supporting genomic-assisted selection and breeding in aquaculture.

Results

Characterization of SV in wild yellowfin seabream

From the 80 yellowfin seabream individuals, we initially detected 61,215,259 genomic variations (STAR Methods, Figure S1), including 54,304,264 SNPs, 6,709,422 INDELs, 189,852 DELs, 9,277 DUPs, and 2,444 INVs. Following filtration for a minor allele frequency (MAF) > 0.1, 33,662,926 SNPs (62.0%), 3,205,257 INDELs (47.8%), 65,898 DELs (34.7%), 4,428 DUPs (47.8%), and 1,376 INVs (56.3%) were retained for downstream analyses (Figures 1A–1F). A total of 1,639 high-confidence chromatin interaction regions were found to co-localize with SVs. Notably, 1,920 SVs were anchored at one end of an interaction, while their corresponding distal anchors overlapped with gene promoter regions (≤2 kb upstream of the transcription start site, TSS). This indicates that these SVs may affect chromatin spatial architecture and gene regulation (Figure 1G). Across most variation types, the majority (∼60.5–71.9%) occurred in intergenic and intronic regions. Only 18.1% of the variants (11,094,093) were identified in exonic and promoter regions of genes, potentially exerting direct or indirect effects on their protein coding and expression regulation (Figure 1H). In contrast, DUPs exhibited a distinct genomic distribution, with a pronounced enrichment (∼70%) within promoter regions (Figure 1H). Regarding size distribution, over 80% of the identified SVs were relatively small (<1,000 bp); nonetheless, DUPs displayed a significantly greater average length than DELs and INVs (Figure 1I).

Landscape of genomic variations in yellowfin seabream (*Acanthopagrus latus*) population

(A–F) Circos plot illustrates the genomic distribution of five variant types (SNP, INDEL, DEL, DUP, and INV) across the 24 chromosomes. All densities were calculated using a 100 kb non-overlapping sliding window. Tracks from outer to inner: (A) Gene density (black), with darker shades representing higher density.

(B) SNP density (orange-red).

(C) INDEL density (pink).

(D) Deletion (DEL) density (orange histogram) and minor allele frequency (MAF) (scatterplot, range: 0.1–1.0).

(E) Duplication (DUP) density (green histogram) and MAF (scatterplot, range: 0.1–1.0).

(F) Inversion (INV) density (blue histogram) and MAF (scatterplot, range: 0.1–1.0).

(G) Chromatin interaction map shows significant inter- and intra-chromosomal interactions (links) identified by Hi-C data. Link colors correspond to the source chromosomes.

(H) Distribution of the five variant types across various genomic features (e.g., promoter, UTR, and exon). The y axis represents the relative proportion of each variant type within specific regions.

(I) Density plots show the length distribution for large structural variants (SVs), including DELs (orange), DUPs (green), and INVs (blue). The x axis represents the SV length (bp), and the y axis represents the density estimation.

Functional enrichment of core genes and variable genes

Core genes were defined as those lacking any high-impact variants (STAR Methods) across all individuals (all GT values are “0/0”), whereas variable genes were defined as those harboring high-impact variants (GT value ≠ “0/0”). We then performed iterative random sampling to simulate the dynamic fluctuations in core and variable gene counts relative to population size (rarefaction analysis, Figure S2). However, the resulting saturation curves failed to reach a smooth convergence, regardless of whether SNPs or SVs were utilized (Figures S3 and S4). To investigate the heterogeneity, we reconstructed a neighbor-joining (NJ) phylogenetic tree based on high-impact SNPs to screen for potential outliers. The analysis revealed 21 significant outliers— predominantly from the Zhanjiang (ZJ) population (18/21)—that formed a divergent clade (Figure S5). It suggests that these individuals experienced distinct evolutionary dynamics from the rest of the cohort, leading to the failure of rarefaction analysis.¹⁷ After removing these outliers, the rarefaction analysis yielded smooth saturation curves.

SVs achieved faster convergence in rarefaction analysis than SNPs (Figures 2A and 2B), and we obtained the final set of core genes and variable genes at a sample size of 50. Among the 11,463 variable genes identified (38.1% of total genes), 55.7% (6,380 genes) were exclusively influenced by SVs and 25.7% (2,949 genes) were only affected by SNPs (Figure 2C). Only a small proportion (2,134 genes, 18.6%) was affected by both SNPs and SVs. Functional enrichment analysis revealed distinct biological roles for core and variable genes in the growth, development, and environmental adaptation of yellowfin seabream (Figure 2D). Core genes were primarily involved in fundamental processes such as the respiratory chain, energy metabolism, ribosome biogenesis, and protein synthesis. In contrast, variable genes were significantly enriched for Gene Ontology (GO) terms related to cell junctions, the nervous system, immunity, and ion channels. Notably, there were no overlapping terms between the two groups (Figure 2D). Furthermore, core genes exhibited significantly higher expression levels and lower inter-individual variance than variable genes (Figures 2E and 2F). A linear mixed-effects model (LMM) confirmed this trend: Elevated expression of core genes persists after controlling for sample- and gene-specific random effects (estimate = 0.092, t = 4.965, p = 6.87e−7, Figure 2F).

Characterization and functional divergence of core and variable genes

(A and B) Rarefaction analysis (saturation curves) depicts the relationship between the number of identified core and variable genes and sample size (n = 1 to 50). Genes are defined by (A) SNPs and (B) SVs, respectively. The curves demonstrate faster convergence and greater stability when utilizing SVs.

(C) Venn diagram illustrates the overlap of variable genes defined by SVs and SNPs. Values indicate the number of genes and their corresponding percentages relative to the total number of variable genes.

(D) Dot plot of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment for core and variable genes. The x axis denotes the gene category, dot size indicates the number of genes, and color intensity represents statistical significance (log₁₀(adjusted P-value)).

(E) Density plot shows the distribution of expression variance for core and variable genes across 56 transcriptome samples. Dashed lines indicate the mean variance for each group.

(F) Raincloud plots compare the mean expression levels (log₂(TPM+1)) between core and variable genes. Boxplots show the median (center line) and interquartile ranges (box limits). Statistical significance was determined using a linear mixed-effects model (LMM), accounting for random effects for samples and genes (Estimate = 0.092, t = 4.965, p = 6.87e−7).

Construction of the pangenome of Acanthopagrus

A linear pangenome was constructed for yellowfin seabream and its close relative, blackhead seabream. Over 70% of the genomic regions exhibited synteny across the four assemblies (Figure S6). Notably, large-scale INVs were identified at chromosome termini between the two species. For example, on chromosome 3, a 10 Mb fragment and 74 loci exceeding 10 kb displayed inverted orientations in blackhead seabream relative to yellowfin seabream. These discrepancies may stem from either authentic SV or assembly artifacts in the blackhead seabream genome, necessitating further validation (Figure 3A). Among the three yellowfin seabreams, we identified a higher density of syntenic regions with high sequence conservation, encompassing over 95% of core genes.

Architecture and genomic features of the *Acanthopagrus* pangenome

(A) Visualization of a large-scale inversion (∼10 Mb) identified on chromosome 3 between yellowfin seabream (*A. latus*, fAcaLat) and blackhead seabream (*A. schlegelii*, fAcaSch). The linear pangenome alignment depicts syntenic blocks, inversions, and other structural rearrangements. The top track represents the coordinates along chromosome 3.

(B) Structural evaluation of the pangenome graph. The plot displays the distribution of node counts (y axis) and total sequence length (color gradient) across different genome combinations. The “U-shaped” distribution indicates that a large number of nodes are either shared by all genomes (core) or specific to a single genome (unique). The bottom panel matrix indicates the presence (black dot) or absence of each assembly within specific node subsets.

Following rigorous correction of chromosomal orientations to ensure consistency (Figures S7–S11), we constructed a pangenome graph with a total length of 806,572,107 bp, comprising 38,639,018 nodes and 52,724,247 edges. Each node is connected by an average of 1.36 edges, with a mean edge length of 21 bp. While only 32.8% of the nodes (12,656,023) are shared across all individuals, these nodes accounted for the majority (74.4%) of total nucleotides. As expected, blackhead seabream harbored the highest proportion of species-specific sequences. Intriguingly, one yellowfin seabream assembly contained approximately 46.8 Mb of unique sequences (5.9%) absent in the other two assemblies (Figure 3B). These findings underscore that integrating multiple genomes can provide a far more comprehensive landscape of sequence variation than a single reference.

Pangenome graph reveals genetic loci associated with xanthophore pigmentation of yellowfin seabream and blackhead seabream

The primary phenotypic distinctions between yellowfin seabream and blackhead seabream lie in the distribution of yellow pigmentation in pelvic, anal, and lower caudal fins (Figures 4A and 4B). To elucidate the genetic basis of this difference, we leveraged the pangenome graph to examine sequence divergence in candidate genes associated with xanthophore formation and yellow pigmentation, specifically gch2, csf1ra, pax7b, and bco2b.⁴⁰^,⁴¹^,⁴²^,⁴³ While the promoter and exon regions of csf1ra, pax7b, and bco2b were highly conserved (Figure S12), several genomic variations were identified in the gch2 locus of blackhead seabream (Figures 4C–4H), including a notable 24 bp INDEL within its promoter region (Figures 4E and 4F).

Genomic variation at the *gch2* locus associated with fin pigmentation

(A and B) Representative photographs and close-up views of the pelvic, anal, and lower caudal fins of (a) yellowfin seabream (*A. latus*, yellow fins) and (B) blackhead seabream (*A. schlegelii*, black fins).

(C) Linear genome alignment of the *gch2* locus. The top schematic depicts the gene structure, while the bottom tracks show alignment coverage for three yellowfin seabream genomes (orange) and one black seabream genome (gray). Yellow highlights denote two major genomic variations: a 24-bp deletion (INDEL) in the promoter region and a 70-bp insertion (INS) near the transcription end site (TES).

(D) Bandage visualization of the local pangenome graph topology for the *gch2* gene. Blue loops represent complex variation bubbles.

(E) Close-up of the graph bubble corresponding to the 24-bp INDEL in the promoter. The two paths represent the reference (upper) and deletion (lower) alleles.

(F) Haplotype sequences aligned to the graph paths in (E), illustrating the specific 24-bp INDEL in blackhead seabream.

(G) Close-up of the graph bubble corresponding to the 70-bp INS.

(H) Haplotype sequences aligned to the graph paths in (g), showing the insertion sequence.

To determine if the sequence divergences identified by the pangenome graph were evolutionarily conserved, we extended our analysis to the genome resequencing data from an additional 24 yellowfin seabream and blackhead seabream individuals. The 24 bp INDEL exhibited complete fixation between the two species, characterized by distinct, non-overlapping distribution patterns (Figures 4E, 4F, 5A, and 5B). This finding was further validated by the sequencing results from fin rays (Figure S13). In contrast, other variants displayed polymorphic or ambiguous patterns (Figures 5C and 5D). These results highlight the superiority of graph-based genotyping in mitigating biases inherent in linear pangenomes, providing a more rigorous framework for precise variant identification (Figures 4C, 4G, 4H, 5C, and 5D).

Genotypic distribution of genomic variations at the *gch2* gene across yellowfin seabream and blackhead seabream populations

(A and B) Sequence Tubemap shows read alignment support for the 24-bp INDEL. Rows represent individual samples. The green gradient represents alignment scores for the reference (non-deletion) allele. (A) In yellowfin seabream populations, high alignment scores indicate the absence of the deletion (yellowfin-type: fAcaLat_1, fAcaLat_2, and fAcaLat_3 genotype).

(B) In blackhead seabream populations, high alignment scores indicate the presence of the 24-bp deletion (blackhead-type: fAcaSch_1 genotype).

(C and D) Sequence Tubemap illustrates read alignment support for the 70-bp downstream insertion, revealing a complex and ambiguous pattern.

(C) In yellowfin seabream, reads with high alignment scores are mapped to both the insertion and non-insertion paths, indicating that this locus is polymorphic and not fixed within the yellowfin seabream population.

(D) In blackhead seabream, while the highest-scoring reads supported the insertion (blackhead-type), reads supporting the non-insertion (yellowfin-type) contained numerous SNPs (indicating low sequence identity). This conflicting alignment pattern suggests genotyping ambiguity for this locus, precluding a definitive conclusion regarding its fixation.

The SV in the promoter region of gch2 as a candidate driver of xanthophore pigmentation divergence in Acanthopagrus

To assess the functional significance of the 24 bp INDEL in blackhead seabream, we performed a transcription factor binding site (TFBS) scanning within the gch promoter. A binding motif “ATTCAT” for the transcription factor Pax3/7 was identified to overlap with the 24 bp INDEL, with a JASPAR specificity score >0.85 (Figure 6A). Four domains were identified in the Pax7a protein, specifically Pax3, Homeodomain, Pax7, and OAR domains (Figure 6B). The 3D structure of the Pax3 domain was then modeled using AlphaFold3. The resulting model demonstrated high reliability (ranking score = 0.84; local pLDDT >90) and excellent stereochemical quality, with 95.4% of residues in favored regions and none in disallowed regions. Structural alignment against the established paired domain crystal structure (PDB: 1MDM) confirmed a highly conserved topological fold (TM-score >0.70; RMSD = 2.58 Å). This TM-score significantly exceeds the 0.5 threshold for structural homology, thereby validating the model for subsequent docking (Figures 6C and 6D). Molecular docking analysis showed favorable electrostatic potential compatibility at the binding interface, where the Pax3 domain forms multiple hydrogen bonds with the “ATTCAT” motif (Figures 6E and 6F). These data support the model in which Pax7a binds directly to the 24 bp INDEL sequence, suggesting that the loss of this binding site might drive the observed divergence in gch2 expression between the two species and the resulting differences in xanthophore pigmentation.

Proposed molecular mechanism of *gch2 cis*-regulation by the Pax7a transcription factor

(A) Motif scanning of the 24-bp sequence deleted in blackhead seabream. The highlighted “ATTCAT” sequence matches the binding motif of the Pax3/7 transcription factor family (JASPAR ID: MA2114.1) with a high specificity score (0.86).

(B) Domain architecture of the yellowfin seabream Pax7a protein. Domains are color-coded: Pax3 DNA-binding domain (orange), homeodomain (green), Pax7 domain (blue), and OAR domain (yellow).

(C) Predicted 3D structure of the interaction between Pax7a protein and the 24-bp DNA fragment via molecular docking, where the Pax3 domain (orange) directly interacts with the DNA major groove.

(D) Close-up of the interface between the Pax3 domain residues and the DNA helix.

(E) Electrostatic potential surface of the Pax3 domain (ranging from −5 kT/e [red) to +5 kT/e [blue)), illustrating charge complementarity with the negatively charged DNA backbone.

(F) Detailed view of hydrogen bond interactions. Black dashed lines denote hydrogen bonds between specific amino acid residues of Pax7a and the nucleotide bases of the 24-bp deletion. Bond distances range from 2.47 Å to 3.74 Å.

Discussion

SVs mediate sequence and regulation divergence between yellowfin seabream individuals

In this study, we performed a genome-wide investigation of SVs in yellowfin seabream populations, identifying 70,712 SVs across 80 individuals. The reliability of our dataset was established through rigorous cross-validation against external datasets and PCR experiments (Figures 5A, 5B, and S13). Compared to SNPs, although non-SNP variations were less numerous than SNPs, they affect a disproportionately larger number of genes (Figures 1A–1F and 2C), which is consistent with observations in other species.⁴⁴^,⁴⁵^,⁴⁶^,⁴⁷ In humans, for example, rare SVs are 841- and 341-fold more likely to be strongly deleterious than rare SNPs and rare INDELs, respectively.⁴⁸ Despite their lower frequency, SVs exert a profound biological impact because a single structural event can span thousands of base pairs, frequently disrupting entire exons or critical regulatory elements. Studies in C. auratus revealed that SVs are significantly enriched in genomic regions exhibiting strong signatures of selection (e.g., extreme Tajima’s D and π values) and possess substantial predictive power for complex traits such as growth.²⁶^,⁴⁹ These findings reinforce the hypothesis that SVs are not merely genomic noise but are potent drivers of phenotypic adaptation, often carrying greater functional weight than the more numerous but smaller-scale SNPs.

In the yellowfin seabream’s genome, 13,499 SVs were located within the gene body, which may disrupt coding sequences or regulatory elements, leading to direct alterations in protein structure and function.⁵⁰ Functional enrichment analysis revealed that these SV-associated genes are predominantly involved in cell junctions, nervous system, immunity, and ion channels (Figure 2D). This enrichment pattern aligns with the “genomic island of divergence” hypothesis observed in other marine teleosts, where structural variants preferentially target loci mediating environmental interactions to drive rapid adaptation.⁵¹ Given that yellowfin seabream is a euryhaline species, the high prevalence of SVs in the genes of ion channels and cell junctions likely reflects an adaptive requirement for flexible osmoregulation and signal transduction under fluctuating salinity.⁵² Similarly, the high variability in immune-related genes suggests a mechanism for maintaining high allelic diversity to cope with diverse pathogen pressures.⁵³ Thus, SV-mediated alterations in these genes likely provide the genomic plasticity necessary for population survival in heterogeneous environments.

In contrast, core genes are primarily associated with fundamental “housekeeping” processes, including the respiratory chain, energy metabolism, ribosome biogenesis, and protein synthesis (Figure 2D). Our transcriptomic analysis further revealed that these core genes exhibited significantly higher expression abundance but lower variance than the genes harboring SVs (Figures 2E and 2F). This distinct architecture mirrors the evolutionary theory of expression level-evolutionary rate (E-R) anticorrelation, which posits that highly expressed genes are subject to intense selective constraints against deleterious mutations to avoid putative protein misfolding and non-specific interactions, which are harmful to the cell’s survival.⁵⁴^,⁵⁵^,⁵⁶

The majority of SVs in this study were located in non-coding regions (Figure 1H), which is consistent with the observation in humans, where more than 88% of causal SVs reside in non-coding regions.²³ These non-coding SVs may cause the gain or loss of cis-regulatory elements (e.g., enhancers or promoters) or alter the 3D chromatin conformation, ultimately changing gene transcription.²²^,⁵⁷ Specifically, the co-localization of SVs with promoter regions and chromatin interaction anchors highlights a physical mechanism by which these variants modulate long-range gene regulation. By potentially disrupting the spatial contact between enhancers and promoters, these SVs could alter the transcriptional output and expression plasticity (Figure 1G). Furthermore, such variants may affect the binding activities of TFs to promoters or distal elements, thereby driving expression differences between individuals and contributing to the rapid adaptation across diverse habitats.

Potential role of genomic variations in mediating pigmentation differences between yellowfin seabream and blackhead seabream

Phenotypic diversity in teleosts is a key driver of adaptation to heterogeneous habitats and lineage divergence. In shallow-waters environments, where short-wavelength light penetrates more effectively, fish in vibrant blue, green, and yellow colors are often highly conspicuous, facilitating their intraspecific signaling, reproductive ornamentation, or aposematic displays.⁵⁸ Gaining a comprehensive understanding of the genomic mechanisms underlying these color variations is essential for elucidating the dynamics of phenotypic evolution and environmental adaptation between different species.

In the case of yellowfin seabream and blackhead seabream, the primary phenotypic difference is localized to fin coloration (Figure 4A). We hypothesize that this pigmentation divergence is driven by distinct selective pressures associated with habitat partitioning and visual communication. The bright yellow fins of yellowfin seabream likely serve as critical visual cues for schooling or mate recognition in clear, shallow coastal waters. Similar adaptive strategies have been widely documented in other teleosts; for instance, in Lake Victoria cichlids (e.g., Pundamilia nyererei) and bluefin killifish (Lucania goodei), bright yellow and red pigmentations are strongly favored by sexual selection to enhance conspecific signaling in well-lit environments.⁵⁹^,⁶⁰ Conversely, the loss of yellow coloration in blackhead seabream—resulting in predominantly dark or black fins—may represent an adaptation to rocky reefs or deeper habitats. Reduced conspicuousness in these environments enhances camouflage and crypsis, thereby mitigating predation risk and conferring a fitness advantage. This strategy mirrors the evolutionary trajectories observed in other benthic and reef-associated marine organisms; for instance, juvenile shore crabs (Carcinus maenas) exhibit environmentally induced color changes that improve background matching, while reef-associated gobies (Gobiidae spp.) modulate body coloration and melanin distribution to enhance crypsis in visually complex rocky habitats.⁵⁸^,⁶¹

Teleost color diversity is predominantly driven by six types of neural crest-derived chromatophores: xanthophores (yellow), erythrophores (red), melanophores (black), cyanophores (blue), leucophores (white), and iridocytes (iridescent).⁶² Among these, xanthophore pigmentation is primarily regulated by the pteridine biosynthetic pathway,⁶³ in which the gch gene serves as a key rate-limiting factor.⁶⁴ Studies in red tilapia (Oreochromis spp.) demonstrated that the spatial expression of gch1 was strongly correlated with xanthophore distribution.⁶⁵^,⁶⁶ Its paralog, gch2, is essential for xanthophore differentiation and pigment synthesis; gch2 deficiency in zebrafish larvae impairs pteridine production, causing xanthophores to appear pale gray despite the physical presence of these cells.⁴⁰ Conversely, gch2 overexpression in these mutants partially rescues the wild-type phenotype. In this study, we observed high sequence conservation of gch1 between yellowfin seabream and blackhead seabream, but identified seven SVs within the promoter and gene body of gch2 between the two species. Notably, a 24 bp INDEL in the blackhead seabream genome causes a loss of Pax7 transcription factor binding motif (“ATTCAT”) in the gch2 promoter, which may alter the expression of gch2 and suppress xanthophore pigmentation in blackhead seabream (Figure 6A). This finding underscores the critical role of genomic variations in driving phenotypic differentiation between the two species, and largely broadens our understanding of evolutionary divergence within the Acanthopagrus genus.

Beyond providing evolutionary insights, the high-resolution SVs catalog and pangenome graph developed in this study can also provide a technical foundation for addressing practical challenges in aquaculture.⁶⁷ This resource helps resolve taxonomic ambiguities and preserve species integrity through the identification of species-diagnostic markers. For example, the fixed 24 bp INDEL in the gch2 promoter can be utilized to verify species purity and monitor hybrid stocks.⁶⁸ Furthermore, the pangenome framework enhances stock management and traceability; by offering higher sensitivity than traditional SNP-based methods, it enables the detection of population-specific signatures necessary to distinguish wild stocks from farmed escapees.²⁶ The pangenome graph also aids in mitigating inbreeding depression in hatchery-reared populations by identifying deleterious SVs and large-scale genomic erosions, which often accumulate under intense selection and are frequently overlooked by single linear reference genomes.⁶⁹ Finally, this work accelerates targeted molecular breeding by localizing SVs within functional gene clusters associated with cell junctions, the nervous system, and immunity, alongside identifying candidate loci for pigmentation. These SV-anchored genes represent a prioritized candidate pool for marker-assisted selection and fast-tracking genetic improvement in the aquaculture of yellowfin seabream and black seabream.⁷⁰

Limitations of the study

Despite the insights provided by this pangenomic analysis, several limitations remain. First, the pangenome graph was constructed exclusively using short-read sequencing data, which may limit the discovery and characterization of SVs in complex repetitive regions of low sequence mappability. The future integration of long-read sequencing (e.g., PacBio or Oxford Nanopore) will be essential to overcome this constraint and improve the structural resolution of the pangenome. Second, while our current sample size provided a robust overview, particularly for yellowfin seabream, it may lack sufficient power to capture low-frequency accessory variants in black seabream. Expanding the sample size across a broader geographic range would enhance the detection of rare genetic variations and provide a higher-fidelity map of the evolutionary landscape of these species. Finally, our findings are primarily based on in silico bioinformatic analyses. Further larger-scale PCR and gene editing validation will be necessary to confirm the functional impact of these identified SVs in the adaptation and speciation of Acanthopagrus species.

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Jianguo Lu (lujianguo@mail.sysu.edu.cn).

Materials availability

This study did not generate new unique reagents.

Data and code availability

•
Data: All 80 whole-genome resequencing datasets reported in this study have been deposited at NCBI and are publicly available. Accession numbers are listed in the key resources table. The Hi-C dataset is available under the SRA accession number SRR12328045. The 56 transcriptome datasets of yellowfin seabream are shown in Table S3. The remaining 24 resequencing datasets consist of 11 yellowfin seabreams and 13 blackhead seabreams, as shown in Table S4.
•
Code: All original code is available in this paper’s supplemental information (Data S1).
•
All other items: Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Acknowledgments

This work was supported in part by the R&D Project for Jinwan Yellowfin Seabream Breeding System Construction [no. K20-42000-018], Project supported by Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai) [no. 311021006], R&D Project for Jinwan Yellowfin Seabream Breeding System Construction, China, Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), China, and National Natural Science Foundation of China [no. 31902427]. The funding bodies are not involved in the design of the study, the collection, analysis, and interpretation of data, and the writing the manuscript.

Author contributions

Conceptualization: J. L. and Y. Y. Investigation: Y. H., W. W., and X. Z. Data analysis and visualization: Y. H., H. W. and Z. G. Funding: J. L. Writing: Y. H. and J. L. All authors read and approved the final manuscript.

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

STAR★Methods

Key resources table

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Biological samples

Yellowfin seabream (A. latus) fin tissues	This paper	Sampling sites: Zhuhai, Guangdong, China
Blackhead seabream (A. schlegelii) fin tissues	This paper	Sampling sites: Zhuhai, Guangdong, China

Chemicals, peptides, and recombinant proteins

Eugenol	Sigma-Aldrich	N/A
AMPure XP beads (Agencourt SPRIselect)	Beckman Coulter	N/A

Critical commercial assays

Magnetic bead–based genomic DNA extraction kit	Tiangen Biotech (Beijing)	DP705
NEBNext® Ultra™ II DNA Library Prep Kit	New England Biolabs (NEB)	E7645
E.Z.N.A.® Tissue DNA Kit	OMEGA Bio-tek	D3396
2 × Taq Plus Master Mix (Dye Plus)	Vazyme	P211
FastPure Gel DNA Extraction Mini Kit	Vazyme	DC301

Deposited data

Raw whole-genome resequencing data (A. latus)	This paper	NCBI BioProject: PRJNA1380438, Table S1
Genome assembly (A. latus & A. schlegelii)	NCBI GenBank	See Table S2
Transcriptome datasets (56 samples)	NCBI SRA	See Table S3
Published resequencing data (A. latus & A. schlegelii, 24 samples)	NCBI SRA	See Table S4
Hi-C datasets (A. latus)	NCBI SRA	SRR12328045

Software and algorithms

iSeq (v1.1.0)	Chao et al.⁷¹	https://github.com/BioOmics/iSeq
fastp (v0.23.4)	Chen et al.⁷²	https://github.com/OpenGene/fastp
BWA-MEM (v0.7.17)	Li,⁷³	http://bio-bwa.sourceforge.net/
Sambamba (v1.0.1)	Tarasov et al.⁷⁴	https://github.com/biod/sambamba
DeepVariant (v1.5.0)	Poplin et al.⁷⁵	https://github.com/google/deepvariant
GLnexus (v1.4.1)	Yun et al.⁷⁶	https://github.com/dnanexus-rnd/GLnexus
SpeedSeq (v0.1.2)	Chiang et al.⁷⁷	https://github.com/hall-lab/speedseq
Lumpy-SV (v0.3.1)	Layer et al.⁷⁸	https://github.com/arq5x/lumpy-sv
CNVnator (v0.3.3)	Abyzov et al.⁷⁹	https://github.com/abyzovlab/CNVnator
SVTyper (v0.7.1)	Chiang et al.⁷⁷	https://github.com/hall-lab/svtyper
Svtools (v0.5.1)	Larson et al.⁸⁰	https://github.com/hall-lab/svtools
Bcftools (v1.21)	Li,⁸¹	http://samtools.github.io/bcftools/
snpEff (v5.1)	Cingolani et al.⁸²	http://pcingola.github.io/SnpEff/
GenomicRanges (R package, v1.54.1)	Lawrence et al.⁸³	https://bioconductor.org/packages/GenomicRanges
Circos (v0.69.9)	Krzywinski et al.⁸⁴	http://circos.ca/
ChIPseeker (v1.42.0)	Yu et al.⁸⁵	https://bioconductor.org/packages/ChIPseeker
Trim Galore (v0.6.10)	Babraham Bioinformatics,⁸⁶^,⁸⁷	https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
HiC-Pro (v3.1.0)	Servant et al.⁸⁸	https://github.com/nservant/HiC-Pro
Armatus (v2.3)	Filippova et al.⁸⁹	https://github.com/kingsfordgroup/armatus
Chrom3D (v1.0.2)	Paulsen et al.⁹⁰^,⁹¹	https://github.com/Chrom3D/Chrom3D
EggNOG-mapper (v6.0))	Hernández-Plaza et al.⁹²	https://github.com/eggnogdb/eggnog-mapper
AnnotationForge (R package, v1.48.0)	Carlson & Pagès,⁹³	https://bioconductor.org/packages/AnnotationForge
clusterProfiler (R package, v4.18.1)	Xu et al.⁹⁴	https://bioconductor.org/packages/clusterProfiler
ggplot2 (R package, v4.0.0)	Wickham,⁹⁵	https://ggplot2.tidyverse.org/
Trimmomatic (v0.40)	Bolger et al.⁹⁶	https://github.com/usadellab/Trimmomatic
HISAT2 (v.2.2.1)	Kim et al.⁹⁷	https://daehwankimlab.github.io/hisat2/
StringTie (v3.0.0)	Pertea et al.⁹⁸	https://ccb.jhu.edu/software/stringtie/
lme4 (R package, v1.1-37)	Bates et al.⁹⁹	https://cran.r-project.org/web/packages/lme4/
lmerTest (R package, v3.1-3)	Kuznetsova et al.¹⁰⁰	https://cran.r-project.org/web/packages/lmerTest/
minimap2 (v2.27)	Li,¹⁰¹	https://github.com/lh3/minimap2
fixchr (default version)	Goel et al.¹⁰²	https://github.com/schneebergerlab/fixchr
SyRI (v1.7.1)	Goel et al.¹⁰²	https://github.com/schneebergerlab/syri
plotsr (v1.1.0)	Goel et al.¹⁰³	https://github.com/schneebergerlab/plotsr
PGGB (v0.7.2)	Garrison et al.¹⁰⁴	https://github.com/pangenome/pggb
PanSN-spec (v0.1.0)	N/A	https://github.com/pangenome/PanSN-spec
MashMap (v3.1.3)	Jain et al.¹⁰⁵	https://github.com/marbl/MashMap
wfmash (0.9.2)	Guarracino et al.¹⁰⁶	https://github.com/waveygang/wfmash
Seqwish (v0.7.11)	Garrison,¹⁰⁷	https://github.com/ekg/seqwish
smoothxg (v0.6.6)	Garrison et al.¹⁰⁸	https://github.com/pangenome/smoothxg
GFAffix (v0.2.1)	codia-lab,¹⁰⁹	https://github.com/codialab/GFAffix
vg toolkit (v1.57.0)	Hickey et al.¹¹⁰	https://github.com/vgteam/vg
Bandage-NG (v2025.4.1)	Wick et al.¹¹¹	https://github.com/asl/BandageNG
ODGI (v0.9.3)	Guarracino et al.¹¹²	https://github.com/pangenome/odgi
SequenceTubeMap (default version)	Beyer et al.¹¹³	https://github.com/vgteam/sequenceTubeMap
Miniport (v0.18)	Li,¹¹⁴	https://github.com/lh3/miniprot
BLAST+ (v2.16.0)	Camacho et al.¹¹⁵	https://blast.ncbi.nlm.nih.gov/Blast.cgi
gffread (v0.12.7)	Pertea and Pertea,¹¹⁶	https://github.com/gpertea/gffread
JASPAR (database online service)	Castro-Mondragon et al.¹¹⁷	https://jaspar.elixir.no/
InterPro (online service)	Paysan-Lafosse et al.¹¹⁸	https://www.ebi.ac.uk/interpro/
AlphaFold3 (online service)	Abramson et al.¹¹⁹	https://alphafoldserver.com/
SAVES (v6.1)	UCLA-DOE LAB	https://saves.mbi.ucla.edu/
TM-align (online service)	Zhang et al.¹²⁰	https://aideepmed.com/TM-align/
UCSF ChimeraX (v1.10.1)	Meng et al.¹²¹	https://www.cgl.ucsf.edu/chimerax/

Open in a new tab

Experimental model and study participant details

The yellowfin seabream and blackhead seabream samples used in this study were sourced from fishery production. All experimental fish were 2-year-old individuals at the functional hermaphroditic stage, characterized by the simultaneous presence of both testicular and ovarian tissues. All animal handling and experimental procedures complied with the regulations and were approved by the Experimental Animal Ethics Committee of Sun Yat-sen University.

Method details

Sampling and DNA sequencing

A total of 80 yellowfin seabreams were sampled from four coastal sites (Table S1). Fish were anesthetized with 10 mg/L eugenol, and pectoral fin tissues were collected and preserved in ethanol for subsequent DNA extraction. Genomic DNA was extracted using a Magnetic Universal Genomic DNA Kit (DP705, Tiangen Biotech, Beijing, China). DNA quality and concentration were assessed via NanoDrop spectrophotometer and Qubit 3.0 fluorometer. Samples yielding >1.5 μg of DNA were fragmented to an average size of ∼350 bp using a Covaris ultrasonicator. Sequencing libraries were constructed using the NEBNext® Ultra™ II DNA Library Prep Kit (New England Biolabs, E7645, Ipswich, MA, USA), purified with AMPure XP beads (Agencourt SPRIselect, Brea, CA, USA), and PCR-amplified. Library quality was verified by quantitative PCR, and libraries with an effective concentration of 3 nM were then sequenced on the Illumina HiSeq 2500 platform (150-bp paired-end reads). The average sequencing depth per individual exceeded 10× with a genome coverage rate of at least 96%, ensuring the high data quality for downstream analyses.

Genomic variations identification using whole-genome resequencing data

Genome resequencing datasets for yellowfin seabream and blackhead seabream were retrieved from the NCBI, with data integrity verified using iSeq. Raw reads were filtered for quality using fastp (-z 4 -q 20 -u 30 -n 5), and aligned to the reference genome of yellowfin seabream (GCF_904848185.1, Table S2) using BWA-MEM. Alignment results were sorted, deduplicated, and indexed using Sambamba. SNPs and INDELs were called for each sample using DeepVariant (--model_type=WGS) and merged via GLnexus for joint variant calling across the population. To ensure robust SVs (DEL, DUP, and INV) discovery, the SpeedSeq pipeline was employed, which utilizes a multi-signal integration strategy to minimize false positives and negatives. Specifically, Lumpy-SV was used to integrate discordant read-pair (RP) and split-read (SR) signals to cross-validate breakpoints. Split reads were extracted from BAM files using the extractSplitReads_BwaMem script. Read-depth (RD) analysis was conducted to capture copy number variants (CNVs) potentially missed by RP/SR signals. Sample genotypes were inferred using SVTyper, and the resulting variants were merged and sorted using the lmerge and lsort commands of svtools. Finally, svtools was used for population-scale genotyping, copy number annotation for non-breakends (BND) variants, and redundant variants filtering.

Filtering and annotation of population variation

For each yellowfin seabream population, rare variants with a MAF <0.1 were filtered using Bcftools. SNPs and INDELs were annotated using snpEff, with a custom database constructed from the NCBI RefSeq genome assembly GCF_904848185.1 and its corresponding gene annotation. SVs, except for BND variants, were annotated using the findOverlaps function of the GenomicRanges R package. Genomic distribution and allele frequency profiles of all five variant types (SNPs, INDELs, DELs, DUPs and INVs) were analyzed via custom scripts, and visualized using Circos. Variant locations in the genome were obtained using the ChIPseeker package.

Inference of significant interactions from Hi-C data

Hi-C data (data and code availability) of yellowfin seabream were first quality-controlled using Trim Galore. To achieve a balance between genomic resolution and signal-to-noise ratio, a 40 kb resolution was used for detecting genome-wide chromatin interactions using the HiC-Pro suite. The Armatus tool was then applied to call topologically associating domains (TADs) for each chromosome. These identified TADs were converted into a segmented genome using Chrom3D, and only the significant interactions between genomic segments (FDR < 0.05) were retained for visualization.

Identification of core genes and variable genes

SNPs and INDELs categorized as ‘HIGH IMPACT’ by snpEff, along with SVs overlapping gene promoters or exons, were defined as high-impact variants. These variants were used to classify genes across populations. As illustrated in Figure S2, rarefaction analysis was performed to robustly distinguish core and variable genes. For each sample size (n, ranging from 2 to 50), we randomly drew individuals from the total population (N=80) 1,000 times. In each iteration, genes maintaining a reference genotype (GT=0/0) across all sampled individuals were identified. The final core gene set (Cⁿ) was defined as the intersection of these sets across all 1,000 iterations ( $\cap L_{i}^{n}$ ), ensuring the rigorous identification of stable genes. The variable gene sets (Vⁿ) was obtained as the set different between the total gene universe and the core set (U-Cⁿ). The convergence of core and variable gene counts was evaluated as sampling size (n) increased; the final core and variable gene sets were obtained at the saturation point of n=50 and utilized for all subsequent analyses.

Enrichment analysis of gene function

Functional annotation of yellowfin seabream genome was performed by searching protein sequences against the eggNOG database, followed by a construction of a custom gene annotation database using the AnnotationForge package. GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis were conducted for genes of interest using the clusterProfiler R package, with adjusted p-value cutoff of 0.05 and q-value cutoff of 0.01. Enrichment results were visualized using the ggplot2 package.

Expression analysis of core genes and variable genes

Fifty-six transcriptome datasets of yellowfin seabream under normal growth conditions were retrieved from NCBI (Table S3). For each sample, raw read quality was assessed with fastp, and low-quality bases and adapter sequences were trimmed using Trimmomatic. High-quality reads were aligned to the reference genome of yellowfin seabream using HISAT2. The mapped reads were further assembled and quantified via StringTie. Gene expression levels were normalized to Transcripts Per Million (TPM) to ensure comparisons across samples and genes. TPM values were log2-transformed to stabilize variance. To rigorously assess the expression differences between core and variable genes, we constructed an LMM using the lme4 and lmer Test R packages. In this model, gene category (core vs. variable) was specified as a fixed effect, while sample identity and gene ID were included as random effects (formula: expr ∼ category + (1|sample) + (1|gene)) to account for inter-individual variability and baseline expression heterogeneity.

Construction of the liner pangenome of Acanthopagrus

Autosome sequences were extracted from three chromosome-level genome assemblies of yellowfin seabream and one gap-free genome assembly of blackhead seabream (Table S2). All-versus-all pairwise genome alignments were conducted using minimap2 (-ax asm5 –eqx). Homologous chromosomes between each pair of genomes were identified using fixchr, and the chromosomes were reoriented where necessary (Figures S4–S8). Synteny and SVs were then detected for each genome pair using SyRI, and the linear pangenome was visualized using plotsr.

Construction of the pangenome graph of Acanthopagrus

The Acanthopagrus pangenome graph was established using the PGGB pipeline, which fulfills two primary objectives: (1) creating a reference-free, unbiased graph structure derived from symmetric all-to-all alignments for accurate representation of non-reference sequences, and (2) preserving base-level resolution throughout the graph. The four assemblies were renamed according to PanSN-spec conventions (Table S2), and partitioned by chromosomes using the partition-before-pggb. Corresponding chromosomes from the four assemblies were aligned using the MashMap of wfmash (-s 5000 -l 25000 -p 90 -n 1 -k 19 -H 0.001 -Y) to assess sequence similarity. Base-level alignments were performed using the wavefront alignment algorithm. Pangenome variation graph was built for each chromosome using seqwish (-k 23 -f 0 -B 10000000), and subsequently normalized using smoothxg (--chop-to 100 -I .9000 -R 0 -j 0 -e 0 -l 700,900,1100 -P 1,19,39,3,81,1 -O 0.001 -Y 300) and GFAffix to refine the graph structure. These graphs were converted to VCF files using the vg deconstruct with default parameters, and indexed for downstream genotyping. Short-read data from 24 samples (Table S4) were mapped to these graphs using vg giraffe with default parameters to generate Graph Alignment Map files. Read support was computed using vg pack, filtering for mapping quality ≥5 and base quality ≥5 to ensure data quality. Finally, variants were called for each sample using vg call, leveraging the graph snarl structure and specifying 'GCF_904848185.1′ as the reference path. Graphs were visualized using Bandage-NG, ODGI and SequenceTubeMap.

Verification of gene structure of gch2

Genomic DNA (gDNA) was extracted from the caudal fins of three yellowfin seabreams and three blackhead seabreams, respectively, using the E.Z.N.A.® Tissue DNA Kit (D3396, OMEGA, USA). DNA concentration and quality were assessed using a NanoDrop-2000 spectrophotometer. PCR amplifications were performed using 2 × Taq Plus Master Mix (Dye Plus) (P211, Vazyme, China). Primers used for PCR are listed in Table S5. Amplification was performed in a 40 μL reaction volume containing 50 ng of gDNA, 0.5 μM of each primer, and 20 μL of 2 × Taq Plus Master Mix. The thermal cycling conditions included an initial denaturation at 95°C for 3 min; followed by 40 cycles of denaturation at 95°C for 15 s, annealing at 58°C for 15 s, and extension at 72°C for 30 s; and a final extension at 72°C for 10 min. Target products were verified via 1.5% agarose gel electrophoresis, purified using the FastPure Gel DNA Extraction Mini Kit (Vazyme, DC301, China), and subjected to Sanger sequencing using the corresponding primers.

Protein sequence prediction and structural modeling

Using the protein sequence of GCF_904848185.1 as a reference (Table S2), sequence alignment and homology-based annotation was performed for the fAcaSch_1 assembly using miniprot. Protein sequences of blackhead seabream were derived from the predicted CDS using the gffread utility from the Cufflinks. Transcription factor binding motifs were scanned against the JASPAR database, and protein domains were predicted via the InterPro web service. Three-dimensional protein structures were modeled using AlphaFold3, with model reliability evaluated through predicted Local Distance Difference Test (pLDDT) and ranking scores. The stereochemical quality of the predicted structures was validated using Structure Analysis and Verification Server (SAVES, UCLA), and structural conservation was assessed by aligning the models against the reference crystal structure (PDB: 1MDM) using TM-align. Molecular docking simulations and structural visualizations, including electrostatic potential analysis and hydrogen bond identification, were performed using UCSF ChimeraX.

Quantification and statistical analysis

For the population-scale genomic variation survey, the sample size n=80 represents the number of wild yellowfin seabream individuals. For the gene expression analysis, n=56 represents the number of independent transcriptome datasets. To compare the expression levels between core and variable genes (Figure 2F), a Linear Mixed-Effects Model was implemented using the lme4 and lmerTest R packages to account for sample identity and gene ID as random effects. Statistical significance for functional enrichment (GO and KEGG) was determined using the clusterProfiler R package, applying an adjusted p-value cutoff of 0.05 and a q-value cutoff of 0.01. In all relevant figures (e.g., Figures 1I, 2E, and 2F), the definition of center and dispersion is provided: box plots represent the median (center line) and interquartile ranges (box limits). Exact p-values, estimates, and t-values are specified directly within the figures or their corresponding legends.

Published: March 30, 2026

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.isci.2026.115539.

Contributor Information

Yuchen Yang, Email: yangych68@mail.sysu.edu.cn.

Jianguo Lu, Email: lujianguo@mail.sysu.edu.cn.

Supplemental information

Document S1. Figures S1–S13, Tables S1–S5, and Data S1

mmc1.pdf^{(2.5MB, pdf)}

References

1.Lin Y.J. Phenotypic divergence may facilitate co-occurrence in Acanthopagrus species ( Family : Sparidae ) J. Fish. Biol. 2025;jfb doi: 10.1111/jfb.70311. [DOI] [PubMed] [Google Scholar]
2.Mozanzadeh M.T., Safari O., Oosooli R., Mehrjooyan S., Najafabadi M.Z., Hoseini S.J., Saghavi H., Monem J. The effect of salinity on growth performance, digestive and antioxidant enzymes, humoral immunity and stress indices in two euryhaline fish species: Yellowfin seabream (Acanthopagrus latus) and Asian seabass (Lates calcarifer) Aquaculture. 2021;534 doi: 10.1016/j.aquaculture.2020.736329. [DOI] [Google Scholar]
3.Li X., Shen Y., Bao Y., Wu Z., Yang B., Jiao L., Zhang C., Tocher D.R., Zhou Q., Jin M. Physiological responses and adaptive strategies to acute low-salinity environmental stress of the euryhaline marine fish black seabream (Acanthopagrus schlegelii) Aquaculture. 2022;554 doi: 10.1016/j.aquaculture.2022.738117. [DOI] [Google Scholar]
4.Wu G.-C., Dufour S., Chang C.-F. Molecular and cellular regulation on sex change in hermaphroditic fish, with a special focus on protandrous black porgy, Acanthopagrus schlegelii. Mol. Cell. Endocrinol. 2021;520 doi: 10.1016/j.mce.2020.111069. [DOI] [PubMed] [Google Scholar]
5.Li S., Lin G., Fang W., Huang P., Gao D., Huang J., Xie J., Lu J. Gonadal Transcriptome Analysis of Sex-Related Genes in the Protandrous Yellowfin Seabream (Acanthopagrus latus) Front. Genet. 2020;11:709. doi: 10.3389/fgene.2020.00709. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Grandcourt E.M., Al Abdessalaam T.Z., Francis F., Al Shamsi A.T. Biology and stock assessment of the Sparids, Acanthopagrus bifasciatus and Argyrops spinifer (Forsskål, 1775), in the Southern Arabian Gulf. Fish. Res. 2004;69:7–20. doi: 10.1016/j.fishres.2004.04.006. [DOI] [Google Scholar]
7.Ochwada-Doyle F., Roberts D., Gray C., Barnes L., Haddy J., Fearman J. Characterizing the biological traits and life history of Acanthopagrus (Sparidae) hybrid complexes: implications for conservation and management. J. Fish. Biol. 2012;81:1540–1558. doi: 10.1111/j.1095-8649.2012.03401.x. [DOI] [PubMed] [Google Scholar]
8.Al-Husaini M., Bishop J.M., Al-Foudari H.M., Al-Baz A.F. A review of the status and development of Kuwait’s fisheries. Mar. Pollut. Bull. 2015;100:597–606. doi: 10.1016/j.marpolbul.2015.07.053. [DOI] [PubMed] [Google Scholar]
9.Lu J., Gao D., Sims Y., Fang W., Collins J., Torrance J., Lin G., Xie J., Liu J., Howe K. Chromosome-level Genome Assembly of Acanthopagrus latus Provides Insights into Salinity Stress Adaptation of Sparidae. Mar. Biotechnol. 2022;24:655–660. doi: 10.1007/s10126-022-10119-x. [DOI] [PubMed] [Google Scholar]
10.Zhu K.C., Zhang N., Liu B.S., Guo L., Guo H.Y., Jiang S.G., Zhang D.C. A chromosome-level genome assembly of the yellowfin seabream (Acanthopagrus latus; Hottuyn, 1782) provides insights into its osmoregulation and sex reversal. Genomics. 2021;113:1617–1627. doi: 10.1016/j.ygeno.2021.04.017. [DOI] [PubMed] [Google Scholar]
11.Pérez-Sánchez J., Naya-Català F., Soriano B., Piazzon M.C., Hafez A., Gabaldón T., Llorens C., Sitjà-Bobadilla A., Calduch-Giner J.A. Genome Sequencing and Transcriptome Analysis Reveal Recent Species-Specific Gene Duplications in the Plastic Gilthead Sea Bream (Sparus aurata) Front. Mar. Sci. 2019;6:760. doi: 10.3389/fmars.2019.00760. [DOI] [Google Scholar]
12.Vignal A., Milan D., SanCristobal M., Eggen A. A review on SNP and other types of molecular markers and their use in animal genetics. Genet. Sel. Evol. 2002;34:275–305. doi: 10.1051/gse:2002009. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Morin P.A., Martien K.K., Taylor B.L. Assessing statistical power of SNPs for population structure and conservation studies. Mol. Ecol. Resour. 2009;9:66–73. doi: 10.1111/j.1755-0998.2008.02392.x. [DOI] [PubMed] [Google Scholar]
14.Wellenreuther M., Mérot C., Berdan E., Bernatchez L. Going beyond SNPs: The role of structural genomic variants in adaptive evolution and species diversification. Mol. Ecol. 2019;28:1203–1209. doi: 10.1111/mec.15066. [DOI] [PubMed] [Google Scholar]
15.Luo Z., Yu Y., Xiang J., Li F. Genomic selection using a subset of SNPs identified by genome-wide association analysis for disease resistance traits in aquaculture species. Aquaculture. 2021;539 doi: 10.1016/j.aquaculture.2021.736620. [DOI] [Google Scholar]
16.Salem M., Vallejo R.L., Leeds T.D., Palti Y., Liu S., Sabbagh A., Rexroad C.E., 3rd, Yao J. RNA-Seq Identifies SNP Markers for Growth Traits in Rainbow Trout. PLoS One. 2012;7 doi: 10.1371/journal.pone.0036264. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wang W., Huang J., Hu Y., Feng J., Gao D., Fang W., Xu M., Ma C., Fu Z., Chen Q., et al. Seascapes Shaped the Local Adaptation and Population Structure of South China Coast Yellowfin Seabream (Acanthopagrus latus) Mar. Biotechnol. 2024;26:60–73. doi: 10.1007/s10126-023-10277-6. [DOI] [PubMed] [Google Scholar]
18.1000 Genomes Project Consortium. Abecasis G.R., Altshuler D., Auton A., Brooks L.D., Durbin R.M., Gibbs R.A., Hurles M.E., McVean G.A. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Alkan C., Coe B.P., Eichler E.E. Genome structural variation discovery and genotyping. Nat. Rev. Genet. 2011;12:363–376. doi: 10.1038/nrg2958. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Sudmant P.H., Rausch T., Gardner E.J., Handsaker R.E., Abyzov A., Huddleston J., Zhang Y., Ye K., Jun G., Fritz M.H.Y., et al. An integrated map of structural variation in 2,504 human genomes. Nature. 2015;526:75–81. doi: 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Collins R.L., Brand H., Karczewski K.J., Zhao X., Alföldi J., Francioli L.C., Khera A.V., Lowther C., Gauthier L.D., Wang H., et al. A structural variation reference for medical and population genetics. Nature. 2020;581:444–451. doi: 10.1038/s41586-020-2287-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Weischenfeldt J., Symmons O., Spitz F., Korbel J.O. Phenotypic impact of genomic structural variation: insights from and for human disease. Nat. Rev. Genet. 2013;14:125–138. doi: 10.1038/nrg3373. [DOI] [PubMed] [Google Scholar]
23.Chiang C., Scott A.J., Davis J.R., Tsang E.K., Li X., Kim Y., Hadzic T., Damani F.N., Ganel L. The impact of structural variation on human gene expression. Nat. Genet. 2017;49:692–699. doi: 10.1038/ng.3834. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Spielmann M., Lupiáñez D.G., Mundlos S. Structural variation in the 3D genome. Nat. Rev. Genet. 2018;19:453–467. doi: 10.1038/s41576-018-0007-0. [DOI] [PubMed] [Google Scholar]
25.Mérot C., Oomen R.A., Tigano A., Wellenreuther M. A Roadmap for Understanding the Evolutionary Significance of Structural Genomic Variation. Trends Ecol. Evol. 2020;35:561–572. doi: 10.1016/j.tree.2020.03.002. [DOI] [PubMed] [Google Scholar]
26.Catanach A., Crowhurst R., Deng C., David C., Bernatchez L., Wellenreuther M. The genomic pool of standing structural variation outnumbers single nucleotide polymorphism by threefold in the marine teleost Chrysophrys auratus. Mol. Ecol. 2019;28:1210–1223. doi: 10.1111/mec.15051. [DOI] [PubMed] [Google Scholar]
27.Sherman R.M., Salzberg S.L. Pan-genomics in the human genome era. Nat. Rev. Genet. 2020;21:243–254. doi: 10.1038/s41576-020-0210-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Ballouz S., Dobin A., Gillis J.A. Is it time to change the reference genome? Genome Biol. 2019;20:159. doi: 10.1186/s13059-019-1774-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Wong K.H.Y., Ma W., Wei C.-Y., Yeh E.-C., Lin W.-J., Wang E.H.F., Su J.-P., Hsieh F.-J., Kao H.-J., Chen H.-H., et al. Towards a reference genome that captures global genetic diversity. Nat. Commun. 2020;11:5482. doi: 10.1038/s41467-020-19311-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Wang T., Antonacci-Fulton L., Howe K., Lawson H.A., Lucas J.K., Phillippy A.M., Popejoy A.B., Asri M., Carson C., Chaisson M.J.P., et al. The Human Pangenome Project: a global resource to map genomic diversity. Nature. 2022;604:437–446. doi: 10.1038/s41586-022-04601-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Zhou Y., Zhang Z., Bao Z., Li H., Lyu Y., Zan Y., Wu Y., Cheng L., Fang Y., Wu K., et al. Graph pangenome captures missing heritability and empowers tomato breeding. Nature. 2022;606:527–534. doi: 10.1038/s41586-022-04808-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Vernikos G., Medini D., Riley D.R., Tettelin H. Ten years of pan-genome analyses. Curr. Opin. Microbiol. 2015;23:148–154. doi: 10.1016/j.mib.2014.11.016. [DOI] [PubMed] [Google Scholar]
33.Collins R.E., Higgs P.G. Testing the Infinitely Many Genes Model for the Evolution of the Bacterial Core Genome and Pangenome. Mol. Biol. Evol. 2012;29:3413–3425. doi: 10.1093/molbev/mss163. [DOI] [PubMed] [Google Scholar]
34.Secomandi S., Gallo G.R., Rossi R., Rodríguez Fernandes C., Jarvis E.D., Bonisoli-Alquati A., Gianfranceschi L., Formenti G. Pangenome graphs and their applications in biodiversity genomics. Nat. Genet. 2025;57:13–26. doi: 10.1038/s41588-024-02029-6. [DOI] [PubMed] [Google Scholar]
35.Paten B., Novak A.M., Eizenga J.M., Garrison E. Genome graphs and the evolution of genome inference. Genome Res. 2017;27:665–676. doi: 10.1101/gr.214155.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Liao W.-W., Asri M., Ebler J., Doerr D., Haukness M., Hickey G., Lu S., Lucas J.K., Monlong J., Abel H.J., et al. A draft human pangenome reference. Nature. 2023;617:312–324. doi: 10.1038/s41586-023-05896-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Schreiber M., Jayakodi M., Stein N., Mascher M. Plant pangenomes for crop improvement, biodiversity and evolution. Nat. Rev. Genet. 2024;25:563–577. doi: 10.1038/s41576-024-00691-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Leonard A.S., Crysnanto D., Mapel X.M., Bhati M., Pausch H. Graph construction method impacts variation representation and analyses in a bovine super-pangenome. Genome Biol. 2023;24:124. doi: 10.1186/s13059-023-02969-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Zhang K., Guo S., Yang S., Zhou W., Wu J., Zhang X., Shi Q., Deng L. A telomere-to-telomere genome assembly of the protandrous hermaphrodite blackhead seabream, Acanthopagrus schlegelii. Sci. Data. 2025;12:350. doi: 10.1038/s41597-025-04602-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Lister J.A. Larval but not adult xanthophore pigmentation in zebrafish requires GTP cyclohydrolase 2 (gch2) function. Pigment Cell Melanoma Res. 2019;32:724–727. doi: 10.1111/pcmr.12783. [DOI] [PubMed] [Google Scholar]
41.Chen J., Wang H., Wu S., Zhang A., Qiu Z., Huang P., Qu J.Y., Xu J. col1a2+ fibroblasts/muscle progenitors finetune xanthophore countershading by differentially expressing csf1a/1b in embryonic zebrafish. Sci. Adv. 2024;10 doi: 10.1126/sciadv.adj9637. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Nord H., Dennhag N., Muck J., von Hofsten J. Pax7 is required for establishment of the xanthophore lineage in zebrafish embryos. Mol. Biol. Cell. 2016;27:1853–1862. doi: 10.1091/mbc.e15-12-0821. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Wang C., Lu B., Li T., Liang G., Xu M., Liu X., Tao W., Zhou L., Kocher T.D., Wang D. Nile Tilapia: A Model for Studying Teleost Color Patterns. J. Hered. 2021;112:469–484. doi: 10.1093/jhered/esab018. [DOI] [PubMed] [Google Scholar]
44.Dorant Y., Cayuela H., Wellband K., Laporte M., Rougemont Q., Mérot C., Normandeau E., Rochette R., Bernatchez L. Copy number variants outperform SNPs to reveal genotype–temperature association in a marine species. Mol. Ecol. 2020;29:4765–4782. doi: 10.1111/mec.15565. [DOI] [PubMed] [Google Scholar]
45.Hämälä T., Wafula E.K., Guiltinan M.J., Ralph P.E., dePamphilis C.W., Tiffin P. Genomic structural variants constrain and facilitate adaptation in natural populations of Theobroma cacao, the chocolate tree. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2102914118. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Scott A.J., Chiang C., Hall I.M. Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes. Genome Res. 2021;31:2249–2257. doi: 10.1101/gr.275488.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Zhang Y., Yang Z., He Y., Liu D., Liu Y., Liang C., Xie M., Jia Y., Ke Q., Zhou Y., et al. Structural variation reshapes population gene expression and trait variation in 2,105 Brassica napus accessions. Nat. Genet. 2024;56:2538–2550. doi: 10.1038/s41588-024-01957-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Abel H.J., Larson D.E., Regier A.A., Chiang C., Das I., Kanchi K.L., Layer R.M., Neale B.M., Salerno W.J., Reeves C., et al. Mapping and characterization of structural variation in 17,795 human genomes. Nature. 2020;583:83–89. doi: 10.1038/s41586-020-2371-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Ruigrok M., Xue B., Catanach A., Zhang M., Jesson L., Davy M., Wellenreuther M. The Relative Power of Structural Genomic Variation versus SNPs in Explaining the Quantitative Trait Growth in the Marine Teleost Chrysophrys auratus. Genes. 2022;13:1129. doi: 10.3390/genes13071129. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Bertolotti A.C., Layer R.M., Gundappa M.K., Gallagher M.D., Pehlivanoglu E., Nome T., Robledo D., Kent M.P., Røsæg L.L., Holen M.M., et al. The structural variation landscape in 492 Atlantic salmon genomes. Nat. Commun. 2020;11:5176. doi: 10.1038/s41467-020-18972-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Tine M., Kuhl H., Gagnaire P.-A., Louro B., Desmarais E., Martins R.S.T., Hecht J., Knaust F., Belkhir K., Klages S., et al. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation. Nat. Commun. 2014;5:5770. doi: 10.1038/ncomms6770. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Kültz D. Physiological mechanisms used by fish to cope with salinity stress. J. Exp. Biol. 2015;218:1907–1914. doi: 10.1242/jeb.118695. [DOI] [PubMed] [Google Scholar]
53.Iula L., Keitelman I.A., Sabbione F., Fuentes F., Guzman M., Galletti J.G., Gerber P.P., Ostrowski M., Geffner J.R., Jancic C.C., Trevani A.S. Autophagy Mediates Interleukin-1β Secretion in Human Neutrophils. Front. Immunol. 2018;9:269. doi: 10.3389/fimmu.2018.00269. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Drummond D.A., Bloom J.D., Adami C., Wilke C.O., Arnold F.H. Why highly expressed proteins evolve slowly. Proc. Natl. Acad. Sci. USA. 2005;102:14338–14343. doi: 10.1073/pnas.0504070102. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Drummond D.A., Wilke C.O. Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution. Cell. 2008;134:341–352. doi: 10.1016/j.cell.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Zhang J., Yang J.-R. Determinants of the rate of protein sequence evolution. Nat. Rev. Genet. 2015;16:409–420. doi: 10.1038/nrg3950. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Shanta O., Noor A., Sebat J., Zhao X., Malhotra A., Porubsky D., Rausch T., Gardner E.J., Rodriguez O.L. The effects of common structural variants on 3D chromatin structure. BMC Genom. 2020;21:95. doi: 10.1186/s12864-020-6516-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Price A.C., Weadick C.J., Shim J., Rodd F.H. Pigments, Patterns, and Fish Behavior. 2009. https://home.liebertpub.com/zeb [DOI] [PubMed]
59.Fuller R.C. Lighting environment predicts the relative abundance of male colour morphs in bluefin killifish (Lucania goodei) populations. Proc. Biol. Sci. 2002;269:1457–1465. doi: 10.1098/rspb.2002.2042. [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Maan M.E., Seehausen O., Söderberg L., Johnson L., Ripmeester E.A.P., Mrosso H.D.J., Taylor M.I., van Dooren T.J.M., van Alphen J.J.M. Intraspecific sexual selection on a speciation trait, male coloration, in the Lake Victoria cichlid Pundamilia nyererei. Proc. Biol. Sci. 2004;271:2445–2452. doi: 10.1098/rspb.2004.2911. [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Stevens M., Lown A.E., Wood L.E. Color change and camouflage in juvenile shore crabs Carcinus maenas. Front. Ecol. Evol. 2014;2 doi: 10.3389/fevo.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Cal L., Suarez-Bregua P., Moran P., Cerdá-Reverter J.M., Rotllant J. In: Emerging Issues in Fish Larvae Research. Yúfera M., editor. Springer International Publishing; 2018. Fish Pigmentation. A Key Issue for the Sustainable Development of Fish Farming; pp. 229–252. [DOI] [Google Scholar]
63.Ziegler I. The Pteridine Pathway in Zebrafish: Regulation and Specification during the Determination of Neural Crest Cell-Fate. Pigment Cell Res. 2003;16:172–182. doi: 10.1034/j.1600-0749.2003.00044.x. [DOI] [PubMed] [Google Scholar]
64.Braasch I., Schartl M., Volff J.-N. Evolution of pigment synthesis pathways by gene and genome duplication in fish. BMC Evol. Biol. 2007;7:74. doi: 10.1186/1471-2148-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Fang W., Huang J., Li S., Lu J. Identification of pigment genes (melanin, carotenoid and pteridine) associated with skin color variant in red tilapia using transcriptome analysis. Aquaculture. 2022;547 doi: 10.1016/j.aquaculture.2021.737429. [DOI] [Google Scholar]
66.Huang J., Fang W., Li J., Cai W., Lu J. Full-length transcriptome reveals alternative splicing regulation pattern of skin color variant in red tilapia (Oreochromis spp.) Aquaculture. 2025;598 doi: 10.1016/j.aquaculture.2024.741963. [DOI] [Google Scholar]
67.Liu Z., Gao D. Current State of Fish Reference Genome and Pangenome: Methodologies, Sampling Strategies, Quality Assessment and Future Perspectives to Aquaculture Breeding. Mar. Biotechnol. 2025;27:158. doi: 10.1007/s10126-025-10535-9. [DOI] [PubMed] [Google Scholar]
68.Pan C., Gao C., Chen T., Chen X., Yang C., Zeng D., Feng P., Jiang W., Peng M. The complete mitochondrial genome of yellowfin seabream, Acanthopagrus latus (Percoiformes, Sparidae) from Beibu Bay. Mitochondrial DNA Part B. 2021;6:1313–1314. doi: 10.1080/23802359.2021.1907804. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Blommaert J., Sandoval-Castillo J., Beheregaray L.B., Wellenreuther M. Peering into the gaps: Long-read sequencing illuminates structural variants and genomic evolution in the Australasian snapper. Genomics. 2024;116 doi: 10.1016/j.ygeno.2024.110929. [DOI] [PubMed] [Google Scholar]
70.Houston R.D., Bean T.P., Macqueen D.J., Gundappa M.K., Jin Y.H., Jenkins T.L., Selly S.L.C., Martin S.A.M., Stevens J.R., Santos E.M., et al. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet. 2020;21:389–409. doi: 10.1038/s41576-020-0227-y. [DOI] [PubMed] [Google Scholar]
71.Chao H., Li Z., Chen D., Chen M. iSeq: an integrated tool to fetch public sequencing data. Bioinformatics. 2024;40 doi: 10.1093/bioinformatics/btae641. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Chen S., Zhou Y., Chen Y., Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. 2013 doi: 10.48550/arXiv.1303.3997. Preprint at. [DOI] [Google Scholar]
74.Tarasov A., Vilella A.J., Cuppen E., Nijman I.J., Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31:2032–2034. doi: 10.1093/bioinformatics/btv098. [DOI] [PMC free article] [PubMed] [Google Scholar]
75.Poplin R., Chang P.-C., Alexander D., Schwartz S., Colthurst T., Ku A., Newburger D., Dijamco J., Nguyen N., Afshar P.T., et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol. 2018;36:983–987. doi: 10.1038/nbt.4235. [DOI] [PubMed] [Google Scholar]
76.Yun T., Li H., Chang P.-C., Lin M.F., Carroll A., McLean C.Y. Accurate, scalable cohort variant calls using DeepVariant and GLnexus. Bioinformatics. 2021;36:5582–5589. doi: 10.1093/bioinformatics/btaa1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
77.Chiang C., Layer R.M., Faust G.G., Lindberg M.R., Rose D.B., Garrison E.P., Marth G.T., Quinlan A.R., Hall I.M. SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat. Methods. 2015;12:966–968. doi: 10.1038/nmeth.3505. [DOI] [PMC free article] [PubMed] [Google Scholar]
78.Layer R.M., Chiang C., Quinlan A.R., Hall I.M. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15:R84. doi: 10.1186/gb-2014-15-6-r84. [DOI] [PMC free article] [PubMed] [Google Scholar]
79.Abyzov A., Urban A.E., Snyder M., Gerstein M. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011;21:974–984. doi: 10.1101/gr.114876.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Larson D.E., Abel H.J., Chiang C., Badve A., Das I., Eldred J.M., Layer R.M., Hall I.M. svtools: population-scale analysis of structural variation. Bioinformatics. 2019;35:4782–4787. doi: 10.1093/bioinformatics/btz492. [DOI] [PMC free article] [PubMed] [Google Scholar]
81.Danecek P., Bonfield J.K., Liddle J., Marshall J., Ohan V., Pollard M.O., Whitwham A., Keane T., McCarthy S.A., Davies R.M., Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Cingolani P., Platts A., Wang L.L., Coon M., Nguyen T., Wang L., Land S.J., Lu X., Ruden D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
83.Lawrence M., Huber W., Pagès H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J. Software for Computing and Annotating Genomic Ranges. PLoS Comput. Biol. 2013;9 doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
84.Krzywinski M., Schein J., Birol İ., Connors J., Gascoyne R., Horsman D., Jones S.J., Marra M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Yu G., Wang L.-G., He Q.-Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31:2382–2383. doi: 10.1093/bioinformatics/btv145. [DOI] [PubMed] [Google Scholar]
86.Krueger F. Trim Galore. 2023. https://www.bioinformatics.babraham.ac.uk/projects/trim_galore
87.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
88.Servant N., Varoquaux N., Lajoie B.R., Viara E., Chen C.-J., Vert J.-P., Heard E., Dekker J., Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16:259. doi: 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Filippova D., Patro R., Duggal G., Kingsford C. Identification of alternative topological domains in chromatin. Algorithms Mol. Biol. 2014;9:14. doi: 10.1186/1748-7188-9-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
90.Paulsen J., Liyakat Ali T.M., Collas P. Computational 3D genome modeling using Chrom3D. Nat. Protoc. 2018;13:1137–1152. doi: 10.1038/nprot.2018.009. [DOI] [PubMed] [Google Scholar]
91.Paulsen J., Sekelja M., Oldenburg A.R., Barateau A., Briand N., Delbarre E., Shah A., Sørensen A.L., Vigouroux C., Buendia B., Collas P. Chrom3D: three-dimensional genome modeling from Hi-C and nuclear lamin-genome contacts. Genome Biol. 2017;18:21. doi: 10.1186/s13059-016-1146-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
92.Hernández-Plaza A., Szklarczyk D., Botas J., Cantalapiedra C.P., Giner-Lamia J., Mende D.R., Kirsch R., Rattei T., Letunic I., Jensen L.J., et al. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Res. 2023;51:D389–D394. doi: 10.1093/nar/gkac1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
93.Marc Carlson H.P. AnnotationForge. (Bioconductor) 2017. [DOI]
94.Xu S., Hu E., Cai Y., Xie Z., Luo X., Zhan L., Tang W., Wang Q., Liu B., Wang R., et al. Using clusterProfiler to characterize multiomics data. Nat. Protoc. 2024;19:3292–3320. doi: 10.1038/s41596-024-01020-z. [DOI] [PubMed] [Google Scholar]
95.Wilkinson L. ggplot2: Elegant Graphics for Data Analysis by WICKHAM, H. Biometrics. 2011;67:678–679. doi: 10.1111/j.1541-0420.2011.01616.x. [DOI] [Google Scholar]
96.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
97.Kim D., Paggi J.M., Park C., Bennett C., Salzberg S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
98.Pertea M., Pertea G.M., Antonescu C.M., Chang T.-C., Mendell J.T., Salzberg S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
99.Bates D., Mächler M., Bolker B., Walker S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Soft. 2015;67:1. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
100.Kuznetsova A., Brockhoff P.B., Christensen R.H.B. lmerTest Package: Tests in Linear Mixed Effects Models. J. Stat. Soft. 2017;82 doi: 10.18637/jss.v082.i13. [DOI] [Google Scholar]
101.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
102.Goel M., Sun H., Jiao W.-B., Schneeberger K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 2019;20:277. doi: 10.1186/s13059-019-1911-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
103.Goel M., Schneeberger K. plotsr: visualizing structural similarities and rearrangements between multiple genomes. Bioinformatics. 2022;38:2922–2926. doi: 10.1093/bioinformatics/btac196. [DOI] [PMC free article] [PubMed] [Google Scholar]
104.Garrison E., Guarracino A., Heumos S., Villani F., Bao Z., Tattini L., Hagmann J., Vorbrugg S., Marco-Sola S., Kubica C., et al. Building pangenome graphs. Nat. Methods. 2024;21:2008–2012. doi: 10.1038/s41592-024-02430-3. [DOI] [PubMed] [Google Scholar]
105.Jain C., Koren S., Dilthey A., Phillippy A.M., Aluru S. A fast adaptive algorithm for computing whole-genome homology maps. Bioinformatics. 2018;34:i748–i756. doi: 10.1093/bioinformatics/bty597. [DOI] [PMC free article] [PubMed] [Google Scholar]
106.Guarracino A., Mwaniki N., Marco-Sola S., Garrison E. Zenodo; 2021. Wfmash: A Pangenome-Scale Aligner. [DOI] [Google Scholar]
107.Garrison E., Guarracino A. Unbiased pangenome graphs. Bioinformatics. 2023;39 doi: 10.1093/bioinformatics/btac743. [DOI] [PMC free article] [PubMed] [Google Scholar]
108.Garrison E., Guarracino A., Heumos S., Novak A., Hickey G., Eizenga J., Prins P. Zenodo; 2022. Pangenome/Smoothxg: Citation Release. [DOI] [Google Scholar]
109.Doerr D., Marijon P. GFAffix. 2022. https://github.com/codialab/GFAffix
110.Hickey G., Heller D., Monlong J., Sibbesen J.A., Sirén J., Eizenga J., Dawson E.T., Garrison E., Novak A.M., Paten B. Genotyping structural variants in pangenome graphs using the vg toolkit. Genome Biol. 2020;21:35. doi: 10.1186/s13059-020-1941-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
111.Wick R.R., Schultz M.B., Zobel J., Holt K.E. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]
112.Guarracino A., Heumos S., Nahnsen S., Prins P., Garrison E. ODGI: understanding pangenome graphs. Bioinformatics. 2022;38:3319–3326. doi: 10.1093/bioinformatics/btac308. [DOI] [PMC free article] [PubMed] [Google Scholar]
113.Beyer W., Novak A.M., Hickey G., Chan J., Tan V., Paten B., Zerbino D.R. Sequence tube maps: making graph genomes intuitive to commuters. Bioinformatics. 2019;35:5318–5320. doi: 10.1093/bioinformatics/btz597. [DOI] [PMC free article] [PubMed] [Google Scholar]
114.Li H. Protein-to-genome alignment with miniprot. Bioinformatics. 2023;39 doi: 10.1093/bioinformatics/btad014. [DOI] [PMC free article] [PubMed] [Google Scholar]
115.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST+: architecture and applications. BMC Bioinf. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
116.Pertea G., Pertea M. GFF Utilities: GffRead and GffCompare. F1000Res. 2020;9 doi: 10.12688/f1000research.23297.2. ISCB Comm J-304. [DOI] [PMC free article] [PubMed] [Google Scholar]
117.Castro-Mondragon J.A., Riudavets-Puig R., Rauluseviciute I., Lemma R.B., Turchi L., Blanc-Mathieu R., Lucas J., Boddie P., Khan A., Manosalva Pérez N., et al. JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2022;50:D165–D173. doi: 10.1093/nar/gkab1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
118.Paysan-Lafosse T., Blum M., Chuguransky S., Grego T., Pinto B.L., Salazar G.A., Bileschi M.L., Bork P., Bridge A., Colwell L., et al. InterPro in 2022. Nucleic Acids Res. 2023;51:D418–D427. doi: 10.1093/nar/gkac993. [DOI] [PMC free article] [PubMed] [Google Scholar]
119.Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., Ronneberger O., Willmore L., Ballard A.J., Bambrick J., et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. doi: 10.1038/s41586-024-07487-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
120.Zhang Y., Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005;33:2302–2309. doi: 10.1093/nar/gki524. [DOI] [PMC free article] [PubMed] [Google Scholar]
121.Meng E.C., Goddard T.D., Pettersen E.F., Couch G.S., Pearson Z.J., Morris J.H., Ferrin T.E. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 2023;32 doi: 10.1002/pro.4792. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S13, Tables S1–S5, and Data S1

mmc1.pdf^{(2.5MB, pdf)}

Data Availability Statement

•
Data: All 80 whole-genome resequencing datasets reported in this study have been deposited at NCBI and are publicly available. Accession numbers are listed in the key resources table. The Hi-C dataset is available under the SRA accession number SRR12328045. The 56 transcriptome datasets of yellowfin seabream are shown in Table S3. The remaining 24 resequencing datasets consist of 11 yellowfin seabreams and 13 blackhead seabreams, as shown in Table S4.
•
Code: All original code is available in this paper’s supplemental information (Data S1).
•
All other items: Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

[bib1] 1.Lin Y.J. Phenotypic divergence may facilitate co-occurrence in Acanthopagrus species ( Family : Sparidae ) J. Fish. Biol. 2025;jfb doi: 10.1111/jfb.70311. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Mozanzadeh M.T., Safari O., Oosooli R., Mehrjooyan S., Najafabadi M.Z., Hoseini S.J., Saghavi H., Monem J. The effect of salinity on growth performance, digestive and antioxidant enzymes, humoral immunity and stress indices in two euryhaline fish species: Yellowfin seabream (Acanthopagrus latus) and Asian seabass (Lates calcarifer) Aquaculture. 2021;534 doi: 10.1016/j.aquaculture.2020.736329. [DOI] [Google Scholar]

[bib3] 3.Li X., Shen Y., Bao Y., Wu Z., Yang B., Jiao L., Zhang C., Tocher D.R., Zhou Q., Jin M. Physiological responses and adaptive strategies to acute low-salinity environmental stress of the euryhaline marine fish black seabream (Acanthopagrus schlegelii) Aquaculture. 2022;554 doi: 10.1016/j.aquaculture.2022.738117. [DOI] [Google Scholar]

[bib4] 4.Wu G.-C., Dufour S., Chang C.-F. Molecular and cellular regulation on sex change in hermaphroditic fish, with a special focus on protandrous black porgy, Acanthopagrus schlegelii. Mol. Cell. Endocrinol. 2021;520 doi: 10.1016/j.mce.2020.111069. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Li S., Lin G., Fang W., Huang P., Gao D., Huang J., Xie J., Lu J. Gonadal Transcriptome Analysis of Sex-Related Genes in the Protandrous Yellowfin Seabream (Acanthopagrus latus) Front. Genet. 2020;11:709. doi: 10.3389/fgene.2020.00709. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Grandcourt E.M., Al Abdessalaam T.Z., Francis F., Al Shamsi A.T. Biology and stock assessment of the Sparids, Acanthopagrus bifasciatus and Argyrops spinifer (Forsskål, 1775), in the Southern Arabian Gulf. Fish. Res. 2004;69:7–20. doi: 10.1016/j.fishres.2004.04.006. [DOI] [Google Scholar]

[bib7] 7.Ochwada-Doyle F., Roberts D., Gray C., Barnes L., Haddy J., Fearman J. Characterizing the biological traits and life history of Acanthopagrus (Sparidae) hybrid complexes: implications for conservation and management. J. Fish. Biol. 2012;81:1540–1558. doi: 10.1111/j.1095-8649.2012.03401.x. [DOI] [PubMed] [Google Scholar]

[bib8] 8.Al-Husaini M., Bishop J.M., Al-Foudari H.M., Al-Baz A.F. A review of the status and development of Kuwait’s fisheries. Mar. Pollut. Bull. 2015;100:597–606. doi: 10.1016/j.marpolbul.2015.07.053. [DOI] [PubMed] [Google Scholar]

[bib9] 9.Lu J., Gao D., Sims Y., Fang W., Collins J., Torrance J., Lin G., Xie J., Liu J., Howe K. Chromosome-level Genome Assembly of Acanthopagrus latus Provides Insights into Salinity Stress Adaptation of Sparidae. Mar. Biotechnol. 2022;24:655–660. doi: 10.1007/s10126-022-10119-x. [DOI] [PubMed] [Google Scholar]

[bib10] 10.Zhu K.C., Zhang N., Liu B.S., Guo L., Guo H.Y., Jiang S.G., Zhang D.C. A chromosome-level genome assembly of the yellowfin seabream (Acanthopagrus latus; Hottuyn, 1782) provides insights into its osmoregulation and sex reversal. Genomics. 2021;113:1617–1627. doi: 10.1016/j.ygeno.2021.04.017. [DOI] [PubMed] [Google Scholar]

[bib11] 11.Pérez-Sánchez J., Naya-Català F., Soriano B., Piazzon M.C., Hafez A., Gabaldón T., Llorens C., Sitjà-Bobadilla A., Calduch-Giner J.A. Genome Sequencing and Transcriptome Analysis Reveal Recent Species-Specific Gene Duplications in the Plastic Gilthead Sea Bream (Sparus aurata) Front. Mar. Sci. 2019;6:760. doi: 10.3389/fmars.2019.00760. [DOI] [Google Scholar]

[bib12] 12.Vignal A., Milan D., SanCristobal M., Eggen A. A review on SNP and other types of molecular markers and their use in animal genetics. Genet. Sel. Evol. 2002;34:275–305. doi: 10.1051/gse:2002009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Morin P.A., Martien K.K., Taylor B.L. Assessing statistical power of SNPs for population structure and conservation studies. Mol. Ecol. Resour. 2009;9:66–73. doi: 10.1111/j.1755-0998.2008.02392.x. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Wellenreuther M., Mérot C., Berdan E., Bernatchez L. Going beyond SNPs: The role of structural genomic variants in adaptive evolution and species diversification. Mol. Ecol. 2019;28:1203–1209. doi: 10.1111/mec.15066. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Luo Z., Yu Y., Xiang J., Li F. Genomic selection using a subset of SNPs identified by genome-wide association analysis for disease resistance traits in aquaculture species. Aquaculture. 2021;539 doi: 10.1016/j.aquaculture.2021.736620. [DOI] [Google Scholar]

[bib16] 16.Salem M., Vallejo R.L., Leeds T.D., Palti Y., Liu S., Sabbagh A., Rexroad C.E., 3rd, Yao J. RNA-Seq Identifies SNP Markers for Growth Traits in Rainbow Trout. PLoS One. 2012;7 doi: 10.1371/journal.pone.0036264. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Wang W., Huang J., Hu Y., Feng J., Gao D., Fang W., Xu M., Ma C., Fu Z., Chen Q., et al. Seascapes Shaped the Local Adaptation and Population Structure of South China Coast Yellowfin Seabream (Acanthopagrus latus) Mar. Biotechnol. 2024;26:60–73. doi: 10.1007/s10126-023-10277-6. [DOI] [PubMed] [Google Scholar]

[bib18] 18.1000 Genomes Project Consortium. Abecasis G.R., Altshuler D., Auton A., Brooks L.D., Durbin R.M., Gibbs R.A., Hurles M.E., McVean G.A. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Alkan C., Coe B.P., Eichler E.E. Genome structural variation discovery and genotyping. Nat. Rev. Genet. 2011;12:363–376. doi: 10.1038/nrg2958. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Sudmant P.H., Rausch T., Gardner E.J., Handsaker R.E., Abyzov A., Huddleston J., Zhang Y., Ye K., Jun G., Fritz M.H.Y., et al. An integrated map of structural variation in 2,504 human genomes. Nature. 2015;526:75–81. doi: 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Collins R.L., Brand H., Karczewski K.J., Zhao X., Alföldi J., Francioli L.C., Khera A.V., Lowther C., Gauthier L.D., Wang H., et al. A structural variation reference for medical and population genetics. Nature. 2020;581:444–451. doi: 10.1038/s41586-020-2287-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Weischenfeldt J., Symmons O., Spitz F., Korbel J.O. Phenotypic impact of genomic structural variation: insights from and for human disease. Nat. Rev. Genet. 2013;14:125–138. doi: 10.1038/nrg3373. [DOI] [PubMed] [Google Scholar]

[bib23] 23.Chiang C., Scott A.J., Davis J.R., Tsang E.K., Li X., Kim Y., Hadzic T., Damani F.N., Ganel L. The impact of structural variation on human gene expression. Nat. Genet. 2017;49:692–699. doi: 10.1038/ng.3834. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Spielmann M., Lupiáñez D.G., Mundlos S. Structural variation in the 3D genome. Nat. Rev. Genet. 2018;19:453–467. doi: 10.1038/s41576-018-0007-0. [DOI] [PubMed] [Google Scholar]

[bib25] 25.Mérot C., Oomen R.A., Tigano A., Wellenreuther M. A Roadmap for Understanding the Evolutionary Significance of Structural Genomic Variation. Trends Ecol. Evol. 2020;35:561–572. doi: 10.1016/j.tree.2020.03.002. [DOI] [PubMed] [Google Scholar]

[bib26] 26.Catanach A., Crowhurst R., Deng C., David C., Bernatchez L., Wellenreuther M. The genomic pool of standing structural variation outnumbers single nucleotide polymorphism by threefold in the marine teleost Chrysophrys auratus. Mol. Ecol. 2019;28:1210–1223. doi: 10.1111/mec.15051. [DOI] [PubMed] [Google Scholar]

[bib27] 27.Sherman R.M., Salzberg S.L. Pan-genomics in the human genome era. Nat. Rev. Genet. 2020;21:243–254. doi: 10.1038/s41576-020-0210-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] 28.Ballouz S., Dobin A., Gillis J.A. Is it time to change the reference genome? Genome Biol. 2019;20:159. doi: 10.1186/s13059-019-1774-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] 29.Wong K.H.Y., Ma W., Wei C.-Y., Yeh E.-C., Lin W.-J., Wang E.H.F., Su J.-P., Hsieh F.-J., Kao H.-J., Chen H.-H., et al. Towards a reference genome that captures global genetic diversity. Nat. Commun. 2020;11:5482. doi: 10.1038/s41467-020-19311-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] 30.Wang T., Antonacci-Fulton L., Howe K., Lawson H.A., Lucas J.K., Phillippy A.M., Popejoy A.B., Asri M., Carson C., Chaisson M.J.P., et al. The Human Pangenome Project: a global resource to map genomic diversity. Nature. 2022;604:437–446. doi: 10.1038/s41586-022-04601-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] 31.Zhou Y., Zhang Z., Bao Z., Li H., Lyu Y., Zan Y., Wu Y., Cheng L., Fang Y., Wu K., et al. Graph pangenome captures missing heritability and empowers tomato breeding. Nature. 2022;606:527–534. doi: 10.1038/s41586-022-04808-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Vernikos G., Medini D., Riley D.R., Tettelin H. Ten years of pan-genome analyses. Curr. Opin. Microbiol. 2015;23:148–154. doi: 10.1016/j.mib.2014.11.016. [DOI] [PubMed] [Google Scholar]

[bib33] 33.Collins R.E., Higgs P.G. Testing the Infinitely Many Genes Model for the Evolution of the Bacterial Core Genome and Pangenome. Mol. Biol. Evol. 2012;29:3413–3425. doi: 10.1093/molbev/mss163. [DOI] [PubMed] [Google Scholar]

[bib34] 34.Secomandi S., Gallo G.R., Rossi R., Rodríguez Fernandes C., Jarvis E.D., Bonisoli-Alquati A., Gianfranceschi L., Formenti G. Pangenome graphs and their applications in biodiversity genomics. Nat. Genet. 2025;57:13–26. doi: 10.1038/s41588-024-02029-6. [DOI] [PubMed] [Google Scholar]

[bib35] 35.Paten B., Novak A.M., Eizenga J.M., Garrison E. Genome graphs and the evolution of genome inference. Genome Res. 2017;27:665–676. doi: 10.1101/gr.214155.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Liao W.-W., Asri M., Ebler J., Doerr D., Haukness M., Hickey G., Lu S., Lucas J.K., Monlong J., Abel H.J., et al. A draft human pangenome reference. Nature. 2023;617:312–324. doi: 10.1038/s41586-023-05896-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Schreiber M., Jayakodi M., Stein N., Mascher M. Plant pangenomes for crop improvement, biodiversity and evolution. Nat. Rev. Genet. 2024;25:563–577. doi: 10.1038/s41576-024-00691-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] 38.Leonard A.S., Crysnanto D., Mapel X.M., Bhati M., Pausch H. Graph construction method impacts variation representation and analyses in a bovine super-pangenome. Genome Biol. 2023;24:124. doi: 10.1186/s13059-023-02969-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] 39.Zhang K., Guo S., Yang S., Zhou W., Wu J., Zhang X., Shi Q., Deng L. A telomere-to-telomere genome assembly of the protandrous hermaphrodite blackhead seabream, Acanthopagrus schlegelii. Sci. Data. 2025;12:350. doi: 10.1038/s41597-025-04602-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] 40.Lister J.A. Larval but not adult xanthophore pigmentation in zebrafish requires GTP cyclohydrolase 2 (gch2) function. Pigment Cell Melanoma Res. 2019;32:724–727. doi: 10.1111/pcmr.12783. [DOI] [PubMed] [Google Scholar]

[bib41] 41.Chen J., Wang H., Wu S., Zhang A., Qiu Z., Huang P., Qu J.Y., Xu J. col1a2+ fibroblasts/muscle progenitors finetune xanthophore countershading by differentially expressing csf1a/1b in embryonic zebrafish. Sci. Adv. 2024;10 doi: 10.1126/sciadv.adj9637. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] 42.Nord H., Dennhag N., Muck J., von Hofsten J. Pax7 is required for establishment of the xanthophore lineage in zebrafish embryos. Mol. Biol. Cell. 2016;27:1853–1862. doi: 10.1091/mbc.e15-12-0821. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] 43.Wang C., Lu B., Li T., Liang G., Xu M., Liu X., Tao W., Zhou L., Kocher T.D., Wang D. Nile Tilapia: A Model for Studying Teleost Color Patterns. J. Hered. 2021;112:469–484. doi: 10.1093/jhered/esab018. [DOI] [PubMed] [Google Scholar]

[bib44] 44.Dorant Y., Cayuela H., Wellband K., Laporte M., Rougemont Q., Mérot C., Normandeau E., Rochette R., Bernatchez L. Copy number variants outperform SNPs to reveal genotype–temperature association in a marine species. Mol. Ecol. 2020;29:4765–4782. doi: 10.1111/mec.15565. [DOI] [PubMed] [Google Scholar]

[bib45] 45.Hämälä T., Wafula E.K., Guiltinan M.J., Ralph P.E., dePamphilis C.W., Tiffin P. Genomic structural variants constrain and facilitate adaptation in natural populations of Theobroma cacao, the chocolate tree. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2102914118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] 46.Scott A.J., Chiang C., Hall I.M. Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes. Genome Res. 2021;31:2249–2257. doi: 10.1101/gr.275488.121. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] 47.Zhang Y., Yang Z., He Y., Liu D., Liu Y., Liang C., Xie M., Jia Y., Ke Q., Zhou Y., et al. Structural variation reshapes population gene expression and trait variation in 2,105 Brassica napus accessions. Nat. Genet. 2024;56:2538–2550. doi: 10.1038/s41588-024-01957-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] 48.Abel H.J., Larson D.E., Regier A.A., Chiang C., Das I., Kanchi K.L., Layer R.M., Neale B.M., Salerno W.J., Reeves C., et al. Mapping and characterization of structural variation in 17,795 human genomes. Nature. 2020;583:83–89. doi: 10.1038/s41586-020-2371-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] 49.Ruigrok M., Xue B., Catanach A., Zhang M., Jesson L., Davy M., Wellenreuther M. The Relative Power of Structural Genomic Variation versus SNPs in Explaining the Quantitative Trait Growth in the Marine Teleost Chrysophrys auratus. Genes. 2022;13:1129. doi: 10.3390/genes13071129. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] 50.Bertolotti A.C., Layer R.M., Gundappa M.K., Gallagher M.D., Pehlivanoglu E., Nome T., Robledo D., Kent M.P., Røsæg L.L., Holen M.M., et al. The structural variation landscape in 492 Atlantic salmon genomes. Nat. Commun. 2020;11:5176. doi: 10.1038/s41467-020-18972-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] 51.Tine M., Kuhl H., Gagnaire P.-A., Louro B., Desmarais E., Martins R.S.T., Hecht J., Knaust F., Belkhir K., Klages S., et al. European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation. Nat. Commun. 2014;5:5770. doi: 10.1038/ncomms6770. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] 52.Kültz D. Physiological mechanisms used by fish to cope with salinity stress. J. Exp. Biol. 2015;218:1907–1914. doi: 10.1242/jeb.118695. [DOI] [PubMed] [Google Scholar]

[bib53] 53.Iula L., Keitelman I.A., Sabbione F., Fuentes F., Guzman M., Galletti J.G., Gerber P.P., Ostrowski M., Geffner J.R., Jancic C.C., Trevani A.S. Autophagy Mediates Interleukin-1β Secretion in Human Neutrophils. Front. Immunol. 2018;9:269. doi: 10.3389/fimmu.2018.00269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib54] 54.Drummond D.A., Bloom J.D., Adami C., Wilke C.O., Arnold F.H. Why highly expressed proteins evolve slowly. Proc. Natl. Acad. Sci. USA. 2005;102:14338–14343. doi: 10.1073/pnas.0504070102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] 55.Drummond D.A., Wilke C.O. Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution. Cell. 2008;134:341–352. doi: 10.1016/j.cell.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] 56.Zhang J., Yang J.-R. Determinants of the rate of protein sequence evolution. Nat. Rev. Genet. 2015;16:409–420. doi: 10.1038/nrg3950. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib57] 57.Shanta O., Noor A., Sebat J., Zhao X., Malhotra A., Porubsky D., Rausch T., Gardner E.J., Rodriguez O.L. The effects of common structural variants on 3D chromatin structure. BMC Genom. 2020;21:95. doi: 10.1186/s12864-020-6516-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib58] 58.Price A.C., Weadick C.J., Shim J., Rodd F.H. Pigments, Patterns, and Fish Behavior. 2009. https://home.liebertpub.com/zeb [DOI] [PubMed]

[bib59] 59.Fuller R.C. Lighting environment predicts the relative abundance of male colour morphs in bluefin killifish (Lucania goodei) populations. Proc. Biol. Sci. 2002;269:1457–1465. doi: 10.1098/rspb.2002.2042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib60] 60.Maan M.E., Seehausen O., Söderberg L., Johnson L., Ripmeester E.A.P., Mrosso H.D.J., Taylor M.I., van Dooren T.J.M., van Alphen J.J.M. Intraspecific sexual selection on a speciation trait, male coloration, in the Lake Victoria cichlid Pundamilia nyererei. Proc. Biol. Sci. 2004;271:2445–2452. doi: 10.1098/rspb.2004.2911. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib61] 61.Stevens M., Lown A.E., Wood L.E. Color change and camouflage in juvenile shore crabs Carcinus maenas. Front. Ecol. Evol. 2014;2 doi: 10.3389/fevo.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib62] 62.Cal L., Suarez-Bregua P., Moran P., Cerdá-Reverter J.M., Rotllant J. In: Emerging Issues in Fish Larvae Research. Yúfera M., editor. Springer International Publishing; 2018. Fish Pigmentation. A Key Issue for the Sustainable Development of Fish Farming; pp. 229–252. [DOI] [Google Scholar]

[bib63] 63.Ziegler I. The Pteridine Pathway in Zebrafish: Regulation and Specification during the Determination of Neural Crest Cell-Fate. Pigment Cell Res. 2003;16:172–182. doi: 10.1034/j.1600-0749.2003.00044.x. [DOI] [PubMed] [Google Scholar]

[bib64] 64.Braasch I., Schartl M., Volff J.-N. Evolution of pigment synthesis pathways by gene and genome duplication in fish. BMC Evol. Biol. 2007;7:74. doi: 10.1186/1471-2148-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib65] 65.Fang W., Huang J., Li S., Lu J. Identification of pigment genes (melanin, carotenoid and pteridine) associated with skin color variant in red tilapia using transcriptome analysis. Aquaculture. 2022;547 doi: 10.1016/j.aquaculture.2021.737429. [DOI] [Google Scholar]

[bib66] 66.Huang J., Fang W., Li J., Cai W., Lu J. Full-length transcriptome reveals alternative splicing regulation pattern of skin color variant in red tilapia (Oreochromis spp.) Aquaculture. 2025;598 doi: 10.1016/j.aquaculture.2024.741963. [DOI] [Google Scholar]

[bib67] 67.Liu Z., Gao D. Current State of Fish Reference Genome and Pangenome: Methodologies, Sampling Strategies, Quality Assessment and Future Perspectives to Aquaculture Breeding. Mar. Biotechnol. 2025;27:158. doi: 10.1007/s10126-025-10535-9. [DOI] [PubMed] [Google Scholar]

[bib68] 68.Pan C., Gao C., Chen T., Chen X., Yang C., Zeng D., Feng P., Jiang W., Peng M. The complete mitochondrial genome of yellowfin seabream, Acanthopagrus latus (Percoiformes, Sparidae) from Beibu Bay. Mitochondrial DNA Part B. 2021;6:1313–1314. doi: 10.1080/23802359.2021.1907804. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib69] 69.Blommaert J., Sandoval-Castillo J., Beheregaray L.B., Wellenreuther M. Peering into the gaps: Long-read sequencing illuminates structural variants and genomic evolution in the Australasian snapper. Genomics. 2024;116 doi: 10.1016/j.ygeno.2024.110929. [DOI] [PubMed] [Google Scholar]

[bib70] 70.Houston R.D., Bean T.P., Macqueen D.J., Gundappa M.K., Jin Y.H., Jenkins T.L., Selly S.L.C., Martin S.A.M., Stevens J.R., Santos E.M., et al. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet. 2020;21:389–409. doi: 10.1038/s41576-020-0227-y. [DOI] [PubMed] [Google Scholar]

[bib71] 71.Chao H., Li Z., Chen D., Chen M. iSeq: an integrated tool to fetch public sequencing data. Bioinformatics. 2024;40 doi: 10.1093/bioinformatics/btae641. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib72] 72.Chen S., Zhou Y., Chen Y., Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib73] 73.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. 2013 doi: 10.48550/arXiv.1303.3997. Preprint at. [DOI] [Google Scholar]

[bib74] 74.Tarasov A., Vilella A.J., Cuppen E., Nijman I.J., Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31:2032–2034. doi: 10.1093/bioinformatics/btv098. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib75] 75.Poplin R., Chang P.-C., Alexander D., Schwartz S., Colthurst T., Ku A., Newburger D., Dijamco J., Nguyen N., Afshar P.T., et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol. 2018;36:983–987. doi: 10.1038/nbt.4235. [DOI] [PubMed] [Google Scholar]

[bib76] 76.Yun T., Li H., Chang P.-C., Lin M.F., Carroll A., McLean C.Y. Accurate, scalable cohort variant calls using DeepVariant and GLnexus. Bioinformatics. 2021;36:5582–5589. doi: 10.1093/bioinformatics/btaa1081. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib77] 77.Chiang C., Layer R.M., Faust G.G., Lindberg M.R., Rose D.B., Garrison E.P., Marth G.T., Quinlan A.R., Hall I.M. SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat. Methods. 2015;12:966–968. doi: 10.1038/nmeth.3505. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib78] 78.Layer R.M., Chiang C., Quinlan A.R., Hall I.M. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15:R84. doi: 10.1186/gb-2014-15-6-r84. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib79] 79.Abyzov A., Urban A.E., Snyder M., Gerstein M. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011;21:974–984. doi: 10.1101/gr.114876.110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib80] 80.Larson D.E., Abel H.J., Chiang C., Badve A., Das I., Eldred J.M., Layer R.M., Hall I.M. svtools: population-scale analysis of structural variation. Bioinformatics. 2019;35:4782–4787. doi: 10.1093/bioinformatics/btz492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib81] 81.Danecek P., Bonfield J.K., Liddle J., Marshall J., Ohan V., Pollard M.O., Whitwham A., Keane T., McCarthy S.A., Davies R.M., Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib82] 82.Cingolani P., Platts A., Wang L.L., Coon M., Nguyen T., Wang L., Land S.J., Lu X., Ruden D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib83] 83.Lawrence M., Huber W., Pagès H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J. Software for Computing and Annotating Genomic Ranges. PLoS Comput. Biol. 2013;9 doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib84] 84.Krzywinski M., Schein J., Birol İ., Connors J., Gascoyne R., Horsman D., Jones S.J., Marra M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib85] 85.Yu G., Wang L.-G., He Q.-Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31:2382–2383. doi: 10.1093/bioinformatics/btv145. [DOI] [PubMed] [Google Scholar]

[bib86] 86.Krueger F. Trim Galore. 2023. https://www.bioinformatics.babraham.ac.uk/projects/trim_galore

[bib87] 87.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]

[bib88] 88.Servant N., Varoquaux N., Lajoie B.R., Viara E., Chen C.-J., Vert J.-P., Heard E., Dekker J., Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16:259. doi: 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib89] 89.Filippova D., Patro R., Duggal G., Kingsford C. Identification of alternative topological domains in chromatin. Algorithms Mol. Biol. 2014;9:14. doi: 10.1186/1748-7188-9-14. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib90] 90.Paulsen J., Liyakat Ali T.M., Collas P. Computational 3D genome modeling using Chrom3D. Nat. Protoc. 2018;13:1137–1152. doi: 10.1038/nprot.2018.009. [DOI] [PubMed] [Google Scholar]

[bib91] 91.Paulsen J., Sekelja M., Oldenburg A.R., Barateau A., Briand N., Delbarre E., Shah A., Sørensen A.L., Vigouroux C., Buendia B., Collas P. Chrom3D: three-dimensional genome modeling from Hi-C and nuclear lamin-genome contacts. Genome Biol. 2017;18:21. doi: 10.1186/s13059-016-1146-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib92] 92.Hernández-Plaza A., Szklarczyk D., Botas J., Cantalapiedra C.P., Giner-Lamia J., Mende D.R., Kirsch R., Rattei T., Letunic I., Jensen L.J., et al. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Res. 2023;51:D389–D394. doi: 10.1093/nar/gkac1022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib93] 93.Marc Carlson H.P. AnnotationForge. (Bioconductor) 2017. [DOI]

[bib94] 94.Xu S., Hu E., Cai Y., Xie Z., Luo X., Zhan L., Tang W., Wang Q., Liu B., Wang R., et al. Using clusterProfiler to characterize multiomics data. Nat. Protoc. 2024;19:3292–3320. doi: 10.1038/s41596-024-01020-z. [DOI] [PubMed] [Google Scholar]

[bib95] 95.Wilkinson L. ggplot2: Elegant Graphics for Data Analysis by WICKHAM, H. Biometrics. 2011;67:678–679. doi: 10.1111/j.1541-0420.2011.01616.x. [DOI] [Google Scholar]

[bib96] 96.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib97] 97.Kim D., Paggi J.M., Park C., Bennett C., Salzberg S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib98] 98.Pertea M., Pertea G.M., Antonescu C.M., Chang T.-C., Mendell J.T., Salzberg S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib99] 99.Bates D., Mächler M., Bolker B., Walker S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Soft. 2015;67:1. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]

[bib100] 100.Kuznetsova A., Brockhoff P.B., Christensen R.H.B. lmerTest Package: Tests in Linear Mixed Effects Models. J. Stat. Soft. 2017;82 doi: 10.18637/jss.v082.i13. [DOI] [Google Scholar]

[bib101] 101.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib102] 102.Goel M., Sun H., Jiao W.-B., Schneeberger K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 2019;20:277. doi: 10.1186/s13059-019-1911-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib103] 103.Goel M., Schneeberger K. plotsr: visualizing structural similarities and rearrangements between multiple genomes. Bioinformatics. 2022;38:2922–2926. doi: 10.1093/bioinformatics/btac196. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib104] 104.Garrison E., Guarracino A., Heumos S., Villani F., Bao Z., Tattini L., Hagmann J., Vorbrugg S., Marco-Sola S., Kubica C., et al. Building pangenome graphs. Nat. Methods. 2024;21:2008–2012. doi: 10.1038/s41592-024-02430-3. [DOI] [PubMed] [Google Scholar]

[bib105] 105.Jain C., Koren S., Dilthey A., Phillippy A.M., Aluru S. A fast adaptive algorithm for computing whole-genome homology maps. Bioinformatics. 2018;34:i748–i756. doi: 10.1093/bioinformatics/bty597. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib106] 106.Guarracino A., Mwaniki N., Marco-Sola S., Garrison E. Zenodo; 2021. Wfmash: A Pangenome-Scale Aligner. [DOI] [Google Scholar]

[bib107] 107.Garrison E., Guarracino A. Unbiased pangenome graphs. Bioinformatics. 2023;39 doi: 10.1093/bioinformatics/btac743. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib108] 108.Garrison E., Guarracino A., Heumos S., Novak A., Hickey G., Eizenga J., Prins P. Zenodo; 2022. Pangenome/Smoothxg: Citation Release. [DOI] [Google Scholar]

[bib109] 109.Doerr D., Marijon P. GFAffix. 2022. https://github.com/codialab/GFAffix

[bib110] 110.Hickey G., Heller D., Monlong J., Sibbesen J.A., Sirén J., Eizenga J., Dawson E.T., Garrison E., Novak A.M., Paten B. Genotyping structural variants in pangenome graphs using the vg toolkit. Genome Biol. 2020;21:35. doi: 10.1186/s13059-020-1941-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib111] 111.Wick R.R., Schultz M.B., Zobel J., Holt K.E. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib112] 112.Guarracino A., Heumos S., Nahnsen S., Prins P., Garrison E. ODGI: understanding pangenome graphs. Bioinformatics. 2022;38:3319–3326. doi: 10.1093/bioinformatics/btac308. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib113] 113.Beyer W., Novak A.M., Hickey G., Chan J., Tan V., Paten B., Zerbino D.R. Sequence tube maps: making graph genomes intuitive to commuters. Bioinformatics. 2019;35:5318–5320. doi: 10.1093/bioinformatics/btz597. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib114] 114.Li H. Protein-to-genome alignment with miniprot. Bioinformatics. 2023;39 doi: 10.1093/bioinformatics/btad014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib115] 115.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST+: architecture and applications. BMC Bioinf. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib116] 116.Pertea G., Pertea M. GFF Utilities: GffRead and GffCompare. F1000Res. 2020;9 doi: 10.12688/f1000research.23297.2. ISCB Comm J-304. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib117] 117.Castro-Mondragon J.A., Riudavets-Puig R., Rauluseviciute I., Lemma R.B., Turchi L., Blanc-Mathieu R., Lucas J., Boddie P., Khan A., Manosalva Pérez N., et al. JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2022;50:D165–D173. doi: 10.1093/nar/gkab1113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib118] 118.Paysan-Lafosse T., Blum M., Chuguransky S., Grego T., Pinto B.L., Salazar G.A., Bileschi M.L., Bork P., Bridge A., Colwell L., et al. InterPro in 2022. Nucleic Acids Res. 2023;51:D418–D427. doi: 10.1093/nar/gkac993. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib119] 119.Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., Ronneberger O., Willmore L., Ballard A.J., Bambrick J., et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. doi: 10.1038/s41586-024-07487-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib120] 120.Zhang Y., Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005;33:2302–2309. doi: 10.1093/nar/gki524. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib121] 121.Meng E.C., Goddard T.D., Pettersen E.F., Couch G.S., Pearson Z.J., Morris J.H., Ferrin T.E. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 2023;32 doi: 10.1002/pro.4792. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

The pangenome of Acanthopagrus provides genomic variation evidence for genetic diversity

Yan Hu

Wenhao Wang

Hao Wang

Zhanyuan Gao

Xinru Zhu

Yuchen Yang

Jianguo Lu

Summary

Graphical abstract

Highlights

Introduction

Results

Characterization of SV in wild yellowfin seabream

Figure 1.

Functional enrichment of core genes and variable genes

Figure 2.

Construction of the pangenome of Acanthopagrus

Figure 3.

Pangenome graph reveals genetic loci associated with xanthophore pigmentation of yellowfin seabream and blackhead seabream

Figure 4.

Figure 5.

The SV in the promoter region of gch2 as a candidate driver of xanthophore pigmentation divergence in Acanthopagrus

Figure 6.

Discussion

SVs mediate sequence and regulation divergence between yellowfin seabream individuals

Potential role of genomic variations in mediating pigmentation differences between yellowfin seabream and blackhead seabream

Limitations of the study

Resource availability

Lead contact

Materials availability

Data and code availability

Acknowledgments

Author contributions

Declaration of interests

STAR★Methods

Key resources table

Experimental model and study participant details

Method details

Sampling and DNA sequencing

Genomic variations identification using whole-genome resequencing data

Filtering and annotation of population variation

Inference of significant interactions from Hi-C data

Identification of core genes and variable genes

Enrichment analysis of gene function

Expression analysis of core genes and variable genes

Construction of the liner pangenome of Acanthopagrus

Construction of the pangenome graph of Acanthopagrus

Verification of gene structure of gch2

Protein sequence prediction and structural modeling

Quantification and statistical analysis

Footnotes

Contributor Information

Supplemental information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases