Skip to main content
iScience logoLink to iScience
. 2020 Oct 9;23(11):101662. doi: 10.1016/j.isci.2020.101662

African Arowana Genome Provides Insights on Ancient Teleost Evolution

Shijie Hao 1,2,9, Kai Han 2,9, Lingfeng Meng 1,2, Xiaoyun Huang 2, Wei Cao 3, Chengcheng Shi 1,2, Mengqi Zhang 2, Yilin Wang 2, Qun Liu 2, Yaolei Zhang 2,5, Haixi Sun 3, Inge Seim 6,7, Xun Xu 2,3,8, Xin Liu 2,3,4,, Guangyi Fan 2,3,4,10,∗∗
PMCID: PMC7586111  PMID: 33134892

Summary

Osteoglossiformes is a basal clade of teleost, evolving since the Jurassic period. The genomes of Osteoglossiformes species would shed light on the evolution and adaptation of teleost. Here, we established a chromosome-level genome of African arowana. Together with the genomes of pirarucu and Asian arowana, we found that they diverged at ∼106.1 million years ago (MYA) and ∼59.2 MYA, respectively, which are coincident with continental separation. Interestingly, we identified a dynamic genome evolution characterized by a fast evolutionary rate and a high pseudogenization rate in African arowana and pirarucu. Additionally, more transposable elements were found in Asian arowana which confer more gene duplications. Moreover, we found the contraction of olfactory receptor and the expansion of UGT in African arowana might be related to its transformation from carnivore to be omnivore. Taken together, we provided valuable genomic resource of Osteoglossidae and revealed the correlation of biogeography and teleost evolution.

Subject Areas: Evolutionary Biology, Phylogenetics, Paleobiology, Paleogenetics

Graphical Abstract

graphic file with name fx1.jpg

Highlights

  • An evolutionary model of Osteoglossidae along the continental drift is provided

  • A faster evolving rate of African arowana than Asian arowana is revealed

  • The gene duplications of Asian arowana are related to more class I TE insertions

  • A mechanism of African arowana’s feeding habits transition is proposed.


Evolutionary Biology; Phylogenetics; Paleobiology; Paleogenetics

Introduction

Osteoglossiformes is an ancient group of teleosts, which comprises five living groups including Hiodontidae, Osteoglossidae, Pantodontidae, Notopteridae and Mormyridae. Osteoglossidae contains two clades of Osteoglossinae and Heterotidinae, with species distributing in Asia, America, Africa, and Australia (Wilson and Murray, 2008). The existence of Osteoglossiformes can be dated back to the Jurassic period according to fossil evidences (Lavoue and Sullivan, 2004; Wilson and Murray, 2008), thus current species in Osteoglossiformes should had witnessed the break-up of the Gondwana supercontinent (Cioffi et al., 2019; Kumazawa and Nishida, 2000; Lavoue, 2016). Therefore, Osteoglossiformes species, serving as models for biogeography, have been extensively studied in morphological and molecular evolution (Hilton, 2001, 2003; Kumazawa and Nishida, 2000; Lavoue and Sullivan, 2004; TANG et al., 2004), and also provide evidences for paleogeology. Previous efforts have been made to decode genomes of Osteoglossiformes species (Bian et al., 2016; Du et al., 2019; Gallant et al., 2017), while more whole-genome sequences, especially those with chromosome information and comprehensive genome comparisons, which would further illustrate the evolutionary process of Osteoglossiformes.

African arowana (or African bonytongue, Heterotis niloticus), pirarucu (Arapaima gigas), and Asian arowana (Scleropages formosus) are three representative species of Osteoglossidae in Osteoglossiformes with some morphological differences (Adite et al., 2017; Axelrod et al., 1986; saint-Paul, 1986). African arowana is the only omnivore in Osteoglossiformes (Adite et al., 2013; Oliveira et al., 2019), distributing majorly in Africa, compared to pirarucu mainly in South America and Asian arowana in Southeast Asia. Despite their differences in habitats and morphology, these three species are relatively closely related with similar behaviors and physiological characters (Monentcham et al., 2009; Núñez et al., 2011; Scott and Fuller, 1976), making them good representative species for investigating the genetic basis of the ancient teleost clade (Betancur et al., 2017). In this study, we assembled the genome of African arowana using advanced sequencing and library-building technologies. In addition, together with the available genome sequences of Asian arowana and pirarucu, we comprehensively analyzed the genome evolution of Osteoglossiformes and illustrated the evolutionary features of Osteoglossiformes.

Results

Sequencing and Assembly of a Chromosome-Level African Arowana Genome

In order to sequence and assemble the African arowana genome, we applied single tube long fragment read (stLFR) technology (Wang et al., 2018) on BGISEQ-500 sequencing platform and generated 144.36 Gb (∼186×) data (Table S1). In total, ∼669.7 Mb (∼99% of the estimated genome size, 673.41Mb, Figure S1) genomic sequences were assembled with a scaffold N50 of ∼9.62 Mb. To further improve the continuity, we sequenced ∼10.2 Gb (∼13.1×) Nanopore long reads to fill the gaps. With these long reads, the contig N50 was further improved from 255.6 Kb to 2.31 Mb (Table S2) using TGS-GapCloser (Xu et al., 2019). To anchor the scaffold sequences to chromosomes, we constructed a Hi-C library and sequenced ∼21.2 Gb Hi-C data and thus ∼650.44 Mb sequences were anchored to 20 chromosomes (Figures 1A, S2, and Table S3), which was consistent with the previous report on African arowana karyotype (Oliveira et al., 2019). The complete African arowana mitochondria genome was assembled using MitoZ (Meng et al., 2019), and it was almost identical to the published African arowana mitochondrial genome (Figure S3), thus indicating the correctness of sampling and species identification. Finally, by using BUSCO (Benchmarking Universal Single-Copy Orthologs) (Simao et al., 2015), we found that ∼97.6% of the complete vertebrate BUSCO genes were covered by our assembly(Table S4), providing further evidence for the fine quality of the assembled genome.

Figure 1.

Figure 1

The Evolution History of Asian arowana, African arowana and Pirarucu

(A) The characteristics of the assembled H. niloticus genome. The tracks from outer to inner represent the gene density, TE density, tandem repeat density, GC content and non-coding RNA respectively.

(B) The phylogenetic relationships of 10 teleost fishes with L. oculatus as outgroup. The numbers on clades represent the evolutionary rates (dN + dS). The numbers beside the inner nodes represent the support values of the nodes (The asterisk represents a support value of 100).

(C) pirarucu and Asian arowana’ divergence pattern through the continents drift.

We then carried out genome annotation to identify repeats and protein-coding genes of African arowana. About 18.74% of this genome was annotated and identified as “repetitive sequences”, and DNA transposable elements (TEs) are the most abundant. We also predicted 24,146 protein-coding genes with combinational annotation methods (de novo prediction, homology-based prediction and RNA-seq-based prediction) in this genome (Table S3), of which ∼89.5% were found to have homologs in public databases (Kyoto Encyclopedia of Genes and Genomes, Swiss-Prot, Translated EMBL, NCBI non-redundant proteins) with known functions (Table S5). Clustering with 10 other fishes, we identified 15,432 gene families in African arowana, of which 30 are unique to African arowana.

Speciation of Osteoglossiformes along with the Geographic Drift

To validate and add to the previously proposed model of Gondwana origin and plate tectonic transportation of Osteoglossiformes species (Kumazawa and Nishida, 2000), we constructed the phylogenetic tree of the representative species in Osteoglossiformes. Collecting the ten teleost species (African arowana, Asian arowana, pirarucu, American eel (Anguilla rostrate), Atlantic salmon (Salmo salar), Kingsley kuhikpea (Paramormyrops kingsleyae), Japanese puffer (Takifugu rubripes), Japanese rice fish (Oryzias latipes), three-spined stickleback (Gasterosteus aculeatus), Zebrafish (Danio rerio)) with available genome sequences and spotted gar (Lepisosteus oculatus) as an outgroup species, we identified 355 single-copy gene families (one orthologous gene in each species) and then used them to build the phylogenetic tree, reflecting the relationship and evolution of teleost (Figure 1B). The divergence time of each internal node was calculated using MCMCTree (Yang, 2007) with the calibration of previous molecular or fossil researches obtained from TimeTree (http://www.timetree.org/, Table S6). In this phylogenetic tree, the divergence time between Asian arowana and the common ancestor of African arowana and pirarucu was ∼106.1 million years ago (MYA), which was moderate to previous researches (Cioffi et al., 2019; Du et al., 2019; Vialle et al., 2018). And the divergence time estimated here was close to the final separation time of South American and African continents in Afro-South American drift of Gondwana supercontinent happened at ∼110 MYA (Rogers and Santosh, 2004). Considering the previous evidences supporting that (1) Osteoglossinae fishes speciated along with the separation of South America, Antarctic, Australia, and Southeast Asia from ∼50.3MYA (Cioffi et al., 2019); (2) Presently, Asian arowana only lives in Southeast Asia (Mu et al., 2012); and (3) Africa has been identified as the taxonomic diversity center of Osteoglossiformes (Wilson and Murray, 2008), we proposed that the ancestor of Asian arowana had migrated from Africa to Southeast Asia before or during the tectonic-mediated Gondwanan fragmentation (especially the fragmentation of Africa-South America, South America-Antarctica-Australia and the fragmentation of Southeast Asia-Australia) (Figure 1C). Then, the divergence time between African arowana and pirarucu was estimated as ∼59.2 MYA (Figure 1B), which was also close to the split time of North American and Eurasian continents (∼65 MYA or later) (Rafferty, 2010; Rogers and Santosh, 2004). Additionally, a Paleocene (56–65Ma) Heterotidinae fossil was discovered in North America (Guo-Qing and Wilson, 1996) and classified as an outgroup of African arowana and pirarucu (Wilson and Murray, 2008). Thus, we concluded that the ancestor of African arowana and pirarucu might live in both North America and Eurasia continents of Gondwana supercontinent, and after the split of these continents, the two species have evolved separately, while pirarucu had spread to South America after the formation of Isthmus of Panama (Figure 1C). Hence, by using the whole-genome data analysis, we proposed the association between the speciation of Osteoglossiformes species and the paleo-geographical changes and improved the previous model for speciation of these species.

Main Distinct Genomic Evolution Events of Three Osteoglossidae Fishes during Their Adaptations to New Environments

Dynamic Evolution Rate

In addition to phylogenetic analysis, we further investigated these three genomes in detail to reveal the genome evolution during the long period after speciation. First, we calculated the dN and dS (the substitution rates at non-synonymous and synonymous sites) of 355 single-copy gene families in each clade. Comparing to the common ancestor, we found that the dN and dS values of Asian arowana (average dN: 0.024; average dS: 0.253) were lower than those of African arowana (average dN: 0.032; average dS: 0.317) and pirarucu (average dN: 0.032; average dS 0.307), indicating higher mutation rate and faster evolution in African arowana and pirarucu compared to Asian arowana (Figure 1B). In order to further validate these results, we identified 7,699 single-copy gene families among three Osteoglossidae fishes and spotted gar. For these orthologous gene families, we calculated the average dN and dS of these three Osteoglossidae species and also found that the dN and dS of African arowana (average dN: 0.039; average dS: 0.335) and pirarucu (average dN: 0.040; average dS: 0.346) were significantly greater than those of Asian arowana (average dN: 0.027; average dS: 0.242). The Wilcoxon rank-sum test showed that the differences between Asian arowana and either African arowana or pirarucu were statistically significant (p-value<0.05, Figure 2A). Besides, we investigated the number of pseudogenes (the remains of malfunctional genes because of mutation accumulation) of these three Osteoglossidae species and found more pseudogenes in African arowana (350) and pirarucu (399) than in Asian arowana (242).

Figure 2.

Figure 2

The Evolution Rate of Asian arowana, African arowana and Pirarucu

(A) The dS and dN distribution of Asian arowana, African arowana, and pirarucu. The statistic significancy was calculated by Wilcoxon rank-sum test and three asterisks indicate a p value that less than 2.22∗e−16 and two asterisks indicate a p value that equal to 2.5 × 10−11 while one asterisks indicate a p value that equal to 0.9.

(B) Syntenic pattern of African arowana and Asian arowana.

(C) Syntenic pattern of intra-African arowana.

(D) Syntenic pattern of intra-Asian arowana.

We also explored the evolution rate of these three Osteoglossidae species' conserved regions. Whole-genome alignments of these three Osteoglossidae species were implemented firstly with spotted gar as a reference. Then, we calculated the genetic distance (TN93 model) in the resulting 30 Mb conserved regions (remain in four species) between each Osteoglossidae species and spotted gar, respectively. And this analysis revealed that the average distance of African arowana (0.358) was significantly larger than Asian arowana (0.353) and pirarucu (0.351) (Wilcoxon rank-sum test, p value<2.22 × 10−16, Figure S4). Moreover, we identified syntenic blocks of African arowana and Asian arowana to further investigate the evolution of their genomes. We found a distinct colinear relationship between African arowana and Asian arowana, indicating slight chromosomal structural variations occurred between their genomes (Figure 2B). We also found a rough chromosomal one-to-one colinear relationship in Asian arowana itself, whereas more scattered self-syntenic blocks were detected in African arowana genome than in Asian arowana (Figures 2C and 2D). These results provided additional evidences supporting the faster evolution rate of African arowana, which had resulted in more variations in the paralogous chromosomes inheriting from the TS-WGD events (∼350 MYA) (Glasauer and Neuhauss, 2014).

Extra Class I TE Insertion in Asian Arowana

Other than the faster evolution rate in African arowana and pirarucu genomes, the genome sizes of these two species (669 Mb and 667 Mb, respectively) are also notably smaller than that of the Asian arowana genome (785 Mb). Thus, we then investigated the possible mechanisms underlying the smaller genome sizes. Performing the same combinational annotation methods (de novo- and homology-based methods), we annotated the TEs in all three Osteoglossidae fishes. Looking into the repeat content, we found it was substantially less in African arowana and pirarucu genomes (18.7% and 18.2%, respectively) than in Asian arowana genome (29.5%). The extra insertion in repeat content (∼100 Mb) may explain the differences of genome sizes (Table S7). Further investigating the different categories of TEs, we found the LINEs (long interspersed nucleotide elements) and LTRs (long terminal repeats) proportions were notably different among African arowana (LINEs: 5.54%, 37.10 Mb and LTRs: 3.08%, 20.63 Mb), pirarucu (LINEs: 3.31%, 22.11 Mb and LTRs: 4.61%, 30.74 Mb) and Asian arowana (LINEs: 16.36%, 128.34 Mb and LTRs: 12.33%, 96.77 Mb) (Figure 3A). Although, DNA TE proportions were comparable in 3 species, the most abundant DNA TE class of Asian arowana was TcMar-Tigger while TcMar-Tc1 was the most abundant in both African arowana and pirarucu, and the distinctly expanded clade of TcMar-Tigger of Asian arowana indicated a recent fast insertion event (Figures 3B and 3C).

Figure 3.

Figure 3

Transposable Element (TE) Dynamics of Pirarucu, African Arowana and Asian Arowana

(A) The distribution of three Osteoglossidae fishes' TEs.

(B) The phylogenetic trees of three Osteoglossidae fishes' Tc1 TEs.

(C) The phylogenetic trees of three Osteoglossidae fishes' Tigger TEs.

(D) The insertion timeline of LINE TEs of three Osteoglossidae fishes.

(E) The insertion timeline of LTR TEs of three Osteoglossidae fishes.

(F) An example of the positional relationship of Asian arowana's genes and TEs, in which the blue bars indicate the genes and the red bar indicate the TEs. In this case, all of the five genes are olfactory receptors.

Additionally, we estimated the relative insertion time of LINEs and LTRs through the Ty3 reverse transcriptase (RT) genetic distance to the outgroup (all the Ty1 LTR RTs sequences of Asian arowana) and found that Asian arowana had a different LINE insertion time peak compared to African arowana and pirarucu, as well as an extra LTR insertion time peak which was close to its LINE insertion peak, indicating that Asian arowana had experienced a specific period (Figures 3D and 3E). To further figure out the function of the additional TEs insertion event, we investigated the TE coverage of all genes of these 3 species. In Asian arowana, we identified 1,261 genes located in TE-inserted regions, while this number in pirarucu and African arowana was only 464 and 73, respectively. The genes covered by TEs were concatenated together and had the same function (Figures 3F and S5). Functional enrichment analysis showed that these genes in Asian arowana were mainly related to olfactory transduction, NOD-like receptor signaling and phagosome pathways (Table S8, q-value<0.01). In summary, we found the genome of Asian arowana had gone through more changes due to the extra insertion of TEs, and also multiple gene families had expanded along with the copy and paste of TEs, which might be related to their adaptions to more variable environment after the first Gondwana split event than African arowana and pirarucu.

Dynamic Evolution of Gene Families

Since we have observed a higher evolution rate in African arowana and more TEs insertion in Asian arowana, their effects on gene content were further evaluated by gene family analysis. We found the average gene family size of African arowana (1.38 gene per family) was less than Asian arowana and pirarucu (both 1.47 gene per family), possibly indicating a higher evolutionary rate based on a published research (Chen et al., 2010). Overall, 14,125 gene families were shared by all these three species, and 1,040 gene families were unique to pirarucu (Figure 4A) which was more than that in Asian arowana (384) and African arowana (226). The unique gene families of Asian arowana were related to salivary secretion and olfactory transduction, whereas those of pirarucu were related to cell growth and death (necroptosis and apoptosis), and cellular community (tight junction, adherens junction, gap junction, and focal adhesion). Additionally, eight UGT (K00699) genes were found in the unique gene families of African arowana. UGT genes have been shown to play a critical role in many metabolism pathways such as ascorbate and aldarate metabolism, retinol metabolism, steroid hormone biosynthesis, and porphyrin & chlorophyll metabolism, which might be related to its special omnivorous diet character (Kakehi et al., 2015; Kawai et al., 2018) (Table S9, q-value<0.01).

Figure 4.

Figure 4

Gene Family Dynamics of Three Osteoglossidae Fishes

(A) The venn diagram represents the overlap relationship of three Osteoglossidae fishes' gene families.

(B) The gene tree of three Osteoglossidae fishes' OR genes in which the bold clades indicated two gene expansions of pirarucu.

(C) The insertion timeline of OR genes.

We then detected 1,210, 424, and 829 expanded gene families in pirarucu, African arowana, and Asian arowana, respectively, (Figure S6). KEGG functional enrichment analysis of these expanded gene families showed that both pirarucu and Asian arowana had experienced significant expansion in 14 pathways including olfactory transduction, salivary secretion, cell adhesion molecules (CAMs) and NOD-like receptor signaling, which were not found in African arowana (Table S10, q-value<0.01). In contrast, the gene family contraction of African arowana (2,510) was more than Asian arowana (2,010) and pirarucu (1,151), which was related to olfactory transduction, tight junction, cellular senescence, TGF-beta signaling pathway (Table S11, q-value<0.01). More importantly, the expansion of UGT genes in African arowana further led to the expansion of several metabolic pathways.

Possible Genetic Mechanisms Underpinning Diet Change of African Arowana

African arowana is an omnivore that has a wide range of preys including small benthic fishes, shrimps, plants, and insects (Adite et al., 2013; Monentcham et al., 2009). In contrast, its closely related species, pirarucu and Asian arowana, are predominantly dependent on fish preys (Natalia, 2004; Saint-Paul, 1986). In order to explore the genetic mechanisms of diet change in African arowana, we comprehensively examined the taste receptors of all tastes including sweet, umami, bitter, sour, and salty in these three genomes and found no significant differences in expansion or contraction (Table S11). The vertebrates have three kinds of odorant receptors including olfactory receptors (ORs), vomeronasal receptors V1R and V2R (Alioto and Ngai, 2005). Thus, then we carried out a comparative analysis of the odorant receptors of these three genomes. The V1R genes and V2R genes in pirarucu, African arowana, and Asian arowana showed no obvious expansion (Table S12). However, we found that OR genes (K04257) were significantly contracted in African arowana (40) comparing with pirarucu (70) and Asian arowana (160). Through the gene tree of ORs, we further observed that Asian arowana had kept more gene copies in most clades of OR, while pirarucu had experienced contraction in several clades except for two clades (marked by bold clades, Figure 4B). In addition, in African arowana, almost all OR gene clades contained fewer members than that of Asian arowana and pirarucu. Estimation of the expanding time for OR genes showed that Asian arowana had gained much more OR gene copies during the whole timeline and had been through an extra insertion period, which can also be observed in pirarucu while only ancient OR expanding events were inferred in African arowana (Figure 4C). Given the results that the olfactory transduction pathway gene family underwent dynamic evolution in history and the expansion of UGT genes, we concluded that the change of OR genes and UGT genes might play a key role in the diet transition of African arowana.

Discussion

Continental drift leads to severe environmental change and geographical separation which would be the reason and driving force for differentiation and speciation (Chen et al., 2018; Dodd and Afzal Rafii, 2001). Phylogenetic relationships among those species which spread across different continents are of great interests for scientist from the establishment of plate tectonics (Casadevall et al., 2017; Sterrer, 1973; Wolfson, 1948). The evolutionary history of freshwater fish was also associated with the continental motion and discussed lively because of the ability to migrate across ocean for some species (Nakatani et al., 2011; Sparks and Smith, 2005). The Osteoglossidae species spread across all continents except Antarctica, serving as a typical subject to investigate the association between its speciation and continental drift. Here, we chose three representative Osteoglossidae species to investigate their divergence and revealed their genomics differences related to the continental motion.

We first assembled the genome of African arowana with advanced library-building and sequencing technologies because the genomes of Asian arowana and pirarucu were available. The stLFR technology was applied because of its cost-saving and convenience to get a better assembly without the need of multiple insertion libraries (Wang et al., 2018). The nanopore sequencing technology was applied to extend the genome continuity for its long reads. Also, we employed the power of Hi-C technology to link the scaffolds into pseudo-chromosomes. Eventually, we got the final African arowana genome with a better quality than recent researches about genome assemblies (Li et al., 2020; Xie et al., 2020). The genome sequences of African arowana provided important genetic resources for further researches.

The phylogenetic analysis in this study revealed a possible speciation track of Osteoglossidae species. Within 11 fishes' genome, we used conserved single-copy gene families which is reliable for phylogeny construction (Aguileta et al., 2008) to reveal the speciation history. By using MCMCtree and the calibration of published speciation time, we surprisingly found both the divergence time between African arowana & Asian arowana, and African arowana & pirarucu were consistent with the time of continental separation. The coincidences were supported by their geographical distribution and the fact that Africa is the biodiversity center of Osteoglossiformes and Heterotidinae (includes Africa arowana and pirarucu) fossil record found in North America. Together with the model of published Asian arowana's speciation history, we proposed a more complete model to reveal the speciation history of Osteoglossidae. Our model not only provides the possible speciation path of Asian arowana and pirarucu but will also guide the researches on paleontology in the future. Moreover, our results also hint a possibility of population genetic research to investigate their population history.

To reveal the evolutionary process they experienced, we focused on the genomic difference of three Osteoglossidae species. The faster evolutionary rate, less expansion and more contraction of gene families of African arowana were found. The association of evolutionary rate and dynamic changes of gene families was investigated previously, hinting a possible causality between African arowana's faster evolution rate and gene family contraction (Chen et al., 2010). We also found more class I TE insertions together with more genes covered by TE insertions which are concatenated in position and duplicated in function in Asian arowana genome. Published researches had reported the relation of genes and TEs in human genome and plant genome and had interpreted the effect of TEs on gene creation, gene evolution and genome rearrangement (Bennetzen, 2005; Nekrutenko and Li, 2001). Therefore, the evolutionary dynamics of gene families in African arowana and more TE insertions of Asian arowana probably play a key role in their adaptive process to new environments. Moreover, we identified an expansion of SINE/5S elements in pirarucu whose function need to be further characterized (Figure S7).

We observed a significant difference in gene number of OR family among the three Osteoglossidae fishes, while the taste receptors, other odorant receptors V1R and V2R were conserved among them. Together with the expansion of UGT family in African arowana genome, we proposed a possible genetic mechanism underlying the diet change of African arowana. Diet change and the genomic evolution process had been investigated broadly (Perry et al., 2007; Schondube et al., 2001). Our research provides a case to investigate this phenomenon and a view to explain this process. However, when this change started and whether it happened on pirarucu is unknown because pirarucu experienced a similar contraction to African arowana in several clades of OR genes.

Limitations of the Study

The data provided here are not sufficient to answer all questions we put forward. More researches will be required to conduct in the future such as the fossils evidence searching in Africa and South America. The impact of TEs insertion on new gene and gene expression regulation in Asian arowana also needs further study. The diet transition of African arowana and the inter-continental emigration of bonytongues should be transferable environments of freshwater fishes particularly for those living in the same period and similar environment with Osteoglossidae. Therefore, more evidences should be revealed in the future. Molecular biological experiments such as RNAi, gene knockout or genetic modification to verify the genetic mechanism underpinning the diet transition are also needed. Moreover, the additional de novo genomic researches and comparative genomic researches on Osteoglossidae and other Osteoglossiformes fish, which will help us to understand the evolution of this ancient teleost clade such as 10000 fish genome project (Fish10K) (Fan et al., 2020). The resequencing genomic and ancient genomic studies about these three Osteoglossiformes fishes were required to disclose the population structure, migration history, and genomic changes in their own genome along with the time.

Resource Availability

Lead Contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact, Guangyi Fan (fanguangyi@genomics.cn).

Materials Availability

There is no resulting materials generated by this study.

Data and Code Availability

The accession numbers for the genome sequencing data, RNA sequencing data, and genome assembly reported in this paper are CNGBdb: CNP0001313 and NCBI: PRJNA665338.

Methods

All methods can be found in the accompanying Transparent Methods supplemental file.

Acknowledgments

This work was supported by the National Key Research and Development Program of China (No. 2018YFD0900301-05). The work also received the technical support from China National GeneBank and Guangdong Provincial Key Laboratory of Genome Read and Write (No. 2017B030301011).

Author Contributions

Shijie Hao and Guangyi Fan conceived and designed the study. Mengqi Zhang and Yilin Wang performed sample collection and sequencing. Xiaoyun Huang performed assembly. Lingfeng Meng performed genome annotation and partial phylogenetic analysis. Kai Han performed pseudogene related analysis and partial phylogenetic analysis and designed the Figures 1 and 4. Shijie Hao performed partial genome assembly, genome annotation and phylogenetic analysis, designed the Figures 2 and 3, wrote the manuscript. All other authors revised and read the manuscript.

Declaration of Interests

The authors declare no competing interests.

Published: November 20, 2020

Footnotes

Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2020.101662.

Contributor Information

Xin Liu, Email: liuxin@genomics.cn.

Guangyi Fan, Email: fanguangyi@genomics.cn.

Supplemental Information

Document S1. Transparent Methods, Figures S1–S7, and Tables S1–S13
mmc1.pdf (1.3MB, pdf)

References

  1. Adite A., Ediye M.M., Toko I.I., Abou Y., Imorou R.S., Sonon S.P. Morphological and meristic characterization of the african bonytongue, Heterotis niloticus (cuvier, 1829), from lake hlan and sô river, southern Benin, west Africa: the need for habitat protection and species conservation. Int. J. Fish. Aquat. Res. 2017;2:16–28. [Google Scholar]
  2. Adite A., Gbankoto A., Toko I.I., Fiogbe E.D. Diet breadth variation and trophic plasticity behavior of the African bonytongue Heterotis niloticus (Cuvier, 1829) in the Sô River-Lake Hlan aquatic system (Benin, West Africa): implications for species conservation and aquaculture development. Nat. Sci. 2013;05:1219–1229. [Google Scholar]
  3. Aguileta G., Marthey S., Chiapello H., Lebrun M.H., Rodolphe F., Fournier E., Gendrault-Jacquemard A., Giraud T. Assessing the performance of single-copy genes for recovering robust phylogenies. Syst. Biol. 2008;57:613–627. doi: 10.1080/10635150802306527. [DOI] [PubMed] [Google Scholar]
  4. Alioto T.S., Ngai J. The odorant receptor repertoire of teleost fish. BMC Genomics. 2005;6:173. doi: 10.1186/1471-2164-6-173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Axelrod H.R., Burgess W.E., Pronek N., Walls J.G. TFH Publications Neptune City; 1986. Dr. Axelrod's Atlas of Freshwater Aquarium Fishes. [Google Scholar]
  6. Bennetzen J.L. Transposable elements, gene creation and genome rearrangement in flowering plants. Curr. Opin. Genet. Dev. 2005;15:621–627. doi: 10.1016/j.gde.2005.09.010. [DOI] [PubMed] [Google Scholar]
  7. Betancur R.R., Wiley E.O., Arratia G., Acero A., Bailly N., Miya M., Lecointre G., Orti G. Phylogenetic classification of bony fishes. BMC Evol. Biol. 2017;17:162. doi: 10.1186/s12862-017-0958-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bian C., Hu Y., Ravi V., Kuznetsova I.S., Shen X., Mu X., Sun Y., You X., Li J., Li X. The Asian arowana (Scleropages formosus) genome provides new insights into the evolution of an early lineage of teleosts. Sci. Rep. 2016;6:24501. doi: 10.1038/srep24501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Casadevall A., Freij J.B., Hann-Soden C., Taylor J. Continental drift and speciation of the cryptococcus neoformans and cryptococcus gattii species complexes. mSphere. 2017;2:e00103-17. doi: 10.1128/mSphere.00103-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chen F.C., Chen C.J., Li W.H., Chuang T.J. Gene family size conservation is a good indicator of evolutionary rates. Mol. Biol. Evol. 2010;27:1750–1758. doi: 10.1093/molbev/msq055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chen J., Hao Z., Guang X., Zhao C., Wang P., Xue L., Zhu Q., Yang L., Sheng Y., Zhou Y. Liriodendron genome sheds light on angiosperm phylogeny and species–pair differentiation. Nat. Plants. 2018;5:18–25. doi: 10.1038/s41477-018-0323-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cioffi M.B., Rab P., Ezaz T., Bertollo L.A.C., Lavoue S., Oliveira E.A., Sember A., Molina W.F., Souza F.H.S., Majtanova Z. Deciphering the evolutionary history of arowana fishes (teleostei, Osteoglossiformes, Osteoglossidae): insight from comparative cytogenomics. Int. J. Mol. Sci. 2019;20:4296. doi: 10.3390/ijms20174296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Dodd R.S., Afzal Rafii Z. Evolutionary genetics of mangroves: continental drift to recent climate change. Trees. 2001;16:80–86. [Google Scholar]
  14. Du K., Wuertz S., Adolfi M., Kneitz S., Stock M., Oliveira M., Nobrega R., Ormanns J., Kloas W., Feron R. The genome of the arapaima (Arapaima gigas) provides insights into gigantism, fast growth and chromosomal sex determination system. Sci. Rep. 2019;9:5293. doi: 10.1038/s41598-019-41457-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fan G., Song Y., Yang L., Huang X., Zhang S., Zhang M., Yang X., Chang Y., Zhang H., Li Y. Initial data release and announcement of the 10,000 fish genomes project (Fish10K) Gigascience. 2020;9:giaa080. doi: 10.1093/gigascience/giaa080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gallant J.R., Losilla M., Tomlinson C., Warren W.C. The genome and adult somatic transcriptome of the mormyrid electric fish paramormyrops kingsleyae. Genome Biol. Evol. 2017;9:3525–3530. doi: 10.1093/gbe/evx265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Glasauer S.M., Neuhauss S.C. Whole-genome duplication in teleost fishes and its evolutionary consequences. Mol. Genet. Genomics. 2014;289:1045–1060. doi: 10.1007/s00438-014-0889-2. [DOI] [PubMed] [Google Scholar]
  18. Guo-Qing L., Wilson M.V.H. The discovery of Heterotidinae (teleostei: Osteoglossidae) from the Paleocene paskapoo formation of alberta, Canada. J. Vertebr. Paleontol. 1996;16:198–209. [Google Scholar]
  19. Hilton E.J. Tongue bite apparatus of osteoglossomorph fishes: variation of a character complex. Copeia. 2001;2:372–381. [Google Scholar]
  20. Hilton E.J. Comparative osteology and phylogenetic systematics of fossil and living bony-tongue fishes (Actinopterygii, Teleostei, Osteoglossomorpha) Zool. J. Linn. Soc. 2003;137:1–100. [Google Scholar]
  21. Kakehi M., Ikenaka Y., Nakayama S.M., Kawai Y.K., Watanabe K.P., Mizukawa H., Nomiyama K., Tanabe S., Ishizuka M. Uridine diphosphate-glucuronosyltransferase (UGT) xenobiotic metabolizing activity and genetic evolution in pinniped species. Toxicol. Sci. 2015;147:360–369. doi: 10.1093/toxsci/kfv144. [DOI] [PubMed] [Google Scholar]
  22. Kawai Y.K., Ikenaka Y., Ishizuka M., Kubota A. The evolution of UDP-glycosyl/glucuronosyltransferase 1E (UGET1E) genes in bird lineages is linked to feeding habits but UGT2 genes is not. PLoS One. 2018;13:e0205266. doi: 10.1371/journal.pone.0205266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kumazawa Y., Nishida M. Molecular phylogeny of osteoglossoids: a new model for gondwanian origin and plate tectonic transportation of the asian arowana. Mol. Biol. Evol. 2000;17:1869–1878. doi: 10.1093/oxfordjournals.molbev.a026288. [DOI] [PubMed] [Google Scholar]
  24. Lavoue S. Was Gondwanan breakup the cause of the intercontinental distribution of Osteoglossiformes? A time-calibrated phylogenetic test combining molecular, morphological, and paleontological evidence. Mol. Phylogenet. Evol. 2016;99:34–43. doi: 10.1016/j.ympev.2016.03.008. [DOI] [PubMed] [Google Scholar]
  25. Lavoue S., Sullivan J.P. Simultaneous analysis of five molecular markers provides a well-supported phylogenetic hypothesis for the living bony-tongue fishes (Osteoglossomorpha: teleostei) Mol. Phylogenet. Evol. 2004;33:171–185. doi: 10.1016/j.ympev.2004.04.021. [DOI] [PubMed] [Google Scholar]
  26. Li F., Bian L., Ge J., Han F., Zhihong L., Xuming L., Yongsheng L., Zhishu L., Huilai S., Liu C. Chromosome-level genome assembly of the East Asian common octopus (Octopus sinensis) using PacBio sequencing and Hi-C technology. Mol. Ecol. Resour. 2020 doi: 10.1111/1755-0998.13216. [DOI] [PubMed] [Google Scholar]
  27. Meng G., Li Y., Yang C., Liu S. MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res. 2019;47:e63. doi: 10.1093/nar/gkz173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Monentcham S.-E., Kouam J., Pouomogne V., Kestemont P. Biology and prospect for aquaculture of African bonytongue, Heterotis niloticus (Cuvier, 1829): a review. Aquaculture. 2009;289:191–198. [Google Scholar]
  29. Mu X.-d., Song H.-m., Wang X.-j., Yang Y.-x., Luo D., Gu D.-e., Luo J.-r., Hu Y.-c. Genetic variability of the Asian arowana, Scleropages formosus, based on mitochondrial DNA genes. Biochem. Syst. Ecol. 2012;44:141–148. [Google Scholar]
  30. Nakatani M., Miya M., Mabuchi K., Saitoh K., Nishida M. Evolutionary history of Otophysi (Teleostei) a major clade of the modern freshwater fishes Pangaean origin and Mesozoic radiation. BMC Evol. Biol. 2011;11:1–25. doi: 10.1186/1471-2148-11-177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Natalia Y. Characterization of digestive enzymes in a carnivorous ornamental fish, the Asian bony tongue Scleropages formosus (Osteoglossidae) Aquaculture. 2004;233:305–320. [Google Scholar]
  32. Nekrutenko A., Li W.-H. Transposable elements are found in a large number of human protein-coding genes. Trends Genet. 2001;17:619–621. doi: 10.1016/s0168-9525(01)02445-3. [DOI] [PubMed] [Google Scholar]
  33. Núñez J., Chu-Koo F., Berland M., Arévalo L., Ribeyro O., Duponchelle F., Renno J.F. Reproductive success and fry production of the paiche or pirarucu, Arapaima gigas (Schinz), in the region of Iquitos. Perú. Aquac. Res. 2011;42:815–822. [Google Scholar]
  34. Oliveira E.A., Bertollo L.A.C., Rab P., Ezaz T., Yano C.F., Hatanaka T., Jegede O.I., Tanomtong A., Liehr T., Sember A. Cytogenetics, genomics and biodiversity of the south American and african arapaimidae fish family (teleostei, Osteoglossiformes) PLoS One. 2019;14:e0214225. doi: 10.1371/journal.pone.0214225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Perry G.H., Dominy N.J., Claw K.G., Lee A.S., Fiegler H., Redon R., Werner J., Villanea F.A., Mountain J.L., Misra R. Diet and the evolution of human amylase gene copy number variation. Nat. Genet. 2007;39:1256–1260. doi: 10.1038/ng2123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Rafferty J.P. Rosen Education Service; 2010. Plate Tectonics, Volcanoes, and Earthquakes. [Google Scholar]
  37. Rogers J.J.W., Santosh M. Oxford University Press; 2004. Continents and Supercontinents. [Google Scholar]
  38. Saint-Paul U. Potential for aquaculture of south american freshwater fishes: a review. Aquaculture. 1986;54:205–240. [Google Scholar]
  39. Schondube J.E., Herrera M.L., Martinez del Rio C. Diet and the evolution of digestion and renal function in phyllostomid bats. Zoology (Jena) 2001;104:59–73. doi: 10.1078/0944-2006-00007. [DOI] [PubMed] [Google Scholar]
  40. Scott D.B.C., Fuller J.D. The reproductive biology of Scleropages formosus (Müller & Schlegel) (Osteoglossomorpha, Osteoglossidae) in Malaya, and the morphology of its pituitary gland. Fish Biol. 1976;8:45–53. [Google Scholar]
  41. Simao F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
  42. Sparks J., Smith W. Freshwater fishes, dispersal ability, and nonevidence: "gondwana life rafts" to the rescue. Syst. Biol. 2005;54:158–165. doi: 10.1080/10635150590906019. [DOI] [PubMed] [Google Scholar]
  43. Sterrer W. Plate tectonics as a mechanism for dispersal and speciation in interstitial sand fauna. Neth. J. Sea Res. 1973;7:200–222. [Google Scholar]
  44. Tang P.Y., Sivananthan J., Pillay S.O., Muniandy S. Genetic structure and biogeography of asian arowana (scleropages formosus) determined by microsatellite and mitochondrial DNA analysis. Asian Fish. Sci. 2004;17:81–92. [Google Scholar]
  45. Vialle R.A., de Souza J.E.S., Lopes K.P., Teixeira D.G., Alves Sobrinho P.A., Ribeiro-Dos-Santos A.M., Furtado C., Sakamoto T., Oliveira Silva F.A., Herculano Correa de Oliveira E. Whole genome sequencing of the pirarucu (Arapaima gigas) supports independent emergence of major teleost clades. Genome Biol. Evol. 2018;10:2366–2379. doi: 10.1093/gbe/evy130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wang O., Chin R., Cheng X., Ka Wu M., Mao Q., Tang J., Sun Y., Anderson E., Lam H.K., Chen D. Single tube bead-based DNA co-barcoding for cost effective and accurate sequencing, haplotyping, and assembly. BioRxiv. 2018 doi: 10.1101/324392. [DOI] [Google Scholar]
  47. Wilson M.V.H., Murray A.M. Osteoglossomorpha: phylogeny, biogeography, and fossil record and the significance of key African and Chinese fossil taxa. Geol. Soc. Lond. Spec. Publ. 2008;295:185–219. [Google Scholar]
  48. Wolfson A. Bird migration and the concept of continental drift. Science. 1948;108:23–30. doi: 10.1126/science.108.2793.23. [DOI] [PubMed] [Google Scholar]
  49. Xie J., Zhao H., Li K., Zhang R., Jiang Y., Wang M., Guo X., Yu B., Kong H., Jiao Y. A chromosome-scale reference genome of Aquilegia oxysepala var. kansuensis. Hortic. Res. 2020;7:113. doi: 10.1038/s41438-020-0328-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Xu M., Guo L., Gu S., Wang O., Zhang R., Fan G., Xu X., Deng L., Liu X. TGS-GapCloser: fast and accurately passing through the Bermuda in large genome using error-prone third-generation long reads. bioRxiv. 2019 doi: 10.1101/831248. [DOI] [Google Scholar]
  51. Yang Z. Paml 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Transparent Methods, Figures S1–S7, and Tables S1–S13
mmc1.pdf (1.3MB, pdf)

Data Availability Statement

The accession numbers for the genome sequencing data, RNA sequencing data, and genome assembly reported in this paper are CNGBdb: CNP0001313 and NCBI: PRJNA665338.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES