Abstract
The complete mitochondrial genome sequence of Acheilognathus hypselonotus is firstly described in this article. The total length of the mitogenome is 16,706 bp. It contains 13 protein-coding genes, 22 tRNA genes and 2 ribosomal RNA genes. The overall base composition of the H-strand is 29.03% A, 25.6% C, 28.13% T, and 17.25% G, with an A+T bias of 57.16%. The phylogenetic analysis result showed that the A. hypselonotus, A. typus and A. macropterus were a close relationship.
Keywords: Mitochondrial genome, Acheilognathus hypselonotus, phylogenetic analysis
Acheilognathus hypselonotus belongs to genus Acheilognathus in the family Cyprinidae of Cypriniformes, mainly distributed in the Qiantang River system in China. The body is high lateral flat and oval with a small and pointed head. In the upper part of its body, the posterior margin of each scale is grayish black.
The complete mitochondrial genome of A. hypselonotus first determined in this paper was expected to provide help on population genetics of A. hypselonotus and further molecular phylogenetic studies. The sample of A. hypselonotus in this article was collected from the Huangpu River system in China (121°0′46″E,31°8′35″N). And the specimen was deposited at the Key Laboratory of Fisher Equipment and Engineering, Ministry of Agriculture (contact person Lin Zhu, zhulin@fmiri.ac.cn), under the voucher number Ah20190702023. DNA libraries were prepared using Nextera DNA Flex Library Prep (Illumina, San Diego, CA, USA) and paired-end (150 bp each) sequencing on an Illumina HiSeq X Ten platform. Subsequently, the clean paired-end reads were assembled by the soft ARC (v1.1.4-beta).
The whole length of A. hypselonotus mitogenome was 16,706 bp. The nucleotide composition of the heavy strand was 29.03% A, 25.6% C, 28.13% T, and 17.25% G, with an A+T bias of 57.16%. It contains 13 protein-coding genes, 22 tRNAs and 2 rRNAs. Most genes were located on the heavy strand, but ND6 and 8 tRNA genes (tRNAGln, RNAAla, tRNAAsn, tRNACys, tRNATyr, tRNASer, tRNAGlu, tRNAPro) were encoded on the light strand. Most protein-coding genes initiated with ATG except for COI starting with GTG. It is also important to note that the majority of protein-coding genes is inferred to terminate with an incomplete stop codon T or TA- (ND2, COII, ATPase 6, COIII, ND3, ND4 and Cyt b), five protein-coding genes share the typical termination codon TAA (ND1, COI, ATPase 8, ND4L and ND6); ND5 uses TAG as a stop codon. The length of 12S (located between tRNAPhe and tRNAVal) and 16S (located between tRNAVal and tRNALeu) rRNA genes were 957 bp and 1643 bp, respectively.
To investigate the phylogenetic relationship, we downloaded the mitochondrial genome sequences of 16 currently available Cyprinidae species. The concatenated sequences of 13 protein-coding genes, 2 rRNAs genes and 22 tRNAs genes were aligned with the ClustalW program (Larkin et al. 2007). Using the Maximum Likelihood (ML) method (Stamatakis 2006), the phylogenetic tree was constructed (Figure 1) by MEGA6 (Tamura et al. 2013). The best-fitting model (GTR + I + G) was obtained as the optimization model by jModelTest (Posada 2008). The result indicates that the A. hypselonotus, A. typus and A. macropterus were a close relationship (Figure 1).
Figure 1.
The phylogenetic tree based on the 13 protein-coding genes, 2 rRNAs genes and 22 tRNAs genes of sixteen species. The bootstrap (1000 replicates test) supports for maximum likelihood (ML) method was indicated at each branch. A. hypselonotus was marked with a black arrow.
Funding Statement
This work was supported by Ministry of Science and Technology of the People's Republic of China, National Key Research and Development Projects of China [2020YFD0900502], the National Infrastructure of Fishery Germplasm Resources under grant number [2021DKA30470]; China Agriculture Research System of MOF MARA under grant [CARS-48]; Natural Science Foundation of Shanghai under grant [21ZR1479900] and Shanghai Sailing Program under grant [18YF1407500].
Disclosure statement
No potential conflict of interest was reported by the authors.
Data availability statement
The genome sequence data that support the findings of this study was openly available on the NCBI website with the accession number MZ334545 (https://www.ncbi.nlm.nih.gov/nuccore/MZ334545). The associated Bioproject, SRA, and Biosample numbers are PRJNA767253, SRR16219791 and SAMN21895032, respectively.
References
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. 2007. Clustal W and clustal X version 2.0. Bioinformatics. 23(21):2947–2948. [DOI] [PubMed] [Google Scholar]
- Posada D. 2008. jModelTest: phylogenetic model averaging. Mol Biol Evol. 25(7):1253–1256. [DOI] [PubMed] [Google Scholar]
- Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 22(21):2688–2690. [DOI] [PubMed] [Google Scholar]
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S.. 2013. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 30(12):2725–2729. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence data that support the findings of this study was openly available on the NCBI website with the accession number MZ334545 (https://www.ncbi.nlm.nih.gov/nuccore/MZ334545). The associated Bioproject, SRA, and Biosample numbers are PRJNA767253, SRR16219791 and SAMN21895032, respectively.

