Abstract
To understand the phylogenetic position of Larimichthys polyactis within the family Sciaenidae and the phylogeny of this family, the organization of the mitochondrial genome of small yellow croaker was determined herein. The complete, 16,470 bp long, mitochondrial genome contains 37 mitochondrial genes (13 protein-coding, 2 ribosomal RNA and 22 transfer RNA genes), as well as a control region (CR), as in other bony fishes. Comparative analysis of initiation/termination codon usage in mitochondrial protein-coding genes of Percoidei species, indicated that COI in Sciaenidae entails an ATG/AGA codon usage different from other Percoidei fishes, where absence of a typical conserved domain or motif in the control regions is common. Partitioned Bayesian analysis of 618 bp of COI sequences data were used to infer the phylogenetic relationships within the family Sciaenidae. An improvement in harmonic mean -lnL was observed when specific models and parameter estimates were assumed for partitions of the total data. The phylogenetic analyses did not support the monophyly of Otolithes, Argyrosomus, and Argyrosominae. L. polyactis was found to be most closely related to Collichthys niveatus, whereby, according to molecular systematics studies, the relationships within the subfamily Pseudosciaenidae should be reconsidered.
Keywords: Larimichthys polyactis, mitochondrial genome, control region, partitioned Bayesian analysis, Sciaenidae
Introduction
Animal mitochondrial DNA, a 16–19 kb long circular molecule, comprises 37 genes encoding 22 transfer RNAs (tRNAs), 13 proteins, 2 ribosomal RNAs, and a putative control region (CR) (Brown, 1985; Boore, 1999). Due to its compactness, maternal inheritance, fast evolutionary rate compared to nuclear DNA, and the resulting short coalescence time, mitochondrial DNA sequence data have been widely used in phylogeny (Miya et al., 2003), taxonomic resolution (Poulin et al., 2004), and population genetic studies (Habib et al., 2010).
The small yellow croaker, Larimichthys polyactis, a very popular fish among consumers, is one of the most important commercial benthopelagic fishes in China and Korea. The species is extensively distributed in the Bohai, Yellow and East China Seas, global landing having reached 320 thousand metric tons in 2000 (Seikai National Fisheries Research Institute, 2001). Nevertheless, intense fishing has deflated catches in the Yellow and East China Seas, to such an extent that resources are now considered over-exploited (Lin et al., 2008). In the past, the focus has been on catch statistics, size composition, early life history and feeding habits (Xue et al., 2004; Yan et al., 2006; Wan and Sun, 2006). However, over recent years, the genetic characteristics of the small yellow croaker have also come under investigation by means of molecular markers (Meng et al., 2003; Lin et al., 2009; Xiao et al., 2009), although, information is still limited, and the complete mitochondrial genome sequence, critical in studies of genome evolution and species phylogeny, continues unavailable. Thus, much additional work is required to furnish important genetic and evolutionary data, essential for species conservation, management and phylogenetic analysis.
Sciaenidae is a diverse and commercially important family, comprising 68 genera and about 311 species (Nelson, 2006). Despite specific studies of morphological and molecular phylogeny, instabilities of the phylogenetic relationships within the group have not, as yet, been resolved, the methods employed in previous research on phylogenetic reconstruction based on molecular data having been, to some extent, empirical and simple. The non-implementation of advanced methods, such as ML or BI, best-fit evolutionary models for specific data, and the statistical testing of the different topologies derived from the same data matrix, has probably contributed to mismodeling and systematic error in analysis. Mismodeling itself commonly occurs when a single model, inappropriate for multiple-gene regions involving several models of evolution, as stem and loop in RNA, or codon positions in protein genes, is employed for those multiple data partitions best explained by separate models of DNA evolution. Another form of mismodeling is the case, when multiple data partitions, defined by the same general model, differ drastically in the specific model parameter estimates that maximize the likelihood score (Reeder, 2003; Brandley et al., 2005). An overall solution would be to apply appreciate models and their specified parameter estimates to each data partition, and subsequently incorporate this into a single ML tree-search (Yang, 1996). Methods for reconstructing phylogeny, based on partitioned data using Bayesian/Markov chain Monte Carlo (MCMC) methods, are now available (Huelsenbeck and Ronquist, 2001; Nylander et al., 2004). Since it more accurately models the data, the use of partition-specific modeling - in other words, partitioned Bayesian analysis - should reduce systematic error, thereby resulting in better likelihood scores and more accurate posterior probability estimates (Brandley et al., 2005).
In the present study, pre-defined complete mitochondrial genome sequences of the small yellow croaker were compared with those reported for Percoidei species, whereupon partitioned Bayesian analysis was applied to infer the phylogeny of Sciaenidae fishes.
Materials and Methods
Fish sample and DNA extraction
L. polyactis individuals were collected by trawling in the Zhoushan fishing grounds, East China Sea (Zhejiang Province, China). They were identified by morphology. Muscle tissue was removed and immediately preserved at −80 °C. Total DNA was extracted as per standard phenol-chloroform method (Sambrook and Russell, 2001) and visualized on 1.0% agrarose gels.
PCR amplification and sequencing
As shown in Table S1, 11 sets of primers that amplify contiguous, overlapping segments of the complete mitochondrial genome of L. polyactis, were used. The primers were designed from reported complete mitochondrial genome sequences for the large yellow croaker (Cui et al., 2009). Worthy of note, these primers are also very useful for amplifying the mitochondrial genomes of two other Sciaenidae species, Miichthys miiuy and Collichthys lucidus. Of the 11 pairs of primers, six (1F/R, 4F/R, 6F/R, 7F/R, 8F/R, and 10F/R) are capable of perfectly amplifying the mitochondrial genomes of both species. The total length of amplified products is approximately 12,300 bp. PCR assays were carried out in a final volume of 50 μL, each containing 5.0 μL of a 10xTaq Plus polymerase buffer, 0.2 mM of dNTP, 0.2 μM of the forward and reverse primers, 2 units of Taq Plus DNA polymerase with proof-reading characteristic (TIANGEN, Beijing, China), and 1 μL of DNA template. Cycling conditions were 94 °C for 4 min, followed by 35 cycles of 94 °C for 50 s, 60 °C for 60 s and 72 °C for 2–3 min, followed by 1 cycle of 72 °C for 10 min, PCR was performed on a PTC-200 thermocycler (MJ Research, USA). The resultant PCR products, first electrophoresed on a 1% agarose gel to check integrity, were then visualized by the Molecular Imager Gel Doc XR system (BioRad), followed by purification using a QIAEX II Gel Extraction Kit (Qiagen). The purified fragments were ligated into PMD18-T vectors (Takara, Japan) used to transform to TOP10 E. coli cells, according to standard protocol. Positive clones were screened via PCR with M13+/− primers. Amplicons were sequenced using an ABI 3730 automated sequencer (Applied Biosystems) with M13+/− primers.
Sequence analysis
The sequence fragments so obtained were edited in the Seqmen program (DNAstar, Madison, WI, USA) for contig assembly to obtain a complete mitochondrial genome sequence. Annotation of protein-coding and ribosomal RNA genes, and definition of their respective gene boundaries were carried out with DOGMA software (Wyman et al., 2004) through reference sequences of Percoidei, available in GenBank. tRNA genes and their secondary structures, were identified by means of tRNAscan-SE 1.21 software (Lowe and Eddy, 1997). Base composition, genetic distances, and codon usage were calculated in MEGA 4.0 software (Tamura et al., 2007). Putative OL and CR, and conserved motifs were identified via sequence homology and proposed secondary structure. The complete mitochondrial genome sequence, deposited in the GenBank database, can be accessed through Accession Number GU586227.
Phylogenetic analysis
Multiple alignments of the COI sequences from 30 Sciaenidae species were performed using the MEGA 4.1 version. The data matrix of COI sequences was partitioned by codon position (Table 1). The appropriate model of sequence evolution for each partition (Table 2) was determined using the jModeltest program (Guindon and Gascuel, 2003; Posada, 2008), under the Akaike Information Criterion (AIC). Bayesian analysis for each data partitioning strategy consisted of two separate runs with four Markov chains in the MrBayes3.1 program (Huelsenbeck and Ronquist, 2001). Each run, comprising 10 million generations, was systematically sampled at every 100. The first 25% of the trees were discarded as part of a burn-in procedure, and the remainder employed in constructing a 50% majority rule consensus tree. The results for each partitioning strategy were then compared using the Bayes factor as an aid in accepting the best phylogeny hypothesis for the sequence data matrix.
Table 1.
Partition strategy | Partition identity |
---|---|
P1 | All codon positions together |
P2 | First and second codon positions; third codon position |
P3 | First codon position; second codon position; third codon position |
Table 2.
Partition | Evolutionary model | Number of characters in partiton |
---|---|---|
COI | HKY+I+G | 618 |
COI 1st codon | GTR+G | 206 |
COI 2nd codon | GTR+I | 206 |
COI 3rd codon | HKY+I+G | 206 |
COI 1st +2nd codon | GTR+I+G | 412 |
Results and Discussion
Gene content, arrangement and base composition
The complete mitochondrial genome of L. polyactiswas 16, 470 bp long (Table 3), which is similar to those of not only teleost species but also terrestrial vertebrates. Its gene content conforms to the vertebrate consensus, by containing the highly conserved set of 37 genes encoding 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and 13 proteins, that are essential in mitochondrion respiration and adenosine triphosphate (ATP) production. Although, as in other vertebrates, most of the genes are encoded on the H-strand, besides ND6 and eight tRNAs (Glu, Ala, Asn, Cys, Tyr, Ser-UCN, Gln, and Pro) being also encoded on the L-strand, the genes are all similar in length to those of bony fishes (Oh et al., 2007, 2008). As reported in other vertebrates, there are four notable ‘overlaps’ between genes, as reported in other vertebrates, with the lengths of these overlaps are generally being fixed. Whereas ATPase 8 and ATPase 6 overlap by 10 bp, overlapping was also observed between ND4L and ND4 (seven bp), ND5 and ND6 (four bp), and ATPase 6 and COIII (one bp). The remainders are located between tRNA genes themselves, and between tRNA and protein-coding genes. The sizes of non-coding spacers range from 1 to 37 bp (Table 3). The largest of these, located between tRNA-Asn and tRNA-Cys, was recognized as the putative replicate origin of the L-strand|s. These non-coding spacers are interesting in the study of mtDNA evolutionary mechanisms. The base composition of L. polyactis was analyzed separately for rRNA, tRNA, and protein-coding genes (Table S2). In the latter, pronounced anti-G bias was observed at the third codon positions (8.5%), which are free from selective constraints on nucleotide substitution. Besides the A+T composition of the second codon position being relatively higher than in most Percoidei fishes, pyrimidines were over-represented in this position (61.5%). Already observed in other vertebrate mitochondrial genomes, this has been attributed to the hydrophobic character of the proteins (Naylor et al., 1995). L. polyactis tRNA genes are A+T rich (54.5%), as in other vertebrates, whereas rRNAs are A+C rich (59.3%), as in other bony fishes (Zardoya and Meyer, 1997; Cheng et al., 2010).
Table 3.
Gene | Position | Size (bp) | Codon | Intergenic nucleotideII | Strand | |||
---|---|---|---|---|---|---|---|---|
From | To | Nucleotide | Amino acid | Initiation | StopI | |||
tRNA-Phe | 1 | 69 | 69 | H | ||||
12S rRNA | 70 | 1019 | 950 | H | ||||
tRNA-Val | 1020 | 1090 | 71 | 2 | H | |||
16S rRNA | 1093 | 2789 | 1697 | 1 | H | |||
tRNA-Leu(UUR) | 2791 | 2864 | 74 | H | ||||
ND1 | 2865 | 3839 | 975 | 324 | ATG | TAG | 4 | H |
tRNA-Ile | 3844 | 3913 | 70 | −1 | H | |||
tRNA-Gln | 3913 | 3984 | 72 | −1 | L | |||
tRNA-Met | 3984 | 4053 | 70 | H | ||||
ND2 | 4054 | 5099 | 1046 | 348 | ATG | TA- | H | |
tRNA-Trp | 5100 | 5170 | 71 | 1 | H | |||
tRNA-Ala | 5172 | 5240 | 69 | 2 | L | |||
tRNA-Asn | 5243 | 5315 | 73 | 37 | L | |||
tRNA-Cys | 5353 | 5419 | 67 | −1 | L | |||
tRNA-Tyr | 5419 | 5488 | 70 | 1 | L | |||
CO1 | 5490 | 7046 | 1557 | 518 | ATG | AGA | −5 | H |
tRNA-Ser(UCN) | 7042 | 7113 | 72 | 3 | L | |||
tRNA-Asp | 7117 | 7185 | 69 | 8 | H | |||
CO2 | 7194 | 7884 | 691 | 230 | ATG | T-- | H | |
tRNA-Lys | 7885 | 7959 | 75 | 1 | H | |||
ATPase8 | 7961 | 8128 | 168 | 55 | ATG | TAA | −10 | H |
ATPase6 | 8119 | 8802 | 684 | 227 | ATG | TAA | −1 | H |
CO3 | 8802 | 9586 | 785 | 261 | ATG | TA- | H | |
tRNA-Gly | 9587 | 9657 | 71 | H | ||||
ND3 | 9658 | 10006 | 349 | 116 | ATG | T-- | H | |
tRNA-Arg | 10007 | 10075 | 69 | H | ||||
ND4L | 10076 | 10372 | 297 | 98 | ATG | TAA | −7 | H |
ND4 | 10366 | 11746 | 1381 | 460 | ATG | T-- | H | |
tRNA-His | 11747 | 11815 | 69 | H | ||||
tRNA-Ser(AGY) | 11816 | 11882 | 67 | 5 | H | |||
tRNA-Leu(CUN) | 11888 | 11960 | 73 | H | ||||
ND5 | 11961 | 13799 | 1839 | 612 | ATG | TAA | −4 | H |
ND6 | 13796 | 14314 | 519 | 172 | ATG | TAA | L | |
tRNA-Glu | 14315 | 14383 | 69 | 4 | L | |||
Cytb | 14388 | 15528 | 1141 | 380 | ATG | T-- | H | |
tRNA-Thr | 15529 | 15600 | 72 | 2 | H | |||
tRNA-Pro | 15603 | 15672 | 70 | L | ||||
Control region | 15673 | 16470 | 798 | H |
I: TA- and T-- represent incomplete stop codons.
II: Numbers correspond to the nucleotides separating adjacent genes. Negative numbers indicate overlapping nucleotides.
Protein-coding genes
As expected, 13 large open-reading frames were detected in the mitochondrial genome of L. polyactis. The T:C:A:G base composition of the mitochondrial 13 protein-coding-gene sequence, 26.9: 32.4: 24.8: 15.9, is summarized in Table S2. Bias of nucleotide frequencies is strand specific (Broughton and Reneau, 2006). Moreover, in contrast to H-stranded genes, in L-stranded ND6 genes, the most prominent anti-C bias is at the third position (7.5%). The lengths of 13 protein-coding genes of L. polyactis mitochondrial DNA were compared with the corresponding sequences of other Percoidei species, whereat it was found that these are conserved. Moreover, there is almost no variation among species. By comparing predicted initiation and termination codons of the 13 protein-coding genes among 23 percoidei species (Table S3), it was apparent that most use ATG as the initiation codon (92.6%), with GTG in second place. In a few species, initiation codons are even absent (shown by “?”). The situation in termination codons is also similar. TAA, TAG and incomplete TA- or T-- are commonly used, but AGA and AGG rarely so. This condition is apparently common among vertebrate mitochondrial genome, and it also appears that TAA stop codons are created via posttranscriptional polyadenylation (Ojala et al., 1981). Furthermore, each protein has its preferable initiation and termination codons, although there are exceptions. For example, in most species, COII, COIII, Cytb, ND4L and ND4 choose ATG/T--, ATG/TA-, ATG/T--, ATG/TAA and ATG/T--, respectively. But in COI there are two types of initiation/termination codon usage. COI proteins in Sciaenidae fish mitochondrial DNA bear ATG/AGA as initiation and termination codons, whereas in other Percoidei fishes, this is not the case (they possess GTG/TAA, GTG/AGG or GTG/T--). Studies in insects have shown positive correlations between the incidence of canonical initiation and termination codons and the relative rate of gene evolution (Szafranski, 2009). Whether this relationship also applies to fishes requires confirmation.
Codons in the 13 protein-coding genes identified in L. polyactis are shown in Table S4. As regards amino acids with the fourfold degenerate third position, codons ending in C are mostly seen, followed by codons ending in A and T for alanie, proline, glycine, valine and threonine. However, for arginine and serine, A is more frequent than C. Among codons with twofold degenerate positions, C appears to be more used than T in the pyrimidine codon family, whereas the purine codon family ends mostly with A. Except for arginine, G is the least common third position nucleotide in all the codon families. All these features are very similar to those observed in vertebrates (Hu et al., 2010; Yang et al., 2010).
Ribosomal RNA gene and transfer RNA gene
As with other mitochondrial genomes, the genome contains a small (12S) and large (16S) subunits of rRNA, which are 950 bp and 1697 bp long, respectively (Table S2). As in the other vertebrates, they are located between tRNA-Phe and tRNA-Leu(UUR), and are separated by tRNA-Val (Figure S1, Table 3). When compared with other genes reported for Sciaenidae, and similar to other vertebrates, both rRNA genes are conserved either in A+T content or gene length and location. As with Gonostoma gracile (Miya and Nishida, 1999), preliminary assessment of the secondary structure of L. polyactis indicated that the present sequence could be reasonably superimposed on the proposed secondary structures of carp 12S and cow 16S rRNA (Gutell et al., 1993). Twenty-two tRNA genes, with lengths varying from 67 bp (tRNA-Cys and tRNA-Ser(AGY) to 75 bp(tRNA-Lys), were interspersed throughout the entire genome. As reported in some other vertebrates (Miya et al., 2003; Kim and Lee, 2004; Oh et al., 2007), with the known exception of the tRNA-Ser (AGY) gene, all tRNA gene transcripts can be folded into typical cloverleaf secondary structures (Figure S2). Besides harboring anticodons identical to those used in other vertebrate mitogenomes, they conserve the aminoacyl, DHU (dihy-drouridine), anticodon and TΨC (thymidine-pseudouridine-cytidine) stems. As shown in the rock bream (Oh et al., 2007) and Pseudolabrus fishes (Oh et al., 2008), the tRNA-Ser (AGY) found in the L. polyactis mitochondrial genome bore no complete DHU arm. Similar to usual tRNAs (Ohtsuki et al., 2002), aberrant tRNAs can also fit into the ribosome by adjusting their structural conformation and function.
Main non-coding regions of Percoidei species
The putative OL was confirmed in L. polyactis. When compared with other Percoidei fishes, they were almost identical and are located in a cluster of five tRNA genes (the WANCY region) between the tRNA-Asn and tRNA-Cys gene. The putative OL, besides serving as the initiation site of Light-strand replication, is capable of folding into a stable stem-loop secondary structure with 13 bp in the stem and 11 bp in the loop. Furthermore, there is a C-rich sequence in the loop, whereby RNA primer synthesis can be initiated. This C-rich sequence has also been found in the OL loop of other fishes, such as Gadus morhua (Johansen et al., 1990) and Oncorynchus mykiss (Zardoya et al., 1995). This feature supports the hypothesis that in vertebrates, primer synthesis is most probably initiated by a polypyrimidine tract (Taanman, 1999), and not by a stretch of thymines, as previously suggested (Wang and Clayton, 1985). The conserved sequence motif, 5′-GCCGG-3′, was found at the base of the stem within tRNA-Cys. This motif seems to be involved in the transition from RNA to DNA synthesis (Hixson and Brown, 1986).
The mitochondrial control region is located between tRNA-Pro and tRNA-Phe in mitochondrial DNA. Besides being the most variable region, it also contains certain conserved motifs that are associated with the initiation of DNA replication and transcription (Zhao et al., 2006). The control region of L. polyactis was identified and compared with those of other Percoidei fishes. They are also located between the two tRNAs (Pro and Phe), and range in size from 533 bp (Parargyrops edita, EF107158) to 1354 bp (Pagrus major, AP002949), all having an overall base composition rich in A and T (A+T= 60%). The variation in length is largely due to the number of conserved domains inserted in these species. Long tandem repeats were recognized in Monodactylus argenteus and Pagellus bogaraveo, with lengths of 56 bp and 183 bp, respectively. Slippage and mispairing during mitogenome replication may explain tandem repeats in the control region (Brought and Dowling, 1997). Although this region is a unique and highly variable area in mitochondrial DNA, conserved domains and motifs are recognizable by multiple homologous sequence alignment and recognition site comparison. Control regions are also divided into a typical tripartite structure with an extent termination association sequence (ETAS), central conserved-block domains (CSB-F, CSB-D, and CSB-E), and conserved sequence block domains (CSB-1, CSB-2, and CSB-3) (Sbisa et al., 1997). The conserved ETAS motif in most fishes is TACAT, with one palindromic sequence, ATGTA. In Coreoperca kawamebari, there is some variation in ETAS having the conserved TGCAT motif. The consensus sequences of ETAS in Percoidei fishes was identified as TACAT-TATGTAT---CACCAT----ATATTAAC CAT, where “−” indicating nucleotide variations such as transitions, transversions, or deletions, similar to reported in sinipercine fishes (Zhao et al., 2006). CSB domains that are associated with the initiation of mitochondrial DNA replication, and other important functions of control regions, were detected. Consensus sequences have been summarized in Table S5. While all these conserved blocks can be easily identified in most of the Percodei species, the incomplete structure of control regions with the absence of conserved domains, was also detected (Table 4). Such obvious insertions and deletions implied the rapid evolution of the control region in Percoidei fishes, a possible source of information for dissecting the structure-function-evolution relationships of control regions.
Table 4.
Species | Accession number | Length | A+T content | Recognized conserved domains |
---|---|---|---|---|
Carangoides armatus | AP004444 | 865 | 64 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Caranx melampygus | AP004445 | 893 | 65 | CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Centropyge loricula | AP006006 | 651 | 60 | ETAS, CSB-E, CSB-D |
Chaetodon auripes | AP006004 | 860 | 65 | ETAS, CSB-F, CSB-E, CSB-D, CSB-2, CSB-3 |
Chaetodontoplus septentrionalis | AP006007 | 1007 | 64 | CSB-E, CSB-D, CSB-2, CSB-3 |
Collichthys lucida | NC_014350 | 771 | 63 | ETAS, CSB-1, CSB-2, CSB-3 |
Collichthys niveatus | HM219223 | 799 | 63 | ETAS, CSB-1, CSB-2, CSB-3 |
Coreoperca kawamebari | AP005990 | 849 | 62 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Diagramma pictum | AP009167 | 840 | 61 | ETAS, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Doederleinia berycoides | AP009181 | 835 | 63 | ETAS, CSB-F, CSB-E, CSB-D, CSB-2, CSB-3 |
Emmelichthys struhsakeri | AP004446 | 832 | 62 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Etheostoma radiosum | AY341348 | 938 | 62 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Heniochus diphreutes | AP006005 | 1000 | 65 | CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Larimichthys crocea | EU339149 | 795 | 61 | ETAS, CSB-1, CSB-2, CSB-3 |
Larimichthys polyactis | GU586227 | 798 | 62 | ETAS, CSB-1, CSB-2, CSB-3 |
Lates calcarifer | DQ010541 | 768 | 64 | CSB-E, CSB-D, CSB-1, CSB-2 |
Lethrinus obsoletus | AP009165 | 967 | 60 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Lutjanus rivulatus | AP006000 | 842 | 62 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Miichthys miiuy | HM447240 | 845 | 63 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Monodactylus argenteus | AP009169 | 873 | 65 | ETAS, ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Pagellus bogaraveo | AB305023 | 1195 | 63 | ETAS, CSB-F, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Pagrus auriga | AB124801 | 948 | 61 | CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Pagrus major | AP002949 | 1354 | 61 | CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Parapristipoma trilineatum | AP009168 | 845 | 63 | ETAS, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Parargyrops edita | EF107158 | 533 | 62 | CSB-D |
Percina macrolepida | DQ536430 | 941 | 64 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Plectropomus leopardus | DQ101270 | 1065 | 66 | CSB-F, CSB-E, CSB-D, CSB-2 |
Pterocaesio tile | AP004447 | 822 | 61 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Spicara maena | AP009164 | 961 | 64 | ETAS, CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Trachurus japonicus | AP003091 | 862 | 63 | CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Trachurus trachurus | AB108498 | 862 | 62 | CSB-F, CSB-E, CSB-D, CSB-1, CSB-2, CSB-3 |
Effect of partitioning on harmonic mean -lnL, topology, posterior probabilities and bayes factors
Harmonic mean -lnL was used when measuring the aptitude of data partitioning in defining the entire data set. Partitioning the COI data set by codon position greatly improved harmonic mean -lnL (Table 5). The same results were reported (Brandley et al., 2005; Brown and Lemmon, 2007), when partitioning by codon position and RNA gene specific stems and loops. The inference is that, as the different data partitions may evolve quite variably, partitioning can be considered as a useful method for accommodating heterogeneity in the processes of molecular evolution. Consensus tree topologies inferred from the three analyses differed, yet all of these difference involved alternative placements of weakly supported nodes (Bayesian posterior probabilities < 95%). These distinct differences were dependent on whether the COI sequences were partitioned. The most dramatic differences could be noted, not only in the deep nodes in the tree, but also in the relationships among the three main groups (Groups1, 2 and 3 in Figures 1–3). No obvious differences were observed in posterior probabilities among analyses depending on different partition-strategies All Bayes-factor estimates were much higher than the criterion for strong evidence against a hypothesis. According to the Bayes factors, analysis employing the P3 partition-strategy provided a decisively better explanation of the data than the remainder (Table 5 and Table 6). Thus, as this is the preferred hypothesis of the phylogeny of Sciaenidae fishes based on the present data, subsequent discussion will be limited to this tree (Figure 3).
Table 5.
Partition strategy | |||
---|---|---|---|
P1 | P2 | P3 | |
Harmonic mean -lnL | 5427.68 | 5124.09 | 5058.92 |
P1 | - | ||
P2 | 607.18 | - | |
P3 | 737.52 | 130.34 | - |
Table 6.
2ln Bayes factor | Evidence for H1 |
---|---|
< 0 | Negative (supports H0) |
0 to 2 | Barely worth mentioning |
2 to 6 | Positive |
6 to 10 | Strong |
> 10 | Very strong |
The phylogeny of the Sciaenidae family
Based on the characters of the gas bladder, sagitta, and mental pores, Zhu et al. (1963) divided the family into seven subfamilies, viz., Johniinae, Megalonibinae, Bahabinae, Sciaeninae, Otolithinae, Argyrosominae and Pseudosciaeniae. In this study, phylogenetic analysis revealed three distinct monophyletic groups (Groups1, 2 and 3), thus very different from the results of Zhu et al. (1963). Monophyly of the genera Otolithes and Argyrosomus is not supported. The proposition (Zhu et al., 1963) of grouping Argyrosomus and Nibea into the subfamily Argyrosominae is also without support, for, based on the phylogenetic tree presented herein, Argyrosomus and Nibea have been placed into two distinct groups (Figure 3). Even though monophyly of Pseudosciaeniae is supported, herein, Bayesian posterior probability is relatively weak. Notably, within the subfamily Pseudosciaeninae, L. polyactis was found to be most closely related to Collichthys niveatus, and then to its congeneric species, L. crocea. Although previous molecular phylogenetic analyses had the same opinion on the phylogenetic positions of Otolithes, Argyrosomus, and Argyrosominae, the phylogentic relationships within the subfamily Pseudosciaeninae are still far from clear (Meng et al., 2004; Xu et al., 2010). Based on different data and methods, once again our results suggested that Collichthys and Larimichthys may be merged into a single genus. These results are consistent with Chen (Chen QM, 2007, Dissertation, Jinan University, China) and Tong et al. (2007), where, respectively, non-monophyletic Larimichthys and Collichthys were recovered. Nevertheless, Cheng et al. (2011) recently recovered monophyletic Collichthys and Larimichthys, thus, in common with the morphological results of Zhu et al. (1963). Sampling errors, scarce data and mismodeling may have contributed to these disputes. Thus, the inclusion of further data from the mitochondrial and nuclear genomes, more accurate evolutionary models, and extensive taxonomic sampling, with careful identification integrated with information on morphological characters, is required for reconstructing the phylogeny of Sciaenidae.
Supplementary Material
The following online material is available for this article:
This material is available as part of the online version of this article from http://www.scielo.br/gmb
Acknowledgments
This study was supported by the Nation Nature Science Foundation of China (31001120), Zhejiang Provincial Natural Science Foundation of China (Y3100013) and Important Science and Technology Specific Projects of Zhejiang Province (2011C14012).
References
- Boore JL. Animal mitochondrial genomes. Nucleic Acids Res. 1999;27:1767–1780. doi: 10.1093/nar/27.8.1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brandley MC, Schmitz A, Reeder TW. Partitioned Bayesian analyses, partition choice, and the phylogenetic relationships of scincid lizards. Syst Biol. 2005;54:373–390. doi: 10.1080/10635150590946808. [DOI] [PubMed] [Google Scholar]
- Brought RE, Dowling TE. Evolutionary dynamics of tandem repeats in the mitochondrial DNA control region of minnow Cyprinella spiloptera. Mol Biol Evol. 1997;14:1187–1196. doi: 10.1093/oxfordjournals.molbev.a025728. [DOI] [PubMed] [Google Scholar]
- Broughton RE, Reneau PC. Spatial covariation of mutation and nonsynonymous substitution rates in vertebrate mitochondrial genomes. Mol Biol Evol. 2006;23:1516–1524. doi: 10.1093/molbev/msl013. [DOI] [PubMed] [Google Scholar]
- Brown WM. The mitochondrial genome of animals. In: McIntyre RJ, editor. Molecular Evolutionary Genetics. Plenum Press; New York: 1985. pp. 95–130. [Google Scholar]
- Brown JM, Lemmon AR. The importance of data partitioning and the utility of Bayes factors in Bayesian phylogenetics. Syst Biol. 2007;56:643–655. doi: 10.1080/10635150701546249. [DOI] [PubMed] [Google Scholar]
- Cheng YZ, Xu TJ, Shi G, Wang RX. Complete mitochondrial genome of the miiuy croaker Miichthys miiuy (Perciformes, Sciaenidae) with phylogenetic consideration. Mar Genomics. 2010;3:201–209. doi: 10.1016/j.margen.2010.10.003. [DOI] [PubMed] [Google Scholar]
- Cheng J, Ma GQ, Miao ZJ, Shui BN, Gao TX. Complete mitochondrial genome sequence of the spinyhead croaker Collichthys lucidus (Perciformes, Sciaenidae) with phylogenetic considerations. Mol Biol Rep. 2011 doi: 10.1007/s11033-011-1211-6. (Epub) [DOI] [PubMed] [Google Scholar]
- Cui ZX, Liu Y, Li CP, You F, Chu KH. The complete mitochondrial genome of the large yellow croaker, Larimichthys cracea (Perciformes, Sciaenidae): Unusual features of its control region and the phylogenetic position of the Sciaenidae. Gene. 2009;432:33–43. doi: 10.1016/j.gene.2008.11.024. [DOI] [PubMed] [Google Scholar]
- Guindon S, Gascuel O. A simple, fast and accurate method to estimate large phylogenies by maximum-likelihood. Syst Biol. 2003;52:696–704. doi: 10.1080/10635150390235520. [DOI] [PubMed] [Google Scholar]
- Gutell RR, Gray MW, Schnare MN. A compilation of large subunit (23S and 23S-like) ribosomal RNA structures. Nucleic Acids Res. 1993;21:3055–3074. doi: 10.1093/nar/21.13.3055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Habib M, Lakra WS, Mohindra V, Khare P, Barman AS, Singh A, Lal KK, Punia P, Khan AA. Evaluation of cyto-chrome b mtDNA sequences in genetic diversity studies of Channa marulius (Channidae, Perciformes) Mol Biol Rep. 2010;38:841–846. doi: 10.1007/s11033-010-0175-2. [DOI] [PubMed] [Google Scholar]
- Hixson JE, Brown WM. A comparison of the small ribosomal RNA genes from the mitochondrial DNA of the great apes and humans: Sequence, structure, evolution and phylogenetic implications. Mol Biol Evol. 1986;3:1–18. doi: 10.1093/oxfordjournals.molbev.a040379. [DOI] [PubMed] [Google Scholar]
- Hu J, Zhang DX, Hao JS, Huang DY, Cameron S, Zhu CD. The complete mitochondrial genome of the yellow coaster, Acraea issoria (Lepidoptera, Nymphalidae, Heliconiinae, Acraeini): Sequence, gene organization and a unique tRNA translocation event. Mol Biol Rep. 2010;37:3431–3438. doi: 10.1007/s11033-009-9934-3. [DOI] [PubMed] [Google Scholar]
- Huelsenbeck JP, Ronquist F. MRBAYES: Bayesian inference of phylogeny. Bioinformatics. 2001;17:754–755. doi: 10.1093/bioinformatics/17.8.754. [DOI] [PubMed] [Google Scholar]
- Johansen S, Guddal PH, Johansen T. Organization of the mitochondrial genome of atlantic cod, Gadus morhua. Nucleic Acids Res. 1990;18:411–419. doi: 10.1093/nar/18.3.411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kass RE, Raftery AE. Bayes factors. J Am Stat Assoc. 1995;90:773–795. [Google Scholar]
- Kim IC, Lee JS. The complete mitochondrial genome of the rockfish Sebastes schlegeli (Scorpaeniformes, Scorpaenidae) Mol Cell. 2004;17:322–328. [PubMed] [Google Scholar]
- Lin LS, Chen JH, Li HY. The fishery biology of Trichiurus japonicus and Larimichthys polyactis in the East China Sea region. Mar Fish. 2008;30:126–134. [Google Scholar]
- Lin SL, Ying YP, Han ZQ, Xiao YS, Gao TX. AFLP analysis on genetic diversity and population structure of small yellow croaker Larimichthys polyactis. Afr J Biotechnol. 2009;8:2700–2706. [Google Scholar]
- Lowe TM, Eddy SR. tRNAscan-SE: A program for improved detection of transfer RNA genes in genome sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meng ZN, Zhuang ZP, Ding SX, Jin XS, Su YQ, Tang QS. Molecular phylogeny of eight Sciaenid species (Perciformes, Sciaenidae) in the China Sea based on mitochondrial 16S rRNA sequence. Prog Nat Sci. 2004;14:514–521. (in Chinese with abstract in English) [Google Scholar]
- Miya M, Nishida M. Organization of the mitochondrial genome of a deep-sea fish, Gonostoma gracile (Teleostei, Stomiiformes): First example of transfer RNA gene rearrangements in bony fishes. Mar Biotechnol. 1999;1:416–426. doi: 10.1007/pl00011798. [DOI] [PubMed] [Google Scholar]
- Miya M, Takeshima H, Endo H, Ishiguro NB, Inoue JG, Mukai T, Satoh TP, Yamaguchi M, Kawaguchi A, Mabuchi K, et al. Major patterns of higher teleostean phylogenies: A new perspective based on 100 complete mitochondrial DNA sequences. Mol Phylogenet Evol. 2003;26:121–138. doi: 10.1016/s1055-7903(02)00332-9. [DOI] [PubMed] [Google Scholar]
- Naylor GJ, Collins TM, Brown WM. Hydrophobicity and phylogeny. Nature. 1995;373:555–556. doi: 10.1038/373565b0. [DOI] [PubMed] [Google Scholar]
- Nelson JS. Fishes of the World. 4th edition. John Wiley and Sons; New York: 2006. p. 600. [Google Scholar]
- Nylander JAA, Ronquist F, Huelsenbeck JP, Nieves-Aldrey JL. Bayesian phylogenetic analysis of combined data. Syst Biol. 2004;53:47–67. doi: 10.1080/10635150490264699. [DOI] [PubMed] [Google Scholar]
- Oh DJ, Kim JY, Lee JA, Yoon WJ, Park SY, Jung YH. Complete mitochondrial genome of the rock bream Oplegnathus fasciatus (Perciformes, Oplegnathidae) with phylogenetic considerations. Gene. 2007;392:174–180. doi: 10.1016/j.gene.2006.12.007. [DOI] [PubMed] [Google Scholar]
- Oh DJ, Kim SJ, Jung YH. Comparison of the mitochondrial genomes of East Asian Pseudolabrus fishes. Mitochondrial DNA. 2008;19:453–460. [PubMed] [Google Scholar]
- Ohtsuki T, Kawai G, Watanabe K. The minimal tRNA: Unique structure of Ascaris suum mitochondrial tRNAser-UCU having a short T arm and lacking the entire D arm. FEBS Lett. 2002;514:37–43. doi: 10.1016/s0014-5793(02)02328-1. [DOI] [PubMed] [Google Scholar]
- Ojala D, Montoya J, Attardi G. tRNA punctuation model of RNA processing in human mitochondria. Nature. 1981;290:470–474. doi: 10.1038/290470a0. [DOI] [PubMed] [Google Scholar]
- Posada D. jModelTest: Phylogenetic model averaging. Mol Biol Evol. 2008;25:1253–1256. doi: 10.1093/molbev/msn083. [DOI] [PubMed] [Google Scholar]
- Poulin E, Cardenas L, Hernandez CE, Kornfield I, Ojeda FP. Resolution of the taxonomic status of Chilean and Californian jack mackerels using mitochondrial DNA sequence. J Fish Biol. 2004;65:1160–1164. [Google Scholar]
- Reeder TW. A phylogeny of the Australian Sphenomorphus group (Scincidae, Squamata) and the phylogenetic placement of the crocodile skinks (Tribolonotus): Bayesian approaches to assessing congruence and obtaining condence in maximum likelihood inferred relationships. Mol Phylogenet Evol. 2003;27:384–397. doi: 10.1016/s1055-7903(02)00448-7. [DOI] [PubMed] [Google Scholar]
- Sambrook J, Russell DW. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press; New York: 2001. [Google Scholar]
- Sbisa E, Tanzariello F, Reyes A, Pesole G, Saccone C. Mammalian mitochondrial D-loop region structure analysis: Identification of new conserved sequence and their functional and evolutionary implications. Gene. 1997;205:125–140. doi: 10.1016/s0378-1119(97)00404-6. [DOI] [PubMed] [Google Scholar]
- Seikai National Fisheries Research Institute. Biological and Ecological Characteristics of Valuable Fisheries Resources from the East China Sea and the Yellow Sea, Comparison between the Chinese and Japanese Knowledges. Seikai National Fisheries Research Institute; Nagasaki: 2001. p. 503. [Google Scholar]
- Szafranski P. The mitochondrial trn-cox1 locus: Rapid evolution in Pompilidae and evidence of bias in cox1 initiation and termination codon usage. Mitochondrial DNA. 2009;20:15–25. doi: 10.1080/19401730802644986. [DOI] [PubMed] [Google Scholar]
- Taanman JW. The mitochondrial genome: Structure, transcription, translation and replication. Biochim Biophys Acta. 1999;1410:103–123. doi: 10.1016/s0005-2728(98)00161-3. [DOI] [PubMed] [Google Scholar]
- Tamura K, Dudley J, Nei M, Kumar S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) ver. 4.0. Mol Biol Evol. 2007;24:1596–1599. doi: 10.1093/molbev/msm092. [DOI] [PubMed] [Google Scholar]
- Tong X, Du B, Yu DH, Gong SY, Guo YH, Huang GJ, Li LH. Sequence analysis of mitochondrial 16S rRNA gene fragment in Chups croaker (Nibea coibor) Mar Fish Res. 2007;28:85–91. (in Chinese with abstract in English) [Google Scholar]
- Wan RJ, Sun S. The category composition and abundance of ichthyoplankton in the ecosystem of the Yellow Sea and the East China Sea. Acta Zool Sin. 2006;52:28–44. (in Chinese with abstract in English) [Google Scholar]
- Wang TW, Clayton DA. In vitro replication of human mitochondrial DNA: Accurate initiation at the origin of light-strand synthesis. Cell. 1985;42:951–958. doi: 10.1016/0092-8674(85)90291-0. [DOI] [PubMed] [Google Scholar]
- Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3253–3255. doi: 10.1093/bioinformatics/bth352. [DOI] [PubMed] [Google Scholar]
- Xiao YS, Zhang Y, Gao TX, Takashi Y, Yabe M, Sakurai Y. Genetic diversity in the mtDNA control region and population structure in the small yellow croaker Larimichthys polyactis. Environ Biol Fish. 2009;85:303–314. [Google Scholar]
- Xu TJ, Cheng YZ, Sun YN, Shi G, Wang RX. The complete mitochondrial genome of bighead croaker, Collichthys niveatus (Perciformes, Sciaenidae): Structure of control region and phylogenetic considerations. Mol Biol Rep. 2010;38:4673–4685. doi: 10.1007/s11033-010-0602-4. [DOI] [PubMed] [Google Scholar]
- Xue Y, Jin XS, Zhang B, Liang ZL. Diet composition and seasonal variation in feeding habits of small yellow croaker Pseudosciaena polyactis Bleeker in the central Yellow Sea. J Fish Sci China. 2004;3:237–243. [Google Scholar]
- Yan LP, Hu F, Ling JZ, Li SF. Study on age and growth of Larimichthys polctis in the East China Sea. Per Ocean Univ China. 2006;36:95–100. (in Chinese with abstract in English) [Google Scholar]
- Yang R, Wu XB, Yan P, Su X, Yang BH. Complete mitochondrial genome of Otis tarda (Gruiformes, Otididae) and phylogeny of Gruiformes inferred from mitochondrial DNA sequences. Mol Biol Rep. 2010;37:3057–3066. doi: 10.1007/s11033-009-9878-7. [DOI] [PubMed] [Google Scholar]
- Yang Z. Maximum likelihood models for combined analyses of multiple sequence data. J Mol Evol. 1996;42:587–596. doi: 10.1007/BF02352289. [DOI] [PubMed] [Google Scholar]
- Zardoya R, Meyer A. The complete DNA sequence of the mitochondrial genome of a ‘living fossil’, the coelacanth (Latimeria chalumnae) Genetics. 1997;146:995–1010. doi: 10.1093/genetics/146.3.995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zardoya R, Garrido-Pertierra A, Bautista JM. The complete nucleotide sequence of the mitochondrial DNA genome of the rainbow trout, Oncorhynchus mykiss. J Mol Evol. 1995;411:942–951. doi: 10.1007/BF00173174. [DOI] [PubMed] [Google Scholar]
- Zhao JL, Wang WW, Li SF, Cai WQ. Structure of the mitochondrial DNA control region of the sinipercine fishes and their phylogenetic relationship. Acta Genet Sin. 2006;33:793–799. doi: 10.1016/S0379-4172(06)60112-1. [DOI] [PubMed] [Google Scholar]
- Zhu YT, Lo YL, Wu HL. A Study on the Classication of the Sciaenoid Fishes of China, with Description of New Genera and Species. 1st edition. Shanghai Science and Technology Press; Shanghai: 1963. pp. 13–14. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.