Abstract
Lycium chinense Mill, an important Chinese herbal medicine, is widely used as a dietary supplement and food. Here the chloroplast (CP) genome of L. chinense was sequenced and analyzed, revealing a size of 155,756 bp and with a 37.8% GC content. The L. chinense CP genome comprises a large single copy region (LSC) of 86,595 bp and a small single copy region (SSC) of 18,209 bp, and two inverted repeat regions (IRa and IRb) of 25,476 bp separated by the single copy regions. The genome encodes 114 genes, 16 of which are duplicated. Most of the 85 protein-coding genes (CDS) had standard ATG start codons, while 3 genes including rps12, psbL and ndhD had abnormal start codons (ACT and ACG). In addition, a strong A/T bias was found in the majority of simple sequence repeats (SSRs) detected in the CP genome. Analysis of the phylogenetic relationships among 16 species revealed that L. chinense is a sister taxon to Lycium barbarum. Overall, the complete sequence and annotation of the L. chinense CP genome provides valuable genetic information to facilitate precise understanding of the taxonomy, species and phylogenetic evolution of the Solanaceae family.
Keywords: Lycium chinense Mill, chloroplast genome, phylogenetic analysis
1. Introduction
Lycium chinense plays an important role in Chinese traditional medicine and has been used as a food and herbal medicine in China for thousands of years [1,2]. The fruits of L. chinense have potential pharmacological effects such as anti-aging, reducing blood glucose and serum lipids, immune regulation, among others [3]. Moreover, the dry root bark of L. chinense is widely used for the treatment of night sweats, diabetes, coughs, vomiting blood, high blood pressure, and ulcers, and is officially listed in the Chinese Pharmacopoeia (2015 version) [4].
L. chinense belongs to the Solanaceae family, which is composed of around 27,000 species belonging to 24 genera. Among them, Lycium is one of the most important genus because the fruit, leaf, root bark, and young shoots of many species of this genus have been used as local foods and/or medicines for many years [5]. The Lycium genus comprises approximately 70 species, which are widely distributed throughout the world, including southern Africa, Europe, Asia, America and Australia [6,7], however, seven are unique to China. These seven Chinese species are all deciduous shrubs with highly similar morphologies and structures, making it difficult to distinguish among them according to their appearance. Consequently, they are often confused in the market [8]. There has been an attempt to define DNA barcode sequences to identify and analyze the phylogenetic relationships within the Lycium genus, however, this has been unsuccessful [8]. Therefore, more effective molecular markers are required to facilitate the investigation of the relationships within the Lycium genus [9].
In recent years, chloroplast (CP) genomes have been widely used to reconstruct phylogenetic relationships among various land plants. As an important plastid, CPs play an indispensable role in a plant cell’s utilization of photosynthesis, carbon fixation. With the rapid development of next-generation sequencing technologies, sequencing of entire genomes is feasible for most laboratories [10]. According to published reports, the structure of CP genomes in angiosperms are highly conserved, with an approximate length of 150 kb comprised of two inverted repeat regions (IRa and, IRb) and large and small single copy regions (LSC and SSC), with the two inverted repeats (IRs) separated by the two single copy regions (SCs) [11,12]. CP genomes can be used to study evolutionary relationships at the taxonomic level, as they are maternally inherited, haploid, and highly conserved in gene content and genome structure [13,14].
Transcriptome analysis of L. chinense leaves has been reported [15], however, information about the structure of its CP genome is still lacking. Here we report the CP genome of L. chinense based on data generated using next-generation sequencing. Further, gene structure characteristics, RNA editing sites, codon usage as well as phylogenetic position of L. chinense were analyzed and compared with those of several related species. To the best of our knowledge, this is the third comprehensive analysis of a CP genome from the Lycium genus, along with the two recently reported CP genomes from Lycium barbarum and Lycium ruthenicum.
2. Results
2.1. CP Genome Organization and Gene Content
The CP genome of L. chinense is 155,756 bp in length consisting of large and small single-copy regions of length 86,595 and 18,209 bp, separated by two IR regions of 25,476 bp (Figure 1 and Table 1). The total G + C content was similar to that of other species in the Solanaceae family, which is about 37.8% [11,12,16,17]. And the G + C contents of the LSC (35.8%) and SSC regions (32.3%) were lower than those of the IR regions (43.1%). In the protein-coding regions (CDS), the G + C contents of the first, second and third codon positions were 43.9%, 37.9%, and 33.1%, respectively (Table 1), demonstrating that the A + T content is much higher at the third codon position than those of the other two. This phenomenon, which is commonly detected in other plant CP genomes, can be used to separate nuclear and mitochondrial DNA from CP DNA [18,19,20,21].
Figure 1.
Gene map of the L. chinense chloroplast (CP) genome. Genes inside the circle are transcribed clockwise, and counterclockwise transcribed otherwise. Filled colors represent different functional groups according to the legend on the left bottom. The gray arrow represents gene direction. The darker gray in the inner circle corresponds to GC content, whereas the lighter gray corresponds to AT content.
Table 1.
Base composition in the chloroplast genome of L. chinense.
| Region | Positions | T/U (%) | C (%) | A (%) | G (%) | Total (bp) |
|---|---|---|---|---|---|---|
| LSC | 32.7 | 18.3 | 31.4 | 17.5 | 86,595 | |
| IRB | 28.4 | 20.7 | 28.5 | 22.4 | 25,476 | |
| SSC | 33.9 | 16.8 | 33.7 | 15.5 | 18,209 | |
| IRA | 28.5 | 22.4 | 28.4 | 20.7 | 25,476 | |
| Total | 31.5 | 19.2 | 30.7 | 18.6 | 155,756 | |
| CDS | 31.3 | 17.9 | 30.5 | 20.3 | 79,700 | |
| 1st position | 25.0 | 18.6 | 30.9 | 25.3 | 26,567 | |
| 2nd position | 34.0 | 19.6 | 28.5 | 18.3 | 26,567 | |
| 3rd position | 35.0 | 15.5 | 32.7 | 17.6 | 26,566 |
The L. chinense CP genome is predicted to encode a total of 130 functional genes. Among them, 16 are duplicated in the IR region, including 5 CDS, 7 tRNAs and 4 rRNAs. Annotation revealed 80 distinct protein-coding genes, 30 separate tRNA genes and 4 distinct rRNA genes (Table 2). In addition, one gene (ycf1) without a stop codon was annotated as a pseudogene. Intron-containing genes were also analyzed. In total, there were 17 genes containing one or two introns found in the L. chinense CP genome (Table 3), including 12 CDS and 5 tRNA genes, two of which (ycf3 and clpP) contain two intron. While there was a single intron in the other 15 genes, as has also been reported in other plants [22,23].
Table 2.
List of genes annotated in the chloroplast genomes of L. chinense.
| Classification of Genes | Gene Names | Number |
|---|---|---|
| Photosystem I | psaA, psaB, psaC, psaI, psaJ | 5 |
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | 15 |
| Cytochrome b/f complex | petA, petB *, petD *, petG, petL, petN | 6 |
| ATP synthase | atpA, atpB, atpE, atpF, atpH, atpI | 6 |
| NADH dehydrogenase | ndhA *, ndhB * (×2), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK | 12 (1) |
| RubisCO large subunit | rbcL | 1 |
| RNA polymerase | rpoA, rpoB, rpoC1, rpoC2 | 4 |
| Ribosomal proteins (SSC) | rps2, rps3, rps4, rps7 (×2), rps8, rps11, rps12 ** (×2), rps14, rps15, rps16 *, rps18, rps19 | 14 (2) |
| Ribosomal proteins (LSC) | rpl2 (×2), rpl14, rpl16, rpl20, rpl22, rpl23 (×2), rpl32, rpl33, rpl36 | 11 |
| Ribosomal RNAs | rrn 4.5 (×2), rrn 5 (×2), rrn 16 (×2), rrn 23 (×2) | 8 (4) |
| Protein of unkown function | ycf1 (×2), ycf2 (×2), ycf3 **, ycf4 | 6 (2) |
| Transfer RNAs | 37 tRNAs (8 contain an intron, 7 in the inverted repeats region) | 37 (7) |
| Other genes | accD, ccsA, cemA, clpP, matK | 5 |
| Total | 130 |
* indicates gene containing one intron; while ** indicates gene with two introns.
Table 3.
Gene in the chloroplast genome of L. chinense with introns and exons.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
| atpF | LSC | 410 | 704 | 145 | ||
| clpP | LSC | 228 | 640 | 292 | 808 | 71 |
| ndhA | SSC | 539 | 1154 | 553 | ||
| ndhB | IR | 777 | 679 | 756 | ||
| petB | LSC | 6 | 750 | 642 | ||
| petD | LSC | 8 | 742 | 475 | ||
| rpl16 | LSC | 396 | 1016 | 9 | ||
| rpl2 | IR | 434 | 666 | 391 | ||
| rpoC1 | LSC | 1616 | 737 | 430 | ||
| rps12 | IR | 26 | 536 | 232 | ||
| rps16 | LSC | 227 | 822 | 40 | ||
| trnA-UGC | IR | 38 | 681 | 35 | ||
| trnI-GAU | IR | 34 | 717 | 37 | ||
| trnK-UUU | LSC | 36 | 2513 | 37 | ||
| trnL-UAA | LSC | 35 | 497 | 50 | ||
| trnV-UAC | LSC | 35 | 565 | 38 | ||
| ycf3 | LSC | 154 | 756 | 229 | 744 | 124 |
2.2. Repeat Structure and Simple Sequence Repeat (SSR) Analysis
The SSRs detected in L. chinense are represented in Table 4. A total of 231 SSR loci were detected, including 107 mononucleotide, 46 dinucleotide, 67 trinucleotide and 11 tetranucleotide repeat units. In addition, of the mononucleotide SSRs, 99.1% constituted A/T sequences, while only one belongs to G/C motif. Interestingly, 63.0% of the dinucleotide SSRs were also A/T motifs.
Table 4.
Types and numbers of simple sequence repeats (SSRs) in the L. chinense chloroplast genome.
| SSR Type | Repeat Unit | Amount | Ratio (%) |
|---|---|---|---|
| Mono | A/T | 106 | 99.1 |
| C/G | 1 | 0.9 | |
| Di | AC/GT | 1 | 2.2 |
| AG/CT | 16 | 34.8 | |
| AT/TA | 29 | 63.0 | |
| Tri | AAC/GTT | 9 | 13.4 |
| AAG/CTT | 20 | 30.0 | |
| AAT/ATT | 21 | 31.3 | |
| ACC/GGT | 1 | 1.4 | |
| ACG/CGT | 1 | 1.4 | |
| ACT/AGT | 2 | 3.0 | |
| AGC/CTG | 5 | 7.5 | |
| AGG/CCT | 4 | 6.0 | |
| ATC/ATG | 4 | 6.0 | |
| Tetra | AAAC/GTTT | 3 | 27.3 |
| AAAG/CTTT | 1 | 9.1 | |
| AAAT/ATTT | 5 | 45.4 | |
| AATC/ATTG | 1 | 9.1 | |
| AGAT/ATCT | 1 | 9.1 |
Long repeats are defined as sequence repeats ≥30 bp, which might function to increase population genetic diversity and promote chloroplast genome rearrangement [24]. It must be noted that the repeat types found are all forward and palindromic among the four species which all belong to the Solanaceae family. There are 49 (25 forward, 24 palindromic), 46 (23 forward, 23 palindromic), 39 (20 forward, 19 palindromic), 40 (20 forward, 20 palindromic) large repeats in the CP genomes of L. chinense, L. barbarum, L. ruthenicum, A. belladonna, respectively (Figure 2).
Figure 2.
Repeat sequences in four chloroplast genomes. F, forward, P, palindrome, R, reverse, and C, complement, respectively. Colors represent repeats of different lengths, as indicated.
2.3. Comparative Chloroplast Genomic Analysis
To facilitate subsequent phylogenetic analyses and plant identification smoothly, we used the mVISTA program to analyze the entire CP genome sequence of L. chinense in comparison with those of A. belladonna (NC_004561.1), L. barbarum (MH032560.1), L. ruthenicum (NC_039651.1) (Figure 3). These four species all belong to the Solanaceae family. The data presented in Figure 3 clearly demonstrate that the IR regions are more conserved than the SC regions. Copy correction caused by gene conversion between the two IR region sequences could be the main cause of this phenomenon [25]. Moreover, coding regions were more strongly conserved than the non-coding regions, which is a very common finding in the CP genomes of many other angiosperms [26,27,28].
Figure 3.
Sequence identity plot comparing five chloroplast genomes using mVISTA with L. chinense as the reference. Grey arrows and thick black lines above the alignment indicate genes with their orientation and the position of the inverted repeats (IRs), respectively. A cut-off of 70% identity was used for the plots, and the Y-scale represents the percentage identity ranging from 50% to 100%.
2.4. IR Contraction and Expansion in the L. chinense. CP Genome.
As shown in Figure 4, the IR-SSC and IR-LSC boundaries of L. chinense were compared with those of three other species, A. belladonna (NC_004561.1), L. barbarum (MH032560.1) and L. ruthenicum (NC_039651). The length of the IR regions in the four CP genomes ranged from 25,395 to 25,906 bp, demonstrating a modest expansion. Due to the expansion, the rps19 and ycf1 genes were partially included in the IR regions of the Solanoideae family. Consequently, there is a truncated rps19 pseudogene and a ycf1 pseudogene copy found at the junction of LSC/IRB and SSC/IRA, respectively. The ndhF and trnH genes, which are entirely located in the SSC and LSC regions, respectively, were at varying distances from the LSC/SSC borders within the Solanoideae family; the longest distance (54 bp) was observed in the L. chinense. In contrast, 33 bp of the rps19 gene and 979 bp of the ycf1 gene are extended into the IR regions in the L. ruthenicum CP genome, which is the shortest among the four species. Moreover, in L. chinense, L. barbarum, and A. belladonna, 47, 49 and 60 bp of the rps19 gene and 995, 998 and 1438 bp of the ycf1 gene are extended into the IR regions, respectively. Taken together, these data indicate that the expansion and contraction of the IR/SC regions exhibit similar patterns within the family, with slight variations.
Figure 4.
Comparison of the borders of large single copy regions (LSC), small single copy (LSC) and inverted repeat (IR) regions among four chloroplast genomes (not to scale). Number above the gene features means the distance between the ends of genes and the borders sites. The IRb/SSC border extended into the ycf1 genes to create various lengths of ycf1 pseudogenes among four chloroplast genomes, while the IRa/LSC border extended into the rps19 genes to create various lengths of rps19 pseudogenes.
2.5. Codon Usage and RNA Editing Sites
The results of analysis of codon usage in the L. chinense CP genome are summarized in Figure 5 and Table S1. Twenty amino acids that can be transported for protein biosynthesis by tRNA molecules were found in the CP genome. All the CDS in the L. chinense CP genome combined included 26,569 codons. Among them, codons encoding leucine was the most common, accounting for 13.16% of total usage. While at the same time, codons encoding cysteine were the least frequent, accounting for 1.82% of total usage. Furthermore, as the number of codons encoding a particular amino acid increased, the relative synonymous codon usage (RSCU) value also increased (Figure 5). Interestingly, most amino acid codons, other than two non-redundant codons (for methionine and tryptophan), exhibit preferential use, as observed in other species [29,30,31,32,33].
Figure 5.
Codon content for 20 amino acids and stop codons in all protein-coding genes in the of L. chinense CP genome.
The Predictive RNA Editor for Plants (PREP) suite was used to detect potential RNA editing sites in the L. chinense CP genome, and the results were summarized in Table S2. A total of 50 RNA editing sites in 35 genes were identified. All the nucleotide changes found were cytidine (C) to thymine (T) edits, which are typical of the transcripts of land plant CP genomes [34]. Furthermore, the results also revealed that the conversion from the amino acid S to L is the most frequent, accounting for approximately 40%; while H to Y, and I to F switches were the least common, accounting for only 2%.
2.6. Phylogenetic Analysis
The results of phylogenetic analyses are presented in Figure 6 and Figure 7 and Table S3; most nodes were strongly supported with bootstrap values of 100%. Furthermore, all 16 species included in this analysis were split into two clades, with 14 Solanaceae plants comprising a unique clade, and the rest two outgroup species from the Labiatae grouped into the other clade. Within the Solanaceae, each genus basically clustered alone, demonstrating good monophyletic separation of this family. L. chinense (Lycium) was a sister species to L. barbarum, which is strongly supported by both the maximum likelihood and BEAST trees. Further, the Lycium genus is sister to the Atropa genus.
Figure 6.
Phylogenetic relationships among 16 species inferred using maximum likelihood analyses of the complete chloroplast genome, excluding IRa regions. Numbers at nodes are values for bootstrap support. The position of L. chinense is indicated in red font. Sequences from Salvia japonica Thunb and Pogostemon cablin (Blanco) Ben were set as out group.
Figure 7.
Phylogenetic relationships among 16 species inferred from Bayesian evolutionary analysis by sample tree (BEAST) analyses of complete chloroplast genome sequences, excluding IRa regions. The position of L. chinense is indicated in blue font. Salvia japonica Thunb and Pogostemon cablin (Blanco) Bent were used as outgroups.
3. Discussion
This is the first report of the complete CP genome of L. chinense. Meanwhile, it has been the third species of the Lycium genus to have its CP genome fully sequenced and analyzed, with those of the other two (L. barbarum and L. ruthenicum) published recently [35,36]. Interestingly, the size of the L. chinense CP genome (155,756 bp) is the largest among the three Lycium species, with 180 bp longer than that of L. barbarum and 763 bp longer than that of L. ruthenicum. This is consistent with the hypothesis that IR contraction and expansion is the main reason explaining size differences between CP genomes [35,36,37,38].
Analysis of the gene content indicated that there are 82 protein-coding genes with a standard ATG start codon among the total of 85 protein-coding genes in the L. chinense CP genome. While the remaining three genes contain abnormal ACG (psbL and ndhD) and ACT (rps12) start codons. ACG and ACT are meaningless as start codons, whereas in general they are synonymous codons which can still encode threonine. This phenomenon has also been reported in the model plant Nicotiana tabacum, where the start codon for psbL and ndhD are ACG as well [39]. Furthermore, an ACT start codon for rps12 is also present in the mitochondrial genome of tube-dwelling diatom Berkeleya fennica [40]. Therefore, we hypothesize that the start codons of some genes such as rps12, psbL and ndhD might be mutated during the evolution. RNA editing is a common phenomenon in plant CP genomes and causes modifying mutations, changes reading frames, and regulates the expression of CP genes [41]. The ndhD and psbL can be normally transcribed due to RNA editing, and this phenomenon has also been described in Linum usitatissimum L [42], tobacco [43], spinach [44], and Ampelopsis brevipedunculata [45]. Amino acid transitions caused by RNA editing in the L. chinense CP genome exhibited the highest frequency of conversion from serine to leucine (S to L), representing a change from a hydrophilic into a hydrophobic amino acid. This is consistent with the general characteristics of high-level plant CP RNA editing [46,47]. In protein structures, hydrophobic residues are usually enclosed in the center If a hydrophilic mutation occurs, the stability of the protein structure will tend to decrease. While in severe cases, correct folding could be hindered. Therefore, we hypothesize that the detected changes induced by RNA editing might promote the formation of hydrophobic cores, making the proteins more stable [48,49].
SSRs, also called microsatellites, which are widely distributed across genomes, refer to a group of tandem sequences [50]. SSRs play a key role in genomes and have been widely used in genetic and genomic studies because of their extreme variability within species [51,52,53,54]. The analysis results of SSRs in our study are consistent with the hypothesis that CP SSRs have a strong A/T bias, which is a common finding in many plant species [55,56,57]. The abundance of AT nucleotides in CP genomes might be related to the presence of SSRs, which is related to the stability of AT and GC nucleotides, to some extent [11]. The SSRs in the L. chinense CP genome could be used as molecular markers resource in future genetic identification studies.
With their great potential for application in studies of phylogenetics, evolution and molecular systematics, CP genomes have been widely used to solve phylogenetic issues in various land plants [16,57,58]. To identify the evolutionary position of L. chinense within the Solanaceae family, we conducted multiple sequence alignments using 13 other Solanaceae CP genome sequences (contain only one IR region). Two species from Labiatae family were chosen as outgroups. The results of phylogenetic analysis using two types of software both showed that L. chinense is the most closely related to the species of L. barbarum, which is identical in appearance to L. chinense. However, their therapeutic efficacies and active ingredients differ from each other [59,60]. Nevertheless, much more research has been carried out on L. barbarum, which is also officially listed in the Chinese Pharmacopoeia (2015 version), while only a few papers have been published reporting L. chinense. Overall, this study has generated abundant information valuable for the investigation of the genetic diversity of L. chinense and the Solanaceae family more broadly; and it will facilitate the use of CP genome sequences for species identification.
4. Materials and Methods
4.1. Plant Material and DNA Extraction
Fresh leaves of the L. chinense were obtained from the Medicinal Plant Garden of Guangzhou University of Chinese Medicine. Total genomic DNA (gDNA) was extracted from those leaves using a DNeasy Plant Mini Kit (Qiagen, German). A DNA sample of good integrity and with both optical density (OD) 260/280 and OD 260/230 ratio greater than 1.8 was submitted to subsequent experiments.
4.2. Chloroplast Genome Sequencing and Assembly
The sequencing library was constructed using this gDNA, after being ultrasonically sheered into 250 bp fragments, and then be submitted to Next-generation Sequencing on an Illumina HiSeq 2000 platform. NGS platform generated 6.82 G of raw sequencing data, then the raw data were filtered by removing low quality reads with a sliding window quality cutoff of Q20 using Trimmomatic [61]. Using the complete sequence of A. belladonna chloroplast genome as a reference, CP-like reads were extracted from those clean reads and then be assembled using the Abyss2.0 program [62], resulting in a complete chloroplast genome sequence of L. chinense. PCR amplification was performed to verify the four junction regions between the IR regions and the LSC/SSC region.
4.3. Gene Annotation and Genome Structure
Gene annotation of the L. chinense CP genome was conducted using the online program GeSeq-Annotation of Organellar Genomes (https://chlorobox.mpimp-golm.mpg.de/geseq.html) [63] with default parameters, which annotates not only protein-coding genes, but also tRNAs and rRNAs. Genious Pro. (version 4.8.4) was used to correct the annotation result by adjusting the open reading frame of coding genes. Further examination and revision of the annotation information were manually performed with the assistance of the CLC Sequence Viewer (version 8). After that the complete chloroplast genome was submitted using the program Sequin and an accession number of MK040922 was assigned from the GenBank. The Organellar Genome DRAW (OGDRAW) program was used to draw the physical map with default parameters and subsequent manual editing. [64].
GC content of CDS regions as well as the distribution of codon usage was analyzed using the Molecular Evolutionary Genetics Analysis (MEGA 6.06) [65,66]. The online program Predictive RNA Editor for Plants (PREP) suite was used, with a cutoff value set at 0.8, to predict potential RNA editing sites in 35 genes of the CP genome of L. chinense using [67]. Furthermore, by using MISA [68], SSRs were detected with default parameters. The forward and inverted repeats were determined by setting the parameter of Hamming Distance to 3 and the Minimal Repeat Size parameter to 30 with the REPuter program (https://bibiserv2.cebitec.uni-bielefeld.de/reputer) [69].
4.4. Genome Comparison and Phylogenetic Analysis
In order to compare the CP genome of L. chinense with others of A. belladonna (NC_004561.1), L. barbarum (MH032560.1) and L. ruthenicum (NC_039651), the mVISTA program (http://genome.lbl.gov/vista/index.shtml) [70] in the Shuffle-LAGAN mode [71] was carried out using the annotation of L. chinense as the reference.
In order to identify the phylogenetic position of L. chinense within the tubiflorae lineages, two kinds of phylogenetic analysis software, MEGA 6.06 and Bayesian evolutionary analysis by sample tree (BEAST). First, 16 complete CP genome sequences were downloaded from the GenBank of National Center for Biotechnology Information (NCBI) database. Those cp genomes underwent a sequence alignment by the Multiple Alignment using Fast Fourier Transform (MAFFT) program [72] (https://www.ebi.ac.uk/Tools/msa/mafft/). Then by using Mega 6.06 [65], maximum likelihood (ML) analysis of those 16 CP genomes was performed [65] to find the best models, which is GTR + G + I, and then construct ML tree, taking the CP genome sequences of Salvia japonica, Pogostemon cablin as the outgroup. Support was estimated through 1000 bootstrap replicates to assess the reliability of the phylogenetic tree. BEAST 2.0 analysis was conducted under the guidance of the online tutorial (http://www.beast2.org/tutorials/). The first step is to upload the sequence alignment document to the Bayesian Evolutionary Analysis Utility (BEAUti), which is a user-friendly program for setting the evolutionary model and options for the Markov chain Monte Carlo (MCMC) analysis. The second step is to actually run BEAST using the input file that contains the data, model and settings. The phylogenetic tree was constructed using the figtree (http://tree.bio.ed.ac.uk/software/figtree/).
5. Conclusions
The CP genome of L. chinense is 155,756 bp in length and has a relatively conserved genome structure as well as gene content. Forty-nine repeated sequences and 231 SSRs, which are informative sources for the development of new molecular markers, were determined and analyzed. The genome structure and composition are similar among the four species belonging to the Solanaceae family. Comparing the CP genome organization of different species helps us to more deeply understand the process of chloroplast evolution. The results of the phylogenetic analysis, which is conducted among 16 species, demonstrate that L. chinense has a close relationship with L. barbarum. In a word, this paper will promote the further investigation of L. chinense.
Supplementary Materials
The following are available online at https://www.mdpi.com/2223-7747/8/4/87/s1, Table S1: Codon usage in the L. chinense chloroplast genome, Table S2: RNA editing predicted in the chloroplast genomes of L. chinense by the PREP, Table S3: Chloroplast genomes used to study their phylogeny relationship in this study.
Author Contributions
Conceptualization, Z.Y.; Data curation, Z.Y., Y.H., X.Z. and S.H.; Formal analysis, Z.Y., W.A., S.H. and L.L.; Funding acquisition, Y.H.; Investigation, Y.H., X.Z. and L.L.; Methodology, Z.Y., Y.H., W.A., S.H. and L.L.; Project administration, Z.Y., Y.H., W.A., X.Z. and S.H.; Resources, Z.Y.; Software, Z.Y., S.H. and L.L.; Supervision, Z.Y. and L.L.; Validation, W.A.; Writing – review & editing, L.L.
Funding
Education Department Science Foundation of Guangxi Zhuang Autonomous Region, No. 2019KY0564.
Conflicts of Interest
The authors declare that they have no conflict of interest.
References
- 1.Potterat O. Goji (Lycium barbarum and L. chinense): Phytochemistry, pharmacology and safety in the perspective of traditional uses and recent popularity. Planta Med. 2010;76:7–19. doi: 10.1055/s-0029-1186218. [DOI] [PubMed] [Google Scholar]
- 2.Kim M.H., Kim E.J., Choi Y.Y., Hong J., Yang W.M. Lycium chinense Improves Post-Menopausal Obesity via Regulation of PPAR-gamma and Estrogen Receptor-alpha/beta Expressions. Am. J. Chin. Med. 2017;45:269–282. doi: 10.1142/S0192415X17500173. [DOI] [PubMed] [Google Scholar]
- 3.Qin X., Yamauchi R., Aizawa K., Inakuma T., Kato K. Structural features of arabinogalactan-proteins from the fruit of Lycium chinense Mill. Carbohydr. Res. 2001;333:79–85. doi: 10.1016/S0008-6215(01)00118-5. [DOI] [PubMed] [Google Scholar]
- 4.Chinese Pharmacopoeia Commission . Chinese Pharmacopoeia. China Medical Science Press; Beijing, China: 2015. [Google Scholar]
- 5.Yao R., Heinrich M., Weckerle C.S. The genus Lycium as food and medicine: A botanical, ethnobotanical and historical review. J. Ethnopharmacol. 2018;212:50–66. doi: 10.1016/j.jep.2017.10.010. [DOI] [PubMed] [Google Scholar]
- 6.Zhang J.X., Guan S.H., Feng R.H., Wang Y., Wu Z.Y., Zhang Y.B., Chen X.H., Bi K.S., Guo D.A. Neolignanamides, lignanamides, and other phenolic compounds from the root bark of Lycium chinense. J. Nat. Prod. 2013;76:51–58. doi: 10.1021/np300655y. [DOI] [PubMed] [Google Scholar]
- 7.Turchetto C., Fagundes N.J., Segatto A.L., Kuhlemeier C., Solis Neffa V.G., Speranza P.R., Bonatto S.L., Freitas L.B. Diversification in the South American Pampas: The genetic and morphological variation of the widespread Petunia axillaris complex (Solanaceae) Mol. Ecol. 2014;23:374–389. doi: 10.1111/mec.12632. [DOI] [PubMed] [Google Scholar]
- 8.Ni L.L., Zhao Z.L., Lu J.N. DNA barcoding construction of medicinal plants in genus Lycium L. based on multiple genomic segments. Chin. Tradit. Herb. Drugs. 2016;47:5. [Google Scholar]
- 9.Hebert P.D., Cywinska A., Ball S.L., de Waard J.R. Biological identifications through DNA barcodes. Proc. Biol. Sci. 2003;270:313–321. doi: 10.1098/rspb.2002.2218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nielsen A.Z., Ziersen B., Jensen K., Lassen L.M., Olsen C.E., Moller B.L., Jensen P.E. Redirecting photosynthetic reducing power toward bioactive natural product synthesis. ACS Synth. Biol. 2013;2:308–315. doi: 10.1021/sb300128r. [DOI] [PubMed] [Google Scholar]
- 11.Yang Y., Dang Y., Li Q., Lu J., Li X., Wang Y. Complete chloroplast genome sequence of poisonous and medicinal plant Datura stramonium: Organizations and implications for genetic engineering. PLoS ONE. 2014;9:e110656. doi: 10.1371/journal.pone.0110656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sanchez-Puerta M.V., Abbona C.C. The chloroplast genome of Hyoscyamus niger and a phylogenetic study of the tribe Hyoscyameae (Solanaceae) PLoS ONE. 2014;9:e98353. doi: 10.1371/journal.pone.0098353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Shaw J., Lickey E.B., Schilling E.E., Small R.L. Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms: The tortoise and the hare III. Am. J. Bot. 2007;94:275–288. doi: 10.3732/ajb.94.3.275. [DOI] [PubMed] [Google Scholar]
- 14.Shaw J., Lickey E.B., Beck J.T., Farmer S.B., Liu W., Miller J., Siripun K.C., Winder C.T., Schilling E.E., Small R.L. The tortoise and the hare II: Relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am. J. Bot. 2005;92:142–166. doi: 10.3732/ajb.92.1.142. [DOI] [PubMed] [Google Scholar]
- 15.Wang G., Du X., Ji J., Guan C., Li Z., Josine T.L. De novo characterization of the Lycium chinense Mill. leaf transcriptome and analysis of candidate genes involved in carotenoid biosynthesis. Gene. 2015;555:458–463. doi: 10.1016/j.gene.2014.10.058. [DOI] [PubMed] [Google Scholar]
- 16.Amiryousefi A.H.J., Poczai P. The chloroplast genome sequence of bittersweet (Solanum dulcamara): Plastid genome structure evolution in Solanaceae. PLoS ONE. 2018;13:23. doi: 10.1371/journal.pone.0196069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cho K.S., Cheon K.S., Hong S.Y., Cho J.H., Im J.S., Mekapogu M., Yu Y.S., Park T.H. Complete chloroplast genome sequences of Solanum commersonii and its application to chloroplast genotype in somatic hybrids with Solanum tuberosum. Plant Cell Rep. 2016;35:2113–2123. doi: 10.1007/s00299-016-2022-y. [DOI] [PubMed] [Google Scholar]
- 18.He L., Qian J., Li X., Sun Z., Xu X., Chen S. Complete Chloroplast Genome of Medicinal Plant Lonicera japonica: Genome Rearrangement, Intron Gain and Loss, and Implications for Phylogenetic Studies. Molecules. 2017;22 doi: 10.3390/molecules22020249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Xiang B., Li X., Qian J., Wang L., Ma L., Tian X., Wang Y. The Complete Chloroplast Genome Sequence of the Medicinal Plant Swertia mussotii Using the PacBio RS II Platform. Molecules. 2016;21 doi: 10.3390/molecules21081029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Clegg M.T., Gaut B.S., Learn G.H., Jr., Morton B.R. Rates and patterns of chloroplast DNA evolution. Proc. Natl. Acad. Sci. USA. 1994;91:6795–6801. doi: 10.1073/pnas.91.15.6795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Morton B.R. Selection on the codon bias of chloroplast and cyanelle genes in different plant and algal lineages. J. Mol. Evol. 1998;46:449–459. doi: 10.1007/PL00006325. [DOI] [PubMed] [Google Scholar]
- 22.Shen X., Guo S., Yin Y., Zhang J., Yin X., Liang C., Wang Z., Huang B., Liu Y., Xiao S., et al. Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Aster tataricus. Molecules. 2018;23 doi: 10.3390/molecules23102426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Guo H., Liu J., Luo L., Wei X., Zhang J., Qi Y., Zhang B., Liu H., Xiao P. Complete chloroplast genome sequences of Schisandra chinensis: Genome structure, comparative analysis, and phylogenetic relationship of basal angiosperms. Sci. China Life Sci. 2017;60:1286–1290. doi: 10.1007/s11427-017-9098-5. [DOI] [PubMed] [Google Scholar]
- 24.Qian J., Song J., Gao H., Zhu Y., Xu J., Pang X., Yao H., Sun C., Li X., Li C., et al. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza. PLoS ONE. 2013;8:e57607. doi: 10.1371/journal.pone.0057607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Khakhlova O., Bock R. Elimination of deleterious mutations in plastid genomes by gene conversion. Plant J. Cell Mol. Biol. 2006;46:85–94. doi: 10.1111/j.1365-313X.2006.02673.x. [DOI] [PubMed] [Google Scholar]
- 26.Cheng H., Li J., Zhang H., Cai B., Gao Z., Qiao Y., Mi L. The complete chloroplast genome sequence of strawberry (Fragaria x ananassa Duch.) and comparison with related species of Rosaceae. PeerJ. 2017;5:e3919. doi: 10.7717/peerj.3919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chen C., Zheng Y., Liu S., Zhong Y., Wu Y., Li J., Xu L.A., Xu M. The complete chloroplast genome of Cinnamomum camphora and its comparison with related Lauraceae species. PeerJ. 2017;5:e3820. doi: 10.7717/peerj.3820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kong W.Q., Yang J.H. The complete chloroplast genome sequence of Morus cathayana and Morus multicaulis, and comparative analysis within genus Morus L. PeerJ. 2017;5:e3037. doi: 10.7717/peerj.3037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhang Y., Du L., Liu A., Chen J., Wu L., Hu W., Zhang W., Kim K., Lee S.C., Yang T.J., et al. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses. Front. Plant Sci. 2016;7:306. doi: 10.3389/fpls.2016.00306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Reginato M., Neubig K.M., Majure L.C., Michelangeli F.A. The first complete plastid genomes of Melastomataceae are highly structurally conserved. PeerJ. 2016;4:e2715. doi: 10.7717/peerj.2715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chang C.C., Lin H.C., Lin I.P., Chow T.Y., Chen H.H., Chen W.H., Cheng C.H., Lin C.Y., Liu S.M., Chang C.C., et al. The chloroplast genome of Phalaenopsis aphrodite (Orchidaceae): Comparative analysis of evolutionary rate with that of grasses and its phylogenetic implications. Mol. Biol. Evol. 2006;23:279–291. doi: 10.1093/molbev/msj029. [DOI] [PubMed] [Google Scholar]
- 32.Pan I.C., Liao D.C., Wu F.H., Daniell H., Singh N.D., Chang C., Shih M.C., Chan M.T., Lin C.S. Complete chloroplast genome sequence of an orchid model plant candidate: Erycina pusilla apply in tropical Oncidium breeding. PLoS ONE. 2012;7:e34738. doi: 10.1371/journal.pone.0034738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wu F.H., Chan M.T., Liao D.C., Hsu C.T., Lee Y.W., Daniell H., Duvall M.R., Lin C.S. Complete chloroplast genome of Oncidium Gower Ramsey and evaluation of molecular markers for identification and breeding in Oncidiinae. BMC Plant Biol. 2010;10:68. doi: 10.1186/1471-2229-10-68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tsudzuki T., Wakasugi T., Sugiura M. Comparative analysis of RNA editing sites in higher plant chloroplasts. J. Mol. Evol. 2001;53:327–332. doi: 10.1007/s002390010222. [DOI] [PubMed] [Google Scholar]
- 35.Yisilam G., Mamut R., Li J., Li P., Fu C. Characterization of the complete chloroplast genome of Lycium ruthenicum (Solanaceae) Mitochondrial DNA Part B. 2018;3:361–362. doi: 10.1080/23802359.2018.1450681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jia G., Xin G., Ren X., Du X., Liu H., Hao N., Wang C., Ni X., Liu W. Characterization of the complete chloroplast genome of Lycium barbarum (Solanales: Solanaceae), a unique economic plant to China. Mitochondrial DNA Part B. 2018;3:1062–1063. doi: 10.1080/23802359.2018.1509930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Li R., Ma P.F., Wen J., Yi T.S. Complete sequencing of five araliaceae chloroplast genomes and the phylogenetic implications. PLoS ONE. 2013;8:e78568. doi: 10.1371/journal.pone.0078568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wang R.J., Cheng C.L., Chang C.C., Wu C.L., Su T.M., Chaw S.M. Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol. Biol. 2008;8:36. doi: 10.1186/1471-2148-8-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kahlau S., Aspinall S., Gray J.C., Bock R. Sequence of the tomato chloroplast DNA and evolutionary comparison of solanaceous plastid genomes. J. Mol. Evol. 2006;63:194–207. doi: 10.1007/s00239-005-0254-5. [DOI] [PubMed] [Google Scholar]
- 40.An S.M., Noh J.H., Choi D.H., Lee J.H., Yang E.C. Repeat region absent in mitochondrial genome of tube-dwelling diatom Berkeleya fennica (Naviculales, Bacillariophyceae) Mitochondrial DNA Part Adna Mapp. Seq. Anal. 2016;27:2137–2138. doi: 10.3109/19401736.2014.982594. [DOI] [PubMed] [Google Scholar]
- 41.Freyer R., Hoch B., Neckermann K., Maier R.M., Kossel H. RNA editing in maize chloroplasts is a processing step independent of splicing and cleavage to monocistronic mRNAs. Plant J. Cell Mol. Biol. 1993;4:621–629. doi: 10.1046/j.1365-313X.1993.04040621.x. [DOI] [PubMed] [Google Scholar]
- 42.De Santana Lopes A., Pacheco T.G., Santos K.G.D., Vieira L.D.N., Guerra M.P., Nodari R.O., de Souza E.M., de Oliveira Pedrosa F., Rogalski M. The Linum usitatissimum L. plastome reveals atypical structural evolution, new editing sites, and the phylogenetic position of Linaceae within Malpighiales. Plant Cell Rep. 2018;37:307–328. doi: 10.1007/s00299-017-2231-z. [DOI] [PubMed] [Google Scholar]
- 43.Hirose T., Sugiura M. Both RNA editing and RNA cleavage are required for translation of tobacco chloroplast ndhD mRNA: A possible regulatory mechanism for the expression of a chloroplast operon consisting of functionally unrelated genes. EMBO J. 1997;16:6804–6811. doi: 10.1093/emboj/16.22.6804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Maier R.M., Zeltz P., Kossel H., Bonnard G., Gualberto J.M., Grienenberger J.M. RNA editing in plant mitochondria and chloroplasts. Plant Mol. Biol. 1996;32:343–365. doi: 10.1007/BF00039390. [DOI] [PubMed] [Google Scholar]
- 45.Raman G., Park S. The Complete Chloroplast Genome Sequence of Ampelopsis: Gene Organization, Comparative Analysis, and Phylogenetic Relationships to Other Angiosperms. Front. Plant Sci. 2016;7:341. doi: 10.3389/fpls.2016.00341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Jain B.P., Chauhan P., Tanti G.K., Singarapu N., Ghaskadbi S., Goswami S.K. Tissue specific expression of SG2NA is regulated by differential splicing, RNA editing and differential polyadenylation. Gene. 2015;556:119–126. doi: 10.1016/j.gene.2014.11.045. [DOI] [PubMed] [Google Scholar]
- 47.Corneille S., Lutz K., Maliga P. Conservation of RNA editing between rice and maize plastids: Are most editing events dispensable? Mol. Gen. Genet. MGG. 2000;264:419–424. doi: 10.1007/s004380000295. [DOI] [PubMed] [Google Scholar]
- 48.Robinson A.C., Castaneda C.A., Schlessman J.L., Garcia-Moreno E.B. Structural and thermodynamic consequences of burial of an artificial ion pair in the hydrophobic interior of a protein. Proc. Natl. Acad. Sci. USA. 2014;111:11685–11690. doi: 10.1073/pnas.1402900111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Loladze V.V., Ermolenko D.N., Makhatadze G.I. Thermodynamic consequences of burial of polar and non-polar amino acid residues in the protein interior. J. Mol. Biol. 2002;320:343–357. doi: 10.1016/S0022-2836(02)00465-5. [DOI] [PubMed] [Google Scholar]
- 50.Chen C., Zhou P., Choi Y.A., Huang S., Gmitter F.G., Jr. Mining and characterizing microsatellites from citrus ESTs. Theor. Appl. Genet. 2006;112:1248–1257. doi: 10.1007/s00122-006-0226-1. [DOI] [PubMed] [Google Scholar]
- 51.Huang H., Shi C., Liu Y., Mao S.Y., Gao L.Z. Thirteen Camellia chloroplast genome sequences determined by high-throughput sequencing: Genome structure and phylogenetic relationships. BMC Evol. Biol. 2014;14:151. doi: 10.1186/1471-2148-14-151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhao Y., Yin J., Guo H., Zhang Y., Xiao W., Sun C., Wu J., Qu X., Yu J., Wang X., et al. The complete chloroplast genome provides insight into the evolution and polymorphism of Panax ginseng. Front. Plant Sci. 2014;5:696. doi: 10.3389/fpls.2014.00696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wheeler G.L., Dorman H.E., Buchanan A., Challagundla L., Wallace L.E. A review of the prevalence, utility, and caveats of using chloroplast simple sequence repeats for studies of plant biology. Appl. Plant Sci. 2014;2 doi: 10.3732/apps.1400059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Vieira Ldo N., Faoro H., Rogalski M., Fraga H.P., Cardoso R.L., de Souza E.M., de Oliveira Pedrosa F., Nodari R.O., Guerra M.P. The complete chloroplast genome sequence of Podocarpus lambertii: Genome structure, evolutionary aspects, gene content and SSR detection. PLoS ONE. 2014;9:e90618. doi: 10.1371/journal.pone.0090618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Firetti F., Zuntini A.R., Gaiarsa J.W., Oliveira R.S., Lohmann L.G., Van Sluys M.A. Complete chloroplast genome sequences contribute to plant species delimitation: A case study of the Anemopaegma species complex. Am. J. Bot. 2017;104:1493–1509. doi: 10.3732/ajb.1700302. [DOI] [PubMed] [Google Scholar]
- 56.Wang W., Chen S., Zhang X. Whole-Genome Comparison Reveals Divergent IR Borders and Mutation Hotspots in Chloroplast Genomes of Herbaceous Bamboos (Bambusoideae: Olyreae) Molecules. 2018;23 doi: 10.3390/molecules23071537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Zhou T., Wang J., Jia Y., Li W., Xu F., Wang X. Comparative Chloroplast Genome Analyses of Species in Gentiana section Cruciata (Gentianaceae) and the Development of Authentication Markers. Int. J. Mol. Sci. 2018;19 doi: 10.3390/ijms19071962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.De Las Rivas J., Lozano J.J., Ortiz A.R. Comparative analysis of chloroplast genomes: Functional annotation, genome-based phylogeny, and deduced evolutionary patterns. Genome Res. 2002;12:567–583. doi: 10.1101/gr.209402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Mocan A., Vlase L., Raita O., Hanganu D., Paltinean R., Dezsi S., Gheldiu A.M., Oprean R., Crisan G. Comparative studies on antioxidant activity and polyphenolic content of Lycium barbarum L. and Lycium chinense Mill. leaves. Pak. J. Pharm. Sci. 2015;28:1511–1515. [PubMed] [Google Scholar]
- 60.Skenderidis P., Lampakis D., Giavasis I., Leontopoulos S., Petrotos K., Hadjichristodoulou C., Tsakalof A. Chemical Properties, Fatty-Acid Composition, and Antioxidant Activity of Goji Berry (Lycium barbarum L. and Lycium chinense Mill.) Fruits. Antioxidants. 2019;8 doi: 10.3390/antiox8030060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jackman S.D., Vandervalk B.P., Mohamadi H., Chu J., Yeo S., Hammond S.A., Jahesh G., Khan H., Coombe L., Warren R.L., et al. ABySS 2.0: Resource-efficient assembly of large genomes using a Bloom filter. Genome Res. 2017;27:768–777. doi: 10.1101/gr.214346.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E.S., Fischer A., Bock R., Greiner S. GeSeq—Versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45:W6–W11. doi: 10.1093/nar/gkx391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Lohse M., Drechsel O., Kahlau S., Bock R. OrganellarGenomeDRAW—A suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res. 2013;41:W575–W581. doi: 10.1093/nar/gkt289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Tamura K., Stecher G., Peterson D., Filipski A., Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 2013;30:2725–2729. doi: 10.1093/molbev/mst197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Kumar S., Nei M., Dudley J., Tamura K. MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief. Bioinform. 2008;9:299–306. doi: 10.1093/bib/bbn017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Mower J.P. The PREP suite: Predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 2009;37:W253–W259. doi: 10.1093/nar/gkp337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Yang X.M., Sun J.T., Xue X.F., Zhu W.C., Hong X.Y. Development and characterization of 18 novel EST-SSRs from the western flower Thrips, Frankliniella occidentalis (Pergande) Int. J. Mol. Sci. 2012;13:2863–2876. doi: 10.3390/ijms13032863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kurtz S., Choudhuri J.V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29:4633–4642. doi: 10.1093/nar/29.22.4633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Mayor C., Brudno M., Schwartz J.R., Poliakov A., Rubin E.M., Frazer K.A., Pachter L.S., Dubchak I. VISTA: Visualizing global DNA sequence alignments of arbitrary length. Bioinformatics. 2000;16:1046–1047. doi: 10.1093/bioinformatics/16.11.1046. [DOI] [PubMed] [Google Scholar]
- 71.Frazer K.A., Pachter L., Poliakov A., Rubin E.M., Dubchak I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004;32:W273–W279. doi: 10.1093/nar/gkh458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Katoh K., Rozewicki J., Yamada K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2017 doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







