Skip to main content
Genes logoLink to Genes
. 2022 Sep 19;13(9):1673. doi: 10.3390/genes13091673

Comparative Analysis of Chloroplast Genomes within Saxifraga (Saxifragaceae) Takes Insights into Their Genomic Evolution and Adaption to the High-Elevation Environment

Zhuyifu Chen 1,2, Xiaolei Yu 1,2, Yujiao Yang 1,2, Pei Wei 1,2, Wencai Zhang 3, Xinzhong Li 3, Chenlai Liu 1,2, Shuqi Zhao 1,2, Xiaoyan Li 4, Xing Liu 1,2,3,*
Editor: Zhiqiang Wu
PMCID: PMC9498722  PMID: 36140840

Abstract

Saxifraga species are widely distributed in alpine and arctic regions in the Northern hemisphere. Highly morphological diversity within this genus brings great difficulties for species identification, and their typical highland living properties make it interesting how they adapt to the extreme environment. Here, we newly generated the chloroplast (cp) genomes of two Saxifraga species and compared them with another five Saxifraga cp genomes to understand the characteristics of cp genomes and their potential roles in highland adaptation. The genome size, structure, gene content, GC content, and codon usage pattern were found to be highly similar. Cp genomes ranged from 146,549 bp to 151,066 bp in length, most of which comprised 130 predicted genes. Yet, due to the expansion of IR regions, the second copy of rps19 in Saxifraga stolonifera was uniquely kept. Through sequence divergence analysis, we identified seven hypervariable regions and detected some signatures of regularity associated with genetic distance. We also identified 52 to 89 SSRs and some long repeats among seven Saxifraga species. Both ML and BI phylogenetic analyses confirmed that seven Saxifraga species formed a monophyletic clade in the Saxifragaceae family, and their intragenus relationship was also well supported. Additionally, the ndhI and ycf1 genes were considered under positive selection in species inhabiting relatively high altitudes. Given the conditions of intense light and low CO2 concentration in the highland, the products of these two genes might participate in the adaptation to the extreme environment.

Keywords: chloroplast genome, Saxifraga, comparative analysis, hypervariable regions, high-elevation adaptation

1. Introduction

A chloroplast is a unique, semiautonomous plant organelle responsible for providing nutrition and oxygen by converting light energy into chemical energy through oxygenic photosynthesis [1,2]. The cp genome is independent of the nuclear genome and contains plenty of genetic information. It is known for genetic stability characterized by the conserved circular structure, highly similar gene content, gene order, maternal inheritance, and lack of recombination [3]. For instance, the size of cp genomes for land plants ranges from 100 to 200 kb. The typical cp genomes share the quadripartite structure consisting of one large single copy region (LSC), one small single copy region (SSC), and two isometric inverted regions (IR), within which there are 120–130 genes for which products perform basic functions of various life activities, such as photosynthesis, transcription, translation, and so forth [2,4]. Predominantly maternal inheritance ensures the haplotype and identity of the cp genomes, while a large number of copies for the chloroplast in the cytoplasm also makes the cp genome more detectable. On the other hand, cp genomes were widely found to be variable in various genera or families, even in some closely related species [5,6]. It has been pointed out that some of these changes could reflect the genetic distance of species and be related to environmental adaptations [7]. Based on these traits, and with the improvements in next-generation sequencing technologies, cp genomes are more available and considered informative for intractable phylogenetic issues with high resolution [8]. Moreover, it is also usually applied for barcoding development and adaptive evolution [9].

Saxifraga, comprising more than 400 species, is the most species-abundant and classification-complex genus in the Saxifragaceae family [10,11,12]. As cold-adapted perennial herbs usually growing in alpine meadows and scrub, many Saxifraga species have been widely cultivated and studied for both ornamental and medical value. According to reports, extracts of several Saxifraga species possess antiparasitic, antioxidant, and antitumor properties [13,14,15]. Saxifraga is distributed mainly in the Northern hemisphere. It originated from North America and was further dispersed to Europe and the Qinghai–Tibet plateau (QTP) region. In China, since the rapid uplift and the forming of the extreme environment in the QTP, rapid radiation has occurred multiple times, and over 200 Saxifraga species, represented by Sect. Ciliatae and Sect. Porphyrion, have turned into a significant component of the QTP [16]. Some organisms have radiated out of the QTP. The difference in habitats also gives rise to multiple variations in morphology (e.g., cushion form, and lime-secreting hydathodes), karyotype, and reproduction, making it difficult to classify only by macroscopic features [17,18,19]. Therefore, it is necessary to seek more reliable and powerful molecular tools for better species identification and understanding of the phylogenetic relationship in the genus. Many efforts have been made for the phylogeny of Saxifraga [20,21]. Through large-scale sampling, Tkach et al. provided a relatively complete phylogeny of the Saxifraga genus [22]. Nevertheless, as multiple studies focused on the medical value and phylogeny of Saxifraga species, research discussing their adaptation to the unique alpine environment they inhabit is rare.

During diffusion to various alpine and arctic regions, the Saxifraga genus undergoes rapid diversification, which makes it the perfect material to investigate how the alpine area influences speciation and differentiation and to learn how these species adapt to the alpine environment [17]. Habitats with high altitudes usually have specific environmental characteristics, such as intense radiation, low temperature, and low CO2 concentration, which could leave some traces among environmentally adapted genes [23,24]. With topology reconstructed with protein-coding genes (PCGs), phylogenetic methods could be utilized to estimate the evolutionary rates of genes and detect the footprints of natural selection [25]. As the genome within an organelle that is essential to the life activities of green plants, the conservation and the irreplaceability of the cp genome provide a good opportunity to explore such footprints and their adaptative contribution. Many genome-wide adaptations to the high altitude have been reported [26,27,28]. However, there was only a minority of research working on the accelerating evolution among cp genomes induced by altitude adaptation [29,30]. For high-altitude species, genes in the ndh (ndhA, ndhH, and ndhF) and ycf (ycf4) families frequently showed a higher evolutionary rate when compared to low-altitude species. Considering the alpine environment that most Saxifraga species inhabit, it is feasible to study the selection their cp genomes go through during environmental adaptation.

In the present study, we would like to seek some clues from the cp genome that help to better understand genetic features and adaptive evolution in the Saxifraga genus. For the first time, we conducted a comprehensive cp genome characteristics and adaptation analysis of Saxifraga. First of all, cp genomes of two Saxifraga species, Saxifraga sessiliflora and Saxifraga saginoides, were newly sequenced. Together with four available cp genomes (Saxifraga sinomontana, Saxifraga umbellulata var. pectinata, Saxifraga umbellulata var. umbellulata, and S. stolonifera) in GenBank and one newly assembled cp genome (Saxifraga granulata) with DNA data from NCBI, several comparative analyses were performed to investigate the genetic features of the cp genome within Saxifraga, including the genome size, structure, gene content, GC content, IR boundary, nucleotide diversity, codon usage, and SSR distribution. Then, another forty-three cp genomes in the Saxifragales order were combined to reconstruct a partial topology of the Saxifragaceae family and investigate the phylogenetic relationships of seven Saxifraga species within the Saxifragaceae family. Moreover, some genes and sites accelerating evolution that were probably induced by the harsh environment were investigated at the genus level. Above, in this study, we aimed to (1) take a glimpse of the characteristics and evolution of the cp genome within the Saxifraga genus and (2) figure out whether natural selection has exerted an influence on the cp genome of Saxifraga to make it adapt to the extreme high-elevation environment.

2. Materials and Methods

2.1. Plant Material Sampling and DNA Sequencing

Seven Saxifraga species were included in this study, two of which were newly collected from the QTP, China, at an altitude of 4898 m in 2021 (Table S8). Both samples were identified by Professor Xing Liu from Wuhan University in China, and the voucher specimens were deposited in the herbarium of Wuhan University. The young, fresh, and healthy leaves were collected and instantly frozen in liquid nitrogen and later restored at −80 °C. Total genomic DNA was extracted by the modified CTAB method. After the integrity and concentration test and library construction, genomic DNA was fragmented and ligated with adapters. The genomic DNA was sequenced on Illumina Novaseq 6000 platform through further amplification and purification. Raw data were deposited in the NCBI Sequence Read Archive (SRA) under accession numbers SRR20740825 and SRR20740826, respectively.

2.2. De Novo Assembly and Annotation

Raw data were filtered with fastp software by removing adapter sequences and low-quality reads [31]. GetOrganelle was used for de novo assembly of the cp genome for S. saginoides and S. sessiliflora [32]. Subsequently, the assembled contig was corrected by pilon and confirmed with short-read mapping to contig through bowtie2 [33,34]. The result was visualized on geneious 8.0.4. Through alignment, the whole contig was fully covered by clean data. Then, the well-assembled sequence was annotated by Geseq [35]. tRNA genes were identified using tRNAscanSE with the default setting [36]. After aligning with a set of reference cp genomes, initial annotations were manually checked and adjusted to validate each gene’s initial codon, terminal codon, and intron position. The newly obtained cp genomes were deposited in GenBank under accession numbers ON458148 and ON458149. These two cp genomes are also accessible in the figshare database [37].

2.3. Sequence Divergence Analysis and Visualization

To visualize the position, transcriptional direction of genes, and the structure feature of each cp genome, OrganellarGenomeDRAW was used to generate the physical maps of S. saginoides and S. sessiliflora [38]. Variation region of cp genome among Saxifraga genus was identified and visualized utilizing mVISTA program with a LAGAN mode [39,40]. The online tool IRscope was used for further exhibiting gene distribution at the boundaries of SSC, LSC, IRa, and IRb [41]. Seven cp genomes were aligned together. The nucleotide polymorphism (Pi) among cp genomes was calculated and visualized using DnaSP software, with a window length of 600 bp and a step size of 200 bp [42].

2.4. Calculation of Codon Usage

Mega X was applied to calculate the relative synonymous codon usage (RSCU) of all the PCGs [43]. RSCU represented the preference for codon usage. RSCU of one codon greater than 1 means it was preferred when coding the same amino acid. GC3s and ENC of each PCG among cp genomes were calculated using CodonW v1.4.4 (JF Peden, Nottingham, UK), and then both values were compared, with the standard curve calculated and visualized using R script. ENC represents the effective number of codons, and it is one of the most informative parameters to estimate the degree of imbalanced usage for synonymous codons [44]. Greatly preferred synonymous codons hold lower ENC values. GC3 means the GC content of synonymous codons at the third position. The standard curve in the ENC-GC3 plot represents the fit of the formula ENC -GC3 content. If the calculated ENC value of a gene was approximate to the standard curve, the observed codon bias was due primarily to nucleotide composition difference at the third codon position, which was mainly influenced by mutation [45]. On the contrary, it was much influenced by natural selection and other factors. Some genes, such as psbL and psbM, were not calculated because their length was overly short [46].

2.5. Identification of Repeat Sequences in Organelle Genomes

Simple sequence repeats (SSRs) of cp genomes were identified using MISA, and the parameters were as follows: 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexanucleotides, respectively [47]. The size and location of long repeats were determined by REPuter, with a minimum repeat size of 30 bp and a hamming distance of 3 [48].

2.6. Phylogenetic Analysis

Fifty species (forty-seven species among Saxifragaceae and three outgroups) were selected to reconstruct a phylogenetic tree. After manually checking and adjusting, 79 shared PCGs were extracted, aligned, and concatenated to a matrix using PhyloSuite [49]. Then, the phylogenetic tree was reconstructed by setting Myriophyllum spicatum, Ribes nevadense, and Ribes roezlii as outgroups. Maximum likelihood (ML) analyses were made using IQ-TREE with the GTR+R3+F model automatically selected by ModelFinder for 5000 ultrafast bootstraps [50]. Then, Bayesian inference (BI) phylogenies were carried out using MrBayes 3.2.6 with the GTR+I+G+F model (2 parallel runs, 2,000,000 generations), in which the original 25% of sampled data were thrown away as burn-in [51]. Two constructed trees were visualized in the Interactive Tree Of Life [52].

2.7. Selective Analysis

We calculated the Ka/Ks (Ka for nonsynonymous substitution ratio and Ks for synonymous substitution ratio) ratio among seven Saxifraga species. Seventy-nine shared PCGs were extracted separately by PhyloSuite and simultaneously translated to the amino acid sequence. ParaAT was used to prepare intermediate files automatically and calculate Ka/Ks value by calling KaKs_calculator 2.0 [53,54]. Some dispersed values greater than 45 or some rows (atpH, petN, psaJ, psbF, psbI, psbJ, psbK, psbL, and psbM) and columns (S. saginoides-S. sinomontana) with too much Na were discarded [55]. This could be caused by the exceedingly low synonymous substitution ratio. Ka/Ks ratio >1 indicated that the gene pair was considered under accelerated selection, while Ka/Ks ratio <1 was regarded as under purifying selection.

Branch-site model in the codeml program of PAML was applied to estimate the selective pressure caused by environmental adaptation among Saxifraga [25]. Selective pressure was quantified by the ratio(ω) of the nonsynonymous substitution rate(dN) to the synonymous substitution rate (dS). Through the likelihood ratio test (LRT), the alternative model (“model = 2, NSsites = 2, omega = 0.5|1.5, fix_omega = 0”) was compared with the null model (“model = 2, NSsites = 2, omega = 1, fix_omega = 1”). The P-value of LRT was acquired by the Chi-squared test. Moreover, the BEB method was implemented to test and select amino acid sites that were potentially under positive selection. A gene with a p-value < 0.05 and ω > 1 was supposed to be under positive selection, and an amino acid site with posterior probabilities > 0.95 was considered under significantly positive selection.

3. Results and Discussion

3.1. Characteristics of the CP Genome for Saxifraga Species

After filtering, two newly sequenced species generated more than three gigabases (Gb) of clean data. Both data were assembled to high-quality contigs without any gap when mapped with clean short reads. Contigs were cyclized, well annotated, and manually checked with some other cp genomes within the Saxifragaceae family (Figure 1; Table 1). By and large, cp genomes among seven Saxifraga species exhibited similar signatures (Table 2). All the seven cp genomes exhibited typical quadripartite structures, which consisted of two isometric IR regions (IRb/IRa, ranging from 25,412 to 25,651 bp), one LSC region (79,310–82,738 bp), and one SSC region (16,390–17,504 bp). S. umbellulata var. pectinata had the smallest cp genome size (146,549 bp), while S. stolonifera had the largest (151,066 bp). The overall GC (guanine–cytosine) content was nearly identical (37.8–38.1%), and the GC content of two IR regions (42.8–43%) was higher than that of the LSC (35.8–36.2%) and SSC (31.9–32.4%) regions. It was apparent that cp genomes showed AT preference, and such a preference was most prominent in the SSC region. Additionally, all the cp genomes also had a highly similar gene content. Most of them comprised 79 unique PCGs, 30 unique tRNA genes, and 4 unique rRNA genes, among which 6 PCGs, 7 tRNA genes, and 4 rRNA genes located at IR regions were duplicated (Table 1 and Table 2). S. stolonifera contained two copies of the rps19 gene, while the second copy of rps19 in the other six cp genomes was under pseudogenization. For all cp genomes, 17 PCGs and tRNA genes were detected to contain introns, and 3 PCGs, rps12, clpP, and ycf3, comprised two introns (Table 1). Compared to cp genomes from other genera of Saxifragaceae, there were no significant differences in genome size, GC content, gene content, and gene order, which was congruent with other higher plants [56,57].

Figure 1.

Figure 1

cp genome map of two newly sequenced Saxifraga species, S. saginoides and S. sessiliflora. The outer circle shows the transcription direction; genes outside are transcribed clockwise while genes inside are transcribed counterclockwise. LSC/SSC/IR regions are exhibited in the inner circle. Genes of different functions are presented in different colors.

Table 1.

Gene annotation of the S. saginoides and S. sessiliflora chloroplast genome.

Category Group Genes
Photosynthesis related genes Rubisco rbcL
Photosystem I psaA, psaB, psaC, psaI, psaJ
Photosystem II psbA, psbB, psbT, psbK, psbI, psbH, psbM, psbN, psbD, psbC, psbZ, psbJ, psbL, psbE, psbF
ATP synthase atpA, atpB, atpE, atpF a, atpH, atpI
Cytochrome b/f complex petA, petB a, petD, petN, petL, petG
Cytochrome C synthesis ccsA
NADPH dehydrogenase NdhA a, ndhB a,c (×2), ndhC, ndhD, ndhE, ndhF, ndhH, ndhG, ndhJ, ndhK, ndhI
Transcription and translation related genes Transcription rpoA, rpoB, rpoC2, rpoC1 a
Ribosomal proteins rps2, rps3, rps4, rps7 c (×2), rps8, rps11, rps12 b,c (×2), rps14, rps15, rps16 a, rps18, rps19, rpl2 a,c (×2), rpl14, rpl16 a, rpl20, rpl22, rpl23 c (×2), rpl32, rpl33, rpl36
Translation initiation factor infA
RNA genes Ribosomal RNA rrn16S c (×2), rrn23S c (×2), rrn4.5 c (×2), rrn5 c (×2)
Transfer RNA trnH-GUG, trnK-UUU a, trnQ-UUG, trnS-GCU, trnS-UGA, trnS-GGA, trnG-GCC a, trnR-UCU, trnR-ACG c (×2), trnC-GCA, trnD-GUC, trnY-GUA, trnE-UUC, trnT-UGU, trnG-UCC, trnfM-CAU, trnL-CAA c (×2), trnL-UAA a, trnL-UAG, trnF-GAA, trnV-GAC c (×2), trnV-UAC a, trnM-CAU, trnT-GGU, trnW-CCA, trnP-UGG, trnI-CAU c (×2), trnI-GAU a,c (×2), trnA-UGC a,c (×2), trnN-GUU c (×2)
Other genes RNA processing matK
Carbon metabolism cemA
Fatty acid synthesis accD
Proteolysis ClpP b
Conserved ORFs ycf1, ycf2 c (×2), ycf3 b, ycf4

a genes with one intron, b genes with two introns, c two gene copies in IRs.

Table 2.

Summary statistics of chloroplast genomes for Saxifraga species.

Genome Feature S. saginoides S. sessiliflora S. sinomontana S. umbellulata var. pectinata S. umbellulata var. umbellulata S. granulata S. stolonifera
Genome size (bp) 147,716 149,569 147,240 146,549 147,442 149,775 151,066
LSC size (bp) 79,912 81,306 79,310 79,317 79,985 81,998 82,738
SSC size (bp) 16,830 16,961 16,874 16,390 16,537 16,895 17,504
IR size (bp) 25,487 25,651 25,528 25,421 25,460 25,441 25,412
Number of genes 130 130 130 130 130 130 131
Protein genes 85 85 85 85 85 85 86
tRNA genes 37 37 37 37 37 37 37
rRNA genes 8 8 8 8 8 8 8
Duplicated genes in IRs 17 17 17 17 17 17 18
GC content (%) 38% 37.9% 38% 38.1% 38.1% 37.8% 37.8%
GC content in LSC (%) 36.2% 36% 36.2% 36.2% 36.2% 35.8% 35.9%
GC content in SSC (%) 32.1% 32% 32% 32.4% 32.4% 31.9% 32.2%
GC content in IRs (%) 42.9% 42.8% 42.9% 42.8% 42.8% 42.9% 43%

3.2. IR Boundary Analysis

The IR/SC boundary shift was widely reported as a general evolutionary phenomenon, which reflected the expansion and contraction of the cp genome and made genes near borders pseudogenization [58,59,60]. We investigated the border of IR, LSC, and SSC regions among Saxifraga species, and some variations are displayed in Figure 2 Through comparison, the boundaries showed no significant differences. In most Saxifraga species, the ndhF gene was mainly on the SSC region, and its right end had a little on the IRb region; in the S. stolonifera, ndhF was sited on the SSC region, without any part on the IRb. The ycf1 gene took both sides of the SSC and Ira region, and its length ranged from 5300 to 5540. Due to the special location, another copy of ycf1, located at the boundary of the IRb and SSC region, became a truncated pseudogene. The rps19 gene was largely located at the junction of LSC and IRb and, similar to ycf1, most of the second copy at the boundary of the IRa and LSC region was also under pseudogenization. In contrast, the rps19 genes of S. stolonifera were totally embedded in the IR regions so that pseudogenization of the second copy was avoided. The result also reflected from the side the expansion event experienced on the IR boundary of the S. stolonifera.

Figure 2.

Figure 2

A plot displaying the boundaries of the LSC, SSC, and IR regions and genes around the junction sites for each Saxifraga cp genome. JLB, JSB, JSA, and JLA denote the junction sites of LSC and IRb, IRb and SSC, SSC and IRa, and IRa and LSC, respectively.

3.3. Genomic Sequence Divergence

Similar to SSR markers, barcoding is a useful molecular tool for identifying organisms [61,62]. For a long time, rbcL, matK, and trnH-psbA in the cp genome, and ITS in nuclear sequence, constituted universal markers due to their relatively high specificity and amplification efficiency [63]. Nevertheless, they were not specific enough to distinguish closely related species in many cases. Therefore, it is necessary to search specific markers for precise identification.

First, mVISTA was used to visualize overall changes in the cp genomes among Saxifraga species. The complete cp genome of S. saginoides was used as a reference to compare with the other six cp genomes (Figure 3). As expected, most PCGs showed high consistency. Several obvious variations were shown in noncoding regions, such as intergenic regions such as trnK-rps16, rps16-trnQ, and the intron of the ndhA. Remarkably, S. stolonifera and S. granulata exhibited more and larger variance than other Saxifraga species, which was consistent with the phylogenetic relationship that these two species belonged to different sections from the other five species inferred by the previous study [22].

Figure 3.

Figure 3

The alignment and comparative analysis of the whole cp genome for seven Saxifraga species. Among them, S. saginoides was set as reference. The horizontal axis represents the coordinates of cp genomes in the alignment result. Exons, introns, and conserved noncoding sequences (CNSs) were marked as different colors.

The Pi value was then calculated among these Saxifraga cp genomes to confirm the visual result acquired from mVISTA and further detect the hypervariable regions. From Figure 4, both results suggested that IR regions were much more conserved than the LSC and SSC regions. When conducting analyses among all seven species, the Pi value ranged from 0 to 0.1 among the whole cp genome (Figure 4A; Table S1). Six intergenic regions (trnK-rps16, trnS-trnG, trnT-trnL, ycf4-cemA, petA-psbJ, and ndhF-rpl32-trnL) and part of gene ycf1 presented higher variability (0.07–0.1) than other regions, which was in accordance with what was exhibited in the mVISTA analysis. trnT-trnL was the most volatile region with a Pi value of 0.1. Additionally, Pi of five Sect. Ciliatae cp genomes (S. saginoides, S. sessiliflora, S. sinomontana, S. umbellulata var. pectinata, and S. umbellulata var. umbellulata) were calculated, and the value (0–0.06) was much lower than the Pi among seven Saxifraga species (Figure 4B; Table S1) [22]. Combining the twice-time Pi calculation with the mVISTA result shown above, we found that the whole cp genome exhibited characteristics of regularity associated with the phylogenetic relationship of seven Saxifraga species. Notably, once the non-Sect. Ciliatae species, S. stolonifera and S. granulata, were included, the regions with larger variation have changed. This indicated that different sections of a genus might provide some specific variations and searching hypervariable loci at the section level might offer the possibility for more precise species identification.

Figure 4.

Figure 4

Comparative analysis of the nucleotide polymorphism (Pi) values among the cp genomes of Saxifraga. (A) Pi values calculated as all the seven Saxifraga species were taken into account. (B) Pi values calculated as five species in Sect. Ciliatae was selected separately.

3.4. Condon Usage Analysis

Codon usage bias is vital for the reflection of the cp genome evolution. Generally, mutation, natural selection, phylogenetic relationship, and other factors may lead to diverse codon usage preferences [64,65]. In this study, codon usage bias and relative synonymous codon usage (RSCU) of the shared PCGs were analyzed among seven Saxifraga species. The Saxifraga genus exhibited highly similar codon usage preference and amino acid frequency (Figure 5A,B and Figure S1; Table S2). A total of 25,913 to 26,154 codons were identified in shared PCGs. Leucine (10.5–10.6%), Isoleucine (8.4–8.5%), and serine (7.7–7.8%) were widely used, while Cysteine (1.2%), tryptophan (1.7–1.8%), and methionine (2.3–2.4%) were less used. Most amino acids were coded with more than one synonymous codon because of codon degeneracy, such as Leucine with six codons and Isoleucine with four codons. However, only tryptophan and methionine had no alternative codon [66]. Similar to the other advanced plant, for those that applied more than one codon to code, the third nucleotide of the codon was frequently occupied by A/T instead of C/G [67].

Figure 5.

Figure 5

Usage preference of amino acids (AAs) and codons for PCGs. (A) AA usage of all the PCGs in each Saxifraga species. (B) RSCU for every AA in S. saginoides. For each amino acid, a color represents a unique codon. (C) ENC-GC3 plot for S. saginoides; every gene was displayed as a dot, and different colors mean genes in distinct functional groups.

Another analysis for ENC and GC3 was conducted on each PCG. The result suggested that PCGs of Saxifraga species shared consistent codon bias patterns (Figure 5C and Figure S2; Table S3). The calculated ENC of most genes appeared to range from 30 to 60. The majority of PCGswere in the vicinity of the expected ENC, suggesting that random mutation dominated these genes. A few photosynthesis-related genes and translation-related ribosomal proteins are distributed far below the standard curve, suggesting that natural selection or other factors might take effect.

3.5. Repeat Sequence Analysis

SSRs have been described as a robust tool for species identification, population genetics, and phylogenetic studies [68,69,70,71]. Fifty-two to eighty-nine SSRs were identified in seven Saxifraga species (Figure 6B; Table S4). S. granulata contained the largest number of SSRs. Among these repeats, the mononucleotide SSRs were the most abundant (36–64) and mainly constituted by A/T. Some regular repeats, such as A/T/C/G, AG/CT/AT, and AAT, were shared by all cp genomes, whereas other repeat units with more than four nucleotides, such as AATG/ATTC, AATAT/ATATT, and AATCCT/AGGATT, were much more distinct in the specific cp genome (Figure 6A). For all cp genomes, LSC consisted of the greatest number of SSRs and two IRs included the least, which was congruent with the regularity of Pi analysis mentioned earlier (Figure 6C). Repeats longer than 30 bp were also found in seven cp genomes. Only forward and palindromic repeats appeared in all Saxifraga cp genomes (Figure 6D; Table S5). Some large, dispersed repeats were thought to be associated with the rearrangement and played an important role in genome evolution [72,73,74]. In summary, these repeat sequences will be helpful for subsequent population genetics studies.

Figure 6.

Figure 6

Repeats analysis among cp genomes of Saxifraga. (A) Distribution of all repeat units for SSRs in each species. (B) The number of different types for SSRs in each species. (C) Distribution of SSRs, respectively, in LSC, SSC, and IR regions. (D) The number of different types for long repeats.

3.6. Phylogenetic Analysis

To investigate the phylogenetic relationship of seven Saxifraga species in this study and the phylogenetic position of the Saxifraga genus in the Saxifragaceae family, 79 unique genes shared by the cp genomes of 50 species were applied to reconstruct the phylogenetic tree by both the ML and BI methods. All 50 species belonged to Saxifragales, within which 47 species were in the Saxifragaceae family, and 3 non-Saxifragaceae species, Myriophyllum spicatum (Haloragaceae), Ribes nevadense (Grossulariaceae), and Ribes roezlii, were set as outgroups. Both methods generated nearly identical topology, and all nodes were well supported with exceedingly high ML bootstrap and Bayesian posterior probability (Figure 7 and Figure S3). At the intrageneric level, S. stolonifera, S. granulata, and the rest of the Saxifraga species were divided into three clades. This result shared the same opinion with the previous phylogenetic study that S. stolonifera and S. granulate, respectively, belong to Sect. Irregulares and Sect. Saxifraga, and the rest of the species are located at Sect. Ciliatae [22]. In addition, Saxifraga species formed a monophyletic clade and were separate from all other species and groups in the Saxifragaceae family, which confirmed another previous study at the family level [75]. This suggested that the whole cp genomes could provide a high resolution of genetic information for an accurate phylogenetic relationship. Some researchers also pointed out that the cp genome could be utilized to accurately parse monophyletic phylogeny, while more complex phylogeny events, such as hybridization and plastid capture, needed nuclear sequence information to be united [76].

Figure 7.

Figure 7

Maximum likelihood (ML) phylogenetic tree of 47 species in the Saxifragaceae family, reconstructed with 79 PCGs. Three non-Saxifragaceae species, Myriophyllum spicatum, Ribes nevadense, and Ribes roezlii, were set as outgroups.

3.7. Selection and Adaption Analysis

First of all, we calculated the Ka/Ks ratio of 79 PCGs between any two Saxifraga species. Most genes were under purifying selection (ratio lower than 1), and the majority ranged from 0.001 to 0.3, indicating that most PCGs were extremely conserved among Saxifraga cp genomes at the amino acid level (Table S6). According to the heatmap, the most conserved genes framed in the red line were suggested to be photosynthesis-related genes (Figure 8). There was only a minority of genes such as ycf1 genes in the comparison of S. umbellulata var. pectinata and S. granulata, S. sinomontana and S. granulata, S. granulata and S. stolonifera, and S. saginoides and S. granulata, and rpl33 in the comparison of S. umbellulata var. umbellulata and S. granulata being under accelerated selection (ratio higher than 1).

Figure 8.

Figure 8

Heatmap representing pairwise Ka/Ks ratios of PCGs among Saxifraga species. The color bias toward red indicates that there is a higher Ka/Ks ratio between genes.

Seventy-nine unique PCGs were used to detect the natural selection pressure among Saxifraga species. Compared to other Saxifraga species mainly distributed in high-altitude areas, S. stolonifera inhabited regions with relatively lower altitudes, which means a moderate climate and weaker light radiation [10]. Therefore, it was worth investigating whether species inhabiting higher elevations with harsh environments underwent adaptive evolution. Selection pressure was estimated with the branch-site model by setting five species (S. sinomontana, S. umbellulata var. pectinata, S. umbellulata var. umbellulata, S. sessiliflora, and S. saginoides) living in the relatively high-altitude niche as the foreground. Genes ndhI and ycf1 were found under positive selection (Table S7). As mentioned earlier, the ndh and ycf families were often involved in the adaptation to highland environments, which now has also been confirmed in the saxifraga species [29,30]. ndhI coded one of the components of the NAD(P)H dehydrogenase (NDH) complex. The NDH complex acted on the photosystem I cyclic electron transfer [77]. It was crucial for carbon accumulation and plant responses to environmental stresses, such as oxidation and fluctuations of light and temperature [78,79]. Given the natural selection of intense light and low air density on the QTP, the alteration of ndhI’s molecular sequence might participate in the adaptive response of Saxifraga species to environmental stress. The ycf1 gene encoded the protein TIC214 that acted as a protein-conducting channel at the inner envelope for importing protein precursors into chloroplasts [80,81]. As one of the longest genes in the cp genome, ycf1 presented substantial variation among various cp genomes, and whether it was essential was controversial [82]. Therefore, the reason why the ycf1 gene was positively selected was unclear. Additionally, a few positively selected sites were also found in another 26 genes, which might also go through faster evolution due to the greatly stressful environment. 31T in the psbT gene and 159L in the ndhI gene were significantly proved to be positively selected by BEB posterior probability greater than 0.95.

4. Conclusions

In this study, seven Saxifraga species were selected for comparative, phylogenetic, and adaptive analysis of Saxifraga cp genomes. Overall, it was revealed that all the Saxifraga species shared similar cp genome size, structure, GC content, gene content, and genome components, and no rearrangements occurred in gene order, either. In the sequence divergence analysis, we detected seven hypervariable regions (trnK-rps16, trnS-trnG, trnT-trnL, ycf4-cemA, petA-psbJ, ndhF-rpl32-trnL, and gene ycf1) in seven Saxifraga species, which provided favorable materials for the further precise identification of Saxifraga species. In this part, IR regions were also found to be more conserved than the LSC and SSC regions. The phylogenetic tree reconstructed with the PCGs of cp genomes provided further evidence that the seven Saxifraga species in this study belong to three different sections. Meanwhile, Saxifraga species formed a monophyletic group in the Saxifragaceae family. The phylogenetic relationship was also reflected in several other features of the cp genome, such as the variation of the whole cp genome exhibited in the mVISTA analysis, and the Pi value calculated among different sections. Regarding altitude adaptation, at the branch and site level, for species inhabiting relatively high altitudes, the ndhI and ycf1 genes were suggested to be under positive selection, which implied the adaptive contribution of the two genes to the extreme environment. The findings above provided some knowledge of the conservation and divergence of the Saxifraga cp genomes and laid the groundwork for precise species identification. Certainly, sampling on a larger scale is needed to learn more about the evolutionary features of Saxifraga cp genomes and the rules of environmental adaptation.

Acknowledgments

We are grateful to those who collected the samples. We would also like to thank Novogene for their NGS service.

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/article/10.3390/genes13091673/s1. Figure S1. RSCU for other six Saxifraga species. (A) S. sinomontana. (B) S. umbellulata var. pectinata. (C) S. umbellulata var. umbellulata. (D) S. sessiliflora. (E) S. granulata. (F) S. stolonifera.; Figure S2. ENC-GC3 plot for other six Saxifraga species. (A) S. sinomontana. (B) S. umbellulata var. pectinata. (C) S. umbellulata var. umbellulata. (D) S. sessiliflora. (E) S. granulata. (F) S. stolonifera; Figure S3. Bayesian inference (BI) phylogenetic tree of 47 species in Saxifragaceae family reconstructed with 79 protein-coding genes. Three non-Saxifragaceae species were set as outgroups; Supplementary table file: Tables S1–S7.

Author Contributions

Conceptualization, Z.C. and X.L. (Xing Liu); data curation, Z.C.; formal analysis, Z.C.; funding acquisition, X.L. (Xiaoyan Li) and X.L. (Xing Liu); methodology, Z.C., X.Y., and Y.Y.; resources, W.Z., X.L. (Xinzhong Li), and X.L. (Xing Liu); software, Z.C., X.Y., and Y.Y.; supervision, X.L. (Xiaoyan Li) and X.L. (Xing Liu); visualization, Z.C.; Writing—original draft preparation, Z.C.; writing—review and editing, P.W., W.Z., X.L. (Xinzhong Li), C.L., and S.Z. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The chloroplast sequences of S. sessiliflora and S. saginoides have been uploaded to GenBank and the accession numbers are ON458148 and ON458149. The genbank format files of two cp genomes are also available in the figshare database (DOI: https://doi.org/10.6084/m9.figshare.20488218.v2) (accessed on 19 August 2022).

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This research was funded by the Local Development Funds of Science and Technology Department of Tibet (XZ202001YD0028C).

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Moreira D., Le Guyader H., Philippe H. The origin of red algae and the evolution of chloroplasts. Nature. 2000;405:69–72. doi: 10.1038/35011054. [DOI] [PubMed] [Google Scholar]
  • 2.Daniell H., Lin C.-S., Yu M., Chang W.-J. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016;17:134. doi: 10.1186/s13059-016-1004-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ravi V., Khurana J., Tyagi A., Khurana P. An update on chloroplast genomes. Plant Syst. Evol. 2008;271:101–122. doi: 10.1007/s00606-007-0608-0. [DOI] [Google Scholar]
  • 4.Jansen R.K., Ruhlman T.A. Genomics of Chloroplasts and Mitochondria. Springer; Berlin/Heidelberg, Germany: 2012. Plastid genomes of seed plants; pp. 103–126. [Google Scholar]
  • 5.Yang Y., Zhou T., Duan D., Yang J., Feng L., Zhao G. Comparative analysis of the complete chloroplast genomes of five Quercus species. Front. Plant Sci. 2016;7:959. doi: 10.3389/fpls.2016.00959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sloan D.B., Triant D.A., Forrester N.J., Bergner L.M., Wu M., Taylor D.R. A recurring syndrome of accelerated plastid genome evolution in the angiosperm tribe Sileneae (Caryophyllaceae) Mol. Phylogenetics Evol. 2014;72:82–89. doi: 10.1016/j.ympev.2013.12.004. [DOI] [PubMed] [Google Scholar]
  • 7.Dong W.L., Wang R.N., Zhang N.Y., Fan W.B., Fang M.F., Li Z.H. Molecular Evolution of Chloroplast Genomes of Orchid Species: Insights into Phylogenetic Relationship and Adaptive Evolution. Int. J. Mol. Sci. 2018;19:716. doi: 10.3390/ijms19030716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ma P.-F., Zhang Y.-X., Zeng C.-X., Guo Z.-H., Li D.-Z. Chloroplast phylogenomic analyses resolve deep-level relationships of an intractable bamboo tribe Arundinarieae (Poaceae) Syst. Biol. 2014;63:933–950. doi: 10.1093/sysbio/syu054. [DOI] [PubMed] [Google Scholar]
  • 9.Mehmood F., Ubaid Z., Bao Y., Poczai P., Mirza B. Comparative plastomics of Ashwagandha (Withania, Solanaceae) and identification of mutational hotspots for barcoding medicinal plants. Plants. 2020;9:752. doi: 10.3390/plants9060752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Flora of China. [(accessed on 10 March 2020)]. Available online: https://www.iplant.cn/foc.
  • 11.Zhmylev P. Ph.D. Thesis. Moscow State University; Moscow, Russia: 2004. Rod Saxifraga L. (Saxifragaceae): Biomorfologia, Sistematika i Evolutsia Zhiznennykh Form [Genus Saxifraga L. (Saxifragaceae): Biomorphology, Systematics and Evolution of the Life Forms] [Google Scholar]
  • 12.Mas De Xaxars G., García-Fernández A., Barnola P., Martín J., Mercadé A., Vallès J., Vargas P., Vigo J., Garnatje T. Phylogenetic and cytogenetic studies reveal hybrid speciation in Saxifraga subsect. Triplinervium (Saxifragaceae). J. Syst. Evol. 2015;53:53–62. doi: 10.1111/jse.12105. [DOI] [Google Scholar]
  • 13.Badral D., Odonbayar B., Murata T., Munkhjargal T., Tuvshintulga B., Igarashi I., Suganuma K., Inoue N., Brantner A.H., Odontuya G. Flavonoid and galloyl glycosides isolated from Saxifraga spinulosa and their antioxidative and inhibitory activities against species that cause piroplasmosis. J. Nat. Prod. 2017;80:2416–2423. doi: 10.1021/acs.jnatprod.7b00142. [DOI] [PubMed] [Google Scholar]
  • 14.Dang J., Tao Y., Shao Y., Mei L., Zhang L., Wang Q. Antioxidative extracts and phenols isolated from Qinghai–Tibet Plateau medicinal plant Saxifraga tangutica Engl. Ind. Crops Prod. 2015;78:13–18. doi: 10.1016/j.indcrop.2015.10.023. [DOI] [Google Scholar]
  • 15.Chen Z., Liu Y.-M., Yang S., Song B.-A., Xu G.-F., Bhadury P.S., Jin L.-H., Hu D.-Y., Liu F., Xue W. Studies on the chemical constituents and anticancer activity of Saxifraga stolonifera (L) Meeb. Bioorg Med. Chem. 2008;16:1337–1344. doi: 10.1016/j.bmc.2007.10.072. [DOI] [PubMed] [Google Scholar]
  • 16.Ebersbach J., Muellner-Riehl A., Michalak I., Tkach N., Hoffmann M., Röser M., Sun H., Favre A. In and out of the Qinghai-Tibet Plateau: Divergence time estimation and historical biogeography of the large arctic-alpine genus Saxifraga L. J. Biogeogr. 2017;44:900–910. doi: 10.1111/jbi.12899. [DOI] [Google Scholar]
  • 17.Ebersbach J., Schnitzler J., Favre A., Muellner-Riehl A. Evolutionary radiations in the species-rich mountain genus Saxifraga L. BMC Evol. Biol. 2017;17:119. doi: 10.1186/s12862-017-0967-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Conti E., Soltis D.E., Hardig T.M., Schneider J. Phylogenetic relationships of the silver saxifrages (Saxifraga, sect. Ligulatae Haworth): Implications for the evolution of substrate specificity, life histories, and biogeography. Mol. Phylogenetics Evol. 1999;13:536–555. doi: 10.1006/mpev.1999.0673. [DOI] [PubMed] [Google Scholar]
  • 19.Guldahl A.S., Gabrielsen T.M., Scheen A.-C., Borgen L., Steen S.W., Spjelkavik S., Brochmann C. The Saxifraga rivularis complex in Svalbard: Molecules, ploidy and morphology. Flora-Morphol. Distrib. Funct. Ecol. Plants. 2005;200:207–221. doi: 10.1016/j.flora.2005.01.003. [DOI] [Google Scholar]
  • 20.Gao Q.-B., Li Y.-H., Gornall R.J., Zhang Z.-X., Zhang F.-Q., Xing R., Fu P.-C., Wang J.-L., Liu H.-R., Tian Z.-Z. Phylogeny and speciation in Saxifraga sect. Ciliatae (Saxifragaceae): Evidence from psbA-trnH, trnL-F and ITS sequences. Taxon. 2015;64:703–713. doi: 10.12705/644.3. [DOI] [Google Scholar]
  • 21.Vargas P. A phylogenetic study of Saxifraga sect. Saxifraga (Saxifragaceae) based on nrDNA ITS sequences. Plant Syst. Evol. 2000;223:59–70. doi: 10.1007/BF00985326. [DOI] [Google Scholar]
  • 22.Tkach N., Röser M., Miehe G., Muellner-Riehl A.N., Ebersbach J., Favre A., Hoffmann M.H. Molecular phylogenetics, morphology and a revised classification of the complex genus Saxifraga (Saxifragaceae) Taxon. 2015;64:1159–1187. doi: 10.12705/646.4. [DOI] [Google Scholar]
  • 23.Wang J., Wang M., Zhang X., Sun S., Chen N., Zhang A., Zhao C. Picea purpurea has a physiological advantage over its progenitors in alpine ecosystems due to transgressive segregation. J. For. Res. 2018;23:363–371. doi: 10.1080/13416979.2018.1521905. [DOI] [Google Scholar]
  • 24.Peng Y., Yang Z., Zhang H., Cui C., Qi X., Luo X., Tao X., Wu T., Ouzhuluobu, Basang, et al. Genetic variations in Tibetan populations and high-altitude adaptation at the Himalayas. Mol. Biol. Evol. 2011;28:1075–1081. doi: 10.1093/molbev/msq290. [DOI] [PubMed] [Google Scholar]
  • 25.Yang Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
  • 26.Casati P., Stapleton A.E., Blum J.E., Walbot V. Genome-wide analysis of high-altitude maize and gene knockdown stocks implicates chromatin remodeling proteins in response to UV-B. Plant J. 2006;46:613–627. doi: 10.1111/j.1365-313X.2006.02721.x. [DOI] [PubMed] [Google Scholar]
  • 27.Guo X., Hu Q., Hao G., Wang X., Zhang D., Ma T., Liu J. The genomes of two Eutrema species provide insight into plant adaptation to high altitudes. DNA Res. 2018;25:307–315. doi: 10.1093/dnares/dsy003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhang J., Tian Y., Yan L., Zhang G., Wang X., Zeng Y., Zhang J., Ma X., Tan Y., Long N., et al. Genome of Plant Maca (Lepidium meyenii) Illuminates Genomic Basis for High-Altitude Adaptation in the Central Andes. Mol. Plant. 2016;9:1066–1077. doi: 10.1016/j.molp.2016.04.016. [DOI] [PubMed] [Google Scholar]
  • 29.Shen J., Zhang X., Landis J.B., Zhang H., Deng T., Sun H., Wang H. Plastome evolution in Dolomiaea (Asteraceae, Cardueae) using phylogenomic and comparative analyses. Front. Plant Sci. 2020;11:376. doi: 10.3389/fpls.2020.00376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zhao D.-N., Ren Y., Zhang J.-Q. Conservation and innovation: Plastome evolution during rapid radiation of Rhodiola on the Qinghai-Tibetan Plateau. Mol. Phylogenetics Evol. 2020;144:106713. doi: 10.1016/j.ympev.2019.106713. [DOI] [PubMed] [Google Scholar]
  • 31.Chen S., Zhou Y., Chen Y., Gu J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jin J.-J., Yu W.-B., Yang J.-B., Song Y., DePamphilis C.W., Yi T.-S., Li D.-Z. GetOrganelle: A fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020;21:1–31. doi: 10.1186/s13059-020-02154-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Walker B.J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., Cuomo C.A., Zeng Q., Wortman J., Young S.K. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E.S., Fischer A., Bock R., Greiner S. GeSeq–versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45:W6–W11. doi: 10.1093/nar/gkx391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chan P.P., Lowe T.M. tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences. Methods Mol. Biol. 2019;1962:1–14. doi: 10.1007/978-1-4939-9173-0_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen Z.-Y.-F. figshare. [(accessed on 19 August 2022)]. Available online: [DOI]
  • 38.Lohse M., Drechsel O., Kahlau S., Bock R. OrganellarGenomeDRAW—a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res. 2013;41:W575–W581. doi: 10.1093/nar/gkt289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Frazer K.A., Pachter L., Poliakov A., Rubin E.M., Dubchak I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004;32:W273–W279. doi: 10.1093/nar/gkh458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Brudno M., Do C.B., Cooper G.M., Kim M.F., Davydov E., Green E.D., Sidow A., Batzoglou S., Program N.C.S. LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 2003;13:721–731. doi: 10.1101/gr.926603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Amiryousefi A., Hyvönen J., Poczai P. IRscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics. 2018;34:3030–3031. doi: 10.1093/bioinformatics/bty220. [DOI] [PubMed] [Google Scholar]
  • 42.Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J.C., Guirao-Rico S., Librado P., Ramos-Onsins S.E., Sánchez-Gracia A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  • 43.Kumar S., Stecher G., Li M., Knyaz C., Tamura K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Wright F. The ‘effective number of codons’ used in a gene. Gene. 1990;87:23–29. doi: 10.1016/0378-1119(90)90491-9. [DOI] [PubMed] [Google Scholar]
  • 45.Wei L., He J., Jia X., Qi Q., Liang Z., Zheng H., Ping Y., Liu S., Sun J. Analysis of codon usage bias of mitochondrial genome in Bombyx moriand its relation to evolution. BMC Evol. Biol. 2014;14:1–12. doi: 10.1186/s12862-014-0262-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Hussain S., Rasool S.T. Analysis of synonymous codon usage in Zika virus. Acta Trop. 2017;173:136–146. doi: 10.1016/j.actatropica.2017.06.006. [DOI] [PubMed] [Google Scholar]
  • 47.Beier S., Thiel T., Münch T., Scholz U., Mascher M. MISA-web: A web server for microsatellite prediction. Bioinformatics. 2017;33:2583–2585. doi: 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Kurtz S., Choudhuri J.V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29:4633–4642. doi: 10.1093/nar/29.22.4633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Zhang D., Gao F., Jakovlić I., Zou H., Zhang J., Li W.X., Wang G.T. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Resour. 2020;20:348–355. doi: 10.1111/1755-0998.13096. [DOI] [PubMed] [Google Scholar]
  • 50.Nguyen L.-T., Schmidt H.A., Von Haeseler A., Minh B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ronquist F., Teslenko M., Van Der Mark P., Ayres D.L., Darling A., Höhna S., Larget B., Liu L., Suchard M.A., Huelsenbeck J.P. MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 2012;61:539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Letunic I., Bork P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zhang Z., Xiao J., Wu J., Zhang H., Liu G., Wang X., Dai L. ParaAT: A parallel tool for constructing multiple protein-coding DNA alignments. Biochem. Biophys. Res. Commun. 2012;419:779–781. doi: 10.1016/j.bbrc.2012.02.101. [DOI] [PubMed] [Google Scholar]
  • 54.Wang D., Zhang Y., Zhang Z., Zhu J., Yu J. KaKs_Calculator 2.0: A toolkit incorporating γ-series methods and sliding window strategies. Genom Proteom. Bioinform. 2010;8:77–80. doi: 10.1016/S1672-0229(10)60008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wu Z., Liao R., Yang T., Dong X., Lan D., Qin R., Liu H. Analysis of six chloroplast genomes provides insight into the evolution of Chrysosplenium (Saxifragaceae) BMC Genom. 2020;21:621. doi: 10.1186/s12864-020-07045-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Liu S., Xu Q., Liu K., Zhao Y., Chen N. Chloroplast genomes for five Skeletonema species: Comparative and phylogenetic analysis. Front. Plant Sci. 2021;12:774617. doi: 10.3389/fpls.2021.774617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Zhai Y., Yu X., Zhou J., Li J., Tian Z., Wang P., Meng Y., Zhao Q., Lou Q., Du S. Complete chloroplast genome sequencing and comparative analysis reveals changes to the chloroplast genome after allopolyploidization in Cucumis. Genome. 2021;64:627–638. doi: 10.1139/gen-2020-0134. [DOI] [PubMed] [Google Scholar]
  • 58.Lee H.-L., Jansen R.K., Chumley T.W., Kim K.-J. Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions. Mol. Biol. Evol. 2007;24:1161–1180. doi: 10.1093/molbev/msm036. [DOI] [PubMed] [Google Scholar]
  • 59.Wang R.J., Cheng C.L., Chang C.C., Wu C.L., Su T.M., Chaw S.M. Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol. Biol. 2008;8:36. doi: 10.1186/1471-2148-8-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Menezes A.P.A., Resende-Moreira L.C., Buzatti R.S.O., Nazareno A.G., Carlsen M., Lobo F.P., Kalapothakis E., Lovato M.B. Chloroplast genomes of Byrsonima species (Malpighiaceae): Comparative analysis and screening of high divergence sequences. Sci. Rep. 2018;8:2210. doi: 10.1038/s41598-018-20189-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Nock C.J., Waters D.L., Edwards M.A., Bowen S.G., Rice N., Cordeiro G.M., Henry R.J. Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol. J. 2011;9:328–333. doi: 10.1111/j.1467-7652.2010.00558.x. [DOI] [PubMed] [Google Scholar]
  • 62.Dong W., Liu H., Xu C., Zuo Y., Chen Z., Zhou S. A chloroplast genomic strategy for designing taxon specific DNA mini-barcodes: A case study on ginsengs. BMC Genet. 2014;15:138. doi: 10.1186/s12863-014-0138-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA. 2009;106:12794–12797. doi: 10.1073/pnas.0905845106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Xu C., Cai X., Chen Q., Zhou H., Cai Y., Ben A. Factors affecting synonymous codon usage bias in chloroplast genome of oncidium gower ramsey. Evol. Bioinform. 2011;7:271–278. doi: 10.4137/EBO.S8092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Das S., Paul S., Dutta C. Synonymous codon usage in adenoviruses: Influence of mutation, selection and protein hydropathy. Virus Res. 2006;117:227–236. doi: 10.1016/j.virusres.2005.10.007. [DOI] [PubMed] [Google Scholar]
  • 66.McClellan D.A. The codon-degeneracy model of molecular evolution. J. Mol. Evol. 2000;50:131–140. doi: 10.1007/s002399910015. [DOI] [PubMed] [Google Scholar]
  • 67.Zhang P., Xu W., Lu X., Wang L. Analysis of codon usage bias of chloroplast genomes in Gynostemma species. Physiol. Mol. Biol. Plants. 2021;27:2727–2737. doi: 10.1007/s12298-021-01105-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Tuler A.C., Carrijo T.T., Nóia L.R., Ferreira A., Peixoto A.L., da Silva Ferreira M.F. SSR markers: A tool for species identification in Psidium (Myrtaceae) Mol. Biol. Rep. 2015;42:1501–1513. doi: 10.1007/s11033-015-3927-1. [DOI] [PubMed] [Google Scholar]
  • 69.Yang A.H., Zhang J.J., Yao X.H., Huang H.W. Chloroplast microsatellite markers in Liriodendron tulipifera (Magnoliaceae) and cross-species amplification in L. chinense. Am. J. Bot. 2011;98:e123–e126. doi: 10.3732/ajb.1000532. [DOI] [PubMed] [Google Scholar]
  • 70.Korkovelos A., Mavromatis A., Huang W., Hagidimitriou M., Giakoundis A., Goulas C. Effectiveness of SSR molecular markers in evaluating the phylogenetic relationships among eight Actinidia species. Sci. Hortic. 2008;116:305–310. doi: 10.1016/j.scienta.2008.01.011. [DOI] [Google Scholar]
  • 71.Tabbasam N., Zafar Y. Pros and cons of using genomic SSRs and EST-SSRs for resolving phylogeny of the genus Gossypium. Plant Syst. Evol. 2014;300:559–575. doi: 10.1007/s00606-013-0891-x. [DOI] [Google Scholar]
  • 72.Cavalier-Smith T. Chloroplast evolution: Secondary symbiogenesis and multiple losses. Curr. Biol. 2002;12:R62–R64. doi: 10.1016/S0960-9822(01)00675-3. [DOI] [PubMed] [Google Scholar]
  • 73.Powell W., Morgante M., McDevitt R., Vendramin G.G., Rafalski J.A. Polymorphic simple sequence repeat regions in chloroplast genomes: Applications to the population genetics of pines. Proc. Natl. Acad. Sci. USA. 1995;92:7759–7763. doi: 10.1073/pnas.92.17.7759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Liu W., Kong H., Zhou J., Fritsch P.W., Hao G., Gong W. Complete Chloroplast Genome of Cercis chuniana (Fabaceae) with Structural and Genetic Comparison to Six Species in Caesalpinioideae. Int. J. Mol. Sci. 2018;19:1286. doi: 10.3390/ijms19051286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Deng J.B., Drew B.T., Mavrodiev E.V., Gitzendanner M.A., Soltis P.S., Soltis D.E. Phylogeny, divergence times, and historical biogeography of the angiosperm family Saxifragaceae. Mol. Phylogenetics Evol. 2015;83:86–98. doi: 10.1016/j.ympev.2014.11.011. [DOI] [PubMed] [Google Scholar]
  • 76.Liu L.X., Du Y.X., Folk R.A., Wang S.Y., Soltis D.E., Shang F.D., Li P. Plastome Evolution in Saxifragaceae and Multiple Plastid Capture Events Involving Heuchera and Tiarella. Front. Plant Sci. 2020;11:361. doi: 10.3389/fpls.2020.00361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Sirpiö S., Allahverdiyeva Y., Holmström M., Khrouchtchova A., Haldrup A., Battchikova N., Aro E.M. Novel nuclear-encoded subunits of the chloroplast NAD(P)H dehydrogenase complex. J. Biol. Chem. 2009;284:905–912. doi: 10.1074/jbc.M805404200. [DOI] [PubMed] [Google Scholar]
  • 78.Kamruzzaman Munshi M., Kobayashi Y., Shikanai T. Identification of a novel protein, CRR7, required for the stabilization of the chloroplast NAD(P)H dehydrogenase complex in Arabidopsis. Plant J. 2005;44:1036–1044. doi: 10.1111/j.1365-313X.2005.02604.x. [DOI] [PubMed] [Google Scholar]
  • 79.Wang P., Duan W., Takabayashi A., Endo T., Shikanai T., Ye J.Y., Mi H. Chloroplastic NAD(P)H dehydrogenase in tobacco leaves functions in alleviation of oxidative damage caused by temperature stress. Plant Physiol. 2006;141:465–474. doi: 10.1104/pp.105.070490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.De Vries J., Sousa F.L., Bölter B., Soll J., Gould S.B. YCF1: A green TIC? Plant Cell. 2015;27:1827–1833. doi: 10.1105/tpc.114.135541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Rochaix J.D. Chloroplast protein import machinery and quality control. FEBS J. 2022 doi: 10.1111/febs.16464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Bölter B., Soll J. Ycf1/Tic214 Is Not Essential for the Accumulation of Plastid Proteins. Mol. Plant. 2017;10:219–221. doi: 10.1016/j.molp.2016.10.012. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The chloroplast sequences of S. sessiliflora and S. saginoides have been uploaded to GenBank and the accession numbers are ON458148 and ON458149. The genbank format files of two cp genomes are also available in the figshare database (DOI: https://doi.org/10.6084/m9.figshare.20488218.v2) (accessed on 19 August 2022).


Articles from Genes are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES