Skip to main content
Genes logoLink to Genes
. 2024 Jul 18;15(7):940. doi: 10.3390/genes15070940

Intraspecific Differentiation of Styrax japonicus (Styracaceae) as Revealed by Comparative Chloroplast and Evolutionary Analyses

Hao-Zhi Zheng 1,2, Wei Dai 1,2, Meng-Han Xu 1,2, Yu-Ye Lin 1,2, Xing-Li Zhu 1,2, Hui Long 1,2, Li-Li Tong 3, Xiao-Gang Xu 1,2,*
Editors: Qinghu Ma, Xinjie Shen
PMCID: PMC11275416  PMID: 39062719

Abstract

Styrax japonicus is a medicinal and ornamental shrub belonging to the Styracaceae family. To explore the diversity and characteristics of the chloroplast genome of S. japonicus, we conducted sequencing and comparison of the chloroplast genomes of four naturally distributed S. japonicus. The results demonstrated that the four chloroplast genomes (157,914–157,962 bp) exhibited a typical quadripartite structure consisting of a large single copy (LSC) region, a small single copy (SSC) region, and a pair of reverse repeats (IRa and IRb), and the structure was highly conserved. DNA polymorphism analysis revealed that three coding genes (infA, psbK, and rpl33) and five intergene regions (petA-psbJ, trnC-petN, trnD-trnY, trnE-trnT, and trnY-trnE) were identified as mutation hotspots. These genetic fragments have the potential to be utilized as DNA barcodes for future identification purposes. When comparing the boundary genes, a small contraction was observed in the IR region of four S. japonicus. Selection pressure analysis indicated positive selection for ycf1 and ndhD. These findings collectively suggest the adaptive evolution of S. japonicus. The phylogenetic structure revealed conflicting relationships among several S. japonicus, indicating divergent evolutionary paths within this species. Our study concludes by uncovering the genetic traits of the chloroplast genome in the differentiation of S. japonicus variety, offering fresh perspectives on the evolutionary lineage of this species.

Keywords: Styrax japonicus, chloroplast, adaptive evolution, morphological variation, selection pressure

1. Introduction

Chloroplasts in higher plants play a crucial role in the biosynthesis of amino acids, fatty acids, vitamins, and pigments [1], as well as being actively involved in photosynthesis [2]. The chloroplast genome structure is characterized by a circular quadripartite organization consisting of two inverted repeat regions (IRs), a small single copy (SSC) region, and a large single copy (LSC) region that are highly conserved [3]. Furthermore, chloroplast genes demonstrate a relatively high level of conservation and play a crucial role in fundamental biological processes such as photosynthesis, transcription, and translation [4]. The size of the chloroplast genome varies among different species, typically ranging from 120 to 160 kb, and contains two sets of four ribosomal RNA genes and 30 tRNA genes [5,6]. The majority of angiosperm species rely on matrilineal inheritance to ensure the stability of species evolution. Simultaneously, genetic mutation events provide valuable information for evolutionary studies, population classification, and species identification [7,8,9,10]. Consequently, chloroplast genes present an ideal research subject for investigating species evolution [11].

The diversity of chloroplast genomes offers valuable resources for uncovering phylogenetic relationships across various dimensions [12,13,14]. Furthermore, due to the maternal inheritance of chloroplasts in most angiosperms [15], variations in their association with nuclear phylogeny could offer valuable insights into the processes of speciation [16,17]. The modification of gene content within the chloroplast genome may contribute to the adaptation of species to specific habitats and living strategies [18,19]. Changes in the environment and habitat can impose selective genetic pressure, influencing the genes involved. Genes undergoing positive selection typically confer advantages in terms of individual fitness and reproductive capability [20]. Therefore, the investigation of gene selection pressure and adaptive evolution has emerged as a prominent area of focus in molecular research, providing a solid groundwork for the study of germplasm resources.

S. japonicus Siebold & Zucc is a member of the Styracaceae family, which is a low-branched landscape tree with aromatic flowers possessing significant ornamental and medicinal properties [21,22,23]. S. japonicus has been utilized in the treatment of various conditions, including cough, bronchitis, and laryngitis [24,25] and is also employed in traditional Chinese herbal medicine for relieving toothaches and sore throats [26]. Moreover, some scholars suggest that the distinctive properties of its nectar make it a potential additive in honey processing [27]. It exhibits wide distribution across eastern Asia, including Korea, Japan, and southern China [28]. Due to its wide distribution and morphological characteristics, there has been some confusion in the taxonomy of S. japonicus, making it one of the most heterogeneous species in Styrax [29,30,31]. However, there is limited research on S. japonicus variety; Ju et al. [32] conducted an analysis of the mechanism of calyx color variation in two S. japonicus varieties using integrated metabolomics and transcriptomics. The extent of genomic diversity in chloroplasts remains entirely unexplored.

Although the nuclear genome predominantly dictates the morphological and physiological traits of plants, the chloroplast genome indirectly influences these attributes to a notable extent. For instance, Lee and colleagues discovered variations in the chloroplast genome of Cynanchum wilfordii, proposing a potential link between these genetic differences and the species’ calyx variations [33]. Similarly, Tian et al., in their investigation of Lindera obtusiloba, highlighted variations in chloroplast genome size and intraspecific diversity, which might be connected to the plant’s adaptive responses to environmental pressures. Specifically, genes under positive selection could drive morphological adaptations in plants facing environmental stress [34]. Sarwar et al. conducted a comprehensive analysis of Olea europaea cultivars, revealing significant disparities in plant height, blade size and shape, phyllotaxy, venation, and chlorophyll content among varieties from different origins. These variations might originate from the diversity within the chloroplast genome [35]. In essence, the chloroplast genome’s diversity significantly shapes plant morphology, with these morphological traits not only mirroring genetic variability but also potentially indicating adaptability to specific environmental conditions. Moreover, the interplay between the chloroplast and nuclear genomes could be crucial in plant speciation and evolutionary processes. Understanding these interactions offers valuable insights into the complex genetic architecture underlying plant diversity and adaptation.

To explore the diversity and evolution of the chloroplast genome of S. japonicus, we collected three samples with typical leaf variation and one wild-type sample with standard morphology from the natural population of S. japonicus in Shandong, China, and conducted chloroplast genome sequencing. The objective is to analyze the evolutionary mechanisms, intraspecific differentiation, and chloroplast genome diversity of S. japonicus from the perspective of the chloroplast genome. Specifically, (1) comparative analysis of chloroplast genome variations in S. japonicus; (2) examination of wild type and genetic variants associated with adaptive evolution; (3) reconstruction and comparison of phylogenetic relationships.

2. Result

2.1. Chloroplast Genome Characterization and Structure

After sequencing, the chloroplast genomes of the four samples (157,914–157,962 bp, 36.96–36.98% GC content) displayed a typical quadripartite structure, comprising of the LSC region (87,549–87,637 bp, 34.79–34.82% GC content), SSC region (18,245–18,269 bp, 30.26–30.28% GC content), and a pair of IR regions (26,040–26,048 bp, 42.94–42.95% GC content) (Appendix B Figure A1 and Table A1). Each of the three chloroplasts contains a total of 133 genes, including 87 protein-coding genes, 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and a pseudogene (ycf1). All genes can be categorized into three groups: (1) photosynthesis, (2) self-replication, and (3) others (Appendix B, Table A2).

2.2. Comparative Analysis

Based on the previously published S. japonicus chloroplast genome from NCBI (Accession number: MT178456; MZ285743; NC047429), the results indicate that four samples exhibit consistency with other S. japonicus boundary genes. However, there were varying degrees of expansion or contraction in the IR region (Figure 1), although such variations were minimal, with only a few bp. The chloroplast genome sequences of three varieties were analyzed using the reference chloroplast genome of wild type in mVISTA (Appendix B Figure A2), revealing a significant level of similarity and conservation among the chloroplast genomes within the three varieties. Importantly, coding regions exhibited higher levels of conservation compared to non-coding regions, while the IR regions demonstrated lower variability than the LSC and SSC regions. Through the analysis of the CDS sequence and complete chloroplast genome nucleotide polymorphisms, it was found that infA, psbK, and rpl33 exhibited high Pi values (Figure 2A). Furthermore, the intergenic regions petA-psbJ, trnC-petN, trnD-trnY, trnE-trnT, and trnY-trnE also exhibited elevated Pi values (Figure 2B).

Figure 1.

Figure 1

Comparison of chloroplast genome structure in four S. japonicus samples. IR (inverted repeat), LSC (large single copy), and SSC (small single copy) regions and border genes are indicated. Note: JLA: junction between LSC and IRa; JLB: junction between LSC and IRb; JSA: junction between SSC and IRa; JSB: junction between SSC and IRb.

Figure 2.

Figure 2

Nucleotide diversity of chloroplast genomes in S. japonicus. (A): Pi in CDS; (B): chloroplast genome Pi value. Note: window length: 600 bp; step length: 50 bp; x axis: position of the midpoint of each window; y axis: Pi of each window. The blue line represents the trajectory of the value of Pi.

2.3. Repetitive and Codon Usage Bias Analysis

Through SSR analysis, the four chloroplast genomes were found to contain a total of 58–61 SSRs, with the majority (77.59–79.66%) located in the LSC region. The IR regions contained between 3.39 and 6.90% of SSR loci, while the SSC region included between 14.75 and 16.95% (Figure 3A). In contrast, only WT exhibited dinucleotide repeat units (Figure 3B). Furthermore, the majority of repeats in all four species consist of mononucleotide repeats (Figure 3C).

Figure 3.

Figure 3

Analysis of SSR sites and repetitive sequences in five chloroplast genomes. (A): Distribution of SSRs in the four S. japonicus samples; (B): Number of different SSRs loci types; (C): number of different repeat types. Note: In (A), symbol (+) represented the position of SSRs and the proportion of text displayed; in (C), C: complementary repeats, F: forward repeats, P: palindromic repeats, R: reverse repeats.

The examination of chloroplast genomes demonstrated that the GC and GC3s composition within the codons was consistently lower than 0.5, suggesting a predilection towards A/T bases and A/T-ending codons in three chloroplast genomes.

2.4. RSCU Analysis and Selection in Evolution

The relative synonymous codon usage (RSCU) values showed similarity across the four genomes (Figure 4A). A total of 33 codons exhibited an RSCU value greater than 1 (Figure 4B), with only two of these codons ending in G (AUG and UUG). Among the codons with an RSCU value less than 1, except for AUA, CUA, and UGA, which terminate in A, the remaining codons conclude with either C or G. Due to the majority of genes having Ks values of 0, this leads to an ineffective Ka/Ks ratio (Figure 4C). Out of the 79 common genes, only 8 had valid values, with ycf1 and ndhD exhibiting values greater than 1.

Figure 4.

Figure 4

Synonymous codon usage and selective pressures in the evolution. (A): Codon content of 20 amino acids and stop codons in all protein-coding genes of four S. japonicus chloroplast genome; (B): Distribution of codon preference in S. japonicus; (C): Ka/Ks values of protein-coding genes of the five comparative combinations. Note: in (A), the top panel shows the RSCU for the corresponding amino acids, and the colored blocks, which are shown below, represent different codons. In (C), Ka: nonsynonymous; Ks: synonymous.

2.5. Phylogenetic Analysis

The best-fit model GTR+G determined by ModelFinder was utilized for maximum likelihood analysis. Based on the Styrax topology, the interbranches within the genus are notably reduced, suggesting a lower level of divergence between Styrax species (Figure 5A). Phylogenetic analysis of the complete chloroplast genome and CDS sequences revealed that S. japonicus formed three clades within Styrax. The four samples sequenced in this study grouped together, with YZ and WT forming one branch, and LY and JR forming another. Three additional reference sequences reveal that S. japonicus (MZ285743) forms a branch with S. calvescens and as a sister clade to two other S. japonicus in the complete chloroplast genome tree (Figure 5B). However, in the CDS tree, S. japonicus (MZ285743) appears as a single branch, while the remaining two S.japonicus (NC047429 and MT178456) consistently form a branch in both trees. In general, the phylogenetic analysis revealed that seven samples of S. japonicus formed three distinct clusters, indicating a high level of diversity in the chloroplast genome and potential phylogenetic conflict within this species.

Figure 5.

Figure 5

Phylogenetic tree analysis using the maximum likelihood (ML). (A) Phylogenetic topologies of 33 Styrax species; (B) phylogenetic tree constructed from complete chloroplast genome sequence (left) and CDS sequence (right).

3. Discussion

In this study, we performed sequencing of three S. japonicus varieties and conducted a comparative analysis with the wild type and other S. japonicus to explore the correlation between leaf morphological variation and evolution in the chloroplast genome dimension. The results revealed a high degree of similarity among their chloroplast genomes; however, sliding window analysis identified potential mutation hotspots that could serve as DNA barcodes for the identification and germplasm resource of S. japonicus taxa. Additionally, (1) the boundary genes detection revealed varying degrees of expansion and contraction in the IR region of the three varieties; (2) ndhD and ycf1 were found to be under positive selection; (3) phylogenetic analysis revealed the formation of three clades by seven S. japonicus. In the following, we will discuss and explore these three points.

3.1. IR Region Contraction within S. japonicus

The findings of this study demonstrate that the boundary genes in all S. japonicus are consistent; however, the IR regions of the three varieties exhibit varying degrees of contraction compared to different S. japonicus samples. Despite being minor, only a few bp, it is noteworthy that such changes occur within the same species.

The IR region of the chloroplast genome is generally considered to be highly conserved [36], but variations in its boundary genes can lead to changes in gene loss or the formation of pseudogenes [37]. This phenomenon is common in chloroplast genome evolution and is a major contributor to its length variation [38]. However, the expansion or contraction of IR regions can be attributed to various factors, including genome rearrangement [39], random mutations [40], evolutionary pressure [41], or gene transfer [42].

In conjunction with the current study, we hypothesize that the contraction of the IR region of S. japonicus in this study is primarily propelled by adaptive evolution due to specific environmental stress adaptation. This variation directly contributes to changes in genetic diversity, leading to the generation of new genetic variants. Moreover, it has been demonstrated that the expansion of the IR region can enhance the stability of the chloroplast genome [43]. Conversely, the contraction of the IR region in this study indicates a reduction in the structural stability of the chloroplast genome, which may be a primary factor contributing to phenotypic change among varieties.

3.2. Positive Selection of ndhD and ycf1

The results of the selection pressure analysis in this study revealed a positive selection of the ndhD and ycf1 genes, directly indicating the adaptive evolution of the four S. japonicus. Positive selection may represent the optimization of gene function in response to environmental stress [44,45,46].

The ndhD-encoded protein is a constituent subunit of the chloroplast NDH complex [47], which plays a crucial role in cyclic electron transport and respiration around photosystem I (PSI), generating a transthylakoid membrane proton gradient and ATP for CO2 assimilation. The expression of the ndhD gene is subject to regulation through RNA editing. RNA editing can create start codons to control the efficiency of translation and promote gene expression. For instance, in the chloroplast genome of Arabidopsis thaliana, the ACG sequence is altered to AUG, thereby generating the start codon for ndhD [48]. Hirose and Sugiura also observed a correlation between the editing efficiency of tobacco ndhD and the stage of leaf development, with the highest efficiency occurring in the early stages and gradually decreasing [49]. Additionally, previous research has indicated that environmental factors such as temperature and light can impact the expression of the ndhD gene [50]. In this study, we speculate that four S. japonicus ndhD genes were positively selected, leading to alterations in the photosynthetic process and subsequent changes in the morphology of S. japonicus. Although research on this process is limited, these findings directly indicate the ongoing evolution of S. japonicus.

The ycf1 gene is typically the largest in chloroplasts and is a component of the chloroplast inner membrane translocon [51]. However, this study found that ycf1 was a pseudogene, a phenomenon also observed in rice, maize, palm, and some Geraniaceae species [52,53,54]. Additionally, the ycf1 gene has been actively selected in other lineages, such as Oenothera, Citrus, and Cardamine [55,56,57]. If the genes encoded by chloroplasts become pseudogenes, it typically indicates their inability to encode functional proteins. In this study, ycf1 was identified as a pseudogene due to the presence of a premature stop codon. However, we hypothesize that it may still possess certain functionalities, such as (1) influencing the expression of nearby genes due to the presence of enhancers or promoters in the sequence [58]; (2) transcribing into non-coding RNA with regulatory functions [59]; (3) becoming part of the transposable element, affecting the dynamic changes of the genome [60]. Of course, pseudogenes may also acquire new functions through evolutionary processes. The positive selection of ycf1 as a pseudogene in this study provides robust support for the ongoing evolutionary event in the S. japonicus chloroplast genome, a phenomenon not previously observed in studies of this species.

In this section, we endeavor to speculate on the underlying factors contributing to chloroplast diversity in S. japonicus by conducting an analysis of ndhD and ycf1. Despite our limited understanding, these findings provide novel insights and establish a groundwork for future genetic manipulation and breeding initiatives.

3.3. Phylogenetic Analysis

The phylogenetic structure reconstructed based on the chloroplast genome in this study reveals conflicting relationships within seven S. japonicus samples. This discrepancy, not previously observed in Styrax studies, prompts a discussion of the reasons for these conflicts in the species’ phylogeny.

In previous studies, phylogenetic analysis based on chloroplast genomes revealed a close clustering of S. japonicus with S. grandiflorus, S. confuses, and S. calvescens [61], which is similar to the findings of the phylogenetic study of Styrax by Song et al. [62]. However, these analyses did not capture the full extent of chloroplast genome diversity in individual species. Our study, which combined analysis of published S. japonicus chloroplast genomes with four newly sequenced samples, revealed that S. japonicus forms at least three distinct clusters within Styrax; it shares sister branches with S. calvescens, S. dasyanthus, and S. faberi, respectively.

Generally, samples of the same species may diverge into different branches for several reasons. Firstly, genetic diversity can lead to significant variations within the species [63]. Secondly, geographic isolation or differences in flora distribution between populations may restrict gene flow [64]. Thirdly, niche differentiation can promote the development of various adaptive evolutionary paths within a species [65,66]. Fourth, mutations and gene recombination in sexual reproduction can introduce new genetic variations [67,68]. Additionally, population bottleneck events or founder effects of small populations can result in a reduction in genetic diversity, reflected in the phylogenetic tree where some samples may be markedly different from others [69,70,71]. Furthermore, hybridization and incomplete lineage sorting with closely related species may also contribute to the formation of distinct clades in phylogenetic analysis [16,17,72].

In combination with this study, we inferred that the broad distribution of S. japonicus and its overlapping habit with other Styrax species may increase the likelihood of hybridization and gene introgression, resulting in direct phylogenetic conflict and diversity of chloroplast genomes [73,74,75]. For example, Wang et al. conducted a comparison between domesticated soybean and wild-type soybean, revealing that gene introgression events contributed to the genetic diversity of domesticated soybean, as well as the asymmetric evolution of the nuclear genome and chloroplast genome [76].

Hybridization and gene introgression can result in recombination and mutation of the chloroplast genome, leading to the emergence of new chloroplast genotypes with potentially distinct biological functions and adaptations [77,78]. For instance, variations in chloroplast genes may impact photosynthesis efficiency, subsequently influencing plant growth and survival [79,80]. Gene introgression could also alter the selection pressure on certain genes within the chloroplast genome, thereby facilitating positive or negative selection processes and impacting the adaptive evolution of the chloroplast genome [81].

The complexity of relationships among the seven S. japonicus samples in this study indicates that the chloroplast genome of S. japonicus exhibits diversity and may be undergoing evolutionary divergence. Although the evolutionary mechanism has not been fully elucidated, the findings of this study provide compelling evidence for the evolution of the chloroplast genome of S. japonicus and establish a solid foundation for future germplasm resources.

4. Materials and Methods

4.1. Plant Materials

In this study, fresh blades of four samples were collected from the natural population of S. japonicus in Weide Mountain, Rongcheng, Shandong Province, China (122.422° E, 37.273° N).

We designated them as Jian Ren (JR), Lian Yi (LY), Yan Zi (YZ), and the wild type (WT) for differentiation. To be specific, three varieties exhibited morphological variations in leaf color, shape, and texture when compared to the WT (Figure 6A). The leaves of JR are thick and leathery (Figure 6B), those of LY exhibit leaf folds (Figure 6C), and the leaves of YZ are purplish red in color (Figure 6D). The voucher specimens were deposited in the herbarium of Nanjing Forestry University.

Figure 6.

Figure 6

Leaf comparison of three S.japonicus varieties and wild type. (A): Wild type. (B): Jian Ren. (C): Lian Yi. (D): Yan Zi.

The methods are exclusively conducted on Styrax plants for experimental purposes, strictly adhering to relevant institutional, national, and international guidelines and regulations.

4.2. DNA Extraction and Sequencing

The fresh and healthy leaf tissues of four samples were frozen in liquid nitrogen and stored in an Ultra-low temperature freezer for DNA extraction. The DNA extraction was carried out using the Plant Genomic DNA Kit (Nanjing Genepioneer Biotechnologies Inc., Nanjing, China) according to the manufacturer’s instructions. The concentration and purity of the DNA were evaluated using a Nandrop 2000 instrument (Thermo Fisher Scientific, Waltham, MA, USA). The nucleic acids utilized for sequencing demonstrated a purity range of 1.8–2.0, as indicated by A260/A280 ratios and sample concentrations ≥10 ng/μL. After the quality inspection of the genomic DNA was performed, the specific process included DNA sample detection, fragmentation, end repair, 3′ end plus A, adapter ligation, fragment selection by agarose gel electrophoresis, polymerase chain reaction (PCR) amplification for sequencing library formation, and quality verification. Subsequently, it underwent sequencing on the Illumina NovaSeq 6000 (Illumina, Cambridge, MA, USA) platform with a read length of 150 bp. The total DNA extracting and sequencing were conducted by Nanjing Genepioneer Biotechnologies Inc. (Nanjing, China).

4.3. Chloroplast Assembly and Annotation

Fastp v0.20.0 was used to trim the raw reads, and the high-quality clean data were obtained by removing the connector sequences and low-quality reads (the Filtering Criteria are in Appendix A). Bowtie2 v2.2.4 (http://bowtie-bio.sourceforge.net/bowtie2/index.shtml, accessed on 1 June 2024) [82] was used to align the clean data with the chloroplast genome database built by Genepioneer Biotechnologies in a very sensitive local mode. SPAdes v3.10.1 [83] https://microbialgenomicslab-spring2022.readthedocs.io/en/latest/GenomeAssemblies.html, accessed on 1 June 2024) was used to acquire seed sequences, and the contigs were obtained using the kmer iterative extend seed. The contig sequences were linked into scaffolds using SSPACE v2.0 [67,84] and then used in Gapfiller v2.1.1 [85] to fill the gaps [86] (the Assembly Process is in Appendix A). Two methods were used to annotate the chloroplast genomes to improve the accuracy of the annotation. First, protein-coding genes were annotated using Prodigal v2.6.3 (https://www.github.com/hyattpd/Prodigal, accessed on 1 June 2024). rRNA was predicted using Hmmer [87] (http://www.hmmer.org/, accessed on 1 June 2024), and tRNA was predicted using Aragorn v1.2.38 [88] (https://chlorobox.mpimp-golm.mpg.de/geseq.html, accessed on 1 June 2024). Second, the assembled sequences were compared using Blast v2.6 [89] (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 1 June 2024) against related species published in the NCBI database. Subsequently, the two sets of annotations were compared and manually corrected. Finally, the chloroplast genome was mapped using the OGDRAW [90] (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html, accessed on 1 June 2024). The chloroplast genome sequences were deposited in GenBank (GenBank accessions: PP853567-PP853569; OQ473820).

4.4. Chloroplast Genome Comparative Analysis

The boundaries of the plastome sequences collected and generated in this study were visualized in IRscope [91] (http://genocat.tools/tools/irscope.html, accessed on 1 June 2024). The Shufe-LAGAN model in the mVISTA program was utilized to compare the chloroplast genome structure of three varieties, with WT serving as a reference. DnaSP version 6 [92] was utilized to compute the nucleotide diversity (Pi) of coding sequences (CDS) and complete chloroplast genomes. The analysis employed a step size of 50 bp with a window length of 600 bp.

4.5. Repeat and Analysis of Codon Usage Bias

REPuter [93] (https://bibiserv.cebitec.uni-bielefeld.de/reputer, accessed on 1 June 2024) was employed to identify the dimensions and positions of forward, reverse, palindromic, and complementary repeats. Simple sequence repeats (SSRs) were determined using a Perl script MISA [94], encompassing mono-, di-, tri-, tetra-, penta-, and hexa-nucleotides with minimum thresholds of 10, 6, 5, 5, 5, and 5, respectively. For the analysis of codon bias, full-length CDS sequences were extracted for the three variants using ATG as the start codon and TGA, TAG, or TAA as the stop codon to determine codon usage preferences. CodonW was utilized to calculate the nucleotide composition at the third position (A3s, U3s, and G3s), codon adaptation index (CAI), codon bias index (CBI), and effective number of codons (ENC).

4.6. RSCU Analysis and Selective Pressures in the Evolution

Relative synonymous codon usage (RSCU) was analyzed using BioPython 1.84 [95]. The synonymous (Ks) and non-synonymous (Ka) substitution, as well as their ratios (Ka/Ks), were calculated by extracting and comparing the common coding sequences (CDS) of the four samples using Mafft 7.463 [96]. The YN algorithm was chosen in the KaKs calculator 3.0 [97] to consider the evolutionary characters of the sequence [14,98] for the purpose of elucidating the Ka/Ks value and conducting selection pressure analysis.

4.7. Phylogenetic Analysis

For the purpose of investigating the phylogenetic positions of the three Styrax varieties, a total of 31 Styrax chloroplast genomes were utilized for phylogenetic analysis, with two additional Sinkjackia species (Sinojackia xylocarpa and Sinojackia sarcocarpa) serving as outgroups.

In order to explore the phylogenetic relationships of S. japonicus, 31 Styrax contained 3 additional complete chloroplast genomes of S. japonicus (obtained from NCBI, accession number: MZ287543; NC047429; MT187456). Two datasets (complete chloroplast genomes and CDS sequences) were generated from chloroplast genomes, and after alignment, the optimal model was determined using ModelFinder [99]. Maximum-likelihood (ML) phylogenetic analysis was conducted using IQ-TREE 2.2.0 [100] with 1000 bootstrap replicates.

5. Conclusions

To explore the diversity of the chloroplast genome of S. japonicus and its effects on its morphology, we conducted the complete chloroplast genome sequencing of four samples from a natural population in Shandong, China, comprising three variants and one wild-type sample. Comparison with the previously published chloroplast genome of S. japonicus revealed a slight contraction in the IR region of four samples in this study, which may be indicative of adaptive evolution and indirectly suggests clues to genetic diversity. Both ycf1 and ndhD underwent positive selection, with ycf1 being identified as a pseudogene, indicating the adaptive evolution of S. japonicus to environmental stress. Phylogenetic analysis revealed that S. japonicus did not exhibit clustering or sister clades, suggesting that genetic diversity, geographic isolation, interbreeding, and other factors may have contributed to this phenomenon. In conclusion, a comparative analysis of the chloroplast genome suggests that S. japonicus may be undergoing divergent evolutionary paths, primarily driven by adaptive evolution. Furthermore, the observed variations in leaf morphology provide evidence for this phenomenon. While morphological changes are primarily regulated by the nuclear genome, the evolution of the chloroplast genome indirectly contributes to the development of these changes to some extent. Although the specific mechanisms underlying leaf deformation remain unknown, the diversity present in the chloroplast genome lays a foundation for future breeding and gene editing endeavors.

Acknowledgments

We would like to thank Zhi Yang, College of Life Sciences, Nanjing Forestry University, for her guidance and advice on the construction of phylogenetic relationships in this study.

Appendix A

  • Filtering Criteria

Fastp v0.20.0 (https://github.com/OpenGene/fastp, accessed on 1 June 2024) was used to filter the original data. The filtering criteria were as follows:

  • (1)

    Truncate the sequencing linker and primer sequence in the reads;

  • (2)

    Filter out the reads whose average quality value is <Q5;

  • (3)

    Filter out n reads whose number is >5;

The high-quality reads obtained after the above series of quality control steps were the clean data.

  • Assembly Process

SPAdes v3.10.1 [83] was used for genome assembly; kmer used 55, 87, and 121, respectively; and the assembly did not depend on the reference genome. However, due to the characteristics of second-generation sequencing, genomic repeats, genome-specific structure, and other reasons, the complete circular genome sequence could not be directly obtained by one-time splicing. Other strategies were used to obtain the complete circular genome sequence. The assembly process was divided into the following seven steps:

Step 1: The seed sequence of the chloroplast genome was obtained by assembling the cpDNA sequences with SPAdes v3.10.1.

Step 2: kmer iterative extend seed if the result of step 2 was a contig, the result was determined to be a pseudogenome sequence, and step 6 was performed directly.

Step 3: Connect the contig sequence obtained in step 2 with SSPACE v2.0 [84] to obtain the scaffolds.

Step 4: Use Gapfiller v2.1.1 [85] to make up the gaps for the scaffold sequences obtained in step 3.

Step 5: If a gap still exists after these operations, design primers, conduct PCR sequencing, and reassemble until the complete pseudogenome sequence is obtained.

Step 6: The sequencing reads were aligned to the pseudogenome to correct the genome.

Step 7: According to the structure of the chloroplast, the corrected pseudogenome was rearranged to obtain the complete chloroplast circular genome sequence.

Appendix B

Figure A1.

Figure A1

Gene map of the four S. japonicus chloroplast genomes.

Figure A2.

Figure A2

Variation level of three varieties of chloroplast genome sequences. The y-axis indicates the level of variation (between 50 and 100%), and the x-axis represents the coordinate in the chloroplast genome.

Table A1.

Characteristics of assembled chloroplast genome basic structure.

Species Total Size GC Content (%) LSC Size IR Size SSC Size Protein Coding Genes rRNA tRNA NCBI Accession Number
Lian Yi 157,962 bp 36.96% 87,637 bp 26,040 bp 18,245 bp 87 8 37 PP853569
Jian Ren 157,962 bp 36.96% 87,637 bp 26,040 bp 18,245 bp 87 8 37 PP853567
Yan Zi 157,914 bp 36.98% 87,549 bp 26,048 bp 18,269 bp 87 8 37 PP853568
Wild type 157,914 bp 36.97% 87,549 bp 26,048 bp 18,269 bp 87 8 37 OQ473820

Table A2.

Lists of genomic genes for Jian Ren, Lian Yi, Yan Zi, and wild type. Notes: Gene*: Gene with one introns; Gene**: Gene with two introns; #Gene: Pseudogene; Gene(2): Number of copies of multi-copy genes.

Category Function Jian Ren Lian Yi Yan Zi Wild Type
Photosynthesis Subunits of photosystem I psaA, psaB, psaC, psaI, psaJ
Subunits of photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH,
psbI, psbJ, psbK, psbL, psbM, psbN, psbT
Subunits of NADH
dehydrogenase
ndhA*, ndhB*(2), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
Subunits of cytochrome
b/f complex
petA, petB*, petD*, petG, petL, petN
Subunits of ATP synthase atpA, atpB, atpE, atpF*, atpH, atpI
Large subunit of rubisco rbcL
Subunits photochlorophyllide reductase -
Self-replication Proteins of large
ribosomal subunit
rpl14, rpl16*, rpl2*(2), rpl20, rpl22, rpl23(2), rpl32, rpl33, rpl36
Proteins of small
ribosomal subunit
rps11, rps12**(2), rps14, rps15, rps16*,
rps18, rps19, rps2, rps3, rps4, rps7(2), rps8
Subunits of RNA polymerase rpoA, rpoB, rpoC1*, rpoC2
Ribosomal RNAs rrn16(2), rrn23(2), rrn4.5(2), rrn5(2)
Transfer RNAs trnA-UGC*(2), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA,
trnG-GCC*, trnG-UCC, trnH-GUG, trnI-CAU(2), trnI-GAU*(2),
trnK-UUU*, trnL-CAA(2), trnL-UAA*, trnL-UAG, trnM-CAU,
trnN-GUU(2), trnP-UGG, trnQ-UUG, trnR-ACG(2), trnR-UCU,
trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU,
trnV-GAC(2), trnV-UAC*, trnW-CCA, trnY-GUA, trnfM-CAU
Other genes Maturase matK
Protease clpP**
Envelope membrane protein cemA
Acetyl–CoA carboxylase accD
c-type cytochrome
synthesis gene
ccsA
Translation initiation factor infA
other -
Genes of unknown function Conserved hypothetical
chloroplast ORF
#ycf1, lhbA, ycf1, ycf15(2), ycf2(2), ycf3**, ycf4

Author Contributions

Conceptualization, H.-Z.Z. and X.-G.X.; methodology, H.-Z.Z.; software, H.-Z.Z.; validation, W.D., M.-H.X. and Y.-Y.L.; formal analysis, H.L.; investigation, X.-L.Z.; resources, L.-L.T.; data curation, H.-Z.Z.; writing—original draft preparation, H.-Z.Z.; writing—review and editing, H.-Z.Z.; visualization, W.D.; supervision, X.-G.X.; project administration, H.-Z.Z. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are publicly available. These data can be found here: NCBI (GenBank accessions: PP853567-PP853569; OQ473820).

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This research received no external funding.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Prabhudas S.K., Prayaga S., Madasamy P., Natarajan P. Shallow Whole Genome Sequencing for the Assembly of Complete Chloroplast Genome Sequence of Arachis hypogaea L. Front. Plant Sci. 2016;7:1106. doi: 10.3389/fpls.2016.01106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Douglas S.E. Plastid Evolution: Origins, Diversity, Trends. Curr. Opin. Genet. Dev. 1998;8:655–661. doi: 10.1016/S0959-437X(98)80033-6. [DOI] [PubMed] [Google Scholar]
  • 3.Shinozaki K., Ohme M., Tanaka M., Wakasugi T., Hayashida N., Matsubayashi T., Zaita N., Chunwongse J., Obokata J., Yamaguchi-Shinozaki K., et al. The Complete Nucleotide Sequence of the Tobacco Chloroplast Genome: Its Gene Organization and Expression. EMBO J. 1986;5:2043–2049. doi: 10.1002/j.1460-2075.1986.tb04464.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sassenrath-Cole G.F. Photosynthesis, a Comprehensive Treatise. Crop Sci. 1999;39:282–283. doi: 10.2135/cropsci1999.0011183X003900010046x. [DOI] [Google Scholar]
  • 5.Alkatib S., Fleischmann T.T., Scharff L.B., Bock R. Evolutionary Constraints on the Plastid Trna Set Decoding Methionine and Isoleucine. Nucleic Acids Res. 2012;40:6713–6724. doi: 10.1093/nar/gks350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rogalski M., Karcher D., Bock R. Superwobbling Facilitates Translation with Reduced Trna Sets. Nat. Struct. Mol. Biol. 2008;15:192–198. doi: 10.1038/nsmb.1370. [DOI] [PubMed] [Google Scholar]
  • 7.Dong W.P., Xu C., Wen J., Zhou S.L. Evolutionary Directions of Single Nucleotide Substitutions and Structural Mutations in the Chloroplast Genomes of the Family Calycanthaceae. BMC Evol. Biol. 2020;20:96. doi: 10.1186/s12862-020-01661-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang J., Wang Y., Chen T., Chen Q., Wang L., Liu Z.S., Wang H., Xie R., He W., Li M., et al. Evolution of Rosaceae Plastomes Highlights Unique Cerasus Diversification and Independent Origins of Fruiting Cherry. Front. Plant Sci. 2021;12:736053. doi: 10.3389/fpls.2021.736053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Cao Z.Y., Yang L.Y., Xin Y.X., Xu W.B., Li Q.S., Zhang H.R., Tu Y.X., Song Y., Xin P.Y. Comparative and Phylogenetic Analysis of Complete Chloroplast Genomes from Seven Neocinnamomum Taxa (Lauraceae) Front. Plant Sci. 2023;14:1205051. doi: 10.3389/fpls.2023.1205051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Shi W.B., Song W.C., Chen Z.M., Cai H.H., Gong Q., Liu J., Shi C., Wang S. Comparative Chloroplast Genome Analyses of Diverse Phoebe (Lauraceae) Species Endemic to China Provide Insight into Their Phylogeographical Origin. PeerJ. 2023;11:e14573. doi: 10.7717/peerj.14573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dong W.P., Xu C., Cheng T., Lin K., Zhou S.L. Sequencing Angiosperm Plastid Genomes Made Easy: A Complete Set of Universal Primers and a Case Study on the Phylogeny of Saxifragales. Genome Biol. Evol. 2013;5:989–997. doi: 10.1093/gbe/evt063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wu Z.Q., Ge S. The Phylogeny of the Bep Clade in Grasses Revisited: Evidence from the Whole-Genome Sequences of Chloroplasts. Mol. Phylogenet. Evol. 2012;62:573–578. doi: 10.1016/j.ympev.2011.10.019. [DOI] [PubMed] [Google Scholar]
  • 13.Li H.T., Yi T.S., Gao L.M., Ma P.F., Zhang T., Yang J.B., Gitzendanner M.A., Fritsch P.W., Cai J., Luo Y., et al. Origin of Angiosperms and the Puzzle of the Jurassic Gap. Nat. Plants. 2019;5:461–470. doi: 10.1038/s41477-019-0421-0. [DOI] [PubMed] [Google Scholar]
  • 14.Zhang R., Wang Y.H., Jin J.J., Stull G.W., Bruneau A., Cardoso D., De Queiroz L.P., Moore M.J., Zhang S.D., Chen S.Y., et al. Exploration of Plastid Phylogenomic Conflict Yields New Insights into the Deep Relationships of Leguminosae. Syst. Biol. 2020;69:613–622. doi: 10.1093/sysbio/syaa013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Corriveau J.L., Coleman A.W. Rapid Screening Method to Detect Potential Biparental Inheritance of Plastid DNA and Results for over 200 Angiosperm Species. Am. J. Bot. 1988;75:1443–1458. doi: 10.1002/j.1537-2197.1988.tb11219.x. [DOI] [Google Scholar]
  • 16.Joly S., McLenachan P.A., Lockhart P.J. A Statistical Approach for Distinguishing Hybridization and Incomplete Lineage Sorting. Am. Nat. 2009;174:E54–E70. doi: 10.1086/600082. [DOI] [PubMed] [Google Scholar]
  • 17.Petit R.J., Excoffier L. Gene Flow and Species Delimitation. Trends Ecol. Evol. 2009;24:386–393. doi: 10.1016/j.tree.2009.02.011. [DOI] [PubMed] [Google Scholar]
  • 18.Song Y., Yu W.-B., Tan Y.H., Liu B., Yao X., Jin J.J., Padmanaba M., Yang J.B., Corlett R.T. Evolutionary Comparisons of the Chloroplast Genome in Lauraceae and Insights into Loss Events in the Magnoliids. Genome Biol. Evol. 2017;9:2354–2364. doi: 10.1093/gbe/evx180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chen J.F., Yu R.X., Dai J.H., Liu Y., Zhou R.C. The Loss of Photosynthesis Pathway and Genomic Locations of the Lost Plastid Genes in a Holoparasitic Plant Aeginetia indica. BMC Plant Biol. 2020;20:199. doi: 10.1186/s12870-020-02415-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wu Z.H., Liao R., Yang T.G., Dong X., Lan D.Q., Qin R., Liu H. Analysis of Six Chloroplast Genomes Provides Insight into the Evolution of Chrysosplenium (Saxifragaceae) BMC Genom. 2020;21:621. doi: 10.1186/s12864-020-07045-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Park K.R., Kim S., Cho M., Kang S.W., Yun H.M. Effects of Pin on Osteoblast Differentiation and Matrix Mineralization through Runt-Related Transcription Factor. Int. J. Mol. Sci. 2020;21:9579. doi: 10.3390/ijms21249579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Horimoto T., Koshioka M., Kubota S., Mander L.N., Hirai N., Ishida N., Suh J.K., Lee A.K., Roh M.S. Effect of Warm and Cold Stratification on 1h-Nmr Profiles, Endogenous Gibberellins and Abscisic Acid in Styrax japonicus Seeds. Hortic. Environ. Biotechnol. 2011;52:233–239. doi: 10.1007/s13580-011-0207-4. [DOI] [Google Scholar]
  • 23.Chen C., Chen H., Ni M., Yu F.Y. A Study on Petal Morphological and Physiological Characteristics of Styrax japonicus During the Flowering Period. Agronomy. 2021;11:1498. doi: 10.3390/agronomy11081498. [DOI] [Google Scholar]
  • 24.Kitagawa I., Imaukura Y., Hayashi T., Yosioka I. Structure of Desacyl-Jegosaponin, a Common Desacyl Derivative of Jegosaponin Isolated from Pericarps of Styrax japonica Sieb. Et Zucc. Chem. Pharm. Bull. 1974;22:1675–1677. doi: 10.1248/cpb.22.1675. [DOI] [PubMed] [Google Scholar]
  • 25.Min B.S., Na M.K., Oh S.R., Ahn K.S., Jeong G.S., Li G., Lee S.K., Joung H., Lee H.K. New Furofuran and Butyrolactone Lignans with Antioxidant Activity from the Stem Bark of Styrax japonica. J. Nat. Prod. 2004;67:1980–1984. doi: 10.1021/np040113m. [DOI] [PubMed] [Google Scholar]
  • 26.He L., Zhou Y., Wan G., Wang W., Zhang N., Yao L. Antinociceptive Effects of Flower Extracts and the Active Fraction from Styrax japonicus. J. Ethnopharmacol. 2022;284:114779. doi: 10.1016/j.jep.2021.114779. [DOI] [PubMed] [Google Scholar]
  • 27.Ren J., Fang A., Jiao S., Li R., Huang Y., Ni X., Zhang Y., Ma Y., Li S., Li J. Lignans from the Leaves of Styrax japonicus and Their Anti-Inflammatory Activity. Fitoterapia. 2024;172:105774. doi: 10.1016/j.fitote.2023.105774. [DOI] [PubMed] [Google Scholar]
  • 28.Li W., Zhang C.P., Jiang X.Q., Liu Q., Liu Q.H., Wang K.L. De Novo Transcriptomic Analysis and Development of Est–Ssrs for Styrax japonicus. Forests. 2018;9:748. doi: 10.3390/f9120748. [DOI] [Google Scholar]
  • 29.Fritsch P.W. Phylogeny of Styrax Based on Morphological Characters, with Implications for Biogeography and Infrageneric Classification. Syst. Bot. 1999;24:356–378. doi: 10.2307/2419695. [DOI] [Google Scholar]
  • 30.Huang S.M. Systematic Position and Geographical Distribution of Styracaceae. J. Trop. Subtrop. Bot. 1994;2:15–30. [Google Scholar]
  • 31.Huang Y.L., Fritsch P., Shi S.H. A Revision of the Imbricate Group of Styrax Series Cyrta (Styracaceae) in Asia. Ann. Mo. Bot. Gard. 2003;90:491. doi: 10.2307/3298541. [DOI] [Google Scholar]
  • 32.Ju Y.Q., Zhang C.P., Li W., Qian C., Qu Y.M., Zou Z.X., Zhao H., Li L.L. Variation in the Calyx Color in Two Styrax japonicus Varieties Is Attributed to Varied Anthocyanin Levels as Revealed by Integrated Metabolomic and Transcriptomic Analyses. Forests. 2023;14:1413. doi: 10.3390/f14071413. [DOI] [Google Scholar]
  • 33.Lee S.H., Kim J., Park H.S., Koo H., Waminal N.E., Pellerin R.J., Shim H., Lee H.O., Kim E., Park J.Y., et al. Genome Structure and Diversity among Cynanchum wilfordii Accessions. BMC Plant Biol. 2022;22:4. doi: 10.1186/s12870-021-03390-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Tian X.Y., Guo J., Song Y., Yu Q.F., Liu C., Fu Z.X., Shi Y.H., Shao Y.Z., Yuan Z.L. Intraspecific Differentiation of Lindera obtusiloba as Revealed by Comparative Plastomic and Evolutionary Analyses. Ecol. Evol. 2024;14:e11119. doi: 10.1002/ece3.11119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sarwar G., Anwar T., Chaudhary M.S., Jamil M., Kamal A., Mustafa A.M.A., Al-Ghamdi A.A., Ullah F., Zaman W. Study of Comparative Morphology of Eight Cultivated Genotypes of Olea europaea L. Horticulturae. 2023;9:696. doi: 10.3390/horticulturae9060696. [DOI] [Google Scholar]
  • 36.Zhu A., Guo W., Gupta S., Fan W., Mower J.P. Evolutionary Dynamics of the Plastid Inverted Repeat: The Effects of Expansion, Contraction, and Loss on Substitution Rates. New Phytol. 2016;209:1747–1756. doi: 10.1111/nph.13743. [DOI] [PubMed] [Google Scholar]
  • 37.Guo Y.Y., Yang J.X., Bai M.Z., Zhang G.Q., Liu Z.J. The Chloroplast Genome Evolution of Venus Slipper (Paphiopedilum): Ir Expansion, Ssc Contraction, and Highly Rearranged SSC Regions. BMC Plant Biol. 2021;21:248. doi: 10.1186/s12870-021-03053-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Guo S., Guo L., Zhao W., Xu J., Li Y., Zhang X., Shen X., Wu M., Hou X. Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Paeonia ostii. Molecules. 2018;23:246. doi: 10.3390/molecules23020246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zhang X.F., Landis J.B., Wang H.X., Zhu Z.X., Wang H.F. Comparative Analysis of Chloroplast Genome Structure and Molecular Dating in Myrtales. BMC Plant Biol. 2021;21:219. doi: 10.1186/s12870-021-02985-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Lynch M., Koskella B., Schaack S. Mutation Pressure and the Evolution of Organelle Genomic Architecture. Science. 2006;311:1727–1730. doi: 10.1126/science.1118884. [DOI] [PubMed] [Google Scholar]
  • 41.Bungard R.A. Photosynthetic Evolution in Parasitic Plants: Insight from the Chloroplast Genome. Bioessays. 2004;26:235–247. doi: 10.1002/bies.10405. [DOI] [PubMed] [Google Scholar]
  • 42.Xiong A.S., Peng R.H., Zhuang J., Gao F., Zhu B., Fu X.Y., Xue Y., Jin X.F., Tian Y.S., Zhao W., et al. Gene Duplication, Transfer, and Evolution in the Chloroplast Genome. Biotechnol. Adv. 2009;27:340–347. doi: 10.1016/j.biotechadv.2009.01.012. [DOI] [PubMed] [Google Scholar]
  • 43.Sun M., Zhang J., Huang T., Yang M., Ma L., Duan L. Genome Structure and Variation of Reynoutria japonica Houtt. Chloroplast Genome. Sheng Wu Gong Cheng Xue Bao. 2022;38:1953–1964. doi: 10.13345/j.cjb.210843. [DOI] [PubMed] [Google Scholar]
  • 44.Hasegawa M., Zhong B., Zhong Y. Adaptive Evolution of Chloroplast Genomes in Ancestral Grasses. Plant Signal Behav. 2009;4:623–624. doi: 10.4161/psb.4.7.8914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Wang Y., Wen F., Hong X., Li Z., Mi Y., Zhao B. Comparative Chloroplast Genome Analyses of Paraboea (Gesneriaceae): Insights into Adaptive Evolution and Phylogenetic Analysis. Front. Plant Sci. 2022;13:1019831. doi: 10.3389/fpls.2022.1019831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Yang Q., Fu G.F., Wu Z.Q., Li L., Zhao J.L., Li Q.J. Chloroplast Genome Evolution in Four Montane Zingiberaceae Taxa in China. Front. Plant Sci. 2021;12:774482. doi: 10.3389/fpls.2021.774482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Li Q.H., He Z.H., Mi H.L. The Research Progress of Chloroplast Nad(P)H Dehydrogenase (Ndh) Complex. Zhiwu Shengli Xuebao/Plant Physiol. J. 2013;49:401–409. [Google Scholar]
  • 48.Kotera E., Tasaka M., Shikanai T. A Pentatricopeptide Repeat Protein Is Essential for Rna Editing in Chloroplasts. Nature. 2005;433:326–330. doi: 10.1038/nature03229. [DOI] [PubMed] [Google Scholar]
  • 49.Hirose T., Sugiura M. Both Rna Editing and Rna Cleavage Are Required for Translation of Tobacco Chloroplast Ndhd Mrna: A Possible Regulatory Mechanism for the Expression of a Chloroplast Operon Consisting of Functionally Unrelated Genes. EMBO J. 1997;16:6804–6811. doi: 10.1093/emboj/16.22.6804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Karcher D., Bock R. The Amino Acid Sequence of a Plastid Protein Is Developmentally Regulated by Rna Editing. J. Biol. Chem. 2002;277:5570–5574. doi: 10.1074/jbc.M107074200. [DOI] [PubMed] [Google Scholar]
  • 51.Kikuchi S., Bédard J., Hirano M., Hirabayashi Y., Oishi M., Imai M., Takase M., Ide T., Nakai M. Uncovering the Protein Translocon at the Chloroplast Inner Envelope Membrane. Science. 2013;339:571–574. doi: 10.1126/science.1229262. [DOI] [PubMed] [Google Scholar]
  • 52.Maier R.M., Neckermann K., Igloi G.L., Kössel H. Complete Sequence of the Maize Chloroplast Genome: Gene Content, Hotspots of Divergence and Fine Tuning of Genetic Information by Transcript Editing. J. Mol. Biol. 1995;251:614–628. doi: 10.1006/jmbi.1995.0460. [DOI] [PubMed] [Google Scholar]
  • 53.Weng M.L., Blazier J.C., Govindu M., Jansen R.K. Reconstruction of the Ancestral Plastid Genome in Geraniaceae Reveals a Correlation between Genome Rearrangements, Repeats, and Nucleotide Substitution Rates. Mol. Biol. Evol. 2014;31:645–659. doi: 10.1093/molbev/mst257. [DOI] [PubMed] [Google Scholar]
  • 54.Yang M., Zhang X., Liu G., Yin Y., Chen K., Yun Q., Zhao D., Al-Mssallem I.S., Yu J. The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.) PLoS ONE. 2010;5:e12762. doi: 10.1371/journal.pone.0012762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Carbonell-Caballero J., Alonso R., Ibañez V., Terol J., Talon M., Dopazo J. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus. Mol. Biol. Evol. 2015;32:2015–2035. doi: 10.1093/molbev/msv082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Greiner S., Wang X., Herrmann R.G., Rauwolf U., Mayer K., Haberer G., Meurer J. The Complete Nucleotide Sequences of the 5 Genetically Distinct Plastid Genomes of Oenothera, Subsection Oenothera: II. A Microevolutionary View Using Bioinformatics and Formal Genetic Data. Mol. Biol. Evol. 2008;25:2019–2030. doi: 10.1093/molbev/msn149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hu S., Sablok G., Wang B., Qu D., Barbaro E., Viola R., Li M., Varotto C. Plastome Organization and Evolution of Chloroplast Genes in Cardamine Species Adapted to Contrasting Habitats. BMC Genom. 2015;16:306. doi: 10.1186/s12864-015-1498-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Xie J.B., Chen S.S., Xu W.J., Zhao Y.Y., Zhang D.Q. Origination and Function of Plant Pseudogenes. Plant Signal. Behav. 2019;14:1625698. doi: 10.1080/15592324.2019.1625698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Zhang J.M., Pontoppidan B., Xue J.P., Rask L., Meijer J. The Third Myrosinase Gene Tgg3 in Arabidopsis thaliana Is a Pseudogene Specifically Expressed in Stamen and Petal. Physiol. Plant. 2002;115:25–34. doi: 10.1034/j.1399-3054.2002.1150103.x. [DOI] [PubMed] [Google Scholar]
  • 60.Yadav S., Kalwan G., Meena S., Gill S.S., Yadava Y.K., Gaikwad K., Jain P.K. Unravelling the Due Importance of Pseudogenes and Their Resurrection in Plants. Plant Physiol. Biochem. 2023;203:108062. doi: 10.1016/j.plaphy.2023.108062. [DOI] [PubMed] [Google Scholar]
  • 61.Tong T.T., Shao L.L., Peng Z.H. The Complete Chloroplast Genome of Styrax japonicus (Styracaceae), a Deciduous Tree Distributed in East Asia. Mitochondrial DNA Part B Resour. 2020;5:1863–1864. doi: 10.1080/23802359.2020.1751004. [DOI] [Google Scholar]
  • 62.Song Y., Zhao W.J., Xu J., Li M.F., Zhang Y.J. Chloroplast Genome Evolution and Species Identification of Styrax (Styracaceae) Biomed. Res. Int. 2022;2022:5364094. doi: 10.1155/2022/5364094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Daniell H., Lin C.S., Yu M., Chang W.J. Chloroplast Genomes: Diversity, Evolution, and Applications in Genetic Engineering. Genome Biol. 2016;17:134. doi: 10.1186/s13059-016-1004-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lo E.Y., Stefanović S., Dickinson T.A. Population Genetic Structure of Diploid Sexual and Polyploid Apomictic Hawthorns (Crataegus; Rosaceae) in the Pacific Northwest. Mol. Ecol. 2009;18:1145–1160. doi: 10.1111/j.1365-294X.2009.04091.x. [DOI] [PubMed] [Google Scholar]
  • 65.Moen D., Morlon H. Why Does Diversification Slow Down? Trends Ecol. Evol. 2014;29:190–197. doi: 10.1016/j.tree.2014.01.010. [DOI] [PubMed] [Google Scholar]
  • 66.Rabosky D.L. Ecological Limits and Diversification Rate: Alternative Paradigms to Explain the Variation in Species Richness among Clades and Regions. Ecol. Lett. 2009;12:735–743. doi: 10.1111/j.1461-0248.2009.01333.x. [DOI] [PubMed] [Google Scholar]
  • 67.Bernstein H., Byerly H.C., Hopf F.A., Michod R.E. Genetic Damage, Mutation, and the Evolution of Sex. Science. 1985;229:1277–1281. doi: 10.1126/science.3898363. [DOI] [PubMed] [Google Scholar]
  • 68.Charlesworth B. Mutation-Selection Balance and the Evolutionary Advantage of Sex and Recombination. Genet. Res. 1990;55:199–221. doi: 10.1017/S0016672300025532. [DOI] [PubMed] [Google Scholar]
  • 69.Alsos I.G., Ehrich D., Eidesen P.B., Solstad H., Westergaard K.B., Schönswetter P., Tribsch A., Birkeland S., Elven R., Brochmann C. Long-Distance Plant Dispersal to North Atlantic Islands: Colonization Routes and Founder Effect. AoB Plants. 2015;7:plv036. doi: 10.1093/aobpla/plv036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Zhang R., Guo Z., Fang L., Zhong C., Duke N.C., Shi S. Population Subdivision Promoted by a Sea-Level-Change-Driven Bottleneck: A Glimpse from the Evolutionary History of the Mangrove Plant Aegiceras corniculatum. Mol. Ecol. 2022;31:780–797. doi: 10.1111/mec.16290. [DOI] [PubMed] [Google Scholar]
  • 71.Major T., Gindele R., Balogh G., Bárdossy P., Bereczky Z. Founder Effects in Hereditary Hemorrhagic Telangiectasia. J. Clin. Med. 2021;10:1682. doi: 10.3390/jcm10081682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Yang L., Su D., Chang X., Foster C.S.P., Sun L., Huang C.H., Zhou X., Zeng L., Ma H., Zhong B. Phylogenomic Insights into Deep Phylogeny of Angiosperms Based on Broad Nuclear Gene Sampling. Plant Commun. 2020;1:100027. doi: 10.1016/j.xplc.2020.100027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wang G., Zhang X., Herre E.A., McKey D., Machado C.A., Yu W.B., Cannon C.H., Arnold M.L., Pereira R.A.S., Ming R., et al. Genomic Evidence of Prevalent Hybridization Throughout the Evolutionary History of the Fig-Wasp Pollination Mutualism. Nat. Commun. 2021;12:718. doi: 10.1038/s41467-021-20957-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Li X., Wei G., El-Kassaby Y.A., Fang Y. Hybridization and Introgression in Sympatric and Allopatric Populations of Four Oak Species. BMC Plant Biol. 2021;21:266. doi: 10.1186/s12870-021-03007-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Vargas O.M., Ortiz E.M., Simpson B.B. Conflicting Phylogenomic Signals Reveal a Pattern of Reticulate Evolution in a Recent High-Andean Diversification (Asteraceae: Astereae: Diplostephium) New Phytol. 2017;214:1736–1750. doi: 10.1111/nph.14530. [DOI] [PubMed] [Google Scholar]
  • 76.Wang X., Chen L., Ma J. Genomic Introgression through Interspecific Hybridization Counteracts Genetic Bottleneck During Soybean Domestication. Genome Biol. 2019;20:22. doi: 10.1186/s13059-019-1631-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Kurata S., Sakaguchi S., Kurashima O., Ogawa R., Suyama Y., Nishida S., Ito M. Refugia within Refugium of Geranium yesoense Varieties: A Follow-up Study Using Chloroplast Genome Sequencing Data of Specimens from Mt. Asama, Japan. Biol. J. Linn. Soc. 2024;142:1–7. doi: 10.1093/biolinnean/blad121. [DOI] [Google Scholar]
  • 78.Lange C.B.A., Hauser T.P., Deichmann V., Orgaard M. Hybridization and Complex Evolution of Barbarea vulgaris and Related Species (Brassicaceae) Mol. Phylogenet. Evol. 2022;169:107425. doi: 10.1016/j.ympev.2022.107425. [DOI] [PubMed] [Google Scholar]
  • 79.You J.L., Lougheed S.C., Zhao Y., Zhang G.J., Liu W.S., Lu F., Wang Y.G., Zhang W.J., Yang J., Qiong L., et al. Comparative Phylogeography Study Reveals Introgression and Incomplete Lineage Sorting During Rapid Diversification of Rhodiola. Ann. Bot. 2022;129:185–200. doi: 10.1093/aob/mcab133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Zhang H.X., Zhang X.F., Zhang J. Genetic Divergence and Evolutionary Adaption of Four Wild Almond Species (Prunus Spp. L.) Forests. 2024;15:834. doi: 10.3390/f15050834. [DOI] [Google Scholar]
  • 81.Stubbs R.L., Theodoridis S., Mora-Carrera E., Keller B., Potente G., Yousefi N., Jay P., Léveillé-Bourret É., Choudhury R.R., Celep F., et al. The Genomes of Darwin’s Primroses Reveal Chromosome-Scale Adaptive Introgression and Differential Permeability of Species Boundaries. New Phytol. 2024;241:911–925. doi: 10.1111/nph.19361. [DOI] [PubMed] [Google Scholar]
  • 82.Langmead B., Salzberg S.L. Fast Gapped-Read Alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Bankevich A., Nurk S., Antipov D., Gurevich A.A., Dvorkin M., Kulikov A.S., Lesin V.M., Nikolenko S.I., Pham S., Prjibelski A.D., et al. Spades: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J. Comput. Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Boetzer M., Pirovano W. Sspace-Longread: Scaffolding Bacterial Draft Genomes Using Long Read Sequence Information. BMC Bioinform. 2014;15:211. doi: 10.1186/1471-2105-15-211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Boetzer M., Pirovano W. Toward Almost Closed Genomes with Gapfiller. Genome Biol. 2012;13:R56. doi: 10.1186/gb-2012-13-6-r56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Xiong Y., Xiong Y., He J., Yu Q., Zhao J., Lei X., Dong Z., Yang J., Peng Y., Zhang X., et al. The Complete Chloroplast Genome of Two Important Annual Clover Species, Trifolium alexandrinum and T. resupinatum: Genome Structure, Comparative Analyses and Phylogenetic Relationships with Relatives in Leguminosae. Plants. 2020;9:478. doi: 10.3390/plants9040478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Potter S.C., Luciani A., Eddy S.R., Park Y., Lopez R., Finn R.D. Hmmer Web Server: 2018 Update. Nucleic Acids Res. 2018;46:W200–W204. doi: 10.1093/nar/gky448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Laslett D., Canback B. Aragorn, a Program to Detect Trna Genes and Tmrna Genes in Nucleotide Sequences. Nucleic Acids Res. 2004;32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.McGinnis S., Madden T.L. Blast: At the Core of a Powerful and Diverse Set of Sequence Analysis Tools. Nucleic Acids Res. 2004;32:W20–W25. doi: 10.1093/nar/gkh435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Greiner S., Lehwark P., Bock R. Organellargenomedraw (Ogdraw) Version 1.3.1: Expanded Toolkit for the Graphical Visualization of Organellar Genomes. Nucleic Acids Res. 2019;47:W59–W64. doi: 10.1093/nar/gkz238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Amiryousefi A., Hyvönen J., Poczai P. Irscope: An Online Program to Visualize the Junction Sites of Chloroplast Genomes. Bioinformatics. 2018;34:3030–3031. doi: 10.1093/bioinformatics/bty220. [DOI] [PubMed] [Google Scholar]
  • 92.Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J.C., Guirao-Rico S., Librado P., Ramos-Onsins S.E., Sánchez-Gracia A. Dnasp 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol. Biol. Evol. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  • 93.Kurtz S., Choudhuri J.V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R. Reputer: The Manifold Applications of Repeat Analysis on a Genomic Scale. Nucleic Acids Res. 2001;29:4633–4642. doi: 10.1093/nar/29.22.4633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Beier S., Thiel T., Münch T., Scholz U., Mascher M. Misa-Web: A Web Server for Microsatellite Prediction. Bioinformatics. 2017;33:2583–2585. doi: 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Cock P.J., Antao T., Chang J.T., Chapman B.A., Cox C.J., Dalke A., Friedberg I., Hamelryck T., Kauff F., Wilczynski B., et al. Biopython: Freely Available Python Tools for Computational Molecular Biology and Bioinformatics. Bioinformatics. 2009;25:1422–1423. doi: 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Katoh K., Standley D.M. Mafft Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Zhang Z. Kaks_Calculator 3.0: Calculating Selective Pressure on Coding and Non-Coding Sequences. Genom. Proteom. Bioinform. 2022;20:536–540. doi: 10.1016/j.gpb.2021.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Zeng S., Zhou T., Han K., Yang Y., Zhao J., Liu Z.L. The Complete Chloroplast Genome Sequences of Six Rehmannia Species. Genes. 2017;8:103. doi: 10.3390/genes8030103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Kalyaanamoorthy S., Minh B.Q., Wong T.K.F., von Haeseler A., Jermiin L.S. Modelfinder: Fast Model Selection for Accurate Phylogenetic Estimates. Nat. Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Nguyen L.T., Schmidt H.A., von Haeseler A., Minh B.Q. Iq-Tree: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The original contributions presented in this study are publicly available. These data can be found here: NCBI (GenBank accessions: PP853567-PP853569; OQ473820).


Articles from Genes are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES