Skip to main content
BioMed Research International logoLink to BioMed Research International
. 2020 May 5;2020:8731857. doi: 10.1155/2020/8731857

Complete Chloroplast Genomes of 14 Mangroves: Phylogenetic and Comparative Genomic Analyses

Chengcheng Shi 1,2, Kai Han 2, Liangwei Li 2, Inge Seim 3,4, Simon Ming-Yuen Lee 5, Xun Xu 2,6,7, Huanming Yang 1,6, Guangyi Fan 2,5,6,, Xin Liu 2,6,7,8,
PMCID: PMC7225854  PMID: 32462024

Abstract

Mangroves are a group of plant species that occupy the coastal intertidal zone and are major components of this ecologically important ecosystem. Mangroves belong to about twenty diverse families. Here, we sequenced and assembled chloroplast genomes of 14 mangrove species from eight families spanning five rosid orders and one asterid order: Fabales (Pongamia pinnata), Lamiales (Avicennia marina), Malpighiales (Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal), Malvales (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea), Myrtales (Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula), and Sapindales (Xylocarpus moluccensis). These chloroplast genomes range from 149 kb to 168 kb in length. A conserved structure of two inverted repeats (IRa and IRb, ~25.8 kb), one large single-copy region (LSC, ~89.0 kb), and one short single-copy region (SSC, ~18.9 kb) as well as ~130 genes (85 protein-coding, 37 tRNAs, and 8 rRNAs) was observed. We found the lowest divergence in the IR regions among the four regions. We also identified simple sequence repeats (SSRs), which were found to be variable in numbers. Most chloroplast genes are highly conserved, with only four genes under positive selection or relaxed pressure. Combined with publicly available chloroplast genomes, we carried out phylogenetic analysis and confirmed the previously reported phylogeny within rosids, including the positioning of obscure families in Malpighiales. Our study reports 14 mangrove chloroplast genomes and illustrates their genome features and evolution.

1. Introduction

Mangroves grow on the intertidal zone of the ocean, the transition zone connecting the land and ocean. Mangrove ecosystems provide essential habitats for marine creatures and benthic organisms and play important roles in regulating energy cycle and maintaining biodiversity [1, 2]. According to their habitats, root morphology, and salt metabolism patterns, mangroves are generally categorized into true mangroves and mangrove associates (or semi-mangroves) [3]. The true mangroves exclusively live in mangrove ecosystems and usually have distinct marine environment adaptations, including the ability to grow in seawater, complex root structures (allowing enhanced nutrient absorption and respiratory metabolism), and viviparous reproduction (seeds germinating on trees) [4]. Semi-mangroves are amphibious, and many can inhabit both terrestrial and aquatic environments (for instance, Pongamia pinnata (L.) Pierre). In mangrove ecosystems, they may grow at the edge of the true mangroves and are often dominant species on degraded beaches.

There are more than 80 mangrove species, covering approximately twenty families [5, 6]. Due to their ecological importance, wide distribution, and unique biological features for adaptation, the genome features and genome evolution of these species would be of considerable interest yet remain largely unexplored. As an essential organelle of plants, the chloroplast has an independent genome with stable sequence structure and a relatively conserved number of genes associated with energy production and metabolism. Chloroplast genes such as rbcL and psbA were once evidenced to be resultful in inferring the evolutionary origins and phylogenetic relationship of mangroves species from different clades or geographical regions [79]. DNA barcodes of rbcL, matK, and trnH-psbA genes have also been used to identify unknown mangrove species [6]. However, whole chloroplast genomes of mangrove species were limited until now [10]. Detailed whole chloroplast genome comparison and phylogenetic analysis has to date been lacking. In order to acquire more mangrove genetic resources and determine the evolutionary location of mangroves in rosids, we sequenced and assembled the complete chloroplast genomes of 14 mangrove species, including Pongamia pinnata (L.) Pierre, Avicennia marina (Forssk.) Vierh., Excoecaria agallocha L., Bruguiera sexangula (Lour.) Poir., Kandelia obovata Sheue, Liu & Yong, Rhizophora stylosa Griff., Ceriops tagal (Perr.) C.B.Rob., Hibiscus tiliaceus L., Heritiera littoralis Dryand., Thespesia populnea (L.) Sol. ex Correa, Laguncularia racemosa (L.) C.F. Gaertn., Sonneratia ovata Backer, Pemphis acidula Forst., and Xylocarpus moluccensis (Lamk.) Roem.. They represent mangroves of eight families, five rosid orders, and one asterid order (as an outgroup for the phylogenetic analysis). We examined their genome structures and gene contents. Comparative genomics and molecular evolution analyses were performed to illustrate mangrove chloroplast genome features further and reveal relationships among mangrove species.

2. Materials and Methods

2.1. Sequencing, Chloroplast Genome Assembly, and Annotation

Fresh leaves of mangroves were provided by collaborators in Guangzhou, China. DNA were extracted according to a CTAB method and then sequenced on a BGISEQ-500 platform. After sequencing, we randomly extracted five million pair-end reads. We used the MITObim v1.9 [11] for the initial assembly, following a closest reference-based strategy. The size of each chloroplast genome was estimated by SPAdes v3.13.0 [12]. With the initial assembly and the estimated genome size, we applied NOVOPlasty v2.7.2 [13] to assemble the complete chloroplast genome. Finally, we carried out manual curation to obtain circular sequences.

Chloroplast genes (including protein coding genes, rRNA genes, and tRNA genes) were predicted and annotated by GeSeq [14] with the MPI-MP chloroplast reference option. The identity cutoffs for protein and rRNA searching were set as 60 and 85, respectively. ARAGORN v1.2.38 [15] was used to annotate tRNAs. Genes were visualized using OGDRAW [16]. IR (inverted repeat) boundaries were identified by chloroplast genome self-alignment using BLAST v2.2.6 [17] (-p blastn -m 8 -F F -e 1). Regions aligned reversely and of the same length were manually curated as inverted repeat regions. Simple sequence repeats (SSRs) with 1-6 bp repeat units were detected using MISA v2.0 [18]. The minimum repeat times were set to be 10 for mononucleotides, 5 for dinucleotide, 4 for trinucleotide, 3 for tetranucleotide, 3 for pentanucleotide, and 3 for hexanucleotide.

2.2. Phylogenetic Analysis

A total of 44 conserved genes (atpA, atpB, atpE, atpH, atpI, ccsA, cemA, matK, ndhA, ndhC, ndhG, ndhI, ndhJ, petA, petN, psaA, psaB, psaC, psaJ, psbA, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbT, rbcL, rpoC1, rpoC2, rpl14, rpl2, rpoA, rpoB, rps11, rps14, rps15, rps19, rps2, rps3, rps4, rps8, and ycf3) found in all 71 plant chloroplast genomes were used to construct robust phylogenetic trees (species listed in Table S2). The coding sequences were aligned by MAFFT v7.407 [19] with the “--auto --adjustdirection” setting. Based on the global alignments, the phylogenetic trees of the 71 representative species were constructed using several methods. Both partitioned and nonpartitioned strategies were implemented with different phylogenetic inference tools based on the concatenated aligned sequences from the 44 conserved genes, including (1) a BI tree constructed using MrBayes v3.2.7 [20] with a single priori GTR+GAMMA model, (2) a ML tree constructed using RAxML v8.2.12 [21] with a GTRGAMMA model, (3) a BI tree constructed using MrBayes with the best partition schemed models estimated by PartitionFinder v2.1.1 [22], and (4) a ML tree constructed using RAxML with the best partition schemed models estimated by PartitionFinder, as well as two trees using methods in (1) and (2) with four commonly used genes (ndhF, matK, rbcL, and atpB). For the Bayesian inference, MCMC analysis was run for 1,000,000 generations with four chains and sampling every 1,000 generations. The first 25% trees were discarded, and the final consensus tree was summarized using the remaining trees. For ML trees, the bootstrap number was set to 1,000. The trees were assessed using CONSEL v1.20 [23].

2.3. Ka/Ks Calculation

The nonsynonymous (Ka) and synonymous (Ks) substitution ratios (Ka/Ks) of genes in 14 mangroves, as well as in species from Lamiales, Fabales, Malpighiales, Malvales, Myrtales, Sapindales, Oxalidales, Celastrales, Fagales, Cucurbitales, Rosales, Brassicales, Huerteales, Geraniales, and Saxifragales (Table S2), were calculated. For all the species in an order, we selected one species outside the order to be used for comparison and Ka/Ks calculation. As these orders cover Asterid I, Rosid I, and Rosid II of core eudicots, we further chose one species from orders Asterales (Helianthus divaricatus: NC_023109.1), Zygophyllales (Larrea tridentata: NC_028023.1), Geraniales (Hypseocharis bilobata: NC_023260.1), and Vitales (Vitis rotundifolia: NC_023790.1) as outgroups of Asterid I, Rosid I, Rosid II, and the remaining rosid species. The two Lamiales species from asterids were compared to Helianthus divaricatus (NC_023109.1). Pairwise alignments were processed using MAFFT v7.407 [19], and Ka/Ks values were calculated using the KaKs Calculator [24]. Genes with low Ks values (cutoff 0.1, determined by considering the Ks distribution) were excluded as genes with unreliable omega values.

2.4. Synteny and Divergence Analyses

Genomic comparison and similarity calculations were performed using mVISTA [25]. The most closely related species used for synteny and divergence comparison for 14 mangroves were Euphorbia tirucalli (NC_042193.1), Erythroxylum novogranatense (NC_030601.1), Wisteria floribunda (NC_027677.1), Gossypium lobatum (NC_039569.1), Hibiscus rosa-sinensis (NC_042239.1), Heritiera angustata (NC_037784.1), Xylocarpus granatum (NC_039925.1), Trapa natans (NC_042895.1), Punica granatum (NC_035240.1), Lumnitzera littorea (NC_039752.1), and Aphelandra knappiae (NC_041424.1). For comparison within orders, the whole chloroplast genome sequences of Lamiales (Sesamum indicum: NC_016433.2, Lindenbergia philippensis: NC_022859.1, Ajuga reptans: NC_023102.1, Hesperelaea palmeri: NC_025787.1, Scrophularia takesimensis: NC_026202.1, Tanaecium tetragonolobum: NC_027955.1, Erythranthe lutea: NC_030212.1, Paulownia coreana: NC_031435.1, Haberlea rhodopensis: NC_031852.1, Aloysia citrodora: NC_034695.1, Echinacanthus lofouensis: NC_035876.1, and one mangrove Avicennia marina), Malpighiales (Byrsonima coccolobifolia: NC_037191.1, Erythroxylum novogranatense: NC_030601.1, Garcinia mangostana: NC_036341.1, Hirtella racemosa: NC_024060.1, Ricinus communis: NC_016736.1, Salix interior: NC_024681.1, Viola seoulensis: NC_026986.1, and mangroves Bruguiera sexangula, Ceriops tagal, Excoecaria agallocha, Kandelia obovata, and Rhizophora stylosa), Myrtales (Allomaieta villosa: NC_031875.1, Eucalyptus obliqua: NC_022378.1, Lagerstroemia fauriei: NC_029808.1, and mangroves Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata), Sapindales (Azadirachta indica: NC_023792.1, Boswellia sacra: NC_029420.1, Citrus aurantiifolia: NC_024929.1, Leitneria floridana: NC_030482.1, Sapindus mukorossi: NC_025554.1, Spondias bahiensis: NC_030526.1, and mangrove Xylocarpus moluccensis), and Malvales species (Gossypium arboreum: NC_016712.1, Daphne kiusiana: NC_035896.1, and mangroves Hibiscus tiliaceus and Thespesia populnea) were aligned with MAFFT (v7.407) [19], and single nucleotide polymorphisms (SNPs) and insertion and deletions (InDels) were identified and counted in 200 bp windows with an in-house python script.

3. Results and Discussions

3.1. Chloroplast Genome Features

Using a reference genome-based strategy (see Materials and Methods), a total of 483 Mb chloroplast data were generated for 14 mangrove species in six orders: Fabales (Pongamia pinnata), Lamiales (Avicennia marina), Malpighiales (Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal), Malvales (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea), Myrtales (Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula), and Sapindales (Xylocarpus moluccensis). The coverage of chloroplasts ranges from 28X to 526X (Table 1), which might be related with different chloroplast DNA content in the total DNA. The chloroplast genomes were assembled into single circular sequences, ranging from 149 kb (Pongamia pinnata) to 168 kb (Kandelia obovata) (Table 1 and Figure S1). We observed typical quadripartite structures in these chloroplast genomes, with two inverted repeat regions (IRa and IRb), a short single-copy (SSC) region, and a long single-copy (LSC) region. The lengths of these four regions are similar among species from the same order but slightly different between orders. The average lengths of IRs of mangrove species in Fabales, Lamiales, Malpighiales, Malvales, Myrtales, and Sapindales are approximately 23.6 kb, 25.6 kb, 26.3 kb, 26.0 kb, 25.3 kb, and 27.0 kb, respectively. The sizes of the SSC range from 17.9 kb to 20.0 kb and the LSC range from 83 kb to 91 kb (Table 1). We found a GC content between 35% and 39% in the 14 chloroplast genomes. The GC content of IR regions (~43%) is higher than those of the SSC (30%) and LSC (34%) regions (Table S1).

Table 1.

Chloroplast genome features of the 14 mangroves species.

Species Family Extracted chloroplast data (Mb) Mean coverage (×) Total length (bp) IR length (bp) SSC length (bp) LSC length (bp)
Fabales
Pongamia pinnata Fabaceae 4.30 28.72 149,635 23,653 18,534 83,795
Lamiales
Avicennia marina Acanthaceae 32.54 213.66 152,288 25,638 17,924 83,088
Malpighiales
Excoecaria agallocha Euphorbiaceae 11.82 73.10 161,667 26,525 19,336 89,282
Bruguiera sexangula Rhizophoraceae 32.86 202.52 162,282 26,403 18,144 91,332
Kandelia obovata Rhizophoraceae 88.42 526.82 168,008 26,320 19,955 95,413
Rhizophora stylosa Rhizophoraceae 71.24 436.76 163,101 26,325 19,243 91,208
Ceriops tagal Rhizophoraceae 42.89 260.80 164,476 26,313 19,153 92,697
Malvales
Hibiscus tiliaceus Malvaceae 18.75 116.25 161,318 26,159 19,717 89,283
Heritiera littoralis Malvaceae 16.40 102.87 159,401 26,261 19,002 87,877
Thespesia populnea Malvaceae 6.42 40.04 160,451 25,582 20,306 88,981
Myrtales
Pemphis acidula Lythraceae 35.84 223.93 160,051 25,695 18,886 89,775
Sonneratia ovata Lythraceae 68.08 444.79 153,057 23,906 18,007 87,238
Laguncularia racemosa Combretaceae 27.43 170.77 160,672 26,353 18,886 89,071
Sapindales
Xylocarpus moluccensis Meliaceae 25.98 163.08 159,317 27,000 17,998 87,319

The number of chloroplast genes is usually conserved [26], with subtle differences between different species [27, 28]. The mangrove chloroplasts contain ~85 protein-coding genes, ~37 tRNA genes, and eight rRNA genes (Table 2). The gene components of photosystem I (five genes), cytochrome b/f complex (six genes), ATP synthase (six genes), NADH dehydrogenase (12 genes), Rubisco large subunit (rbcL), RNA polymerase (four genes), assembly/stability of photosystem I (ycf3 and ycf4), RNA processing (matK), chloroplast envelope membrane protein (cemA), cytochrome c synthesis (ccsA), ATP-dependent protease (clpP), fatty acid biosynthetic (accD), and proteasome subunit beta type-1 (pbf1) are the same in all the mangrove chloroplasts. We only found infA, a chloroplast genome translation initiation factor gene, in Avicennia marina and Heritiera littoralis (Table 2). This agrees with the fact that infA is commonly lost in angiosperms, especially in rosid species [29].

Table 2.

Gene content of the 14 mangrove chloroplast genomes.

Pongamia pinnata Avicennia marina Excoecaria agallocha Bruguiera sexangula Kandelia obovata Rhizophora stylosa Hibiscus tiliaceus Heritiera littoralis Thespesia populnea Ceriops tagal Laguncularia racemosa Sonneratia ovata Pemphis acidula Xylocarpus moluccensis
Total gene number 127 133 132 128 129 129 130 129 129 130 130 130 131 131
Transfer RNAs (tRNA) 35 37 37 37 37 37 37 36 37 37 37 37 37 37
Ribosomal RNAs (rRNAs) 8 8 8 8 8 8 8 8 8 8 8 8 8 8
Coding genes 84 88 87 83 84 84 85 85 84 85 85 85 86 86
 Photosystems I 5 5 5 5 5 5 5 5 5 5 5 5 5 5
 Photosystems II 15 14 14 14 14 14 14 14 14 14 14 14 14 14
 Cytochrome b/f complex 6 6 6 6 6 6 6 6 6 6 6 6 6 6
 ATP synthase 6 6 6 6 6 6 6 6 6 6 6 6 6 6
 NADH dehydrogenase 12 12 12 12 12 12 12 12 12 12 12 12 12 12
 Rubisco large subunit 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 RNA polymerase 4 4 4 4 4 4 4 4 4 4 4 4 4 4
 Ribosomal proteins (small subunit) 14 14 14 13 13 13 14 14 14 14 14 14 15 15
 Ribosomal proteins (large subunit) 10 11 11 10 11 11 11 11 11 11 11 11 11 11
 Assembly/stability of photosystem I 2 2 2 2 2 2 2 2 2 2 2 2 2 2
 RNA processing 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 Chloroplast envelope membrane protein 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 Cytochrome c synthesis 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 Proteins of unknown function 3 4 4 4 4 4 4 3 3 4 4 4 4 4
 ATP-dependent protease 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 Fatty acid biosynthetic 1 1 1 1 1 1 1 1 1 1 1 1 1 1
 Translation initiation factor 0 1 0 0 0 0 0 1 0 0 0 0 0 0
 Putative uncharacterized protein 0 2 2 0 0 0 0 0 0 0 0 0 0 0
 Proteasome subunit beta type-1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

3.2. Simple Sequence Repeat Content

Simple sequence repeats (SSRs) are tandem repeats (1~6 bp units repeated multiple times) in the genome which have been widely applied as markers for population studies and crop improvements [3033]. In this study, we detected and compared SSRs in the mangrove and 57 terrestrial plant chloroplast genomes (Table S2). The SSR contents are highly variable among species (Figure S2). Of the 14 mangroves, Kandelia obovate has the highest number of SSRs (194), while Avicennia marina has the fewest (61) (Table 3). Comparing between orders, the Malpighiales (number of SSRs ranges from 133 to 194; Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Ceriops tagal, and Rhizophora stylosa) has more SSRs than species of orders Malvales (number of SSRs ranges from 80 to 110; Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea) and Myrtales (number of SSRs ranges from 88 to 118; Laguncularia racemosa, Sonneratia ovata, and Pemphis acidula) (Table 3). Assessing SSR categories in the 14 mangrove chloroplast genomes, we found that the mononucleotide type accounted for at least half of the total SSRs (in Laguncularia racemosa and Avicennia marina up to 80%). A/T tandem repeats are most frequent, followed by dinucleotide, tetranucleotide, trinucleotide, pentanucleotide, and hexanucleotide repeats. Similar patterns of SSR variability and constitution were also observed in the 57 terrestrial plant chloroplasts (Figure S2). We propose that the SSRs identified here can serve as useful genetic resources for future population and evolution studies.

Table 3.

Number of SSRs in the 14 mangrove chloroplast genomes. Mononucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide repeat units are abbreviated as u1 to u6.

Species Total u1 u1 : A/T u1 : C/G u2 u3 u4 u5 u6
Pongamia pinnata 130 84 82 2 31 6 7 1 1
Avicennia marina 61 49 45 4 1 4 7 0 0
Excoecaria agallocha 133 92 87 5 19 7 13 1 1
Bruguiera sexangula 175 106 103 3 31 14 18 5 1
Kandelia obovata 194 106 106 0 36 20 20 10 2
Rhizophora stylosa 169 115 111 4 22 13 8 9 2
Ceriops tagal 142 78 75 3 16 20 23 4 1
Hibiscus tiliaceus 86 55 53 2 15 4 10 2 0
Heritiera littoralis 110 76 74 2 7 6 13 6 2
Thespesia populnea 80 47 44 3 17 3 7 5 1
Sonneratia ovata 112 77 72 5 15 5 10 4 1
Pemphis acidula 88 64 64 0 7 7 9 1 0
Laguncularia racemosa 118 98 96 2 6 5 8 1 0
Xylocarpus moluccensis 98 72 71 1 6 6 11 3 0

3.3. Phylogenetic Relationships of Mangroves

Similar to mitochondrial genomes used in vertebrate genetics, chloroplast genomes are widely used to settle phylogenetic and evolutionary disputes [28]. In order to reveal the phylogenetic relationships of mangroves, we constructed phylogenetic trees from chloroplast genomes from the 14 mangroves species and 57 terrestrial plant families in 16 rosid orders and one asterid order (a data set of 71 plant species) (Table S2). Based on these complete chloroplast genomes, we identified 44 highly conserved genes in the 71 species and constructed three Bayesian inference (BI) trees and three maximum-likelihood (ML) trees (see Materials and Methods; Figure 1 and Figures S3-S7). Three trees constructed using BI and ML strategies exhibited the same topology (Figure 1, Figure S3, Figure S5, and Table S3). Thus, we have produced a well-supported phylogenetic tree of mangrove and terrestrial plants.

Figure 1.

Figure 1

The Bayesian-inferred phylogenetic tree based on all chloroplast genes of 14 mangroves and 57 land plant species. Mangroves are indicated in red, and the background colors highlight the orders of mangroves and their sister species. Posterior probabilities are indicated near the branches.

Based on our phylogenetic tree, we found species from the same order to be in one group, and Rosid I (Malpighiales, Oxalidales, Celastrales, Fagales, Cucurbitales, Fabales, Rosales, and Zygophyllales), Rosid II (Malvales, Brassicales, Huerteales, Sapindales, Myrtales, and Geraniales), and a clade of rosids (Saxifragales and Vitales) were classified. For mangrove species, Avicennia marina is close to the other asterid terrestrial plant Echinacanthus lofouensis. Using these two species as outgroups, we obtained a clear phylogenetic relationship of the rest 13 rosid mangrove species. Myrtales is close to Geraniales in this tree and contains five families, of which Myrtaceae and Melastomataceae are in one clade, while Onagraceae and Lythraceae (including two mangroves Sonneratia ovata and Pemphis acidula) are in another clade. The relationships indicated here are consistent with a reported ML tree of Myrtales species [34]. Furthermore, we found that Combretaceae (Laguncularia racemosa) is a separate node close to Onagraceae and Lythraceae, supported by 1.00 posterior probability. For Sapindales, there is one mangrove Xylocarpus moluccensis, a member of Meliaceae whose position coincides with a previous study [35]. For Malvales, three mangroves (Hibiscus tiliaceus, Heritiera littoralis, and Thespesia populnea) together with Huerteales and Brassicales species are clustered as neighboring orders. The relationship of genera within the family of Malvaceae is in agreement with trees in Aquilaria sinensis [36] and Heritiera angustata [37] chloroplast studies, and we further confirmed that the semi-mangrove Thespesia populnea is close to Gossypium species. However, for Malpighiales, an order of high morphological and ecological diversities, the phylogenetic relationship of different families especially Linaceae was less resolved. Other than the grouping of families Linaceae and Euphorbiaceae, our phylogenetic tree is consistent with a previous study which employed 82 plastid genes of 58 species from Malpighiales [38]. We found that Euphorbiaceae constitutes a single branch, while Rhizophoraceae (including four mangroves Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, and Ceriops tagal) is a neighboring branch to Erythroxylaceae and Clusiaceae. Also, Linaceae forms a sister lineage with Chrysobalanaceae and Malpighiaceae. Again, these relationships are largely accordant with a study of Linum plastome [39]. Finally, our phylogenetic tree supports a sister relationship between the mangrove Pongamia pinnata with the other orders of Rosales, Cucurbitales, and Fagales.

3.4. Synteny and Divergence of the Chloroplast Genomes

We next analyzed the synteny and divergence between the mangrove and related chloroplast genomes. For mangrove species, the genomes have a conserved gene order similar to sister clades, except Heritiera littoralis which probably had been subjected to segmental rearrangements (Figures 2 and 3). Compared to its closely related species Hibiscus tiliaceus and Thespesia populnea, we found a notable rearrangement at position 8,109 bp to 33,498 bp in the Heritiera littoralis chloroplast genome. This region encodes 16 genes, including trnC-GCA, petN, psbM, trnD-GUC, trnY-GUA, trnE-UUC, rpoB, rpoC1, rpoC2, rps2, atpI, atpH, atpF, atpA, trnR-UCU, and trnS-CGA. Assessing the genetic divergence of mangrove chloroplast genomes by the most closely related species and within the orders (see Materials and Methods), we found the lowest divergence in the genera Heritiera and Xylocarpus (Figure 3). Compared to Heritiera and Xylocarpus, there is a relatively higher divergence between Hibiscus rosa-sinensis and Hibiscus tiliaceus, reflecting a higher level of genetic polymorphism of chloroplast genomes within genus Hibiscus. We also observed a more distinct divergence between species from one family/order in most other comparisons (Euphorbiaceae: Euphorbia tirucalli vs. Excoecaria agallocha, Malpighiales: Erythroxylum novogranatense vs. Kandelia obovata/Ceriops tagal/Rhizophora stylosa/Bruguiera sexangula, Fabaceae: Wisteria floribunda vs. Pongamia pinnata, Malvaceae: Gossypium lobatum vs. Thespesia populnea, Lythraceae: Trapa natans vs. Sonneratia ovata, Lythraceae: Punica granatum vs. Pemphis acidula, Combretaceae: Lumnitzera littorea vs. Laguncularia racemosa, Acanthaceae: Aphelandra knappiae vs. Avicennia marina) (Figure 3). Furthermore, according to the whole genomic comparison among multiple chloroplasts within orders, we found that the similarity of coding regions is generally higher than that of the intergenic regions, and the tRNA and rRNA genes are almost identical in all species (Figure S8). For the four regions, variations in SSC and LSC regions are more frequent comparing to the IR regions, indicating the IRs to be more conserved than the single-copy sequences (Figure 4). This is consistent with reports on other plant chloroplasts [4042].

Figure 2.

Figure 2

Synteny between the chloroplast genomes. The species are shown in the phylogenetic tree with red fonts to indicate the mangrove species, and the synteny is shown by linking the homologous genes. Colors at the bottom indicate genes with different functions.

Figure 3.

Figure 3

Pairwise comparisons of mangrove species and their most closely related species. Whole chloroplast genome comparisons are shown on the left, and the corresponding divergences are shown on the right.

Figure 4.

Figure 4

Divergence within orders of Malpighiales, Myrtales, Malvales, Lamiales, and Sapindales. The IR regions are shown in grey background, and the average divergences of each region are indicated above the lines in the corresponding regions. Two bars at the bottom show the SNPs and InDels in both the gene regions (red) and the intergenic regions (blue).

3.5. Genes under Selective Pressures

Genes in the chloroplasts are functionally important and might have been under selection during evolution. To analyze the selective pressures in mangrove chloroplast genomes, we calculated the nonsynonymous substitutions and synonymous substitution ratio (Ka/Ks) of coding genes in 14 mangrove species and 57 terrestrial plants (see Materials and Methods). Genes with Ka/Ks values above 1.0 should be under positive selection and might be candidate genes responsible for functional adaptations, while genes with values lower than 1.0 should be under negative (purifying) selection [43]. We found the Ka/Ks values of most gene pairs (within and between orders) to be lower than 1.0 (Figure 5), reflecting selection pressures to maintain the gene functions. For instance, genes involved in photosystems (psaA, psaB, psaC, psbA, psbC, psbD, psbE, psbI, and psbM), the cytochrome b/f complex (petB, petD, and petG), and some ATP synthases (atpB and atpH) in all species have Ka/Ks values close to 0. Genes encoding other ATP synthases (atpA, atpE, and atpF), NADH dehydrogenases (ndhA, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, and ndhK), ribosomal proteins (rps2, rps3, rps4, rps8, rps11, rps14, rps15, and rps19), and RNA polymerases (rpoA, rpoB, rpoC1, and rpoC2) also have low Ka/Ks values (mostly between 0 and 0.5). Similar to other plants [39, 44], chloroplast genes involved in photosynthesis and energy metabolism are conserved (very low Ka/Ks ratios) in mangroves.

Figure 5.

Figure 5

Ka/Ks values of protein-coding genes in chloroplasts. Ka/Ks values (horizontal axis) of genes in the mangrove species and their related species in six orders (Lamiales, Fabales, Malpighiales, Malvales, Myrtales, and Sapindales) are indicated. Red dots indicate Ka/Ks values of genes in mangrove species, while blue dots indicate Ka/Ks values in terrestrial species.

We further investigated the genes under positive selection. We found that the Ka/Ks values of four genes (petL, psaI, rpl36, and ycf1) were greater than 1.0 in the mangrove species Excoecaria agallocha, Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata. These genes are from four different functional groups, including subunits of cytochrome (petL), subunits of photosystems (psaI), subunits of ribosomes (rpl36), and unclassified genes (ycf1). The gene petL is a component of the cytochrome b6/f complex required for photosynthesis. The Ka/Ks of petL is ~1.5 in mangrove species Sonneratia ovata and Pemphis acidula, and positive selection on this gene was found only in these two species (Figure 5 and Figures S9 and S10). For psaI, a member of photosystem I (PSI), we observed a Ka/Ks value of 1.22 in Laguncularia racemosa (family Combretaceae, order Myrtales), suggesting potential positive selection on this gene in this mangrove. Among all species, Ka/Ks values of psaI range widely (from 0.2 to 1.5, especially in Malpighiales, Myrtales, and Sapindales), which may reflect the different selection pressures and adaptations in the diverse clades examined. Although a report on tobacco showed a role for psaI in stabilizing PSI during leaf senescence [45], the function of the protein remains unknown in most plant species. For rpl36 (LSU), the Ka/Ks values are greater than 1.0 in Pemphis acidula (1.41) and Sonneratia ovata (1.04). The loss of rpl36 might result in severe morphological aberrations, low translational efficiency, and poor photoautotrophic growth [46]. We speculate that relaxed selection on this gene might be associated with adaptations of plants to highly diverse environments. Gene ycf1 is one of the largest genes in the chloroplast genome, and there are usually two [39, 41] or one functional gene copies (i.e., one copy has become a pseudogene) [36, 44, 47, 48]. In this study, one functional copy and one fragment of the ycf1 gene were annotated in 11 mangroves (Avicennia marina, Xylocarpus moluccensis, Hibiscus tiliaceus, Excoecaria agallocha, Bruguiera sexangula, Kandelia obovata, Rhizophora stylosa, Ceriops tagal, Sonneratia ovata, Pemphis acidula, and Laguncularia racemosa). Furthermore, we found that ycf1 genes in many species from Rosid I (such as Malpighiales including mangrove Excoecaria agallocha) have Ka/Ks values around or higher than 1.0, while ycf1 genes in species from Rosid II plants have lower Ka/Ks values (~0.4 in Malvales, Myrtales, Sapindales, Huerteales, and Brassicales). Together with the fact that the ycf1 gene showed relatively lower sequence similarity among different species (Figure S8), we found different selection pressures on this gene in Rosid I and Rosid II here (Figure 5 and Figures S9 and S10). The observed potential positive selection on petL, psaI, and rpl36 in mangroves (especially in the three mangroves Laguncularia racemosa, Pemphis acidula, and Sonneratia ovata of order Myrtales) and a relaxed selection pressure on ycf1 possibly reflect the functional importance of those genes during adaptation.

4. Conclusions

Our study reports 14 complete mangrove chloroplast genomes, as well as a comprehensive comparative chloroplast genome analysis of mangrove and related plant species. The sequenced mangroves span six orders (five rosids and one asterid), making it the first large-scale study on mangrove chloroplast genomes. We found that mangrove chloroplast genomes are similar in structure and gene content. Notable exceptions include the retainment of the translation initiation factor gene infA in two mangrove species (the asterid Avicennia marina and the rosid Heritiera littoralis) and an inversion in the LSC region of mangrove Heritiera littoralis. We used our new mangrove genomes to create a well-supported phylogeny. Protein-coding genes of mangroves were found to be under pressure to maintain gene function, with only a small number of genes in a handful of species showing evidence of positive or relaxed selection. In conclusion, we report 14 complete chloroplast genomes from diverse mangrove species and analyzed their phylogeny and genome features. This study provides a useful resource for future studies on the evolution of mangroves and environmental adaptation.

Acknowledgments

This study was supported by the National Key Research and Development Program of China (No. 2016YFE0122000). We sincerely thank Professor Suhua Shi and Dr. Sen Li from Sun Yat-sen University for their help in providing samples.

Contributor Information

Guangyi Fan, Email: fanguangyi@genomics.cn.

Xin Liu, Email: liuxin@genomics.cn.

Data Availability

The 14 assembled chloroplast genome sequences along with the annotation can be found in CNSA (https://db.cngb.org/cnsa/) of CNGBdb under the accession number CNP0000567.

Disclosure

This manuscript has been released as a preprint in bioRxiv [49].

Conflicts of Interest

The authors declare no conflict of interest.

Authors' Contributions

Guangyi Fan, Xin Liu, Huanming Yang, and Simon Ming-Yuen Lee conceived this project. Guangyi Fan, Xin Liu, and Xun Xu oversaw execution of this study. Chengcheng Shi, Kai Han, and Liangwei Li performed the data analysis. Chengcheng Shi, Inge Seim, and Xin Liu wrote the manuscript. Chengcheng Shi and Kai Han contributed equally to this work.

Supplementary Materials

Supplementary Materials

Figure S1: the whole chloroplast genomes of 14 mangroves. The inner circle marks the LSC, SSC, and IR regions. Genes' position and orientations are shown along the outer circle. Genes with different functions are colored. Figure S2: the SSR distribution in the 14 mangroves and 57 terrestrial plant chloroplast genomes. (A) Number of different SSR types in each species. Mangroves are marked in red. (B) The SSR numbers in 17 orders (dots for each species and bars for average number). Figure S3: the ML phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species. Figure S4: the BI phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species using partition model. Figure S5: the ML phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species using partition model. Figure S6: the BI phylogenetic tree based on four conserved genes (ndhF, matK, rbcL, and atpB) of 14 mangroves and 57 land species. Figure S7: the ML phylogenetic tree based on four conserved genes (ndhF, matK, rbcL, and atpB) of 14 mangroves and 57 land species. Figure S8: the genomic comparison and similarities of whole chloroplast sequences among mangroves and their related species within orders Lamiales, Fabales, Malpighiales, Malvales, Myrtales, and Sapindales. CNS: conserved noncoding sequence. Figure S9: the Ka/Ks values of chloroplast protein-coding genes in species from Oxalidales, Celastrales, Fagales, Cucurbitales, and Rosales (Rosid I clades). Figure S10: the Ka/Ks values of chloroplast protein-coding genes in species from Brassicales (Rosid II), Huerteales (Rosid II), Geraniales, and Saxifragales. Table S1: GC content in the 14 mangrove chloroplast genomes. Table S2: the 57 public terrestrial species used for genomic comparison analyses (SSR, phylogeny, and evolution). Table S3: the assessment and comparison of phylogenetic trees.

References

  • 1.Serafy J. E., Shideler G. S., Araújo R. J., Nagelkerken I. Mangroves enhance reef fish abundance at the Caribbean regional scale. PLoS One. 2015;10(11) doi: 10.1371/journal.pone.0142022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Mumby P. J., Edwards A. J., Arias-González J. E., et al. Mangroves enhance the biomass of coral reef fish communities in the Caribbean. Nature. 2004;427(6974):533–536. doi: 10.1038/nature02286. [DOI] [PubMed] [Google Scholar]
  • 3.Tomlinson P. B. The Botany of Mangroves. Cambridge University Press; 1986. [Google Scholar]
  • 4.Ball M. C. Ecophysiology of mangroves. Trees. 1988;2(3):129–142. doi: 10.1007/bf00196018. [DOI] [Google Scholar]
  • 5.Polidoro B. A., Carpenter K. E., Collins L., et al. The loss of species: mangrove extinction risk and geographic areas of global concern. PLoS One. 2010;5(4) doi: 10.1371/journal.pone.0010095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wu F., Li M., Liao B., Shi X., Xu Y. DNA barcoding analysis and phylogenetic relation of mangroves in Guangdong Province China. Forests. 2019;10(1):p. 56. doi: 10.3390/f10010056. [DOI] [Google Scholar]
  • 7.Shi S., Huang Y., Zeng K., et al. Molecular phylogenetic analysis of mangroves: independent evolutionary origins of vivipary and salt secretion. Molecular Phylogenetics and Evolution. 2005;34(1):159–166. doi: 10.1016/j.ympev.2004.09.002. [DOI] [PubMed] [Google Scholar]
  • 8.Li X., Duke N. C., Yang Y., et al. Re-evaluation of phylogenetic relationships among species of the mangrove genus Avicennia from Indo-West Pacific based on multilocus analyses. PLoS One. 2016;11(10, article e0164453) doi: 10.1371/journal.pone.0164453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lo E. Y. Y., Duke N. C., Sun M. Phylogeographic pattern of Rhizophora (Rhizophoraceae) reveals the importance of both vicariance and long-distance oceanic dispersal to modern mangrove distribution. BMC Evolutionary Biology. 2014;14(1):p. 83. doi: 10.1186/1471-2148-14-83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Yang Y., Zhang Y., Chen Y., et al. Complete chloroplast genome sequence of the mangrove speciesKandelia obovata and comparative analyses with related species. PeerJ. 2019;7:e7713–e7713. doi: 10.7717/peerj.7713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hahn C., Bachmann L., Chevreux B. Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads--a baiting and iterative mapping approach. Nucleic Acids Research. 2013;41(13) doi: 10.1093/nar/gkt371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bankevich A., Nurk S., Antipov D., et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dierckxsens N., Mardulyn P., Smits G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Research. 2017;45(4) doi: 10.1093/nar/gkw955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tillich M., Lehwark P., Pellizzer T., et al. GeSeq - versatile and accurate annotation of organelle genomes. Nucleic Acids Research. 2017;45(W1):W6–W11. doi: 10.1093/nar/gkx391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Laslett D., Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Research. 2004;32(1):11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Greiner S., Lehwark P., Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Research. 2019;47(W1):W59–W64. doi: 10.1093/nar/gkz238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Camacho C., Coulouris G., Avagyan V., et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):421–421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Beier S., et al. Thiel T., Münch T., Scholz U., Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017;33(16):2583–2585. doi: 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Katoh K., Standley D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ronquist F., Teslenko M., van der Mark P., et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic Biology. 2012;61(3):539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lanfear R., Frandsen P. B., Wright A. M., Senfeld T., Calcott B. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Molecular Biology and Evolution. 2017;34(3):772–773. doi: 10.1093/molbev/msw260. [DOI] [PubMed] [Google Scholar]
  • 23.Shimodaira H., Hasegawa M. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics. 2001;17(12):1246–1247. doi: 10.1093/bioinformatics/17.12.1246. [DOI] [PubMed] [Google Scholar]
  • 24.Wang D., Zhang Y., Zhang Z., Zhu J., Yu J. KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genomics, Proteomics & Bioinformatics. 2010;8(1):77–80. doi: 10.1016/S1672-0229(10)60008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Frazer K. A., Pachter L., Poliakov A., Rubin E. M., Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Research. 2004;32:W273–W279. doi: 10.1093/nar/gkh458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tiller N., Bock R. The translational apparatus of plastids and its role in plant development. Molecular Plant. 2014;7(7):1105–1120. doi: 10.1093/mp/ssu022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Jansen R. K., Cai Z., Raubeson L. A., et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(49):19369–19374. doi: 10.1073/pnas.0709121104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Daniell H., Lin C. S., Yu M., Chang W. J. Chloroplast genomes: diversity, evolution, and applications in genetic engineering. Genome Biology. 2016;17(1):p. 134. doi: 10.1186/s13059-016-1004-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Millen R. S., Olmstead R. G., Adams K. L., et al. Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. The Plant Cell. 2001;13(3):645–658. doi: 10.1105/tpc.13.3.645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gandhi S. G., Awasthi P., Bedi Y. S. Analysis of SSR dynamics in chloroplast genomes of Brassicaceae family. Bioinformation. 2010;5(1):16–20. doi: 10.6026/97320630005016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Song S.-L., Lim P.-E., Phang S.-M., Lee W.-W., Hong D., Prathep A. Development of chloroplast simple sequence repeats (cpSSRs) for the intraspecific study of Gracilaria tenuistipitata (Gracilariales, Rhodophyta) from different populations. BMC Research Notes. 2014;7(1):p. 77. doi: 10.1186/1756-0500-7-77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Diekmann K., Hodkinson T. R., Barth S. New chloroplast microsatellite markers suitable for assessing genetic diversity of Lolium perenne and other related grass species. Annals of Botany. 2012;110(6):1327–1339. doi: 10.1093/aob/mcs044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wheeler G. L., Dorman H. E., Buchanan A., Challagundla L., Wallace L. E. A review of the prevalence, utility, and caveats of using chloroplast simple sequence repeats for studies of plant biology. Applications in Plant Sciences. 2014;2(12) doi: 10.3732/apps.1400059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Berger B. A., Kriebel R., Spalink D., Sytsma K. J. Divergence times, historical biogeography, and shifts in speciation rates of Myrtales. Molecular Phylogenetics and Evolution. 2016;95:116–136. doi: 10.1016/j.ympev.2015.10.001. [DOI] [PubMed] [Google Scholar]
  • 35.Lin N., Moore M. J., Deng T., et al. Complete plastome sequencing from Toona (Meliaceae) and phylogenomic analyses within Sapindales. Applications in Plant Sciences. 2018;6(4):e1040–e1040. doi: 10.1002/aps3.1040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wang Y., Zhan D.-F., Jia X., et al. Complete chloroplast genome sequence of Aquilaria sinensis (Lour.) Gilg and evolution analysis within the Malvales order. Frontiers in Plant Science. 2016;7 doi: 10.3389/fpls.2016.00280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zhao K. K., Wang J. H., Cai Y.-C., Zhu Z.-X., López-Pujol J., Wang H.-F. Complete chloroplast genome sequence of Heritiera angustata(Malvaceae): an endangered plant species. Mitochondrial DNA Part B. 2018;3(1):141–142. doi: 10.1080/23802359.2017.1422398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Xi Z., Ruhfel B. R., Schaefer H., et al. Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales. Proceedings of the National Academy of Sciences of the United States of America. 2012;109(43):17519–17524. doi: 10.1073/pnas.1205818109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.de Santana Lopes A., Pacheco T. G., Santos K. G. D., et al. The Linum usitatissimum L. plastome reveals atypical structural evolution, new editing sites, and the phylogenetic position of Linaceae within Malpighiales. Plant Cell Reports. 2018;37(2):307–328. doi: 10.1007/s00299-017-2231-z. [DOI] [PubMed] [Google Scholar]
  • 40.Du Y.-p., Bi Y., Yang F.-p., et al. Complete chloroplast genome sequences of Lilium : insights into evolutionary dynamics and phylogenetic analyses. Scientific Reports. 2017;7(1):p. 5751. doi: 10.1038/s41598-017-06210-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wang X., Zhou T., Bai G., Zhao Y. Complete chloroplast genome sequence of Fagopyrum dibotrys: genome features, comparative analysis and phylogenetic relationships. Scientific Reports. 2018;8(1):p. 12379. doi: 10.1038/s41598-018-30398-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Jeon J. H., Kim S.-C. Comparative analysis of the complete chloroplast genome sequences of three closely related East-Asian wild roses (Rosa sect. Synstylae; Rosaceae) Genes. 2019;10(1):p. 23. doi: 10.3390/genes10010023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kimura M. The neutral theory of molecular evolution and the world view of the neutralists. Genome. 1989;31(1):24–31. doi: 10.1139/g89-009. [DOI] [PubMed] [Google Scholar]
  • 44.Menezes A. P. A., Resende-Moreira L. C., Buzatti R. S. O., et al. Chloroplast genomes of Byrsonima species (Malpighiaceae): comparative analysis and screening of high divergence sequences. Scientific Reports. 2018;8(1):p. 2210. doi: 10.1038/s41598-018-20189-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Schöttler M. A., Thiele W., Belkius K., et al. The plastid-encoded PsaI subunit stabilizes photosystem I during leaf senescence in tobacco. Journal of Experimental Botany. 2017;68(5):1137–1155. doi: 10.1093/jxb/erx009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Fleischmann T. T., Scharff L. B., Alkatib S., Hasdorf S., Schöttler M. A., Bock R. Nonessential plastid-encoded ribosomal proteins in tobacco: a developmental role for plastid translation and implications for reductive genome evolution. Plant Cell. 2011;23(9):3137–3155. doi: 10.1105/tpc.111.088906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ivanova Z., Sablok G., Daskalova E., et al. Chloroplast genome analysis of resurrection tertiary relict Haberlea rhodopensis highlights genes important for desiccation stress response. Frontiers in Plant Science. 2017;8:p. 204. doi: 10.3389/fpls.2017.00204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wang W., Yu H., Wang J., et al. The complete chloroplast genome sequences of the medicinal plant Forsythia suspensa (Oleaceae) International Journal of Molecular Sciences. 2017;18(11):p. 2288. doi: 10.3390/ijms18112288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shi C., Han K., Li L., et al. Complete chloroplast genomes of 14 mangroves: phylogenetic and genomic comparative analyses. bioRxiv. 2019 doi: 10.1101/787556. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

Figure S1: the whole chloroplast genomes of 14 mangroves. The inner circle marks the LSC, SSC, and IR regions. Genes' position and orientations are shown along the outer circle. Genes with different functions are colored. Figure S2: the SSR distribution in the 14 mangroves and 57 terrestrial plant chloroplast genomes. (A) Number of different SSR types in each species. Mangroves are marked in red. (B) The SSR numbers in 17 orders (dots for each species and bars for average number). Figure S3: the ML phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species. Figure S4: the BI phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species using partition model. Figure S5: the ML phylogenetic tree based on whole chloroplast genes of 14 mangroves and 57 land species using partition model. Figure S6: the BI phylogenetic tree based on four conserved genes (ndhF, matK, rbcL, and atpB) of 14 mangroves and 57 land species. Figure S7: the ML phylogenetic tree based on four conserved genes (ndhF, matK, rbcL, and atpB) of 14 mangroves and 57 land species. Figure S8: the genomic comparison and similarities of whole chloroplast sequences among mangroves and their related species within orders Lamiales, Fabales, Malpighiales, Malvales, Myrtales, and Sapindales. CNS: conserved noncoding sequence. Figure S9: the Ka/Ks values of chloroplast protein-coding genes in species from Oxalidales, Celastrales, Fagales, Cucurbitales, and Rosales (Rosid I clades). Figure S10: the Ka/Ks values of chloroplast protein-coding genes in species from Brassicales (Rosid II), Huerteales (Rosid II), Geraniales, and Saxifragales. Table S1: GC content in the 14 mangrove chloroplast genomes. Table S2: the 57 public terrestrial species used for genomic comparison analyses (SSR, phylogeny, and evolution). Table S3: the assessment and comparison of phylogenetic trees.

Data Availability Statement

The 14 assembled chloroplast genome sequences along with the annotation can be found in CNSA (https://db.cngb.org/cnsa/) of CNGBdb under the accession number CNP0000567.


Articles from BioMed Research International are provided here courtesy of Wiley

RESOURCES