Abstract
Symplocarpus renifolius is a member of Araceae family that is extraordinarily diverse in appearance. Previous studies on chloroplast genomes in Araceae were focused on duckweeds (Lemnoideae) and root crops (Colocasia, commonly known as taro). Here, we determined the chloroplast genome of Symplocarpus renifolius and compared the factors, such as genes and inverted repeat (IR) junctions and performed phylogenetic analysis using other Araceae species. The chloroplast genome of S. renifolius is 158,521 bp and includes 113 genes. A comparison among the Araceae chloroplast genomes showed that infA in Lemna, Spirodela, Wolffiella, Wolffia, Dieffenbachia and Colocasia has been lost or has become a pseudogene and has only been retained in Symplocarpus. In the Araceae chloroplast DNA (cpDNA), psbZ is retained. However, psbZ duplication occurred in Wolffia species and tandem repeats were noted around the duplication regions. A comparison of the IR junction in Araceae species revealed the presence of ycf1 and rps15 in the small single copy region, whereas duckweed species contained ycf1 and rps15 in the IR region. The phylogenetic analyses of the chloroplast genomes revealed that Symplocarpus are a basal group and are sister to the other Araceae species. Consequently, infA deletion or pseudogene events in Araceae occurred after the divergence of Symplocarpus and aquatic plants (duckweeds) in Araceae and duplication events of rps15 and ycf1 occurred in the IR region.
Keywords: Araceae, Symplocarpus, infA, psbZ, inverted repeat region, phylogenetic relationship
1. Introduction
The family Araceae consists of approximately 102 genera and 3300 species and is one of most structurally diverse groups of monocots. This family is widely distributed in all the continents, except Antarctica. The Lemnoideae subfamily (duckweeds) within Araceae displays particularly diverse morphological traits, habits and life forms [1,2,3]. The earliest modern classification of Araceae was given in 1860 by Schott [4] and Engler [5,6]. They classified Araceae according to the vegetable morphology and anatomy. Many molecular phylogenetic analyses support this classification. However, the taxonomic relationships among some species have not been resolved, for example, among the members of Aroideae subfamily [2,3,7,8].
The genus Symplocarpus belongs to the subfamily Orontioideae, which also contains the genera Orontium and Lysichiton [9]. Five Symplocarpus species (S. renifolius Schott ex Miq., S. foetidus (L.) Nutt., S. nipponicus Makino, S. nabekuraensis Otsika and K.Inoue and S. egorovii N.S. Pavlova and V.A. Nechaev) are distributed in eastern Asia and eastern North America [9,10,11].
The plant chloroplast (cp) genomes are divided into four major segments. Two of these contain only single copies of genes: the large single-copy (LSC) region and small single-copy (SSC) region. The last two segments are nearly identical inverted copies, termed inverted repeats (IRA and IRB). The cp genomes are approximately 120–170 kb and include 100–120 genes. Most angiosperm cp genomes are highly conserved [12,13]. However, there have been observations of gene loss [14], inversion [15], rearrangement [16], IR loss [17], IR contraction and IR extension [18] in some lineages. In particular, four junctions (LSC/IRB, IRB/SSC, SSC/IRA and IRA/LSC) of cp genome were found at various sites. Wang et al. [18] suggested two possible mechanisms, (i) gene conversion to account for the small IR expansion or movements in most species of the genus; (ii) a DNA double-strand break (DSB) to explain the extensive incorporation of the LSC region. We observed that the IR region has expanded in three species of Lemnoideae.
Previous studies on the cp genomes in the Araceae family have focused on five species. Mardanov et al. [19] studied the cp genome of Duckweed (Lemna minor) and compared it with those of the other angiosperm species. Wang and Messing [20] generated the cp genomes of three other duckweed species (Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana) and Ahmed et al. [21] determined the cp genome sequences for two taro species (subfamily Aroideae) and compared the Aroid cp genomes.
In this study, we determined the complete cp genome sequence of the Asian skunk cabbage Symplocarpus renifolius (subfamily Orontioideae, family Araceae) and compared it with those of other Araceae species. We also analyzed the gene contents (infA and psbZ), IR region extensions (ycf1 and rps15 duplication) and phylogenetic relationships in Araceae.
2. Materials and Methods
2.1. Plants Materials and DNA Sequencing
This study was approved by the Korean National Arboretum (KNA 1-2-13, 14-2). The S. renifolius sample was obtained from a living collection in the greenhouse of Yeungnam University, in Gyeongsan, South Korea. Total DNA was extracted using a DNeasy Plant Mini Kit (Qiagen Inc., Valencia, CA, USA) and was quantified using a HiGen Gel and PCR Purification system (Biofact Inc., Daejeon, Korea). The genomic DNA was sequenced using Illumina Miseq (Illumina Inc., San Diego, CA, USA). A total number of 4,491,905,669 raw reads were obtained from the Illumina sequencing. These raw reads were trimmed and filtered using Genious v.10.1 (Biomatters, Auckland, New Zealand). The filtered 1,516,987 reads were mapped to reference genome, Dieffenbachia sequine (NC_027272). The coverage of S. renifolia was 1002 X. The genome coverage was estimated using the CLC Genomics Workbench, v. 7.0.4 (CLC Bio, Aarchus, Denmark) at LabGenomics Co. (Seongnam, Korea).
2.2. Genome Annotation, Genome Mapping and Sequence Analysis
The complete cp genome sequence was annotated using Dual Organellar GenoMe Annotator (DOGMA) [22] and was compared with the published complete cp genome sequences (Table S1) available for Colocasia esculenta (JN105689), Dieffenbachia seguine (NC_0272725), Spirodela polyrhiza (JN160603), Wolffia australiana (JN160605) and Wolffiella lingulata (JN160604). All the identified tRNA genes were verified using the corresponding structures predicted by tRNAscan-SE [23]. A circular cp genome map was drawn using ORDRAW [24]. To compare the structure and genes present in Araceae family, sequences from the different plants were aligned using MAFFT [25] and Geneious v.6.1.7 bioinformatics software platform (Biomatters).
2.3. Repeat Structure
Tandem repeats (forward, palindrome, reverse and complement repeats) were detected using REPtuer program [26]. The minimum repeat size was set at 30 bp and at a sequence identity greater than 90%. The simple sequence repeats (SSRs) were detected using Phobos v.3.3.12 (http://www.ruhr-uni-bochum.de/ecoevo/cm/com_phobos.htm). We applied a threshold 10 to mononucleotide repeats, five to dinucleotide repeats, four to trinucleotide repeats, three to tetranucleotide repeats and two to penta-, hexa-, hepta- and deca- nucleotide repeats.
2.4. Phylogenetic Analysis
A total of 77 coding genes from 17 species were compiled into a single file of 67,392 bp and aligned using MAFFT [25]. Sixteen species (including six Araceae species) were selected as the in-groups and Amborella trichopoda was included as the out-group (Table S1). Maximum likelihood (ML) analyses were performed using RAxmL v.7.4.2 with 1000 bootstrap replications and using the GTR+I+G model [27]. The best-fit models of substitutions selected by ModelTest 3.7 [28].
3. Results
3.1. Characteristics of the Symplocarpus renifolius and Araceae cp Genomes
The cp genome of S. renifolius (GenBank accession number KY039276) is 158,521 bp in length (Figure 1). It includes two inverted repeat regions (IRA and IRB) of 25,801 bp separated by LSC and SSC regions of 86,620 bp and 20,299 bp, respectively. The overall AT content of the S. renifolius cp genome is 62.7% (Table 1). The S. renifolius cp genome contained 113 genes, of which four were ribosomal RNA (rRNA) genes, 30 were transfer RNA (tRNA) genes, 78 genes were protein coding genes and 17 genes located in the IR regions were duplicated (rpl2, rpl23, trnI-CAU, ycf2, trnL-CAA, ndhB, rps7, rps12, trnV-GAC, rrn16, trnI-GAU, trnA-UGC, rrn23, rrn4.5, rrn5, trnR-ACG and trnN-UGA). Of the 113 single-copy genes, 13 contained a single intron (eight protein-coding genes and five tRNA genes) and three protein-coding genes (ycf3, clpP and rps12) contained two introns.
Figure 1.
Complete chloroplast genome of Symplocarpus renifolius. Genes drawn inside the circle are transcribed clockwise, whereas those outside are transcribed counterclockwise. The gray plot in the inner circle corresponds to the GC content. The colored bars indicate known protein-coding genes, transfer RNA (tRNA) genes and ribosomal RNA (rRNA) genes.
Table 1.
General features of Symplocarpus and comparison to other Araceae species.
| Symplocarpus | Colocasia | Dieffenbachia | Lemna | Spirodela | Wolffiella | Wolffia | |
|---|---|---|---|---|---|---|---|
| Total length | 158,521 bp | 162,424 bp | 163,699 bp | 165,955 bp | 168,788 bp | 169,337 bp | 168,704 bp |
| LSC | 86,620 bp | 86,670 bp | 90,780 bp | 89,906 bp | 91,222 bp | 92,015 bp | 91,454 bp |
| SSC | 20,299 bp | 22,208 bp | 22,440 bp | 13,603 bp | 14,056 bp | 13,956 bp | 13,394 bp |
| IR | 25,801 bp | 25,273 bp | 25,235 bp | 31,223 bp | 31,755 bp | 31,683 bp | 31,930 bp |
| % AT content | 62.7% | 63.8% | 63.6% | 64.3% | 64.3% | 64.2% | 64.1% |
| Genes | |||||||
| Coding genes | 80 | 79 | 79 | 79 | 79 | 79 | 79 |
| tRNA | 30 | 30 | 30 | 30 | 30 | 30 | 30 |
| rRNA | 4 | 4 | 4 | 4 | 4 | 4 | 4 |
| IR duplication genes | 17 | 17 | 17 | 17 | 19 | 19 | 19 |
LSC: Large Single Copy, SSC: Small Single Copy, IR: Inverted Repeat, tRNA: transfer RNA, rRNA: ribosomal RNA.
The seven complete cp genomes of Araceae species (Lemna, Spirodela, Wolffiella, Wolffia, Dieffenbachia, Colocasia and Symplocarpus) consist of a pair of IRs (25,273–31,930 bp) separated by one LSC (86,670–92,015 bp) and one SSC (13,394–22,208 bp) region, each. S. renifolius has one of the smallest cp genomes and has the lowest AT content among the Araceae species (Table 1).
3.2. infA and psbZ Genes in Araceae
In the present study, we compared each individual genes, rearrangement and IR boundaries in the cp genomes of Symplocarpus and other species of Araceae. The genes are conserved, except for infA and psbZ, within Araceae. Previous study suggested that the functional gene, infA is highly variable in Araceae species such as pseudogene or missing gene [21]. However, infA of S. renifolius is an intact gene (Figure 2).
Figure 2.
Sequence alignment of the rpl36-rps8 region (including infA) in seven Araceae species. (A) rpl36-rps8 region; (B) infA sequence and its translation.
psbZ in six Araceae species (Lemna, Spirodela, Dieffenbachia, Wolffiella, Colocasia and Symplocarpus) was located between trnS-UGA and trnG-UCC. However, psbZ of Wolffia had a duplication event because of which two psbZ genes became one pseudogene and one intact gene. The psbZ pseudogene was located between trnS-UGA and trnG-UCC and the intact psbZ was located between ycf3 and trnS-GGA (Figure 3).
Figure 3.
Comparison of psbZ in Araceae species. (A) Sequence of psbC-rps4 region; (B) Location of psbZ between trnS-UGA and trnG-UCC (C) Location of psbZ between ycf3 and trnS-GGA. Arrow indicates the tandem repeats.
3.3. Simple-Sequence Repeats Loci of Araceae cp Genomes
Four classes of tandem repeats (forward repeats, reverse repeats, complement repeats and palindromic repeats) were found in six Araceae species (Lemna, Spirodela, Dieffenbachia, Wolffiella, Colocasia and Symplocarpus). The number of tandem repeats ranged from 24 to 39 in the six Araceae species, with Wolffia showing the highest number of tandem repeats. The tandem repeats in Symplocarpus, Wolffia, Spirodela and Lemna were located in the LSC region more often than in the SSC or IR regions. However, larger number of tandem repeats in Wolffiella and Colocasia were located in the IR region (Figure 4A). Symplocarpus contained five complement repeats, seven forward repeats, nine palindromic repeats and six reverse repeats. For the type of SSRs, forward repeats had the highest numbers in Wolffiella, Spirodela, Lemna and Colocasia (Table S2). The SSRs in Wolffia and Symplocarpus had the highest number of palindromic repeats, especially those in Wolffia, with 29 palindromic repeats (Figure 4B). The tandem repeats ranged from 30 to 1485 bp and the sizes of most of the tandem repeats were between 30 and 40 bp (Figure 4C).
Figure 4.
Analyses of tandem repeated sequences in the Araceae chloroplast genome. (A) Distribution of tandem repeats in Araceae species. F, P, R and C indicate the repeats matching in forward, palindrome, reverse and complement orientation, respectively (B) Type of tandem repeats in Araceae species. (C) Frequency of tandem repeats sequences in Araceae species.
Simple sequence repeats are effective markers for population genetics. A total of 121 SSRs were present in the Symplocarpus cp genome, in addition to 83 mononucleotides, 40 dinucleotides, nine trinucleotides, 13 tetranucleotides, one pentanucleotide and one decanucleotide repeat (Table S3). A total of 85 SSRs were distributed in the LSC region (six in the coding regions, 60 in the noncoding regions and 19 in the introns), whereas the SSC and IR regions had 30 (six in the coding regions, 20 in the noncoding regions and four in the introns) and six (four in non-coding regions and two introns) SSRs, respectively (Figure 5A,B). The total number of SSRs in the genomes of other Araceae species was 66 in Wolffia, 77 in Wolffiella, 85 in Spirodela, 71 in Lemna and 148 in Colocasia (Figure 5A and Table S3). Most of the SSRs were located in the non-coding regions in the LSC region (Figure 5B).
Figure 5.
Analyses of simple-sequence repeats (SSRs) in the Araceae chloroplast genome. (A) Location of SSRs in Araceae species; (B) Distribution and region of SSRs in Araceae species.
3.4. Two Types of Inverted Repeat Regions in Araceae
The LSC/IRB/SSC/IRA boundary regions of the S. renifolius cp genome were compared to those of the other Araceae genomes and two types of such regions were found (Figure 6). Type A was present in S. renifolius, Dieffenbachia and Colocasia and was found at the border between LSC and IRB, between rpl2 and rps19. The IRB and SSC border occurs between trnN-GUU and ndhF. The IRB/SSC border was located between the trnN-GUU and ycf1 and the IRB/LSC border was located between rpl2 and trnH-GUG. The sizes of the IRs in S. renifolius, Dieffenbachia and Colocasia were 25,801, 25,273 and 25,235 bp, respectively (Table 1). Type B occurred in Lemna, Spirodela, Wolffia and Wolffiella. These four plants showed expansion of the IRs, such as duplication of ycf1 and rps15. The IR lengths in Spirodela, Lemna, Wolffiella and Wolffia were longer than in the four Type A species (31,223, 31,755, 31,683 and 31,930 bp, respectively, Table 1). The borders between the LSC/IRB and IRA/LSC were located in the same place as in Type A. However, IRB/SSC and SSC/IRA borders were located between rps15/ndhF and ndhH/rps15, respectively.
Figure 6.
Comparison of LSC, SSC and IR junction positions among Araceae species.
3.5. Phylogenetic Analysis of Araceae
The 77 genes comprising 67,982 bp were used for ML analysis (Figure 7). The ML analysis resulted in a tree with ML values of –lnL = 303,581.50476. Araceae was well supported as monophyletic (100% bootstrap values, BS) and is shown to be a sister family to other monocots with 100% BS. The clades of Symplocarpus (Orontioideae), Colocasia + Dieffenbachia (Aroideae) and Spirodela + Lemna + Wolffiella + Wolffia (Lemnoideae) were supported with 100% bootstrap values, respectively. Lemnoideae (Spirodela, Lemna, Wolffiella and Wolffia) and other Aroideae species (Colocasia and Dieffenbachia) formed after Symplocarpus in the ML tree.
Figure 7.
Phylogenetic tree reconstruction of 17 taxa using maximum likelihood method based on concatenated sequences of 77 protein-coding genes. Bootstrap support values > 50% are given at the nodes.
4. Discussion
4.1. Gene Loss of infA and Gene Duplication of psbZ in Araceae
The translation initiation factor 1, the product of infA, has been lost from many angiosperms and infA genes of some plants were transferred to the nucleus [29,30,31]. Studies on infA in angiosperms have indicated that it has been independently lost multiple times [29]. The data collected by Ahmed et al. [21] indicate that infA is completely missing in duckweed (Spirodela, Lemna, Wolffiella and Wolffia), taro (Colocasia) and Dieffenbachia (NC_27272), with internal stop codons in infA. However, Symplocarpus contains infA (Figure 2). infA was lost from Aroideae (Colocasia and Dieffenbachia) and Lemnoideae (Spirodela, Lemna, Wolffiella and Wolffia) and from the completely sequenced Orontioideae subfamily (Symplocarpus). Consequently, this result suggests the loss or pseudogenization of infA after the divergence of Aroideae and Lemnoideae (Figure 7).
The psb genes (psbA, B, C, D, E, F, H, I, J, K, L, M, N, T and Z) of the cp genomes encode a subunit of photosystems II [13,32]. Swiatek et al. [33] proposed that the psbZ protein controls the interaction of PSII cores with the light-harvesting antenna. In particular, they suggested that the PSII-LHCII super complexes could no longer be isolated from PsbZ-deficient tobacco plants. In addition, Nelson and Yocum [34] reported that the products of psbN and psbZ interact with chlorophyll-bound subunits of psbC that reach into the thylakoid lumen. Previous studies on Cuscuta [35], Aneura [36] and Epifagus [37] showed the loss of some psb genes.
psbZ of Araceae plants is located between trnS-UGA and trnG-UCC in the LSC region (Figure 3). However, psbZ in Wolffia (JN160605) had a duplication event and was identified at two locations in the LSC region (trnS-UGA/trnG-UCC and ycf3/trnS-GGA). Interestingly, seven tandem repeats of Wolffia were located in the trnS-UGA/trnG-UCC and ycf3/trnS-GGA regions (Figure 3, Table S2) and all the tandem repeats were palindromic. The tandem repeats were not found in other Araceae plants, except in Wolffia in the trnS-UGA/trnG-UCC and ycf3/trnS-GGA regions. Our results suggest that the palindromic repeats are effective for psbZ duplication.
4.2. Phylogenetic Analysis and cp Structure in Araceae
The Araceae family is divided into two groups: the Proto-Araceae (Orontioideae) and the Spirodela clade (Figure 8). The Spirodela clade is composed of Lemnoideae (Duckweeds) and the True Araceae, as determined by molecular phylogenetic analysis (rbcL, matK, trnK intron, trnL intron and trnL-trnF IGS) and morphological data analysis [3,38]. Our results support those of previous studies showing that Araceae are a well-defined group with three groups within it (Figure 8A): Orontioideae (Symplocarpus), Aroideae (Colocasia and Dieffenbachia) and Lemnoideae (Spirodela, Lemna, Wolffiella and Wolffia).
Figure 8.
Gene duplication, gene loss and IR expansion in Araceae. (A) Phylogenetic position and gene contents in Araceae species; (B) IR junction expansion and gene duplication in Lemnoideae; (C) IR junction in Aroideae and Orontioideae.
Inverted repeat regions are variable sites and useful features for plant [18,39]. The expansion of the IR occurred within a few angiosperm families and groups, such as Eleagnaceae [31], Geraniaceae [16], Ipomoea [35], Oryza [40] and maize [41].
The species within the Lemnoideae (Spirodela, Lemna, Wolffiella and Wolffia) subfamily are different from the other Araceae species. The Lemnoideae species are miniscule in size and are aquatic monocotyledons. In our results (Figure 8), it was interesting to note that Lemnoideae were an independent clade and had different IR gene contents in their cp genomes. In the four species of Lemnoideae, the IR region was ca. 6 kb larger than that of the other Araceae species (Table 1). In Lemnoideae, the IRB/SSC and SSC/IRA regions have expanded to include ycf1 and rps15 gene duplications (Figure 8B), unlike that in other Araceae species (Figure 8C). This structure supported the theory of independent evolution of the IR regions in the Araceae family.
Acknowledgments
This work was supported by grants from Scientific Research (KNA1-2-13, 14-2) of the Korea National Arboretum.
Supplementary Materials
The following are available online at www.mdpi.com/2073-4425/8/11/324/s1. Supplementary Table S1. Taxa used in the phylogenetic analysis and GenBank accession numbers of the references, Supplementary Table S2. Tandem repeat sequence of Araceae, Supplementary Table S3. Simple sequence repeats of Araceae.
Author Contributions
S.P., K.S.C. and K.T.P. conceived and designed the experiments; K.S.C. and K.T.P. performed the experiments; K.S.C. analyzed the data; K.S.C. wrote the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Grayum M.H. Evolution and phylogeny of the Araceae. Ann. Mo. Bot. Gard. 1990;77:628–697. doi: 10.2307/2399668. [DOI] [Google Scholar]
- 2.Les D.H., Crawford D.J., Landolt E., Gabel J.D., Kimball R.T. Phylogeny and systematics of Lemnaceae, the duckweed family. Syst. Bot. 2002;27:221–240. [Google Scholar]
- 3.Cusimano N., Bogner J., Mayo S.J., Boyce P.C., Wong S.Y., Hesse M., Hetterscheid W.L., Keating R.C., French J.C. Relationships within the Araceae: Comparison of morphological patterns with molecular phylogenies. Am. J. Bot. 2011;98:654–668. doi: 10.3732/ajb.1000158. [DOI] [PubMed] [Google Scholar]
- 4.Schott H.W. Prodromus Systematis Aroidearum. Congretationis Mechitharisticae; Vienna, Austria: 1860. [Google Scholar]
- 5.Engler A. Nova Acta Academiae Caesareae Leopino-Carolinae Germanicae Naturae Curiosorum. Volume 39. E. Blochmann & Sohn; Dresden, Germany: 1876. Vergleichende Untersuchungen über die morphologischen Verhältnisse der Araceae. I. Natürliches System der Araceae; pp. 133–155. [Google Scholar]
- 6.Engler A. Araceae-Pars generalis et index familiae generalis. In: Engler A., editor. Das Pflanzenreich. Volume 74. Verlag von Wilhelm Engelmann; Leipzig, Germany: 1920. pp. 1–71. (IV.23A) [Google Scholar]
- 7.Barabé D., Lacroix C., Bruneau A., Forest F., Lacroix C. The correlation between development of atypical bisexual flowers and phylogeny in the Aroideae (Araceae) Plant Syst. Evol. 2002;232:1–19. doi: 10.1007/s006060200023. [DOI] [Google Scholar]
- 8.Rothwell G.W., Van Atta M.R., Ballard H.W., Jr., Stockey R.A. Molecular phylogenetic relationships among Lemnaceae and Araceae using the chloroplast trnL-trnF spacer. Mol. Phylogenet. Evol. 2004;30:378–385. doi: 10.1016/S1055-7903(03)00205-7. [DOI] [PubMed] [Google Scholar]
- 9.Mayo S.J., Bogner J., Bouce P.C. The Genera of Araceae. Royal Botanic Gardens, Kew; London, UK: 1997. [Google Scholar]
- 10.Wen J., Jansen R.K., Kilgore K. Evolution of the eastern Asian and eastern North American disjunct genus Symplocarpus (Araceae): Insights from chloroplast DNA restriction site data. Biochem. Syst. Ecol. 1996;24:735–747. doi: 10.1016/S0305-1978(96)00070-1. [DOI] [Google Scholar]
- 11.Nie Z.-L., Sun H., Li H., Wen J. Intercontinental biogeography of subfamily Orontioideae (Symplocarpus, Lysichiton and Orontium) of Araceae in eastern Asia and North America. Mol. Phylogenet. Evol. 2006;40:155–165. doi: 10.1016/j.ympev.2006.03.012. [DOI] [PubMed] [Google Scholar]
- 12.Palmer J.D. Contrasting modes and tempos of genome evolution in land plant organelles. Trends Genet. 1990;6:115–120. doi: 10.1016/0168-9525(90)90125-P. [DOI] [PubMed] [Google Scholar]
- 13.Wicke S., Schneeweiss G.M., de Pamphilis C.W., Müller K.F., Quandt D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol. Biol. 2011;76:273–297. doi: 10.1007/s11103-011-9762-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jansen R.K., Cai Z., Raubeson L.A., Daniell H., dePamphilis C.W., Leebens-Mack J., Müller K.F., Gusinger-Bellian M., Haberte R.C., Hansen A.K., et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc. Natl. Acad. Sci. USA. 2007;104:19369–19374. doi: 10.1073/pnas.0709121104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Jansen R.K., Palmer J.D. A chloroplast DNA inversion markers and ancient evolutionary split in the sunflower family (Asteraceae) Proc. Natl. Acad. Sci. USA. 1987;84:5818–5822. doi: 10.1073/pnas.84.16.5818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chumley T.W., Palmer J.D., Mower J.P., Fourcade H.M., Calie P.J., Boore J.L., Jansen R.K. The complete chloroplast genome sequence of Pelargonium X hortorum: Organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Mol. Biol. Evol. 2006;23:2175–2190. doi: 10.1093/molbev/msl089. [DOI] [PubMed] [Google Scholar]
- 17.Palmer J.D., Osorio B., Aldrich J., Thompson W.F. Chloroplast DNA evolution among legumes: Loss of a large inverted repeat occurred prior to other sequence rearrangements. Curr. Genet. 1987;11:275–286. doi: 10.1007/BF00355401. [DOI] [Google Scholar]
- 18.Wang R.J., Cheng C.L., Chang C.C., Wu C.L., Su T.M., Chaw S.M. Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol. Biol. 2008;8:36. doi: 10.1186/1471-2148-8-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mardanov A.V., Ranin N.V., Kuznestsov B.B., Samigullin T.H., Antonov A.S., Kolganova T.V., Skyabin K.G. Complete sequence of the Duckweed (Lemna minor) chloroplast genome: Structural organization and phylogenetic relationships to other Angiosperms. J. Mol. Evol. 2008;66:555–564. doi: 10.1007/s00239-008-9091-7. [DOI] [PubMed] [Google Scholar]
- 20.Wang W., Messing J. High-throughput sequencing of three Lemnoideae (Duckweeds) chloroplast genomes from total DNA. PLoS ONE. 2011;6:e24670. doi: 10.1371/journal.pone.0024670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ahmed I., Biggs P.G., Mattews P.J., Collins L.J., Hendy M.D., Lockhart P.J. Mutational dynamics of Aroid chloroplast genomes. Genome Biol. Evol. 2012;4:1316–1323. doi: 10.1093/gbe/evs110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wyman S.K., Jansen R.K., Boore J.L. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–3255. doi: 10.1093/bioinformatics/bth352. [DOI] [PubMed] [Google Scholar]
- 23.Schattner P., Brooks A.N., Lowe T.M. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33:W686–W689. doi: 10.1093/nar/gki366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lohse M., Drechsel O., Bock R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2009;25:1451–1452. doi: 10.1007/s00294-007-0161-y. [DOI] [PubMed] [Google Scholar]
- 25.Katoh K., Misawa K., Kuma K., Miyata T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–3066. doi: 10.1093/nar/gkf436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kurtz S., Schleiermacher C. REPuter: Fast computation of maximal repeats in complete genomes. Bioinformatics. 1999;15:426–427. doi: 10.1093/bioinformatics/15.5.426. [DOI] [PubMed] [Google Scholar]
- 27.Stamatakis A. RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
- 28.Posada D., Crandall K.A. MODELTEST: Testing the model of DNA substitution. Bioinformatics. 1998;14:817–818. doi: 10.1093/bioinformatics/14.9.817. [DOI] [PubMed] [Google Scholar]
- 29.Millen R.S., Olmstead R.G., Adams K.L., Palmer J.D., Lao N.T., Heggie L., Kavanagh T.A., Hibberd J.M., Grey J.C., Morden C.W., et al. Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. Plant Cell. 2001;13:645–658. doi: 10.1105/tpc.13.3.645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Park S., Jansen R.K., Park S. Complete plastome sequence of Thalictrum coreanum (Ranunculaceae) and transfer of the rpl32 gene to the nucleus in the ancestor of the subfamily Thalictroideae. BMC Plant Biol. 2015;15:40. doi: 10.1186/s12870-015-0432-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Choi K.S., Son O.G., Park S.J. The chloroplast genome of Elaeagnus macrophylla and trnH duplication event in Elaeagnaceae. PLoS ONE. 2015;10:e0138727. doi: 10.1371/journal.pone.0138727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ruhlman T.A., Jansen R.K. The plastid genomes of flowering plants. In: Maliga P., editor. Chloroplast Biotechnology: Methods and Protocols. Volume 1132. Springer Science and Business Media, LLC; New York, NY, USA: 2014. pp. 3–38. [DOI] [PubMed] [Google Scholar]
- 33.Swiatek M., Kuras R., Sokolenko A., Higgs D., Olive J., Cinque G., Müller B., Eichacker L.A., Stem D.B., Bassi R., et al. The chloroplast gene ycf9 encodes a photosystem II (PSII) core subunit, PsbZ, that participates in PSII supramolecular architecture. Plant Cell. 2001;13:1347–1367. doi: 10.1105/tpc.13.6.1347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Nelson N., Yocum C.F. Structure and function of photosystems I and II. Annu. Rev. Plant Biol. 2006;57:521–565. doi: 10.1146/annurev.arplant.57.032905.105350. [DOI] [PubMed] [Google Scholar]
- 35.McNeal J.R., Kuehl J., Boore J., Leebens-Mack J.H., de Pamphilis C.W. Complete plastid genome sequences suggest strong selection for retention of photosynthetic genes in the parasitic plant genus Cuscuta. BMC Plant Biol. 2007;7:57. doi: 10.1186/1471-2229-7-57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wickett N.J., Fan Y., Lewis P., Goffinet B. Distribution and evolution of pseudogenes, gene losses and a gene rearrangement in the plastid genome of the nonphotosynthetic liverwort, Aneura mirabilis (Metzgeriales, Jungermanniopsida) J. Mol. Evol. 2008;67:111–122. doi: 10.1007/s00239-008-9133-1. [DOI] [PubMed] [Google Scholar]
- 37.Wolfe K.H., Morden C.W., Palmer J.D. Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant. Proc. Natl. Acad. Sci. USA. 1992;89:10648–10652. doi: 10.1073/pnas.89.22.10648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cabrera L.I., Salazar G.A., Chase M.W., Mayo S.J., Bogner J., Dávila P. Phylogenetic relationships of aroids and duckweeds (Araceae) inferred from coding and noncoding plastid DNA. Am. J. Bot. 2008;95:1153–1165. doi: 10.3732/ajb.0800073. [DOI] [PubMed] [Google Scholar]
- 39.Plunkett G.M., Dawnie S.R. Expansion and contraction of the chloroplast inverted repeat in Apiaceae subfamily Apioideae. Syst. Bot. 2000;25:648–667. doi: 10.2307/2666726. [DOI] [Google Scholar]
- 40.Hiratsuka J., Shimada H., Whittier R., Ishibashi T., Sakamoto M., Mori M., Kondo C., Honji Y., Sun C.R., Meng B.Y., et al. The complete sequence of the rice (Oryza sativa) chloroplast genome: Intermolecular recombination between distint tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals. Mol. Gen. Genet. 1989;217:185–194. doi: 10.1007/BF02464880. [DOI] [PubMed] [Google Scholar]
- 41.Maier R.M., Neckermann K., Igloi G.L., Kössel H. Complete sequence of the Maize chloroplast genome: Gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J. Mol. Biol. 1995;241:614–628. doi: 10.1006/jmbi.1995.0460. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








