Abstract
Plastid genomes (plastomes) represent rich sources of information for phylogenomics, from higher-level studies to below the species level. The genus Rhus (sumac) has received a significant amount of study from phylogenetic and biogeographic perspectives, but genomic studies in this genus are lacking. Rhus integrifolia and R. ovata are two shrubby species of high ecological importance in the southwestern USA and Mexico, where they occupy coastal scrub and chaparral habitats. They hybridize frequently, representing a fascinating system in which to investigate the opposing effects of hybridization and divergent selection, yet are poorly characterized from a genomic perspective. In this study, complete plastid genomes were sequenced for one accession of R. integrifolia and one each of R. ovata from California and Arizona. Sequence variation among these three accessions was characterized, and PCR primers potentially useful in phylogeographic studies were designed. Phylogenomic analyses were conducted based on a robustly supported phylogenetic framework based on 52 complete plastomes across the order Sapindales. Repeat content, rather than the size of the inverted repeat, had a stronger relative association with total plastome length across Sapindales when analyzed with phylogenetic least squares regression. Variation at the inverted repeat boundary within Rhus was striking, resulting in major shifts and independent gene losses. Specifically, rps19 was lost independently in the R. integrifolia-ovata complex and in R. chinensis, with a further loss of rps22 and a major contraction of the inverted repeat in two accessions of the latter. Rhus represents a promising novel system to study plastome structural variation of photosynthetic angiosperms at and below the species level.
Keywords: Sumac, Chloroplast genome, Phylogeny, SNP, Anacardiaceae, Sapindales
Introduction
The plastid genomes (plastomes) of green plants have been used as phylogenetic tools and studied from an evolutionary standpoint for decades (Soltis, Soltis & Doyle, 1992; Chase et al., 1993; Moore et al., 2007; Ruhfel et al., 2014; Gitzendanner et al., 2018; Li et al., 2019a). Across angiosperms, these genomes are thought to be highly conserved in structure, with two single copy regions (the large single copy, or LSC; and small single copy region, or SSC) and a large inverted repeat with two copies (IRb and IRa) (Palmer & Stein, 1986; Shinozaki et al., 1986; Sugiura, 1992). High throughput sequencing technologies have spurred an explosion of plastid genome sequencing, which is changing the view that these genomes are evolutionarily static, instead revealing the dynamic nature of their evolution (Wicke et al., 2011; Chaw & Jansen, 2018). This increase in plastome sequencing allows investigation of evolutionary dynamics at increasingly finer-scale taxonomic levels, for example, at or below the species level (Barrett, Wicke & Sass, 2018).
The angiosperm order Sapindales Juss. ex Bercht. & J. Presl is composed of nine families and approximately 6,570 species, including many important crops (e.g., citrus, cashew, mango) and numerous ecologically and economically important trees and shrubs. Within the family Anacardiaceae R. Br. (with ~700 accepted species) the genus Rhus L. (sumacs, with ~35 species) has received significant attention in phylogenetic and biogeographic studies (Miller, Young & Wen, 2001; Yi, Miller & Wen, 2004, 2007; Andrés-Hernández & Terrazas, 2009; Andrés-Hernández et al., 2014). Despite these previous studies, Rhus are poorly studied from a genomic perspective. To date, plastid genomes are available for only two representatives of Rhus: R. chinensis Mill., and R. typhina L. (Lee et al., 2016; Kim et al., 2017; Zuo et al., 2019). Interestingly, plastomes of the Asian R. chinensis range in size from 149,011–158,809 bp, while the North American R. typhina has a plastome of 160,204 bp, revealing striking variation in plastome size within Rhus. This genus, though largely uninvestigated from a genomic perspective, holds the potential as a powerful comparative model for the investigation of plastid genome evolutionary dynamics at and below the species level.
Rhus integrifolia (Nutt.) Benth. & Hook. f. ex W.H. Brewer & S. Watson and R. ovata S. Watson are two shrubby tree species native to the southwestern USA, Baja California, and outlying islands (Barkley, 1937; Young, 1972, 1974). These two species are ecologically important components of coastal scrub and chaparral habitats, respectively. Both form ligno-tuberous root crowns as an adaptation to frequent fires in their native habitat (Sawyer, Keeler-Wolf & Evens, 2009; Montalvo, Riordan & Beyers, 2017). Further, both serve as important ecological resources, providing habitat for wildlife and protecting against erosion (Horton, 1949). The fruits of both species were used as food and a source of tea for indigenous peoples (hence the names “lemonadeberry” for R. integrifolia and “sugarbush” for R. ovata) (Timbrook, 2007). Further, both are of great horticultural interest as native ornamental shrubs and pollinator resources and are high-value structural entities used in ecological restoration projects (Montalvo, Riordan & Beyers, 2017).
Within Rhus, R. integrifolia and R. ovata are sister species (Miller, Young & Wen, 2001; Yi, Miller & Wen, 2007) and are known to hybridize frequently in sympatry (Fig. 1), especially in areas of mountainous shoreline where lower-montane chaparral and low-lying coastal scrub habitats abruptly meet (Young, 1974). Both are gynodioecious, yielding hermaphroditic and male-sterile individuals (Young, 1972), yet they widely introgress as evidenced by intermediate morphologies in sympatry. Furthermore, populations of R. ovata from central Arizonan and marine Californian chaparral habitats are separated from each other by the Sonoran and Mojave Deserts, which sets up a disjunct distribution of the species (Young, 1974). The extent and frequency of hybrid introgression, patterns of genomic and morphological variation, and ecological niche space of these two species and their hybrid introgressants remain poorly studied despite their ecological and anthropogenic importance. Thus, there is need for future phylogeographic study in this complex, especially in the context of the disjunct distribution of R. ovata.
Figure 1. Leaf exemplars of the R. integrifolia-ovata complex.
(A) Rhus integrifolia (Los Angeles County, California, USA). (B) Rhus ovata (Gila County, Arizona, USA). (C) Putative hybrid-introgressant between R. integrifolia and R. ovata showing intermediate leaf morphology. Note the flat, wavy leaf folding and toothed leaf margins. (D) Shrubby habit of Rhus ovata (Gila County, Arizona, USA). Photo credits: Craig F. Barrett.
In this study, complete plastid genomes for three accessions from the R. integrifolia-ovata complex were sequenced and annotated in order to generate genomic resources for phylogeographic study and to quantify the degree of plastid genomic variation among them. Further, a phylogenomic dataset was constructed and analyzed, consisting of 52 complete, annotated plastid genomes of order Sapindales, emphasizing the family Anacardiaceae and genus Rhus. Analysis of these plastomes revealed the extent of sequence variation within the R. integrifolia-ovata complex, support for phylogenetic relationships among major clades of Sapindales, dynamic structural changes at the IR-LSC boundary, and independent instances of gene loss within the genus Rhus.
Materials and Methods
Sampling, DNA extraction, Illumina sequencing, and read quality control
Voucher specimens were collected for one individual of Rhus integrifolia (accession CFB 320c CA), and two individuals of R. ovata (accessions 290b CA and 371a AZ; Table 1). Permission to collect samples was provided by California State Parks (DRP065), and the USDA Forest Service (FS-2400-8). Because R. integrifolia and R. ovata are known to introgress frequently in California, individuals were chosen from localities showing no visible evidence of intermediate morphologies. DNA was extracted from leaf material using a CTAB extraction protocol (Doyle & Doyle, 1987), modified to 1/5 volume. Total genomic DNA was visualized on a 1% agarose gel stained with ApexSafe loading Dye (Genesee Scientific, San Diego, CA, USA), and quality/quantity of total DNA was verified via NanoDrop spectrophotometry (ThermoFisher, Waltham, MA, USA) and Qubit fluorometry (dsDNA Broad Range Assay, ThermoFisher, Waltham, MA, USA). Library preparation (Nextera DNA Flex Library Prep Kit) and Illumina sequencing on a NextSeq500 were conducted at Global Biologics, LLC (Columbia, MO, USA). Accessions were pooled at equimolar concentrations and sequenced with nine samples from another project. Paired-end reads (2 × 150 bp) were trimmed from 3′ and 5′ ends via a 3-bp sliding window specifying a minimum PHRED score of 20, and Illumina adapters were removed with Trimmomatic v.0.32 (Bolger, Lohse & Usadel, 2014). After processing, a total of 21,723,472 paired-end reads remained for R. ovata accession 371a AZ, 10,979,554 reads remained for R. ovata accession 320c CA, and 19,342,934 reads remained for R. integrifolia accession 320c CA (Table 1). FASTQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used to assess read quality before and after trimming and filtering.
Table 1. Newly sequenced accessions of Rhus integrifolia and Rhus ovata.
Species | Acc # | # Reads | L (bp), x-cov | Locality | Lat, Lon, Elev (m) |
---|---|---|---|---|---|
Rhus ovata | 290b | 21,723,472 | 160,262, 1,266.5× | Banner Grade, San Diego County, California, USA | 33.084277, −116.563568, 306.9 |
R. ovata | 371a | 10,979,554 | 160,173, 2,490.9× | Prescott National Forest, Yavapai County, Arizona, USA | 34.432816, −112.553975, 1679.7 |
R. integrifolia | 320c | 19,342,934 | 160,141, 6,063.9× | El Capitan State Beach, Santa Barbara County, California, USA | 33.743931, −118.411408, 46.9 |
Note:
Acc#, accession number for voucher specimens deposited at the West Virginia University Herbarium (WVA); # reads, the number of paired-end Illumina reads remaining after processing; L (bp), length of the complete plastome in base-pairs; x-cov, coverage depth of the plastome; Lat/Lon/Elev (ft), GPS latitude, longitude, and elevation of collection localities in feet above sea level.
Plastome assembly and annotation
Plastomes were assembled into complete circular chromosomes with the Perl script ‘NOVOPlasty v.3.8’ (Dierckxsens, Mardulyn & Smits, 2016) specifying the following parameters: genome size range = 100,000–200,000 bp; kmer length = 55; insert size = 350 bp; insert range = 2.0; and using Rhus ovata plastid rbcL (GenBank MF963245) as a starting seed. Initial assemblies revealed coverage depth estimates > 1,000×, so each read pool was down-sampled to approximately 200× using a custom UNIX command before assembly. Total read pools were mapped back to assembled circular chromosomes under stringent parameters in Geneious v.10 (http://www.geneious.com/), allowing no gaps and no mismatches, in order to check the accuracy of assemblies and calculate coverage depth. The inverted repeat boundaries were verified via continuous paired-end read coverage and the self-dotplot function in Geneious. FASTA files of each chromosome were then rearranged into the same syntenic orientation, with the Large Single Copy (LSC), Inverted Repeat B (IRb), Small Single Copy (SSC), and Inverted Repeat A (IRa), respectively.
Complete, finished plastomes were annotated with the Perl script “PGA” (Qu et al., 2019), which uses BLAST+ (Camacho et al., 2009) and a set of previously annotated plastomes (here, Anacardium occidentale L. GenBank # KY635877; Mangifera indica L. KX871231; Pistacia weinmaniifolia J. Poiss. ex Franch. MF630953; Spondias mombin L. KY828469; Toxicodendron vernicifluum (Stokes) F.A. Barkley MK419151; Rhus chinensis MF351625 and Citrus aurantiifolia (Christm.) Swingle KJ865401). Annotations were examined in Geneious to identify mis-annotated regions or problems with predicted translations, and then were adjusted in the NCBI Sequin software (https://www.ncbi.nlm.nih.gov/Sequin). Annotated plastomes were submitted to NCBI GenBank under accession numbers MT024991–MT024993 (File S1).
Plastome alignment, sequence variation, and repeat content
The three complete plastomes were then aligned via the progressiveMAUVE (Darling, Mau & Perna, 2010) plugin for Geneious (using MAFFT v.7 as an alignment software; Katoh & Standley, 2013), relaxing the assumption of collinearity, with gap opening penalty = 3.0 and offset value = 0.5, in order to check for genomic rearrangements. Dispersed repeat content was calculated with REPuter (Kurtz, 2001) via the Bielefeld University Bioinformatics Service (https://bibiserv.cebitec.uni-bielefeld.de/), quantifying the numbers of forward-forward, forward-compliment, forward-reverse, and palindromic repeats with motif lengths >20 bp. A Hamming distance of 3 and e-value cutoff of 10−3 were specified during the search. Perfect tandem repeat content for motifs from 2 to 50 bp were quantified with the Phobos plugin for Geneious 10 (Mayer C., Phobos 3.3.11, 2006–2010, http://www.rub.de/ecoevo/cm/cm_phobos.htm). Information content per locus (here termed “IC”) for each spacer, intron, or protein-coding region yielding variation was calculated as: IC = ((# SNP + # Indels)/locus length) × 100. These were ranked by locus length, and length vs. IC were plotted for each locus to identify highly variable regions within the typical target range for PCR amplification and Sanger sequencing with a single primer pair (e.g., 1,000–500 bp). Primers were designed using the Primer3 (Untergasser et al., 2012) plugin for Geneious. Additional information on sequence variation among Rhus accessions was calculated in DnaSP v.6. (Rozas et al., 2017).
Plastid phylogenomic analysis
A representative sample of species was chosen from complete, annotated plastid genomes of Sapindales to investigate plastid genomic structural variation across the order. This included 52 completely sequenced and annotated plastomes (49 from NCBI GenBank plus the three newly generated Rhus plastomes), and two outgroup taxa: Bretschneidera sinensis Hemsl. GenBank # NC_037753 (Brassicales Bromhead, Akaniaceae Stapf) and Shorea pachyphylla Ridl. ex Symington MK841940 (Malvales Juss., Dipterocarpaceae Blume). In order to make the analyses computationally tractable, a single member of each available genus was included representing seven of the nine families of Sapindales (Biebersteiniaceae and Kirkiaceae had no publicly available complete plastomes). Multiple species of some genera were included due to notable plastome length variation, including: Commiphora Jacq. (Burseraceae Kunth); Rhus (Anacardiaceae); Acer L. and Dipteronia Oliv. (Sapindaceae Juss.); and Entandrophragma C. DC. (Meliaceae Juss.). IR annotations were added to plastomes missing this information in GENEIOUS via the self-dotplot function, and the second IR copy was removed from all accessions (IRa). The fifty-four complete plastomes (minus IRa) were aligned with progressiveMAUVE as above. The number of syntenic, locally collinear blocks (LCB) was calculated and subjected to a pairwise “double cut and join” (DCJ) analysis in MAUVE to estimate and visualize the number of genomic rearrangements and major inversions.
In order to interpret plastome evolution in a comparative context, a phylogenetic tree was constructed for a concatenated matrix of the seven largest LCB identified in MAUVE (ranging from 11,663 to 52,267 bp aligned length) for a total of 209,579 bp in aligned length, including both coding and non-coding regions. RAxML (Stamatakis, 2014) was used to generate a tree under the GTR-Γ model, with the default number of rate categories (25), and base frequencies estimated empirically. Ten replicate tree searches were conducted, starting from parsimony trees, to check for convergence in topology. Then, 1,000 standard bootstrap replicates were conducted in RAxML under the same search parameters and displayed on the maximum likelihood tree estimate. All analyses were run across 30 cores, each using of 16 GB RAM (specifying RAxMLHPC-PTHREADS) on a Thinkmate VSX R5 760V3 server-class workstation (Thinkmate, Waltham, MA, USA).
To investigate the relationship between and total plastome length, IR length, and total tandem repeat content, all accessions were compared via phylogenetic least-squares regression (PGLS), which accounts for phylogenetic signal in trait covariance among species (Felsenstein, 1985; Garland, Harvey & Ives, 1992; Grafen, 1992), using the R packages “ape” (Paradis & Schliep, 2019), “phytools” (Revell, 2012), and “caper” (Orme et al., 2013). Tandem repeat content for all 54 complete plastomes (excluding IRa) was calculated in TandemRepeatsFinder (Benson, 1999) specifying a minimum alignment score of 50, and alignment parameters of 2, 7, and 7 for matches, mismatches, and indels, respectively. The consensus size of each repeat (bp) was multiplied by the number of each repeat to calculate the total tandem repeat content in bp for each plastome. Phylogenetic signal for IR length, repeat content, and total plastome length was calculated as Pagel’s λ and its significance compared to a model of λ = 0 (Pagel, 1999). PGLS analysis was run under a Brownian Motion model, with λ estimated. Lastly, a closer investigation of expansion and contraction of IR boundaries was conducted using IRscope (Amiryousefi, Hyvönen & Poczai, 2018), an R ‘Shiny’ application that allows analysis of up to ten annotated plastomes simultaneously (https://irscope.shinyapps.io/irapp/), and outputs a JPEG illustration of IR boundary dynamics.
Results
Plastome structure, repeat content, and sequence variation
Assembled plastome sequences for Rhus ranged from 160,141 bp (R. integrifolia) to 160,262 bp (R. ovata from CA) (Table 1; File S1). All three plastomes contained a total of 110 genes: 76 protein coding genes (CDS), 30 transfer RNA genes, and four ribosomal RNA genes. The LSC region ranged from 87,980 bp in R. integrifolia to 88,086 in Arizonan R. ovata; the IR from 26,602 in R. integrifolia to 26,635 bp in Californian R. ovata; and the SSC from 18,880 bp in Arizonan R. ovata to 18,957 bp in R. integrifolia. There was a total of 111 segregating sites (excluding sites with gaps), and an average pairwise nucleotide diversity (π) of 0.00055 ± 0.00016. The number of nucleotide substitutions in pairwise comparisons ranged from 127 between Californian and Arizonan R. ovata to 208 between Californian R. ovata and R. integrifolia (Table 2). All three accessions had similar numbers of dispersed and tandem repeats (Table 3; File S2). Forward–Forward dispersed repeats were the most common, followed by palindromic, forward–reverse, and forward–compliment based on our search criteria. Tandem repeats were abundant, with pentanucleotide repeats being most common, while dinucleotide repeats were the rarest, considering repeat motifs from 2 to 20 bp (Table 3). Of particular note was a 22 bp minisatellite repeat within the ndhC-trnVUAC spacer that varied in copy number between the three Rhus accessions (ATT TTT TTT ATT ATT AAT TAT T), with one unit in R. integrifolia, two units in the Arizonan accession of R. ovata, and three in the Californian accession of R. ovata.
Table 2. The number of pairwise nucleotide differences among three Rhus plastomes, excluding Inverted Repeat A.
R. ovata CA | R. ovata AZ | |
---|---|---|
R. ovata CA | ||
R. ovata AZ | 127 | |
R. integrifolia | 208 | 196 |
Table 3. Numbers of dispersed and tandem repeats in three Rhus plastomes.
R. integrifola | R. ovata CA | R. ovata AZ | |
---|---|---|---|
Dispersed repeats | |||
Forward–Compliment | 1 | 1 | 2 |
Forward–Forward | 32 | 33 | 33 |
Forward–Reverse | 13 | 15 | 16 |
Palindromic | 19 | 18 | 18 |
Total | 65 | 67 | 69 |
Tandem repeat motif length (bp) |
|||
2 | 5 | 5 | 5 |
3 | 16 | 17 | 16 |
4 | 76 | 77 | 77 |
5 | 253 | 257 | 256 |
6 | 154 | 157 | 158 |
7 | 47 | 45 | 45 |
8 | 21 | 21 | 21 |
9 | 11 | 11 | 10 |
>9 | 30 | 31 | 28 |
Total | 613 | 621 | 616 |
Overall plastome structure and synteny did not differ among all three newly sequenced Rhus accessions (File S1). Interestingly, the plastomes of two accessions of Rhus chinensis (GenBank accessions MF351625 and KX447140) displayed a major contraction of the IR relative to R. integrifolia, R. ovata, R. typhina, and another accession of R. chinensis (MG267385), with the IR being some 10 KB smaller in the former R. chinensis accessions. In the two accessions of R. chinensis with a reduced IR, four genes normally found in the IR are instead found in the LSC region: rpl2, rpl23, trnICAU, and ycf2.
The majority of SNPs, single-base indels, and multi-base indels occur in intergenic spacers within R. integrifolia and R. ovata, and these are predominantly found in the LSC region (Fig. 2). SNPs and indels within introns are present mainly in the LSC, followed by the SSC, with none in the IR. Exonic SNPs and multi-base indels are slightly more prevalent in the SSC than in the LSC. Considering information content (IC), several regions display relatively high variation within the range of commonly used PCR and sequencing capabilities with two primers for a single amplicon (i.e., 500–1,000 bp; Fig. 3). These include: the trnFGAA-ndhJ spacer, matK-trnKUUU spacer, accD-psaI spacer, clpP intron #2, trnHGUG-psbA spacer, psaA-ycf3 spacer, trnEUUC-trnTGGU spacer, and ndhCUAC-trnV spacer. Oligonucleotides for amplification of these eight regions are listed in File S3.
Figure 2. Plastome variation by region.
Numbers of single nucleotide polymorphisms (SNPs, dark gray), single-base insertion/deletions (light gray), and insertions/deletions > 1 bp (black) across coding gene regions (i.e., exons, RNA genes), intergenic spacers, and introns of all three aligned plastomes of Rhus integrifolia and R. ovata.
Figure 3. Informative characters by plastid locus.
Comparison of informative character content (IC) for all loci containing variation among three Rhus accessions. Here, the black columns indicate IC = ((# SNP + # Indels)/locus length) × 100. Gray columns = length of each locus (kilobases; numbers above first three loci are lengths in bp). Vertical dashed lines represent the range for a typical PCR amplicon for Sanger sequencing with two primers (1,000–500 bp).
Plastid phylogeonomics of Rhus and order Sapindales
The concatenated matrix of 54 taxa and 209,573 aligned bp had 26,989 variable positions, excluding gaps, with 16,421 parsimony informative sites, 10,568 singleton sites, and 87,070 sites including at least one inferred gap (File S4). Fig. 4 shows the estimated phylogenetic relationships among the 52 complete plastomes of Sapindales, plus two non-sapindalean outgroups. The Maximum likelihood analysis recovered 100% bootstrap support for all “deep” relationships among families within Sapindales (excluding Biebersteiniaceae Schnitzlein and Kirkiaceae Takhtajan, for which no complete plastomes were available). Rutaceae Juss. were placed as sister to Simaroubaceae DC., followed by Meliaceae. This clade was placed as sister to Sapindaceae, and collectively these four families were placed as sister to a clade of Anacardiaceae + Burseraceae. Nitrariaceae Lindl. grouped as sister to all other families of Sapindales included in this analysis. Within Anacardiaceae, three accessions of the Asian Rhus chinensis grouped as sister to an accession of the North American R. typhina L. The two accessions of Rhus ovata (CA and AZ) were weakly supported as sister to one another (BS = 72), and together sister to R. integrifolia. The Rhus integrifolia-ovata clade grouped as sister to all other Rhus accessions. Rhus was grouped as sister to Pistacia, Toxicodendron, Anacardium L. + Mangifera L., Sclerocarya Hochst., and Spondias L., respectively.
Figure 4. Plastid phylogenetic tree of Sapindales.
Maximum likelihood tree of a 209,753 bp alignment of seven concatenated locally collinear plastome blocks across 52 representatives of order Sapindales, representatives from seven of nine constituent families. Black square indicates order Sapindales relative to the two outgroup taxa from Brassicales and Malvales. Values adjacent to branches indicate RAxML bootstrap support; all are 100% unless otherwise noted. Inset: Unrooted Maximum likelihood tree displaying branch length differences based on a whole-plastome alignment of the three Rhus accessions sequenced in this study, excluding IRa. Dashed line shows the position of the Rhus integrifolia-ovata complex in the larger tree. Scalebar represents substitutions per site under a GTR+Γ model in RAxML.
There was evidence for phylogenetic signal in total plastome length (Pagel’s λ = 0.802, p = 0.0017), while there was no signal in IR length (Pagel’s λ = 0.017, p = 0.8885) or repeat content (Pagel’s λ = 6.68 × 10−5, p = 1.0) based on analysis of 54 plastomes. Phylogenetic least squares regression revealed no association between IR length and total plastome length (adjusted R2 = 0.0079, F = 1.421, p = 0.2387), but a significant, albeit weak association between repeat content and total plastome length (adjusted R2 = 0.2175, F = 15.73, p = 0.0002). A model including both IR length and repeat content as explanatory variables yielded similar results (adjusted R2 = 0.2256, F = 9.722, p = 0.0006), wherein repeat content was significantly associated with total plastome length (p = 0.0002) and IR length was not (p = 0.2189). Many members of Sapindaceae display relatively smaller total plastome lengths (e.g., Acer, Dipteronia, Aesculus L.; Fig. 5), but this is not reflected by smaller IR lengths for those species. The plastome of Anacardium occidentale was the longest in the dataset, and had the longest IR region, but otherwise there were no clear patterns of variation in IR length driving total length of the plastome.
Figure 5. Ancestral state reconstructions of total plastome length, length of the inverted repeat (IR), and tandem repeat content.
Analyses were conducted under a Brownian Motion model of trait covariance in the R packages “ape” and “phytools”. Red-orange values in the legend indicate smaller values, while blue values represent larger values.
Synteny analysis in MAUVE of the 54 complete plastomes revealed a total of 11 locally collinear, syntenic blocks (LCB; File S5). There were a few notable genomic rearrangements among members of Sapindales, but these were mostly specific to individual species or genera and were not phylogenetically informative at “higher” taxonomic levels, for example, at the family level or above (File S5). Mangifera indica has a large inversion in the LSC region, and Anacardium occidentale contains a ~6,700 insertion of mitochondrial DNA within the IR, as observed previously (Rabah et al., 2017). The other striking example of plastome modification comes from two of three accessions of the Asian species Rhus chinensis: GenBank accessions KX447140 and MF351625 have IR lengths of 16,741 bp and 16,602 bp, respectively, relative to accession MG267385, which has an IR length of 27,375 bp (Fig. 5; File S5).
A closer analysis of IR boundaries within Rhus reveals dynamic expansions and contractions of the IR, primarily at the IR-LSC junctions (Fig. 6). The most notable finding is the independent loss of the rps19 gene from the plastomes of R. chinensis and the R. integrifolia-ovata complex, with further loss of rpl22 in R. chinensis (Fig. 6). The IR is expanded at the LSC-IR boundary in one accession of R. chinensis relative to R. integrifolia, R. ovata, and R. typhina, to include a portion of psbA, which is typically found at the start of the LSC region (Fig. 6). This is followed by an approximately 9.8 KB contraction of the IR boundary in two accessions of R. chinensis. The rps19 gene spans the LSC-IRb boundary in Rhus typhina, while in R. ovata and R. integrifolia, the gene is missing. Based on the phylogenetic pattern of (R.integrifolia-ovata, (R. typhina, R. chinensis)), it can be inferred that rps19 was lost at least twice within Rhus, based on the current sampling.
Figure 6. Inverted Repeat (IR) boundaries in Rhus.
The tree figure (left) corresponds to relationships from Fig. 4, which are all supported with bootstrap values of 100. Red arrows indicate losses of genes or regions of the plastome. LSC, Large Single Copy region; IRb, Inverted Repeat copy “b”; ‘SSC, Small Single Copy Region; IRa, Inverted Repeat copy “a”; JLB, LSC-IRb junction; LSB, IRb-SSC junction; JSA, SSC-IRa junction; JLA, IRa-LSC junction. Small bars and arrows indicate the distance to the nearest IR junction, and diamonds/double-slashes indicate truncations for the purpose of visualization.
Discussion
Three complete plastid genomes of the Rhus integrifolia-R. ovata species complex were sequenced, annotated, and analyzed in a comparative context with several other complete plastomes from the order Sapindales. Patterns of nucleotide polymorphism and indel variation were characterized among the three newly generated Rhus plastomes, providing genomic resources for population-level and phylogeographic study of these ecologically and economically important species. Additionally, phylogenetic analysis of 52 complete plastomes in order Sapindales provides resolution and robust branch support among most families within the order, allowing a well-supported basis for phylogenomic comparison of plastid genome evolution. The most striking finding was the dynamic evolution of the inverted repeat boundary in the genus Rhus, which was evident based on limited sampling of only seven accessions representing four species of this genus. This included independent losses of the rps19 gene in R. ovata-R. integrifolia complex and in R. chinensis, with an additional 9.8 KB contraction of the IR within two of three individuals of R. chinensis. These findings suggest that the genus Rhus may have experienced several such events, providing a potentially novel, dynamic model system in which to study plastome structural changes in angiosperms at and below the species level.
Plastome structure, repeat content, and sequence variation in R. integrifolia and R. ovata
Plastome structure and synteny in the three newly sequenced Rhus accessions are somewhat typical for angiosperms (Wicke et al., 2011; Ruhlman & Jansen, 2014). The degree of overall plastome sequence variation among the three Rhus accessions is similar to what is observed in other intraspecific studies of plastid genomic variation among photosynthetic angiosperms, for example, in: Silene vulgaris (Moench) Garcke (Krüger et al., 2019), Macadamia integrifolia Maiden & Betche (Nock et al., 2019), Quercus acutissima Carruth. (Zhang et al., 2020), Utricularia amethystina Salzm. ex A. St.-Hil. & Girard (Silva et al., 2019), Capsicum annuum L. (Magdy et al., 2019), Holcoglossum Schltr. spp. (Li et al., 2019b). Repetitive DNA is common in the three Rhus accessions, with 65–69 dispersed repeats >20 bp in length) and 613–621 tandem repeats of motif length 2–20 bp (File S2). These regions are candidates for primer design and the study of length variation in a population-genetic or phylogeographic context. In particular, a 22 bp minisatellite repeat within the ndhC-trnVUAC spacer varied between all three accessions and holds potential as an informative marker. The vast majority of SNP and indel variation is found in the LSC region, particularly within spacer or intron regions. Some of these intron and spacer regions correspond to those shown in previous studies to contain relatively high levels of variation (e.g., ndhC-trnVUAC; Shaw et al., 2005, 2007), and are likely to be useful both above and below the species level as variable markers. Rhus-specific rimers for PCR amplification were designed for eight of these regions (File S3). Furthermore, the complete plastid genome sequences for R. integrifolia and R. ovata can be used to design specific target-capture or long-range PCR probes (Cronn et al., 2012; Mariac et al., 2014; Peñalba et al., 2014; Bethune et al., 2019).
Plastid phylogeonomics of Rhus and order Sapindales
Several previous studies have addressed phylogenetic relationships among families of the order Sapindales, or have included representatives of Sapindales in larger studies of angiosperm relationships (Muellner, Vassiliades & Renner, 2007; Weeks et al., 2014; Magallón et al., 2015; Muellner-Riehl et al., 2016; Li et al. 2019b). However, these studies differ in the relative placement of some of the nine families of Sapindales, with many lacking support for key relationships due to the reliance upon either limited genomic information (e.g., few-gene analyses), or by limited taxon sampling. Order Sapindales contains nine families, 479 genera, and approximately 6,570 species, or approximately 2.15% of angiosperm species diversity (Angiosperm Phylogeny Group, 2016; The Plant List, 2010). While the sampling of 52 plastomes in the current study does not capture the species richness of this diverse order, it does provide support for higher-level relationships based on the plastomes included, and further enables a robust framework for comparative plastid genomics in the order. In fact, the relationships among sapindalean families recovered in the current study are corroborated by another recent study based on complete plastome sequences in Wang et al. (2020), here with the addition of two plastomes from Nitrariaceae (Nitraria L. and Peganum L.), which were not included in the aforementioned study.
Missing from the current study are representatives of two sapindalean families, Biebersteiniaceae and Kirkiaceae. Muellner-Riehl et al. (2016) sampled 207 species representing all families of Sapindales, and conducted a phylogenetic analysis based on three plastid loci: rbcL, atpB, and the trnL-trnF spacer. While most of the deep branch support values in that study were moderate to high, the placement of Biebersteiniaceae and Nitrariaceae among other members of Sapindales received no support. Additionally, there was moderate support for Meliaceae as sister to Simaroubaceae, whereas in the current study and in that of Wang et al. (2020), both based on complete plastome data, Meliaceae is supported as sister to Simaroubaceae + Rutaceae. The topology from the current study among families is also identical to that in (Li et al., 2019b) based on coding regions of the plastome. However, in that study Biebersteiniaceae (not included in the current study) are sister to Nitrariaceae with <50% bootstrap support, collectively sister to the rest of Sapindales. Thus, future phylogenomic studies of Sapindales should focus on dense, strategic plastome sampling representative of the species richness of this clade, as well as on broad representation of nuclear genomic information (Collins, Gostel & Weeks, 2016; Eiserhardt et al., 2018; Johnson et al., 2019; Dodsworth et al., 2019).
Boundary shifts of the IR in plastid genomes are hypothesized to be one of the primary drivers of overall plastid genome size variation (Wicke et al., 2011; Ruhlman & Jansen, 2014). However, this hypothesis was not supported across 52 representative plastomes of Sapindales in a comparative framework via phylogenetic least squares regression. While a few shifts in plastome size were visibly associated with expansions or contractions of the IR (e.g., Anacardium occidentale, Rhus chinensis; Fig. 5), shifts in the IR boundary alone are not sufficient to explain overall plastome size variation in Sapindales among phylogenetically independent lineages, based on the current sampling. Variation in repeat content appears to be a stronger determinant of plastome length variation in Sapindales (e.g., as in Asarum; Sinn et al., 2018).
Perhaps the most striking finding in the current study is the dynamic nature of evolution of the IR boundary in Rhus, based on comparison of complete plastomes across the order Sapindales.
The inverted repeat (IR) region of the plastome is the most conserved in terms of sequence variation (Palmer & Thompson, 1982) relative to the LSC and SSC. However, numerous studies have demonstrated the dynamic nature of boundary shifts in the IR, which often has implications for the evolution of gene content, structural stability, and substitution rates (Zhu et al., 2016). By the inclusion of only seven available accessions of four species in Rhus, it is evident that IR boundaries are variable at and below the species levels (Fig. 6). Thus, increased representation of plastomes in this genus and related genera will likely reveal additional variation in plastome structure, making Rhus a promising system in which to test hypotheses associated with IR expansion/contraction and the overall structural/substitutional dynamics of the plastome in a phylogenetic, comparative context.
IR boundary shifts often lead to gene duplication, loss, and large-scale syntenic rearrangement, all of which have implications for substitution rates of the genes captured or “released” from the IR (Zhu et al., 2016). In some cases, genes can be lost from the plastome as a result of IR boundary shifts (Wicke et al., 2014; Downie & Jansen, 2015), as is the case in R. integrifolia-ovata (rps19) and R. chinensis (rpl22 and rps19; Fig. 6). It is possible that one or both of these genes were transferred to another genomic compartment, either to the mitochondrial genome or the nuclear genome, where they may still be expressed (Martin & Herrmann, 1998; Selosse, Albert & Godelle, 2001). Alternatively, they may have been lost from the organism altogether, with little to no fitness implications for plastid function. Numerous examples of the loss of such “housekeeping” genes are evident in mycoheterotrophic or parasitic plants which contain plastids that presumably retain function at some level (Wicke et al., 2011, 2013, 2016).
The IR is hypothesized to play a major role in the structural stability of the plastome (Palmer & Thompson, 1982). The IR has been lost several times independently among plant lineages, for example in: conifers (Raubeson & Jansen, 1992; Wu et al., 2011; Wu & Chaw, 2014), legumes (Palmer, Osorio & Thompson, 1988; Lavin, Doyle & Palmer, 1990; but also regained de novo, Choi, Jansen & Ruhlman, 2019), parasitic plants (Wicke et al., 2016), mycoheterotrophic plants (Schelkunov et al., 2015; Yuan et al., 2018; Barrett, Sinn & Kennedy, 2019), members of family Geraniaceae Juss. (Guisinger et al., 2011; Blazier et al., 2016), cacti (Sanderson et al., 2015), and a palm (Tahina spectabilis J. Dransf. & Rakotoarinivo; Barrett et al., 2016). However, there is mixed evidence that IR loss leads to syntenic instability, including a lack of evidence from Erodium L’Hér. (Geraniaceae; Blazier et al., 2016) and weak evidence within the “IR-lacking clade” of papilionoid legumes that plastome rearrangements are prevalent following the loss of the IR (Sabir et al., 2014).
Conclusion
Three complete plastid genomes were generated for two ecologically important North American shrub species in the genus Rhus. Information from these plastomes was used to design PCR primers for amplifying variable regions and regions containing phylogeographically informative repeats in dense population-level samples (e.g., ndhC-trnV). Plastome evolution was investigated in a robustly supported phylogenomic framework, revealing striking variation within Rhus in terms of dynamics of the LSC-IR boundary, with multiple, independent instances of gene loss. Rhus presents a promising novel system in which to study the dynamics of plastome structural variation in photosynthetic angiosperms at or below the species level, and continued representation of plastomes in this genus is likely to reveal additional changes.
Supplemental Information
Acknowledgments
The author thanks Victoria Sosa, Anna Victoria Silvério Righetto Mauad, Brandon Sinn, Mathilda Santee, Yamuna Somaratne, and an anonymous reviewer for helpful comments on the manuscript.
Funding Statement
Funding was provided by The California State University Program to Support Education and Biotechnology, the West Virginia University Department of Biology, and a WVU Program to Stimulate Competitive Research (PSCoR) award to Craig F. Barrett. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
Craig F. Barrett conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.
Field Study Permissions
The following information was supplied relating to field study approvals (i.e., approving body and any reference numbers):
Permission to collect samples was provided by California State Parks (DRP065), and the USDA Forest Service (FS-2400-8).
DNA Deposition
Data Availability
The following information was supplied regarding data availability:
Complete plastid genomes of Rhus accessions (GenBank format) are available in File S1. Repeat locations are available in File S2. Primer sequences are available in File S3. The plastid coding gene alignment for Sapindales is available in File S4. MAUVE alignment of 54 plastomes is available in File S5.
References
- Amiryousefi, Hyvönen & Poczai (2018).Amiryousefi A, Hyvönen J, Poczai P. IRscope: an online program to visualize the junction sites of chloroplast genomes. Bioinformatics. 2018;34(17):3030–3031. doi: 10.1093/bioinformatics/bty220. [DOI] [PubMed] [Google Scholar]
- Angiosperm Phylogeny Group (2016).Angiosperm Phylogeny Group An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Botanical Journal of the Linnean Society. 2016;181(1):1–20. doi: 10.1111/boj.12385. [DOI] [Google Scholar]
- Andrés-Hernández & Terrazas (2009).Andrés-Hernández AR, Terrazas T. Leaf architecture of Rhus s.str. (Anacardiaceae) Feddes Repertorium. 2009;120(5–6):293–306. doi: 10.1002/fedr.200911109. [DOI] [Google Scholar]
- Andrés-Hernández et al. (2014).Andrés-Hernández AR, Terrazas T, Salazar G, Ochoterena H. Phylogenetic analysis based on structural and combined analyses of Rhus s.s. (Anacardiaceae) Botanical Journal of the Linnean Society. 2014;176(4):452–468. doi: 10.1111/boj.12222. [DOI] [Google Scholar]
- Barkley (1937).Barkley FA. A monographic study of Rhus and its immediate allies in North and Central America, including the West Indies. Annals of the Missouri Botanical Garden. 1937;24(3):265. doi: 10.2307/2394183. [DOI] [Google Scholar]
- Barrett et al. (2016).Barrett CF, Baker WJ, Comer JR, Conran JG, Lahmeyer SC, Leebens-Mack JH, Li J, Lim GS, Mayfield-Jones DR, Perez L, Medina J, Pires JC, Santos C, Stevenson DW, Zomlefer WB, Davis JI. Plastid genomes reveal support for deep phylogenetic relationships and extensive rate variation among palms and other commelinid monocots. New Phytologist. 2016;209(2):855–870. doi: 10.1111/nph.13617. [DOI] [PubMed] [Google Scholar]
- Barrett, Sinn & Kennedy (2019).Barrett CF, Sinn BT, Kennedy AH. Unprecedented parallel photosynthetic losses in a heterotrophic orchid genus. Molecular Biology and Evolution. 2019;36(9):1884–1901. doi: 10.1093/molbev/msz111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrett, Wicke & Sass (2018).Barrett CF, Wicke S, Sass C. Dense infraspecific sampling reveals rapid and independent trajectories of plastome degradation in a heterotrophic orchid complex. New Phytologist. 2018;218(3):1192–1204. doi: 10.1111/nph.15072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benson (1999).Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Research. 1999;27(2):573–580. doi: 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bethune et al. (2019).Bethune K, Mariac C, Couderc M, Scarcelli N, Santoni S, Ardisson M, Martin J, Montúfar R, Klein V, Sabot F, Vigouroux Y, Couvreur TLP. Long-fragment targeted capture for long-read sequencing of plastomes. Applications in Plant Sciences. 2019;7(5):e1243. doi: 10.1002/aps3.1243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blazier et al. (2016).Blazier JC, Jansen RK, Mower JP, Govindu M, Zhang J, Weng M-L, Ruhlman TA. Variable presence of the inverted repeat and plastome stability in Erodium. Annals of Botany. 2016;117:1209–1220. doi: 10.1093/aob/mcw065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolger, Lohse & Usadel (2014).Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho et al. (2009).Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chase et al. (1993).Chase MW, Soltis DE, Olmstead RG, Morgan D, Les DH, Mishler BD, Duvall MR, Price RA, Hills HG, Qiu Y-L, Kron KA, Rettig JH, Conti E, Palmer JD, Manhart JR, Sytsma KJ, Michaels HJ, Kress WJ, Karol KG, Clark WD, Hedren M, Gaut BS, Jansen RK, Kim K-J, Wimpee CF, Smith JF, Furnier GR, Strauss SH, Xiang Q-Y, Plunkett GM, Soltis PS, Swensen SM, Williams SE, Gadek PA, Quinn CJ, Eguiarte LE, Golenberg E, Learn GH, Graham SW, Barrett SCH, Dayanandan S, Albert VA. Phylogenetics of seed plants: an analysis of nucleotide sequences from the plastid gene rbcL. Annals of the Missouri Botanical Garden. 1993;80(3):528–580. doi: 10.2307/2399846. [DOI] [Google Scholar]
- Chaw & Jansen (2018).Chaw S-M, Jansen RK. Advances in Botanical Research. Vol. 85. Cambridge: Academic Press; 2018. Plastid genome evolution; pp. 263–292. [Google Scholar]
- Choi, Jansen & Ruhlman (2019).Choi I-S, Jansen R, Ruhlman T. Lost and found: return of the inverted repeat in the legume clade defined by its absence. Genome Biology and Evolution. 2019;11(4):1321–1333. doi: 10.1093/gbe/evz076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collins, Gostel & Weeks (2016).Collins ES, Gostel MR, Weeks A. An expanded nuclear phylogenomic PCR toolkit for Sapindales. Applications in Plant Sciences. 2016;4(12):1600078. doi: 10.3732/apps.1600078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cronn et al. (2012).Cronn R, Knaus BJ, Liston A, Maughan PJ, Parks M, Syring JV, Udall J. Targeted enrichment strategies for next-generation plant biology. American Journal of Botany. 2012;99(2):291–311. doi: 10.3732/ajb.1100356. [DOI] [PubMed] [Google Scholar]
- Darling, Mau & Perna (2010).Darling AE, Mau B, Perna NT. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLOS ONE. 2010;5(6):e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dierckxsens, Mardulyn & Smits (2016).Dierckxsens N, Mardulyn P, Smits G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Research. 2016;53:gkw955. doi: 10.1093/nar/gkw955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dodsworth et al. (2019).Dodsworth S, Pokorny L, Johnson MG, Kim JT, Maurin O, Wickett NJ, Forest F, Baker WJ. Hyb-Seq for flowering plant systematics. Trends in Plant Science. 2019;24(10):887–891. doi: 10.1016/j.tplants.2019.07.011. [DOI] [PubMed] [Google Scholar]
- Downie & Jansen (2015).Downie SR, Jansen RK. A comparative analysis of whole plastid genomes from the Apiales: expansion and contraction of the inverted repeat, mitochondrial to plastid transfer of DNA, and identification of highly divergent noncoding regions. Systematic Botany. 2015;40(1):336–351. doi: 10.1600/036364415X686620. [DOI] [Google Scholar]
- Doyle & Doyle (1987).Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin. 1987;19:11–15. [Google Scholar]
- Eiserhardt et al. (2018).Eiserhardt WL, Antonelli A, Bennett DJ, Botigué LR, Burleigh JG, Dodsworth S, Enquist BJ, Forest F, Kim JT, Kozlov AM, Leitch IJ, Maitner BS, Mirarab S, Piel WH, Pérez-Escobar OA, Pokorny L, Rahbek C, Sandel B, Smith SA, Stamatakis A, Vos RA, Warnow T, Baker WJ. A roadmap for global synthesis of the plant tree of life. American Journal of Botany. 2018;105(3):614–622. doi: 10.1002/ajb2.1041. [DOI] [PubMed] [Google Scholar]
- Felsenstein (1985).Felsenstein J. Phylogenies and the comparative method. American Naturalist. 1985;125(1):1–15. doi: 10.1086/284325. [DOI] [PubMed] [Google Scholar]
- Garland, Harvey & Ives (1992).Garland T, Harvey PH, Ives AR. Procedures for the analysis of comparative data using phylogenetically independent contrasts. Systematic Biology. 1992;41(1):18–32. doi: 10.1093/sysbio/41.1.1816. [DOI] [Google Scholar]
- Gitzendanner et al. (2018).Gitzendanner MA, Soltis PS, Wong GK-S, Ruhfel BR, Soltis DE. Plastid phylogenomic analysis of green plants: a billion years of evolutionary history. American Journal of Botany. 2018;105(3):291–301. doi: 10.1002/ajb2.1048. [DOI] [PubMed] [Google Scholar]
- Grafen (1992).Grafen A. The uniqueness of the phylogenetic regression. Journal of Theoretical Biology. 1992;156(4):405–423. doi: 10.1016/S0022-5193(05)80635-6. [DOI] [Google Scholar]
- Guisinger et al. (2011).Guisinger MM, Kuehl JV, Boore JL, Jansen RK. Extreme reconfiguration of plastid genomes in the angiosperm family Geraniaceae: rearrangements, repeats, and codon usage. Molecular Biology and Evolution. 2011;28(1):583–600. doi: 10.1093/molbev/msq229. [DOI] [PubMed] [Google Scholar]
- Horton (1949).Horton JS. Trees and shrubs for erosion control of southern California mountains. Berkeley: U.S. Department of Agriculture, Forest Service, California Forest and Range Experiment Station; California Department of Natural Resources, Division of Forestry; 1949. [Google Scholar]
- Johnson et al. (2019).Johnson MG, Pokorny L, Dodsworth S, Botigué LR, Cowan RS, Devault A, Eiserhardt WL, Epitawalage N, Forest F, Kim JT, Leebens-Mack JH, Leitch IJ, Maurin O, Soltis DE, Soltis PS, Wong GK, Baker WJ, Wickett NJ. A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering. Systematic Biology. 2019;68(4):594–606. doi: 10.1093/sysbio/syy086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh & Standley (2013).Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim et al. (2017).Kim I, Park JY, Lee YS, Joh HJ, Kang SJ, Murukarthick J, Lee HO, Hur Y-J, Kim Y, Kim KH, Lee S-C, Yang T-J. The complete chloroplast genome sequence and intra-species diversity of Rhus chinensis. Plant Breeding and Biotechnology. 2017;5(3):243–251. doi: 10.9787/PBB.2017.5.3.243. [DOI] [Google Scholar]
- Krüger et al. (2019).Krüger M, Abeyawardana OAJ, Juříček M, Krüger C, Štorchová H. Variation in plastid genomes in the gynodioecious species Silene vulgaris. BMC Plant Biology. 2019;19(1):568. doi: 10.1186/s12870-019-2193-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kurtz (2001).Kurtz S. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Research. 2001;29(22):4633–4642. doi: 10.1093/nar/29.22.4633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lavin, Doyle & Palmer (1990).Lavin M, Doyle JJ, Palmer JD. Evolutionary significance of the loss of the chloroplast-DNA inverted repeat in the Leguminosae subfamily Papilionoideae. Evolution. 1990;44(2):390–402. doi: 10.1111/j.1558-5646.1990.tb05207.x. [DOI] [PubMed] [Google Scholar]
- Lee et al. (2016).Lee YS, Kim I, Kim J-K, Park JY, Joh HJ, Park H-S, Lee HO, Lee S-C, Hur Y-J, Yang T-J. The complete chloroplast genome sequence of Rhus chinensis Mill (Anacardiaceae) Mitochondrial DNA Part B. 2016;1(1):696–697. doi: 10.1080/23802359.2016.1222250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2019a).Li Z-H, Ma X, Wang D-Y, Li Y-X, Wang C-W, Jin X-H. Evolution of plastid genomes of Holcoglossum (Orchidaceae) with recent radiation. BMC Evolutionary Biology. 2019a;19(1):63. doi: 10.1186/s12862-019-1384-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2019b).Li H-T, Yi T-S, Gao L-M, Ma P-F, Zhang T, Yang J-B, Gitzendanner MA, Fritsch PW, Cai J, Luo Y, Wang H, Van der Bank M, Zhang S-D, Wang Q-F, Wang J, Zhang Z-R, Fu C-N, Yang J, Hollingsworth PM, Chase MW, Soltis DE, Soltis PS, Li D-Z. Origin of angiosperms and the puzzle of the Jurassic gap. Nature Plants. 2019b;5(5):461–470. doi: 10.1038/s41477-019-0421-0. [DOI] [PubMed] [Google Scholar]
- Magallón et al. (2015).Magallón S, Gómez-Acevedo S, Sánchez-Reyes LL, Hernández-Hernández T. A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. New Phytologist. 2015;207(2):437–453. doi: 10.1111/nph.13264. [DOI] [PubMed] [Google Scholar]
- Magdy et al. (2019).Magdy M, Ou L, Yu H, Chen R, Zhou Y, Hassan H, Feng B, Taitano N, Van der Knaap E, Zou X, Li F, Ouyang B. Pan-plastome approach empowers the assessment of genetic variation in cultivated Capsicum species. Horticulture Research. 2019;6(1):108. doi: 10.1038/s41438-019-0191-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mariac et al. (2014).Mariac C, Scarcelli N, Pouzadou J, Barnaud A, Billot C, Faye A, Kougbeadjo A, Maillol V, Martin G, Sabot F, Santoni S, Vigouroux Y, Couvreur TLP. Cost-effective enrichment hybridization capture of chloroplast genomes at deep multiplexing levels for population genetics and phylogeography studies. Molecular Ecology Resources. 2014;14(6):1103–1113. doi: 10.1111/1755-0998.12258. [DOI] [PubMed] [Google Scholar]
- Martin & Herrmann (1998).Martin W, Herrmann RG. Gene transfer from organelles to the nucleus: how much, what happens, and why? Plant Physiology. 1998;118(1):9–17. doi: 10.1104/pp.118.1.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller, Young & Wen (2001).Miller AJ, Young DA, Wen J. Phylogeny and biogeography of Rhus (Anacardiaceae) based on ITS sequence data. International Journal of Plant Sciences. 2001;162(6):1401–1407. doi: 10.1086/322948. [DOI] [Google Scholar]
- Montalvo, Riordan & Beyers (2017).Montalvo AM, Riordan EC, Beyers JL. Plant profile for Rhus ovata: native plant recommendations for southern California ecoregions. Riverside: Riverside-Corona Resource Conservation District and U.S. Department of Agriculture, Forest Service, Pacific Southwest Research Station; 2017. [Google Scholar]
- Moore et al. (2007).Moore MJ, Bell CD, Soltis PS, Soltis DE. Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(49):19363–19368. doi: 10.1073/pnas.0708072104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muellner, Vassiliades & Renner (2007).Muellner AN, Vassiliades DD, Renner SS. Placing Biebersteiniaceae, an herbaceous clade of Sapindales, in a temporal and geographic context. Plant Systematics and Evolution. 2007;266(3–4):233–252. doi: 10.1007/s00606-007-0546-x. [DOI] [Google Scholar]
- Muellner-Riehl et al. (2016).Muellner-Riehl AN, Weeks A, Clayton JW, Buerki S, Nauheimer L, Chiang Y-C, Cody S, Pell SK. Molecular phylogenetics and molecular clock dating of Sapindales based on plastid rbcL, atpB and trnL-trnF DNA sequences. TAXON. 2016;65(5):1019–1036. doi: 10.12705/655.5. [DOI] [Google Scholar]
- Nock et al. (2019).Nock CJ, Hardner CM, Montenegro JD, Ahmad Termizi AA, Hayashi S, Playford J, Edwards D, Batley J. Wild origins of macadamia domestication identified through intraspecific chloroplast genome sequencing. Frontiers in Plant Science. 2019;10:19. doi: 10.3389/fpls.2019.00334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orme et al. (2013).Orme D, Freckleton RP, Thomas GH, Petzoldt T, Fritz SA, Isaac N. CAPER: comparative analyses of phylogenetics and evolution in R. Methods in Ecology and Evolution. 2013;3(1):145–151. [Google Scholar]
- Pagel (1999).Pagel M. Inferring the historical patterns of biological evolution. Nature. 1999;401(6756):877–884. doi: 10.1038/44766. [DOI] [PubMed] [Google Scholar]
- Palmer, Osorio & Thompson (1988).Palmer JD, Osorio B, Thompson WF. Evolutionary significance of inversions in legume chloroplast DNAs. Current Genetics. 1988;14(1):65–74. doi: 10.1007/BF00405856. [DOI] [Google Scholar]
- Palmer & Stein (1986).Palmer JD, Stein DB. Conservation of chloroplast genome structure among vascular plants. Current Genetics. 1986;10(11):823–833. doi: 10.1007/BF00418529. [DOI] [Google Scholar]
- Palmer & Thompson (1982).Palmer JD, Thompson WF. Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell. 1982;29(2):537–550. doi: 10.1016/0092-8674(82)90170-2. [DOI] [PubMed] [Google Scholar]
- Paradis & Schliep (2019).Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35(3):526–528. doi: 10.1093/bioinformatics/bty633. [DOI] [PubMed] [Google Scholar]
- Peñalba et al. (2014).Peñalba JV, Smith LL, Tonione MA, Sass C, Hykin SM, Skipwith PL, McGuire JA, Bowie RCK, Moritz C. Sequence capture using PCR-generated probes: a cost-effective method of targeted high-throughput sequencing for nonmodel organisms. Molecular Ecology Resources. 2014;14:1000–1010. doi: 10.1111/1755-0998.12249. [DOI] [PubMed] [Google Scholar]
- Qu et al. (2019).Qu X-J, Moore MJ, Li D-Z, Yi T-S. PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 2019;15(1):50. doi: 10.1186/s13007-019-0435-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rabah et al. (2017).Rabah SO, Lee C, Hajrah NH, Makki RM, Alharby HF, Alhebshi AM, Sabir JSM, Jansen RK, Ruhlman TA. Plastome sequencing of ten nonmodel crop species uncovers a large insertion of mitochondrial DNA in cashew. Plant Genome. 2017;10(3):1–14. doi: 10.3835/plantgenome2017.03.0020. [DOI] [PubMed] [Google Scholar]
- Raubeson & Jansen (1992).Raubeson LA, Jansen RK. Chloroplast DNA evidence on the ancient evolutionary split in vascular land plants. Science. 1992;255(5052):1697–1699. doi: 10.1126/science.255.5052.1697. [DOI] [PubMed] [Google Scholar]
- Revell (2012).Revell LJ. phytools: an R package for phylogenetic comparative biology (and other things) Methods in Ecology and Evolution. 2012;3(2):217–223. doi: 10.1111/j.2041-210X.2011.00169.x. [DOI] [Google Scholar]
- Rozas et al. (2017).Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, Sánchez-Gracia A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Molecular Biology and Evolution. 2017;34(12):3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
- Ruhfel et al. (2014).Ruhfel BR, Gitzendanner MA, Soltis PS, Soltis DE, Burleigh J. From algae to angiosperms-inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes. BMC Evolutionary Biology. 2014;14(1):23. doi: 10.1186/1471-2148-14-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruhlman & Jansen (2014).Ruhlman TA, Jansen RK. The plastid genomes of flowering plants. In: Maliga P, editor. Chloroplast Biotechnology: Methods and Protocols—Methods in Molecular Biology. Totowa: Humana Press; 2014. pp. 3–38. [DOI] [PubMed] [Google Scholar]
- Sabir et al. (2014).Sabir J, Schwarz E, Ellison N, Zhang J, Baeshen NA, Mutwakil M, Jansen R, Ruhlman T. Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes. Plant Biotechnology Journal. 2014;12(6):743–754. doi: 10.1111/pbi.12179. [DOI] [PubMed] [Google Scholar]
- Sanderson et al. (2015).Sanderson MJ, Copetti D, Búrquez A, Bustamante E, Charboneau JLM, Eguiarte LE, Kumar S, Lee HO, Lee J, McMahon M, Steele K, Wing R, Yang T-J, Zwickl D, Wojciechowski MF. Exceptional reduction of the plastid genome of saguaro cactus (Carnegiea gigantea): Loss of the ndh gene suite and inverted repeat. American Journal of Botany. 2015;102(7):1115–1127. doi: 10.3732/ajb.1500184. [DOI] [PubMed] [Google Scholar]
- Sawyer, Keeler-Wolf & Evens (2009).Sawyer JO, Keeler-Wolf T, Evens JM. A manual of California vegetation. Second Edition. Sacramento: California Native Plant Society Press; 2009. [Google Scholar]
- Schelkunov et al. (2015).Schelkunov MI, Shtratnikova VY, Nuraliev MS, Selosse M-A, Penin AA, Logacheva MD. Exploring the limits for reduction of plastid genomes: a case study of the mycoheterotrophic orchids Epipogium aphyllum and Epipogium roseum. Genome Biology and Evolution. 2015;7(4):1179–1191. doi: 10.1093/gbe/evv019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Selosse, Albert & Godelle (2001).Selosse M-A, Albert B, Godelle B. Reducing the genome size of organelles favours gene transfer to the nucleus. Trends in Ecology and Evolution. 2001;16(3):135–141. doi: 10.1016/S0169-5347(00)02084-X. [DOI] [PubMed] [Google Scholar]
- Shaw et al. (2005).Shaw J, Lickey EB, Beck JT, Farmer SB, Liu W, Miller J, Siripun KC, Winder CT, Schilling EE, Small RL. The tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. American Journal of Botany. 2005;92(1):142–166. doi: 10.3732/ajb.92.1.142. [DOI] [PubMed] [Google Scholar]
- Shaw et al. (2007).Shaw J, Lickey EB, Schilling EE, Small RL. Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms: the tortoise and the hare III. American Journal of Botany. 2007;94(3):275–288. doi: 10.3732/ajb.94.3.275. [DOI] [PubMed] [Google Scholar]
- Shinozaki et al. (1986).Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, Zaita N, Chunwongse J, Obokata J, Yamaguchi-Shinozaki K, Ohto C, Torazawa K, Meng BY, Sugita M, Deno H, Kamogashira T, Yamada K, Kusuda J, Takaiwa F, Kato A, Tohdoh N, Shimada H, Sugiura M. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO Journal. 1986;5(9):2043–2049. doi: 10.1002/j.1460-2075.1986.tb04464.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silva et al. (2019).Silva SR, Pinheiro DG, Penha HA, Płachno BJ, Michael TP, Meer EJ, Miranda VFO, Varani AM. Intraspecific variation within the Utricularia amethystina Species morphotypes based on chloroplast genomes. International Journal of Molecular Sciences. 2019;20(24):6130. doi: 10.3390/ijms20246130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sinn et al. (2018).Sinn BT, Sedmak DD, Kelly LM, Freudenstein JV. Total duplication of the small single copy region in the angiosperm plastome: Rearrangement and inverted repeat instability in Asarum. American Journal of Botany. 2018;105(1):71–84. doi: 10.1002/ajb2.1001. [DOI] [PubMed] [Google Scholar]
- Soltis, Soltis & Doyle (1992).Soltis PS, Soltis DE, Doyle JJ. Molecular systematics of plants. New York: Springer US; 1992. [Google Scholar]
- Stamatakis (2014).Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sugiura (1992).Sugiura M. The chloroplast genome. Plant Molecular Biology. 1992;19(1):149–168. doi: 10.1007/BF00015612. [DOI] [PubMed] [Google Scholar]
- Timbrook (2007).Timbrook J. Chumash Ethnobotany: plant knowledge among the Chumash people of southern California. Berkeley: Heyday Books; 2007. [Google Scholar]
- The Plant List (2010).The Plant List Version 1. Published on the internet. 2010. http://www.theplantlist.org/ [1 January 2010]. http://www.theplantlist.org/
- Yuan et al. (2018).Yuan Y, Jin X, Liu J, Zhao X, Zhou J, Wang X, Wang D, Lai C, Xu W, Huang J, Zha L, Liu D, Ma X, Wang L, Zhou M, Jiang Z, Meng H, Peng H, Liang Y, Li R, Jiang C, Zhao Y, Nan T, Jin Y, Zhan Z, Yang J, Jiang W, Huang L. The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nature Communications. 2018;9(1):1615. doi: 10.1038/s41467-018-03423-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang et al. (2020).Wang L, He N, Li Y, Fang Y, Zhang F. Complete chloroplast genome sequence of Chinese lacquer tree (Toxicodendron vernicifluum, Anacardiaceae) and its phylogenetic significance. BioMed Research International. 2020;2020(1):1–13. doi: 10.1155/2020/9014873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weeks et al. (2014).Weeks A, Zapata F, Pell SK, Daly DC, Mitchell JD, Fine PVA. To move or to evolve: contrasting patterns of intercontinental connectivity and climatic niche evolution in Terebinthaceae (Anacardiaceae and Burseraceae) Frontiers in Genetics. 2014;5:7843. doi: 10.3389/fgene.2014.00409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wicke et al. (2016).Wicke S, Müller KF, DePamphilis CW, Quandt D, Bellot S, Schneeweiss GM. Mechanistic model of evolutionary rate variation en route to a nonphotosynthetic lifestyle in plants. Proceedings of the National Academy of Sciences of the United States of America. 2016;113(32):9045–9050. doi: 10.1073/pnas.1607576113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wicke et al. (2013).Wicke S, Müller KF, De Pamphilis CW, Quandt D, Wickett NJ, Zhang Y, Renner SS, Schneeweiss GM. Mechanisms of functional and physical genome reduction in photosynthetic and nonphotosynthetic parasitic plants of the broomrape family. Plant Cell. 2013;25(10):3711–3725. doi: 10.1105/tpc.113.113373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wicke et al. (2014).Wicke S, Schäferhoff B, DePamphilis CW, Müller KF. Disproportional plastome-wide increase of substitution rates and relaxed purifying selection in genes of carnivorous Lentibulariaceae. Molecular Biology and Evolution. 2014;31(3):529–545. doi: 10.1093/molbev/mst261. [DOI] [PubMed] [Google Scholar]
- Wicke et al. (2011).Wicke S, Schneeweiss GM, DePamphilis CW, Müller KF, Quandt D. The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Molecular Biology. 2011;76(3–5):273–297. doi: 10.1007/s11103-011-9762-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu & Chaw (2014).Wu C-S, Chaw S-M. Highly rearranged and size-variable chloroplast genomes in conifers II clade (cupressophytes): evolution towards shorter intergenic spacers. Plant Biotechnology Journal. 2014;12(3):344–353. doi: 10.1111/pbi.12141. [DOI] [PubMed] [Google Scholar]
- Wu et al. (2011).Wu C-S, Lin C-P, Hsu C-Y, Wang R-J, Chaw S-M. Comparative chloroplast genomes of pinaceae: insights into the mechanism of diversified genomic organizations. Genome Biology and Evolution. 2011;3:309–319. doi: 10.1093/gbe/evr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yi, Miller & Wen (2004).Yi T, Miller AJ, Wen J. Phylogenetic and biogeographic diversification of Rhus (Anacardiaceae) in the Northern Hemisphere. Molecular Phylogenetics and Evolution. 2004;33(3):861–879. doi: 10.1016/j.ympev.2004.07.006. [DOI] [PubMed] [Google Scholar]
- Yi, Miller & Wen (2007).Yi T, Miller AJ, Wen J. Phylogeny of Rhus (Anacardiaceae) Based on sequences of nuclear Nia-i3 intron and chloroplast trnC-trnD. Systematic Botany. 2007;32(2):379–391. doi: 10.1600/036364407781179635. [DOI] [Google Scholar]
- Young (1972).Young DA. The reproductive biology of Rhus Integrifolia and Rhus ovata (Anacardiaceae) Evolution. 1972;26(3):406–414. doi: 10.1111/j.1558-5646.1972.tb01945.x. [DOI] [PubMed] [Google Scholar]
- Young (1974).Young DA. Introgressive hybridization in two southern California species of Rhus (Anacardiaceae) Brittonia. 1974;26(3):241. doi: 10.2307/2805727. [DOI] [Google Scholar]
- Zhang et al. (2020).Zhang R-S, Yang J, Hu H-L, Xia R-X, Li Y-P, Su J-F, Li Q, Liu Y-Q, Qin L. A high level of chloroplast genome sequence variability in the sawtooth oak Quercus acutissima. International Journal of Biological Macromolecules. 2020;152:340–348. doi: 10.1016/j.ijbiomac.2020.02.201. [DOI] [PubMed] [Google Scholar]
- Zhu et al. (2016).Zhu A, Guo W, Gupta S, Fan W, Mower JP. Evolutionary dynamics of the plastid inverted repeat: the effects of expansion, contraction, and loss on substitution rates. New Phytologist. 2016;209(4):1747–1756. doi: 10.1111/nph.13743. [DOI] [PubMed] [Google Scholar]
- Zuo et al. (2019).Zuo R, Sun C, Chen C, Jiang P. Complete chloroplast genome of Rhus chinensis by de novo sequencing. Mitochondrial DNA Part B. 2019;4(1):1672–1673. doi: 10.1080/23802359.2019.1604183. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
Complete plastid genomes of Rhus accessions (GenBank format) are available in File S1. Repeat locations are available in File S2. Primer sequences are available in File S3. The plastid coding gene alignment for Sapindales is available in File S4. MAUVE alignment of 54 plastomes is available in File S5.