Abstract
Plastid sequences are a cornerstone in plant systematic studies and key aspects of their evolution, such as uniparental inheritance and absent recombination, are often treated as axioms. While exceptions to these assumptions can profoundly influence evolutionary inference, detecting them can require extensive sampling, abundant sequence data, and detailed testing. Using advancements in high-throughput sequencing, we analyzed the whole plastomes of 65 accessions of Picea, a genus of ∼35 coniferous forest tree species, to test for deviations from canonical plastome evolution. Using complementary hypothesis and data-driven tests, we found evidence for chimeric plastomes generated by interspecific hybridization and recombination in the clade comprising Norway spruce (P. abies) and 10 other species. Support for interspecific recombination remained after controlling for sequence saturation, positive selection, and potential alignment artifacts. These results reconcile previous conflicting plastid-based phylogenies and strengthen the mounting evidence of reticulate evolution in Picea. Given the relatively high frequency of hybridization and biparental plastid inheritance in plants, we suggest interspecific plastome recombination may be more widespread than currently appreciated and could underlie reported cases of discordant plastid phylogenies.
Keywords: hybridization, chloroplast genomes, phylogenetic incongruence, recombination, reticulate evolution
Introduction
Technical feasibility coupled with an apparently simple mode of evolution have made plastid genomes (plastomes) a workhorse of plant systematics. While nuclear genomes vary in size by orders of magnitude, a typical land plant plastome contains around 115 unique genes distributed over 120–160 kb (Bock 2007). Plastid genes are generally conserved and encode the two photosystems, the large subunit of the carbon fixing enzyme RuBisCO, and ∼30 other housekeeping proteins. Compared with the nuclear genome, plastome mutation rates are characteristically low and coding sequences may be subject to stronger functional constraints (Wolfe et al. 1987). In addition, the uniparental plastid inheritance found in most species is expected to result in clonal lineages, a smaller effective population size (Ne) than the nuclear genome, and the absence of sexual recombination.
Exceptions to nearly every assumption of this simple model of plastome evolution are known (reviewed in Wolfe and Randle 2004). Of these, biparental plastid inheritance is particularly widespread phenomenon. Plastids are most often transmitted by the seed parent in angiosperms but ∼20% of species have plastids present in mature pollen, which indicates the potential for biparental inheritance (Corriveau and Coleman 1988; Zhang et al. 2003). In coniferous gymnosperms, where plastid inheritance is typically paternal, realized biparental inheritance has been documented in Pinus (White 1990), Picea (Sutton et al. 1991), and Larix (Szmidt et al. 1987). Heteroplasmy resulting from biparental inheritance can be expected to be transient due to the vegetative sorting of organelles at every cell division (Wolfe and Randle 2004). However, a recent study of whole plastomes in Citrus revealed heteroplasmy consistent with hybridization in germplasm from mature trees (Carbonell-Caballero et al. 2015). Biparental plastid inheritance and stable heteroplasmy allow for violation of another axiom of plastome evolution: sexual recombination.
Robust evidence for recombination between biparentally inherited plastomes in natural populations is lacking. However, population studies of Pinus (Marshall et al. 2001) and Cycas (Huang et al. 2001) have invoked recombination to explain unusual patterns of plastome polymorphism and linkage disequilibirum. Interspecific recombination has been hypothesized as the source of the markedly different phylognies inferred from different plastid loci in Silene (Erixon and Oxelman 2008), Picea (Bouillé et al. 2011), and the diatom genus Pseudo-nitzschia (D’Alelio and Ruggiero 2015). Given that all sites in the plastome should share the same topology under clonal inheritance, recombination may more broadly underlie cases of discordance among plastid loci in studies where its presence has not been inferred (e.g. polygrammoid ferns, Schneider et al. 2004; Asteraceae, Doorduin et al. 2011). As recombination can severely bias population and phylogenetic analyses (e.g. Schierup and Hein 2000), new detection methods are warranted. In particular, approaches using complete chromosomes or organelles can exploit the expected spatial mosaic of evolutionary histories generated by recombination (e.g. Maynard Smith 1992) and use structural features to identify probable breakpoints or hotspots.
Repetitive sequences contribute to intra- and intermolecular recombination during plastome replication and repair (Day and Madesis 2007; Maréchal and Brisson 2010). During intermolecular recombination, repeats can act as recognition or cleavage sites through primary and secondary structures (Kawata et al. 1997). In addition, repeats may help maintain plastome size and content during recombinational DNA repair. For example, double-stranded breaks in Chlamydomonas plastomes are repaired through homologous recombination, which can result in an unstable plastome dimer that is reduced through intramolecular recombination between repeats (Odom et al. 2008). This process could explain why sexual recombination hotspots in the Chlamydomonas plastome are correlated with dispersed and inverted repeats (Lemieux et al. 1990; Odom et al. 2008). Based on this evidence, we reasoned rare sexual recombination in land plant plastomes should also be associated with repeats.
We developed and applied phylogenetic tests for recombination to 65 whole plastomes representing all ∼35 extant species of Picea, a genus of coniferous trees broadly distributed in the northern hemisphere (Farjon 1991; Eckenwalder 2009). In contrast to the canonical quadripartite plastome structure found in most plants, the large inverted repeat (IR) region is highly reduced in Picea and other conifers (Strauss et al. 1988; Wu et al. 2011). Instead, the Picea plastome contains three single copy regions (the F1, F2, and F3 regions), which are intervened by three large repeats (Wu et al. 2011, see fig. 1). Large structural rearrangements between these repeats have been reported (Wu et al. 2011; Nystedt et al 2013), which supports their role as recombination substrates during intra- and intermolecular recombination. We predicted sexual recombination in Picea would generate different evolutionary histories among the F1, F2, and F3 regions—regardless of genomic context, mutation rates, or natural selection—but not within them. Combining this hypothesis-based approach with purely data-driven tests, we identified at least one interspecific recombination event that produced a chimeric plastome inherited by over one-third of Picea species. This result remained after controlling for process that may generate patterns similar to recombination, such as process heterogeneity, saturation, and positive selection. Given that hybridization occurs in ∼15% of plant genera (e.g. Ellstrand et al. 1996; Whitney et al. 2010), such recombinant plastomes may not be unique to Picea.
Fig. 1.
Consensus map of the Picea plastome. The outermost ring displays genes colored by functional group: genes on the inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. The second circle displays the boundaries of the three major structural regions (F1, F2, and F3) of the plastome, which are intervened by large repeats represented by white arrows (not to scale). The innermost circle depicts average pairwise nucleotide diversity (π) estimated in 350 bp moving windows.
Results
Plastome Assembly and Divergence
We used whole-genome shotgun sequencing to generate 61 draft plastomes, which together with existing assemblies represent all 35 commonly recognized Picea species. Total genomic DNA was sequenced on three different Illumina platforms at differing multiplexing levels and read lengths (see “Materials and Methods” section), but all assemblies had high coverage and large scaffolds (table 1). De novo plastomes averaged 121,885 bp (± 2,861) or ∼98% of reported reference-quality assemblies (Wu et al. 2011; Nystedt et al. 2013; Jackman et al. 2015; Yang et al. 2016). Assemblies generally comprised five large nonoverlapping scaffolds, which correspond to the intervening regions between the two copies of the highly reduced inverted repeat (IRa, IRb) and the three large Pinaceae-specific dispersed repeats (fig. 1). Plastomes assembled from two paired-end libraries were more contiguous despite lower sequencing depth and shorter reads (table 1), possibly because of improved resolution of repetitive sequences (Prjibelski et al. 2014) or differences in Illumina chemistry. Assembly statistics, detailed locality information, and Genbank and Short Read Archive (SRA) accession numbers for each individual are reported in supplementary table S1, Supplementary Material online.
Table 1.
Sequencing and Assembly Metrics for 61 Picea Draft Plastomes.
| Sequencing system | Mean number of scaffolds | N50 scaffold length (bp) | Average largest scaffold (bp) | Mean read depth |
|---|---|---|---|---|
| Illumina HiSeq 2000 and 2500 2×100 bp, 2×125 bp | 5.77 (1.76) | 56,475 (2,528) | 66,112 (13,562) | 46.83 (23.32) |
| Illumina HiSeq 4000 2×150 bp | 6.75 (1.35) | 22,939 (1,162) | 55,847 (6,949) | 413.51 (239.18) |
| Illumina MiSeq 2×100 bp | 7 | 23,241 | 55,799 | 47.17 |
Note.—Standard errors are reported in parentheses.
Subsequent analyses included four additional published Picea plastomes (Wu et al. 2011; Nystedt et al. 2013; Jackman et al. 2015; Yang et al. 2016), yielding a total of 65 assembled genomes. Spruce plastomes were highly similar, averaging 97.9% sequence identity in pair-wise comparisons. Gene content was conserved and comprised 74 protein-coding genes, 36 tRNA genes, and four rRNAs. No evidence of rearrangements within scaffolds was found, but we were unable to evaluate structural changes spanning multiple scaffolds, such as the inversion of the F2 region between P. morrisonicola and P. abies (Nystedt et al. 2013). Indels were found in only three genes: accD, chlN, and ycf1.
Nucleotide diversity (π) across the plastome was relatively low (π = 0.0030; table 2). Sliding-scale estimates of π in nonoverlapping 350-bp windows revealed three “hotspots” of diversity occurring near the boundaries of the structural regions (fig. 1). Five genes (psbH, psbT, rpl23, ycf1, ycf2) were more polymorphic than intergenic spacers on average. Notably, ycf1 and ycf2 together contain 57% of all coding sequence polymorphisms (supplementary table S2, Supplementary Material online). Mean synonymous and nonsynonymous divergence was low (dN = 0.017 ± 0.024, dS = 0.040 ± 0.031) but varied markedly among genes (fig. 2). Mean substitution rates were significantly elevated in the ycf genes (GLM, t = 2.828, P < 0.001)—a pattern driven entirely by ycf1 and ycf2—and in the photosystem II complex psb (GLM, t = 2.103, P < 0.05; fig. 2). Six genes comprised solely nonsynonymous substitutions while eight contained only synonymous substitutions (fig. 2 and supplementary table S2, Supplementary Material online).
Table 2.
Alignment Length, Number of Polymorphic Sites, and Nucleotide Diversity (π) by Genomic Context Estimated from 65 Picea Plastomes.
| Context | Alignment length (bp) | Bi-allelic sites | π |
|---|---|---|---|
| Protein-coding exons | 61,256 | 430 | 0.0020 (1.6×10−4) |
| Four-fold synonymous sites | 7,720 | 180 | 0.0025 (2.9×10−4) |
| tRNAs | 2,874 | 12 | 0.0015 (1.6×10−4) |
| rRNAs | 4,522 | 13 | 0.0002 (7.6×10−5) |
| Introns | 9,929 | 197 | 0.0023 (3.2×10−4) |
| Intergenic spacers | 49,373 | 1,681 | 0.0048 (1.1×10−4) |
| Whole plastome | 127,954 | 2,333 | 0.0030 (6.6×10−5) |
Note.—Standard errors are reported in parentheses.
Fig. 2.
Synonymous (dS) and nonsynonymous (dN) substitution rates for Picea plastome genes grouped by complex and function. Rates were estimated using the maximum likelihood method implemented in PAML (Yang 2007). Substitution rates were significantly (P < 0.05) elevated in the photosystem II (psb) and hypothetical open reading frame (ycf) genes. Per-gene estimates are reported in supplementary table S2, Supplementary Material online.
The variation in substitution rates among genes prompted us to test for selection acting on them. Codon-based models of nucleotide substitution strongly supported (P ≤ 0.001) positive evolution (ω > 1) in 12 out of 70 genes (table 3). Five additional genes were significant at 0.001 < P ≤ 0.05 but we did not consider them further (supplementary table S2, Supplementary Material online). Most genes had one or two positively selected codons, but the massive ycf1 gene contained 73 sites (3.5% of the predicted protein) potentially under selection. However, many of these sites were located within repetitive sequences and poor alignment may have generated false positives. After excluding sites in tandem repeats, positive selection was still supported at 21 codons (presented in table 3).
Table 3.
Positive Selection Inferred from the M1a and M2a Random Site Models Implemented in PAML (P ≤ 0.001).
| Complex | Gene | −ln(M1a) | −ln(M2a) | Sites |
|---|---|---|---|---|
| ATP synthase | atpA | −2210.70 | −2182.54 | 185(P,L), 232(P,S), 253(K,N), 299(S,L), 314(V,L), 427(A,T) |
| atpE | −617.32 | −578.11 | 119(P,L) | |
| Photosystem I | psaC | −352.21 | −342.13 | 35(K,R) |
| Photosystem II | psbA | −1517.02 | −1481.65 | 155(T,S,A) |
| psbD | −1538.38 | −1525.49 | 156(P,S) | |
| psbH | −346.92 | −337.02 | 16(S,A), 21(R,T) | |
| psbT | −198.05 | −168.65 | 30(P,L,Q,S) | |
| Large ribosomal subunit | rpl22 | −683.02 | −664.74 | 7(N,Y,K) |
| RNA polymerase | rpoC2 | −5556.69 | −5540.32 | 326(N,K), 870(R,I) |
| Small ribosomal subunit | rps19 | −399.85 | −388.39 | 20(K,Q) |
| Hypothetical open reading frame | ycf1 | −13094.20 | −12574.89 | 68(A,T), 176(V,A), 502(S,Y), 612(L,I), 619(H,Y), 739(R,K), 959(F,L), 1054(A,S), 1083(L,F), 1139(N,K), 1149(S,A), 1223(F,L), F1306(F,L), 1378(L,F), 1383(S,L), 1507(S,L), 1633(L,F), 1708(D,E), 1709(K,Q), 1790(V,T), 1912(R,E) |
| ycf4 | −815.00 | −799.87 | 118(L,P) |
Note.—Positively selected sites have ω > 1 with posterior probability ≥ 0.95 according to the Bayes empirical Bayes method. Selected site are listed in parenthesis. Site number refers to the position in the Picea alignment for that protein.
Phylogenetic Discordance in the Picea Plastome
Phylogenetic analysis of the 65 whole plastomes using maximum likelihood resolved five major clades comprising: (1) the North American P. engelmannii, P. glauca, and P. mexicana (hereinafter “glauca”), (2) P. abies and predominately northeast Asian taxa (“abies”), (3) P. mariana and four geographically disparate species (“mariana”), (4) taxa from the Qinghai-Tibetan Plateau and surrounding regions (“likiangensis”), and (5) a diverse clade containing species representing the entire geographic range of genus (“chihuahuana”; fig. 3). Relationships within most clades were strongly supported with the notable exception of the “abies” clade, in which several nodes had <50% support. The uncertain topology is likely due to the rapid and recent radiation of this clade during the last two million years (Lockwood et al. 2013; Ran et al. 2015). In contrast, the backbone topology of the genus was poorly resolved, with 52% of trees supporting a “glauca”-“abies” sister relationship and 39% supporting a most recent common ancestor of the “glauca”, “abies”, and “mariana” clades.
Fig. 3.

Maximum likelihood analysis of the whole plastome alignment of 65 Picea accessions provides strong support for many shallow nodes but the relationships between the “glauca”, “abies”, and “mariana” clades are unresolved. The majority rules extended consensus tree is presented and node labels denote the proportion of bootstrap replicate trees supporting the bipartition. Nodes with ≥ 90% support are indicated by an asterisk. Colored squares denote the geographic distribution of the species. Numbers after species names correspond to the accession information in supplementary table S1, Supplementary Material online.
We analyzed bipartition frequencies in bootstrapped trees to assess the presence of alternate topologies at these two weakly supported backbone nodes. Out of 1,000 replicates, the “glauca” clade occurred in a bipartition with P. sitchensis and P. breweriana at even greater frequency than the relationship included on the consensus tree in figure 3 (46% vs. 39% of bootstrap replicates). Both the “mariana” and “glauca” clades were grouped with the “abies” clade to the exclusion of all other species with almost equal frequency (52% vs. 48%). This result is consistent with prior plastid-based studies, which discovered conflicting sister clades for “abies” taxa depending on the locus analyzed (cf. Bouillé et al. 2011; Ran et al. 2015). These results suggest the presence of highly supported yet conflicting topologies in the whole-plastome alignment.
Saturation, selection, or recombination could explain the near-equal occurrence of conflicting bipartitions in the whole-plastome data set and the disagreement among earlier studies. To assess the causes of phylogenetic discord across the plastome, we estimated phylogenies from five different subsets: (1) protein-coding exons; (2) concatenated intergenic spacers; and (3–5) subsets corresponding to the three major structural regions (F1, F2, and F3) of Pinaceae plastomes, which are subtended by large repeats that likely serve as recombination substrates (Wu et al. 2011; see fig. 1). We reason these repeats could facilitate recombination between divergent plastome copies if biparental inheritance and plastid fusion occurred. Discordance among structural fragments would be consistent with our hypothesis of recombination, but discordance between genic and intergenic regions would be more consistent with parallel mutations from convergence or saturation.
We implemented randomization tests of Robinson–Foulds (RF) tree distances, quartet distances (QD), and the parsimony-based incongruence length difference (ILD) metric to test the significance of phylogenetic incongruence between a priori data subsets. Incongruence between intergenic spacers and protein-coding exons was only supported by the ILD test (table 4), which potentially has elevated Type I and II error rates under a variety of circumstances (Planet 2006). The lack of strongly supported discordance between intergenic and coding sites indicates the absence of pervasive differences in phylogenetic signal, even if some sites are genuinely discordant. In contrast, all three tests strongly supported incongruence between the F3 region versus the F1 and F2 regions (table 4 and fig. 4). While the placement of “mariana” is not highly supported in the F3 region topology, it is excluded from the bipartition comprising the “glauca” and “abies” clades in 99% of bootstrap replicates. In the F1 and F2 phylogenies, “mariana” is resolved as the sister clade of “abies” with high support (fig. 4 and supplementary fig. S1, Supplementary Material online). Together, these results suggest different evolutionary histories among the plastome structural regions.
Table 4.
Significance of Phylogenetic Incongruence between Plastome Subsets According to the Incongruence Length Difference (ILD), Quartet Distance (QD), and Robinson–Foulds (RF) Metric Permutation Tests.
| ILD | QD | RF | |
|---|---|---|---|
| Whole genome and structural region comparisons | |||
| PCEa vs. intergenic | 0.05 | 0.40 | 0.46 |
| F1 vs. F3 region | 0.01 | <0.05 | <0.01 |
| F2 vs. F3 region | 0.01 | <0.001 | <0.01 |
| F2 vs. F1 region | 0.20 | 0.17 | 0.23 |
| Comparisons by genomic context and structural region | |||
| F1 vs. F2 PCE | 0.01 | <0.001 | <0.001 |
| F1 vs. F2 ω < 1 sitesb | 0.16 | 0.26 | 0.11 |
| F2 vs. F2 intergenic | 0.85 | 1.00 | 1.00 |
| F3 vs. F1+F2 intergenic | 0.01 | <0.001 | <0.001 |
| F3 vs. F1+F2 PCE | 0.01 | <0.001 | <0.001 |
| F3 vs. F1+F2 4Dc | 0.86 | <0.001 | <0.001 |
Note.—F1, F2 and F3 regions refer to the three major single copy units of the Picea plastome (see fig. 1). The F1 and F2 regions are concatenated for comparisons by genomic context against the F3 region.
Protein-coding exons.
Protein coding exons with no evidence of positive selection.
Four-fold synonymous sites.
Fig. 4.
Phylogenies inferred from the structural regions of the Picea plastome are strongly discordant according to topology and character based incongruence tests. While the F3 region supports a sister relationship between the “glauca” and “abies” clades, the F2 and F1 regions suggest “abies” and “mariana” share a more recent common ancestor. Full phylogenies are presented in supplementary figure S1, Supplementary Material online.
Differences in gene content, nucleotide diversity, and substitution rates across the three regions could potentially explain the variation in topologies. We tested if phylogenetic discordance between regions could be explained by differences in sequence evolution by comparing only the following: (1) intergenic sites, (2) protein-coding exons, and (3) 4-fold synonymous sites (4D) on each region. Intergenic sites in the F1 and F2 regions yielded statistically indistinguishable topologies but protein-coding exons were significantly discordant (table 4 and supplementary fig. S2, Supplementary Material online). Given the paucity of polymorphic 4D sites on these two regions, we instead compared protein-coding exons without significant evidence of selection. A disproportionate number of codons on the F1 region are putatively under positive selection (see table 3), and their removal yielded topologies no more different than expected by chance (table 4). As discordance between the F1 and F2 regions can be explained by just eight polymorphisms, we concatenated these two regions for tests against the larger F3 region because using a similar number of sites each phylogeny should improve the accuracy of the permutation tests.
Protein-coding exons, intergenic sites, and 4D sites on the concatenated F1 and F2 regions were strongly discordant with those on the F3 region according to all three tests, with the exception of the ILD test on 4D sites (table 4). As there are only ∼100 parsimony informative 4D sites across the entire plastome, the ILD test may have insufficient power to detect discordance (Planet 2006). Each subset of sites (e.g., 4D sites on the F3 region) yielded a similar topology as the entire region (fig. 4), albeit with poorer resolution (supplementary fig. S2, Supplementary Material online). Therefore, phylogenetic discordance between the F3 versus the F1 and F2 regions encompasses intergenic, synonymous, and nonsynonymous sites, and topological differences among the three fragments cannot be adequately explained by saturation or selection.
Recombination Tests
Phylogenetic discordance among structural regions is consistent with our a priori hypothesis of interspecific recombination in the plastome. To further support this inference, we employed RDP4 (Martin et al. 2015) to implement unguided tests for recombination on the whole plastome alignment. This two-step approach helps safeguard against potentially spurious results from exploratory tests yet allows the data to refute our hypothesis that recombination should involve the repeats subtending the three structural regions.
The seven tests implemented in RDP4 identified two putative recombination events, both with breakpoints estimated near region junctions (fig. 5 and table 5). All tests supported a recombinant plastome shared by the “abies” clade originating from parents from the “glauca” and “mariana” clades (table 5). While the majority of the “abies”-type plastome suggested a close relationship with the “glauca” clade, a ∼30 kbp segment was highly similar to P. jezoensis, a species from the “mariana” clade distributed from central Japan to northeastern Siberia. This relationship between the “abies” clade and P. jezoensis is also evident in our a priori phylogenetic analysis, which supports a shared ancestor of the F2 region (fig. 4). Maximum likelihood recombination breakpoints were placed near the repeats bounding the F2 region, but broad confidence intervals also encompassed most of the F1 region (fig. 5). Although the phylogenetic discordance and recombination tests differ somewhat in the extent of the exchange, both support a recombinant plastome in the “abies” clade with at least the F2 region inherited from a “mariana”-like parent.
Fig. 5.
Unguided tests for recombination in the Picea whole plastome alignment, as illustrated by the MaxChi method (Maynard Smith 1992). (A) All seven recombination tests supported a recombinant plastome shared by the entire “abies” clade originating from interspecific recombination with a P. jezoensis-like species. On the y-axis, lines represent the 2×2 contingency χ2 value of the number of polymorphic sites on either side of a given window. Breakpoints are inferred to occur where this value maximized and the recombinant region is shaded in grey. A schematic representing the structural components of the plastome is superimposed on the x-axis. (B) Five tests supported a recombinant plastome shared by the “likiangensis” clade and estimated breakpoints occurred near the F2 and F1 region junctions.
Table 5.
Bonferonni Corrected P Values for the Two Recombination Events Detected in the “abies” and “likiangensis” Clades by Unguided Tests as Implemented in RDP4.
| Recombination test | “abies” | “likiangensis” |
|---|---|---|
| RDP | 6.71×10−09 | 2.36×10−5 |
| GENECOV | 1.89×10−11 | n.s. |
| BootScan | 6.71×10−5 | n.s. |
| MaxChi | 1.81×10−7 | 6.11×10−6 |
| Chimaera | 1.83×10−4 | 6.87×10−8 |
| SiScan | 9.46×10−11 | 1.55×10−3 |
| 3Seq | 4.67×10−4 | 2.57×10−9 |
n.s., not significant.
A second putative recombination event spanning the F1 and F2 fragments involved the “likiangensis” clade of Qinghai-Tibetan Plateau taxa and an unknown species similar to P. pungenş a species from the “mariana” clade (table 5 and fig. 5). The F3 region topology supported a sister relationship between the “likiangensis” and “mariana” clades with 68% bootstrap support (fig. 4), whereas the F1 and F2 regions intercalated taxa from the “likiangensis” and “chihuahuana” clades (fig. 4 and supplementary fig. 2, Supplementary Material online). As the RF, QD, and ILD randomization tests are global measures of discordance, this more equivocal recombination event likely contributed to the highly significant results in table 4.
Discussion
Uniparental inheritance, strong functional constraints, and absent recombination are key but often untested assumptions in plant evolutionary studies (Wolfe and Randle 2004). Using whole genome assemblies and comprehensive taxon sampling, we found that Picea plastomes are highly conserved in sequence and structure but shaped by interspecific recombination and punctuated by putative positive selection. Our results add to the growing body of evidence of complex plastid evolution and demonstrate how overly simple models can lead to misleading inferences.
Sequence Divergence
Plastome substitution rates are typically low in land plants (Wolfe et al. 1987). Consequently, interspecific distances in the Picea plastome are of the same magnitude as intraspecific variation in Pinaceae nuclear genes (e.g. Mosca et al. 2012). Genome-wide substitution rates are not commonly reported in genus-level whole plastome studies, however, substitution rates in Picea are similar to those reported from more limited sets of loci in Pinus (Willyard et al. 2007; Wang and Wang 2014). Both genera have ancient origins in the Cretaceous (Wang et al. 2000; Klymiuk and Stockey 2012), but the diversification of extant pine and spruce species occurred within the last 20 My, perhaps facilitated by a cooling climate after the Miocene optimum (Willyard et al. 2007; Lockwood et al. 2013; Wang and Wang 2014; Ran et al. 2015). Although we did not estimate absolute substitution rates explicitly, the similarity of the Pinus and Picea relative rates and divergence times together suggests a similar per-generation mutation rate in these two Pinaceae genera.
Selection on Picea Plastid Genes
Positive selection on plastid genes has been widely inferred in angiosperms (e.g. Carbonell-Caballero et al. 2015; Hu et al. 2015) and in some cases potentially adaptive changes have been demonstrated at the protein or phenotype level (Mengistu et al. 2000). Gymnosperm plastomes are less studied. An analysis of Pinus plastomes found more limited evidence for positive selection (6/71 genes; Parks et al. 2009), in contrast to the 20% of Picea genes apparently affected. Only two genes, psbH and ycf1, were inferred to have codons with ω > 1 in both Pinus and Picea. To our knowledge, psbH has not been widely implicated in positive evolution in angiosperms, and high ω at this gene may represent a unique attribute of Pinaceae. In contrast, ycf1 is exceptionally divergent across land plants (Dong et al. 2015) and shows elevated ω in some angiosperm lineages (e.g. Hu et al. 2015). Surprisingly, ycf1 encodes an essential component of plastid protein import apparatus (Kikuchi et al. 2013), the hypervariable repeat motifs of which may serve an adaptive function (de Vries et al. 2015).
Consistent with an adaptive explanation, positively selected sites in Picea were disproportionately inferred in genes involved in photosynthesis (table 3). However, nonsynonymous mutations may become fixed through nonadaptive processes. Small effective population sizes, rare recombination, and strong mutational bias are all characteristic of plastomes and can facilitate the fixation of slightly deleterious alleles (Felsenstein 1974; Kimura 1983; Moran 1996). On the other hand, plastids have highly efficient DNA repair systems, which could be expected to purge deleterious mutations (Day and Madesis 2007; Maréchal and Brisson 2010). Site-directed mutagenesis and in vitro characterization of mutant proteins (e.g. Lan et al. 2013) could help clarify the role of adaptive and nonadaptive processes in shaping plastid substitution rates.
Reticulate Evolution and Chimeric Plastomes
Recombination between divergent haplotypes has been invoked to explain phylogenetic discordance among plastid loci (Erixon and Oxelman 2008; Bouillé et al. 2011; D’Alelio and Ruggiero 2015). Although recombination has been demonstrated in somatic hybrids (e.g. Medgyesy et al. 1985), evidence from natural populations is rare and based on limited data (Huang et al. 2001; Marshall et al. 2001; Erixon and Oxelman 2008; Bouillé et al. 2011; D’Alelio and Ruggiero 2015). Using hypothesis and data-driven analyses, we found evidence for at least one interspecific recombination event in Picea plastomes. Although a robust coalescent species tree is not yet available for Picea, topologies inferred for the F3 plastome region closely match those from mitochondrial loci (Ran et al. 2015) and nuclear loci (Zou et al. 2016; Liu J-Q, unpublished data), which suggests the “glauca” and “abies” clades may be related by vertical descent. In contrast, the F2 and potentially the F1 region of the “abies” plastome was inherited horizontally from a member of the “mariana” clade, most likely P. jezoensis or an extinct relative. Formation of chimeric plastomes via interspecific recombination would require at a minimum: (1) hybridization, (2) temporary breakdown of uniparental inheritance, and (3) plastid fusion and intermolecular recombination.
Like many plant taxa, extant Picea are widely interfertile and many “abies” species successfully hybridize with the “mariana” clade in controlled crosses (Wright 1955). In natural populations, many parapatric spruces hybridize readily (e.g. Jaramillo-Correa and Bousquet 2005; Du et al. 2011), including distantly related species (Hamilton and Aitken 2013). Comparison of phylogenies inferred from mitochondrial and plastid loci, which are differentially inherited in conifers, also support widespread introgression throughout the genus (Bouillé et al. 2011; Ran et al. 2015). In particular, P. jezoensis has a “mariana”-type plastome but more closely resembles the “abies” clade at mitochondrial and some nuclear loci (Ran et al. 2015). Fossil and genetic evidence indicate the continuous presence of P. jezoensis and its extinct relative P. hondoensis in northeastern Russia throughout the Pleistocene (Aizawa et al. 2007). The most recent common ancestor of the “abies” clade likely emerged in eastern Asia during this same time period (Ran et al. 2015), and introgression between a P. jezoensis-like species and the “abies” ancestor may have occurred here, prior to the clade’s diversification and colonization of Eurasia.
Mechanisms maintaining uniparental inheritance are varied (Bock 2007), but hybridization or interpopulation crosses have been linked to a breakdown of these processes (Hansen et al. 2007; Ellis et al. 2008; Barnard-Kubow et al. 2017). In Helianthus and Passiflora, rates of plastid leakage vary by family and species but realized biparental inheritance occurs in ∼2–4% of offspring overall, respectively (Hansen et al. 2007; Ellis et al. 2008). As biparental inheritance is not uncommon on an evolutionary timescale, fusion of the two parental plastids is probably the limiting step preventing more widespread recombination (Day and Madesis 2007). To our knowledge, plastid fusion has not been observed directly in natural populations but has been demonstrated experimentally in somatic hybrids (e.g. Medgyesy et al. 1985). Once fused, recombination could occur between highly similar genomes through normal DNA replication and repair mechanisms (Day and Madesis 2007; Maréchal and Brisson 2010). Spread of the chimeric plastome through the ancestor of the “abies” clade may have then proceeded through entirely neutral processes, such as allele surfing during range expansion (Excoffier and Ray 2008).
The prevalence of sexual recombination in plastomes is unknown. Taxa with higher rates of biparental plastid inheritance, including groups prone to hybridization (Hansen et al. 2007; Ellis et al. 2008; Barnard-Kubow et al. 2017), are presumably the most susceptible. In addition, three genera with predominately paternal plastid inheritance, including Picea, have been suggested to form chimeric plastomes (Huang et al. 2001; Marshall et al. 2001), which may indicate a link between this inheritance mode and a propensity for recombination. However, until further research clarifies the factors promoting sexual plastome recombination, adding recombination tests to phylo- and population genetic pipelines is prudent.
A Word of Caution for “Chloroplast Phylogenomics”
Plastome phylogenies effectively estimate the history of a single gene and do not necessarily represent species histories. Incomplete lineage sorting can cause the plastome to vary markedly from the species tree, and introgression can further exacerbate this discordance (Doyle 1992). While whole Picea plastomes yielded high resolution within most clades, they also revealed conflicting histories generated by vertical descent and introgression. Furthermore, the three genomes in Picea suggest incongruent histories (Bouillé et al. 2011; Ran et al. 2015), and substantial gene-tree discordance is also evidence among nuclear loci (Ran et al. 2015). Reconstructing a useful approximation of the species tree for Picea—and potentially many other plant genera—will likely necessitate methods explicitly allowing for gene tree heterogeneity arising from both incomplete lineage sorting and introgression.
Plastome haplotypes for many species were not reciprocally monophyletic, an issue which further confounds the use of single-gene phylogenies as species tree proxies. Haplotype sharing can arise from some combination of incomplete lineage sorting, hybridization, cryptic and poorly delimited species, or misidentified material. For example, Norway spruce (P. abies) was polyphyletic in our data set, with the two Scandinavian plastomes more closely related to P. obovata than to a Norway spruce originating from Serbia. A polyphyletic Norway spruce was noted by Lockwood et al. (2013) in their predominately mitochondrial data set, which indicates a need to clarify species boundaries in this taxon. In the absence of a robust species tree estimate, we urge caution when interpreting plastome phylogenies and note results from exemplar accessions may not generalize to an entire species.
Conclusions
We used whole plastome sequences and comprehensive taxon sampling to characterize sequence divergence, positive selection, and evolutionary histories in Picea. Most significantly, we found chimeric plastomes formed by ancient introgression and recombination between divergent clades. Chimeric plastomes explain the conflicting results of earlier plastid-based studies in Picea: phylogenies inferred from different loci alternately reflect vertical descent and reticulate evolution. As whole plastome sequences become increasingly available, we suggest future studies utilize structural features, such as repeats, to develop phylogenetic hypotheses to test for recombination. Given that hybridization (Ellstrand et al. 1996; Whitney et al. 2010) and the potential for biparental plastid are common in plants (Corriveau and Coleman 1988; Zhang et al. 2003), interspecific plastome recombination may not be unique to Picea.
Materials and Methods
Plant Material
A total of 61 representatives from the 35 commonly recognized Picea species (Farjon 1991; Eckenwalder 2009) were sourced from natural populations and cultivated collections. Multiple accessions were sequenced for broadly distributed species or those with spatially disjunct or fragmented ranges. For cultivated trees, we preferred individuals obtained as seed or seedlings from natural populations and selected wild accessions outside of known hybrid zones whenever possible (see supplementary table S1, Supplementary Material online for provenances and voucher numbers). Additional complete chloroplast genomes for P. abies (Nystedt et al. 2013), P. glauca (Jackman et al. 2015), P. morrisonicola (Wu et al. 2011), and P. jezoensis (Yang et al. 2016) were downloaded from Genbank (supplementary table S1, Supplementary Material online).
Genome Sequencing and Assembly
Total genomic DNA was isolated from leaf material using the Macherey-Nagel NucleoSpin® Plant II kit according to the manufacturer’s instructions (Düren, Germany). Paired-end libraries with ∼350 bp inserts were constructed for 77 accessions for this and another study and sequenced on Illumina HiSeq 2000 and 2500 lanes using 2 × 100 bp and 2 × 125 bp chemistry, respectively, at the National Genomics Infrastructure hosted at Science for Life Laboratory (SciLifeLab, Stockholm, Sweden). Paired-end libraries with ∼350 bp inserts for 17 accessions for this and another study were multiplexed on a single Illumina HiSeq 4000 lane at the University of Texas at Austin (Austin, TX) using 2 × 150 bp chemistry. Finally, an entire lane of 2 × 100 bp reads from the Illumina MiSeq platform was available for one accession. We developed a de novo assembly strategy by first comparing assemblies produced by SPAdes v. 3.7.0 (Bankevich et al. 2012; Prjibelski et al. 2014) using either total genomic reads or reads pre-filtered for similarity to the plastome. For read filtering, we used local alignment in bowtie2 v. 2.2.7 (Langmead and Salzberg 2012) using the P. abies plastome as a reference and tested both the “very fast” and “very sensitive” presets. The most contiguous assemblies with the highest depth of coverage were obtained by using pre-filtered reads and results were nearly identical for both local alignment presets (results not shown). Therefore, we used “very fast” local alignment followed by assembly using the “careful” parameter, default error correction and kmer selection, and a minimum contig size of 500 bp. Sequencing methods, assembly metrics, and Genbank accession numbers for each accession reported in supplementary table S1, Supplementary Material online.
Annotation and Alignment
We tested for structural rearrangements in de novo scaffolds of each accession versus the P. abies reference genome using nucmer, which were visualized as dot plots with mummerplot as contained within the software package MUMmer v. 3.0 (Kurtz et al. 2004). The scaffolds of each accession were aligned against the P. abies reference and pseudomolecules were generated in Geneious v. 9.0.4 for further analysis. Multiple sequence alignment of pseudomolecules was conducted in MAFFT v. 7.245 (Katoh and Standley 2013) using the fast Fourier transformation approximation option, a partition size of 1,000 and three iterative refinements. Sequences were annotated according to the P. abies reference genome using the BLAST-like “transfer annotations” tool in Geneious using an identity cut-off of 60%. All annotations were manually curated to ensure accurate identification of genes, pseudogenes, and reading frames. Plastome maps were created with CIRCOS (Krzywinski et al. 2009).
Sequence Polymorphism and Divergence
Nucleotide diversity (π) was estimated using the R package PopGenome v. 2.1.6 (Pfeifer et al. 2014) by genomic context and in 350-bp sliding windows. Standard errors were estimated using 500 bootstrap replicates. Average synonymous (dS) and nonsynonymous (dN) substitution rates were estimated per-gene using the maximum likelihood method (Goldman and Yang 1994) implemented in the codeml program in PAML v. 4.9 (Yang 2007). We inferred natural selection at the codon-level from the ratio of dN to dS (ω) using nested random site models (M1a and M2a; Wong et al. 2004; Yang et al. 2005) as implemented in codeml. The maximum-likelihood tree inferred from concatenated protein-coding exons was used as the input tree with relationships between the five major clades and any branches with < 70% bootstrap support constrained to polytomies. Both copies of ycf12 and psbI were excluded because they are located within large repeats (white arrows in fig. 1) and were not uniformly recovered across all accessions.
Phylogenetic Analyses
Sequence alignments were prepared for phylogenetic analysis by stripping sites with >20% missing data and removing tandem repeats using Phobos v. 3.3.12 (http://www.ruhr-uni-bochum.de/spezzoo/cm/cm_phobos.htm; last accessed March 25, 2017). We used PartitionFinder v. 1.1.1 to identify best-fit partitioning schemes of nucleotide evolution (Lanfear et al. 2012, 2014). We defined an initial model giving each protein-coding gene and codon position a partition, and tRNAs, rRNAs, introns were each assigned to a single partition, respectively. We constrained the model of nucleotide evolution to GTR + Γ and used the linked branch lengths model. Partition searches were performed using the “rcluster” clustering algorithm with the default percentage. Optimal partitioning schemes identified using the whole genome data were subsequently applied to each of the five data subsets: (1) protein-coding exons, (2) concatenated intergenic spaces, and (3–5) the three structural regions (F1, F2, and F3) of the Picea plastome. Phylogenies were inferred from the partitioned data sets in RAxML v. 8.2.4 (Stamatakis 2014) using rapid bootstrapping under the autoMRE convergence criterion and were summarized on an extended majority rule consensus tree (Aberer et al. 2010). Trees are rooted by P. breweriana for display, which has been inferred to be the sister lineage to all other extant spruces in previous studies (Bouillé et al. 2011; Lockwood et al. 2013; Ran et al. 2015).
We tested for phylogenetic discordance using character and tree-based methods. The character-based ILD test was performed using PAUP* v. 4.0a149 (Swofford 2002) using 100 replicates with a heuristic starting seed, random-stepwise additions, and nearest-neighbor interchange branch swapping with the search limited to 100,000 re-arrangements for computational feasibility. Permutation tests based on tree QD (Bryant et al. 2000) and RF (Robinson and Foulds 1981) distances tested if phylogenies estimated from different a priori subsets are more different than expected by chance. To simulate tree distances under the null hypothesis, we calculated QD and RF values between 10,000 trees estimated from randomly sampled plastome sites. Sets of random sites were selected to be comparable in size to the subsets of interest. Then, we calculated RF and QD distances between 10,000 randomly selected pairs of trees sampled from the bootstrap replicates of the a priori subsets. We implemented the permutation test using the R package “perm” (Fay and Shaw 2010) using 2,000 replications and estimated P values using the default algorithm. QDs were calculated using the program rqtdist v. 1.0 (Sand et al. 2013) and RF distances in the R packages “phangorn” (Schliep 2011) and “ape” (Paradis et al. 2004).
Recombination Tests
Unguided tests for recombination were conducted in the RDP4 package (Martin et al. 2015) using the RDP (Martin and Rybicki 2000), GENECOV, BootScan (Salminen et al. 1995), MaxChi (Maynard Smith 1992), Chimaera (Posada and Crandall 2001), SiScan (Padidam et al. 1999), and 3seq (Boni et al. 2007) methods. After identifying a recombination event, RDP4 implements a hidden Markov model to estimate breakpoint positions. Then, RDP4 identifies the probable recombinant sequence using the PHYLPRO (Weiller 1998), VISRD (Lemey et al. 2009), and EEEP (Beiko and Hamilton 2006) methods. Bonferrroni corrections were applied to set the family-wise error rate to 0.05. We tested the whole-plastome alignment for recombination and an alignment stripped of gaps and poorly aligned regions but the results did not differ. In addition, altering window sizes and indel processing options did not alter the results so default values were used for the final runs.
Supplementary Material
Supplementary data are available at Molecular Biology and Evolution online.
Supplementary Material
Acknowledgments
We are grateful to Daniel Luscombe (Bedgebury National Pinetum), Peter Brownless (Royal Botanic Garden Edinburgh) Charles Keith (Charles Keith Arboretum), Erik Dahl Kjær and Albin Lobo (University of Copenhagen, Denmark), Piotr Banaszczak (Rogów Arboretum), Johnny Schimmel (Arboretum Norr), Dan Crowley (Westonbirt, The National Arboretum), László Csiba and Rhinaixa Duque-Thüs (Royal Botanic Gardens, Kew), Jaakko Saarinen (Mustila Arboretum), and Jianquan Liu (Lanzhou University and Sichuan University) for contributing samples to this study. We acknowledge support from Science for Life Laboratory and the National Genomics Infrastructure, Sweden for providing assistance in massive parallel sequencing. Computations were performed on resources provided by SNIC through Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX) under Project SNIC 2015-6-67. Douglas Scofield at UPPMAX and Nicolas Delhomme at the Umeå Plant Science Center (UPSC) Bioinformatics Facility provided bioinformatics support. This study was supported by the Trees and Crops for the Future (TC4F) program. ARS was supported in part by Bergvik Skog AB. Constructive criticism from anonymous reviewers improved the manuscript.
References
- Aberer AJ, Pattengale ND, Stamatakis A.. 2010. Parallelized phylogenetic post-analysis on multi-core architectures. J Comput Sci. 1:107–114. [Google Scholar]
- Aizawa M, Yoshimaru H, Saito H, Katsuki T, Kawahara T, Kitamura K, Shi F, Kaji M.. 2007. Phylogeography of a northeast Asian spruce, Picea jezoensis, inferred from genetic variation observed in organelle DNA markers. Mol Ecol. 16:3393–3405. [DOI] [PubMed] [Google Scholar]
- Bankevich A, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19:455–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barnard-Kubow KB, McCoy MA, Galloway LF.. 2017. Biparental chloroplast inheritance leads to rescue from cytonuclear incompatibility. New Phytol. 213:1466–1476. [DOI] [PubMed] [Google Scholar]
- Beiko RG, Hamilton N.. 2006. Phylogenetic identification of lateral genetic transfer events. BMC Evol Biol. 6:15–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bock R. 2007. Structure, function, and inheritance of plastid genomes In: Bock R, editor. Cell and molecular biology of plastids. Berlin, Heidelberg: Springer Berlin Heidelberg; p. 29–63. [Google Scholar]
- Boni MF, Posada D, Feldman MW.. 2007. An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 176:1035–1047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouillé M, Senneville S, Bousquet J.. 2011. Discordant mtDNA and cpDNA phylogenies indicate geographic speciation and reticulation as driving factors for the diversification of the genus Picea. Tree Gene Genomes 7:469–484. [Google Scholar]
- Bryant D, Tsang J, Jearney PE, Li M.. 2000. Computing the quartet distance between evolutionary trees In: Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms. New York City: ACM Press; p. 285–286. [Google Scholar]
- Carbonell-Caballero J, Alonso R, Ibañez V, Terol J, Talon M, Dopazo J.. 2015. A phylogenetic analysis of 34 chloroplast genomes elucidates the relationships between wild and domestic species within the genus Citrus. Mol Biol Evol. 32:2015–2035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corriveau JL, Coleman AW.. 1988. Rapid screening method to detect potential biparental inheritance of plastid DNA and results for over 200 angiosperms. Am J Bot. 75:1443–1458. [Google Scholar]
- D’Alelio D, Ruggiero MV.. 2015. Interspecific plastidial recombination in the diatom genus Pseudo-nitzschia. J Phycol. 51:1024–1028. [DOI] [PubMed] [Google Scholar]
- Day A, Madesis P.. 2007. DNA replication, recombination, and repair in plastids In: Bock R, editor. Cell and molecular biology of plastids. Berlin, Heidelberg: Springer Berlin Heidelberg; p. 65–119. [Google Scholar]
- de Vries J, Sousa FL, Bölter B, Soll J, Gould SB.. 2015. YCF1: a green TIC?. Plant Cell 27:1827–1833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dong W, Xu C, Li C, Sun J, Zuo Y, Shi S, Cheng T, Guo J, Zhou S.. 2015. ycf1, the most promising plastid DNA barcode of land plants. Sci Rep. 5:8348.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doorduin L, Gravendeel B, Lammers Y, Ariyurek Y, Chin-A-Woeng T, Vrieling K.. 2011. The complete chloroplast genome of 17 individuals of pest species Jacobaea vulgaris: SNPs, microsatellites and barcoding markers for population and phylogenetic studies. DNA Res. 28:93–105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doyle JJ. 1992. Gene trees and species trees: molecular systematics as one-character taxonomy. Syst Bot. 17:144–163. [Google Scholar]
- Du FK, Peng XL, Liu JQ, Lascoux M, Hu FS, Petit RJ.. 2011. Direction and extent of organelle DNA introgression between two spruce species in the Qinghai-Tibetan Plateau. New Phytol. 192:1024–1033. [DOI] [PubMed] [Google Scholar]
- Eckenwalder JE. 2009. Conifers of the world: the complete reference. Portland (OR: ): Timber Press, Inc. [Google Scholar]
- Ellis JR, Bentley KE, McCauley DE.. 2008. Detection of rare paternal chloroplast inheritance in controlled crosses of the endangered sunflower Helianthus verticillatus. Heredity 100:574–580. [DOI] [PubMed] [Google Scholar]
- Ellstrand NC, Whitkus R, Rieseberg LH.. 1996. Distribution of spontaneous plant hybrids. Proc Natl Acad Sci U S A. 93:5090–5093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erixon P, Oxelman B.. 2008. Reticulate or tree-like chloroplast DNA evolution in Sileneae (Caryophyllaceae)?. Mol Phylogenet Evol. 48:313–325. [DOI] [PubMed] [Google Scholar]
- Excoffier L, Ray N.. 2008. Surfing during population expansions promotes genetic revolutions and structuration. Trends Ecol Evol. 23:347–351. [DOI] [PubMed] [Google Scholar]
- Farjon A. 1991. Pinaceae: drawings and descriptions of the genera: Abies, Cedrus, Pseudolarix, Keteleeria, Nothotsuga, Tsuga, Cathaya, Pseudotsuga, Larix and Picea. Königstein: Koeltz Scientific Books. [Google Scholar]
- Fay MP, Shaw PA.. 2010. Exact and asymptotic weighted logrank tests for interval censored data: the interval R package. J Stat Softw. 36:1–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Felsenstein J. 1974. The evolutionary advantage of recombination. Genetics 78:737–765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldman N, Yang Z.. 1994. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol. 11:725–736. [DOI] [PubMed] [Google Scholar]
- Hamilton JA, Aitken SN.. 2013. Genetic and morphological structure of a spruce hybrid (Picea sitchensis x P. glauca) zone along a climatic gradient. Am J Bot. 100:1651–1662. [DOI] [PubMed] [Google Scholar]
- Hansen AK, Escobar LK, Gilbert LE, Jansen RK.. 2007. Paternal, maternal, and biparental inheritance of the chloroplast genome in Passiflora (Passifloraceae): implications for phylogenetic studies. Am J Bot. 94:42–46. [DOI] [PubMed] [Google Scholar]
- Hu S, Sablok G, Wang B, Qu D, Barbaro E, Viola R, Li M, Varotto C.. 2015. Plastome organization and evolution of chloroplast genes in Cardamine species adapted to contrasting habitats. BMC Genomics 16:306.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang S, Chiang YC, Schaal BA, Chou CH, Chiang TY.. 2001. Organelle DNA phylogeography of Cycas taitungensis, a relict species in Taiwan. Mol Ecol. 10:2669–2681. [DOI] [PubMed] [Google Scholar]
- Jackman SD, et al. 2015. Organellar genomes of white spruce (Picea glauca): assembly and annotation. Genome Biol Evol. 8:29–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jaramillo-Correa JP, Bousquet J.. 2005. Mitochondrial genome recombination in the zone of contact between two hybridizing conifers. Genetics 171:1951–1962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawata M, Harada T, Shimamoto Y, Oono K, Takaiwa F.. 1997. Short inverted repeats function as hotspots of intermolecular recombination giving rise to oligomers of deleted plastid DNAs (ptDNAs). Curr Genet. 31:179–184. [DOI] [PubMed] [Google Scholar]
- Kikuchi S, Bédard J, Hirano M, Hirabayashi Y, Oishi M, Imai M, Takase M, Ide T, Nakai M.. 2013. Uncovering the protein translocon at the chloroplast inner envelope membrane. Science 339:571–574. [DOI] [PubMed] [Google Scholar]
- Kimura M. 1983. The neutral theory of molecular evolution. Cambridge: Cambridge University Press. [Google Scholar]
- Klymiuk AA, Stockey RA.. 2012. A lower Cretaceous (Valanginian) seed cone provides the earliest fossil record for Picea (Pinaceae). Am J Bot. 99:1069–1082. [DOI] [PubMed] [Google Scholar]
- Krzywinski MI, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA.. 2009. Circos: an information aesthetic for comparative genomics. Genome Res. 19:1639–1645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL.. 2004. Versatile and open software for comparing large genomes. Genome Biol. 5:R12.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lan T, Wang XR, Zeng QY.. 2013. Structural and functional evolution of positively selected sites in pine glutathione S-transferase enzyme family. J Biol Chem. 288:24441–24451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lanfear R, Calcott B, Ho SYW, Guindon S.. 2012. PartitionFinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol Biol Evol. 29:1695–1701. [DOI] [PubMed] [Google Scholar]
- Lanfear R, Calcott B, Kainer D, Mayer C, Stamatakis A.. 2014. Selecting optimal partitioning schemes for phylogenomic datasets. BMC Evol Biol. 14:82.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B, Salzberg SL.. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lemey P, Lott M, Martin DP, Moulton V.. 2009. Identifying recombinants in human and primate immunodeficiency virus sequence alignments using quartet scanning. BMC Bioinformatics 10:1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lemieux B, Turmel M, Lemieux C.. 1990. Recombination of Chlamydomonas chloroplast DNA occurs more frequently in the large inverted repeat sequence than in the single-copy regions. Theor Appl Genet. 79:17–27. [DOI] [PubMed] [Google Scholar]
- Lockwood JD, Aleksić JM, Zou J, Wang J, Liu J, Renner SS.. 2013. A new phylogeny for the genus Picea from plastid, mitochondrial, and nuclear sequences. Mol Phylogenet Evol. 69:717–727. [DOI] [PubMed] [Google Scholar]
- Maréchal A, Brisson N.. 2010. Recombination and the maintenance of plant organelle genome stability. New Phytol. 186:299–317. [DOI] [PubMed] [Google Scholar]
- Marshall HD, Newton C, Ritland K.. 2001. Sequence-repeat polymorphisms exhibit the signature of recombination in lodgepole pine chloroplast DNA. Mol Biol Evol. 18:2136–2138. [DOI] [PubMed] [Google Scholar]
- Martin DP, Rybicki E.. 2000. RDP: detection of recombination amongst aligned sequences. Bioinformatics 16:562–563. [DOI] [PubMed] [Google Scholar]
- Martin DP, Murrell B, Golden M, Khoosal A, Muhire B.. 2015. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 1:1–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maynard Smith J. 1992. Analyzing the mosaic structure of genes. J Mol Evol. 34:126–129. [DOI] [PubMed] [Google Scholar]
- Medgyesy P, Fejes E, Maliga P.. 1985. Interspecific chloroplast recombination in a Nicotiana somatic hybrid. Proc Natl Acad Sci U S A. 82:6960–6964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mengistu LW, Mueller-Warrant GW, Liston A, Barker RE.. 2000. psbA Mutation (valine 219 to isoleucine) in Poa annua resistant to metribuzin and diuron. Pest Manag Sci. 56:209–217. [Google Scholar]
- Moran NA. 1996. Accelerated evolution and Muller’s rachet in endosymbiotic bacteria. Proc Natl Acad Sci U S A. 93:2873–2878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mosca E, Eckert AJ, Liechty JD, Wegrzyn JL, La Porta N, Vendramin GG, Neale DB.. 2012. Contrasting patterns of nucleotide diversity for four conifers of Alpine European forests. Evol Appl. 5:762–775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nystedt B, et al. 2013. The Norway spruce genome sequence and conifer genome evolution. Nature 497:579–584. [DOI] [PubMed] [Google Scholar]
- Odom OW, Baek KH, Dani RN, Herrin DL.. 2008. Chlamydomonas chloroplasts can use short dispersed repeats and multiple pathways to repair a double‐strand break in the genome. Plant J. 53:842–853. [DOI] [PubMed] [Google Scholar]
- Padidam M, Sawyer S, Fauquet CM.. 1999. Possible emergence of new geminiviruses by frequent recombination. Virology 265:218–225. [DOI] [PubMed] [Google Scholar]
- Paradis E, Claude J, Strimmer K.. 2004. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290. [DOI] [PubMed] [Google Scholar]
- Parks M, Cronn R, Liston A.. 2009. Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes. BMC Biol. 7:84.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfeifer B, Wittelsbürger U, Ramos-Onsins SE, Lercher MJ.. 2014. PopGenome: an efficient swiss army knife for population genomic analyses in R. Mol Biol Evol. 31:1929–1936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Planet PJ. 2006. Tree disagreement: measuring and testing incongruence in phylogenies. J Biomed Inform. 39:86–102. [DOI] [PubMed] [Google Scholar]
- Posada D, Crandall KA.. 2001. Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc Natl Acad Sci U S A. 98:13757–13762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prjibelski AD, et al. 2014. ExSPAnder: a universal repeat resolver for DNA region assembly. Bioinformatics 30:293–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ran JH, Shen TT, Liu WJ, Wang PP, Wang XQ.. 2015. Mitochondrial introgression and complex biogeographic history of the genus Picea. Mol Phylogenet Evol. 93:63–76. [DOI] [PubMed] [Google Scholar]
- Robinson DF, Foulds LR.. 1981. Comparison of phylogenetic trees. Math Biosci. 53:131–147. [Google Scholar]
- Salminen MO, Carr JK, Burke DS, McCutchan FE.. 1995. Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS Res Hum Retroviruses 11:1423–1425. [DOI] [PubMed] [Google Scholar]
- Sand A, Holt MK, Johansen J, Fagerberg R, Brodal GS, Pedersen CN, Mailund T.. 2013. Algorithms for computing the triplet and quartet distances for binary and general trees. Biology 2:1189–1209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schierup MH, Hein J.. 2000. Consequences of recombination on traditional phylogenetic analysis. Genetics 156:879–891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schliep KP. 2011. phangorn: phylogenetic analysis in R. Bioinformatics 27:592–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider H, Smith AR, Cranfill R, Hildebrand TJ, Haufler CH, Ranker TA.. 2004. Unraveling the phylogeny of polygrammoid ferns (Polypodiaceae and Grammitidaceae): exploring aspects of the diversification of epiphytic plants. Mol Phylogenet Evol. 31:1041–1063. [DOI] [PubMed] [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strauss SH, Palmer JD, Howe GT, Doerksen AH.. 1988. Chloroplast genomes of two conifers lack a large inverted repeat and are extensively rearranged. Proc Natl Acad Sci U S A. 85:3898–3902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sutton BCS, Flanagan D, El-Kassaby YA.. 1991. A simple and rapid method for estimating representation of species in spruce seedlots using chloroplast DNA restriction fragment length polymorphism. Silvae Genet. 40:119–123. [Google Scholar]
- Swofford DL. 2002. PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version 4. Sunderland (MA: ): Sinauer Associates. [Google Scholar]
- Szmidt AE, Aldén T, Hällgren JE.. 1987. Paternal inheritance of chloroplast DNA in Larix. Plant Mol Biol. 9:59–64. [DOI] [PubMed] [Google Scholar]
- Wang B, Wang X-R.. 2014. Mitochondrial DNA capture and divergence in Pinus provide new insights into the evolution of the genus. Mol Phylogenet Evol. 80:20–30. [DOI] [PubMed] [Google Scholar]
- Wang X-Q, Tank DC, Sang T.. 2000. Phylogeny and divergence times in Pinaceae: evidence from three genomes. Mol Biol Evol. 17:773–781. [DOI] [PubMed] [Google Scholar]
- Weiller GF. 1998. Phylogenetic profiles: a graphical method for detecting genetic recombinations in homologous sequences. Mol Biol Evol. 15:326–335. [DOI] [PubMed] [Google Scholar]
- White EE. 1990. Chloroplast DNA in Pinus monticola. 2. Survey of within-species variability and detection of heteroplasmic individuals. Theor Appl Genet. 79:251–255. [DOI] [PubMed] [Google Scholar]
- Whitney KD, Ahern JR, Campbell LG, Albert LP, King MS.. 2010. Patterns of hybridization in plants. Plant Ecol Evol Syst. 12:175–182. [Google Scholar]
- Willyard A, Syring J, Gernandt DS, Liston A, Cronn R.. 2007. Fossil calibration of molecular divergence infers a moderate mutation rate and recent radiations for Pinus. Mol Biol Evol. 24:90–101. [DOI] [PubMed] [Google Scholar]
- Wolfe AD, Randle CP.. 2004. Recombination, heteroplasmy, haplotype polymorphism, and paralogy in plastid genes: implications for plant molecular systematics. Syst Bot. 29:1011–1020. [Google Scholar]
- Wolfe KH, Li W-H, Sharp PM.. 1987. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc Natl Acad Sci U S A. 84:9054–9058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong WSW, Yang Z, Goldman N, Nielsen R.. 2004. Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites. Genetics 168:1041–1051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright JW. 1955. Species crossability in spruce in relation to distribution and taxonomy. For Sci. 1:319–349. [Google Scholar]
- Wu CS, Lin CP, Hsu CY, Wang RJ, Chaw SM.. 2011. Comparative chloroplast genomes of Pinaceae: insights into the mechanism of diversified genomic organizations. Genome Biol Evol/ 3:309–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang JC, Joo M, So S, Yi DK, Shin CH, Lee YM, Choi K.. 2016. The complete plastid genome sequence of Picea jezoensis (Pinaceae: Piceoideae). Mitochondrial DNA A 27:3761–3763. [DOI] [PubMed] [Google Scholar]
- Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 24:1586–1591. [DOI] [PubMed] [Google Scholar]
- Yang Z, Wong WSW, Nielsen R.. 2005. Bayes empirical Bayes inference of amino acid sites under positive selection. Mol Biol Evol. 22:1107–1118. [DOI] [PubMed] [Google Scholar]
- Zhang Q, Sodmergen Liu Y.. 2003. Examination of the cytoplasmic DNA in male reproductive cells to determine the potential for cytoplasmic inheritance in 295 angiosperm species. Plant Cell Physiol. 44:941–951. [DOI] [PubMed] [Google Scholar]
- Zou J, Yue W, Li L, Wang X, Lu J, Duan B, Liu J.. 2016. DNA barcoding of recently diversified tree species: a case study on spruces based on 20 DNA fragments from three different genomes. Trees 30:959–969. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




