Abstract
Endosymbiosis has been common all along eukaryotic evolution, providing opportunities for genomic and organellar innovation. Plastids are a prominent example. After the primary endosymbiosis of the cyanobacterial plastid ancestor, photosynthesis spread in many eukaryotic lineages via secondary endosymbioses involving red or green algal endosymbionts and diverse heterotrophic hosts. However, the number of secondary endosymbioses and how they occurred remain poorly understood. In particular, contrasting patterns of endosymbiotic gene transfer (EGT) have been detected and subjected to various interpretations. In this context, accurate detection of EGTs is essential to avoid wrong evolutionary conclusions. We have assembled a strictly selected set of markers that provides robust phylogenomic evidence suggesting that nuclear genes involved in the function and maintenance of green secondary plastids in chlorarachniophytes and euglenids have unexpected mixed red and green algal origins. This mixed ancestry contrasts with the clear red algal origin of most nuclear genes carrying similar functions in secondary algae with red plastids.
Keywords: Chlorarachniophyta, Euglenida, endosymbiotic gene transfer, phylogenomics, plastids
Photosynthesis in eukaryotes takes place in a specialized compartment: the plastid. This organelle first evolved in a common ancestor of Archaeplastida (i.e., Viridiplantae + Rhodophyta + Glaucophyta) through the endosymbiosis of a cyanobacterium inside a eukaryotic host (Moreira and Philippe 2001; Archibald 2009; Keeling 2013). This primary endosymbiotic event entailed massive endosymbiotic gene transfer (EGT) from the cyanobacterial genome to the host nucleus (Weeden 1981; Kleine et al. 2009). Consequently, most proteins required for the proper functioning of primary plastids are encoded in the nuclear genome and addressed to the plastid lumen via specialized signal sequences and a translocation apparatus (Gutensohn et al. 2006). Other photosynthetic eukaryotic phyla obtained their plastids through secondary endosymbiosis, i.e. the symbiosis of either green or red algae within another eukaryotic cell, or even through tertiary endosymbiosis (symbiosis of secondary photosynthetic eukaryotes within eukaryotic hosts) (Delwiche 1999; Archibald 2009; Keeling 2013). Euglenida (Excavata) and Chlorarachniophyta (Rhizaria) carry green algal secondary plastids ('green plastids') acquired through two independent endosymbioses involving Prasinophyceae and Ulvophyceae green algae, respectively (Rogers et al. 2007; Hrdá et al. 2012; Suzuki et al. 2016). Photosynthetic species in the Cryptophyta, Alveolata, Stramenopiles and Haptophyta (CASH) lineages have plastids derived from red algae ('red plastids') but so far it has been impossible to retrace a consensual evolutionary history (Lane and Archibald 2008; Archibald 2009; Keeling 2013). Whereas phylogenomic analyses of plastid-encoded genes support the monophyly of all CASH plastids, arguing for a single red algal secondary endosymbiosis (Yoon et al. 2002; Muñoz-Gómez et al. 2017), most of the phylogenies based on host nuclear genes do not retrieve their monophyly (Baurain et al. 2010; Burki et al. 2016). To reconcile these incongruent results, some authors have proposed the hypothesis that a unique phylum (which may have gone extinct or evolved into one of the extant CASH phyla) acquired a red alga through secondary endosymbiosis and originated the first lineage of red secondary algae. Subsequently, this lineage would have transmitted the secondary red plastid to other CASH phyla via serial tertiary endosymbioses involving different hosts (Larkum et al. 2007; Sanchez-Puerta and Delwiche 2008; Bodył et al. 2009; Baurain et al. 2010; Petersen et al. 2014).
As for the primary endosymbiosis, each secondary or tertiary endosymbiosis was accompanied by numerous EGTs from the nucleus of the endosymbiotic red or green alga to the host nucleus. Consequently, secondary photosynthetic eukaryotes possess two types of genes that can inform about the phylogenetic identity of their plastids: plastid-encoded genes and nucleus-encoded genes acquired via EGT. Genes encoded in primary plastid genomes and the EGTs found in the genomes of Archaeplastida are related to cyanobacteria and have helped to identify the cyanobacterial lineage at the origin of the first plastid (Ponce-Toledo et al. 2017). Similarly, plastid-encoded genes and EGTs found in nuclear genomes of secondary photosynthetic eukaryotes are expected to be useful to determine the red or green algal origin of their plastids. Compared to plastid-encoded genes, EGTs have the additional advantage that they can inform about the presence and identity of past plastids in lineages where plastids have been lost or replaced (cryptic plastid endosymbioses). However, if EGTs are valuable to track contemporary and cryptic endosymbioses, their detection within whole nuclear genome sequences remains a complex task (Stiller 2011). In the case of primary endosymbiosis, EGT detection is rather straightforward because cyanobacterial-type genes are easily distinguishable from typical eukaryotic nuclear genes. The situation is more difficult in the case of secondary endosymbioses. Indeed, detection of EGT genes transferred from the nucleus of green or red algal endosymbionts can be ambiguous due to the poor resolution often found in single gene phylogenies that hampers distinguishing EGTs from vertically inherited nuclear genes, especially considering the short phylogenetic distance between Archaeplastida and several groups of secondary algae. Two studies on red-plastid-bearing algae, the chromerids (Alveolata) and the diatoms (Stramenopiles), illustrate this issue. Both reported an unexpected high number of genes phylogenetically related to green algal homologs. Whereas in the case of the chromerids the green signal was attributed to probable phylogenetic artifacts and the reduced sampling of red algal genome sequences (Woehle et al. 2011), it was interpreted in diatoms as evidence for a cryptic green algal endosymbiont (Moustafa et al. 2009). However, the subsequent reanalyses of the same genes using richer taxonomic sampling and more robust phylogenetic methods largely erased the evidence for cryptic green endosymbioses in these CASH phyla (Burki et al. 2012; Deschamps and Moreira 2012; Moreira and Deschamps 2014).
The extent and impact of horizontal gene transfer (HGT) on eukaryotic evolution remain controversial topics (Leger et al. 2018). HGTs might be valuable to infer the history of genomes and lineages (Abby et al. 2012) but they can also introduce inconvenient noise in phylogenomic analyses, in particular for the study of EGTs (Stiller 2011). Through time, secondary photosynthetic eukaryotes may have accumulated HGTs in their nuclear genomes from various sources, perhaps even including non-endosymbiotic red or green algae. Unfortunately, gene phylogenies of such HGTs may display topologies comparable to those of EGTs, making them difficult to set apart. In this context, anomalous phylogenetic signal in certain secondary photosynthetic groups has been interpreted as HGT rather than EGT from cryptic endosymbionts. This is the case of the nuclear genome sequence of the green-plastid-containing chlorarachniophyte alga Bigellowiella natans, in which 22% of the genes potentially acquired via HGT appeared to have a red algal origin (Curtis et al. 2012). Because of the phagotrophic ability of chlorarachniophytes, the presence of these genes was considered to be the result of progressive accumulation of HGTs from red algae or from red-plastid-containing CASH lineages, some eventually substituting original 'green' EGTs (Archibald et al. 2003; Yang et al. 2011; Yang et al. 2014). Analogous studies on euglenid species suggested a similar trend for several genes involved in central metabolic pathways (Maruyama et al. 2011; Yang et al. 2011; Markunas and Triemer 2016). The unexpected presence of those 'red' genes in chlorarachniophytes and euglenids was first considered as the result of multiple HGTs (e.g., Archibald et al. 2003; Maruyama et al. 2011) but the increasing number of reported cases has prompted some authors to speculate on putative cryptic red algal endosymbioses in both lineages (Maruyama et al. 2011; Markunas and Triemer 2016). A systematic investigation of HGT/EGT is still missing in euglenids and chlorarachniophytes but, as mentioned above, in the context of secondary endosymbioses it can be difficult to distinguish among HGT, EGT, and just unresolved trees on the basis of single-gene phylogenies (Deschamps and Moreira 2012).
In this work, we have focused on a particular group of genes to reduce this uncertainty: genes transferred from the original cyanobacterial plastid endosymbiont into the nuclear genome of Archaeplastida and subsequently transferred from Archaeplastida into the genomes of complex secondary algae. In Archaeplastida, these genes are known to be involved in essential plastid functions and tend to be highly conserved (Reyes-Prieto et al. 2006; Deschamps and Moreira 2009), so we expected that they can provide strong phylogenetic signal. To identify them, we queried by BLAST the whole predicted proteomes of Guillardia theta and Bigelowiella natans against a local genome database containing representatives of the three domains of life, in particular a comprehensive collection of genomes and transcriptomes of photosynthetic protists (supplementary table S1, Supplementary Material online). Guillardia and Bigelowiella proteins with hits in other photosynthetic eukaryotes and in cyanobacteria were selected for phylogenetic analysis. Maximum likelihood (ML) phylogenetic trees for these proteins were constructed and manually filtered to retain those fulfilling two criteria: i) trees have to support a clear separation of Viridiplantae and Rhodophyta (with secondary lineages branching within them), and ii) proteins have to be shared by at least three secondary photosynthetic lineages. We identified in this way 82 genes most likely acquired by secondary photosynthetic eukaryotes from Archaeplastida. 70 were cyanobacterial genes likely transferred sequentially through primary and secondary endosymbioses, and 12 were derived from diverse bacterial groups likely transferred to a common ancestor of Archaeplastida and subsequently transferred to secondary photosynthetic groups (supplementary table S3 and figs. S1-S82, Supplementary Material online). Interestingly, most of these genes were absent in non-photosynthetic eukaryotes, supporting that they were not misinterpreted vertically-inherited ones.
Most of the 82 ML phylogenies were well resolved and enabled us to unambiguously determine, for each secondary lineage, whether the source of the gene was a green or a red alga. As expected, in the great majority of our trees (between 84 and 90%, fig. 1A ) the genes of red-plastid-endowed CASH lineages derived from red algae (e.g., fig. 2A and 2B ). Because of their secondary green plastids, we expected the opposite situation in chlorarachniophytes and euglenids, namely a majority of 'green' genes. However, 42 of the 78 trees where chlorarachniophytes were present (54%, fig. 1A ) supported a 'red' origin of the corresponding genes (e.g., fig. 2A ). Similarly, 22 of the 61 trees containing euglenids (36%, fig. 1A ) also supported a 'red' ancestry (e.g., fig. 2B ). These surprisingly high values were in sharp contrast with the small number of trees (<10%, fig. 1A ) showing CASH phyla embedded within green algae. Interestingly, the CASH phyla were monophyletic in 7 of these trees, arguing for a common evolutionary origin of the corresponding 'green' genes. Almost all of the 82 genes identified here encode plastid-targeted proteins involved in essential plastid functions (fig. 1B and supplementary table S4, Supplementary Material online). For instance, in both chlorarachniophytes and euglenids, these nuclear-encoded 'red' genes participate in plastid genome expression (e.g., elongation factors and aminoacyl-tRNA synthetases), light harvesting, chlorophyll biosynthesis, and photosystem II assembly. Keeping these important genes implies a plastid-related selective pressure, which excludes that they could have accumulated in the heterotrophic ancestors of green secondary photosynthetic eukaryotes prior to plastid acquisition.
Fig 1.
Genes of red and green algal ancestry in secondary photosynthetic eukaryotes. (A) Number of red or green algal-like genes in each lineage among the 82 genes analyzed classified according to their origin and statistical support in phylogenetic trees (supplementary figs. S1-S82, Supplementary Material online). (B) Gene functions of the 'red' and 'green' genes detected in transcriptomes and nuclear genomes of chlorarachniophytes and euglenids.
Fig 2.
Examples of maximum likelihood phylogenetic trees of nucleus-encoded genes of red and green algal origin in secondary photosynthetic eukaryotes. (A) Protein involved in photosystem II assembly (inherited from green algae in euglenids and from a red lineage in chlorarachniophytes). (B) Protein required for thylakoid membrane formation (inherited from green algae in chlorarachniophytes and from a red lineage in euglenids). Bootstrap support values are indicated by black (100%), dark grey (95-99%), and light grey (85-95%) circles. Scale bars indicate the number of substitutions per site. Complete trees can be seen, respectively, in supplementary figs. S74 and S62, Supplementary Material online.
The marked disproportion of unexpected gene sources in green versus red secondary photosynthetic lineages is intriguing and may be interpreted in different ways. First, the green algal ancestors of chlorarachniophyte and euglenid plastids may have had a high proportion of red algal HGT genes in their genomes. However, such a high HGT proportion involving essential genes has not been reported so far in any green alga. Second, these 'red' genes may have accumulated in chlorarachniophyte and euglenid nuclear genomes by numerous HGTs, for example from food sources. This would imply that, for unknown reasons, HGT is much more frequent in secondary green lineages than in red ones, as well as a long-lasting feeding preference towards red prey in both secondary green lineages. Moreover, the 'red' genes are shared by all the species of the relatively rich taxon sampling available for chlorarachniophytes (fig. 2A ), indicating that their acquisition predated the diversification of this group and stopped afterwards (we did not retrieve any tree supporting a recent HGT involving only a subgroup of chlorarachniophytes). Our data therefore argue for an ancient timing of 'red' gene acquisition. These observations may support a third interpretation: the 'red' genes are shared by all SAR lineages (Stramenopiles, Alveolata, and Rhizaria) because they were acquired from a single common secondary red algal endosymbiosis ancestral to the whole SAR supergroup. This original red plastid would have been lost in many phyla and replaced by a green alga in chlorarachniophytes. However, this scenario poses several problems. On the one hand, traces of past presence of red algal plastids, in the form of EGTs, in non-photosynthetic SAR lineages are very often controversial (Elias and Archibald 2009; Stiller et al. 2009; Stiller 2011). On the other hand, plastid-bearing chlorarachniophytes constitute a relatively late-emerging branch within SAR (Sierra et al. 2016), implying that if their present-day green plastid replaced a former red one, this red plastid would have had to be present until recently and been lost in all other rhizarian lineages, which may seem unparsimonious. The case of euglenids is even more difficult to interpret as this group of excavates has no close phylogenetic relationship with any other photosynthetic lineage. In addition, massive sequence data remain much more limited for euglenids than for chlorarachniophytes (only a few transcriptomes available, see supplementary table S1, Supplementary Material online), making it difficult to infer the relative age of possible gene transfers. Nonetheless, 'red' genes were often shared by several euglenids in our trees, suggesting a similar pattern of ancient acquisition as in chlorarachniophytes (supplementary figs. S1-S82, Supplementary Material online).
Our results show the presence of an unexpectedly high number of genes of red algal affinity in the two groups of eukaryotic algae with secondary green plastids, the euglenids and chlorarachniophytes, which is significantly higher than the frequency of 'green' genes in algae with secondary red plastids, the CASH lineages. To address this question, we have focused on a subset of genes selected because of their strong phylogenetic signal and their implication in plastid-related activities. It is therefore uncertain whether this conclusion can be applied to the rest of HGTs/EGTs potentially present in the genomes of all these algae. In fact, in addition to the problems inherent to the accurate detection of EGTs, our focus on these specific genes may explain, at least partly, the different results obtained in recent analyses of all potential EGTs in some CASH lineages, not only those of ultimate cyanobacterial origin (e.g., Dorrell et al. 2017).
However, we could not identify any particular bias in our gene selection process that could have artificially enriched the observed 'red' gene frequency in euglenids and chlorarachniophytes. Despite the methodological problems inherent to global genome analyses cited above, including a highly unbalanced representation of red and green algal genomes in sequence databases (Deschamps and Moreira 2012), the study of the chlorarachniophyte B. natans genome already pointed in that direction, with 22% of EGT genes of apparent red algal ancestry (Curtis et al. 2012). The origin of the 'red' genes in euglenids and chlorarachniophytes, either by cumulative HGT or by EGT from cryptic red algal endosymbionts, remains mysterious but our work indicates that they were acquired early in both groups and that they fulfill essential functions for plastid activity and maintenance. Interestingly, indisputable evidence supports that in a third group of complex algae with green plastids, the dinoflagellate genus Lepidodinium, a former red plastid was replaced by the current green one, leading to a mosaic plastid proteome encoded by a mix of red and green algal genes (Minge et al. 2010), reminiscent of those found in euglenids and chlorarachniophytes. It has been proposed that, since they retain more gene-rich genomes than green ones, red plastids have increased capacity for autonomous metabolism that could explain why they are more widespread across the diversity of eukaryotes as secondary plastids (the "portable plastid" hypothesis (Grzebyk et al. 2003)). It is thus tempting to speculate for euglenids and chlorarachniophytes a similar case as for Lepidodinium, with initial red plastids subsequently replaced by green ones. Even if this hypothesis turns out to be wrong and these cryptic red endosymbioses did not exist, the ancient acquisition by another mechanism of a significant number of red algal genes in both groups before their diversification and, especially, their maintenance in the contemporary species through millions of years of evolution, suggest that the 'red' genes were instrumental in the establishment and maintenance of the secondary green plastids. Sequencing and analysis of additional genomes of euglenids, chlorarachniophytes, and their non-photosynthetic relatives will help to refine the inventory of 'red' genes in these lineages and their timing and, eventually, mechanism of acquisition.
Materials and Methods
Sequence Analysis
A local database was constructed to host the predicted proteomes from various nuclear genomes and transcriptomes as well as plastid genomes (for the complete list, see supplementary table S1, Supplementary Material online). All proteins of the Bigelowiella natans (Chlorarachniophyta) and Guillardia theta (Cryptophyta) predicted proteomes were used as queries for BLASTp sequence similarity searches (Camacho et al. 2009) against the local database. We retained up to 350 top hits with an e-value threshold of 1e-05. BLASTp outputs were parsed with a custom Python script to identify the proteins having hits in diverse photosynthetic eukaryotes and that were more similar to cyanobacteria or other bacteria than to non-photosynthetic eukaryotes.
For these proteins, reciprocal BLASTp searches were done against the database to collect up to 600 similar sequences. We then aligned each set using Mafft v7.123b (Katoh and Standley 2013) with default parameters. Non-conserved alignment regions were trimmed with BMGE v1.0 (Criscuolo and Gribaldo 2010) with the BLOSUM62 matrix and allowing less than 50% gaps per position. Preliminary phylogenetic trees were inferred from trimmed alignments using FastTree v2.1.7 (Price et al. 2010) with default parameters. These trees were then manually inspected to identify those compatible with an EGT/HGT scenario. For all positive cases, only the sequences corresponding to the portion of interest of each phylogenetic tree (the part showing the photosynthetic eukaryotes and the closest outgroup) were retained for the remaining steps. We then removed very short partial sequences and, to speed up subsequent calculations, several outgroup sequences from all alignments (see supplementary table S2, Supplementary Material online). The final sequence datasets were realigned and trimmed using TrimAL v1.4.rev15 with “gappy-out” parameter (Capella-Gutierrez et al. 2009). ML phylogenetic trees were inferred using IQtree v1.5.1 with the PMSF model of sequence evolution (Wang et al. 2018) parameterized using guided trees constructed with the LG+G+I model. Statistical support was calculated with 1000 ultrafast bootstrap replicates (Minh et al. 2013; Nguyen et al. 2015; Hoang et al. 2018).
Final selection of trees was done by manual inspection to keep those fulfilling the following two requirements: i) the protein had to be shared by Cyanobacteria (or other bacteria), Archaeplastida and at least three secondary photosynthetic lineages, and ii) the corresponding phylogenetic trees had to support the clear separation of Viridiplantae and Rhodophyta (plus the lineages with secondary green and red plastids nested within them). Finally, the 82 trees passing this final filter (supplementary figs. S1-S82, Supplementary Material online) were inspected to infer the phylogenetic origin of the corresponding genes in the secondary photosynthetic lineages (supplementary table S3, Supplementary Material online).
Gene Functional Annotation
We annotated the functions of the 82 proteins from the final selection (see above) through the EggNOG 4.5 (Huerta-Cepas et al. 2016) web portal (http://eggnogdb.embl.de). For each protein we used as queries the ortholog sequences of Guillardia theta and Bigelowiella natans. Functional annotations are shown in supplementary table S4, Supplementary Material online.
Supplementary Material
Supplementary figures S1–S82 and tables S1-S4 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
Acknowledgments
This study was supported by European Research Council grant ProtistWorld (P.L.-G., agreement no. 322669), the Université Paris-Sud program “Attractivité” (P.D.) and the Agence Nationale de la Recherche (D.M., project ANR-15-CE32-0003 "ANCESSTRAM"). We thank the Associated Editor and two anonymous reviewers for constructive comments.
Data Availability
Protein sequence datasets used in this work are available for download at http://www.ese.u-psud.fr/article950.html?lang=en. They include nonaligned sequences and trimmed alignments.
References
- Abby SS, Tannier E, Gouy M, Daubin V. Lateral gene transfer as a support for the tree of life. Proc Natl Acad Sci U S A. 2012;109:4962–4967. doi: 10.1073/pnas.1116871109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Archibald JM. The puzzle of plastid evolution. Curr Biol. 2009;19:R81–88. doi: 10.1016/j.cub.2008.11.067. [DOI] [PubMed] [Google Scholar]
- Archibald JM, Rogers MB, Toop M, Ishida KI, Keeling PJ. Lateral gene transfer and the evolution of plastid-targeted proteins in the secondary plastid-containing alga Bigelowiella natans . Proc Natl Acad Sci U S A. 2003;100:7678–7683. doi: 10.1073/pnas.1230951100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baurain D, Brinkmann H, Petersen J, Rodriguez-Ezpeleta N, Stechmann A, Demoulin V, Roger AJ, Burger G, Lang BF, Philippe H. Phylogenomic evidence for separate acquisition of plastids in cryptophytes, haptophytes, and stramenopiles. Mol Biol Evol. 2010;27:1698–1709. doi: 10.1093/molbev/msq059. [DOI] [PubMed] [Google Scholar]
- Bodył A, Stiller JW, Mackiewicz P. Chromalveolate plastids: direct descent or multiple endosymbioses? Trends Ecol Evol. 2009;24:119–121. doi: 10.1016/j.tree.2008.11.003. [DOI] [PubMed] [Google Scholar]
- Burki F, Flegontov P, Obornik M, Cihlar J, Pain A, Lukes J, Keeling PJ. Re-evaluating the green versus red signal in eukaryotes with secondary plastid of red algal origin. Genome Biol Evol. 2012;4:626–635. doi: 10.1093/gbe/evs049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burki F, Kaplan M, Tikhonenkov DV, Zlatogursky V, Minh BQ, Radaykina LV, Smirnov A, Mylnikov AP, Keeling PJ. Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta and Cryptista. Proc Biol Sci. 2016;283:1823. doi: 10.1098/rspb.2015.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Curtis BA, Tanifuji G, Burki F, Gruber A, Irimia M, Maruyama S, Arias MC, Ball SG, Gile GH, Hirakawa Y, et al. Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs. Nature. 2012;492:59–65. doi: 10.1038/nature11681. [DOI] [PubMed] [Google Scholar]
- Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Criscuolo A, Gribaldo S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol Biol. 2010;10:1471–2148. doi: 10.1186/1471-2148-10-210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delwiche CF. Tracing the thread of plastid diversity through the tapestry of life. Am Nat. 1999;154:S164–S177. doi: 10.1086/303291. [DOI] [PubMed] [Google Scholar]
- Deschamps P, Moreira D. Reevaluating the green contribution to diatom genomes. Genome Biol Evol. 2012;4:683–688. doi: 10.1093/gbe/evs053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deschamps P, Moreira D. Signal conflicts in the phylogeny of the primary photosynthetic eukaryotes. Mol Biol Evol. 2009;26:2745–2753. doi: 10.1093/molbev/msp189. [DOI] [PubMed] [Google Scholar]
- Dorrell RG, Gile G, McCallum G, Méheust R, Bapteste EP, Klinger CM, Brillet-Guéguen L, Freeman KD, Richter DJ, Bowler C. Chimeric origins of ochrophytes and haptophytes revealed through an ancient plastid proteome. Elife. 2017;6 doi: 10.7554/eLife.23717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elias M, Archibald JM. Sizing up the genomic footprint of endosymbiosis. Bioessays. 2009;31:1273–1279. doi: 10.1002/bies.200900117. [DOI] [PubMed] [Google Scholar]
- Grzebyk D, Schofield O, Vetriani C, Falkowski PG. The mesozoic radiation of eukaryotic algae: the portable plastid hypothesis. J Phycol. 2003;39:259–267. [Google Scholar]
- Gutensohn M, Fan E, Frielingsdorf S, Hanner P, Hou B, Hust B, Klosgen RB. Toc, Tic, Tat et al.: structure and function of protein transport machineries in chloroplasts. J Plant Physiol. 2006;163:333–347. doi: 10.1016/j.jplph.2005.11.009. [DOI] [PubMed] [Google Scholar]
- Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018;35:518–522. doi: 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hrdá Š, Fousek J, Szabová J, Hampl V, Vlček Č. The plastid genome of Eutreptiella provides a window into the process of secondary endosymbiosis of plastid in euglenids. PLoS One. 2012;7:e33746. doi: 10.1371/journal.pone.0033746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, Walter MC, Rattei T, Mende DR, Sunagawa S, Kuhn M, Jensen LJ, et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 2016;44:D286–293. doi: 10.1093/nar/gkv1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keeling PJ. The number, speed, and impact of plastid endosymbioses in eukaryotic evolution. Annu Rev Plant Biol. 2013;64:583–607. doi: 10.1146/annurev-arplant-050312-120144. [DOI] [PubMed] [Google Scholar]
- Kleine T, Maier UG, Leister D. DNA transfer from organelles to the nucleus: the idiosyncratic genetics of endosymbiosis. Annu Rev Plant Biol. 2009;60:115–138. doi: 10.1146/annurev.arplant.043008.092119. [DOI] [PubMed] [Google Scholar]
- Lane CE, Archibald JM. The eukaryotic tree of life: endosymbiosis takes its TOL. Trends Ecol Evol. 2008;23:268–275. doi: 10.1016/j.tree.2008.02.004. [DOI] [PubMed] [Google Scholar]
- Larkum AW, Lockhart PJ, Howe CJ. Shopping for plastids. Trends Plant Sci. 2007;12:189–195. doi: 10.1016/j.tplants.2007.03.011. [DOI] [PubMed] [Google Scholar]
- Leger MM, Eme L, Stairs CW, Roger AJ. Demystifying eukaryote lateral gene transfer. Bioessays. 2018;40:e1700242. doi: 10.1002/bies.201700242. [DOI] [PubMed] [Google Scholar]
- Markunas CM, Triemer RE. Evolutionary history of the enzymes involved in the Calvin-Benson cycle in euglenids. J Eukaryot Microbiol. 2016;63:326–339. doi: 10.1111/jeu.12282. [DOI] [PubMed] [Google Scholar]
- Maruyama S, Suzaki T, Weber AP, Archibald JM, Nozaki H. Eukaryote-to-eukaryote gene transfer gives rise to genome mosaicism in euglenids. BMC Evol Biol. 2011;11:1471–2148. doi: 10.1186/1471-2148-11-105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minge MA, Shalchian-Tabrizi K, Torresen OK, Takishita K, Probert I, Inagaki Y, Klaveness D, Jakobsen KS. A phylogenetic mosaic plastid proteome and unusual plastid-targeting signals in the green-colored dinoflagellate Lepidodinium chlorophorum . BMC Evol Biol. 2010;10:191. doi: 10.1186/1471-2148-10-191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minh BQ, Nguyen MA, von Haeseler A. Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol. 2013;30:1188–1195. doi: 10.1093/molbev/mst024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreira D, Deschamps P. What was the real contribution of endosymbionts to the eukaryotic nucleus? Insights from photosynthetic eukaryotes. Cold Spring Harb Perspect Biol. 2014;6:a016014. doi: 10.1101/cshperspect.a016014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreira D, Philippe H. Sure facts and open questions about the origin and evolution of photosynthetic plastids. Res Microbiol. 2001;152:771–780. doi: 10.1016/s0923-2508(01)01260-8. [DOI] [PubMed] [Google Scholar]
- Moustafa A, Beszteri B, Maier UG, Bowler C, Valentin K, Bhattacharya D. Genomic footprints of a cryptic plastid endosymbiosis in diatoms. Science. 2009;324:1724–1726. doi: 10.1126/science.1172983. [DOI] [PubMed] [Google Scholar]
- Muñoz-Gómez SA, Mejía-Franco FG, Durnin K, Colp M, Grisdale CJ, Archibald JM, Slamovits CH. The new red algal subphylum Proteorhodophytina comprises the largest and most divergent plastid genomes known. Curr Biol. 2017;27:1677–1684. doi: 10.1016/j.cub.2017.04.054. [DOI] [PubMed] [Google Scholar]
- Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Petersen J, Ludewig AK, Michael V, Bunk B, Jarek M, Baurain D, Brinkmann H. Chromera velia, endosymbioses and the rhodoplex hypothesis--plastid evolution in cryptophytes, alveolates, stramenopiles, and haptophytes (CASH lineages) Genome Biol Evol. 2014;6:666–684. doi: 10.1093/gbe/evu043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ponce-Toledo RI, Deschamps P, López-García P, Zivanovic Y, Benzerara K, Moreira D. An early-branching freshwater cyanobacterium at the origin of plastids. Curr Biol. 2017;27:386–391. doi: 10.1016/j.cub.2016.11.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5 doi: 10.1371/journal.pone.0009490. 0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reyes-Prieto A, Hackett JD, Soares MB, Bonaldo MF, Bhattacharya D. Cyanobacterial contribution to algal nuclear genomes is primarily limited to plastid functions. Curr Biol. 2006;16:2320–2325. doi: 10.1016/j.cub.2006.09.063. [DOI] [PubMed] [Google Scholar]
- Rogers MB, Gilson PR, Su V, McFadden GI, Keeling PJ. The complete chloroplast genome of the chlorarachniophyte Bigelowiella natans: evidence for independent origins of chlorarachniophyte and euglenid secondary endosymbionts. Mol Biol Evol. 2007;24:54–62. doi: 10.1093/molbev/msl129. [DOI] [PubMed] [Google Scholar]
- Sanchez-Puerta MV, Delwiche CF. A hypothesis for plastid evolution in chromalveolates. J Phycol. 2008;44:1097–1107. doi: 10.1111/j.1529-8817.2008.00559.x. [DOI] [PubMed] [Google Scholar]
- Sierra R, Canas-Duarte SJ, Burki F, Schwelm A, Fogelqvist J, Dixelius C, Gonzalez-Garcia LN, Gile GH, Slamovits CH, Klopp C, et al. Evolutionary origins of rhizarian parasites. Mol Biol Evol. 2016;33:980–983. doi: 10.1093/molbev/msv340. [DOI] [PubMed] [Google Scholar]
- Stiller JW. Experimental design and statistical rigor in phylogenomics of horizontal and endosymbiotic gene transfer. BMC Evol Biol. 2011;11:1471–2148. doi: 10.1186/1471-2148-11-259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stiller JW, Huang J, Ding Q, Tian J, Goodwillie C. Are algal genes in nonphotosynthetic protists evidence of historical plastid endosymbioses? BMC Genomics. 2009;10:484. doi: 10.1186/1471-2164-10-484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suzuki S, Hirakawa Y, Kofuji R, Sugita M, Ishida K. Plastid genome sequences of Gymnochlora stellata, Lotharella vacuolata, and Partenskyella glossopodia reveal remarkable structural conservation among chlorarachniophyte species. J Plant Res. 2016;129:581–590. doi: 10.1007/s10265-016-0804-5. [DOI] [PubMed] [Google Scholar]
- Wang HC, Minh BQ, Susko E, Roger AJ. Modeling site heterogeneity with posterior mean site frequency profiles accelerates accurate phylogenomic estimation. Syst Biol. 2018;67:216–235. doi: 10.1093/sysbio/syx068. [DOI] [PubMed] [Google Scholar]
- Weeden NF. Genetic and biochemical implications of the endosymbiotic origin of the chloroplast. J Mol Evol. 1981;17:133–139. doi: 10.1007/BF01733906. [DOI] [PubMed] [Google Scholar]
- Woehle C, Dagan T, Martin WF, Gould SB. Red and problematic green phylogenetic signals among thousands of nuclear genes from the photosynthetic and apicomplexa-related Chromera velia . Genome Biol Evol. 2011;3:1220–1230. doi: 10.1093/gbe/evr100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Y, Maruyama S, Sekimoto H, Sakayama H, Nozaki H. An extended phylogenetic analysis reveals ancient origin of "non-green" phosphoribulokinase genes from two lineages of "green" secondary photosynthetic eukaryotes: Euglenophyta and Chlorarachniophyta. BMC Res Notes. 2011;4:330. doi: 10.1186/1756-0500-4-330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Y, Matsuzaki M, Takahashi F, Qu L, Nozaki H. Phylogenomic analysis of "red" genes from two divergent species of the "green" secondary phototrophs, the chlorarachniophytes, suggests multiple horizontal gene transfers from the red lineage before the divergence of extant chlorarachniophytes. PLoS One. 2014;9:e101158. doi: 10.1371/journal.pone.0101158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoon HS, Hackett JD, Pinto G, Bhattacharya D. The single, ancient origin of chromist plastids. Proc Natl Acad Sci U S A. 2002;99:15507–15512. doi: 10.1073/pnas.242379899. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Protein sequence datasets used in this work are available for download at http://www.ese.u-psud.fr/article950.html?lang=en. They include nonaligned sequences and trimmed alignments.


