Abstract
Secondary loss of photosynthesis is observed across almost all plastid-bearing branches of the eukaryotic tree of life. However, genome-based insights into the transition from a phototroph into a secondary heterotroph have so far only been revealed for parasitic species. Free-living organisms can yield unique insights into the evolutionary consequence of the loss of photosynthesis, as the parasitic lifestyle requires specific adaptations to host environments. Here, we report on the diploid genome of the free-living diatom Nitzschia putrida (35 Mbp), a nonphotosynthetic osmotroph whose photosynthetic relatives contribute ca. 40% of net oceanic primary production. Comparative analyses with photosynthetic diatoms and heterotrophic algae with parasitic lifestyle revealed that a combination of gene loss, the accumulation of genes involved in organic carbon degradation, a unique secretome, and the rapid divergence of conserved gene families involved in cell wall and extracellular metabolism appear to have facilitated the lifestyle of a free-living secondary heterotroph.
Genomic adaptation to loss of photosynthesis and an osmotrophic lifestyle requires unique evolutionary processes.
INTRODUCTION
The loss of photosynthesis in photoautotrophs is successful if compensated by a competitive advantage arising from the availability of an extracellular energy source. Hence, many secondary heterotrophs evolve as parasites (1–3), relying on sufficient resources provided by their hosts. Well-studied examples are the Apicomplexa [e.g., (4)], which have lost photosynthesis secondarily. However, the loss of photosynthesis can also lead to free-living secondary heterotrophs, which are as common as parasites (2, 5–8). Despite their significance, our knowledge about the evolution of free-living secondary heterotrophs is very limited, and we therefore lack insights into evolutionary processes required for them to thrive without photosynthesis and independently of a resource-providing host. Given that a parasitic lifestyle accelerates the rate of evolution (cf. Red Queen hypothesis) (9) and of loss of conserved orthologous genes [e.g., (10)], the genome analysis of a nonparasitic secondary heterotroph can provide insights uncompromised by parasite-specific adaptations. Hence, the diatom Nitzschia putrida, isolated from mangrove estuaries, is the ideal model to test these hypotheses because it is an example of a free-living secondary heterotroph (5, 11) within the diverse group of largely photoautotrophic diatoms (12, 13). As several genomes of the latter have recently become available including close phylogenetic relatives (14–16), a genome-based comparative metabolic reconstruction of N. putrida promises to reveal fresh insights into what is required to thrive as a free-living secondary heterotroph. Thus, here, we have analyzed the draft genome sequence of N. putrida, which provides insight into evolutionary processes underpinning lifestyle shifts from photoautotrophy to free-living heterotrophy in the context of a coastal surface ocean ecosystem.
RESULTS
Genome assembly
K-mer–based GenomeScope analysis (17) with 150–base pair (bp)–long Illumina short reads suggested the genome of N. putrida (Fig. 1A) to be diploid (fig. S1A). To provide a high-quality genome with long-range contiguity, PacBio sequencing (RSII platform) was performed, resulting in ≥40-fold coverage. Because of the confirmed diploid nature of the N. putrida genome, we have applied the Falcon assembler and Falcon_unzip version 0.5 (18) to provide a first draft genome of this species. On the basis of this assembly, we estimated a genome size of 35 Mbp, including 87 scaffolds with an N50 of 860.9 kbp. The longest scaffold was 3.8 Mbp. The heterozygous regions of the genome (alternate contigs) estimated by the Falcon assembler resulted in 12 Mbp, with an N50 of 121 kbp (table S1). The Falcon assembly was error-corrected and polished by approximately 150-fold coverage of Illumina short reads, which were subsequently used for generating the final assembly with Pilon 1.2.2 (19) including manual curation.
Fig. 1. The heterotrophic diatom N. putrida and its plastid proteome.
(A) The frustule view of N. putrida. Bar, 10 μm. (B) Estimated plastid proteome size in three diatoms. Light and dark gray bars show low and high confident plastid-targeted proteins identified by ASAFind (22), respectively. Data of two photosynthetic diatoms P. tricornutum and T. pseudonana are derived from the previous study (22). (C) Unique and shared plastid-targeted orthogroups. Highlighted in red is the orthogroup exclusively shared by the two photosynthetic diatoms. (D) Predicted metabolic map of the nonphotosynthetic plastid. Representative pathways found in photosynthetic diatom species are shown. Green and light gray arrows show the presence and absence of the responsible protein sequences for the reactions in the genome, respectively. Amino acids are highlighted in red. Abbreviations are described in the Supplementary Materials.
According to the k-mer–assessed diploid nature of the N. putrida genome, the read coverage of the homozygous regions is approximately twofold higher than the read coverage for the heterozygous regions, suggesting the presence of diverged alleles as previously identified in the genome of the photoautotroph diatom Fragilariopsis cylindrus (fig. S1, A and B). Thus, most of the diverged allelic variants can be found in the heterozygous regions characterized by the presence of alternate contigs, whereas the regions with no corresponding alternate contigs are homozygous (fig. S1B). On the basis of the analysis with Braker2 version 2.0.3 (20), the Nitzschia genome comprises 15,003 and 5767 inferred protein-coding loci on the primary and alternate contigs, respectively (table S1). Almost 40% of loci in the genome of N. putrida appear to be characterized by diverged alleles. A BUSCOv3 analysis (21) revealed the genome to be complete at a level of 90.1% based on the haploid set of genes.
The loss of photosynthesis
The haploid set of genes was used to reconstruct the nuclear-encoded plastid proteome of N. putrida and therefore to reveal the extent of gene loss including key genes of photosynthesis. A comparative analysis of the N. putrida plastome (22) with its photosynthetic counterparts revealed that more than 50% of nuclear encoded plastid proteins have been lost (Fig. 1B). More than 500 orthogroups (OrthoFinder) (23) of nuclear-encoded plastid proteins, which are usually shared between photosynthetic diatoms (22), are missing in the predicted plastid proteome of N. putrida (Fig. 1C). The missing part of the plastid proteome included genes encoding for proteins of light-harvesting antenna including fucoxanthin-chlorophyll a/c protein (fcp), photosystem II and I (e.g., psbA, psbC, psbO, psaA, psaB, and psaD), the cytochrome b6/f complex (e.g., petA), and carbon fixation (e.g., rbcS and rbcL) in addition to genes of the Calvin cycle [e.g., phosphoribulokinase (prk)]. Furthermore, a substantial number of key genes were missing for the biosynthesis of chlorophyll, carotenoids, and plastoquinones (fig. S1C).
Despite the loss of some of these key photosynthesis genes, there is still a substantial number of genes left encoding common plastid metabolic pathways as known from photosynthetic diatoms, including the generation of adenosine 5′-triphosphate (ATP) by adenosine triphosphatase (ATPase) subunits, which are encoded both in the nuclear and plastid genomes (24). Almost all genes encoding for plastid enzymes to synthesize essential amino acids are still encoded in the nuclear genome of N. putrida. Furthermore, all genes of the heme pathway have been found, and N. putrida appears to be able to synthesize riboflavin. The presence of plastid-targeted transporters (25) enables the transport of phosphoenolpyruvate, 3-phosphoglycerate, and dihydroxyacetone-phosphate across the plastid membranes. In addition, our genome-based reconstruction of plastid metabolism identified the biosynthesis pathway for lipids and the ornithine cycle in N. putrida (Fig. 1D and fig. S2). The latter has been reported neither in previous transcriptome-based studies with this species (26) nor in any other secondary heterotroph ([.g., (6, 7)]. As N. putrida has an osmotrophic lifestyle, it relies on dissolved organic matter and nutrients. Thus, the biosynthesis of a variety of metabolic compounds supports the osmotrophic lifestyle, lacking abilities to prey or to parasitize on other organisms.
Communication between organelles and light-dependent gene expression
The lack of CO2 fixation in plastids of N. putrida—which reduces the amount of amino acids, lipids, and other metabolites to be synthesized—appears to be partially compensated for by the remodeling of metabolic interactions with mitochondria and peroxisomes and by retaining active recycling of nitrogen (Figs. 1 and 2 and figs. S3 and S4). It appears that the nonphotosynthetic plastid of N. putrida still exchanges glutamine and ornithine, both of which are important intermediates of the ornithine cycle. All genes for the ornithine-urea cycle have been retained in the N. putrida genome. The ornithine-urea cycle is indispensable for nitrogen recycling in photosynthetic diatoms (27, 28), and even after the loss of photosynthesis, nitrogen recycling appears to be essential in N. putrida (Fig. 2A) due to its osmotrophic lifestyle. Usually, the ornithine-urea cycle is tightly linked with tricarboxylic acid cycle and/or photorespiration in photosynthetic diatoms (27, 28). However, N. putrida is not likely to perform photorespiration (Fig. 2A). The metabolic exchange with the peroxisome through glycolate likely has ceased as phosphoglycolate phosphatase and peroxisomal glycolate oxidase are missing. Thus, photorespiration is unlikely to take place in nonphotosynthetic plastids of N. putrid due to the lack of ribulose 1,5-bisphosphate carboxylase/oxygenase and other key enzymes of the Calvin cycle (Fig. 1). Nevertheless, peroxisomes still appear to play a role in N. putrida for the production of malate or glyoxylate, which feed into respiratory pathways of the mitochondria to support ATP and NADPH (reduced form of nicotinamide adenine dinucleotide phosphate) production (Fig. 2A).
Fig. 2. Loss of genes for the plastid-peroxisome metabolic flow and photoreceptors.
(A) Metabolic interactions between a mitochondrion and a nonphotosynthetic plastid and between a mitochondrion and a peroxisome. Black, orange, and blue arrows show presence of responsible protein sequences for the reactions in a plastid, a mitochondrion, and a peroxisome, respectively, while light gray arrows show absence of responsible protein sequences. Dashed arrows show possible interorganellar metabolic flows. Abbreviations are described in the Supplementary Materials. (B) Photoreceptor and cell-cycle genes in the N. putrida genome. The other genes are shown in fig. S5. Light green and light gray boxes show the presence and absence of corresponding genes, respectively. (C) Growth of the heterotrophic diatom under the different light conditions. Closed boxes show growth in the continuous dark condition, while open boxes show growth in the light-dark condition. Shaded in gray are the dark periods in the light-dark cultivation conditions. (D) Left: Heatmap showing the reproducible expression patterns of genes (Pearson’s correlation coefficient < 0.9). k-means clustering was calculated for each gene based on reads per kilobase of transcript per million mapped reads (RPKM) + 1 values, which were transformed to log2 and centered by median values. Yellow and blue indicate up-regulation and down-regulation of the gene, respectively. Right: The line graphs showing expression pattern of genes in each cluster. The colored line indicates the average value of the expression patterns. LogFC, log fold change.
Light in photosynthetic organisms not only plays a substantial role for photosynthesis generating ATP and NADPH but also regulates cell division, diel cycles, and different signaling processes unlike in many heterotrophic organisms (29–31). As a consequence, we identified the remaining photoreceptors and cell-cycle regulators such as cyclins and cyclin-dependent kinases (32). Although they were still encoded and expressed in the genome of N. putrida (Fig. 2B and fig. S5, A and B), we were unable to identify a diel cycle in cell division (Fig. 2C). This suggests that these cel-cycle regulators potentially have neo/subfunctionalized and therefore have a different regulatory role in N. putrida unrelated to the diel cycle. The loss of the transcription factor bHLH-1a (RITMO1), which has been identified as a master regulator of diel periodicity (33), corroborates our finding that N. putrida has lost the ability to perform diel cycles. In addition, most of the other photoreceptors known from photosynthetic diatoms have also been lost (Fig. 2B) such as the blue light sensing aureochromes 1a/b, both of which are transcription factors responsible for photoacclimation (34). Despite the lack of light-dependent cell-cycle regulation, a few remaining photoreceptors were identified including bHLH1b_PAS, aureochrome 1c, and cryptochrome-DASH/CPF2 (Fig. 2B) (29, 35). Basic ZIP [basic leucine zipper proteins (bZIP)] transcription factors having potentially light-sensitive Per-Arnt-Sim (PAS) domains (bZIP-PAS) (36) were also identified in the N. putrida genome such as homologs to bZIP6 and bZIP7 of Phaeodactylum tricornutum (37). The latter homolog has been duplicated and diversified in N. putrida (fig. S5C). The presence of bZIP-PAS protein in a heterotrophic eukaryote is not unprecedented as some oomycetes, nonphotosynthetic parasites, have been reported to also encode them in their genomes [e.g., (38)]. Although their role in regulating gene expression remains to be investigated in N. putrida, light still appears to influence the expression of some genes in this heterotrophic species. Comparative transcriptome analyses every 4 hours during a shift from a light phase to darkness (Fig. 2D) revealed eight clusters characterized by different expression patterns. Furthermore, there was no cluster explicitly representing the light-dependent gene expression patterns as seen in photosynthetic algae [e.g., (29, 39)]. However, one of the clusters contained genes only expressed in the mid-light phase: cluster 7 containing 90 genes (0.6% total). Forty four of them were genes with known functional domains based on a KOG (EuKaryotic Orthologous Groups) analysis, and 21 of them were encoding proteins for substrate import and carbon metabolism (fig. S5D). However, the photoreceptor homologs above—bHLH1b_PAS, aureochrome 1c, and cryptochrome-DASH/CPF2—were not part of this cluster, and there was no explicit trend in their gene expression patterns with respect to changes between light and dark conditions.
The genetic toolkit for the evolution of nonparasitic secondary heterotrophy
Despite the loss of many nuclear genes and their families, the genome size of N. putrida is not significantly different to photosynthetic relatives such as F. cylindrus and P. tricornutum and the more distantly related diatom Thalassiosira pseudonana (table S1). This is distinct from evolutionary trends observed in parasitic eukaryotes that have lost photosynthesis as they have smaller genomes encoding smaller gene families compared to their photosynthetic relatives (fig. S6). By comparing KOGs of paralogous proteins, there was no significant difference in the number of unique KOG IDs between these four diatom species (fig. S7, A and B). However, when we compared the number of paralogous proteins assigned to each KOG ID, there were several KOG categories for which N. putrida had a higher number of paralogous proteins compared to the other diatom species: nucleotide transport (F), transcription (K), signal transduction (T), intracellular trafficking, secretion, vesicular transport (U), and cytoskeleton (Z) (fig. S7C). Even after normalization by total gene numbers, nucleotide transport (F), signal transduction (T), and cytoskeleton (Z) genes were more abundant in the N. putrida genome (fig. S8A). This observation was corroborated by N. putrida–specific enrichment of Pfam domains such as adenylate/guanylate cyclase and cyclic nucleotide esterase, leucine-rich repeat (LRR), and glycosyl/galactosyl transferase domains (fig. S8B).
A microbial heterotroph acquires nutrients either by phagotrophy, the preferred nutrition of many parasites, or by osmotrophy. The latter requires uptake of dissolved organic compounds by osmosis as realized by bacteria and fungi, for instance (40, 41). As N. putrida grows well under axenic conditions (5, 42), it is likely an osmotroph, dependent on the uptake of dissolved organic compounds across the silicified cell wall and the plasma membrane. As realized by osmotrophic fungi, N. putrida may even be able to degrade higher–molecular weight compounds extracellularly to be subsequently taken up as individual molecules by specific transporters or even osmosis (40, 41). Thus, it is likely that cell wall, membrane, and secreted proteins were diversified in N. putrida compared to photosynthetic diatoms to facilitate osmotrophy.
To address this hypothesis, we analyzed the enrichment of paralog proteins and differences in nutrient transporters involved in the uptake of dissolved organic compounds such as solute carriers. A comparison to photosynthetic diatoms and parasitic nonphotosynthetic algal species [Prototheca and Helicosporidium (green algae), and the apicomplexans Plasmodium and Toxoplasma] has revealed that N. putrida has a unique composition of genes encoding transporters, which is therefore different to photosynthetic algae and parasitic nonphotosynthetic algal species (Fig. 3, A to C). For instance, the number of genes encoding silicon transporters (SITs), solute symporters, and the resistance-nodulation–cell division superfamily was more than twice as abundant in N. putrida compared to photosynthetic diatom species (Fig. 3D and fig. S9A). However, in contrast to the difference between N. putrida and photosynthetic diatom species, there is no enrichment of particular transporters in parasitic algal species when compared to their photosynthetic relatives (fig. S9, B and C).
Fig. 3. Diversity of transporters and carbohydrate active enzymes in N. putrida.
(A) Distribution of the number of transporters in each transporter family of diatoms. Differences in the distributions among species were tested by the Wilcoxon signed-rank test corrected with the Benjamini-Hochberg procedure (P < 0.05), but there is no significant difference. Outliers were omitted in the boxplot. Nonphotosynthetic species are highlighted in gray. (B) Distribution of the number of transporters in each transporter family of Alveolata. Details are described in (A). (C) Distribution of the number of transporters in each transporter family of green algae (Trebouxiophyceae). Details are described in (A). (D) The gene number of transporters in the 12 most abundant transporter families of N. putrida. (E) Silicon transporter (SIT) genes tandemly located in the contig 000000F. SIT genes are highlighted in light red with the gene IDs. (F) Glycoside hydrolase (GH) families from the Carbohydrate Active enZyme (CAZy) database focused on diatoms. The diagram shows a heatmap of CAZyme prevalence in each taxon (number of a particular CAZyme family divided by the total number of CAZyme genes in the organism); the white to blue color scheme indicates low to high prevalence, respectively. Dendrograms (left and top) show respectively the relative taxa proximity with respect co-occurrence of CAZyme families and the co-occurrence of CAZyme families with one another within genomes. (G) GH114 genes tandemly located in the contig 000022F. GH114 genes are highlighted in light green with the gene IDs.
Expansion of those gene families may, at least partly, have been achieved by recent tandem duplications (Fig. 3E). To gain insight into when the expansion had occurred, we performed a coalescence analysis, which revealed that SITs in N. putrida began to expand around 3.3 million years (Ma) ago [1.2 to 6.6, 95% confidence interval (CI)], while divergence from another nonphotosynthetic diatom N. alba is estimated to have occurred around 6.67 Ma ago (2.5 to 11.5, 95% CI; fig. S10). The split between F. cylindrus and P. multiseries, which was used to date the tree, was estimated at 9.7 Ma ago (7.6 to 11.6, 95% CI).
Thus, the recent expansion of SITs suggests neo/subfunctionalization of the gene family in response to the change in lifestyle. The divergence rate of SIT genes was much larger than that of control genes (e.g., myosin), indicating that SIT diversification might have contributed to the adaptation of the heterotrophic lifestyle. In support of this hypothesis, we detected several sites under positive selection in different members of the SIT family (table S2), which implies that the evolution of those genes may have been driven by diversifying selection.
The solute sodium symporters are estimated to have diverged around 7.5 Ma ago (3.8 to 11.1, 95% CI), markedly earlier than the SIT gene family. Although the divergence rate is also larger than that of control genes (fig. S10), we did not find evidence of diversifying selection in this gene family. The differences between these two families of transporters suggest that their expansion might have occurred in a stepwise manner and driven by different evolutionary forces.
Furthermore, although the overall carbohydrate-active enzyme (CAZyme) family composition of Nitzschia was not different from that of photosynthetic diatoms (fig. S11), families encoding β-glycoside hydrolase (GH8), laminarinase (GH16_3), pectinase (GH28), β-glucanase (GH72), α-mannan hydrolyzing enzymes (GH99), and β-1,2-glucan hydrolytic enzymes (GH114) were enriched in N. putrida compared to photosynthetic species (Fig. 3F). Expansion of these families might, at least partly, have been achieved by recent tandem duplications (Fig. 3G), suggesting an important role of these genes for the heterotrophic lifestyle of N. putrida. Notably, more than one-third of proteins assigned to the above six CAZyme families are predicted to be secreted in N. putrida (see below). The CAZyme compositions suggest that N. putrida might be able to degrade extracellular polysaccharides such as ß-1,3 glucans (e.g., lichenin, paramylon, callose, and laminarin), starches, β-1,2-glucans, pectin, and α-mannan. As N. putrida has been isolated from disintegrating mangrove leaves in a paddle (5, 42), this species might play a role in degrading dead leaves and therefore facilitating carbon recycling in mangroves. To gain first insight into how transcription of CAZyme genes is regulated by different carbon sources, we performed comparative transcriptome analyses with starved N. putrida cells in comparison to cells growing on glucose and starch. However, we found that only a limited number of genes encoding CAZymes were differentially expressed (table S3). About half of these genes were up-regulated in response specifically to starch as a carbon source, while only one CAZyme gene was up-regulated in response to glucose (table S3). This observation suggests that most of the CAZymes in N. putrida are not for the utilization of glucose and only very few for starch utilization. Arguably, providing a very limited set of organic substrates does not reflect the complexity of organic carbon provided by disintegrating leaves in a mangrove ecosystem. Hence, this might be the main reason for the limited transcriptome response observed in our experiments.
The predicted secretome of the nonparasitic, free-living secondary heterotroph N. putrida
Given that the secretome plays an important role for substrate degradation and subsequent uptake of low–molecular weight compounds in osmotrophs (40), we conducted a comparative analysis to predict secreted proteins of N. putrida in silico by identifying proteins with N-terminal signal peptides and a lack of transmembrane domains. The resulting proteins were clustered using TribeMCL (43), and plastid- and lysosome-localized proteins were subsequently removed using ASAFind according to their characteristic targeting motifs (22) and Pfam domains. The number of putatively secreted proteins is 978, 998, 596, and 718 in N. putrida, F. cylindrus, P. tricornutum, and T. pseudonana, respectively, which corresponds to between 5 and 7% of the total number of genes in their genomes (fig. S12A). Nevertheless, there were significant differences when we compared the diversity of proteins between these four diatom species (Fig. 4, A and B); N. putrida, on average, had a significantly higher number of proteins per tribe than any of the other diatom species (two-sided Wilcoxon signed-rank test; P < 0.01; Fig. 4C). In particular, proteins involved in heterotrophy such as organic matter degradation/modification including CAZymes and peptidases were more abundant in N. putrida than in the photosynthetic diatom genomes (188 in N. putrida, 142 in F. cylindrus, 118 in P. tricornutum, and 101 in T. pseudonana; fig. S12A). This is in contrast to parasitic green algae because their predicted secretomes are smaller than those of their photosynthetic relatives and show no explicit enrichment of secretome proteins per tribe (Fig. 4, D to F).
Fig. 4. Secretome of nonphotosynthetic algae.
(A) The number of secretome tribes of diatoms, including at least four sequences, clustered by TribeMCL (45). Different colors represent tribe categories as follows: 1, species specific tribes; 2 to 4, tribes shared by two to four species, respectively. OTUs, operational taxonomic unit. (B) Proportion of each tribe category in diatoms. Details are described in (A). (C) Distribution of the number of protein sequences in each secretome tribe in diatoms. Outliers were omitted in the boxplot. The Wilcoxon signed-rank test corrected with the Benjamini-Hochberg procedure was used for tests of statistical significance. (D) The number of secretome tribes in green algae (trebouxiophytes), including at least four sequences, clustered by TribeMCL (45). Different colors represent tribe categories as follows: 1, species specific tribes; 2 to 5, tribes shared by two to five species, respectively. (E) Proportion of each tribe category in green algae. Details are described in (D). (F) Distribution of the number of protein sequences in each secretome tribe in green algae. Details are described in (C). (G) Expression of the 10 largest tribes in N. putrida during the 25 hours of cultivation. Genes in the tribes could be divided into four clusters. Details are described in Fig. 2D.
The most common secreted proteins in N. putrida are LRR-containing proteins (fig. S12B), many of which contain additional domains such as tegument and glycoprotein domains, suggesting an increased functional diversity (fig. S13). Only very few LRR-containing proteins were identified in the predicted secretomes of the photosynthetic diatoms, indicating that signal peptide–dependent secretion of abundant and diverse LRR-containing proteins may be an essential requirement in this secondary heterotroph, such as for environmental signaling (44). In addition to LRR-containing proteins, the top 10 most enriched proteins in N. putrida were von Willebrand factor type D (VWFD) proteins involved in adhesion or clotting, two types of endopeptidases, trypsin and leishmanolysin (cell-surface peptidase of the human parasite Leishmania), intradiol ring-cleavage dioxygenase protein for degradation of aromatic compounds, methyltransferase, and four proteins with unknown function (fig. S12B). LRR-containing proteins and VWFDs might play important roles in N. putrida for attaching to disintegrating mangrove leaves (5, 42, 45). Endopeptidases and aromatic compound degradation may facilitate the utilization of their complex carbon compounds.
Furthermore, transcriptional dynamics of the predicted secretome over a diel cycle (Fig. 2) revealed the presence of four different clusters. Genes in cluster 1 were transcribed at the beginning of the first light phase and genes in cluster 2 at the end of the dark phase and into the second light phase (Fig. 4G). Genes of cluster 3 were most strongly expressed in the middle and end of the first light phase, whereas genes in cluster 4 were relatively weakly expressed throughout day and night. These results suggest that stimuli including light and/or nutrients play a role in the regulation of these genes, which might either be a relict from the photosynthetic ancestor or a response to diel cycles of organic substances in the aquatic system occupied by N. putrida. There is only weak evidence of lateral transfer of secretome genes in N. putrida (figs. S14 and S15) with five genes of potential lateral origin (figs. S14 and S15). Thus, the origin of most secretome proteins in N. putrida likely was derived vertically from homologs of a photosynthetic ancestor.
DISCUSSION
N. putrida experienced a series of genetic adaptations toward a heterotrophic lifestyle. This diatom species took a step backward in one of the major evolutionary transitions, from photoautotrophs to heterotrophs, potentially relaxing selection on some of the now redundant gene networks and their functions. As expected, more than 50% of nuclear encoded plastid proteins have been lost in the N. putrida plastid proteome in comparison to its photosynthetic counterparts (22). However, the total number of genes (~15,000) fell within the range of photosynthetic microalgae, and we found no evidence of pseudogene formation, genome streamlining [e.g., (46)], gene family contraction (cf. birth-and-death hypothesis) (47), or reductive genome evolution (Black Queen hypothesis) (48). The relatively large genome size is not unexpected given that N. putrida is a free-living osmotroph. This free-living lifestyle in a complex and highly variable coastal marine environment likely is the reason why a substantial number of genes including some photoreceptors, cell cycle regulators, and common plastid metabolic pathways usually present in photosynthetic diatoms have remained. Although some of the latter genes were still expressed, N. putrida appears to lack a diel growth cycle, which suggests that these cell-cycle regulators have neo/subfunctionalized. However, as a certain number of genes still appear to be regulated by light, osmotrophy potentially benefits from diel fluctuations of resources such as dissolved organic carbon in aquatic environments (49–51). For photoautotrophs, it is important to regulate the cell cycle in accordance with diel cycles for optimizing photosynthesis and therefore cell proliferation (29–31, 52, 53). Without being reliant on light as its primary energy source, the osmotroph N. putrida no longer requires coordinating its cell cycle with diel cycles. Thus, after the loss of photosynthesis, the strict light-dependent regulation of gene expression might have become less important and gene expression therefore may have become predominantly regulated by other stimuli. Many photoreceptors are missing, but duplication of genes for bZIP transcription factors with PAS domains and genes for signal transduction and cellular regulatory roles such as adenyl/guanyl cyclase and cyclic nucleotide esterase domains was enriched in the N. putrida genome. Furthermore, the peroxisome-plastid interaction is no longer required after the loss of photosynthesis, giving rise to loss of carbon fixation in the context of glycolate recycling. In contrast, the ornithine-urea cycle likely remains to be functional to facilitate nitrogen recycling.
Gene family expansions and neo/subfunctionalizations appear to have played a prominent role in the adaptation to its different lifestyle given that many proteins predicted to be secreted have diversified in N. putrida, possibly to facilitate osmotrophy. Together, the marked change of lifestyle associated with the “devolution” did not result in reductive genome evolution as known from nonphotosynthetic plastid-bearing parasites.
METHODS
Cultivation, DNA and RNA extraction, and sequencing
N. putrida NIES-4239 was cultivated in the Daigo’s IMK medium (Wako) including 1% Luria-Bertani medium based on the artificial seawater made with MARINE ART SF-1 (Osaka Yakken Co.) at 20°C under the 12-hour light and 12-hour dark conditions: 50 μmol photons/m2 per second with plant cultivation light-emitting diode light (BC-BML3, Biomedical Science). DNA was extracted with the Extrap Soil DNA Kit Plus version 2 (Nippon Steel). Total DNA was subjected to library construction with TruSeq DNA PCR-Free (350; Illumina) and to 151-bp paired-end sequencing by HiSeqX, resulting in 660 million paired-end reads, and to PacBio RSII, with SMRT cell 8Pac v3 and DNA Polymerase Binding Kit P6 v2, in Macrogen, resulting in 1.3-Gb subreads. Total RNA was extracted with TRIzol (Sigma-Aldrich) according to the manufacturer’s instruction and was subjected to library construction with TruSeq RNA Sample Prep Kit v2 (Illumina) and 101-bp paired-end sequencing by HiSeq 2500, resulting in 107.5 million paired-end reads.
Genome assembly and construction of gene models
PacBio reads were assembled into contigs using Falcon (version 0.7.0) (18) with a length cutoff of 7000 bp for seed reads and an estimated genome size of 33 Mbp. Genome size estimation was performed on the GenomeScope web server (http://qb.cshl.edu/genomescope/) based on the k-mer frequency distribution of Illumina reads calculated by JellyFish version 2.2.6 with a k-mer size of 21. The resultant primary and associate contigs were then subjected to Falcon_unzip (version 0.5.0) (18), generating partially haplotype-phased contigs (primary contigs) and fully phased contigs (haplotigs). The assembly was polished using PacBio reads and Quiver program, followed by single-nucleotide polymorphism (SNP) and short insertion-deletion (indel) error correction using Pilon (version 1.2.2) with Illumina reads mapped by the Burrows-Wheeler Aligner (version 0.7.15) (19). Indel errors in the vicinity of hetero-SNPs were further fixed manually, as they were difficult to be automatically corrected. Contigs derived from plastid and mitochondrial genomes were identified using BLASTN and separated from contigs derived from the nuclear genome.
RNA sequencing reads were trimmed under the parameters of ILLUMINACLIP:TruSeq3-PE.fa:2:30:10, LEADING:20 TRAILING:20, SLIDINGWINDOW:4:15, and MINLEN:75 using Trimmomatic (version 0.36) (54). The trimmed reads aligned to the assembled contigs using HISAT2 (version 2.0.4) (55). They were provided to BRAKER2 gene annotation pipeline (version 2.0.3) (20) as training data to be used for ab initio prediction of protein-coding genes. In addition, PASA (version 2.3.3) (56) was used to generate transcript-based gene models by integrating de novo transcriptome assembly and genome-guided assembly using Trinity (version 2.5.0) (57). The genome-guided assembly used the mapping result from HISAT2 with --dta option. TransDecoder (version 5.0.2) (58) was used to extract protein coding regions from PASA result with the alignment files from BlastP (version 2.7.1) (59) against UniRef90 with -evalue 1e-5 option and hmmscan (http://hmmer.org/, version 3.1b2) against Pfam (60) database.
The gene models that overlapped with the results from BRAKER were removed using BlastP with evalue 1e-5 option, and the remaining gene models were merged with the BRAKER gene models to generate the final gene annotation. Transposable elements in the NIES-4239 genome were searched by RepeatMasker (version 4.9.0) using Dfam3.1 and RepBase-20170127 as reference repeat libraries (61). The predicted gene set was available in Dryad (https://doi.org/10.5061/dryad.j3tx95xft).
The integrity of gene annotation was assessed by BUSCO (version 3.0.2) (21) and the Eukaryota odb9 (version 2) dataset. The manipulation of SAM/BAM file was used by SAMtools (version 1.9). The sequence files of gene region from gff file were used by GffRead (version 0.9.11) (62).
Organellar genome annotation was performed by comparison with previously sequenced organellar genomes of nonphotosynthetic diatoms (24). Gene sets and their arrangements of the plastid and mitochondrial genomes sequenced in this study were found to be identical to previously sequenced nonphotosynthetic diatoms (24). Assembled genomes were deposited to DNA Data Bank of Japan (http://getentry.ddbj.nig.ac.jp/) under the accession numbers BLYE01000001 to BLYE01000234 for the nuclear genome, LC600866 for the mitochondrial genome, and LC600867 for the plastid genome.
Functional annotation
The predicted protein coding genes were annotated using InterProScan, and RPS-BLAST search was performed against KEGG orthology database (63, 64). KO identifiers for Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways were assigned using KEGG Automatic Annotation Server (65). Transporter proteins were annotated with TransportTP (66) followed by manual curation. Reference proteome datasets for three photosynthetic diatom species were obtained from the JGI Genome Portal: P. tricornutum CCAP 1055/1 v2.0 (Phatr2_bd_unmapped_GeneModels_FilteredModels1_aa.fasta and Phatr2_chromosomes_geneModels_FilteredModels2_aa.fasta, 10,402 protein sequences in total), T. pseudonana CCMP 1335 (Thaps3_bd_unmapped_GeneModels_FilteredModels1_aa.fasta and Thaps3_chromosomes_geneModels_FilteredModels2_aa.fasta, 11,776 sequences), and F. cylindrus CCMP 1102 (Fracy1_GeneModels_FilteredModels3_aa.fasta, 21,066 sequences). KEGG and KOG annotation was performed with them in the same manner as NIES-4239. Other details for annotation of CAZyme, cyclins, cyclin-dependent kinases, bZIP transcription factors, photoreceptor proteins, mitochondrial proteins, plastid proteins, and secretome proteins are described in the Supplementary Materials. Evolutionary analyses, comparative transcriptome analyses under the 12-hour light and 12-hour dark condition, those in different carbon sources, and biochemical experiments for lipids, fatty acids, and quinones are also described in the Supplementary Materials. Transcriptome data obtained in this study were deposited to DNA Data Bank of Japan (https://ddbj.nig.ac.jp/resource/bioproject/PRJDB11016 and https://ddbj.nig.ac.jp/resource/bioproject/PRJDB12553).
Acknowledgments
We thank F. Hayashi for the manual correction of the genome assembly and the gene annotation for the secretome. Computational work was, in part, performed at the SuperComputer System, Institute for Chemical Research, Kyoto University, and the NIG supercomputer at the National Institute of Genetics.
Funding: This work was supported by JSPS Grants-in-Aid for Scientific Research (B) (19H03274 to R.K., 20H03305 to T.N., and 17H03723 to G.T.); JSPS Grants-in-Aid for Early-Career Scientists (20K15783 to Y.T.); JSPS Grant-in-Aid for Challenging Research (Exploratory) (21K19303 to G.T.); JSPS Grants-in-Aid for Scientific Research (A) (18H03743 to Y.K.); JSPS Grants-in-Aid for Scientific Research on Innovative Areas, Platform for Advanced Genome Science (16H06279 to R.K., T.Moch, M.S., Y.T., and Y.N.); NERC (NE/R000883/1 to T. Mock, C.v.O., A.T., K.S., and S.S.); School of Environmental Sciences at the University of East Anglia, UK (to T. Mock, C.v.O., A.T., K.S., and S.S.); The Gatsby Charitable Foundation and Biotechnology and Biological Sciences Research Council (BBSRC) (to S.K. and J.W.); and NIG-JOINT (7A2017, 6A2018, and 30A2019 to R.K.).
Author contributions: Conceptualization: R.K., G.T., T. Mock, and Y.N. Methodology: R.K., T. Mock, and Y.N. Investigation: R.K., Y.T., T.Moch., M.S., G.T., Y.N., U.C., B.H., V.L., D.M., S.S., K.S., A.T., C.v.O., T. Mock, K.O., K.-i.I., M.Kay., T.A., Y.K., and J.W. Visualization: R.K., R.O., K.K., T.N., U.C., S.S., K.S., A.T., and C.v.O. Supervision: S.-y.M., M.Kat., H.M., S.M., and S.K. Writing—original draft: R.K., C.v.O., and T. Mock. Writing—review and editing: All the authors.
Competing interests: The authors declare that they have no competing interests.
Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Additional data related to this paper are available in DDBJ and Dryad as described in Methods.
Supplementary Materials
This PDF file includes:
Supplementary Text
Figs. S1 to S15
Tables S1 to S3
References
REFERENCES AND NOTES
- 1.Freesec J. M., Lane C. E., Parasitism finds many solutions to the same problems in red algae (Florideophyceae, Rhodophyta). Mol. Biochem. Parasitol. 214, 105–111 (2017). [DOI] [PubMed] [Google Scholar]
- 2.Hadariová L., Vesteg M., Hampl V., Krajčovič J., Reductive evolution of chloroplasts in non-photosynthetic plants, algae and protists. Curr. Genet. 64, 365–387 (2018). [DOI] [PubMed] [Google Scholar]
- 3.Janouškovec J., Paskerova G. G., Miroliubova T. S., Mikhailov K. V., Birley T., Aleoshin V. V., Simdyanov T. G., Apicomplexan-like parasites are polyphyletic and widely but selectively dependent on cryptic plastid organelles. eLife 8, e49662 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kissinger J. C., Brunk B. P., Crabtree J., Fraunholz M. J., Gajria B., Milgram A. J., Pearson D. S., Schug J., Bahl A., Diskin S. J., Ginsburg H., Grant G. R., Gupta D., Labo P., Li L., Mailman M. D., McWeeney S. K., Whetzel P., Stoeckert C. J. Jr., Roos D. S., The Plasmodium genome database. Nature 419, 490–492 (2002). [DOI] [PubMed] [Google Scholar]
- 5.Kamikawa R., Yubuki N., Yoshida M., Taira M., Nakamura N., Ishida K., Leander B. S., Miyashita H., Hashimoto T., Mayama S., Inagaki Y., Multiple losses of photosynthesis in Nitzschia (Bacillariophyceae). Phycol. Res. 63, 19–28 (2015). [Google Scholar]
- 6.Dorrell R. G., Azuma T., Nomura M., de Kerdrel G. A., Paoli L., Yang S., Bowler C., Ishii K., Miyashita H., Gile G. H., Kamikawa R., Principles of plastid reductive evolution illuminated by nonphotosynthetic chrysophytes. Proc. Natl. Acad. Sci. U.S.A. 116, 6914–6923 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kayama M., Chen J. F., Nakada T., Nishimura Y., Shikanai T., Azuma T., Miyashita H., Takaichi S., Kashiyama Y., Kamikawa R., A non-photosynthetic green alga illuminates the reductive evolution of plastid electron transport systems. BMC Biol. 18, 126 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kayama M., Maciszewski K., Yabuki A., Miyashita H., Karnkowska A., Kamikawa R., Highly reduced plastid genomes of the non-photosynthetic dictyochophyceans Pteridomonas spp. (Ochrophyta, SAR) are retained for tRNA-Glu-based organellar heme biosynthesis. Front. Plant Sci. 11, 602455 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Van Valen L., Molecular evolution as predicted by natural selection. J. Mol. Evol. 3, 89–101 (1974). [DOI] [PubMed] [Google Scholar]
- 10.Sun G., Xu Y., Liu H., Sun T., Zhang J., Hettenhausen C., Shen G., Qi J., Qin Y., Li J., Wang L., Chang W., Guo Z., Baldwin I. T., Wu J., Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis. Nat. Commun. 9, 2683 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li C. W., Volcani B. E., Four new apochlorotic diatoms. Br. Phycol. J. 22, 375–382 (1987). [Google Scholar]
- 12.Field C. B., Behrenfeld M. J., Randerson J. T., Falkowski P., Primary production of the biosphere: Integrating terrestrial and oceanic components. Science 281, 237–240 (1998). [DOI] [PubMed] [Google Scholar]
- 13.Mann D. G., The species concept in diatoms. Phycologia 38, 437–495 (1999). [Google Scholar]
- 14.Armbrust E. V., Berges J. A., Bowler C., Green B. R., Martinez D., Putnam N. H., Zhou S., Allen A. E., Apt K. E., Bechner M., Brzezinski M. A., Chaal B. K., Chiovitti A., Davis A. K., Demarest M. S., Detter J. C., Glavina T., Goodstein D., Hadi M. Z., Hellsten U., Hildebrand M., Jenkins B. D., Jurka J., Kapitonov V. V., Kröger N., Lau W. W. Y., Lane T. W., Larimer F. W., Lippmeier J. C., Lucas S., Medina M., Montsant A., Obornik M., Parker M. S., Palenik B., Pazour G. J., Richardson P. M., Rynearson T. A., Saito M. A., Schwartz D. C., Thamatrakoln K., Valentin K., Vardi A., Wilkerson F. P., Rokhsar D. S., The genome of the diatom Thalassiosira pseudonana: Ecology, evolution, and metabolism. Science 306, 79–86 (2004). [DOI] [PubMed] [Google Scholar]
- 15.Bowler C., Allen A. E., Badger J. H., Grimwood J., Jabbari K., Kuo A., Maheswari U., Martens C., Maumus F., Otillar R. P., Rayko E., Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T., Valentin K., Verret F., Berges J. A., Brownlee C., Cadoret J.-P., Chiovitti A., Choi C. J., Coesel S., De Martino A., Detter J. C., Durkin C., Falciatore A., Fournet J., Haruta M., Huysman M. J. J., Jenkins B. D., Jiroutova K., Jorgensen R. E., Joubert Y., Kaplan A., Kröger N., Kroth P. G., La Roche J., Lindquist E., Lommer M., Martin-Jézéquel V., Lopez P. J., Lucas S., Mangogna M., McGinnis K., Medlin L. K., Montsant A., Secq M.-P. O.-L., Napoli C., Obornik M., Parker M. S., Petit J.-L., Porcel B. M., Poulsen N., Robison M., Rychlewski L., Rynearson T. A., Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M. R., Taylor A. R., Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L. S., Rokhsar D. S., Weissenbach J., Armbrust E. V., Green B. R., Van de Peer Y., Grigoriev I. V., The Phaeodactylum genome reveals the evolutionary history of diatom genomes. Nature 456, 239–244 (2008). [DOI] [PubMed] [Google Scholar]
- 16.Mock T., Otillar R. P., Strauss J., McMullan M., Paajanen P., Schmutz J., Salamov A., Sanges R., Toseland A., Ward B. J., Allen A. E., Dupont C. L., Frickenhaus S., Maumus F., Veluchamy A., Wu T., Barry K. W., Falciatore A., Ferrante M. I., Fortunato A. E., Glöckner G., Gruber A., Hipkin R., Janech M. G., Kroth P. G., Leese F., Lindquist E. A., Lyon B. R., Martin J., Mayer C., Parker M., Quesneville H., Raymond J. A., Uhlig C., Valas R. E., Valentin K. U., Worden A. Z., Armbrust E. V., Clark M. D., Bowler C., Green B. R., Moulton V., van Oosterhout C., Grigoriev I. V., Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus. Nature 541, 536–540 (2017). [DOI] [PubMed] [Google Scholar]
- 17.Vurture G. W., Sedlazeck F. J., Nattestad M., Underwood C. J., Fang H., Gurtowski J., Schatz M. C., GenomeScope: Fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chin C. S., Peluso P., Sedlazeck F. J., Nattestad M., Concepcion G. T., Clum A., Dunn C., O’Malley R., Figueroa-Balderas R., Morales-Cruz A., Cramer G. R., Delledonne M., Luo C., Ecker J. R., Cantu D., Rank D. R., Schatz M. C., Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050–1054 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Walker B. J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., Cuomo C. A., Zeng Q., Wortman J., Young S. K., Earl A. M., Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLOS ONE 9, e112963 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hoff K. J., Lange S., Lomsadze A., Borodovsky M., Stanke M., BRAKER1: Unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32, 767–769 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Waterhouse R. M., Seppey M., Simão F. A., Manni M., Ioannidis P., Klioutchnikov G., Kriventseva E. V., Zdobnov E. M., BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543–548 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gruber A., Rocap G., Kroth P. G., Armbrust E. V., Mock E. V. T., Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage. Plant J. 81, 519–528 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Emms D. M., Kelly S., OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kamikawa R., Tanifuji G., Ishikawa S. A., Ishii K., Matsuno Y., Onodera N. T., Ishida K., Hashimoto T., Miyashita H., Mayama S., Inagaki Y., Proposal of a twin arginine translocator system-mediated constraint against loss of ATP synthase genes from nonphotosynthetic plastid genomes. Mol. Biol. Evol. 32, 2598–2604 (2015). [DOI] [PubMed] [Google Scholar]
- 25.Moog D., Nozawa A., Tozawa Y., Kamikawa R., Substrate specificity of plastid phosphate transporters in a non-photosynthetic diatom and its implication in evolution of red alga-derived complex plastids. Sci. Rep. 10, 1167 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kamikawa R., Moog D., Zauner S., Tanifuji G., Ishida K., Miyashita H., Mayama H., Hashimoto T., Maier U. G., Archibald J. M., Inagaki Y., A non-photosynthetic diatom reveals early steps of reductive evolution in plastids. Mol. Biol. Evol. 34, 2355–2366 (2017). [DOI] [PubMed] [Google Scholar]
- 27.Allen A. E., Dupont C. L., Oborník M., Horák A., Nunes-Nesi A., McCrow J. P., Zheng H., Johnson D. A., Hu H., Fernie A. R., Bowler C., Evolution and metabolic significance of the urea cycle in photosynthetic diatoms. Nature 473, 203–207 (2011). [DOI] [PubMed] [Google Scholar]
- 28.Smith S. R., Dupont C. L., McCarthy J. K., Broddrick J. T., Oborník M., Horák A., Füssy Z., Cihlář J., Kleessen S., Zheng H., McCrow J. P., Hixson K. K., Araújo W. L., Nunes-Nesi A., Fernie A., Nikoloski Z., Palsson B. O., Allen A. E., Evolution and regulation of nitrogen flux through compartmentalized metabolic networks in a marine diatom. Nat. Commun. 10, 4552 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ashworth J., Coesel S., Lee A., Armbrust E. V., Orellana M. V., Baliga N. S., Genome-wide diel growth state transitions in the diatom Thalassiosira pseudonana. Proc. Natl. Acad. Sci. U.S.A. 110, 7518–7523 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Chauton M. S., Winge P., Brembu T., Vadstein O., Bones A. M., Gene regulation of carbon fixation, storage, and utilization in the diatom Phaeodactylum tricornutum acclimated to light/dark cycles. Plant Physiol. 161, 1034–1048 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Smith S. R., Gillard J. T. F., Kustka A. B., McCrow J. P., Badger J. H., Zheng H., New A. M., Dupont C. L., Obata T., Fernie A. R., Allen A. E., Correction: Transcriptional orchestration of the global cellular response of a model pennate diatom to diel light cycling under iron limitation. PLOS Genet. 13, e1006688 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Huysman M. J. J., Martens C., Vandepoele K., Gillard J., Rayko E., Heijde M., Bowler C., Inzé D., Van de Peer Y., De Veylder L., Vyverman W., Genome-wide analysis of the diatom cell cycle unveils a novel type of cyclins involved in environmental signaling. Genome Biol. 11, R17 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Annunziata R., Ritter A., Emidio Fortunato A., Manzotti A., Cheminant-Navarro S., Agier N., Huysman M. J. J., Winge P., Bones A. M., Bouget F., Cosentino Lagomarsino M., Bouly J., Falciatore A., bHLH-PAS protein RITMO1 regulates diel biological rhythms in the marine diatom Phaeodactylum tricornutum. Proc. Natl. Acad. Sci. U.S.A. 116, 13137–13142 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kroth P. G., Wilhelm C., Kottke T., An update on aureochromes: Phylogeny–mechanism–function. J. Plant Physiol. 217, 20–26 (2017). [DOI] [PubMed] [Google Scholar]
- 35.Coesel S., Mangogna M., Ishikawa T., Heijde M., Rogato A., Finazzi G., Todo T., Bowler C., Falciatore A., Diatom PtCPF1 is a new cryptochrome/photolyase family member with DNA repair and transcription regulation activity. EMBO Rep. 10, 655–661 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fortunato A. E., Jaubert M., Enomoto G., Bouly J., Raniello R., Thaler M., Malviya M., Bernardes J. S., Rappaport F., Gentili B., Huysman M. J. J., Carbone A., Bowler C., Ribera d’Alcalà M., Ikeuchi M., Falciatore A., Diatom phytochromes reveal the existence of far-red-light-based sensing in the ocean. Plant Cell 28, 616–628 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Rayko E., Maumus F., Maheswari U., Jabbari K., Bowler C., Transcription factor families inferred from genome sequences of photosynthetic stramenopiles. New Phytol. 188, 52–66 (2010). [DOI] [PubMed] [Google Scholar]
- 38.Kong G., Chen Y., Deng Y., Feng D., Jiang L., Wan L., Li M., Jiang Z., Xi P., The basic leucine zipper transcription factor PlBZP32 associated with the oxidative stress response is critical for pathogenicity of the lychee downy blight oomycete Peronophythora litchi. mSphere 5, e00261-20 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Fujiwara T., Hirooka S., Ohbayashi O., Onuma R., Miyagishima S., Relationship between cell cycle and diel transcriptomic changes in metabolism in a unicellular red alga. Plant Physiol. 183, 1484–1501 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Richards T. A., Talbot N. J., Horizontal gene transfer in osmotrophs: Playing with public goods. Nat. Rev. Microbiol. 11, 720–727 (2013). [DOI] [PubMed] [Google Scholar]
- 41.Richards T. A., Talbot N. J., Osmotrophy. Curr. Biol. 28, R1179–R1180 (2018). [DOI] [PubMed] [Google Scholar]
- 42.Ishii K., Kamikawa R., Growth characterization of non-photosynthetic diatoms, Nitzschia spp., inhabiting estuarine mangrove forests of Ishigaki Island, Japan. Plankton Benthos Res. 12, 164–170 (2017). [Google Scholar]
- 43.Enright A. J., Van Dongen S., Ouzounis C. A., An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Belkhadir Y., Yang L., Hetzel J., Dangl J. L., Chory J., The growth–defense pivot: Crisis management in plants mediated by LRR-RK surface receptors. Trends Biochem. Sci. 39, 447–456 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Karsten U., Mostaert A. S., King R. J., Kamiya M., Hara Y., Osmoprotectors in some species of Japanese mangrove macroalgae. Phycol. Res. 44, 109–112 (1996). [Google Scholar]
- 46.Wolf Y. I., Koonin E. V., Genome reduction as the dominant mode of evolution. Bioessays 35, 829–837 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.M. Nei, S. Kumar S, Molecular Evolution and Phylogenetics (Oxford Univ. Press, New York, 2000), pp. 17–203 [Google Scholar]
- 48.Morris J. J., Lenski R. E., Zinser E. R., The Black Queen hypothesis: Evolution of dependencies through adaptive gene loss. MBio 3, e00036-12 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ottesen E. A., Young C. R., Gifford S. M., Eppley J. M., Marin R. III, Schuster S. C., Scholin C. A., DeLong E. F., Multispecies diel transcriptional oscillations in open ocean heterotrophic bacterial assemblages. Science 345, 207–212 (2014). [DOI] [PubMed] [Google Scholar]
- 50.Aylward F. O., Eppley J. M., Smith J. M., Chavez F. P., Scholin C. A., DeLong E. F., Microbial community transcriptional networks are conserved in three domains at ocean basin scales. Proc. Natl. Acad. Sci. U.S.A. 112, 5443–5448 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Frischkorn K. R., Haley S. T., Dyhrman S. T., Coordinated gene expression between Trichodesmium and its microbiome over day–night cycles in the North Pacific Subtropical Gyre. ISME J. 12, 997–1007 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ottesen E. A., Young C. R., Eppley J. M., Ryan J. P., Chavez F. P., Scholin C. A., DeLong E. F., Pattern and synchrony of gene expression among sympatric marine microbial populations. Proc. Natl. Acad. Sci. U.S.A. 110, E488–E497 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hernández Limón M. D., Hennon G. M. M., Harke M. J., Frischkorn K. R., Haley S. T., Dyhrman S. T., Transcriptional patterns of Emiliania huxleyi in the North Pacific Subtropical Gyre reveal the daily rhythms of its metabolic potential. Environ. Microbiol. 22, 381–396 (2020). [DOI] [PubMed] [Google Scholar]
- 54.Bolger A. M., Lohse M. A., Usadel B., Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Kim D., Langmead B., Salzberg S. L., HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Haas B. J., Delcher A. L., Mount S. M., Wortman J. R., Smith R. K. Jr., Hannick L. I., Maiti R., Ronning C. M., Rusch D. B., Town C. D., Salzberg S. L., White O., Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Grabherr M. G., Haas B. J., Yassour M., Levin J. Z., Thompson D. A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., Chen Z., Mauceli E., Hacohen N., Gnirke A., Rhind N., di Palma F., Birren B. W., Nusbaum C., Lindblad-Toh K., Friedman N., Regev A., Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Haas B. J., Papanicolaou A., Yassour M., Grabherr M., Blood P. D., Bowden J., Brian Couger M., Eccles D., Li B., Lieber M., MacManes M. D., Ott M., Orvis J., Pochet N., Strozzi F., Weeks N., Westerman R., William T., Dewey C. N., Henschel R., LeDuc R. D., Friedman N., Regev A., De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protocol 8, 1494–1512 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T. L., BLAST+: Architecture and applications. BMC Bioinformatics 10, 421 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.El-Gebali S., Mistry J., Bateman A., Eddy S. R., Luciani A., Potter S. C., Qureshi M., Richardson L. J., Salazar G. A., Smart A., Sonnhammer E. L. L., Hirsh L., Paladin L., Piovesan D., Tosatto S. C. E., Finn R. D., The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427–D432 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Tarailo-Graovac M., Chen N., Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 25, 4.10.1–4.10.14 (2009). [DOI] [PubMed] [Google Scholar]
- 62.Pertea G., Pertea M., GFF utilities: GffRead and GffCompare. F1000Research 9, 304 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Jones P., Binns D., Chang H.-Y., Fraser M., Li W., McAnulla C., McWilliam H., Maslen J., Mitchell A., Nuka G., Pesseat S., Quinn A. F., Sangrador-Vegas A., Scheremetjew M., Yong S.-Y., Lopez R., Hunter S., InterProScan 5: Genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Marchler-Bauer A., Bryant S. H., CD-Search: Protein domain annotations on the fly. Nucleic Acids Res. 32, W327–W331 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Moriya Y., Itoh M., Okuda S., Yoshizawa A. C., Kanehisa M., KAAS: An automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182–W185 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Li H., Benedito V. A., Udvardi M. K., Zhao P. X., TransportTP: A two-phase classification approach for membrane transporter prediction and characterization. BMC Bioinformatics 10, 418 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lombard V., Golaconda Ramulu H., Drula E., Coutinho P. M., Henrissat B., The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 42, D490–D495 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Altschul S. F., Madden T. L., Schäffer A. A., Zhang J., Zhang Z., Miller W., Lipman D. J., Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Mistry J., Finn R. D., Eddy S. R., Bateman A., Punta M., Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e121 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Curtis B. A., Tanifuji G., Burki F., Gruber A., Irimia M., Maruyama S., Arias M. C., Ball S. G., Gile G. H., Hirakawa Y., Hopkins J. F., Kuo A., Rensing S. A., Schmutz J., Symeonidi A., Elias M., Eveleigh R. J. M., Herman E. K., Klute M. J., Nakayama T., Oborník M., Reyes-Prieto A., Armbrust E. V., Aves S. J., Beiko R. G., Coutinho P., Dacks J. B., Durnford D. G., Fast N. M., Green B. R., Grisdale C. J., Hempel F., Henrissat B., Höppner M. P., Ishida K.-I., Kim E., Kořený L., Kroth P. G., Liu Y., Malik S.-B., Maier U. G., McRose D., Mock T., Neilson J. A. D., Onodera N. T., Poole A. M., Pritham E. J., Richards T. A., Rocap G., Roy S. W., Sarai C., Schaack S., Shirato S., Slamovits C. H., Spencer D. F., Suzuki S., Worden A. Z., Zauner S., Barry K., Bell C., Bharti A. K., Crow J. A., Grimwood J., Kramer R., Lindquist E., Lucas S., Salamov A., McFadden G. I., Lane C. E., Keeling P. J., Gray M. W., Grigoriev I. V., Archibald J. M., Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs. Nature 492, 59–65 (2012). [DOI] [PubMed] [Google Scholar]
- 71.Cenci U., Sibbald S. J., Curtis B. A., Kamikawa R., Eme L., Moog D., Henrissat B., Maréchal E., Chabi M., Djemiel C., Roger A. J., Kim E., Archibald J. M., Nuclear genome sequence of the plastid-lacking cryptomonad Goniomonas avonlea provides insights into the evolution of secondary plastids. BMC Biol. 16, 137 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Bray J. R., Curtis J. T., An ordination of the upland forest communities of Southern Wisconsin. Ecol. Monogr. 27, 325–349 (1957). [Google Scholar]
- 73.Ward J. H. Jr., Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58, 236–244 (1963). [Google Scholar]
- 74.J. Oksanen, Multivariate Analysis of Ecological Communities in R: Vegan tutorial (2015); https://www.mooreecology.com/uploads/2/4/2/1/24213970/vegantutor.pdf.
- 75.Katoh K., Standley D. M., MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Hall T. A., BioEdit: A user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp. Ser. 41, 95–98 (1999). [Google Scholar]
- 77.Nguyen L. T., Schmidt H. A., Von Haeseler A., Minh B. Q., IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Davis A., Abbriano R., Smith S. R., Hildebrand M., Clarification of photorespiratory processes and the role of malic enzyme in diatoms. Protist 168, 134–153 (2017). [DOI] [PubMed] [Google Scholar]
- 79.Gonzalez N. H., Felsner G., Schramm F. D., Klingl A., Maier U. G., Bolte K., A single peroxisomal targeting signal mediates matrix protein import in diatoms. PLOS ONE 6, e25316 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Mix A. K., Cenci U., Heimerl T., Marter P., Wirkner M. L., Moog D., Identification and localization of peroxisomal biogenesis proteins indicates the presence of peroxisomes in the cryptophyte Guillardia theta and other “Chromalveolates”. Genome Biol. Evol. 10, 2834–2852 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Sidiropoulos K., Viteri G., Sevilla C., Jupe S., Webber M., Orlic-Milacic M., Jassal B., May B., Shamovsky V., Duenas C., Rothfels K., Matthews L., Song H., Stein L., Haw R., D’Eustachio P., Ping P., Hermjakob H., Fabregat A., Reactome enhanced pathway visualization. Bioinformatics 33, 3461–3467 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Fabregat A., Korninger F., Viteri G., Sidiropoulos K., Marin-Garcia P., Ping P., Wu G., Stein L., D’Eustachio P., Hermjakob H., Reactome graph database: Efficient access to complex pathway data. PLOS Comput. Biol. 14, e1005968 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Fabregat A., Sidiropoulos K., Viteri G., Marin-Garcia P., Ping P., Stein L., D’Eustachio P., Hermjakob H., Reactome diagram viewer: Data structures and strategies to boost performance. Bioinformatics 34, 1208–1214 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Jassal B., Matthews L., Viteri G., Gong C., Lorente P., Fabregat A., Sidiropoulos K., Cook J., Gillespie M., Haw R., Loney F., May B., Milacic M., Rothfels K., Sevilla C., Shamovsky V., Shorser S., Varusai T., Weiser J., Wu G., Stein L., Hermjakob H., D’Eustachio P., The reactome pathway knowledgebase. Nucleic Acids Res. 48, D498–D503 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.UniProt Consortium , UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J., Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990). [DOI] [PubMed] [Google Scholar]
- 87.Aramaki T., Blanc-Mathieu R., Endo H., Ohkubo K., Kanehisa M., Goto S., Ogata H., KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Fukasawa Y., Tsuji J., Fu S. C., Tomii K., Horton P., Imai K., MitoFates: Improved prediction of mitochondrial targeting sequences and their cleavage sites. Mol. Cell. Proteomics 14, 1113–1126 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Kume K., Amagasa T., Hashimoto T., Kitagawa H., NommPred: Prediction of mitochondrial and mitochondrion-related organelle proteins of nonmodel organisms. Evol. Bioinform. Online 14, 1176934318819835 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Petersen T. N., Brunak S., von Heijne G., Nielsen H., SignalP 4.0: Discriminating signal peptides from transmembrane regions. Nat. Methods 8, 785–786 (2011). [DOI] [PubMed] [Google Scholar]
- 91.Kanehisa M., Goto S., Sato Y., Furumichi M., Tanabe M., KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Sonnhammer E. L., von Heijne G., Krogh A., A hidden Markov model for predicting transmembrane helices in protein sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 6, 175–182 (1998). [PubMed] [Google Scholar]
- 93.Krogh A., Larsson B., von Heijne G., Sonnhammer E. L., Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes. J. Mol. Biol. 305, 567–580 (2001). [DOI] [PubMed] [Google Scholar]
- 94.Bruckner C. G., Rehm C., Grossart H. P., Kroth P. G., Growth and release of extracellular organic compounds by benthic diatoms depend on interactions with bacteria. Environ. Microbiol. 13, 1052–1063 (2011). [DOI] [PubMed] [Google Scholar]
- 95.Buhmann M. T., Schulze B., Förderer A., Schleheck D., Kroth P. G., Bacteria may induce the scretion of mucin-like proteins by the diatom Phaeodactylum tricornutum. J. Phycol. 52, 463–474 (2016). [DOI] [PubMed] [Google Scholar]
- 96.Lachnit M., Buhmann M. T., Klemm J., Kröger N., Poulsen N., Identification of proteins in the adhesive trails of the diatom Amphora coffeaeformis. Philos. Trans. R. Soc. B 374, 20190196 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Dell’Aquila G., Zauner S., Heimerl T., Kahnt J., Samel-Gondesen V., Runge S., Hempel F., Maier U. G., Mobilization and cellular distribution of phosphate in the diatom Phaeodactylum tricornutum. Front. Plant Sci. 11, 579 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Bligh E. G., Dyer W. J., A rapid method of total lipid extraction and purification. Can. J. Biochem. Physiol. 37, 911–917 (1959). [DOI] [PubMed] [Google Scholar]
- 99.Mitani E., Nakayama F., Matsuwaki I., Ichi I., Kawabata A., Kawachi M., Kato M., Fatty acid composition profiles of 235 strains of three microalgal divisions within the NIES microbial culture collection. Microb. Resour. Syst. 33, 19–29 (2017). [Google Scholar]
- 100.Wright R. S., A reagent for the non-destructive location of steroids and some other lipophilic materials on silica gel thin-layer chromatograms. J. Chromatogr. 59, 220–221 (1971). [DOI] [PubMed] [Google Scholar]
- 101.Sato N., Lipids in Cryptomonas CR-1. I. Occurrence of betaine lipids. Plant Cell. Physiol. 32, 819–825 (1991). [Google Scholar]
- 102.Allen C. F., Good P., [48] Acyl lipids in photosynthetic systems. Methods Enzymol. 23, 523–547 (1971). [Google Scholar]
- 103.Johnson L. K., Alexander H., Brown C. T., Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes. GigaScience 8, giy158 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Criscuolo A., Gribaldo S., BMGE (Block Mapping and Gathering with Entropy): A new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Langmead B., Salzberg S. L., Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R.; 1000 Genome Project Data Processing Subgroup , The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Quinlan A. R., Hall I. M., BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Ihaka R., Gentleman R., R: A language for data analysis and graphics. J. Comp. Graph. Stat. 5, 299–314 (1996). [Google Scholar]
- 109.de Hoon M. J. L., Imoto S., Nolan J., Miyano S., Open source clustering software. Bioinformatics 20, 1453–1454 (2004). [DOI] [PubMed] [Google Scholar]
- 110.Saldanha A. J., Java Treeview - Extensible visualization of microarray data. Bioinformatics 20, 3246–3248 (2004). [DOI] [PubMed] [Google Scholar]
- 111.Löytynoja A., Phylogeny-aware alignment with PRANK. Methods Mol. Biol. 1079, 155–170 (2014). [DOI] [PubMed] [Google Scholar]
- 112.Capella-Gutiérrez S., Silla-Martínez J. M., Gabaldón T., trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Talavera G., Castresana J., Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol. 56, 564–577 (2007). [DOI] [PubMed] [Google Scholar]
- 114.Bouckaert R., Vaughan T. G., Barido-Sottani J., Duchêne S., Fourment M., Gavryushkina A., Heled J., Jones G., Kühnert D., De Maio N., Matschiner M., BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLOS Comput. Biol. 15, e1006650 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Höhna S., May M. R., Moore B. R., TESS: An R package for efficiently simulating phylogenetic trees and performing Bayesian inference of lineage diversification rates. Bioinformatics 32, 789–791 (2016). [DOI] [PubMed] [Google Scholar]
- 116.Yang Z., Wong W. S. W., Nielsen R., Bayes empirical bayes inference of amino acid sites under positive selection. Mol. Biol. Evol. 22, 1107–1118 (2005). [DOI] [PubMed] [Google Scholar]
- 117.Yang Z., PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007). [DOI] [PubMed] [Google Scholar]
- 118.Pombert J.-F., Blouin N. A., Lane C., Boucias D., Keeling P. J., A lack of parasitic reduction in the obligate parasitic green alga Helicosporidium. PLOS Genet. 10, e1004355 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Suzuki S., Endoh R., Manabe R., Ohkuma M., Hirakawa Y., Multiple losses of photosynthesis and convergent reductive genome evolution in the colourless green algae Prototheca. Sci. Rep. 8, 940 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Durkin C. A., Koester J. A., Bender S. J., Armbrust E. V., The evolution of silicon transporters in diatoms. J. Phycol. 52, 716–731 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Text
Figs. S1 to S15
Tables S1 to S3
References