Abstract
Marine microeukaryotes have evolved diverse cellular features that link their life histories to surrounding environments. How those dynamic life histories intersect with the ecological functions of microeukaryotes remains a frontier to understand their roles in essential biogeochemical cycles1,2. Choanoflagellates, phagotrophs that cycle nutrients through filter feeding, provide models to explore this intersection, for many choanoflagellate species transition between life history stages by differentiating into distinct cell types3–6. Here we report that cell differentiation in the marine choanoflagellate Salpingoeca rosetta endows one of its cell types with the ability to utilize insoluble ferric colloids for improved growth through the expression of a cytochrome b561 iron reductase (cytb561a). This gene is an ortholog of the mammalian duodenal cytochrome b561 (DCYTB) that reduces ferric cations prior to their uptake in gut epithelia7 and is part of an iron utilization toolkit that choanoflagellates and their closest living relatives, the animals, inherited from a last common eukaryotic ancestor. In a database of oceanic metagenomes8,9, the abundance of cytb561a transcripts from choanoflagellates positively correlates with upwellings, which are a major source of ferric colloids in marine environments10. As this predominant form of iron11,12 is largely inaccessible to cell-walled microbes13,14, choanoflagellates and other phagotrophic eukaryotes may serve critical ecological roles by first acquiring ferric colloids through phagocytosis and then cycling this essential nutrient through iron utilization pathways13–15. These findings provide insight into the ecological roles choanoflagellates perform and inform reconstructions of early animal evolution where functionally distinct cell types became an integrated whole at the origin of animal multicellularity16–22.
Thecates display a different transcriptome profile than other cell types.
Choanoflagellates transition between different stages of their life history by differentiating into distinct cell types3–5. In the emerging model choanoflagellate S. rosetta, diverse environmental cues promote the transitions between different types of cells6. Under nutrient replete conditions, sessile thecate cells differentiate into motile slow swimmers that can proliferate as single cells or chains of cells connected through intercellular bridges6,23,24. In response to specific bacterial25–27 and algal28 cues, slow swimmers develop into rosettes, which are multicellular colonies that form through serial cell division29. As rosettes or slow swimmers starve, they become fast swimmers, and under prolonged nutrient limitation, fast swimmers differentiate into thecates that can proliferate under similarly harsh conditions6. The recapitulation of these cell type transitions in the laboratory make S. rosetta a promising model to investigate how a marine microeukaryote integrates environmental signals to produce distinct cell types. However, the environmental functions that S. rosetta cell types could perform remain to be uncovered, as choanoflagellate cell types have mostly been defined through their morphological features3,6,29,30 rather than molecular functions.
We refined previous S. rosetta transcriptomes31 to uncover functional differences between cell types reflected in their gene expression profiles. To improve those transcriptomes, we took advantage of methods to stably culture thecate, slow swimmer, rosette, and fast swimmer cell types in monoxenic strains of S. rosetta that feed on the bacterium Echinicola pacifica23,24,32. We also developed a method to preferentially lyse S. rosetta while discarding their feeder bacteria to harvest mRNA from S. rosetta with a single round of poly-A selection (Fig. S1) rather than previous methods that relied on multiple rounds of selection to deplete contaminating RNAs from bacteria1,10. A principal component analysis of these improved transcriptomes emphasized the difference between the thecate cell type compared to slow swimmers, rosettes, and fast swimmers (Fig. 1B) with the first principal component accounting for 86% of the variance and clearly separating thecates from all other cell types. While these results are consistent with previous gene expression profiles that found thecates separated from other types of cells31,34, the replicate measures in our data do not show a clear difference between slow swimmers, rosettes, and fast swimmers, suggesting that these three cell types are phenotypically plastic states.
The contrast between thecates and other cell types prompted us to investigate what unique functions thecates may perform. By comparing thecates to slow swimmers, we identified a set of genes that reproducibly (q < 0.01, note that q is a P-value adjusted for multiple comparisons) displayed a greater than two-fold change in gene expression in thecates. We analyzed this set of genes for common gene ontologies (GO)35–39 that indicated functional categories enriched in thecates, keeping in mind that GO analysis may underestimate functional differences between cell types due to incomplete genome annotations in this emerging model system40. Genes upregulated in thecates were clearly enriched for functional modules (Fig. S2) implicated in cell signaling (e.g. nucleotide binding, protein kinase, protein phosphatase, calcium binding, metal ion transporter, guanylate cyclase and G-protein coupled receptor), gene regulation (e.g. transcription factor and ribosome component), and metabolism (e.g. nucleotide binding, oxidoreductase, glycosyl hydrolase, metal ion transporter, carbohydrate binding, sulfuric ester hydrolase, and ascorbic acid binding). In contrast, swimmers expressed modules for cell division, RNA processing, and oxidative metabolism (Fig. S3). This comparative analysis highlighted a gene (PTSG_09715) with 479-fold higher expression in thecates compared to slow swimmers (q < 10−40) that was annotated as an oxidoreductase (GO:0016491) composed of a single cytochrome b561 iron reductase domain (Pfam 03188). Given the role that oxidoreductases perform in diverse eukaryotes to regulate iron acquisition41–44, the strong induction of PTSG_09715 (henceforth called cytb561a) led us to hypothesize that thecates may have a different capacity to acquire iron from the environment compared to other cell types.
cytb561a expression improves thecate cell growth with ferric colloids.
Of the 2,820 genes that reproducibly displayed a two-fold or greater change in mRNA expression in thecates compared to slow swimmers, the change in cytb561a expression is in the ninety-eighth percentile (Fig. 2A). This large change in cytb561a expression may have been coupled to the thecate cell type program, may have been due to differences in strains to generate thecate and slow swimmer transcriptomes, or may have been an independent response to external nutrients that differed between the media for culturing slow swimmers and thecates for the transcriptomes (see methods). To distinguish among these possibilities, we measured cytb561a mRNA expression in slow swimmers and thecates from the same strain of S. rosetta after standardizing a culturing regime (Fig. 2B and Table S1). In this protocol, slow swimmers and thecates were grown in the same media formulation and then depleted of iron. Afterwards, we reintroduced iron either as soluble ferric EDTA (Fe3+•EDTA) or as insoluble ferric colloids (Fe3+(oxyhydr)oxides)40. Both forms of iron are ecologically important, for models based on environmental sampling predict that 40% of the world’s surface ocean waters rely on colloidal sources of iron and 18% on soluble iron chelates11. With either source of iron, the expression of cytb561a mRNA increased in thecates compared to slow swimmers (Fig. 2C and S4 A–C): 89-fold with ferric EDTA and 66.2-fold with ferric colloids (two-way ANOVA, P < 0.005). This result supports the conclusion that thecates are a stable cell type that expresses cytb561a as part of its differentiation regulon.
To determine the functional consequence of increased cytb561a expression in thecates, we compared the growth of thecates and slow swimmers in the presence of 100 μM ferric EDTA or ferric colloids (Fig. 2D). Thecates grew to a two-fold higher cell density in the presence of ferric colloids (two-way ANOVA, P < 10−4), yet slow swimmers showed no change (two-way ANOVA, P > 0.1). This effect was observed at lower concentrations of iron (50 μM), but higher concentrations of iron (≥200 μM), favored greater cell proliferation in ferric EDTA (Fig. S5A). Thecates also grew better than slow swimmers with other ferric chelates, such as ferric EGTA and ferric pyoverdine (Fig. S5B). Importantly, the differences in S. rosetta growth were unlikely due to indirect effects from the prey bacteria, for E. pacifica grew similarly in all iron conditions, likely due to the small concentration of iron (3.43 ± 1.01 μM) that dissolved from the ferric colloids (Fig. S5 C–E).
The increased proliferation of thecates in the presence of ferric colloids did depend on the expression of cytb561a. With an improved genome editing pipeline (see methods and Fig. S6), we altered cytb561a expression by introducing a premature termination sequence46 at position 151 of cytb561a to incorporate missense mutations and a polyadenylation signal (Fig. S7), which diminished mRNA expression in thecates by 3.6-fold compared to wild-type (Fig. S4D, one-way ANOVA, P = 0.004). This mutant allele, cytb561aPTS, eliminated the difference in cell proliferation between slow swimmers and thecates (Fig. 2D, two-way ANOVA P > 0.1). A comparison of cytb561aPTS and wild-type growth dynamics (Fig. S8) showed that the enhancement of thecate growth with ferric colloids was primarily due to a two-fold larger carrying capacity and a shorter doubling time (17.9 h vs 13.2 h, two-way ANOVA, P = 0.035). To derive iron from ferric colloids for enhanced cell growth, thecates can directly ingest iron particles, as seen in time-lapse images of thecates feeding on ferric colloids embedded with a fluorescent polysaccharide (Figs. 2E & S9). Taken together, these results suggest that the increased expression of cytb561a in thecates confers a cell-type-specific function to assimilate iron from insoluble ferric colloids for faster growth to higher cell densities.
Eukaryotes have broadly conserved a core iron acquisition toolkit.
Although cytb561a expression was necessary for the increased proliferation of thecates when grown with ferric colloids, we were uncertain how this iron reductase factored into the pathway(s) for iron acquisition in S. rosetta, as iron transport pathways have not previously been examined in choanoflagellates. Therefore, we compiled an inventory of iron import and export proteins that have been characterized in diverse model eukaryotes (e.g. humans44, S. cerevisiae47, P. tricornutum41, C. reinhardtii43, Fig. S10) to survey the distribution of those proteins across Holozoa (the taxonomic group that includes Animalia, Choanoflagellata, Filasterea, and Teretosporea) and a curated set of eukaryotes48. We found that nearly all of the proteins for iron import, export, reduction, and oxidation are widely distributed among eukaryotes, suggesting a deep ancestry of iron acquisition pathways from the last common eukaryotic ancestor (Fig. 3A and S11).
Our searches also revealed notable evolutionary patterns that correspond to the unique biology of certain eukaryotic lineages. For example, we found significant losses of iron toolkit genes in some lineages. Apicomplexa, a taxon that includes the intracellular parasites Cryptosporidium parvum, Toxoplasma gondii, and Plasmodium falciparum has a reduced set of iron acquisition genes, consistent with the origin of de novo iron acquisition pathways for siphoning iron from their hosts49. Moreover, we observed that some iron utilization genes are more taxonomically restricted than others: Genes first characterized in ascomycete yeasts were either primarily found in Fungi and Archaeplastida (e.g. iron transporter50) or only present in a few eukaryotes (e.g. iron permease51). Similarly, transferrin and phytotransferrin were primarily found in Holozoa and Archaeplastida, respectively, as previously shown52. Notably, the presence of transferrin in Animalia and Teretosporea indicated that transferrin was present in the last common ancestor of Holozoa, yet the transferrin receptor that facilitates the cellular internalization of iron-bound transferrin appears to have originated in Deuterostomes (Fig. S12), highlighting the origin of a mechanism to share iron between cells in the multicellular bodies of animals. Finally, we also observed that the presence of genes encoding vacuolar iron transporters and ferritin, which are both involved in iron storage, was slightly anticorrelated (r = −0.380, P < 10−8). Overall, this catalog of iron utilization proteins indicates that eukaryotes have variably retained, amplified, or lost genes from an ancestral toolkit for the acquisition, storage, and export of iron.
Choanoflagellates and animals appeared to share a core pathway for acquiring iron from their external environment. Iron acquisition in animals has most extensively been studied in vertebrate gut epithelia53. The apical surface of gut epithelial cells absorbs dietary iron by reducing ferric cations to ferrous cations through the activity of duodenal cytochrome B561 (DCYTB) before the divalent metal ion transporter (DMT1) brings ferrous cations inside of cells. On the basal surface of gut epithelia, ferroportin (FPN) secretes ferrous cations into the bloodstream, where they are oxidized to ferric cations near the membrane by hephaestin (HEPH). In our catalog of iron utilization genes, all choanoflagellates possessed homologs of DCYTB, DMT1, and FPN, but HEPH (Fig. 3A, row 8–ferroxidase type1) was only present in S. rosetta and 9 other species (of 22 surveyed54) – perhaps due to the lack of complete genomes for many of those species. Differences between choanoflagellates and animals were evident in iron storage mechanisms, as choanoflagellates mostly possessed vacuolar iron transport proteins, which have been studied in S. cerevisiae55,56; whereas, animals broadly conserved ferritin. As we searched for homologs of iron acquisition genes in S. rosetta, we found single copies of dmt1, fpn, and heph and identified two vacuolar iron transporters (vit1 and vit2). Three cytochrome b561 genes were present, yet among all the conserved iron utilization genes, only cytb561a exhibited strong differential expression between cell types (Fig. 3B), emphasizing its cell-type-specific role in thecates.
Because the apico-basal localization of iron acquisition genes facilitates the unidirectional import of iron in animal epithelia53, we tested if iron transport in S. rosetta shared a similar architecture (Fig. 3C). To do so, we used genome editing at endogenous loci to introduce an epitope tag (ALFA-tag57) in all the iron utilization genes (Fig. S7). Only Cytb561a, Fpn, and Heph were visible in western blots of thecates (Fig. S13A), so we focused on those proteins to visualize by immunofluorescence. In thecates, Cytb561a localized intracellularly toward the apical half of the cell but not in any pattern resembling organelles53. Cytb561a was undetectable in slow swimmers (Fig. S13B), in contrast to thecates that stably expressed Cytb561a independent of iron availability (Fig. S14), both of which are consistent with mRNA expression levels (Fig. 2C). In both slow swimmers and thecates, Fpn and Heph localized near the apical membrane inside of the collar (Fig. S15). The similar co-localization of Fpn and Heph would be consistent with their coordinated function as an efflux pump and oxidase as described in animals53. Importantly, the basal localization of these proteins appears to be a feature of animal epithelia that is not shared with choanoflagellates and may have evolved alongside the mechanisms to establish and maintain cell polarity59.
Based on the functions of homologous genes and the localization of Cytb561a, Fpn, and Heph, we propose a model for iron acquisition in S. rosetta thecates that may extend to other choanoflagellates (Fig. 3D). After internalizing ferric colloids by phagocytosis, Cytb561a reduces ferric cations in endosomes where we surmise that Dmt1 transports ferrous cations into cells – although Dmt1 was undetectable by immunohistochemistry (Fig. S14). As slow swimmers do not express Cytb561a, S. rosetta may acquire iron directly from bacterial prey digested in food vacuoles or rely on Cytb561b or Cytb561c to reduce ferric cations. In both slow swimmers and thecates, we hypothesize Fpn exports ferrous cations at the apical surface where Heph can oxidize it.
Cytb561 paralogs display distinct biochemical properties and global distributions.
The variation in the number of cytochrome b561 paralogs across eukaryotes led us to examine the evolution and function of the three cytochrome b561 paralogs that we identified in S. rosetta. While the expression of cytb561a in thecates was clearly necessary for improved growth in the presence of colloidal iron, slow swimmers still grew in those conditions as did cytb561aPTS thecates (Fig. 2D). These observations and the steady mRNA expression of cytb561b and cytb561c across cell types (Fig. 3C) would be consistent with these paralogs sustaining a baseline ability to acquire iron, even if we could not detect their protein products by immunohistochemistry (Fig. S13). The potential for functional redundancy among these paralogs also led us to consider if choanoflagellates and other eukaryotes display lineage-specific expansions of this gene family that may have resulted in differing capacities to acquire iron.
To better compare the cytochrome b561 paralogs in S. rosetta, we first built a phylogenetic tree60 from an alignment61 of Cytb561 proteins from Amorphea (Fig. 4A), a eukaryotic supergroup that includes Holozoa and Amoebae48,62. The unrooted tree of cytochrome b561 proteins robustly supported (ultrafast bootstrap63 > 88) three distinct groups. Groups B and C contained protein sequences from each major clade in Amorphea and its sister group CRuMs (Collodictyonida, Rigifilida and Mantamonadida). Strikingly, sequences from Holozoa dominated Group A, as a single protein from Amoebozoa was the only sequence from outside of Holozoa that fell into this group. Notably, Group A not only contained mammalian DCYTB proteins but also Cytb561a from S. rosetta, supporting the annotation of this gene as an DCYTB ortholog. The Cytb561b and Cytb561c proteins from S. rosetta fell into Groups B and C, respectively.
Because each of the S. rosetta paralogs was part of a different group of cytochrome b561 proteins, we examined64 multiple sequence alignments (Figs. S16,17,19) and their predicted structures65 (Fig. S18) to evaluate if the deep evolutionary divergence was reflected in their biochemical characteristics (Fig. 4B), using the crystal structure of human DCYTB as a reference66. DCYTB forms a homodimeric complex with each monomer folding into six transmembrane helices that sandwich two heme groups in the interior of the protein. These heme groups mediate the transfer of an electron from an ascorbate molecule on the cytosolic side of the protein to ferric cations on the lumenal side. Thus, in our comparisons of multiple sequence alignments, we focused on the residues in DCYTB that bind ascorbate, mediate homodimerization, and attract iron. First, all homologs appear to have conserved positively charged residues that form the pocket to bind negatively charged ascorbate66 (Fig. S16). Second, only proteins from Group A conserve residues that mediate dimerization (Fig. S17), which was also reflected in predicted dimeric structures of paralogs from S. rosetta (Fig. S18). Third, among the Cytochrome b561 groups, we noticed significant differences on the lumenal surface that forms the binding site for ferric cations (Figs. 4B, arrow and S19). Homologs within Group A, which contains Cytb561a and DCYTB, possess a mix of positive and negative charges, indicating Group A paralogs can bind iron and coordinated ligands66. Group C homologs have largely anionic surface charges at the putative iron binding site, which could plausibly bind iron as well. However, the positively-charged binding pockets conserved across Group B homologs may repel rather than bind iron cations. Notably, the differences in charge distribution among Cytochrome b561 groups appear to be general features of those groups (Fig. S19C), yet electrostatic properties can diverge within those lineages (Fig. S19D). With each paralog from S. rosetta possessing distinct biochemical properties near the iron binding site, we hypothesize that the improvement of thecate growth with ferric colloids may not only be due to the mass action from more Cytb561 proteins but also through the distinct biochemical activity of Cytb561a.
If the biochemical activity and cell-type-specific expression of Cytb561 paralogs enable choanoflagellates to acquire iron from different environmental sources, then we expect that the expression of each paralog may be enriched in different ecological settings. To test this hypothesis, we searched for each paralog in an open repository (Ocean Gene Atlas8,9) of metagenomes and metatranscriptomes from marine environments. When we visualized the distribution of each paralog, we found that they varied in both location and transcript abundance but were similarly present in metagenomic reads (Fig. 4C) The most abundant and cosmopolitan paralog was cytb561c. Although present at most of the sites, the transcript abundance of cytb561b was ~10-fold lower than cytb561c. The abundance of cytb561a had a similarly low abundance as cytb561b, yet its distinct presence near coastlines and the equator resembled the locations of marine upwellings67 (Fig. S20A).
Importantly, upwellings are a major source of remineralized iron colloids rising from ocean depths68,69, and upwelling velocities may serve as a better proxy for the presence of colloidal iron than predictions of soluble iron concentrations10. Therefore, we correlated the abundance of the genomic and transcript reads for each paralog to average vertical velocities at a depth of 100 meters for each sample location (Fig. 4C, S20B). Of all the comparisons, the only robust correlation was between cytb561a transcript abundance and vertical velocity (r = 0.64, P < 10−4), as the transcript abundances of the other paralogs exhibited only weak positive correlations that were probably random (P > 0.3). The positive correlation between cytb561a transcripts and upwelling velocities indicate ecological contexts where choanoflagellates may be acquiring insoluble iron colloids for their growth. We cannot, however, determine if the choanoflagellates in those regions are thecates expressing cytb561a as in S. rosetta, or choanoflagellates with a different regulatory mechanism that evolved uncoupled cytb561a expression from thecate development. Nonetheless, these observations connect cytb561a expression to an environmental feature and places choanoflagellate biology in a global ecological framework.
Iron is an essential micronutrient affecting the growth and productivity of phytoplankton70. Field measurements and ocean scale models have found that iron particulates are the major source of iron for 40% of the world’s surface ocean waters11. In addition to photolytic reduction14,71 and bacterial siderophores72,73, the ingestion of iron particles by phagotrophic protists is a major mechanism to solubilize iron and can alleviate iron limitation for cell-walled organisms that rely on obtaining iron in soluble forms13–15. As efficient phagotrophs74,75,4,76, choanoflagellates ingest iron particulates (Figs. 2E and S9), yet iron assimilation from ferric colloids depends on the expression of a specific Cytb561 paralog in only one cell type of S. rosetta. This dependency emphasizes the importance of integratively factoring protein evolution77 and gene expression during life history transitions1,2 in models for determining78 the ecological roles of microeukaryotes in the acquisition and distribution of limiting nutrients throughout their surrounding environment. Moreover, the synthesis of cytological, genomic, and ecological data may help to interpret how the rise in oxygen that resulted in the depletion of soluble ferrous cations from the oceans influenced eukaryotic evolution77,79,80, including the emergence of animals81.
The cell-type-specific expression of cytb561a not only illustrates how cellular differentiation impacts the environmental function of a choanoflagellate but also informs how cellular differentiation evolved as a defining feature of animal multicellularity16,19,17,82,20,21. Choanoflagellate cell types have been primarily defined by their morphology and motility4,6. In addition to those cell biological characteristics, we show here that thecates have a distinct ability to utilize ferric colloids. The coupling of this function to cell differentiation stands in contrast to the regulation of iron utilization in other microeukaryotes that transiently regulate iron utilization genes in response to external iron concentrations56,83,84. The functional difference between slow swimmers and thecates more closely resembles the stable differentiation of cell types during animal development85–87. Taken together, these observations support models for the origin of animal cell differentiation in which the last common holozoan ancestor differentiated into unique cell types at each life history stage88,16,17,20,22. Evolution along the animal stem lineage coupled cell differentiation to multicellular development, resulting in multicellular bodies composed of functionally distinct cells89. Importantly, mechanisms to distribute resources from cells that acquire nutrients from the external environment (e.g. epithelial cells90) to cells that reside internally helped to transform the functions of various cell types into a common organismal physiology18. As the evolution of transferrin and its receptor illuminates (Figs. 3A and S12), the multicellular physiology of animals has likely been cobbled together through the co-option of mechanisms for nutrient acquisition and cellular trafficking.
Materials and Methods:
Culturing S. rosetta
To grow cell types of S. rosetta for the generation of transcriptomes, slow swimmer and rosettes were grown in 5% Sea Water Complete (SWC) Media23,91 (Table S1), seeded at 104 cells/ml and grown for 48 hours at 22°C. Rosettes were induced with outer membrane vesicles (OMVs) harvested from the bacterium Algoriphagus machipongonensis26. Fast swimmers were prepared the same way as slow swimmers, but were grown for 72 hours at 22°C and transitioned to 30°C for 2 hours and 45 minutes. Thecates were grown from the strain HD1, a strain which is constitutively in the thecate cell type, in 10% Cereal Grass Media 3 (CGM3)23,91 (Table S1), seeded at 104 cells/ml and grown for 48 hours at 22°C.
For iron acquisition experiments and standard culturing of S. rosetta, all cell types were grown in 25% Red Algae (RA) Media92 (Table S1) with the feeder bacteria Echinicola pacifica. Cultures were grown at 18°C for maintaining cultures and shifted to 27°C one day prior to conducting iron acquisition experiments.
RNA extraction
Protocol Link:
dx.doi.org/10.17504/protocols.io.261ge4bojv47/v1
In a previous paper91, we developed a lysis buffer to extract RNA from S. rosetta that has subsequently been used in other work for RNA30,93,94 and protein95 extraction. Here we show that this buffer was developed to preferentially lyse S. rosetta in cultures that have an abundance of the prey bacterium E. pacifica. To preferentially lyse S. rosetta, we reasoned that sterol-based detergents, such as digitonin, would more effectively disrupt membranes with sterols, like those of S. rosetta and other eukaryotes96 rather than bacterial membranes that largely lack the sterols97. Additionally, the lysis buffer contains RNase inhibitors (RNaseIN and heparin) and translation inhibitors to decrease the degradation of mRNA. We evaluated the efficacy of this buffer by tracking ribosomal RNA from S. rosetta and E. pacifica in lysed samples that were centrifuged to separate the supernatant and pellet, in which we found that S. rosetta ribosomal RNAs were enriched in the supernatant and bacterial ribosomal RNAs were enriched in the pellet (Fig. S1). The optimization of this procedure resulted in the following method to obtain RNA samples from each cell type of S. rosetta:
After counting the cell concentration in a culture of S. rosetta feeding on E. pacifica, a volume yielding 107 cells was centrifuged at 2600×g for 5 min at 4°C. The pellet from this sample was resuspended in 100 μl of preferential lysis buffer (20 mM Tris-HCl, pH 8.0; 150 mM KCl; 5 mM MgCl2; 250 mM Sucrose; 1 mM DTT; 10 mM Digitonin; 1 mg/ml Sodium Heparin; 1 mM Pefabloc SC; 100 μg/ml Cycloheximide; 1 tablet/ 5 ml buffer EDTA-free Protease Inhibitor Tablet [Sigma-Aldrich Cat. No. 11836170001] ; 0.5 U/μl Turbo DNase [Thermo Fisher Scientific, Cat. No. AM2239]; 1 U/μl SUPERaseIN [Thermo Fisher Scientific, Cat. No. AM2696]) and incubated on ice for 10 min. Cells were triturated through a 30G needle ten times. Afterwards, the insoluble debris was pelleted by centrifugation at 6000×g for 10 min at 4°C. The supernatant was removed and adjusted to a volume of 100 μl with RNase-free water. Total RNA was purified on a silica membrane with the RNA clean-up protocol from the RNAeasy kit (Qiagen, Cat. No. 74104).
RNA library preparation and sequencing
In preparation for sequencing, the quality of total RNA from S. rosetta samples was assessed by Agilent Bioanalyzer 2100 (Fig. S1C) with the Agilent RNA 6000 Nano Kit (Agilent, Cat. No. 5067–1511). Then total RNA (500 ng per sample) was purified by one round of polyA mRNA selection with oligo-dT magnetic beads (NEB, Cat. No. S1550S), converted to cDNA using KAPA mRNA HyperPrep kit (KAPA biosystems, Cat. No. KK8580), and indexed-adapter ligated with KAPA single-indexed adapter kit (KAPA biosystems, Cat. No. KK8701). Final library quality was determined by Bioanalyzer 2100 using the Agilent High Sensitivity DNA Kit (Agilent, Cat. No. 5067–4626) and pooled together after normalizing samples based on their quantity from qPCR. Sequencing was carried out by the QB3-Berkeley Genomics core (QB3 Genomics, UC Berkeley, Berkeley, CA, RRID:SCR_022170), on the Illumina HiSeq 4000. All samples were pooled and run on a single lane with 12.4 to 61.3 million reads per sample. Reads were demultiplexed, and checked for quality by fastqc (Babraham Bioinformatics). Metadata for sequencing can be found in Table S2.
RNA-seq quantification and gene enrichment analysis
RNA-seq reads from each sample were mapped to a reference S. rosetta transcriptome (https://ftp.ensemblgenomes.ebi.ac.uk/pub/protists/release-58/fasta/protists_choanoflagellida1_collection/salpingoeca_rosetta_gca_000188695/cdna/) using kallisto98. Transcript abundance and differential expression were quantified with sleuth99. Gene enrichment for thecates was quantified by selecting genes with q < 0.01 and log2 (fold-change + 0.5) > 1. The resulting genes were assigned GO terms using DAVID Bioinformatics37,38 where S. rosetta gene IDs could be identified by “ENSEMBLE_GENE_ID”. For ease of interpretation, assigned GO terms were collapsed down using REVIGO100 and further assigned to functional categories in S. rosetta (TaxID: 946362) by GO Slim39 (Table S3).
S. rosetta growth with different sources of iron
Media Preparation.
To test the ability of S. rosetta cell types to differentially utilize iron sources, we developed a method to culture S. rosetta under iron limitation. We created an iron depleted media formulation named 4% Peptone Glycerol (PG) Media (Table S1). We added iron to this media from two different sources: Ferric EDTA was prepared by dissolving FeCl3 and EDTA to a final concentration of 1.46 mM in artificial seawater (ASW [Ricca Chemical Company, Cat. No. R8363000–20F]) and sterile filtering through a 0.22 μm polyethersulfone (PES) filter. Ferric colloids were prepared by adding a small volume of 70% (v/v) ethanol to FeCl3(s) to sterilize the solid; after evaporating the liquid in a biological safety cabinet, 1.46 mM FeCl3 was prepared in ASW. The solution was heated to 50°C for 10 minutes to precipitate ferric (oxyhydr)oxides, or ferric colloids45.
Fluorescent ferric colloid preparation.
Ferric colloids were prepared as above, except during the 50°C incubation, a fluorescent dextran (10 kD anionic dextran conjugated to tetramethylrhodamin [Thermo Fisher, Cat. No. D1868]) was added to a final concentration of 20 μg/ml. This dextran co-precipitated with the ferric colloids, thereby embedding fluorophores within the colloids. The precipitated ferric colloids were pelleted by centrifugation at 4600 × g for 10 min at room temperature. The pellet was resuspended in ASW supplemented with 2% (v/v) dextranase (Sigma aldrich, Cat. No. D0443) to digest any excess dextran during a 10 min incubation at 60°C. After pelleting the ferric colloids again, they were resuspended to ~1.46 mM in ASW. Finally, the resuspended colloids were ultrasonicated for two, 1 min cycles (pulsed for 1 sec on, 1 sec off) to create smaller particles for easier feeding and imaging, as larger particles broke down slowly and often obscured cells during imaging.
Determining iron concentration in media.
To determine the background concentration of soluble iron in our newly formulated media and the amount of labile iron liberated from the ferric colloids, we used a modified protocol13 to measured soluble iron concentrations in 4% PG and 4% PG with 100 μM ferric colloids. To measure these iron concentrations, 50 ml of 4% PG media was incubated at 27°C for 6 hours with and without 100 μM ferric colloids. The media was then centrifuged at 4600×g for 15 minutes at room temperature. The top 45 ml of supernatant was taken for the following steps. To capture and concentrate the soluble iron in the supernatant, 125 μM 8-Hydroxyquinoline (oxine) was added to chelate labile iron, and 5 ml of 1M MES, pH 6 was added to aid in oxine’s solubility in the media and its binding to iron, which is facilitated by lower pH. These solutions were incubated for 24 hours protected from light, and then filtered through a Sep-Pak C18 column (Waters, Cat. No. WAT036800) to bind oxine-iron complexes. The column was then washed with 5 ml water and eluted with 1.4 ml methanol. To measure iron concentrations in the eluate, 50 μl of eluate was measured with an colorimetric iron detection kit (Sigma-Aldrich, Cat. No. MAK025–1KT) following the provided protocol. Iron concentration was determined by measuring absorbance at 595 nm on a plate reader.
Iron-limited
E. pacifica pellets were prepared by growing E. pacifica in 4% PG for 72 hours at 225 rpm and 30°C, then pelleted, aliquoted, and frozen into 10 mg pellets, and stored at −80°C. E. pacifica pellets were resuspended in 1ml of media to achieve a 10mg/ml liquid bacteria stock to add to cultures.
Culturing with different iron sources.
To set up the iron utilization assay, cultures grown in 25% RA were passaged at a 1:60 dilution into 25% RA and grown for 72 hours at 18°C. This culture was then passaged at a 1:60 dilution into 25% RA and grown for 24 hours at 27°C. Finally, this culture was passaged at a 1:30 dilution into 4% PG (which had no iron added) and grown at 27°C for 24 hours. Cells from this final passage were twice washed by first centrifuging the culture at 2600×g for 4 minutes and then resuspending the pellet in 30 ml sterile ASW. After the final wash, the pellet was resuspended in 100 μl ASW and counted. These washed cells were used to seed cultures at 104 cells/ml in 4% PG with the additions of 100 μM iron (ferric EDTA or ferric colloids) and 50 μg/ml of iron-limited E. pacifica pellets (see above). The resulting culture was grown at 27°C for 48 hours and harvested for each experiment.
qPCR
Protocol Links:
dx.doi.org/10.17504/protocols.io.ewov1nnzogr2/v1
dx.doi.org/10.17504/protocols.io.n2bvjnjongk5/v1
Primer design.
Primers were designed against cytb561a, the experimental gene of interest and cofilin, a highly expressed eukaryotic specific housekeeping gene with a consistent expression between S. rosetta cell types, as shown by the RNAseq analysis. Primers for quantitative polymerase chain reaction (qPCR) were designed by selecting 19–20 bp oligos spanning exon-exon junctions from cDNA sequences, (downloaded from: https://protists.ensembl.org/Salpingoeca_rosetta_gca_000188695/Info/Index). Primer sequences as follows: cytb561a forward primer 5’-CATGGAAGAAGCGCATGGTG-3’, reverse primer 5’-GGGTCCGCAAGATCATTGAG-3’, cofilin forward primer 5’-CAAGCTCCCCACGGACAAG-3’, reverse primer 5’-GGGTCCATGCGAAGAAGAC-3’. In the case of cytb561a, primers were selected downstream of the premature termination sequence edited in by CRISPR/Cas9.
Primer validation.
Target amplicons were amplified by polymerase chain reaction (PCR) from the cDNA of slow swimmer and thecate cultures (Fig. S4A) with the Luna Universal qPCR Master Mix (NEB, Cat, No. M3003). The amplicons were run on a 1% (w/v) agarose gel in Tris-Borate-EDTA (TBE) buffer to verify that the primer sets amplified only one amplicon. The amplification efficiency of primer sets was characterized across a serial dilution of ssDNA standards for each target (Fig. S4B–C). ssDNA standards were generated from PCR products amplified with a forward primer that was 5’ phosphorylated to promote Lambda exonuclease digestion of that strand (NEB, Cat. No. M0262S) and a reverse primer with phosphorothioate bonds between the first four 5’ nucleotides to block digestion. Those PCR products were then digested with Lambda exonuclease following NEB’s protocol. ssDNA concentrations were determined by Qubit ssDNA Assay Kit (Thermo Fisher, Cat. No. Q10212) by creating a standard curve with the provided standards and then reading fluorescence (Ex = 500 nm / Em=540 nm) on a plate reader (Molecular Devices, SpectraMax iD5). The ssDNA standards were serially diluted from 105 to 100 copies/μl in a solution with 10 ng/μl of S. rosetta RNA (Total RNA after removing mRNAs) to account for any matrix effects.
cDNA preparation.
Samples from slow swimmer or thecate cultures were pelleted and lysed using the preferential lysis protocol, and total RNA was purified by RNeasy MinElute Cleanup Kit (Qiagen, Cat. No. 74204). cDNA was prepared by SuperScript IV Reverse Transcriptase (Thermo FIsher, Cat. No. 18090010) following the provided protocol using oligo d(T)20 for synthesis. However, the incubation temperature for cDNA synthesis was increased to 60°C to account for the high GC content of the S. rosetta genome.
qPCR.
3 μl of cDNA samples or ssDNA standards were added to a 20μl reaction with Luna Universal qPCR Master Mix (NEB, Cat, No. M3003) according to the provided protocol. The samples were amplified in a single run on the QuantStudio 3 Real-Time PCR System (Thermo Fisher, Cat. No. A28567).
Genome editing
Protocol Link:
dx.doi.org/10.17504/protocols.io.j8nlk86o5l5r/v1
Culturing cells for transfection.
Mutant strains were generated with a modified protocol from a previously published method46. Cultures of S. rosetta slow swimmers were maintained in 15% RA/2% PYG (called 15/2, Table S1) at 22°C, and 48 hours prior to transfection, cultures were seeded at 104 cells/ml in 80 ml of 15/2 Media in 300 cm2 vented tissue culture flasks (VWR, Cat. No. 10062–884) and grown at 22°C.
Cas9 RNP preparation.
On the day of transfection, Cas9 bound to a guide RNA (Cas9 RNP) was prepared by combining 2 μl of 100 μM guide RNA (gRNA) with 2 μl of 20 μM EnGen SpyCas9 NLS (NEB, Cat. No. M0646T), and incubated at room temperature for 2–4 hours. (Note: Our gRNA was ordered as synthetic RNA oligonucleotides from Integrated DNA Technologies that came as an Alt-R CRISPR-Cas9 crRNA and an Alt-R CRISPR-Cas9 tracrRNA. These two oligonucleotides were annealed together to form gRNA). Guide sequences can be found in table S4.
Transfection.
During the Cas9 RNP incubation, cells were harvested for transfection by centrifuging the cultured cells at 2400×g for 3 minutes at 4°C. The supernatant was discarded, and cells were resuspended in 50 ml of Cell Wash Buffer (420 mM NaCl; 50 mM MgCl2; 30 mM Na2SO4; 10 mM KCl; titrated to pH 8.0 with ~2.4 mM NaHCO3). The cells were centrifuged again at 2400×g and 4°C for 3 minutes. The supernatant was removed, and the pellet was resuspended in 100 μl of Cell Wash Buffer. Cells were counted and diluted to 5×107 cells/ml, split into 100 μl aliquots, and centrifuged for 2 min at 800×g and room temperature. The supernatant was removed and replaced with 100 μl priming buffer (40 mM HEPES-KOH, pH 7.5; 34 mM Lithium Citrate; 15% (w/v) PEG 8000; 50 mM Cysteine; 1.5 μM Papain). Cells were primed for 45 minutes at room temperature. Afterwards, the priming was quenched with 10 μl of 50 mg/ml bovine serum albumin. The primed cells were centrifuged at 1200×g for 4 min at room temperature. After removing the supernatant, the cells were resuspended in 25 μl ice cold SF buffer (Lonza Cat. No. V4SC-2096). Nucelofection reactions were prepared by combining 16 μl ice cold SF buffer, 4 μl Cas9 RNP, 2 μl of repair oligo, and 2 μl of the resuspended and primed cells. This reaction mix was loaded into a 96-well nucleofection plate (Lonza Cat. No. AAF-1003S, AAF-1003B), and pulsed with CU 154 (Fig. S4). 100 μl of ice cold recovery buffer (10 mM HEPES-KOH, pH 7.5; 900 mM Sorbitol; and 8% (w/v) PEG 8000) was added immediately to each pulsed well and incubated for 5 minutes at room temperature. Afterwards the entire content from each nucleofection well was transferred into 2 ml of 15/2 Media with the addition of 50 μg/ml E. pacifica pellet.
Strain isolation and Screening
Protocol Link:
dx.doi.org/10.17504/protocols.io.14egn6n2pl5d/v1
We adapted the Cas12a DETECTR genotyping assay101 to screen for cells with desired mutations from Cas9 genome editing. Following overnight recovery post-nucleofection, cells were counted and diluted to 45 cells/ml in 10% RA (Table S1) with 50 μg/ml E. pacifica pellet and aliquoted into 96-well plates. Plates were then grown at 27°C for 72 hours to propagate cells. Afterwards 12 μl from each well was added into 36 μl DNAzol Direct (20 mM potassium hydroxide; 60% (w/v) PEG 200, pH 13.3–13.7– Note: It is important to test a range of pH values to establish the optimal pH for your own use)46 that was pre-aliquoted into 96-well PCR plates, which we called the lysed sample. The lysed samples were then incubated at 80°C for 10 minutes. The target DNA was amplified (primer sequences in Table S4) in PCR reactions prepared with GoTaq Clear Master Mix (Promega, Cat. No. M7123) using 2 μl of the lysed samples per 25 μl PCR reaction. Cycling conditions followed Promega guidelines and melt temperatures for oligos were calculated using (https://www.promega.com/resources/tools/biomath/tm-calculator/). During the PCR, a Cas12a mastermix was assembled in two parts: First, Cas12a RNP: (13 μl water; 2 μl r2.1 buffer [NEB, Cat. No. B6002S]; 2.5 μl 100 μM gRNA [ordered as a synthesized oligo]; 2 μl 100 μM LbaCas12a [NEB, Cat. No. M0653T]) was incubated for 5 minutes at room temperature. Second, the Cas12a RNP was combined with the rest of the components (for one 96-well plate: 486 μl water; 60 μl r2.1 buffer; 18 μl Cas12a RNP; 36 μl 5 μM ssDNA probe [IDT, Cat. No. 11–04-02–04]) and then incubated for 5 minutes at room temperature. 5 μl of the mastermix was added to each 25 μl PCR and incubated at 37°C for 1 hour. The fluorescent signal was then measured in the VIC channel on a QuantStudio 3 Real-Time PCR System. Wells with high signal (≥10-fold above background) were recovered in 25% RA overnight at 27°C, counted, diluted to 3 cells/ml in 10% RA with 50 μg/ml E. pacifica pellet, and plated in a 96-well plate with 100 μl/well, which corresponds to 0.3 cells/well. Plates were grown at 27°C for 72–96 hours. Finally, these clonal isolates were screened by digesting amplicons with a restriction enzyme (RE). 12 μl per isolate was added to 36 μl DNAzol, incubated at 80°C to make a lysed sample. 2 μl of the lysed samples were added to 25 μl PCR reactions with GoTaq Green mastermix (Promega, Cat. No. M7123). After amplification, 1 μl of RE was added to each 25 μl PCR reaction, and incubated following manufacturers’ protocols. RE digestions were run on 1% agarose gels with wild-type or undigested controls. For positive RE digestion hits, PCR products were Sanger sequenced to confirm the sequence of the edited site (Table S5).
Growth curves
S. rosetta.
We monitored cell density at times points over 48 hours to assess the population dynamics of wild type and cytb561PTS thecates in iron utilization assay conditions. Cells were plated in 24-well plates with 100 μM ferric EDTA or ferric colloids sources in 4% PG or in nutrient replete media (25% RA) as a control. At each time point, the entire contents of one well was harvested by scraping the bottom and then transferring the liquid to a microfuge tube. Cells were fixed with 5 μl of 37.5% paraformaldehyde (PFA) and then counted on a Reichert Bright-Line Metallized Hemacytometer (Hausser Scientific, Cat. No. 1483). Growth curves were fit to a logistic growth equation that explicitly models the lag time with a heaviside step function28.
E. pacifica growth curves were set up in 4% PG media inoculated with 50 μg/ml iron-limited E. pacifica pellet and a ferric EDTA titration in clear bottom 96-well plates, and grown for 48 hours at 27°C. Growth was assessed by monitoring the optical density at 600 nm on a plate reader with orbital shaking before every reading. Ferric colloids were not included in the growth curves as iron particulates would shift optical density readings. To assess E. pacifica growth with ferric colloids, 4% PG media was inoculated with 50 μg/ml iron-limited E. pacifica pellet and 100 μM ferric EDTA or ferric colloids in 6-well plates and grown for 48 hours at 27°C. Wells were scraped and 1 ml aliquots were centrifuged at 500×g for 2 minutes to settle iron particulates. The top 500 μl of supernatant was gathered for measuring the optical density at 600 nm.
Immunofluorescent Staining
Protocol Link:
dx.doi.org/10.17504/protocols.io.5jyl85k38l2w/v1
Cultures of wild-type and ALFA-tagged strains were maintained in 25% RA at 18°C for thecates and 22°C for slow swimmers, as higher temperatures helped to maintain slow swimmers. 24 hours prior to imaging, cultures were seeded at 104 cells/ml in 6 ml of 25% RA in 25cm2 vented tissue culture flasks (VWR Cat. No. 10062–874) and grown at 27°C. On the day of imaging, prior to preparing cells, 50 μl of 10 mg/ml Poly D-lysine hydrobromide (MP Biomedicals, Cat. No. 102694) was added to each well of a glass bottom 96-well plate (Thermo Scientific, Cat. No. 165305) and incubated for ≥15 min at room temperature. To prepare cells, the supernatant of thecate cultures was decanted, and replaced with 6 ml filtered ASW, then scraped with a cell scraper, and centrifuged at 500×g for 5 minutes at room temperature. For swimmer cultures, the cells were centrifuged 500×g for 5 minutes at room temperature. The supernatant removed leaving 0.5 ml remaining, and resuspended in 5 ml filtered ASW and centrifuged 500×g for 5 min at room temperature. Both thecate and swimmer cultures follow the same protocol henceforth. The supernatant was removed, leaving 0.5 ml, and the tubes gently swirled to resuspend cells. The poly D-lysine coated wells were washed 3 times with 50 μl ASW, and then 50–100 μl of cells were added to each well using wide-bore pipette tips. Cells settled on the well surface for 15 minutes. All subsequent steps use gel loading tips. After settlement, the supernatant was removed leaving 25 μl behind to avoid drying and damaging delicate cell structures. 70 μl of Fix buffer (10 mM MES-KOH, pH 6.1; 138 mM KCl; 3 mM MgCl2; 2 mM EGTA; 3% (v/v) PFA; 15% (w/v) sucrose) was slowly added to the top of each well and incubated for 5 min at room temperature. 70 μl of supernatant was removed from the opposite side from which solution was added. This will be referred to as a “wash”. Next, wells were washed with Fix/tween buffer (Fix buffer with the addition of 0.07% (v/v) Tween-20) and incubated for 5 min at room temperature. Wells were then washed with a quench buffer (10 mM MES-KOH, pH 6.1; 138 mM KCl; 3 mM MgCl2; 2 mM EGTA; 15% (w/v) sucrose; 0.3M glycine) and washed immediately with Permeabilization-MeOH buffer (8% (v/v) MeOH; 1% (v/v) Tween-20; in LICOR Intercept Blocking Buffer [LI-COR Biosciences Cat. No. 927–70001]) and incubated for 15 minutes. (Note: LICOR Intercept Blocking buffer increased detection of AFLA epitopes during testing.) After, wells were washed with Permeabilization buffer (1% (v/v) Tween-20; LICOR Intercept Blocking Buffer) and then washed twice in antibody mix (1:200 FluoTag-X2 anti-ALFA Alexa647 [Nanotag, Cat No. N1502-AF647-L]; 1:200 phalloidin Alexa546 [Invitrogen, Cat. No. A22283]; 1:500 alpha Tubulin Monoclonal Antibody, DM1A [Invitrogen, Cat. No. 62204]; 1:500 alpaca anti-mouse IgG1, recombinant VHH, Alexa488 [Chromotek, Cat. No. sms1AF488–1]) diluted in the Permeabilization buffer and incubated for 1 hour protected from light. Afterwards, wells were washed twice with PEM (100 mM PIPES-KOH, pH 6.9; 1 mM EGTA; 1 mM MgSO4) and immediately imaged.
Microscopy
Time-lapse of ferric colloid ingestion.
100 μl of 1.46 μM fluorescent ferric colloids were added to 2 ml of wild-type thecate cultures, previously grown in glass bottomed dishes (World Precision Instruments, Cat. No. FD35–100) for 24 hours at 22°C. Samples were imaged by widefield microscopy. The microscope was a Nikon Eclipse Ti2-E inverted microscope outfitted with a D-LEDI light source, Chroma 89401 Quad Filter Cube, and 60x CFI Plan Apo VC NA1.2 water immersion objective. Time-lapse images were acquired every 4 sec over 2 hours with 50 msec exposure time for brightfield and fluorescent (TRITC channel) images. In three independent experiments, a total of 4 feeding events were found in a field of 200 cells, although this was not an exhaustive search for ferric colloid intake frequency. One cell that was oriented on its side to best visualize feeding was selected for further processing. Images were processed in FIJI by cropping individual cells in 100×100 pixel bounds, and image contrast was manually adjusted. Ferric colloid particles were automatically tracked, first by setting a threshold to produce a binarized image (image > adjust > manual threshold) and then by finding particles (analyze > analyze particles > size = 10–300 pixels, circularity = 0.2–1.0).
Immunofluorescence Imaging.
Samples were imaged by widefield microscopy. The microscope was a Nikon Eclipse Ti2-E inverted microscope outfitted with a D-LEDI light source, Chroma 89401 Quad Filter Cube, and 60x CFI Plan Apo VC NA1.2 water immersion objective. Single focal plane and Z-stacks of samples were imaged with two-fold binning (120 μm/pixel) on a Nikon Digital Sight 50 M Camera with 100–400 msec exposure times for each channel and variable illumination intensities to extend the dynamic range of signal without photobleaching samples. Single focal-plane images were processed within the Nikon imaging software. Images were background subtracted (120 pixel, rolling ball radius), and deconvolved using the Landweber deconvolution algorithim with 12–16 iterations (autostopping engaged), and automatic background subtraction. Deconvolved images were cropped in 150×150 pixel bounds in FIJI and merged images were combined using maximum intensity composite. Supplementary images were processed in FIJI by cropping individual cells in 150×200 pixel bounds, projecting the maximal intensity, subtracting the background (75 pixel, sliding paraboloid radius), enhancing the contrast (saturation 0.02% of the pixels), and applying a minimum filter of 0.5 pixels. Merged images were combined using maximum intensity composite.
Western blots
Protocol Link:
dx.doi.org/10.17504/protocols.io.3byl4987zgo5/v1
Cultures were grown in 25% RA or 4% PG (without supplemented iron) for 24 hours at 27°C to test whether external iron conditions (replete or deplete) resulted in any post-translational regulation of protein expression. Cultures were scraped and then lysed according to the preferential lysis protocol, with the following modifications: TurboDNase and SUPERasein were replaced with 15 μl of Pierce universal nuclease (Thermo Scientific Cat. No. 88702) per 1 ml buffer, and heparin and cycloheximide were removed. After clearing the lysate, Tween-20 was added to a final concentration of 1% (v/v), and the sample was incubated for 10 minutes at room temperature to aid in liberating membrane-bound proteins. Protein concentration was determined by Bradford assay (Thermo Fisher Scientific Cat. No. 23236) and samples were normalized to the lowest concentration. To prepare for SDS-PAGE, approximately 20 μg or 40 μg of total protein for thecates or slow swimmers, respectively, was denatured with 4x Laemmli SDS loading buffer and incubated for 10 minutes at room temperature (Note: 80°C caused membrane proteins to crash out, so room temperature incubation was essential). Samples were loaded into a Novex wedgewell 10% Tris-Glycine Mini Gel (Thermo Fisher Scientific Cat. No. XP00105BOX) along with a 1:10 diluted PageRuler Plus Prestained Protein Ladder (Thermo Fisher Scientific Cat. No. 26619). Gels were run at 225 V for 30 minutes, and afterwards the gel was transferred to a 0.45 μM nitrocellulose membrane (Bio-rad Cat. No. 1620115) at 50 V for 30 minutes, then 100 V for 60 minutes in transfer buffer (25 mM tris base; 192mM glycine; 10% methanol) at 4°C. Afterwards the membrane was stained with total protein stain (LI-COR Biosciences Cat. No. 926–11010) and then imaged in the 700 nm channel with a 30 second exposure on a LI-COR Odysey Fc. Afterwards, the membrane was blocked in LICOR Intercept Blocking Buffer (LI-COR Biosciences Cat. No. 927–70001) for 1 hour at 37°C and then overnight at 4°C. The following day the membrane was immunostained with FluoTag-X2 anti-ALFA LI-COR IRDye 800CW (NanoTag Biotechnologies Cat. No. N1502-Li800-L) diluted 1:10,000 in LI-COR PBS block plus 0.2% Tween-20 for 2 hours at room temperature. (Note: Both the LI-COR PBS block and anti-ALFA IRDye 800CW were essential for visualization, other blocks and fluorophores lead to significantly lower signal.) The membrane was washed 3x in PBST (PBS with 0.2% Tween-20) and then imaged in the 800 nm channel with a 10 minute exposure. This western protocol was adapted from a previous study95.
Protein database searches and tree construction
We compiled a list of iron acquisition proteins (Table S6) from these model organisms: Homo sapiens44, Saccharomyces cerevisiae47, Chlamydomonas reinhardtii43, Arabadopsis thaliana102, and Phaeodactylum tricornutum41. For each protein, we retrieved the corresponding Pfam hidden markov model (HMM) from Interpro (https://www.ebi.ac.uk/interpro/)103. Using HMMER v3.4 (hmmer.org104), we searched proteomes from EukProt v3105, focusing on proteomes from “The Comparative Set” (TCS), which consisted of 196 species of phylogenetic relevance with high BUSCO completion. We supplemented the TCS with the transcriptomes of 18 choanoflagellate species54 and other Holozoa and Fungi. To count the number of unique proteins that conformed to the HMM profile from ‘hmmsearch’, we aligned hits with ‘hmmalign’ and retained only one sequence from hits with >99% identity (Note: this was important for removing multiple isoforms and reference sequences from highly annotated genomes). We additionally filtered hits based on length with ‘esl-alimanip’ to retain protein sequences that only possessed the HMM profile, which eliminated multidomain proteins that only had the HMM profile as a one of its domains (Table S3 for length filter thresholds). (Note: The number of hits prior to filtering can be found in Table S3) For HMM profiles that yielded a high number of protein hits that were not well-filtered (e.g. Transferrin Receptor), we aligned sequences with MAFFT-DASH61, removed sites that were uninformative or contained 90% gaps with ClipKIT106 (kpic-gappy 0.9), and generated phylogenetic trees with IQ-TREE107 using ModelFinder108 to choose the appropriate substitution model for each set of protein alignments. This procedure was also used to determine the phylogenetic relationships of Cytb561 paralogs (Fig. 4A), and the chosen substitution model was LG+F+R7. Trees were visualized with iTOL109. Multiple sequence alignments were visually inspected and curated for supplemental figures using Jalview v2110. Cytb561 structure models were predicted with AlphaFold v365 and visualized using UCSF ChimeraX64.
Gene distribution maps and correlations with upwellings.
cytb561 paralog search.
The abundance and location of S. rosetta cytb561 paralogs was searched for using the Ocean Gene Atlas8,9. Protein sequences of each paralog from S. rosetta were searched for in the MATOUv1+G (for metagenomes) and MATOUv1+T (for metatranscriptomes) databases, using tblastn with the following search parameters: expected threshold E-values < 1−10, and abundance as percent of total reads. The Ocean Gene Atlas automatically assigns protein sequences to phyla and classifies them on expected organism size, such that the results could be filtered first by size (0.8–5 μm) and then by phyla to identify only choanoflagellate specific hits. Therefore to identify choanoflagellate specific hits and the location they were gathered from, three tables were gathered from each search: the distribution of E-values (E-values per hit and their assigned phyla), abundance matrix for the 0.8–5 μm fraction (the abundance as a fraction of total reads and the sample site ID for each hit), and environmental parameters for the 0.8–5 μm fraction (all measured and inferred environmental measurements for each hits sample location). From these three tables, choanoflagellate assigned hits, abundances, and sample locations were compiled.
Upwelling velocities.
To identify average vertical velocities at the sample sites from the Tara Oceans Expedition111, from which the Ocean Gene Atlas acquires its data, upward seawater velocities were gathered from the Copernicus Marine Services’ Global Ocean Physics Analysis and Forecast model’s (https://doi.org/10.48670/moi-00016) dataset (cmems_mod_glo_phy-wcur_anfc_0.083deg_P1M-m). The coordinates for each sample site were compiled111 and vertical velocities were gathered from the dates 10/31/2020 – 5/20/2024. Then the search was done using a modified script offered by the Copernicus Marine Services (https://help.marine.copernicus.eu/en/articles/7970637-how-to-download-data-for-multiple-points-from-a-csv). The vertical velocity at each sample site was then averaged across the 3.5 year range to account for temporal variation (Fig. S8B).
Correlations and maps.
Choanoflagellate paralog abundances were tabulated with their sample site ID, sample coordinates, and average upwelling velocity (from above). First, the abundances and average vertical velocities were correlated by the Pearson function in Microsoft Excel. To calculate the associated P value for each Pearson correlation, a T statistic was calculated using the formula: t = (r•[n-2]½)•(1–r2)−½ , where r is the calculated Pearson correlation, n is the sample number, t is the T statistic. This statistic was then input into the t distribution function (TDIST) in Excel, along with the degrees of freedom (n-2), to calculate the P values. Next, the abundances and average vertical velocities were plotted in a scatter plot to manually inspect the data for outliers. If a datum was suspected of being an outlier, a Dixon’s Q test was conducted for that datum with the formula: Q = |gap|/range, where gap = the difference between the suspected outlier and the next nearest datum, and range = the range of the data set. The value is considered an outlier and removed if Q > Qtable, where Qtable is a reference value determined by the sample size and confidence level. Of all tested outliers, only one datum was removed, belonging to cytb561a transcript abundance at TARA site 52 (−16.96°, 53.99°), with a Q = 0.8718 and where Qtable = 0.3323 for 99% confidence level for a sample size of 33. Without this outlier removed the statistics for cytb561a’s correlation would be r = 0.09 and P = 0.58. Maps were plotted with the compiled table of paralog abundances and coordinates, using matplotlib with modified scripts generated by ChatGPT (https://chat.openai.com).
Supplementary Material
Acknowledgements:
We thank Nicole King for supporting an early phase in this project and providing feedback. Alain Garcia de las Bayonas, Flora Rutaganira, Ben Larson, members of the Booth lab, and the students at the Marine Biological Laboratories (MBL) 2023 Physiology Course provided essential experimental advice. We also appreciate helpful discussions from Elizabeth Blackburn, Jeremy Reiter, Wallace Marshall, Sandy Johnson, Carol Gross, and Dyche Mullins. FL was supported by a National Science Foundation Graduate Research Fellowship, a UCSF Discovery Fellowship, and the UCSF Tetrad Graduate Program Training Grant T32GM139786. This work was supported by awards to DSB from the Chan Zuckerberg Biohub, the UCSF Sandler Program for Breakthrough Biomedical Research, a David and Lucile Packard Foundation Fellowship in Science and Engineering, and a Maximizing Investigators’ Research Award for Early Stage Investigators from the National Institute of General Medical Sciences R35GM147404. Sequencing was additionally supported by an instrumentation grant from the National Institutes of Health S10OD018174 to the QB3 Vincent J. Coates Genomics Sequencing Laboratory at University of California, Berkeley.
Data Availability:
RNA sequencing data is available in the public repository NCBI Gene Expression Omnibus (GEO) database under requisition ID: GSE267344
References:
- 1.Worden A. Z. et al. Rethinking the marine carbon cycle: Factoring in the multifarious lifestyles of microbes. Science 347, (2015). [DOI] [PubMed] [Google Scholar]
- 2.Keeling P. J. Combining morphology, behaviour and genomics to understand the evolution and ecology of microbial eukaryotes. Philos. Trans. R. Soc. B Biol. Sci. 374, 20190085 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Leadbeater B. S. C. Observations on the life-history and ultrastructure of the marine choanoflagellate Choanoeca perplexa Ellis. J. Mar. Biol. Assoc. U. K. 57, 285–301 (1977). [Google Scholar]
- 4.Leadbeater B. S. C. The Choanoflagellates: Evolution, Biology, and Ecology. (Cambridge University Press, Cambridge, United Kingdom, 2015). [Google Scholar]
- 5.Reyes-Rivera J. et al. Nitric oxide signaling controls collective contractions in a colonial choanoflagellate. Curr. Biol. 32, 2539–2547.e5 (2022). [DOI] [PubMed] [Google Scholar]
- 6.Dayel M. J. et al. Cell differentiation and morphogenesis in the colony-forming choanoflagellate Salpingoeca rosetta. Dev. Biol. 357, 73–82 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.McKie A. T. et al. An Iron-Regulated Ferric Reductase Associated with the Absorption of Dietary Iron. Science 291, 1755–1759 (2001). [DOI] [PubMed] [Google Scholar]
- 8.Carradec Q. et al. A global ocean atlas of eukaryotic genes. Nat. Commun. 9, 373 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Vernette C. et al. The Ocean Gene Atlas v2.0: online exploration of the biogeography and phylogeny of plankton genes. Nucleic Acids Res. 50, W516–W526 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Capone D. G. & Hutchins D. A. Microbial biogeochemistry of coastal upwelling regimes in a changing ocean. Nat. Geosci. 6, 711–717 (2013). [Google Scholar]
- 11.Tagliabue A. et al. Authigenic mineral phases as a driver of the upper-ocean iron cycle. Nature 620, 104–109 (2023). [DOI] [PubMed] [Google Scholar]
- 12.Sofen L. E. et al. Authigenic Iron Is a Significant Component of Oceanic Labile Particulate Iron Inventories. Glob. Biogeochem. Cycles 37, e2023GB007837 (2023). [Google Scholar]
- 13.Barbeau K., Moffett J. W., Caron D. A., Croot P. L. & Erdner D. L. Role of protozoan grazing in relieving iron limitation of phytoplankton. Nature 380, 61–64 (1996). [Google Scholar]
- 14.Barbeau K. & Moffett J. W. Laboratory and field studies of colloidal iron oxide dissolution as mediated by phagotrophy and photolysis. Limnol. Oceanogr. 45, 827–835 (2000). [Google Scholar]
- 15.Barbeau K., Kujawinski E. B. & Moffett J. W. Remineralization and recycling of iron, thorium and organic carbon by heterotrophic marine protists in culture. Aquat. Microb. Ecol. 24, 69–81 (2001). [Google Scholar]
- 16.Mikhailov K. V. et al. The origin of Metazoa: a transition from temporal to spatial cell differentiation. BioEssays 31, 758–768 (2009). [DOI] [PubMed] [Google Scholar]
- 17.Sebé-Pedrós A., Degnan B. M. & Ruiz-Trillo I. The origin of Metazoa: a unicellular perspective. Nat. Rev. Genet. 18, 498–512 (2017). [DOI] [PubMed] [Google Scholar]
- 18.Brunet T. & King N. The Origin of Animal Multicellularity and Cell Differentiation. Dev. Cell 43, 124–140 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Arendt D. et al. The origin and evolution of cell types. Nat. Rev. Genet. 17, 744–757 (2016). [DOI] [PubMed] [Google Scholar]
- 20.Najle S. R. & Ruiz-Trillo I. The Protistan Origins of Animal Cell Differentiation. in Origin and Evolution of Metazoan Cell Types (CRC Press, 2021). [Google Scholar]
- 21.Márquez-Zacarías P. et al. Evolution of Cellular Differentiation: From Hypotheses to Models. Trends Ecol. Evol. 36, 49–60 (2021). [DOI] [PubMed] [Google Scholar]
- 22.Brunet T. & King N. The Single-Celled Ancestors of Animals: A History of Hypotheses. in The Evolution of Multicellularity (CRC Press, 2022). [Google Scholar]
- 23.Levin T. C. & King N. Evidence for Sex and Recombination in the Choanoflagellate Salpingoeca rosetta. Curr. Biol. 23, 2176–2180 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Levin T. C., Greaney A. J., Wetzel L. & King N. The Rosetteless gene controls development in the choanoflagellate S. rosetta. eLife 3, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Alegado R. A. et al. A bacterial sulfonolipid triggers multicellular development in the closest living relatives of animals. eLife 1, e00013 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Woznica A. et al. Bacterial lipids activate, synergize, and inhibit a developmental switch in choanoflagellates. Proc. Natl. Acad. Sci. U. S. A. 113, 7894–7899 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Peng C.-C., Dormanns N., Regestein L. & Beemelmanns C. Isolation of sulfonosphingolipids from the rosette-inducing bacterium Zobellia uliginosa and evaluation of their rosette-inducing activity. RSC Adv. 13, 27520–27524 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Perotti O., Esparza G. V. & Booth D. S. A red algal polysaccharide influences the multicellular development of the choanoflagellate Salpingoeca rosetta. 2024.05.14.594265 Preprint at 10.1101/2024.05.14.594265 (2024). [DOI] [Google Scholar]
- 29.Fairclough S. R., Dayel M. J. & King N. Multicellular development in a choanoflagellate. Curr. Biol. 20, R875–R876 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Brunet T. et al. A flagellate-to-amoeboid switch in the closest living relatives of animals. eLife 10, e61037 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Fairclough S. R. et al. Premetazoan genome evolution and the regulation of cell differentiation in the choanoflagellate Salpingoeca rosetta. Genome Biol. 14, R15 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nedashkovskaya O. I. et al. Echinicola pacifica gen. nov., sp. nov., a novel flexibacterium isolated from the sea urchin Strongylocentrotus intermedius. Int. J. Syst. Evol. Microbiol. 56, 953–958 (2006). [DOI] [PubMed] [Google Scholar]
- 33.Richter D. J., Fozouni P., Eisen M. B. & King N. Gene family innovation, conservation and loss on the animal stem lineage. eLife 7, e34226 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fumagalli M. R., Zapperi S. & Porta C. A. L. Gene regulatory programs in the life history of Salpingoeca rosetta. 2023.06.12.544615 Preprint at 10.1101/2023.06.12.544615 (2023). [DOI] [Google Scholar]
- 35.Binns D. et al. QuickGO: a web-based tool for Gene Ontology searching. Bioinformatics 25, 3045–3046 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Huntley R. P. et al. The GOA database: Gene Ontology annotation updates for 2015. Nucleic Acids Res. 43, D1057–D1063 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Huang D. W., Sherman B. T. & Lempicki R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009). [DOI] [PubMed] [Google Scholar]
- 38.Sherman B. T. et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 50, W216–W221 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Thomas P. D. et al. PANTHER: Making genome-scale phylogenetics accessible to all. Protein Sci. 31, 8–22 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gaudet P. & Dessimoz C. Gene Ontology: Pitfalls, Biases, and Remedies. in The Gene Ontology Handbook (eds. Dessimoz C. & Škunca N.) 189–205 (Springer, New York, NY, 2017). doi: 10.1007/978-1-4939-3743-1_14. [DOI] [PubMed] [Google Scholar]
- 41.Sutak R., Camadro J.-M. & Lesuisse E. Iron Uptake Mechanisms in Marine Phytoplankton. Front. Microbiol. 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Caza M. & Kronstad J. W. Shared and distinct mechanisms of iron acquisition by bacterial and fungal pathogens of humans. Front. Cell. Infect. Microbiol. 3, 80 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Allen M. D., del Campo J. A., Kropat J. & Merchant S. S. FEA1, FEA2, and FRE1, Encoding Two Homologous Secreted Proteins and a Candidate Ferrireductase, Are Expressed Coordinately with FOX1 and FTR1 in Iron-Deficient Chlamydomonas reinhardtii. Eukaryot. Cell 6, 1841–1852 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kaplan J. Mechanisms of Cellular Iron Acquisition: Another Iron in the Fire. Cell 111, 603–606 (2002). [DOI] [PubMed] [Google Scholar]
- 45.Wells M. L., Mayer L. M. & Guillard R. R. L. A chemical method for estimating the availability of iron to phytoplankton in seawater. Mar. Chem. 33, 23–40 (1991). [Google Scholar]
- 46.Booth D. S. & King N. Genome editing enables reverse genetics of multicellular development in the choanoflagellate Salpingoeca rosetta. eLife 9, e56193 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ramos-Alonso L., Romero A. M., Martínez-Pastor M. T. & Puig S. Iron Regulatory Mechanisms in Saccharomyces cerevisiae. Front. Microbiol. 11, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Richter D. J. et al. EukProt: A database of genome-scale predicted proteins across the diversity of eukaryotes. Peer Community J. 2, (2022). [Google Scholar]
- 49.Wunderlich J. et al. Iron transport pathways in the human malaria parasite Plasmodium falciparum revealed by RNA-sequencing. 2024.04.18.590068 Preprint at 10.1101/2024.04.18.590068 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Stearman R., Yuan D. S., Yamaguchi-Iwai Y., Klausner R. D. & Dancis A. A Permease-Oxidase Complex Involved in High-Affinity Iron Uptake in Yeast. Science 271, 1552–1557 (1996). [DOI] [PubMed] [Google Scholar]
- 51.Dix D. R., Bridgham J. T., Broderius M. A., Byersdorfer C. A. & Eide D. J. The FET4 gene encodes the low affinity Fe(II) transport protein of Saccharomyces cerevisiae. J. Biol. Chem. 269, 26092–26099 (1994). [PubMed] [Google Scholar]
- 52.McQuaid J. B. et al. Carbonate-sensitive phytotransferrin controls high-affinity iron uptake in diatoms. Nature 555, 534–537 (2018). [DOI] [PubMed] [Google Scholar]
- 53.Pantopoulos K., Porwal S. K., Tartakoff A. & Devireddy L. Mechanisms of Mammalian Iron Homeostasis. Biochemistry 51, 5705–5724 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Richter D. J., Fozouni P., Eisen M. B. & King N. Gene family innovation, conservation and loss on the animal stem lineage. eLife 7, e34226 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Bode H.-P., Dumschat M., Garotti S. & Fuhrmann G. F. Iron Sequestration by the Yeast Vacuole. Eur. J. Biochem. 228, 337–342 (1995). [PubMed] [Google Scholar]
- 56.Philpott C. C. & Protchenko O. Response to Iron Deprivation in Saccharomyces cerevisiae. Eukaryot. Cell 7, 20–27 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Götzke H. et al. The ALFA-tag is a highly versatile tool for nanobody-based bioscience applications. Nat. Commun. 10, 4403 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Laundon D., Larson B. T., McDonald K., King N. & Burkhardt P. The architecture of cell differentiation in choanoflagellates and sponge choanocytes. PLOS Biol. 17, e3000226 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Brunet T. & Booth D. S. Chapter One - Cell polarity in the protist-to-animal transition. in Current Topics in Developmental Biology (ed. Tepass U.) vol. 154 1–36 (Academic Press, 2023). [DOI] [PubMed] [Google Scholar]
- 60.Minh B. Q. et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 37, 1530–1534 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Rozewicki J., Li S., Amada K. M., Standley D. M. & Katoh K. MAFFT-DASH: integrated protein sequence and structural alignment. Nucleic Acids Res. 47, W5–W10 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Torruella G., Galindo L. J., Moreira D. & López-García P. Phylogenomics of neglected flagellated protists supports a revised eukaryotic tree of life. 2024.05.15.594285 Preprint at 10.1101/2024.05.15.594285 (2024). [DOI] [PubMed] [Google Scholar]
- 63.Hoang D. T., Chernomor O., von Haeseler A., Minh B. Q. & Vinh L. S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol. Biol. Evol. 35, 518–522 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Pettersen E. F. et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. Publ. Protein Soc. 30, 70–82 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Abramson J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Ganasen M. et al. Structural basis for promotion of duodenal iron absorption by enteric ferric reductase with ascorbate. Commun. Biol. 1, 1–12 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Marsh R. & van Sebille E. Chapter 3 - Surface drift, gyres, and the fate of plastic. in Currents Ocean (eds. Marsh R. & van Sebille E.) 63–102 (Elsevier, 2021). doi: 10.1016/B978-0-12-816059-6.00012-7. [DOI] [Google Scholar]
- 68.Fitzwater S. E. et al. Iron, nutrient and phytoplankton biomass relationships in upwelled waters of the California coastal system. Cont. Shelf Res. 23, 1523–1544 (2003). [Google Scholar]
- 69.von der Heyden B. P. & Roychoudhury A. N. A review of colloidal iron partitioning and distribution in the open ocean. Mar. Chem. 177, 9–19 (2015). [Google Scholar]
- 70.Behrenfeld M. J., Bale A. J., Kolber Z. S., Aiken J. & Falkowski P. G. Confirmation of iron limitation of phytoplankton photosynthesis in the equatorial Pacific Ocean. Nature 383, 508–511 (1996). [Google Scholar]
- 71.Wells M. L., Mayer L. M., Donard O. F. X., de Souza Sierra M. M. & Ackelson S. G. The photolysis of colloidal iron in the oceans. Nature 353, 248–250 (1991). [Google Scholar]
- 72.Bundy R. M., Jiang M., Carter M. & Barbeau K. A. Iron-Binding Ligands in the Southern California Current System: Mechanistic Studies. Front. Mar. Sci. 3, (2016). [Google Scholar]
- 73.Manck L. E. et al. Petrobactin, a siderophore produced by Alteromonas, mediates community iron acquisition in the global ocean. ISME J. 16, 358–369 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Tranvik L. J., Sherr E. B. & Sherr B. F. Uptake and utilization of ‘colloidal DOM’ by heterotrophic flagellates in seawater. Mar. Ecol. Prog. Ser. 92, 301–309 (1993). [Google Scholar]
- 75.Gonzalez J. M. & Suttle C. A. Grazing by marine nanoflagellates on viruses and virus-sized particles: ingestion and digestion. Mar. Ecol. Prog. Ser. 94, 1–10 (1993). [Google Scholar]
- 76.Wilken S. et al. Choanoflagellates alongside diverse uncultured predatory protists consume the abundant open-ocean cyanobacterium Prochlorococcus. Proc. Natl. Acad. Sci. U. S. A. 120, e2302388120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Dupont C. L., Butcher A., Valas R. E., Bourne P. E. & Caetano-Anollés G. History of biological metal utilization inferred through phylogenomic analysis of protein structures. Proc. Natl. Acad. Sci. 107, 10567–10572 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kiørboe T., Visser A. & Andersen K. H. A trait-based approach to ocean ecology. ICES J. Mar. Sci. 75, 1849–1863 (2018). [Google Scholar]
- 79.Knoll A. H. & Nowak M. A. The timetable of evolution. Sci. Adv. 3, e1603076 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wade J., Byrne D. J., Ballentine C. J. & Drakesmith H. Temporal variation of planetary iron as a driver of evolution. Proc. Natl. Acad. Sci. 118, e2109865118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Sperling E. A., Knoll A. H. & Girguis P. R. The Ecological Physiology of Earth’s Second Oxygen Revolution. Annu. Rev. Ecol. Evol. Syst. 46, 215–235 (2015). [Google Scholar]
- 82.Sebé-Pedrós A. et al. Early metazoan cell type diversity and the evolution of multicellular gene regulation. Nat. Ecol. Evol. 2, 1176–1188 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Allen A. E. et al. Whole-cell response of the pennate diatom Phaeodactylum tricornutum to iron starvation. Proc. Natl. Acad. Sci. 105, 10438–10443 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Glaesener A. G., Merchant S. S. & Blaby-Haas C. E. Iron economy in Chlamydomonas reinhardtii. Front. Plant Sci. 4, (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Arendt D. The evolution of cell types in animals: emerging principles from molecular studies. Nat. Rev. Genet. 9, 868–882 (2008). [DOI] [PubMed] [Google Scholar]
- 86.Tanay A. & Sebé-Pedrós A. Evolutionary cell type mapping with single-cell genomics. Trends Genet. 37, 919–932 (2021). [DOI] [PubMed] [Google Scholar]
- 87.Zeng H. What is a cell type and how to define it? Cell 185, 2739–2755 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Brunet T. & King N. The Single-Celled Ancestors of Animals: A History of Hypotheses. (2020) doi: 10.20944/preprints202011.0302.v1. [DOI] [Google Scholar]
- 89.Knoll A. H. The Multiple Origins of Complex Multicellularity. Annu. Rev. Earth Planet. Sci. 39, 217–239 (2011). [Google Scholar]
- 90.Richter D. J. & King N. The Genomic and Cellular Foundations of Animal Origins. Annu. Rev. Genet. 47, 509–537 (2013). [DOI] [PubMed] [Google Scholar]
- 91.Booth D. S., Szmidt-Middleton H. & King N. Transfection of choanoflagellates illuminates their cell biology and the ancestry of animal septins. Mol. Biol. Cell 29, 3026–3038 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Perotti O., Viramontes G. & Booth D. A red algal polysaccharide influences the multicellular development of the choanoflagellate S. rosetta. (2024).
- 93.Woznica A. et al. STING mediates immune responses in the closest living relatives of animals. eLife 10, e70436 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Coyle M. C. et al. An RFX transcription factor regulates ciliogenesis in the closest living relatives of animals. Curr. Biol. 33, 3747–3758.e9 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Rutaganira F. U. N., Scopton A. P., Dar A. C. & King N. A small molecule screen reveals the essentiality of p38 kinase for choanoflagellate cell proliferation. 2022.08.26.505350 Preprint at 10.1101/2022.08.26.505350 (2022). [DOI] [Google Scholar]
- 96.Harayama T. & Riezman H. Understanding the diversity of membrane lipid composition. Nat. Rev. Mol. Cell Biol. 19, 281–296 (2018). [DOI] [PubMed] [Google Scholar]
- 97.Sohlenkamp C. & Geiger O. Bacterial membrane lipids: diversity in structures and pathways. FEMS Microbiol. Rev. 40, 133–159 (2016). [DOI] [PubMed] [Google Scholar]
- 98.Bray N. L., Pimentel H., Melsted P. & Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016). [DOI] [PubMed] [Google Scholar]
- 99.Pimentel H., Bray N. L., Puente S., Melsted P. & Pachter L. Differential analysis of RNA-seq incorporating quantification uncertainty. Nat. Methods 14, 687–690 (2017). [DOI] [PubMed] [Google Scholar]
- 100.Supek F., Bošnjak M., Škunca N. & Šmuc T. REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms. PLOS ONE 6, e21800 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Chen J. S. et al. CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity. Science 360, 436–439 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Tabata R. et al. Systemic Regulation of Iron Acquisition by Arabidopsis in Environments with Heterogeneous Iron Distributions. Plant Cell Physiol. 63, 842–854 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Paysan-Lafosse T. et al. InterPro in 2022. Nucleic Acids Res. 51, D418–D427 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Eddy S. R. Accelerated Profile HMM Searches. PLOS Comput. Biol. 7, e1002195 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Richter D. J. et al. EukProt: A database of genome-scale predicted proteins across the diversity of eukaryotes. 2020.06.30.180687 Preprint at 10.1101/2020.06.30.180687 (2022). [DOI] [Google Scholar]
- 106.Steenwyk J. L., Iii T. J. B., Li Y., Shen X.-X. & Rokas A. ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference. PLOS Biol. 18, e3001007 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Nguyen L.-T., Schmidt H. A., von Haeseler A. & Minh B. Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 32, 268–274 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Kalyaanamoorthy S., Minh B. Q., Wong T. K. F., von Haeseler A. & Jermiin L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Letunic I. & Bork P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. gkae268 (2024) doi: 10.1093/nar/gkae268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Waterhouse A. M., Procter J. B., Martin D. M. A., Clamp M. & Barton G. J. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Pesant S. et al. Open science resources for the discovery and analysis of Tara Oceans data. Sci. Data 2, 150023 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Bloh K. et al. Deconvolution of Complex DNA Repair (DECODR): Establishing a Novel Deconvolution Algorithm for Comprehensive Analysis of CRISPR-Edited Sanger Sequencing Data. CRISPR J. 4, 120–131 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
RNA sequencing data is available in the public repository NCBI Gene Expression Omnibus (GEO) database under requisition ID: GSE267344