Skip to main content
eLife logoLink to eLife
. 2020 Jun 4;9:e56193. doi: 10.7554/eLife.56193

Genome editing enables reverse genetics of multicellular development in the choanoflagellate Salpingoeca rosetta

David S Booth 1,†,, Nicole King 1
Editors: Alejandro Sánchez Alvarado2, Patricia J Wittkopp3
PMCID: PMC7314544  PMID: 32496191

Abstract

In a previous study, we established a forward genetic screen to identify genes required for multicellular development in the choanoflagellate, Salpingoeca rosetta (Levin et al., 2014). Yet, the paucity of reverse genetic tools for choanoflagellates has hampered direct tests of gene function and impeded the establishment of choanoflagellates as a model for reconstructing the origin of their closest living relatives, the animals. Here we establish CRISPR/Cas9-mediated genome editing in S. rosetta by engineering a selectable marker to enrich for edited cells. We then use genome editing to disrupt the coding sequence of a S. rosetta C-type lectin gene, rosetteless, and thereby demonstrate its necessity for multicellular rosette development. This work advances S. rosetta as a model system in which to investigate how genes identified from genetic screens and genomic surveys function in choanoflagellates and evolved as critical regulators of animal biology.

Research organism: Other

Introduction

As the sister group of animals (Figure 1A), choanoflagellates have great potential for revealing the origins of animal development and the cell biology of multicellularity (Lang et al., 2002; Burger et al., 2003; Carr et al., 2008; Ruiz-Trillo et al., 2008; Grau-Bové et al., 2017). Comparative genomic studies have demonstrated that choanoflagellates express genes that are necessary for animal development (King et al., 2008; Fairclough et al., 2013; Richter et al., 2018; López-Escardó et al., 2019), including genes for intercellular adhesion (e.g. cadherins: Abedin and King, 2008; Nichols et al., 2012), signaling (e.g. receptor tyrosine kinases and CamKII: Manning et al., 2008; Pincus et al., 2008; Bhattacharyya et al., 2016; Amacher et al., 2018), and cellular differentiation (e.g. myc, STAT, and p53: Young et al., 2011; de Mendoza et al., 2013). Moreover, choanoflagellates and animals are the only clades that have cells with a collar complex (Leadbeater, 2015; Brunet and King, 2017), a unique cellular module in which a collar (choano in Greek) of actin-filled microvilli surrounds an apical flagellum (Figure 1B; Sebé-Pedrós et al., 2013; Peña et al., 2016; Colgren and Nichols, 2020). Together, these observations have motivated the development of choanoflagellates as models for researching the function and evolution of core developmental regulators (King, 2004; Hoffmeyer and Burkhardt, 2016; Sebé-Pedrós et al., 2017; Brunet and King, 2017).

Figure 1. Introduction to Salpingoeca rosetta as a simple model for multicellularity and the ancestry of animal cell biology.

Figure 1.

(A) Choanoflagellates (blue) are the closest living relatives of animals (red) and last shared a common ancestor (purple) ~800 million years ago (Parfrey et al., 2011). (B) The collar complex, an apical flagellum (f) surrounded by a collar (c) of actin-filled microvilli, typifies choanoflagellates and is uniquely shared between choanoflagellates and animals (Brunet and King, 2017). (C) Wild-type S. rosetta forms multicellular rosette colonies in response to rosette inducing factors (RIFs) secreted by environmental bacteria. In the absence of RIFs (C’), S. rosetta grows as single cells or as a linear chain of cells (star). Upon the addition of RIFs (C’; Alegado et al., 2012; Woznica et al., 2016), S. rosetta develops into spheroidal, multicellular rosettes (arrowhead) through serial cell divisions (Fairclough et al., 2010). (D) The rosetteless C-type lectin gene is necessary for rosette development. A mutation in rosetteless allows normal cell growth as single cells and linear chains in the absence of RIFs (D’) but prevents rosette development in the presence of RIFs (D’; Levin et al., 2014). (E) Wild-type S. rosetta secretes Rosetteless protein from the basal ends of cells into the interior of rosettes. Shown is a representative rosette stained with an antibody to alpha-tubulin to mark cortical microtubules and the apical flagellum of each cell (E’, grey) phalloidin to mark actin-filled microvilli (E’, magenta), and an antibody to Rosetteless protein (E’’’, green). A merge of alpha-tubulin, phalloidin, and Rosetteless staining shows that Rosetteless protein localizes to the interior of rosettes (arrow) where cells meet at their basal ends (E''''; Levin et al., 2014).

The choanoflagellate Salpingocea rosetta has received the greatest investment in tool development (Hoffmeyer and Burkhardt, 2016). Its 55.44 megabase genome encodes ~11,629 genes, some of which are homologs of integral regulators for animal development (Fairclough et al., 2013). Moreover, the life history of S. rosetta provides a rich biological context for investigating the functions of intriguing genes (King et al., 2003; Fairclough et al., 2010; Dayel et al., 2011; Levin and King, 2013; Woznica et al., 2017). For example, S. rosetta develops into multicellular, spheroidal colonies called rosettes through serial cell divisions from a single founding cell (Fairclough et al., 2010; Laundon et al., 2019; Larson et al., 2020), a process induced by environmental bacteria that can also serve as a food source (Figure 1C; Alegado et al., 2012; Woznica et al., 2016). Thus, rosette development can provide a phylogenetically relevant model for discovering genes that mediate multicellular development and bacterial recognition in choanoflagellates and animals.

A forward genetic screen was established to hunt for mutants that were unable to develop into rosettes and resulted in the identification of genes required for rosette development (Levin et al., 2014; Wetzel et al., 2018). The first of these (Levin et al., 2014), rosetteless encodes a C-type lectin protein that localizes to the interior of rosettes (Figure 1D–E). As C-type lectins are important for mediating intercellular adhesion in animals (Drickamer and Fadden, 2002; Cummings and McEver, 2015), this discovery highlighted the conserved role of an adhesion protein family for animal and choanoflagellate development. However, the screen also underscored the necessity for targeted genetics in S. rosetta. Because of inefficient mutagenesis in S. rosetta, forward genetics has been laborious: out of 37,269 clones screened, only 16 rosette-defect mutants were isolated and only three of these have been mapped to genes (Levin et al., 2014; Wetzel et al., 2018). Establishing genome editing would accelerate direct testing of gene candidates identified through forward genetic screens, differential gene expression, and/or genomic comparisons.

Therefore, for the present study, we sought to establish CRISPR/Cas9 genome editing in S. rosetta. Cas9-mediated genome editing (Jinek et al., 2012; Jinek et al., 2013; Cong et al., 2013) has been crucial for advancing genetics in emerging models (Gilles and Averof, 2014; Harrison et al., 2014; Momose and Concordet, 2016). Depending on the DNA repair pathways expressed in a given organism (Yeh et al., 2019), the delivery of the Cas9 endonuclease bound to a programmable guide RNA (gRNA) can direct DNA cleavage at a target site to introduce mutations from co-delivered DNA templates or from untemplated repair errors that cause insertions or deletions (Rouet et al., 1994; Choulika et al., 1995; Bibikova et al., 2001; Jinek et al., 2013; Cong et al., 2013). While the delivery of macromolecules into choanoflagellate cells has been a longstanding barrier for establishing reverse genetic tools, we recently established a robust method to transfect S. rosetta with DNA plasmids for expressing transgenes (Booth et al., 2018), which allowed us to perform genetic complementation (Wetzel et al., 2018). Despite having established a method for gene delivery in S. rosetta, the lack of knowledge about DNA repair mechanisms in choanoflagellates and low-transfection efficiency (~1%) presented challenges for establishing genome editing, particularly without a proven selectable marker in an endogenous gene to enrich for editing events.

Here we report a reliable method for genome editing to perform reverse genetics in S. rosetta that we have developed into a publicly-accessible protocol (https://dx.doi.org/10.17504/protocols.io.89fhz3n). First, we engineered a selectable marker for cycloheximide resistance as an initial demonstration of genome editing with CRISPR/Cas9 (Figure 2). We then inserted a foreign sequence into rosetteless that eliminates its function, confirming the importance of this gene for multicellular rosette development (Figure 3). Finally, we found that, even in the absence of selection, S. rosetta preferentially uses DNA templates to introduce mutations for genome editing (Figure 4). This work establishes genome editing in S. rosetta and provides a path for testing the function of choanoflagellate genes that are implicated in the early evolution of animals.

Figure 2. Engineered cycloheximide resistance in S. rosetta provides a proof-of-principle for Cas9-mediated genome editing.

(A) Schematic of Cas9-mediated genome editing to engineer cycloheximide resistance in S. rosetta. Nucleofection was used to deliver SpCas9 (gray) bound to gRNA (cyan), which together form the SpCas9 RNP, and repair oligonucleotides (Repair Oligo; Figure 2—figure supplement 2) to engineer cycloheximide resistance. After recovering cells for one day, successfully edited cells were selected by growth in media supplemented with cycloheximide (cyh), which inhibits the growth of wild-type cells (Figure 2—figure supplement 1) and selects for cycloheximide-resistant cells (purple). (B) A designer cycloheximide-resistant allele (Figure 2—figure supplement 2) allows cell proliferation in the presence of cycloheximide. Wild-type (WT, black dots and line) and rpl36aP56Q (purple dots and line) strains were placed into media supplemented with a range of cycloheximide concentrations (x-axis) at a cell density of 104 cells/ml and then were grown for two days. rpl36aP56Q grew to higher cell densities than the wild-type strain at cycloheximide concentrations < 10 ng/ml. At higher concentrations, cycloheximide inhibited growth of both strains. The dots show cell densities from three independent replicates. The lines show the average from independently fitting a dose inhibition curve to the cell densities from three independent experiments.

Figure 2.

Figure 2—figure supplement 1. An approach for selecting cycloheximide resistance in S. rosetta.

Figure 2—figure supplement 1.

(A) Cycloheximide inhibits S. rosetta growth. We seeded each well of a 24-well plate with 0.5 ml of cells at 2 × 104 cells/ml in a 3-fold serial dilution of cycloheximide, including a condition without cycloheximide. Three independent wells were set up for each concentration of cycloheximide for replicate measures. The cell density at each concentration was determined after 48 hr by counting with a hemocytometer. To establish a relationship between growth rate and cycloheximide concentration, the cell density was transformed into relative growth rates (S) using a rearranged form of the logistic equation: S=tt+T•lnPcyh/P0 , in which the cell density in a given cycloheximide concentration (Pcyh) at time (t = 48 hr) was normalized by the cell density without cyloheximide (P0) and the doubling time without cycloheximide (T = 10 hr) was taken from growth curves of wild-type strains (Figure 3—figure supplement 2). After performing a linear fit of the data, we determined that 5 ng/ml of cycloheximide retards the growth rate two-fold. (B) The cycloheximide resistant mutation rpl36aP56Q disrupts cycloheximde binding to the large ribosomal subunit of yeast (left). A crystal structure of cycloheximide bound to the yeast 80S ribosome (Garreau de Loubresse et al., 2014; PDB 4U3U) shows that the yeast ortholog of Rpl36a (L42A; black) and ribosomal RNA (gray) form the cycloheximide binding pocket. The most critical residues in L42A for cycloheximide binding are Lys55, Pro56, and Phe58 (right). In silico modeling (Goddard et al., 2005) of cycloheximide resistance mutations (Bae et al., 2018) shows that some rotamers of the Pro56Gln substitution (purple) disrupt van der Waals interactions and cause steric clashes. (C) The S. rosetta ortholog of Rpl36a conserves residues that bind cycloheximide in yeast and Chlamydomonas reinhardtii. A sequence alignment (Sievers et al., 2011) of Rpl36a orthologs from S. rosetta, S. cerevisiae (L42A), and C. reinhardtii (RPL41) shows that S. rosetta conserves residues for cycloheximide binding, the most critical of which is Pro56 (purple arrow). (D) The efficiency of genome editing alters the selection of edited alleles. Using growth parameters determined from S. rosetta growth curves (Figure 3—figure supplement 2) and cycloheximide inhibition (Figure 2—figure supplement 1A), we modeled the selection for cycloheximide resistance using the following equation: Fcyh=ε[Mt/ST+(1ε)]ε[Met/ST+(1ε)]+(1ε)[Met/T+ε] , where Fcyh is the frequency of the cycloheximide resistant allele, ε is the genome editing efficiency (which also corresponds to the initial frequency of the edited allele), M is the ratio of the carrying capacity to the cell density (which we set as an arbitrarily large number because continuous passaging in the laboratory can keep the cell population far from the carrying capacity), t is the time of growth after starting selection, T is the doubling time in the absence of selection (which is 10 hr, see Fig. S4), and S is the relative growth rate in the presence of selection (which we set to 2, based on the relative growth rate upon adding 5 ng/ml of cycloheximide to cells (panel A). Notably, this model only captures the relative changes in growth upon selection and assumes that the edited allele is insensitive to the drug; the model does not include a term for the rate of cells dying, which we observed to happen after 3 days of selection. Nonetheless, this model helped us determine that after 5 days of growth in a selective media, we could expect to observe genome editing events occurring at frequencies > 0.01%.

Figure 2—figure supplement 2. Engineered cycloheximide resistance establishes genome editing conditions.

Figure 2—figure supplement 2.

(A) The design of a cycloheximide resistant allele, rpl36aP56Q, in S. rosetta. The protospacer adjacent motif (PAM, orange) next to the 56th codon of rpl36a (Target, cyan), which is located on the second exon (thick black line labeled 2), provides a suitable site to design a gRNA that targets SpCas9 cleavage (sequence is shown underneath the locus schematic, and carets indicate the target cleavage site). A repair oligonucleotide (black line with knob) introduces a cycloheximide resistant allele, rpl36aP56Q (Mutation, purple), flanked by 100 bases of homologous sequence. The sequence of the edited allele is shown below. (B–D) A comparison of genotypes from populations of unedited cells (B), edited cells (C), and a strain established from a clonal isolate of edited cells (D) shows that cycloheximide selection enriches for rpl36aP56Q. The genotype for each population was determined by amplifying the locus with primers surrounding the editing site (black arrows in panel A) that did not overlap in sequence with the repair oligonucleotide. One of the primers had a T3 primer binding site for Sanger sequencing of amplicons (black arrow with flap). Remarkably, after selection, the wild-type allele was not detected (B). (E) S. rosetta uses repair oligonucleotides with >20 nt homology arms for genome editing. Truncations of repair oligonucleotides encoding the rpl36aP56Q allele were designed in the same orientation as gRNAs (sense, black dots and lines) or the opposite orientation (antisense, gray dots and lines). 24 hr after S. rosetta recovered from transfections with repair templates and SpCas9 RNPs, cycloheximide was added to grow cells in selective media for five days, at which time the cells were harvested for counting cell density and for genotyping. Closed circles indicate that the consensus genotype of the cell population had the rpl36aP56Q allele in Sanger sequencing; whereas, open circles indicate that the cell population had the wild-type allele. E’ and E’’ show two independent trials. Notably, we observed a slight bias for repair oligonucleotides in the sense direction, particularly with shorter homology arms of 20-30 bases. Because repair templates in the sense orientation with 40–80 bases of homologous sequence resulted in the best editing, we performed subsequent optimization with a sense repair oligonucleotide that 50-base homology arms on each side of the double-stranded break. (F) Small quantities of SpCas9 RNPs are sufficient to initiate genome editing. Decreasing concentrations of SpCas9 RNP (SpCas9 was the limiting factor) and a constant amount of repair template were transfected into S. rosetta. After characterizing genome editing outcomes by counting cell density and sequencing the consensus genotype (described in panel E), we found that low concentrations of SpCas9 (20 pmol) were sufficient to introduce the rpl36aP56Q allele. F’ and F’’ show two independent trials. (G) High concentrations of repair oligonucleotides increase genome editing efficiency. A serial dilution of a repair template was delivered into S. rosetta. The cell density and consensus genotypes from these experiments show that all concentrations of repair template can introduce the rpl36aP56Q allele, but the higher cell densities recovered after transfection with increasing concentrations of repair templates indicate more efficient editing. G’ and G’’ show two independent trials. (H) The addition of gRNAs stimulates genome editing. Genome editing was performed by delivering a repair oligonucleotide with SpCas9 without the addition of any gRNA or with a gRNA that was prepared from in vitro transcriptions (noted as gRNA in figure) or with a synthetic crRNA that was annealed to a synthetic tracrRNA (noted as crRNA). The consensus genotype and cell densities from these experiments show that gRNAs are necessary for editing and that gRNAs from either source were sufficient for editing. The dots show two independent experiments and lines show their average result.

Figure 3. Genome editing of rosetteless enables targeted disruption of multicellular development in S. rosetta.

(A) An engineered mutation in rosetteless introduces a premature termination sequence (PTS) to knockout the expression of rosetteless. The rosetteless gene (exons shown as numbered black boxes, connected by introns) encodes a secreted protein (SS denotes the signal sequence for secretion) with two C-type lectin domains (CTL1 and CTL2) and two carboxy-terminal repeats (RP1 and RP2). A forward genetic screen (Levin et al., 2014) identified a mutation, rtlstl1, in which a T to C transition in the seventh intron disrupts splicing and knocks out rosetteless expression. To increase the likelihood of disrupting rosetteless function with genome editing, we designed the rtlsPTS1 mutation that introduces a PTS (green), with a poly-adenylation sequence and stop codons in each reading frame, into the fourth exon of the gene. (B) The genotypes of strains established from genome-editing confirm that rosetteless and rpl36a incorporated the designed mutations. To enrich for genome-edited cells, SpCas9 RNPs and repair templates for introducing rpl36aP56Q (Figure 2—figure supplement 2) and rtlsPTS1 were simultaneously delivered into S. rosetta. Afterward, cycloheximide resistant cells were clonally isolated and screened for cells that did not develop into rosettes in the presence of RIFs. The genotypes of rtlsPTS1 rpl36aP56Q, and rpl36P56Q confirmed that strains established from genome editing had the rpl36P56Q allele and the strain with the rosetteless phenotype also had the rtlsPTS1 allele. In addition, the wild-type and genome edited strains lacked the T to C transition in the 5’-splice site of intron seven that defined the rtlstl1 allele. (C–F) Phenotypes of genome-edited strains correspond to their respective genotypes. In the absence of RIFs, all strains (C’, D’, E’, and F’) grew as chains (stars) or single cells. Upon the addition of RIFs, the wild-type (C”) and rpl36P56Q strains (D”) formed rosettes (arrowheads). In contrast, rtlsPTS1 rpl36P56Q (E”) and rtlstl1 (F”) did not form rosettes. (G–J) Mutations in rosetteless prevent the secretion of Rosetteless protein at the basal end of cells and into the interior or rosettes. Immunofluorescent staining for Rosetteless (green), alpha tubulin (gray), and actin (magenta) in wild-type (G), rpl36aP56Q (H), rtlsPTS1 rpl36aP56Q (I), and rtlstl1 (J) strains with (G”–J”) and without (G’–J’) rosette induction. Rosetteless localizes in the interior of rosettes (arrow) in wild-type and rpl36aP56Q but not rtlsPTS1 rpl36aP56Q and rtlstl1.

Figure 3.

Figure 3—figure supplement 1. Phenotypes of rosetteless mutants correspond to their genotypes.

Figure 3—figure supplement 1.

(A) The consensus genotype at the site of rosetteless editing in cell populations selected for cycloheximide resistance indicates the presence of the rtlsPTS1 allele. In a wild-type strain (top) and a clonal isolate of rltsPTS1 rpl36aP56Q (bottom), Sanger sequencing at the rtls exon four reveals no heterogeneity in sequence. In populations of cells that had been co-edited to simultaneously engineer rpl36aP56Q and rtlsPTS1 alleles and then selected for cycloheximide resistance, sequence heterogeneity was detected at rtls exon 4 (indicated by N at positions were the base cannot automatically be assigned) and revealed that the rtlsPTS1 allele was present at 16.5% (Population 1) or 10.4% (Population 2) in populations of cells from two independent experiments in which selection for cycloheximide resistant cells was performed after co-editing rpl36aP56Q and rtlsPTS1 alleles. Allele frequency was estimated by unmixing wild-type and rtlsPTS1 alleles in electropherograms from Sanger sequencing (Brinkman et al., 2018). Carets indicate site targeted cleavage by SpCas9. (B) Mutations in rosetteless eliminate rosette development. Rosette development in cell populations (N = 500 cells for each of three independent replicates) shows that wild-type and rpl36aP56Q develop into rosettes in the presence of RIFs while rtlsPTS1 rpl36aP56Q and rtlstl1 do not.

Figure 3—figure supplement 2. Wild-type and mutant strains proliferate similarly.

Figure 3—figure supplement 2.

Growth curves for wild-type (A), rpl36aP56Q (B), rtlsPTS1 rpl36aP56Q (C), and rtlstl1 (D) show similar rates of proliferation. The growth for each strain was characterized by seeding cells at a density of 1 × 104 cells per ml and determination the cell concentration every ~12 hr. For each time point, triplicate measures were taken. Each replicate growth trajectory was fit with the logistic equation to calculate the doubling time (E). An analysis of variance (ANOVA) between samples showed growth rates were probably the same (p<0.08).

Figure 4. S. rosetta preferentially introduces genome-edited mutations from DNA templates.

(A) Schematic of a gRNA targeting SpCas9 to a genomic locus of interest. A gRNA (cyan, knobs indicate 5’ ends) that encodes a 20 nt targeting sequence from the sense strand of a genomic locus (black) hybridizes with the antisense strand (dark gray). SpCas9 (light gray) introduces a double-stranded break at the genomic locus (carets), 3 bp upstream of a protospacer adjacent motif (PAM, orange). (B) We designed a panel of repair oligonucleotides to test the preferred substrates for repairing double-stranded breaks introduced by SpCas9 at rosetteles exon 4. Oligonucleotide repair templates containing the PTS sequence (green) were delivered as single-stranded DNA in the sense (S) or anti-sense (A) orientations and as a double-stranded template (D) to test which most efficiently templated DNA repair at the SpCas9 cleavage site. (C) SpCas9 stimulated the introduction of PTS mutations from DNA templates. Repair templates with a PTS (from panel B) were delivered in the presence and absence of SpCas9 (+/–). A ~ 450 bp fragment surrounding the rtlsPTS1 cleavage site was amplified from cells that had been transfected the previous day to prepare deep sequencing libraries for quantifying the frequency of PTS insertions (green) or insertions/deletions from error prone editing (black). Each experiment was performed three independent times (dots; mean and standard deviations are shown with lines). The dotted line indicates the limit of detection of the sequencing, based on a 6-base, randomized barcode. Upon transfection with the SpCas9 RNP, 10x more mutations from repair templates (1–2%, green dots) were detected than untemplated insertions or deletions (black dots). In the absence of SpCas9, mutations generated from a double-stranded template, but not single-stranded templates, were rarely (<0.1%) and unreliably (2 of 3 trials) found.

Figure 4.

Figure 4—figure supplement 1. Characterization of editing outcomes at the rosetteless locus with different types of repair templates.

Figure 4—figure supplement 1.

(A) Double-stranded DNA repair templates (black indicates homology arms from the sense strand, gray indicates homology arms from the antisense strand and green is the PTS as in Figure 4) were designed with phosphorylated 5’ ends (indicated with open circles at the 5’ end and a ‘p’ in template names; closed circles indicate unphosphorylated ends) or three phosphorothioate bonds between bases at the 5’ end (indicated with asterisks in diagrams and a ‘t’ in template names). We hypothesized that phosphorylated templates would be more susceptible to nucleases and phosphorothioate bonds less susceptible (Renaud et al., 2016; Yu et al., 2020), altering their utility as repair substrates in vivo, yet S. rosetta used all double stranded templates with similar efficiency (see panel C). (B) We also designed a panel of single-stranded repair templates (colors as in panel A) that lacked 5’ or 3’ arms (Paix et al., 2017) and found that both arms of homology are required for efficient template-mediated genome editing in S. rosetta (see panel C). (C) A comparison of DNA repair templates revealed that S. rosetta efficiently uses double stranded DNA templates during DNA repair and requires both arms of homology for single-stranded DNA templates. Frequencies of alleles containing either the PTS (green) or insertion/deletion mutations (black) are shown for genome editing experiments based on each of the templates described in panels A and B. Genome editing in the presence of double-stranded DNA templates favored template-directed DNA repair. The use of phosphorylated double-stranded DNA templates or double-stranded DNA templates with phosphorothioate bonds (see panel A) did not increase editing efficiency over unmodified double-stranded DNA templates. We also found that removing 5’ or 3’ homology arms from single stranded templates (see panel B) almost completely eliminated efficient editing as compared to single-stranded templates with both homology arms (Figure 4C). Each editing experiment was performed three independent times. (D–E) An aggregate analysis of insertion/deletion mutations identified in deep sequencing of genome editing experiments. (D) A histogram shows the length and frequency of insertion and deletion mutations. (E) A sequence alignment or representative insertion and deletion mutations from each size of insertion/deletion mutations. Notably, the most frequent deletions (8–10 bases) occur at dinucleotide repeats, suggesting that microhomologies may promote deletions after double-stranded breaks. (F) An extreme example of templated repair suggests that S. rosetta may incorporate larger insertions. One mutation identified in deep sequencing shows an 88-base insertion, with the insertion featuring two PTS sequences with an intervening region that has some homology to sequences to the left (thick line) and right (dotted line) of the double stranded break. Although we are unsure of the mechanism that led to this mutation, its presence suggests that large mutations could be incorporated into S. rosetta via genome editing.

Results

A marker to select for cycloheximide resistance facilitates genome editing in S. rosetta

Our initial attempts to target rosetteless for genome editing in S. rosetta were either unsuccessful or resulted in editing events that were below the limits of detection. Therefore, suspecting that genome editing in S. rosetta might prove to be challenging to establish, we first aimed to introduce a mutation in an endogenous gene that would confer antibiotic resistance and allow selection for rare genome editing events.

In Chlamydomonas (Stevens et al., 2001) and Fungi (Kawai et al., 1992; Dehoux et al., 1993; Kondo et al., 1995; Kim et al., 1998), specific mutations in the ribosomal protein gene rpl36a confer resistance to the antibiotic cycloheximide by disrupting cycloheximide binding to the large subunit of eukaryotic ribosomes (Stöcklein and Piepersberg, 1980; Schneider-Poetsch et al., 2010; Garreau de Loubresse et al., 2014). After finding that S. rosetta cell proliferation was inhibited by cycloheximide (Figure 2—figure supplement 1A), we sought to establish a cycloheximide-resistant strain through genome editing. By combining prior genetic findings (Bae et al., 2018) with our own structural modeling (Figure 2—figure supplement 1B) and bioinformatic analyses (Figure 2—figure supplement 1C) of the S. rosetta rpl36a homolog (PTSG_02763), we predicted that converting the 56th codon of rpl36a from a proline to a glutamine codon (rpl36aP56Q) would render S. rosetta resistant to cycloheximide (Figure 2—figure supplement 1D). Insertion or deletion mutations that could arise as errors from repairing the double-stranded break without a template would likely kill cells by disrupting the essential function of rpl36a for protein synthesis (Bae et al., 2018).

To edit the rpl36a gene in S. rosetta, we first designed a gRNA with a 20 nt sequence from rpl36a to direct Cas9 from Streptomyces pyogenes (SpCas9) to cut at S. rosetta supercontig 6: 948,122 nt (Fairclough et al., 2013). Then we made a DNA repair template as a single-stranded DNA oligonucleotide with a sequence encoding the Pro56Gln mutation and 200 bases of flanking homologous sequence from rpl36a centered on the cleavage site (Figure 2—figure supplement 2A). To deliver the SpCas9/gRNA ribonucleoprotein complex (SpCas9 RNP) and the repair template encoding the Pro56Gln mutation into S. rosetta cells, we used a nucleofection protocol adapted from our recently developed method for transfecting S. rosetta (Figure 2A; Booth et al., 2018). We favored delivering the SpCas9 RNP rather than expressing SpCas9 and gRNAs from plasmids, as RNA polymerase III promoters for driving gRNA expression have not yet been characterized in S. rosetta and the overexpression of SpCas9 from plasmids can be cytotoxic for other organisms (Jacobs et al., 2014; Jiang et al., 2014; Shin et al., 2016; Foster et al., 2018) as well as increase the likelihood of introducing off-target mutations (Kim et al., 2014b; Liang et al., 2015; Han et al., 2020). After growing transfected cells in the presence of cycloheximide for five days, Sanger sequencing of PCR-amplified rpl36a showed that rpl36aP56Q was the major allele in the population (Figure 2—figure supplement 2, compare B and C). Sequencing a clonal strain established from this population confirmed the rpl36aP56Q genotype (Figure 2—figure supplement 2D), and growth assays showed that the rpl36aP56Q strain proliferated better than the wild-type strain in increasing concentrations of cycloheximide (Figure 2B; Two-Factor ANOVA: p<10−20).

The ability to engineer cycloheximide resistance additionally offered a simple assay to optimize essential parameters for genome editing in S. rosetta. Therefore, we tested how varying delivery conditions would impact the frequency of template-mediated mutagenesis and, ultimately, the cell density and consensus genotype of a cell population after genome editing and cycloheximide treatment (Figure 2—figure supplement 2E–H). Through this optimization process (Figure 2—figure supplement 2), we found that efficient genome editing required transfection with at least 20 pmol of SpCas9 RNP and more than 200 nmol of a single-stranded DNA repair template that had 50 bases of homology flanking a designed mutation. Henceforth, these parameters established baseline conditions for designing and executing genome editing experiments.

Targeted disruption of rosetteless demonstrates its essentiality for multicellular rosette development

We next sought to use genome editing as a general tool for reverse genetics in choanoflagellates. To this end, we targeted rosetteless (rtls), one of only three genes known to be required for rosette development in S. rosetta (Levin et al., 2014; Wetzel et al., 2018). A prior forward genetic screen linked the first rosette defect mutant to an allele, rtlstl1, in which a T to C transition in the 5’-splice site of intron 7 (S. rosetta supercontig 8: 427,804 nt; Fairclough et al., 2013) was associated with the disruption of rtls expression and rosette development (Figures 3A and 1C–E; Levin et al., 2014). We therefore sought to generate a new rtls knockout allele, whose phenotype we predicted would be the loss of rosette development.

To increase the likelihood of generating a rtls knockout through genome editing, we aimed to introduce sequences that would prematurely terminate transcription and translation near the 5’ end of the gene. First, we designed a gRNA that would target SpCas9 to the 5’ end of rtls. Next, we designed a general-purpose premature termination sequence (PTS), an 18-base, palindromic sequence (5’-TTTATTTAATTAAATAAA-3’) that encodes polyadenylation sequences and stop codons on both strands and in each possible reading frame. This sequence should prematurely terminate transcription and translation to either create a gene truncation or fully knockout target gene expression. We then designed a DNA oligonucleotide repair template in which the PTS was inserted into 100 bp of rtls sequence centered around the SpCas9 cleavage site (supercontig 8: 429,354 nt).

The low efficiency of transfection (~1%; Booth et al., 2018), the inability to select for cells with the Rosetteless phenotype, and the unknown but potentially low efficiency of genome editing meant that it might be difficult to recover cells in which rosetteless had been edited. To overcome this challenge, we sought to simultaneously edit rosetteless and rpl36a by transfecting cells with RNPs complexed with gRNAs and DNA repair templates for both knocking out rosetteless and engineering cycloheximide resistance. In other organisms, this approach has allowed for co-selection by using a selectable marker to improve the recovery of cells that contain a second mutation in a different locus. In S. rosetta, we found that 10.4–16.5% of cycloheximide resistant cells contained the rtlsPTS1 allele when rosetteless and rpl36a were co-edited (Figure 3—figure supplement 1A).

By first selecting for cycloheximide resistance and then performing clonal isolation by limiting dilution, we were able to isolate multiple clonal lines that were resistant to cycloheximide. We focused on one strain that correctly formed rosettes in response to bacterial rosette inducing factors (RIFs; Figure 3D; Alegado et al., 2012; Woznica et al., 2016) and two cycloheximide-resistant strains that failed to form rosettes in the presence of RIFs (representative strain shown in Figure 3E). Genotyping of these strains at exon 4 of rosetteless and at rpl36a (Figure 3B) showed that: (1) all three cycloheximide resistant strains established from the same genome-edited population had the cycloheximide resistance allele, (2) the strains that developed into rosettes only had the cycloheximide resistant allele, rpl36aP56Q, and (3) the two strains that did not develop into rosettes also had the PTS in rosetteless exon 4, meaning their genotype was rtlsPTS1 rpl36aP56Q (Figure 3B). For comparison, we also genotyped wild-type, rpl36aP56Q, rtlsPTS1 rpl36aP56Q, and rtlstl1 strains at intron 7 of rosetteless, where the rtlstl1 mutation was mapped, to underscore that rtlsPTS1 is an independent mutation that prevents the development of rosettes (Figure 3B).

To further validate the genotype-to-phenotype relationship of the rosetteless knockouts (Figure 3C–F), we analyzed the percentage of cells that developed into rosettes (Figure 3—figure supplement 1B), the localization of the Rosetteless protein (Figure 3G-J), and the rates of proliferation (Figure 3—figure supplement 2) in the wild-type, rpl36aP56Q, rtlsPTS1 rpl36aP56Q, and rtlstl1 strains of S. rosetta. In each of these assays, the rtlsPTS1 rpl36aP56Q strains exhibited the same phenotype as rtlstl1 (Figure 3, compare E to F): no cells developed into rosettes (Figure 3—figure supplement 1B), an anti-Rosetteless antibody did not detect Rosetteless protein at the basal end of cells (Figure 3, compare I-J to G), and the mutant and wild-type strains proliferated comparably well (Figure 3—figure supplement 2). Furthermore, rpl36aP56Q developed into wild-type rosettes (Figure 3, compare D to C and Figure 3—figure supplement 1B) localized Rosetteless protein to the basal end of cells (Figure 3, compare H to G), and proliferated as rapidly as the wild-type strain (Figure 3—figure supplement 2A–B,E), demonstrating that the act of genome editing alone does not yield non-specific defects in rosette development. Our ability to engineer a new rosetteless allele, rtlsPTS1, that mimics the rosette-defect phenotype of rtlstl1 demonstrates the potential of genome editing as a general tool for generating targeted gene knockouts in choanoflagellates.

S. rosetta preferentially introduces genome-edited mutations from DNA templates

Thus far, we had only detected mutations from repair templates with homology arms spanning both sides of the double-strand break (Figures 2 and 3, and Figure 2—figure supplement 2). However, selecting for cycloheximide resistance may have favored those repair outcomes, as insertion or deletion (INDEL) mutations arising from untemplated repair are likely to be deleterious for the function of rpl36a. Therefore, to investigate the frequency of template-mediated repair in the absence of selection, we sought to edit rosetteless, which is not required for viability (Figure 3—figure supplement 2).

As prior work has shown that editing outcomes in different cell types (Harrison et al., 2014; Yeh et al., 2019) can depend on the length and orientation (anti-sense or sense) of homology arms (Lin et al., 2014; Kaulich et al., 2015; Richardson et al., 2016; Paix et al., 2017; Li et al., 2019; Okamoto et al., 2019) and chemical modifications of DNA repair templates (Tsai et al., 2015; Renaud et al., 2016; Yu et al., 2020), we designed a panel of diverse double- and single-stranded DNA repair templates that all contained the PTS (Figure 4B and Figure 4—figure supplement 1). The double-stranded templates contained phosphorylation or phosphorothioate bonds at their 5’ ends (Figure 4—figure supplement 1A); whereas, the single-stranded templates varied in their orientation and presence of 5’ or 3’ homology arms (Figure 4—figure supplement 1B). We transfected cells with these repair templates with or without the SpCas9 RNP. After the cells recovered for one day, we amplified a ~ 450 bp fragment around the SpCas9 cut site for deep sequencing (Yang et al., 2013; Lin et al., 2014) and quantified the frequency and type of mutation after genome editing.

We found that the S. rosetta genome could be edited in a SpCas9-dependent manner using a variety of templates (Figure 4C, Figure 4—figure supplement 1C). In the presence of the SpCas9, INDEL mutations occurred at a frequency of <0.1%, either in the presence or absence of DNA repair templates (Figure 4C, Figure 4—figure supplement 1D–E). In contrast, the addition of SpCas9 with DNA repair templates that encoded the PTS resulted in PTS mutations at a frequency of 0.79–1.57%, which is at a ten-fold higher frequency than the INDEL mutational frequency (Two-Factor ANOVA: p<10−13). Notably, the total frequency of genome editing (~1%) is on the same order of magnitude as transfection efficiency (~1%; Booth et al., 2018), suggesting that the delivery of SpCas9 and repair templates is the biggest factor limiting genome editing efficiency.

In the absence of SpCas9, we observed two types of SpCas9-independent genome edits. The first was a single INDEL mutation detected in a population of cells transfected with the antisense repair template, and the second was the detection of PTS mutations at an average frequency of ~0.02% upon the delivery of a double-stranded repair template. Although the frequency of these mutations occurred at a rate less than or equal to the detection threshold (~0.02%), meaning that we could not confidently conclude that differences exist between any of the samples (Two-Factor, ANOVA: p<0.27), the presence of these mutations is consistent with a low frequency of endogenous DNA repair. These results also emphasize that the addition of SpCas9 was essential for efficient, targeted mutagenesis (Two-Factor ANOVA: p<0.02 for INDEL mutations and p<10−13 for PTS mutations).

Altogether, our optimization efforts revealed that the delivery of the SpCas9 with a DNA template spanning both sides of the SpCas9 cleavage site introduced PTS mutations at a frequency of ~1%. We recommend using a sense-oriented, single-stranded template for genome editing, as this template led to the highest frequency of PTS mutations (Single-Factor ANOVA, Tukey Multiple Comparison Test: p<0.04) and costs less to synthesize than a double-stranded template.

Discussion

The establishment of Cas9-mediated genome editing advances S. rosetta as a model for illuminating the evolution of development in choanoflagellates and their closest living relatives, animals. We were able to overcome initial failed efforts to establish genome editing in S. rosetta by engineering cycloheximide resistance in rpl36a as a selectable marker, similar to the use of selectable markers during the establishment of genome editing in other eukaryotes, including Fungi (Foster et al., 2018), green algae (Ferenczi et al., 2017), and nematodes (Arribere et al., 2014; Kim et al., 2014a; Ward, 2015). Single-copy ribosomal protein genes like rpl36a offer certain advantages for engineering drug resistance markers with genome editing. First, resistance mutations in ribosomal protein genes have been genetically and biochemically characterized for a variety of drugs in diverse eukaryotes (Sutton et al., 1978; Ares and Bruns, 1978; Kawai et al., 1992; Dehoux et al., 1993; Kondo et al., 1995; Kim et al., 1998; Stevens et al., 2001; Garreau de Loubresse et al., 2014 and references therein). In our case, interpreting alignments among Rpl36a sequences from S. rosetta and organisms in the context of structures of eukaryotic ribosomes provided a starting point for customizing cycloheximide resistant alleles, a strategy that can also extend to other organisms. Second, the specificity of antibiotics that inhibit eukaryotic or prokaryotic translation can be leveraged to tailor genetic tools for particular organisms in complex communities. For example, cycloheximide binds selectively to eukaryotic ribosomes, resulting in the inhibition of S. rosetta growth and not that of its food source: live prey bacteria. Combining these advantages to establish genome editing in S. rosetta provided the first proof-of-principle for genome editing and allowed us to characterize the essential parameters before targeting other genes.

With the newfound potential for reverse genetics, we revisited the genetic basis of multicellular rosette development in S. rosetta. A previous forward genetic screen followed by mapping crosses implicated the C-type lectin gene rosetteless in the regulation of rosette development (Levin et al., 2014). At the time, however, it was not possible to independently corroborate rosetteless function with targeted mutations. In this study, we used genome editing to introduce a premature termination sequence in rosetteless and found that strains with the engineered rosetteless mutation have the same rosette defect phenotype as cells with the original rtlstl1 mutation, demonstrating that rosetteless is necessary for rosette development.

Moving forward, the approach established here promises to accelerate future research on choanoflagellates. It will now be possible for choanoflagellate researchers to introduce candidate mutations into a wild-type strain or correct the causative mutations in the original mutant strain to cleanly test the connection between genotype and phenotype. Similarly, for reverse genetics, the use of different guide RNAs and repair templates through CRISPR/SpCas9 genome editing will allow researchers to engineer multiple independent alleles to better understand the phenotype of targeted gene knockouts. Tools that have been previously used in forward genetic approaches, such as genetic crosses (Levin and King, 2013; Levin et al., 2014; Woznica et al., 2017) and stable transgenesis (Wetzel et al., 2018), may also provide the means to rapidly generate strains with different genetic backgrounds to complement mutants, to reveal epistasis, or simply to eliminate off-target mutations that may arise during genome editing.

Importantly, the establishment of genome editing in S. rosetta offers the first model choanoflagellate to investigate the ancestral and core functions of genes that evolved as integral regulators of animal biology. The S. rosetta genome (Fairclough et al., 2013) encodes receptors for immunity (e.g. Toll-like receptors), intercellular communication (e.g. receptor tyrosine kinases), and adhesion (e.g. cadherins, C-type lectins, and immunoglobulins) as well as master regulators of cell differentiation (e.g. forkhead, homeodomain, p53 and sox transcription factors). As a simple microbial model, S. rosetta now may serve as an accessible system for uncovering the conserved functions of genes that are not as readily studied in the more complex context of multicellular animals. Moreover, S. rosetta is just one tip on the choanoflagellate branch. Recent surveys of 21 choanoflagellate transcriptomes and genomes from uncultured species have revealed that choanoflagellates are at least as genetically diverse as animals (Richter et al., 2018; López-Escardó et al., 2019), with other species retaining genetic pathways or exhibiting behaviors that are not found in S. rosetta (e.g., Marron et al., 2013; Leadbeater, 2015; Brunet et al., 2019). Together with future findings from S. rosetta, we anticipate that the establishment of genome editing in other choanoflagellates will provide an increasingly complete portrait of the last common ancestor of choanoflagellates and animals.

Materials and methods

Culturing choanoflagellates

Strains of S. rosetta were co-cultured with Echinicola pacifica bacteria (Levin and King, 2013); American Type Culture Collection [ATCC], Manassas, VA; Cat. No. PRA-390) in seawater-based media enriched with glycerol, yeast extract, and peptone to promote the growth of E. pacifica that serve as the choanoflagellate prey (Levin and King, 2013; Booth et al., 2018). We further supplemented this media with cereal grass (King et al., 2009; Fairclough et al., 2010; Carolina Biological Supply Company, Burlington, NC; Cat. No. 132375), which we call high nutrient media (Supplementary file 1-Table A), as we noticed that this addition promoted S. rosetta growth to a higher cell density (~107 cells/ml [Figure 3—figure supplement 2A] versus ~106 cells/ml (Booth et al., 2018). To maintain rapidly proliferating cells in an abundance of nutrients, cultures were diluted 1 in 30 daily or 1 in 60 every two days into 6 ml of high nutrient media in 25 cm2 vented culture flasks (Corning, Oneonta, NY, USA; Cat. No. 430639) and incubated at 22°C and 60% relative humidity. To prevent an overgrowth of bacteria when S. rosetta experienced stress, such as after transfections or during clonal isolation, we cultured S. rosetta in low nutrient media, which is 0.375x high nutrient media (Supplementary file 1-Table A).

Purification of outer membrane vesicles that contain RIFs

Rosette inducing factors (RIFs) contained in outer membrane vesicles (OMVs) from Algoriphagus machipongonensis (Alegado et al., 2013; ATCC; Cat. No. BAA-2233) can be used to induce rosette development in S. rosetta (Alegado et al., 2012; Woznica et al., 2016). A machipongonensis OMVs were purified using the protocol in Woznica et al., 2016. In summary, a 200 ml culture of 25x high nutrient media without cereal grass was inoculated from a single colony of A. machipongonensis and grown in a 1 l, baffled flask by shaking at 200 rpm for 3 days at 30°C. Afterwards, the bacteria were pelleted in 50 ml conical tubes by centrifugation at 4500 g and 4°C for 30 min. The pellet was discarded and the supernatant was filtered through a 0.22 µm vacuum filter. Outer membrane vesicles were pelleted from the filtered supernatant by ultracentrifugation at 36,000 g and 4°C in a fixed-angle, Type 45 Ti rotor (Beckman Coulter Life Sciences, Indianapolis, IN; Cat. No. 339160) for 3 hr. After discarding the supernatant, the pellet of outer membrane vesicles, which has an orange hue, was resuspended in a minimal volume of 50 mM HEPES-KOH, pH 7.5 and then incubated at 4°C overnight to fully dissolve the pellet. Last, the pellet was sterile filtered through a 0.45 µm polyvinylidene fluoride syringe filter (Thermo Fisher Scientific, Waltham, MA; Cat. No. 09-720-4) into a sterile tube.

The rosette-inducing activity of the OMVs was tested by serially diluting the purified OMVs in a 24-well plate, with each well containing 0.5 ml of S. rosetta at a concentration of 104 cells/ml and E. pacifica. The cells were incubated with OMVs at 22°C for 48 hr and then fixed with formaldehyde before counting the fraction of cells (n = 100) in rosettes. The dilution of lipids in which half of S. rosetta cells formed rosettes was defined as two unit/ml. All subsequent rosette inductions were performed with OMVs at a final concentration of 10 units/ml.

Genome editing

Below we describe the considerations for the design and preparation of gRNAs and repair oligonucleotides for genome editing. The particular gRNAs and DNA repair template sequences for each given experiment are provided in Supplementary file 1-Table B.

Design and preparation of gRNAs

Upon inspecting the structure of the SpCas9 RNP poised to cleave a DNA target (Jiang et al., 2016), we concluded that sequences adjacent to and upstream of the PAM sequence (5’-NGG-3’), which have been reported to bias SpCas9 activity in vivo (Doench et al., 2014; Wu et al., 2014; Xu et al., 2015; Moreno-Mateos et al., 2015; Horlbeck et al., 2016; Liu et al., 2016; Kaur et al., 2016; Gandhi et al., 2017), likely influence SpCas9 recognition by stabilizing the conformation of the DNA target for cleavage. Therefore, we accounted for biases in SpCas9 recognition by choosing gRNAs sequences that conformed, as much as possible, to the motif 5’-HNNGRSGGH-3’, in which the PAM is underlined, N stands for any base, R stands for purine, S stands for G or C, and H stands for any base except G. This motif was first used to search for suitable targets (Peng and Tarleton, 2015) in cDNA sequences. We reasoned that initially searching for putative targets in cDNA sequences would ensure that gRNAs direct SpCas9 to cleave in protein coding regions of genes, and we later verified that putative gRNAs recognized genomic sequences instead of exon-exon junctions. Finally, we filtered out putative gRNA sequences with potential secondary sequences that can impede gRNA hybridization with DNA targets (Thyme et al., 2016) by evaluating their predicted secondary structures (Lorenz et al., 2011) and keeping gRNAs with predicted folding free energies greater than −1.5 kcal/mol.

gRNAs were prepared by annealing synthetic CRISPR RNA (crRNA) with a synthetic trans-activating CRISPR RNA (tracrRNA). The synthetic crRNA contains the 20 nt sequence for gene targeting and an additional sequence to anneal to the tracrRNA that binds to SpCas9. Alternatively, we also performed genome editing (Figure 2 and Figure 2—figure supplement 2H) with in vitro transcribed gRNAs (see below) that link the crRNA and tracrRNA into one continuous strand (Jinek et al., 2012; Chen et al., 2013), but we found that genome editing with crRNA/tracrRNA was the most time- and cost-effective. To prepare a functional gRNA complex from synthetic RNAs, crRNA and tracrRNA (Integrated DNA Technologies [IDT], Coralville, IA, USA) were resuspended to a final concentration of 200 µM in duplex buffer (30 mM HEPES-KOH, pH 7.5; 100 mM potassium acetate; IDT, Cat. No. 11-01-03-01). Equal volumes of crRNA and tracrRNA stocks were mixed together, incubated at 95°C for 5 min in an aluminum heat block, and then the entire heat block was placed at room temperature to slowly cool the RNA to 25°C. The RNA was stored at −20°C.

In vitro transcription of gRNAs

DNA templates for in vitro transcription of gRNAs were amplified by PCR (Q5 DNA Polymerase; New England Biolabs [NEB], Ipswich, MA, USA, Cat. No. M0491L) from synthetic DNA templates (IDT; Supplementary file 1-Table B) that had a T7 promoter sequence appended to the 5’ end of the guide sequence and a trans-activating CRISPR RNA (tracrRNA) sequence (Chen et al., 2013) at the 3’ end. The purified DNA templates (PCR cleanup kit; Qiagen, Venlo, NLD; Cat. No. 28006) were used to synthesize gRNAs with T7 RNA polymerase (Milligan and Uhlenbeck, 1989) in reactions set up with these components: 40 mM Tris-HCl, pH 8.0; 2.5 mM spermidine; 0.01% (v/v) Triton X-100; 5 mM GTP; 5 mM UTP; 5 mM ATP; 5 mM CTP; 80 mg/ml PEG 8000; 32 mM magnesium chloride; 5 mM dithiothreitol; 10 ng/µl template DNA; 0.5 U/µl SUPERase•In (Thermo Fisher Scientific, Waltham, MA; Cat. No. AM2696); 2 U/µl T7 RNA polymerase (Thermo Fisher Scientific, Cat. No. EP0113); 0.025 mg/ml pyrophosphatase (Thermo Fisher Scientific, Cat. No. EF0221). After incubating the transcription reaction at 37°C for >4 hr, the DNA template was digested with the addition of 0.1 U/µl TURBO DNase (Thermo Fisher Scientific, Cat. No. AM2239). After assessing the transcription products on denaturing, urea-polyacrylamide gel electrophoresis (PAGE), we found that the in vitro transcriptions yielded high amounts of gRNA with few byproducts. Therefore, we used a simplified protocol to purify gRNAs by first removing contaminating nucleotides with a desalting column (GE Healthcare Lifesciences, Pittsburgh, PA; Cat. No. 17085302) to exchange gRNA into 1 mM sodium citrate, pH 6.4. The gRNAs were then precipitated from the solution by adding 0.25 volumes of RNA precipitation buffer (1.2 M sodium acetate, pH 5.2; 4 mM EDTA-NaOH, pH 8.0; 0.04% sodium dodecyl sulfate [SDS]) and 2.5 volumes of ethanol. The precipitated RNA was centrifuged for 60 min at 4°C, washed once with 70% ethanol/water, and finally resuspended in 1 mM sodium citrate, pH 6.4.

After determining the concentration of gRNA, which has a 260 nm extinction coefficient of 1.41 × 106 M−1cm−1, by UV-vis spectroscopy, the gRNA was diluted to a final concentration of 50 µM with 1 mM sodium citrate, pH 6.4. To ensure that the gRNA was properly folded, the gRNA was placed at 95°C for 5 min in an aluminum heat block and then slowly cooled to 25°C by placing the aluminum block on a room temperature bench top. Finally, gRNAs were stored at −20°C.

Design and preparation of repair oligonucleotides

Repair oligonucleotides for generating knockouts were designed by copying the sequence 50 bases upstream and downstream of the SpCas9 cleavage site, which itself is 3 bp upstream of the PAM sequence (for example, 5’-N-cleave-NNNNGG-3’; PAM sequence underlined). A PTS (5’-TTTATTTAATTAAATAAA-3’) was inserted at the cleavage site. Importantly, this sequence has a stop codon (TAA) in each possible reading frame to terminate translate, a polyadenylation sequence (AATAAA) to terminate transcription, and a PacI sequence (5’-TTAATTAA-3’) that can be used to genotype with restriction digests. Moreover, the knockout sequence is palindromic, so it can be inserted in the sense or antisense direction of a gene and still generate a knockout.

Dried oligonucleotides (IDT) were resuspended to a concentration of 250 µM in a buffer of 10 mM HEPES-KOH, pH 7.5, incubated at 55°C for 1 hr, and mixed well by pipetting up and down. The oligonucleotides were stored at −20°C.

SpCas9 expression and purification

For efficient genome editing, we purified or purchased (NEB, Cat. No. M0646M) an engineered version of Streptomyces pyogenes Cas9 that has SV40 nuclear localization sequences (NLS) at the amino- and carobxy- termini of SpCas9. Below we describe a simplified purification procedure based on the previously published work (Jinek et al., 2012).

Vector construction

Using a variation of Gibson cloning (Gibson et al., 2009; NEB, Cat. No. E2621L), we modified a vector (Jinek et al., 2012); Addgene, Watertown, MA; Cat. No. 69090) for expressing SpCas9 in Escherichia coli by inserting tandem SV40 NLSs at the amino terminus of SpCas9. A similar construct (Addgene, Cat. No. 88916) has been shown to increase nuclear localization in mammalian cells (Cong et al., 2013; Staahl et al., 2017). The expression vector has a hexahistidine (His6) tag and maltose binding protein (MBP) fused to the amino terminus of SpCas9. A tobacco etch virus (TEV) protease cleavage site between MBP and the amino terminal nuclear localization sequence on Cas9 facilitates the removal of the His6-MBP tag from SpCas9.

Protein expression

The SpCas9 expression vector was transformed into the BL21 Star (DE3) strain of E. coli (Thermo Fisher Scientific, Cat. No. C601003), and a single colony of the transformants was inoculated into Miller’s LB broth (Atlas, 2010) for growing a starter culture overnight at 37°C with shaking at 200 rpm. 20 ml of the starter culture was diluted into 1 L of M9 medium (Atlas, 2010) and the culture was grown at 37°C with shaking at 250 rpm until the OD600 = 0.60. At that cell density, the culture was shifted to 16°C for 15 min and SpCas9 expression was induced by addition isopropyl β-D-1-thiogalactopyranoside (IPTG) to a final concentration of 0.5 mM. The culture was grown at 16°C overnight and cells were harvested by centrifugation at 4900 g and 4° for 15 min in a swinging bucket centrifuge. The supernatant was discarded and the bacterial pellets were flash frozen in liquid nitrogen and stored at −80°C.

Protein purification

The bacterial pellet was lysed by resuspending 1 g of bacterial pellet in 9 ml of lysis buffer (150 mM potassium phosphate, pH 7.5; 500 mM sodium chloride; 5 mM imidazole, pH 8.0; 1 mM Pefabloc SC; 2 mM 2-mercaptoethanol; 10% [v/v] glycerol; one protease inhibitor tablet [cOmplete, EDTA-free; Roche; Cat. No, 04693132001] per 20 ml of lysate) and lysing with a microfluidizer. The lysate was centrifuged at 30,000 g and 4°C for 30 min to remove insoluble debris.

The supernatant was passed through an Ni-NTA Agarose column (Qiagen, Cat. No. 30210) equilibrated in elution buffer (150 mM potassium phosphate, pH 7.5; 500 mM sodium chloride; 5 mM imidazole, pH 8.0; 2 mM 2-mercaptoethanol; 10% [v/v] glycerol), using 1 ml of resin per 10 grams of bacterial pellet. The column was washed with 10 column volumes (CV) of lysis buffer, 5 CV of elution buffer supplemented with 10 mM imidazole, and 3 CV of wash elution buffer supplemented with 20 mM imidazole. The protein was eluted from the column with 4 CV of elution buffer supplemented with 240 mM imidazole. After determining the protein concentration by UV-vis spectroscopy (using an extinction coefficient of 0.18829 µM−1cm−1 for SpCas9), TEV protease was added at 1:20 molar ratio of TEV protease to SpCas9. SpCas9 supplemented with TEV protease was placed in a dialysis bag with a 3500 dalton molecular weight cut-off (MWCO) and dialyzed against dialysis buffer (100 mM potassium phosphate, pH 7.5; 2 mM 2-mercaptoethanol; 10 mM imidazole; 10% [v/v] glycerol) overnight at 4°C. Afterwards, the dialyzed protein was passed over the Ni-NTA column that had been equilibrated in dialysis buffer to remove His6-MBP tag from SpCas9, which is in the flow through. The flow through was loaded onto HiTrap SP High-Performance (GE Healthcare Lifesciences, Cat. No. 17-1152-01) column that had been equilibrated in dialysis buffer. The column was extensively washed with dialysis buffer prior to eluting the protein in S-elution buffer (500 mM potassium phosphate, pH 7.5; 3 mM dithiothreitol; 0.3 mM EDTA-KOH, pH 8.0; 10% [v/v] glycerol). The purity was evaluated by SDS-PAGE, and the concentration was measured using UV-vis spectroscopy. Afterwards, the purified protein was concentrated with a 100,000 MWCO centrifugal filter to a final concentration of 20–25 µM. The concentrated protein was flash-frozen in liquid nitrogen and stored at −80°C.

Delivery of gene editing cargoes with nucleofection

SpCas9 RNPs and DNA repair templates were delivered into S. rosetta using a modified method for nucleofection (Booth et al., 2018). Here we describe the complete transfection procedure and provide a publicly-accessible protocol specific for genome editing (https://dx.doi.org/10.17504/protocols.io.89fhz3n):

Cell culture

Two days prior to transfection, 120 ml of high nutrient media was inoculated with a culture of S. rosetta/E. pacifica to a final concentration of S. rosetta at 8000 cells/ml. The culture was grown in a 3-layer flask (Corning; Cat. No. 353143), which has a surface area of 525 cm2, at 22°C and 60% humidity.

Assembly of Cas9/gRNA RNP

Before starting transfections, the SpCas9 RNP was assembled. For one reaction, 2 µl of 20 µM SpCas9 (NEB, Cat. No. M0646M or purified as described above) was placed in the bottom of a 0.25 ml PCR tube, and then 2 µl of 100 µM gRNA was slowly pipetted up and down with SpCas9 to gently mix the solutions. The mixed solution was incubated at room temperature for 1 hr, which is roughly the time to complete the preparation of S. rosetta for priming (see below).

Thaw DNA oligonucleotides

Before using oligonucleotides in nucleofections, the oligonucleotides (prepared as above) were incubated at 55°C for 1 hr during the assembly of the SpCas9 RNP to ensure that they were fully dissolved.

Cell washing

S. rosetta cells were first prepared for nucleofection by washing away feeder bacteria. The 120 ml culture started two days previously was homogenized by vigorous shaking and then split into 40 ml aliquots in 50 ml conical tubes. The aliquots were vigorously shaken before centrifuging the cells for 5 min at 2000 g and 22°C in a swinging bucket rotor. All but 2 ml of the supernatant, which remains cloudy with E. pacifica bacteria, was gently pipetted off of the pellet with a serological pipette; a fine tip transfer pipette gently removed the remaining liquid near the pellet. The three cell pellets were resuspended in artificial seawater (ASW; see Supplementary file 1-Table A) for a total volume of 50 ml, combined into one conical tube, and vigorously shaken to homogenize the cells. For a second time, the resuspended cells were centrifuged for 5 min at 2000 g and 22°C. The supernatant was removed as before, the pellet was resuspended in 50 ml of artificial seawater, and the cells were homogenized by vigorous shaking. The cells were centrifuged for a third time for 5 min at 2200 g and 22°C. After removing the supernatant as described above, the cell pellet was resuspended in 400 µl of ASW. The concentration of cells was determined by diluting 2 µl of cells into 196 µl of ASW. The diluted cells were fixed with 2 µl of 37.5% formaldehyde, vortexed, and then pipetted into a fixed chamber slide for counting with Luna-FL automated cell counter (Logos Biosystems, Anyang, KOR; Cat. No. L20001). After determining the cell concentration, the washed S. rosetta cells were diluted to a final concentration of 5 × 107 cell/ml and split into 100 µl aliquots.

Priming

To prime S. rosetta cells for nucleofection, we treated them with a cocktail that removes the extracellular matrix as follows. Aliquots of washed cells were pelleted at 800 g and 22°C for 5 min. The supernatant was gently removed with gel-loading tips and each pellet was resuspended in 100 µl of priming buffer (40 mM HEPES-KOH, pH 7.5; 34 mM lithium citrate; 50 mM l-cysteine; 15% [wt/vol] PEG 8000; and 1 μM papain [Millipore Sigma, St. Louis, MO; Cat. No. P3125-100MG]). After incubating cells for 30–40 min, 10 µl of 50 mg/ml bovine serum albumin was added to each aliquot of primed cells to quench proteolysis from the priming buffer. Finally, the cells were centrifuged at 1250 g and 22°C for 5 min, the supernatant was removed, and the pellet was resuspended in 25 µl of SF Buffer (Lonza, Basel, Switzerland; Cat. No. V4SC-2960). The resuspended cells were stored on ice while preparing nucleofection reagents.

Nucleofection

Each nucleofection reaction was prepared by adding 16 µl of ice-cold SF Buffer to 4 µl of the SpCas9 RNP that was assembled as described above. (For reactions that used two different gRNAs, each gRNA was assembled with SpCas9 separately and 4 µl of each RNP solution was added to SF buffer at this step). 2 µl of the repair oligonucleotide template was added to the SpCas9 RNP diluted in SF buffer. Finally, 2 µl of primed cells were added to the solution with SpCas9 RNP and the repair template. The whole solution, which has a total volume of 24 µl (30 µl for two different SpCas9 RNPs and repair templates), was placed in one well of a 96-well nucleofection plate. The well was pulsed in a Lonza shuttle nucleofector (Lonza, Cat. No. AAF-1002B and AAM-1001S) with the CM156 pulse.

Recovery

Immediately after transfection, 100 µl of ice-cold recovery buffer (10 mM HEPES-KOH, pH 7.5; 0.9 M sorbitol; 8% [wt/vol] PEG 8000) was added to each transfection and gently mixed by firmly tapping the side of the plate or cuvette. After the cells rested in recovery buffer at room-temperature for 5 min, the whole volume of a nucleofection well was transferred to 2 ml of low nutrient media in one well of a six well plate. After 30 min, 10 µl of 10 mg/ml E. pacifica (prepared by resuspending a frozen 10 mg pellet of E. pacifica in ASW) was added to each well and the six well plate was incubated at 22°C and 60% relative humidity for downstream experiments.

Establishing clonal strains

Here we describe how to isolate clones to establish strains. For a complete list of strains used in this study, see Supplementary file 1-Table C.

Cycloheximide selection

One day after transfecting S. rosetta with SpCas9 RNPs repair oligonucleotides for rpl36aP56Q (Figure 2), 10 µl of 1 µg/ml cycloheximide was added to a 2 ml culture of transfected cells. The cells were incubated with cycloheximide for 5 days prior to genotyping and clonal isolation.

Clonal isolation

To prepare cells for clonal isolation by limiting dilution, the initial cell density was determined by fixing a 200 µl sample of cells with 5 µl of 37.5% (w/v) formaldehyde and then by counting the fixed cells with a hemocytometer (Hausser Scientific, Horsham, PA; Cat. No. 1475) or Luna-FL automated cell counter. The cells were by diluted to a final concentration of 3 cells/ml in low nutrient sea water and then distributed in a 96 well plate with 100 µl/well. Thus, the mean frequency of finding a cell in each well is 0.3, which, according to a Poisson distribution, corresponds to a > 99% probability that a given well with S. rosetta was founded from a single cell. Cells were grown in a 96 well plate for 5–7 days at 22°C and 60% relative humidity. Lastly, the plate was screened using phase contrast microscopy to identify wells with S. rosetta. Finally, larger cultures of high nutrient media were inoculated with clonal isolates to establish strains.

Genotyping by Sanger sequencing (Figures 23Figure 2—figure supplement 2; Figure 3—figure supplement 1)

Cells were harvested for genotyping by centrifuging 1 ml of cells at 4250 g and 22°C for 5 min. The supernatant was removed with a fine tip transfer pipette. (Optional: To remove lingering DNA from cells that die in the course of cycloheximide selection, the pellet was resuspended in 50 µl DNase buffer [10 mM Tris-HCl, pH 7.6; 1 M sorbitol; 2.5 mM magnesium chloride; 0.5 mM calcium chloride; 0.1 U/µl Turbo DNase (Thermo Fisher Scientific; Cat. No. AM2238)] and incubated at room temperature for 30 min. Afterwards, the cells were centrifuged as before, discarding the supernatant.) The cell pellet was dissolved in 100 µl of DNAzol Direct (20 mM potassium hydroxide, 60% [w/v] PEG 200, pH 13.3–13.5; Molecular Research Center, Inc, Cincinnati, OH; Cat. No. DN131). 5 µl of the dissolved cells were added to a 50 µl PCR reaction (Q5 DNA polymerase, NEB; see Supplementary file 1-Table B for primer sequences) and amplified with 36 rounds of thermal cycling. Samples dissolved in DNAzol direct can be directly added to PCR reactions because the pH of DNAzol Direct dramatically drops upon a ten-fold or greater dilution (Chomczynski and Rymaszewski, 2006). The PCR product was purified using magnetic beads (Oberacker et al., 2019) and then submitted for Sanger sequencing (UC Berkeley DNA Sequencing facility).

Cell proliferation assays (Figure 3—figure supplement 2)

We characterized the cell proliferation rates of S. rosetta strains by monitoring the concentration of cells over time to fit logistic growth curves and determine the doubling time. Cell proliferation assays started by diluting cultures to a concentration of 104 cells/ml in high nutrient media and then distributing 0.5 ml of culture into each well of a 24 well plate. Every ~12 hr, the entire contents of one well were thoroughly homogenized by pipetting up and down and then transferred to a 1.5 ml conical tube. Three independent wells were taken for triplicate measures of cell concentration at every time point. The cells were fixed with 20 µl of 37.5% formaldehyde and mixed by vortexing. The fixed cells were stored at 4°C until the sample was used for determining the cell concentration after the full growth course.

The cell concentration was determined by counting the number of cells in a fixed-volume imaging chamber. In detail, the fixed cells were thoroughly homogenized by vortexing for 10 s and then pipetted up and down before transfer into a chamber of a Smart Slide (ibidi USA, Inc, Firchburg, WI; Cat. No. 80816) that has a fixed height of 200 µm. After allowing cells to settle to the bottom for 5 min, each chamber was imaged on an Axio Observer.Z1/7 Widefield microscope (Carl Zeiss AG, Oberkochen, Germany) and recorded with a Hamamatsu Orca-Flash 4.0 LT CMOS Digital Camera (Hamamatsu Photonics, Hamamatsu City, Japan) using either phase contrast for 10x (objective), in-focus images or a 20x brightfield image with a 1 µm overfocus to make the cells appear dark on a light gray background. The volume for each image was calculated from the image area, which was calibrated on the microscope, and the fixed height of the imaging chamber: 3.54 × 10−4 ml when imaged at 10x and 8.86 × 10−5 ml when imaged at 20x. Using automated particle detection in Fiji (Schindelin et al., 2012), cells were counted in each 20x image by thresholding the image to make cells appear as black spots on a white background and then each circular spot was counted with the ‘Analyze Particles’ function. For early time points with fewer numbers of cells, we manually counted cells in 10x images to include more cells in a greater area for a more accurate count.

Each time course was fit by least absolute deviation curve fitting to the logistic equation:

Pt=KP0((K–P0)e-tT)+P0,

where Pt is the cell density at time (t), K is the carrying capacity, P0 is the initial cell density, and T is the doubling time.

Live-cell microscopy (Figures 1 and 3)

Glass-bottomed dishes (World Precision Instruments, Sarasota, FL; Cat. No. FD35-100) were prepared for imaging by covering the bottom with 500 µl of 0.1 mg/ml poly-D-lysine (Millipore Sigma; Cat. No. P6407-5MG) and incubating for 15 min. The poly-D-lysine was removed and then the dish was washed three times with 500 µl of ASW. Cells were placed into the dish by gently pipetting 500 µl of cells with a wide pipette tip.

Differential interference contrast (DIC) microscopy images were captured with a Zeiss Axio Observer.Z1/7 Widefield microscope with a Hamamatsu Orca-Flash 4.0 LT CMOS Digital Camera (Hamamatsu Photonics, Hamamatsu City, Japan) and 40×/NA 1.1 LD C-Apochromatic water immersion, 63×/NA1.40 Plan-Apochromatic oil immersion, or 100 × NA 1.40 Plan-Apochromatic oil immersion objectives (Zeiss).

Immunofluorescent staining and imaging (Figure 1 and Figure 3—figure supplement 1)

200 µl of S. rosetta cells were gently pipetted into chamber slides (ibidi; Cat. No.80826) coated with poly-D-lysine (see live cell imaging for coating procedure). Importantly, cells were pipetted using a tip that had been trimmed to create a larger bore for reducing shear forces. The cells were incubated on the coverslip for 30 min to allow the cells to adsorb to the surface.

Cells were fixed by adding 200 µl of 6% acetone in cytoskeleton buffer (10 mM MES, pH 6.1; 138 KCl, 3 mM MgCl2; 2 mM ethylene glycol-bis(β-aminoethylether)-N,N,N’,N’-tetraacetic acid [EGTA]; 600 mM sucrose) and then incubated for 10 min at room temperature. After removing 200 µl from the chamber, 200 µl of 4% formaldehyde diluted in cytoskeleton buffer was added to the chamber and then incubated for 15 min at room temperature. Last, the coverslip was gently washed three times with 200 µl of cytoskeleton buffer.

Cells were permeabilized by washing the coverslip once with 200 µl of permeabilization buffer (PEM [100 mM PIPES, pH 6.95; 2 mM EGTA; 1 mM MgCl2] with 1% [wt/vol] bovine serum albumin (BSA)-fraction V and 0.3% [vol/vol] Triton X-100) and then incubated for 60 min upon a second addition of permeabilization buffer. Afterwards, 200 µl of the permeabilization buffer was replaced with primary antibodies diluted in permeabilization buffer, 1 µg/ml mouse DM1A anti-α-tubulin antibody (Thermo Fisher Scientific; Cat. No. 62204) and 1:200 rabbit anti-Rosetteless (Levin et al., 2014). After the samples were incubated in primary antibody for 2 hr, the chamber was gently washed three times with 200 µl permeabilization buffer. Next, 200 µl of permeabilization buffer with 10 µg/ml donkey anti-mouse immunoglobulin G–AlexaFluor568 (Thermo Fisher Scientific; Cat. No. A10037), donkey anti-rabbit immunoglobulin G–AlexaFluor647 (Thermo Fisher Scientific; Cat. No. A32795), 10 μg/ml Hoechst 33342 (Thermo Fisher Scientific; Cat. No. H3570), and 4 U/ml Phalloidin-AlexaFluor488 (Thermo Fisher Scientific; Cat. No. A12379) was added to the chamber and then incubated for 40 min. Afterwards, the chamber was washed five times with PEM.

Immunostained samples were imaged on a Zeiss Axio Observer LSM 880 with an Fast Airyscan detector and a 40x/NA1.1 Plan-Apochromatic water immersion objective (Zeiss) by frame scanning in the superresolution mode with the following settings: 50 × 50 nm pixel size; 220 nm z-step; 0.73 μs/pixel dwell time; 750 gain; 488/561/633 nm multiple beam splitter; 633 nm laser operating at 16% power with a 570-620/645 bandpass/longpass filter; 561 nm laser operating at 16% power with a 570-620/645 bandpass/longpass filter; 488 nm laser operating at 14% power with a 420-580/495-550 bandpass filters.

Next-generation sequencing (Figure 4 and Figure 4—figure supplement 1)

We performed deep sequencing of edited cells to quantify the efficiency of genome editing (Figure 3 and Figure 4—figure supplement 1). The transfections were performed as above with the following modifications: Two transfections were conducted for each condition and combined into 1 ml of low nutrient media (see Supplementary file 1-Table A for recipe). One day after transfection, the cells were harvested and dissolved in 50 µl of DNAzol direct (Molecular Research Center, Inc). Three independent transfections performed on different days provided replicate measures for each condition (Figure 3C and Figure 4—figure supplement 1B).

To preserve the diversity of sequences during PCR, six parallel PCR reactions (Q5 DNA polymerase, NEB) were set up with 30 µl of sample. The target locus was amplified in 15 thermal cycles, purified using magnetic beads (UC Berkeley DNA sequencing facility), and pooled together in a total volume of 180 µl. Importantly, the primers for this first round of PCR had a randomized 6-nucleotide sequence in the forward primer to distinguish PCR duplicates (primer sequences in Supplementary file 1-Table D), which allowed us to identify 4096 unique sequences. Extending this randomized sequence would result in higher sensitivity.

A second round of PCR was performed to attach adapters for Illumina sequencing. For these reactions, four replicate PCR reactions were set up with 25 µl of the purified products from the first round of PCR and primers with sequencing adapters and unique sample barcodes were attached in five thermal cycles. Afterward, the PCR products were purified using magnetic beads (UC Berkeley DNA sequencing facility) and their quality was assessed on a Bioanalyzer (UC Berkeley Functional Genomic Laboratory). The bioanalyzer traces showed that the amplicons were the proper size, yet a similar concentration of residual PCR primers remained in each sample. After quantifying DNA (Qubit; Thermo Fisher Scientific) and pooling equimolar amounts of sample, the amplicons were further purified with magnetic beads (UC Berkeley Functional Genomics Lab) and the concentration was verified using qPCR. The library was sequenced on a miSeq sequencer (Illumina, San Diego, CA) using the V3 chemistry (Illumina) for 300 rounds of paired-end sequencing, which gives up to 600 bases of sequence per sample. After sequencing the samples were separated based on their unique barcodes for further analysis of individual samples.

The editing efficiency for each sample was calculated from high-quality, unique reads. First, we used tools from the Galaxy Project (Afgan et al., 2018) to join paired-end reads into one read (fastq-join) and then retain high quality sequences (Galaxy–Filter by quality: 100% of bases with quality scores ≥ 30) with 50 bp of expected sequence from the rosetteless locus on the 5’ and 3’ ends of the amplicon (Galaxy–Cutadapt: 50 base overlap with 0.1 maximum error rate). Next, the reads were filtered for unique instances of the randomized barcode sequence from the first round PCR primers (Galaxy–Unique). We then combined matching amplicon sequences into unique bins, while counting the number of sequences in each bin (Galaxy–Collapse). The FASTA file of aligned sequences (Galaxy–ClustalW) from this initial processing was further analyzed using a custom script (Source code 1). To quantify the instances of template-mediated repair, we counted the number of sequences that had the PTS. Untemplated mutations were counted from insertions and deletions larger than 1 bp. The remaining sequences, those that were the same length as the rosetteless locus but did not have the exact amplicon sequence, were counted as single-nucleotide polymorphisms (SNPs). The outputs from each category were also visually inspected to reclassify incorrect calls, such as a few instances of template-directed repair in the insertions and deletion category due to mutations in the PTS. The SNP data revealed that conditions with or without the addition of SpCas9 or repair templates had same SNP frequency. Therefore, we only compared reads categorized as template-mediated repair or untemplated insertions and deletions. Importantly, this analysis may overlook some instances where DNA repair resulted in sequences that maintained the original sequence or introduced SNPs, thereby underestimating the efficiency of non-templated repair.

Statistical analyses

Differences between experimental conditions were evaluated by comparing the variance between experimental groups, each with triplicate and biologically independent measures of the response variable, with Single- or Multi-Factor Analysis of Variance (ANOVA) (Zar, 1999). If the groups were very unlikely to have the same response (p<0.05), the source of variation from Multi-Factor ANOVA was identified by performing Single-Factor ANOVA on subsets of the original experimental groups. Finally, a Tukey Multiple Comparison Test was performed after Single-Factor ANOVA showed that experimental groups were unlikely to have the same response. The specific analyses for each experiment are detailed in the following text:

Figure 2: A Two-Factor ANOVA was performed to test the effect of genotype on cell density in a serial dilution of cycloheximide, which was measured in three biological replicates. This test showed that the growth of the rpl36aP56Q strain in cycloheximide was very unlikely to be the same as the wild-type strain (p<10−20).

Figure 4: Four sets of Two-Factor ANOVA tested the effect of repair template and SpCas9 cleavage on the percentage of PTS or INDEL mutations, as measured by deep sequencing of the rosetteless surrounding the position supercontig 8: 429,354 nt in three biological replicates of each condition. In the first and second sets of Two-Factor ANOVA, the outcomes of PTS mutations were separately analyzed from INDEL mutations, and within each set, we tested the effect of adding SpCas9 and repair templates on the mutation frequency, which showed that the frequencies of PTS or INDEL mutations were very unlikely to be the same in the presence of SpCas9 versus its absence (p<10−15 for PTS mutations and p<0.02 for INDEL mutations). In the third and fourth sets of Two-Factor ANOVA, the editing outcomes in the absence of SpCas9 were separately analyzed from the outcomes in the presence of SpCas9 as we tested the effects of mutation type (PTS versus INDEL) and the addition of repair templates on the mutation frequency. Importantly, the editing outcomes in the absence of SpCas9 were the same across all conditions (p<0.34), indicating that there is virtually no mutagenesis in the absence of SpCas9. Furthermore, in the presence of SpCas9, the editing outcomes across all repair templates were very improbably the same (p<10−15). A follow-up, Single-Factor ANOVA that tested the effect of adding repair templates on the frequency of INDEL mutations in the presence of SpCas9 showed no differences between repair template groups (p<0.60). Whereas, the same analysis for PTS mutations in the presence of SpCas9 showed that editing frequencies were very unlikely to be the same (p<10−4) across different repair templates. Therefore, multiple comparisons with the Tukey method were performed on all of the pairwise combinations of the repair template groups in the presence of SpCas9. These multiple comparisons showed that none of the repair templates resulted in the same frequency of PTS mutations. For example, the comparison that was most likely to have the same repair outcome was the single-stranded, sense oligonucleotide versus the duplex repair template (p<0.04); whereas, all other comparisons between repair templates were much less likely to be the same (0.0001 < p < 0.004). Among these repair templates, the single-stranded, sense oligonucleotide resulted in the highest average frequency of PTS mutations (1.57%).

Figure 3—figure supplement 1: After finding that both genotype and RIF induction affected the percentage of cells in rosettes with a Two-Factor ANOVA, in which three biological replicates were used for each group, a Single-Factor ANOVA followed by a Tukey Multiple Comparison Test was performed on groups that had been induced with RIFs to test the effect of genotype on rosette development. These tests showed that wild-type and rpl36P56Q strains were similarly induced (p<0.84), which was also the case for rtlsPTS1 rpl36P56Q and rtlstl1 (p<0.99), and rosette development was very unlikely to be the same between wild-type or rpl36P56Q strains versus rtlsPTS1 rpl36P56Q or rtlstl1 strains (p<10−10). Note that such tests were not performed for the uninduced samples because the fraction of cells in rosettes was zero for all three measurements of each genotype.

Figure 3—figure supplement 2: The effect of genotype on growth rate was tested with a SIngle-Factor ANOVA using three biological replicate measures of the doubling time (T) for each strain, which showed that the doubling-times were probably the same among all strains (p<0.08).

Figure 4—figure supplement 1: The outcomes of PTS mutations were separately analyzed from INDEL mutations in two sets of Single-Factor ANOVA to test the effect of repair template in the presence of SpCas9 on mutation frequency. The frequency of INDEL mutations was most likely the same between all repair templates (p<0.17). In contrast, the frequency of PTS mutations was improbably the same between repair templates (p<10−6), yet multiple comparisons with the Tukey test showed that there was no difference between the duplex templates (0.50 < p < 0.99) just as there were no differences between the single-stranded templates with truncated homology arms (0.99 < p < 1.00). Thus, the major effect on PTS mutation frequency was between the duplex templates versus the single-stranded, truncated templates (p<0.02).

Acknowledgements

We thank the following people for insights and support that helped advance this work: Heather Szmidt-Middleton, Laura Wetzel, Monika Sigg, Lily Helfrich, Arielle Woznica, Sabrina Sun, Tara DeBoer, Jorge Santiago-Ortiz, and Kayley Hake helped with and provided feedback on early experiments. Through the Gordon and Betty Moore Foundation Marine Microbiology Initiative (GBMF MMI), Manny Ares first brought cycloheximide resistance alleles to our attention. The following people stimulated helpful discussions: Fyodor Urnov, Stephen Floor, James Gagnon, Chris Richardson, Jacob Corn, David Schaffer, Niren Murthy, Craig Miller, and members of the King Lab. Brett Stahl, Shana McDevitt, the UC Berkeley Vincent Coates Sequencing Center, and the UC Berkeley Functional Genomics Laboratory provided help with sequencing. This work was supported, in part, by a GBMF MMI Experimental Model Systems grant. DSB was supported through a Simons Foundation Fellowship from the Jane Coffin Childs Memorial Fund for Medical Research.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

David S Booth, Email: David.Booth@ucsf.edu.

Alejandro Sánchez Alvarado, Stowers Institute for Medical Research, United States.

Patricia J Wittkopp, University of Michigan, United States.

Funding Information

This paper was supported by the following grants:

  • Gordon and Betty Moore Foundation MMI Experimental Model Systems grant to David S Booth, Nicole King.

  • Howard Hughes Medical Institute to Nicole King.

  • Jane Coffin Childs Memorial Fund for Medical Research Simons Foundation Fellowship to David S Booth.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Resources, Data curation, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Supervision, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Source code 1. Quantification of DNA repair outcomes.

BASH script for quantifying the frequency of repair outcomes from deep sequencing data that were preprocessed and aligned in a Galaxy server (Afgan et al., 2018).

elife-56193-code1.sc (4.5KB, sc)
Supplementary file 1. Tables of critical resources.

Table A: Media recipes for making artificial seawater (Hallegraeff et al., 2004; Skelton et al., 2009), high nutrient media (modified from King et al., 2009; Levin and King, 2013; Booth et al., 2018), and low nutrient media. Table B: Oligonucleotide sequences for gRNAs, repair oligonucleotides, and primers that were used to construct and to validate genome edited strains. Table C: S. rosetta strains Genotypes and sources of S. rosetta strains used in this study. Table D: Deep sequencing library primers Sequences for primers (adapted from Lin et al., 2014 used to generate libraries for deep sequencing (Figure 4 and S5)

elife-56193-supp1.xlsx (24.8KB, xlsx)
Transparent reporting form

Data availability

All data generated are included in the manuscript. Additionally, we have posted a protocol at https://www.protocols.io: https://doi.org/10.17504/protocols.io.89fhz3n.

References

  1. Abedin M, King N. The premetazoan ancestry of cadherins. Science. 2008;319:946–948. doi: 10.1126/science.1151084. [DOI] [PubMed] [Google Scholar]
  2. Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Grüning BA, Guerler A, Hillman-Jackson J, Hiltemann S, Jalili V, Rasche H, Soranzo N, Goecks J, Taylor J, Nekrutenko A, Blankenberg D. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Research. 2018;46:W537–W544. doi: 10.1093/nar/gky379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Alegado RA, Brown LW, Cao S, Dermenjian RK, Zuzow R, Fairclough SR, Clardy J, King N. A bacterial sulfonolipid triggers multicellular development in the closest living relatives of animals. eLife. 2012;1:e00013. doi: 10.7554/eLife.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Alegado RA, Grabenstatter JD, Zuzow R, Morris A, Huang SY, Summons RE, King N. Algoriphagus machipongonensis sp. nov., co-isolated with a colonial choanoflagellate. International Journal of Systematic and Evolutionary Microbiology. 2013;63:163–168. doi: 10.1099/ijs.0.038646-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Amacher JF, Hobbs HT, Cantor AC, Shah L, Rivero MJ, Mulchand SA, Kuriyan J. Phosphorylation control of the ubiquitin ligase cbl is conserved in choanoflagellates. Protein Science. 2018;27:923–932. doi: 10.1002/pro.3397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ares M, Bruns PJ. Isolation and genetic characterization of a mutation affecting ribosomal resistance to cycloheximide in Tetrahymena. Genetics. 1978;90:463–474. doi: 10.1093/genetics/90.3.463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Arribere JA, Bell RT, Fu BX, Artiles KL, Hartman PS, Fire AZ. Efficient marker-free recovery of custom genetic modifications with CRISPR/Cas9 in Caenorhabditis elegans. Genetics. 2014;198:837–846. doi: 10.1534/genetics.114.169730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Atlas RM. Handbook of Microbiological Media. CRC Press; 2010. [Google Scholar]
  9. Bae J-H, Sung BH, Sohn J-H. Site saturation mutagenesis of ribosomal protein L42 at 56th residue and application as a consecutive selection marker for cycloheximide resistance in yeast. FEMS Microbiology Letters. 2018;365:fny066. doi: 10.1093/femsle/fny066. [DOI] [PubMed] [Google Scholar]
  10. Bhattacharyya M, Stratton MM, Going CC, McSpadden ED, Huang Y, Susa AC, Elleman A, Cao YM, Pappireddi N, Burkhardt P, Gee CL, Barros T, Schulman H, Williams ER, Kuriyan J. Molecular mechanism of activation-triggered subunit exchange in Ca(2+)/calmodulin-dependent protein kinase II. eLife. 2016;5:e13405. doi: 10.7554/eLife.13405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bibikova M, Carroll D, Segal DJ, Trautman JK, Smith J, Kim YG, Chandrasegaran S. Stimulation of homologous recombination through targeted cleavage by chimeric nucleases. Molecular and Cellular Biology. 2001;21:289–297. doi: 10.1128/MCB.21.1.289-297.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Booth DS, Szmidt-Middleton H, King N. Transfection of choanoflagellates illuminates their cell biology and the ancestry of animal septins. Molecular Biology of the Cell. 2018;29:3026–3038. doi: 10.1091/mbc.E18-08-0514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Brinkman EK, Kousholt AN, Harmsen T, Leemans C, Chen T, Jonkers J, van Steensel B. Easy quantification of template-directed CRISPR/Cas9 editing. Nucleic Acids Research. 2018;46:e58. doi: 10.1093/nar/gky164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brunet T, Larson BT, Linden TA, Vermeij MJA, McDonald K, King N. Light-regulated collective contractility in a multicellular choanoflagellate. Science. 2019;366:326–334. doi: 10.1126/science.aay2346. [DOI] [PubMed] [Google Scholar]
  15. Brunet T, King N. The origin of animal multicellularity and cell differentiation. Developmental Cell. 2017;43:124–140. doi: 10.1016/j.devcel.2017.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Burger G, Forget L, Zhu Y, Gray MW, Lang BF. Unique mitochondrial genome architecture in unicellular relatives of animals. PNAS. 2003;100:892–897. doi: 10.1073/pnas.0336115100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Carr M, Leadbeater BS, Hassan R, Nelson M, Baldauf SL. Molecular phylogeny of choanoflagellates, the sister group to metazoa. PNAS. 2008;105:16641–16646. doi: 10.1073/pnas.0801667105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chen B, Gilbert LA, Cimini BA, Schnitzbauer J, Zhang W, Li GW, Park J, Blackburn EH, Weissman JS, Qi LS, Huang B. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell. 2013;155:1479–1491. doi: 10.1016/j.cell.2013.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chomczynski P, Rymaszewski M. Alkaline polyethylene glycol-based method for direct PCR from Bacteria, eukaryotic tissue samples, and whole blood. BioTechniques. 2006;40:454–458. doi: 10.2144/000112149. [DOI] [PubMed] [Google Scholar]
  20. Choulika A, Perrin A, Dujon B, Nicolas JF. Induction of homologous recombination in mammalian chromosomes by using the I-SceI system of Saccharomyces cerevisiae. Molecular and Cellular Biology. 1995;15:1968–1973. doi: 10.1128/MCB.15.4.1968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Colgren J, Nichols SA. The significance of sponges for comparative studies of developmental evolution. Wiley Interdisciplinary Reviews. Developmental Biology. 2020;9:e359. doi: 10.1002/wdev.359. [DOI] [PubMed] [Google Scholar]
  22. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, Zhang F. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Cummings RD, McEver RP. Essentials of Glycobiology. Cold Spring Harbor: Cold Spring Harbor Laboratory Press; 2015. [Google Scholar]
  24. Dayel MJ, Alegado RA, Fairclough SR, Levin TC, Nichols SA, McDonald K, King N. Cell differentiation and morphogenesis in the colony-forming choanoflagellate Salpingoeca rosetta. Developmental Biology. 2011;357:73–82. doi: 10.1016/j.ydbio.2011.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. de Mendoza A, Sebé-Pedrós A, Šestak MS, Matejcic M, Torruella G, Domazet-Loso T, Ruiz-Trillo I. Transcription factor evolution in eukaryotes and the assembly of the regulatory toolkit in multicellular lineages. PNAS. 2013;110:E4858–E4866. doi: 10.1073/pnas.1311818110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Dehoux P, Davies J, Cannon M. Natural cycloheximide resistance in yeast the role of ribosomal protein L41. European Journal of Biochemistry. 1993;213:841–848. doi: 10.1111/j.1432-1033.1993.tb17827.x. [DOI] [PubMed] [Google Scholar]
  27. Doench JG, Hartenian E, Graham DB, Tothova Z, Hegde M, Smith I, Sullender M, Ebert BL, Xavier RJ, Root DE. Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation. Nature Biotechnology. 2014;32:1262–1267. doi: 10.1038/nbt.3026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Drickamer K, Fadden AJ. Genomic analysis of C-type lectins. Biochemical Society Symposia; 2002. pp. 59–72. [DOI] [PubMed] [Google Scholar]
  29. Fairclough SR, Dayel MJ, King N. Multicellular development in a choanoflagellate. Current Biology. 2010;20:R875–R876. doi: 10.1016/j.cub.2010.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Fairclough SR, Chen Z, Kramer E, Zeng Q, Young S, Robertson HM, Begovic E, Richter DJ, Russ C, Westbrook MJ, Manning G, Lang BF, Haas B, Nusbaum C, King N. Premetazoan genome evolution and the regulation of cell differentiation in the choanoflagellate Salpingoeca rosetta. Genome Biology. 2013;14:R15. doi: 10.1186/gb-2013-14-2-r15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ferenczi A, Pyott DE, Xipnitou A, Molnar A. Efficient targeted DNA editing and replacement in Chlamydomonas reinhardtii using Cpf1 ribonucleoproteins and single-stranded DNA. PNAS. 2017;114:13567–13572. doi: 10.1073/pnas.1710597114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Foster AJ, Martin-Urdiroz M, Yan X, Wright HS, Soanes DM, Talbot NJ. CRISPR-Cas9 ribonucleoprotein-mediated co-editing and counterselection in the rice blast fungus. Scientific Reports. 2018;8:14355. doi: 10.1038/s41598-018-32702-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Gandhi S, Haeussler M, Razy-Krajka F, Christiaen L, Stolfi A. Evaluation and rational design of guide RNAs for efficient CRISPR/Cas9-mediated mutagenesis in Ciona. Developmental Biology. 2017;425:8–20. doi: 10.1016/j.ydbio.2017.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Garreau de Loubresse N, Prokhorova I, Holtkamp W, Rodnina MV, Yusupova G, Yusupov M. Structural basis for the inhibition of the eukaryotic ribosome. Nature. 2014;513:517–522. doi: 10.1038/nature13737. [DOI] [PubMed] [Google Scholar]
  35. Gibson DG, Young L, Chuang RY, Venter JC, Hutchison CA, Smith HO. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nature Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  36. Gilles AF, Averof M. Functional genetics for all: engineered nucleases, CRISPR and the gene editing revolution. EvoDevo. 2014;5:43. doi: 10.1186/2041-9139-5-43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Goddard TD, Huang CC, Ferrin TE. Software extensions to UCSF chimera for interactive visualization of large molecular assemblies. Structure. 2005;13:473–482. doi: 10.1016/j.str.2005.01.006. [DOI] [PubMed] [Google Scholar]
  38. Grau-Bové X, Torruella G, Donachie S, Suga H, Leonard G, Richards TA, Ruiz-Trillo I. Dynamics of genomic innovation in the unicellular ancestry of animals. eLife. 2017;6:e26036. doi: 10.7554/eLife.26036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hallegraeff GM, Anderson DM, Cembella AD, Enevoldsen HO. Manual on Harmful Marine Microalgae. UNESCO; 2004. [Google Scholar]
  40. Han HA, Pang JKS, Soh BS. Mitigating off-target effects in CRISPR/Cas9-mediated in vivo gene editing. Journal of Molecular Medicine. 2020;98:615–632. doi: 10.1007/s00109-020-01893-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Harrison MM, Jenkins BV, O'Connor-Giles KM, Wildonger J. A CRISPR view of development. Genes & Development. 2014;28:1859–1872. doi: 10.1101/gad.248252.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hoffmeyer TT, Burkhardt P. Choanoflagellate models - Monosiga brevicollis and Salpingoeca rosetta. Current Opinion in Genetics & Development. 2016;39:42–47. doi: 10.1016/j.gde.2016.05.016. [DOI] [PubMed] [Google Scholar]
  43. Horlbeck MA, Gilbert LA, Villalta JE, Adamson B, Pak RA, Chen Y, Fields AP, Park CY, Corn JE, Kampmann M, Weissman JS. Compact and highly active next-generation libraries for CRISPR-mediated gene repression and activation. eLife. 2016;5:e19760. doi: 10.7554/eLife.19760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Jacobs JZ, Ciccaglione KM, Tournier V, Zaratiegui M. Implementation of the CRISPR-Cas9 system in fission yeast. Nature Communications. 2014;5:1–5. doi: 10.1038/ncomms6344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Jiang W, Brueggeman AJ, Horken KM, Plucinak TM, Weeks DP. Successful transient expression of Cas9 and single guide RNA genes in Chlamydomonas reinhardtii. Eukaryotic Cell. 2014;13:1465–1469. doi: 10.1128/EC.00213-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Jiang F, Taylor DW, Chen JS, Kornfeld JE, Zhou K, Thompson AJ, Nogales E, Doudna JA. Structures of a CRISPR-Cas9 R-loop complex primed for DNA cleavage. Science. 2016;351:867–871. doi: 10.1126/science.aad8282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Jinek M, East A, Cheng A, Lin S, Ma E, Doudna J. RNA-programmed genome editing in human cells. eLife. 2013;2:e00471. doi: 10.7554/eLife.00471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kaulich M, Lee YJ, Lönn P, Springer AD, Meade BR, Dowdy SF. Efficient CRISPR-rAAV engineering of endogenous genes to study protein function by allele-specific RNAi. Nucleic Acids Research. 2015;43:e45. doi: 10.1093/nar/gku1403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kaur K, Gupta AK, Rajput A, Kumar M. ge-CRISPR - An integrated pipeline for the prediction and analysis of sgRNAs genome editing efficiency for CRISPR/Cas system. Scientific Reports. 2016;6:30870. doi: 10.1038/srep30870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kawai S, Murao S, Mochizuki M, Shibuya I, Yano K, Takagi M. Drastic alteration of cycloheximide sensitivity by substitution of one amino acid in the L41 ribosomal protein of yeasts. Journal of Bacteriology. 1992;174:254–262. doi: 10.1128/JB.174.1.254-262.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kim IG, Nam SK, Sohn JH, Rhee SK, Lee SH, Choi ES. Cloning of the ribosomal protein L41 gene of Phaffia rhodozyma and its use a drug resistance marker for transformation. Applied and Environmental Microbiology. 1998;64:1947–1949. doi: 10.1128/AEM.64.5.1947-1949.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kim H, Ishidate T, Ghanta KS, Seth M, Conte D, Shirayama M, Mello CC. A co-CRISPR strategy for efficient genome editing in Caenorhabditis elegans. Genetics. 2014a;197:1069–1080. doi: 10.1534/genetics.114.166389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kim S, Kim D, Cho SW, Kim J, Kim JS. Highly efficient RNA-guided genome editing in human cells via delivery of purified Cas9 ribonucleoproteins. Genome Research. 2014b;24:1012–1019. doi: 10.1101/gr.171322.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. King N, Hittinger CT, Carroll SB. Evolution of key cell signaling and adhesion protein families predates animal origins. Science. 2003;301:361–363. doi: 10.1126/science.1083853. [DOI] [PubMed] [Google Scholar]
  56. King N. The unicellular ancestry of animal development. Developmental Cell. 2004;7:313–325. doi: 10.1016/j.devcel.2004.08.010. [DOI] [PubMed] [Google Scholar]
  57. King N, Westbrook MJ, Young SL, Kuo A, Abedin M, Chapman J, Fairclough S, Hellsten U, Isogai Y, Letunic I, Marr M, Pincus D, Putnam N, Rokas A, Wright KJ, Zuzow R, Dirks W, Good M, Goodstein D, Lemons D, Li W, Lyons JB, Morris A, Nichols S, Richter DJ, Salamov A, Sequencing JG, Bork P, Lim WA, Manning G, Miller WT, McGinnis W, Shapiro H, Tjian R, Grigoriev IV, Rokhsar D. The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature. 2008;451:783–788. doi: 10.1038/nature06617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. King N, Young SL, Abedin M, Carr M, Leadbeater BS. Starting and maintaining Monosiga brevicollis cultures. Cold Spring Harbor Protocols. 2009;2009:pdb.prot5148. doi: 10.1101/pdb.prot5148. [DOI] [PubMed] [Google Scholar]
  59. Kondo K, Saito T, Kajiwara S, Takagi M, Misawa N. A transformation system for the yeast candida utilis: use of a modified endogenous ribosomal protein gene as a drug-resistant marker and ribosomal DNA as an integration target for vector DNA. Journal of Bacteriology. 1995;177:7171–7177. doi: 10.1128/JB.177.24.7171-7177.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Lang BF, O'Kelly C, Nerad T, Gray MW, Burger G. The closest unicellular relatives of animals. Current Biology. 2002;12:1773–1778. doi: 10.1016/S0960-9822(02)01187-9. [DOI] [PubMed] [Google Scholar]
  61. Larson BT, Ruiz-Herrero T, Lee S, Kumar S, Mahadevan L, King N. Biophysical principles of choanoflagellate self-organization. PNAS. 2020;117:1303–1311. doi: 10.1073/pnas.1909447117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Laundon D, Larson BT, McDonald K, King N, Burkhardt P. The architecture of cell differentiation in choanoflagellates and sponge choanocytes. PLOS Biology. 2019;17:e3000226. doi: 10.1371/journal.pbio.3000226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Leadbeater BSC. The Choanoflagellates: Evolution, Biology, and Ecology. Cambridge: Cambridge University Press; 2015. [DOI] [Google Scholar]
  64. Levin TC, Greaney AJ, Wetzel L, King N. The rosetteless gene controls development in the choanoflagellate S. rosetta. eLife. 2014;3:e04070. doi: 10.7554/eLife.04070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Levin TC, King N. Evidence for sex and recombination in the choanoflagellate Salpingoeca rosetta. Current Biology. 2013;23:2176–2180. doi: 10.1016/j.cub.2013.08.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Li H, Beckman KA, Pessino V, Huang B, Weissman JS, Leonetti MD. Design and specificity of long ssDNA donors for CRISPR-based knock-in. bioRxiv. 2019 doi: 10.1101/178905. [DOI]
  67. Liang X, Potter J, Kumar S, Zou Y, Quintanilla R, Sridharan M, Carte J, Chen W, Roark N, Ranganathan S, Ravinder N, Chesnut JD. Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection. Journal of Biotechnology. 2015;208:44–53. doi: 10.1016/j.jbiotec.2015.04.024. [DOI] [PubMed] [Google Scholar]
  68. Lin S, Staahl BT, Alla RK, Doudna JA. Enhanced homology-directed human genome engineering by controlled timing of CRISPR/Cas9 delivery. eLife. 2014;3:e04766. doi: 10.7554/eLife.04766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Liu X, Homma A, Sayadi J, Yang S, Ohashi J, Takumi T. Sequence features associated with the cleavage efficiency of CRISPR/Cas9 system. Scientific Reports. 2016;6:19675. doi: 10.1038/srep19675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. López-Escardó D, Grau-Bové X, Guillaumet-Adkins A, Gut M, Sieracki ME, Ruiz-Trillo I. Reconstruction of protein domain evolution using single-cell amplified genomes of uncultured choanoflagellates sheds light on the origin of animals. Philosophical Transactions of the Royal Society B: Biological Sciences. 2019;374:20190088. doi: 10.1098/rstb.2019.0088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Lorenz R, Bernhart SH, Höner Zu Siederdissen C, Tafer H, Flamm C, Stadler PF, Hofacker IL. ViennaRNA package 2.0. Algorithms for Molecular Biology. 2011;6:26. doi: 10.1186/1748-7188-6-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Manning G, Young SL, Miller WT, Zhai Y. The protist, Monosiga brevicollis, has a tyrosine kinase signaling network more elaborate and diverse than found in any known metazoan. PNAS. 2008;105:9674–9679. doi: 10.1073/pnas.0801314105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Marron AO, Alston MJ, Heavens D, Akam M, Caccamo M, Holland PWH, Walker G. A family of diatom-like silicon transporters in the siliceous loricate choanoflagellates. Proceedings of the Royal Society B: Biological Sciences. 2013;280:20122543. doi: 10.1098/rspb.2012.2543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Milligan JF, Uhlenbeck OC. Synthesis of small RNAs using T7 RNA polymerase. Methods in Enzymology. 1989;180:51–62. doi: 10.1016/0076-6879(89)80091-6. [DOI] [PubMed] [Google Scholar]
  75. Momose T, Concordet JP. Diving into marine genomics with CRISPR/Cas9 systems. Marine Genomics. 2016;30:55–65. doi: 10.1016/j.margen.2016.10.003. [DOI] [PubMed] [Google Scholar]
  76. Moreno-Mateos MA, Vejnar CE, Beaudoin JD, Fernandez JP, Mis EK, Khokha MK, Giraldez AJ. CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo. Nature Methods. 2015;12:982–988. doi: 10.1038/nmeth.3543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Nichols SA, Roberts BW, Richter DJ, Fairclough SR, King N. Origin of metazoan cadherin diversity and the antiquity of the classical cadherin/β-catenin complex. PNAS. 2012;109:13046–13051. doi: 10.1073/pnas.1120685109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Oberacker P, Stepper P, Bond DM, Höhn S, Focken J, Meyer V, Schelle L, Sugrue VJ, Jeunen GJ, Moser T, Hore SR, von Meyenn F, Hipp K, Hore TA, Jurkowski TP. Bio-On-Magnetic-Beads (BOMB): Open platform for high-throughput nucleic acid extraction and manipulation. PLOS Biology. 2019;17:e3000107. doi: 10.1371/journal.pbio.3000107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Okamoto S, Amaishi Y, Maki I, Enoki T, Mineno J. Highly efficient genome editing for single-base substitutions using optimized ssODNs with Cas9-RNPs. Scientific Reports. 2019;9:1–11. doi: 10.1038/s41598-019-41121-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Paix A, Folkmann A, Goldman DH, Kulaga H, Grzelak MJ, Rasoloson D, Paidemarry S, Green R, Reed RR, Seydoux G. Precision genome editing using synthesis-dependent repair of Cas9-induced DNA breaks. PNAS. 2017;114:E10745–E10754. doi: 10.1073/pnas.1711979114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Parfrey LW, Lahr DJ, Knoll AH, Katz LA. Estimating the timing of early eukaryotic diversification with multigene molecular clocks. PNAS. 2011;108:13624–13629. doi: 10.1073/pnas.1110633108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Peña JF, Alié A, Richter DJ, Wang L, Funayama N, Nichols SA. Conserved expression of vertebrate microvillar gene homologs in choanocytes of freshwater sponges. EvoDevo. 2016;7:13. doi: 10.1186/s13227-016-0050-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Peng D, Tarleton R. EuPaGDT: a web tool tailored to design CRISPR guide RNAs for eukaryotic pathogens. Microbial Genomics. 2015;1:e000033. doi: 10.1099/mgen.0.000033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Pincus D, Letunic I, Bork P, Lim WA. Evolution of the phospho-tyrosine signaling machinery in premetazoan lineages. PNAS. 2008;105:9680–9684. doi: 10.1073/pnas.0803161105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Renaud JB, Boix C, Charpentier M, De Cian A, Cochennec J, Duvernois-Berthet E, Perrouault L, Tesson L, Edouard J, Thinard R, Cherifi Y, Menoret S, Fontanière S, de Crozé N, Fraichard A, Sohm F, Anegon I, Concordet JP, Giovannangeli C. Improved genome editing efficiency and flexibility using modified oligonucleotides with TALEN and CRISPR-Cas9 nucleases. Cell Reports. 2016;14:2263–2272. doi: 10.1016/j.celrep.2016.02.018. [DOI] [PubMed] [Google Scholar]
  86. Richardson CD, Ray GJ, DeWitt MA, Curie GL, Corn JE. Enhancing homology-directed genome editing by catalytically active and inactive CRISPR-Cas9 using asymmetric donor DNA. Nature Biotechnology. 2016;34:339–344. doi: 10.1038/nbt.3481. [DOI] [PubMed] [Google Scholar]
  87. Richter DJ, Fozouni P, Eisen MB, King N. Gene family innovation, conservation and loss on the animal stem lineage. eLife. 2018;7:e34226. doi: 10.7554/eLife.34226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Rouet P, Smih F, Jasin M. Introduction of double-strand breaks into the genome of mouse cells by expression of a rare-cutting endonuclease. Molecular and Cellular Biology. 1994;14:8096–8106. doi: 10.1128/MCB.14.12.8096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Ruiz-Trillo I, Roger AJ, Burger G, Gray MW, Lang BF. A phylogenomic investigation into the origin of metazoa. Molecular Biology and Evolution. 2008;25:664–672. doi: 10.1093/molbev/msn006. [DOI] [PubMed] [Google Scholar]
  90. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Schneider-Poetsch T, Ju J, Eyler DE, Dang Y, Bhat S, Merrick WC, Green R, Shen B, Liu JO. Inhibition of eukaryotic translation elongation by cycloheximide and lactimidomycin. Nature Chemical Biology. 2010;6:209–217. doi: 10.1038/nchembio.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Sebé-Pedrós A, Burkhardt P, Sánchez-Pons N, Fairclough SR, Lang BF, King N, Ruiz-Trillo I. Insights into the origin of metazoan filopodia and microvilli. Molecular Biology and Evolution. 2013;30:2013–2023. doi: 10.1093/molbev/mst110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Sebé-Pedrós A, Degnan BM, Ruiz-Trillo I. The origin of metazoa: a unicellular perspective. Nature Reviews Genetics. 2017;18:498–512. doi: 10.1038/nrg.2017.21. [DOI] [PubMed] [Google Scholar]
  94. Shin SE, Lim JM, Koh HG, Kim EK, Kang NK, Jeon S, Kwon S, Shin WS, Lee B, Hwangbo K, Kim J, Ye SH, Yun JY, Seo H, Oh HM, Kim KJ, Kim JS, Jeong WJ, Chang YK, Jeong BR. CRISPR/Cas9-induced knockout and knock-in mutations in Chlamydomonas reinhardtii. Scientific Reports. 2016;6:27810. doi: 10.1038/srep27810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. Fast, scalable generation of high-quality protein multiple sequence alignments using clustal omega. Molecular Systems Biology. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Skelton HM, Burkholder JM, Parrow MW. Axenic culture of the heterotrophic dinoflagellate Pfiesteria shumwayae in a semi-defined medium. Journal of Eukaryotic Microbiology. 2009;56:73–82. doi: 10.1111/j.1550-7408.2008.00368.x. [DOI] [PubMed] [Google Scholar]
  97. Staahl BT, Benekareddy M, Coulon-Bainier C, Banfal AA, Floor SN, Sabo JK, Urnes C, Munares GA, Ghosh A, Doudna JA. Efficient genome editing in the mouse brain by local delivery of engineered Cas9 ribonucleoprotein complexes. Nature Biotechnology. 2017;35:431–434. doi: 10.1038/nbt.3806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Stevens DR, Atteia A, Franzén LG, Purton S. Cycloheximide resistance conferred by novel mutations in ribosomal protein L41 of Chlamydomonas reinhardtii. Molecular and General Genetics MGG. 2001;264:790–795. doi: 10.1007/s004380000368. [DOI] [PubMed] [Google Scholar]
  99. Stöcklein W, Piepersberg W. Binding of cycloheximide to ribosomes from wild-type and mutant strains of Saccharomyces cerevisiae. Antimicrobial Agents and Chemotherapy. 1980;18:863–867. doi: 10.1128/AAC.18.6.863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Sutton CA, Ares M, Hallberg RL. Cycloheximide resistance can be mediated through either ribosomal subunit. PNAS. 1978;75:3158–3162. doi: 10.1073/pnas.75.7.3158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Thyme SB, Akhmetova L, Montague TG, Valen E, Schier AF. Internal guide RNA interactions interfere with Cas9-mediated cleavage. Nature Communications. 2016;7:11750. doi: 10.1038/ncomms11750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Tsai SQ, Zheng Z, Nguyen NT, Liebers M, Topkar VV, Thapar V, Wyvekens N, Khayter C, Iafrate AJ, Le LP, Aryee MJ, Joung JK. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nature Biotechnology. 2015;33:187–197. doi: 10.1038/nbt.3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Ward JD. Rapid and precise engineering of the Caenorhabditis elegans genome with lethal mutation co-conversion and inactivation of NHEJ repair. Genetics. 2015;199:363–377. doi: 10.1534/genetics.114.172361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Wetzel LA, Levin TC, Hulett RE, Chan D, King GA, Aldayafleh R, Booth DS, Sigg MA, King N. Predicted glycosyltransferases promote development and prevent spurious cell clumping in the choanoflagellate S. rosetta. eLife. 2018;7:e41482. doi: 10.7554/eLife.41482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Woznica A, Cantley AM, Beemelmanns C, Freinkman E, Clardy J, King N. Bacterial lipids activate, synergize, and inhibit a developmental switch in choanoflagellates. PNAS. 2016;113:7894–7899. doi: 10.1073/pnas.1605015113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Woznica A, Gerdt JP, Hulett RE, Clardy J, King N. Mating in the closest living relatives of animals is induced by a bacterial chondroitinase. Cell. 2017;170:1175–1183. doi: 10.1016/j.cell.2017.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Wu X, Scott DA, Kriz AJ, Chiu AC, Hsu PD, Dadon DB, Cheng AW, Trevino AE, Konermann S, Chen S, Jaenisch R, Zhang F, Sharp PA. Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells. Nature Biotechnology. 2014;32:670–676. doi: 10.1038/nbt.2889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Xu H, Xiao T, Chen CH, Li W, Meyer CA, Wu Q, Wu D, Cong L, Zhang F, Liu JS, Brown M, Liu XS. Sequence determinants of improved CRISPR sgRNA design. Genome Research. 2015;25:1147–1157. doi: 10.1101/gr.191452.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Yang L, Guell M, Byrne S, Yang JL, De Los Angeles A, Mali P, Aach J, Kim-Kiselak C, Briggs AW, Rios X, Huang P-Y, Daley G, Church G. Optimization of scarless human stem cell genome editing. Nucleic Acids Research. 2013;41:9049–9061. doi: 10.1093/nar/gkt555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Yeh CD, Richardson CD, Corn JE. Advances in genome editing through control of DNA repair pathways. Nature Cell Biology. 2019;21:1468–1478. doi: 10.1038/s41556-019-0425-z. [DOI] [PubMed] [Google Scholar]
  111. Young SL, Diolaiti D, Conacci-Sorrell M, Ruiz-Trillo I, Eisenman RN, King N. Premetazoan Ancestry of the Myc–Max Network. Molecular Biology and Evolution. 2011;28:2961–2971. doi: 10.1093/molbev/msr132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Yu Y, Guo Y, Tian Q, Lan Y, Yeh H, Zhang M, Tasan I, Jain S, Zhao H. An efficient gene knock-in strategy using 5'-modified double-stranded DNA donors with short homology arms. Nature Chemical Biology. 2020;16:387–390. doi: 10.1038/s41589-019-0432-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Zar JH. Biostatistical Analysis. Prentice Hall; 1999. [Google Scholar]

Decision letter

Editor: Alejandro Sánchez Alvarado1
Reviewed by: Margaret A Titus2, Iñaki Ruiz-Trillo3, Matthew C Gibson4

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Thank you for submitting your article "Genome editing enables reverse genetics of multicellular development in the choanoflagellate Salpingoeca rosetta" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by Alejandro Sánchez Alvarado as the Reviewing Editor and Patricia Wittkopp as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Margaret A Titus (Reviewer #1); Iñaki Ruiz-Trillo (Reviewer #2); Matthew C Gibson (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

Summary:

This work represents an important step forward for the study of choanoflagellates. It reports the first evidence of a reverse genetics approach based on CRISPR/Cas-9 mediated genome editing in Salpingoeca rosetta, an emerging model to study the origin and evolution of animal multicellularity. In this work, authors engineered a cycloheximide-resistance selectable marker to demonstrate genome editing and to enrich for edited cells. Later they disrupted rosetteless, an S. rosetta C-type lectin gene essential for rosette colonies development, the absence of which impairs the formation of rosette colonies. This work represents an important advance to investigate in vivo the function of genes using a genome editing-based approach in S. rosetta, which will greatly expand its value as an experimental model to address animal origins.

Essential revisions:

1) The authors point out that the method here could also be used to edit the genome of related choanoflagellates. This would assume that the transfection method would work as well on these (or some of them) as well as it does for S. rosetta. Has this been tested or is there a strong reason to think that this will be the case, especially for the more divergent members of the species?

2) One desired use of gene editing would be to introduce a tag to a gene of interest for expression of an in-frame fusion at endogenous levels. The insertions made are rather small in size. Have the authors tried to fuse a GFP, for example, to a gene of interest and found this to work? Or do the data the authors have in hand support the possibility to insert a larger DNA to generate such a fusion?

3) One consideration with gene editing is the introduction of off-target mutations. Do the authors have any information about the possibility of generating additional, background mutations in the genome when using the co-CRISPR approach?

4) One nagging question in CRISPR/Cas9 reverse genetic approaches is the possibility of off-target mutagenesis, which is most effectively ruled out by 1)independent gRNAs; 2) evidence for Mendelian segregation and 3) non-complementation between independent alleles in a diploid organism. All experiments shown here are performed in haploid cells. However, Salpingoeca rosetta can undergo sexual reproduction through a diploid state (Levin and King, 2013) with genetic analysis suggesting that inheritance patterns follow Mendel's law of segregation and independent assortment (Levin et al., 2014). Can CRISPR/Cas9 mutagenized choanoflagellates undergo sexual reproduction? If so, is allele frequency in line with the rules of classical Mendelian genetics? How could this be utilized in future experiments to improve genetic resolution? A discussion of this information, if possible, would help to fully expand on the potential uses for targeted mutagenesis in S. rosetta.

5) In the manuscript version for review, the protocol availability on protocol.io appears only once – towards the middle of the Materials and methods (subsection “Delivery of gene editing cargoes with nucleofection”). I would suggest this information additionally be listed in the Abstract and/or somewhere in the main body of the paper.

6) By directly showing the loss of Rosettless protein in the mutants, Supplementary Figure 3C offers striking visual corroboration of the genetic and genomic evidence of targeted mutagenesis. I found this to be particularly strong data and would suggest moving these panels to the main text. Within the figures, it would also be helpful to readers to clearly indicate the color key for the markers used somewhere in the figure itself (Rosetteless (green), α-tubulin (gray), and actin (magenta).

7) The main Figure 1 is essentially review information and the main Figure 2 is important but graphically and conceptually quite simple. The authors should consider fusing these figures by putting Figure 1 in a horizontal format with the Figure 2 panels below. I think this would be more engaging for readers and also open up room to move Supplementary Figure 3 wholesale into the main presentation.

8) The authors provide detailed analysis to show that the inclusion of a donor template significantly favors DNA repair over indel creation (Figure 4, Figure 4—figure supplement 1). However, there appears to be no significant difference in editing outcomes with different types of repair templates as long as the template has both left and right homology arms. Maybe I'm missing something, but this seems at odds with the final sentence of the Results section.

eLife. 2020 Jun 4;9:e56193. doi: 10.7554/eLife.56193.sa2

Author response


Essential revisions:

1) The authors point out that the method here could also be used to edit the genome of related choanoflagellates. This would assume that the transfection method would work as well on these (or some of them) as well as it does for S. rosetta. Has this been tested or is there a strong reason to think that this will be the case, especially for the more divergent members of the species?

This point highlights that a limiting step for performing genome editing and reverse genetics in S. rosetta and other choanoflagellates has been the establishment of an efficient method for transfection. In our previous paper that focused on transfecting S. rosetta (Booth et al., 2018), we described how transfection could be applied to other systems. In this paper, we raise the possibility of establishing genome editing in other choanoflagellates (Discussion) as a hypothetical (“…we anticipate…”) based on lessons that we’ve learned from S. rosetta.

In fact, we have helped advise another group that has adapted the transfection procedure from S. rosetta to successfully transfect another choanoflagellate species. As these data are still unpublished and not ours to report, we think it is most appropriate to simply raise the possibility that our methods might extend to other choanoflagellates.

2) One desired use of gene editing would be to introduce a tag to a gene of interest for expression of an in-frame fusion at endogenous levels. The insertions made are rather small in size. Have the authors tried to fuse a GFP, for example, to a gene of interest and found this to work? Or do the data the authors have in hand support the possibility to insert a larger DNA to generate such a fusion?

We agree that one of the desired uses for gene editing is the insertion of fluorescent protein tags at endogenous loci. Although we do not currently have data that demonstrate the incorporation of such tags, we are actively pursuing these experiments (when we can return to lab, that is). It is worth noting that even in large research communities, such as researchers using human cell lines and/or established model organisms, there has often been a long wait between establishing genome editing and inserting large tags (Hsu et al., 2014) with more and more clever insights continually being published (e.g. Li et al., 2019). Nonetheless, with the tools in hand, we are optimistic that small epitope tags could be incorporated at endogenous loci to accelerate protein localization experiments. As we have not yet been able to pursue those experiments, we have not commented on that particular use in the submitted manuscript.

3) One consideration with gene editing is the introduction of off-target mutations. Do the authors have any information about the possibility of generating additional, background mutations in the genome when using the co-CRISPR approach?

We agree that an important concern for any CRISPR-based approach is the possibility of introducing off-target mutations. In fact, this potential problem was a key motivation behind our decision to perform genome editing with the SpCas9 RNP (as opposed to a cas9 plasmid), as previous studies comparing off-target mutations with different Cas9-delivery methods showed fewer off-target mutations with the Cas9 RNP (Kim et al., 2014; Liang et al., 2015; Han et al., 2020). We have now added this point and associated references to our list of reasons for using the SpCas9 RNP:

“We favored delivering the SpCas9 RNP rather than expressing SpCas9 and gRNAs from plasmids, as RNA polymerase III promoters for driving gRNA expression have not yet been characterized in S. rosetta and the overexpression of SpCas9 from plasmids can be cytotoxic for other organisms (Jacobs et al., 2014; Jiang et al., 2014; Shin et al., 2016; Foster et al., 2018) as well as increase the likelihood of introducing off-target mutations (Kim et al., 2014; Liang et al., 2015; Han et al., 2020).”

Although we did not directly measure the frequency of off-target mutagenesis during genome editing in S. rosetta, genetic crosses have been performed in S. rosetta (Levin et al., 2014; Wetzel et al., 2018) and offer a means to eliminate background mutations for future studies (also see our response to comment 4). Moreover, circumstantial evidence suggests that off-target mutagenesis may be infrequent in S. rosetta. Off-target mutagenesis in other organisms typically occurs through untemplated insertions and/or deletions (Kim et al., 2014; Liang et al., 2015; Han et al., 2020), but these are rare according to our next generation sequencing at the rosetteless locus (Figure 4).

4) One nagging question in CRISPR/Cas9 reverse genetic approaches is the possibility of off-target mutagenesis, which is most effectively ruled out by 1)independent gRNAs; 2) evidence for Mendelian segregation and 3) non-complementation between independent alleles in a diploid organism. All experiments shown here are performed in haploid cells. However, Salpingoeca rosetta can undergo sexual reproduction through a diploid state (Levin and King, 2013) with genetic analysis suggesting that inheritance patterns follow Mendel's law of segregation and independent assortment (Levin et al., 2014). Can CRISPR/Cas9 mutagenized choanoflagellates undergo sexual reproduction? If so, is allele frequency in line with the rules of classical Mendelian genetics? How could this be utilized in future experiments to improve genetic resolution? A discussion of this information, if possible, would help to fully expand on the potential uses for targeted mutagenesis in S. rosetta.

Thank you for encouraging us to include these points for discussion. Based on this comment, we have added the following paragraph to the Discussion to better highlight how independent alleles can work together to establish genotype to phenotype relationships, just as we show in this manuscript with two rosetteless alleles (Figure 3). We also highlight how genetic crosses and transgenesis could further aid reverse genetics.

“Moving forward, the approach established here promises to accelerate future research on choanoflagellates. […] Tools that have been previously used in forward genetic approaches, such as genetic crosses (Levin and King, 2013; Levin et al., 2014; Woznica et al., 2017) and stable transgenesis (Wetzel et al., 2018), may also provide the means to rapidly generate strains with different genetic backgrounds to complement mutants, to reveal epistasis, or simply to eliminate off-target mutations that may arise during genome editing.”

To respond to the specific question about whether genome edited strains can mate, we do expect that genome edited strains can undergo sexual reproduction, and we have some preliminary evidence, but we feel this goes beyond the scope of the current study and the relevant data do not yet meet our standards for publication. As previous forward genetic studies have shown, the frequency of alleles in haploid progeny from genetic crosses do obey classical Mendelian genetics. Except for tightly linked loci and potential epistasis between alleles, we would expect that these results should be generalizable to genome edited strains.

5) In the manuscript version for review, the protocol availability on protocol.io appears only once – towards the middle of the Materials and methods (subsection “Delivery of gene editing cargoes with nucleofection”). I would suggest this information additionally be listed in the Abstract and/or somewhere in the main body of the paper.

Thank you for this suggestion. We have added a mention of the protocol in the Introduction:

“Here we report a reliable method for genome editing to perform reverse genetics in S. rosetta that we have developed into a publicly-accessible protocol (https://dx.doi.org/10.17504/protocols.io.89fhz3n).”

6) By directly showing the loss of Rosettless protein in the mutants, Supplementary Figure 3C offers striking visual corroboration of the genetic and genomic evidence of targeted mutagenesis. I found this to be particularly strong data and would suggest moving these panels to the main text. Within the figures, it would also be helpful to readers to clearly indicate the color key for the markers used somewhere in the figure itself ( Rosetteless (green), α-tubulin (gray), and actin (magenta).

Done.

7) The main Figure 1 is essentially review information and the main Figure 2 is important but graphically and conceptually quite simple. The authors should consider fusing these figures by putting Figure 1 in a horizontal format with the Figure 2 panels below. I think this would be more engaging for readers and also open up room to move Supplementary Figure 3 wholesale into the main presentation.

Thank you for these suggestions. Following these recommendations, we have moved the immunofluorescence images from the supplement to the main text Figure 3G-J. Although simple, we think Figures 1 and 2 make distinct points for readers to take away and would like to keep them as separate figures.

8) The authors provide detailed analysis to show that the inclusion of a donor template significantly favors DNA repair over indel creation (Figure 4, Figure 4—figure supplement 1). However, there appears to be no significant difference in editing outcomes with different types of repair templates as long as the template has both left and right homology arms. Maybe I'm missing something, but this seems at odds with the final sentence of the Results section.

Thank you for this question, as it emphasizes that we need to clarify our data regarding relative rates of repair with different templates. Although the frequencies of genome editing appear largely comparable among the repair templates (Figure 4C; subsection “S. rosetta preferentially introduces genome-edited mutations from DNA templates”), there are meaningful differences that may be relevant for experimental design. In Figure 4C, it may have been difficult to discern those differences because the logarithmic y-axis compresses values and deemphasizes the variance and differences between experiments with different repair templates, so we have revised this panel to extend the y-axis. We have also underscored the differences between each repair template by reporting in the text the P-values from ANOVA and multiple comparison tests. To the reviewers’ point, we also emphasize in the main text that the editing frequencies are similar for all practical purposes.

“We found that the S. rosetta genome could be edited in a SpCas9-dependent manner using a variety of templates (Figure 4C, Figure 4—figure supplement 1C). In the presence of the SpCas9, INDEL mutations occurred at a frequency of < 0.1%, either in the presence or absence of DNA repair templates (Figure 4C, Figure 4—figure supplement 1D-E). In contrast, the addition of SpCas9 with DNA repair templates that encoded the PTS resulted in PTS mutations at a frequency of 0.79-1.57%, which is at a ten-fold higher frequency than the INDEL mutational frequency (Two-Factor ANOVA: P < 10-13). Notably, the total frequency of genome editing (~ 1%) is on the same order of magnitude as transfection efficiency (~1%; Booth et al., 2018), suggesting that the delivery of SpCas9 and repair templates is the biggest factor limiting genome editing efficiency.

In the absence of SpCas9, we observed two types of SpCas9-independent genome edits. The first was a single INDEL mutation detected in a population of cells transfected with the antisense repair template, and the second was the detection of PTS mutations at an average frequency of ~ 0.02% upon the delivery of a double-stranded repair template. Although the frequency of these mutations occurred at a rate less than or equal to the detection threshold (~ 0.02%), meaning that we could not confidently conclude that differences exist between any of the samples (Two-Factor, ANOVA: P < 0.27), the presence of these mutations is consistent with a low frequency of endogenous DNA repair. These results also emphasize that the addition of SpCas9 was essential for efficient, targeted mutagenesis (Two-Factor ANOVA: P < 0.02 for INDEL mutations and P < 10-13 for PTS mutations).

Altogether, our optimization efforts revealed that the delivery of the SpCas9 with a DNA template spanning both sides of the SpCas9 cleavage site introduced PTS mutations at a frequency of ~ 1%. We recommend using a sense-oriented, single-stranded template for genome editing, as this template led to the highest frequency of PTS mutations (Single-Factor ANOVA, Tukey Multiple Comparison Test: P < 0.04) and costs less to synthesize than a double-stranded template.”

We also point out other considerations that have led us to favor the sense repair template. First, single-stranded oligonucleotides for the sizes of repair templates that were used in this paper are cheaper and faster to obtain compared to duplex templates. Second, the single-stranded oligo in the sense direction leads to higher editing frequencies (Figure 4 and Figure 2—figure supplement 2E, compare templates with 20-30 base homology arms). Given these collective considerations, we argue that the single-stranded, sense template represents the all-around best guide template to be used by other researchers who wish to implement these methods for their own research.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Source code 1. Quantification of DNA repair outcomes.

    BASH script for quantifying the frequency of repair outcomes from deep sequencing data that were preprocessed and aligned in a Galaxy server (Afgan et al., 2018).

    elife-56193-code1.sc (4.5KB, sc)
    Supplementary file 1. Tables of critical resources.

    Table A: Media recipes for making artificial seawater (Hallegraeff et al., 2004; Skelton et al., 2009), high nutrient media (modified from King et al., 2009; Levin and King, 2013; Booth et al., 2018), and low nutrient media. Table B: Oligonucleotide sequences for gRNAs, repair oligonucleotides, and primers that were used to construct and to validate genome edited strains. Table C: S. rosetta strains Genotypes and sources of S. rosetta strains used in this study. Table D: Deep sequencing library primers Sequences for primers (adapted from Lin et al., 2014 used to generate libraries for deep sequencing (Figure 4 and S5)

    elife-56193-supp1.xlsx (24.8KB, xlsx)
    Transparent reporting form

    Data Availability Statement

    All data generated are included in the manuscript. Additionally, we have posted a protocol at https://www.protocols.io: https://doi.org/10.17504/protocols.io.89fhz3n.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES