Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Mar 1.
Published in final edited form as: Curr Protoc Mol Biol. 2020 Mar;130(1):e112. doi: 10.1002/cpmb.112

Use of the CRISPR-Cas9 system in Drosophila cultured cells to introduce fluorescent tags into endogenous genes

Justin A Bosch 1, Shannon Knight 1,2, Oguz Kanca 3,4, Jonathan Zirin 1,2, Donghui Yang-Zhou 1,2, Yanhui Hu 1,2, Jonathan Rodiger 1,2, Gabriel Amador 1,2, Hugo J Bellen 3,4,5,6, Norbert Perrimon 1,2,7, Stephanie E Mohr 1,2
PMCID: PMC7213786  NIHMSID: NIHMS1062933  PMID: 31869524

Abstract

The CRISPR-Cas9 system makes it possible to cause double-strand breaks in specific regions, inducing repair. In the presence of a donor construct, repair can involve insertion or ‘knock-in’ of an exogenous cassette. One common application of knock-in technology is to generate cell lines expressing fluorescently tagged endogenous proteins. The standard approach relies on production of a donor plasmid with ~500–1000 bp of homology on either side of an insertion cassette that contains the fluorescent protein open reading frame (ORF). We present two alternative methods for knock-in of fluorescent protein ORFs into Cas9-expressing Drosophila S2R+ cultured cells, the single-stranded DNA (ssDNA) Drop-In method and CRISPaint universal donor method. Both methods eliminate the need to clone a large plasmid donor for each target. We discuss the advantages and limitations of the standard, ssDNA Drop-In, and CRISPaint methods for fluorescent protein tagging in Drosophila cultured cells.

Keywords: CRISPR, Drosophila, cell culture, knock-in, fluorescent protein tagging, ssDNA Drop-In, CRISPaint, GFP fusion, gene tagging

INTRODUCTION

Tagging endogenous proteins by insertion of a fluorescent protein open reading frame (ORF) into a gene is a common application of CRISPR knock-in technology, as it facilitates visualization of the cellular and subcellular distribution of the resulting fusion protein in live or fixed cells. In Drosophila cell lines, for example, introduction of a fluorescent protein ORF has been applied to generate a resource of Drosophila cell lines in which various organelles and sub-cellular compartments have been tagged with mCherry (Neumuller et al., 2012). The efficiency of introduction of tags into Drosophila cells is dramatically improved by introduction of the CRISPR-Cas9 system as a tool to facilitate insertion or ‘knock in’ of an insertion cassette into a specific locus. Indeed, CRISPR approaches have been successfully used to generate Drosophila cells in which endogenous loci are tagged by GFP fusion, e.g. (Bosch, Colbeth, Zirin, & Perrimon, 2019; Bottcher et al., 2014; Kanca et al., 2019; Kunzelmann, Bottcher, Schmidts, & Forstemann, 2016; Wang et al., 2016).

The standard protocol involves production of a plasmid donor with ~500–1000 bp homology arms (Housden & Perrimon, 2016). Alternative approaches, as presented here, can accelerate the CRISPR knock-in workflow, for example by making it easier to obtain or make donor constructs (Bosch et al., 2019; Kanca et al., 2019). Specifically, we present as alternatives to the standard approach a single-stranded DNA (ssDNA) “Drop-In” method based on in vitro synthesis of a ssDNA donor (Kanca et al., 2019)(Basic Protocol 1) and a “CRISPaint”-based approach that relies on universal donors (Bosch et al., 2019; Schmid-Burgk, Honing, Ebert, & Hornung, 2016)(Basic Protocol 2). For all three approaches, starting with a Cas9-positive cell line increases efficiency. The protocols described here are both based on use of the S2R+-MT::Cas9 cell line, which is described in (Viswanatha, Li, Hu, & Perrimon, 2018) and available from the Drosophila Genomics Resource Center (DGRC catalog #268; https://dgrc.bio.indiana.edu).

The standard plasmid-based donor, ssDNA Drop-In, and CRISPaint methods have different strengths and limitations (Fig. 1). A standard donor plasmid provides the most flexibility, allowing for insertion of GFP or another sequence into any region of the Drosophila genome with an sgRNA target site in close proximity (i.e. effectively, any genomic region). With the ssDNA Drop-In method (Basic Protocol 1), the donor construct is built using PCR followed by an in vitro digestion reaction to remove one of the two strands. Gene-specific regions are included in the design of the synthetic oligos used as primers in the PCR step, and there is no need for sub-cloning or propagation of donor plasmids in bacteria. These improve donor production efficiency; however, the size of the ssDNA insertion cassette is limited compared to the standard approach due to the way in which the ssDNA is generated (Support Protocol 2). The specific ssDNA Drop-In protocol described here corresponds to the research report by (Kanca et al., 2019) and is based on insertion of sfGFP as an artificial exon (Basic Protocol 1). With the CRISPaint method (Basic Protocol 2), there are no gene-specific regions in the donor; however, because the donor plasmid is linearized and integrated in full into the target locus, the CRISPaint method is only useful for C-terminal tagging. The specific CRISPaint protocol described here is based on the research report by (Bosch et al., 2019) for insertion of mNeonGreen and a puromycin selection marker that contributes to efficient isolation of insertion events (Basic Protocol 2). The nature of the gene target(s) and scale of the project are among the considerations that go into choosing an optimal method for a given project (see Strategic Planning).

Figure 1: Comparison of CRISPR knock-in methods for introduction of fluorescent protein tags into Drosophila cultured cells.

Figure 1:

(A) Diagram of a theoretical gene target and results of knock-in using ssDNA Drop-In method (Basic Protocol 1) and CRISPaint method (Basic Protocol 2). FP, fluorescent protein open reading frame (ORF); T2A, self-cleaving peptide ORF; PuroR, puromycin resistance ORF. (B) Comparison of standard plasmid-based donor method for tagging with a fluorescent protein ORF with ssDNA Drop-In and CRISPaint methods.

A workflow for both protocols is shown in Fig. 2. For either protocol, transfection with the donor and single guide RNA (sgRNA) constructs can be performed by chemical transfection, such as with Qiagen Effectene following the manufacturer’s protocol, or by electroporation, such as with the Lonza Nucleofect system (Support Protocol 1). Moreover, for both approaches, fluorescence-activated cell sorting (FACS) is used to identify and single-cell isolate putative fluorescent protein tagged cells, and this can be followed by image analysis to observe GFP or mNeonGreen in the cells, for example using Cell Profiler (Carpenter et al., 2006) and taking advantage of the fact that the S2R+-MT::Cas9 cell line expresses mCherry (Neumuller et al., 2012; Viswanatha et al., 2018). The mCherry signal can be used to identify cells and compared with the signal from the knock-in tag.

Figure 2: Workflow for ssDNA Drop-In and CRISPaint approaches to knock-in of fluorescent protein tags into Drosophila cultured cells.

Figure 2:

With both methods, production and validation of single-cell clones positive for the knock-in takes about two months.

STRATEGIC PLANNING

Which method is best for my target?

When deciding among the standard plasmid donor, ssDNA Drop-In, and CRISPaint methods to fluorescently tag an endogenous protein, important planning considerations include the following: (a) the size of the insertion cassette, (b) the intron-exon structure of the target gene, and (c) the desired position of the insertion relative to the coding sequence, as these will determine which strategy or strategies match with the project goals (Fig. 1). Insertion cassette size is limited for the ssDNA Drop-In method but not for the standard or CRISPaint methods. The ssDNA Drop-In method provides the fluorescence protein ORF as an artificial exon, such that an appropriate intron must be present in the target gene. The standard method allows for insertion of a cassette anywhere and the CRISPaint method is useful for C-terminal tagging. For all approaches, the expression level of the gene in Drosophila cells must be sufficient for detection of the fluorescent tagged protein. Expression levels of a target gene(s) in any of several Drosophila cell lines can be queried based on modENCODE Drosophila cell line transcriptomics data sets (Cherbas et al., 2011), for example using the Drosophila Gene Expression Tool (DGET) (https://www.flyrnai.org/dget) (Hu, Comjean, Perrimon, & Mohr, 2017). We note that (Kanca et al., 2019) report isolation of GFP-tagged cell lines using the ssDNA Drop-In approach for some targets expressed at moderate or low levels.

Rationale—ssDNA Drop-In Method (Basic Protocol 1)

Why single-stranded donors?

For ssDNA homology donors, short homology arms, typically 100 nucleotides (nts), are used to facilitate integration (Beumer, Trautman, Mukherjee, & Carroll, 2013; Gratz et al., 2013; Wissel et al., 2016). This is short enough to be included in PCR primers as 5’ overhangs, rather than requiring PCR amplification and cloning of homology arms, which is a requirement for the standard method. Moreover, PCR conditions do not change from gene to gene since the PCR template does not change. Thus, as compared with making standard donors, making donors using the ssDNA Drop-In method is faster and more scalable. In addition, different donor constructs can be amplified with the same primers. The method we use to generate ssDNA homology donors was modified from the ssDNA production method described in (Higuchi & Ochman, 1989).

Why provide the fluorescent tag as an artificial exon?

In our experience, integration of donor cassettes is not always precise. With an artificial exon approach, small indels are unlikely to affect function because the ssDNA Drop-In cassette is inserted into an intron (i.e. would not affect the resulting mature mRNA). Moreover, a comparison of data from an intronic tagging effort as reported in (Nagarkar-Jaiswal et al., 2015) to data from a C-terminal tagging effort as reported in (Sarov et al., 2016) suggests that intronic tagging leads to a slightly higher percentage of functional tagged proteins (75% of intronic tagged proteins vs. 67% of C-terminally tagged proteins were functional). Selecting introns that do not bifurcate functional domains will likely increase the chance of obtaining a functional tagged protein. Based on our own analysis of the Drosophila reference genome at FlyBase (Thurmond et al., 2019), about 40% of Drosophila protein-coding genes contain an intron with suitable sgRNA sites and of sufficient size (i.e. at least ~150 nucleotides) to support the ssDNA Drop-In artificial exon approach.

Rationale—CRISPaint method (Basic Protocol 2)

Whereas the standard and ssDNA Drop-In approaches rely on homology-directed repair (HDR) to integrate donor DNA with homology arms, the CRISPaint method uses an NHEJ-mechanism to insert a universal donor plasmid into a target gene (Schmid-Burgk et al., 2016). This accelerates up-front molecular steps by eliminating the need for PCR amplification of long homology donor arms (as for the standard approach) or generating ssDNA. Furthermore, publicly available universal donor plasmids containing different insert sequences provide flexibility (e.g. the CRISPaint Gene Tagging Kit, Addgene #1000000086). As mentioned, one trade-off is that the entire plasmid will insert into the locus, such that for fluorescence tagging, this approach is only useful for C-terminal tagging.

What cell line should I start with?

Different cell lines will be optimal for different targets, as different cell lines express different subsets of Drosophila genes. As mentioned in the Introduction, the protocols described here are based on use of a Cas9-expressing S2R+ cell line generated in (Viswanatha et al., 2018) and made available through the DGRC (catalog #268). For other cell lines, Cas9 could be provided transiently via co-transfection with a Cas9 expression vector or a Cas9-expression cell line could be established. For stable transfection protocols, see (Santos, Jorge, Brillet, & Pereira, 2007) and https://fgr.hms.harvard.edu/stable-fly-cell-lines.

Is your goal to tag any allele or all alleles?

In the protocols presented, we make the assumption that generating any tagged allele will result in a cell clone useful for the project. In some cases, however, the goal might be to isolate a cell line in which the fusion protein is the only source of the protein. This could be achieved either by isolating cell clones in which all alleles were converted to the knock-in allele or the non-knock-in alleles were disrupted by NHEJ-induced indels. Given that Drosophila S2R+ cells are polyploid, isolation of cells in which the tagged protein is the only source of the protein could be challenging. This would likely require additional molecular analyses not described here to identify and characterize non-tagged alleles (e.g. PCR amplification of the non-tagged alleles and next-generation sequencing of the product to detect indels), and might require a multi-step approach in which remaining non-tagged alleles are targeted for CRISPR knockout following successful isolation of a knock-in event.

What about knock-in of other types of sequences?

Knock-in of other sequences, including non-fluorescent tags, could be approached using protocols similar to those presented here. With introduction of a fluorescent tag or reporter, FACS can be used to identify and isolate from a live cell population the subset of single cells that are positive for the fluorescent marker. However, for most or all non-fluorescent tags or other knock-in events, live-cell FACS could not be used to identify and isolate single cells positive for the insertion. Instead, detection of non-fluorescent tags would require screening single-cell clones for tag expression by methods such as immunoblot or molecular analysis. In this case, antibiotic enrichment of correct insertion events, as is possible using the CRISPaint method, could make identification of positive cells much more feasible by enriching for successful insertion events prior to single-cell isolation and analysis.

BASIC PROTOCOL 1: Knock-in into Cas9-positive S2R+ cells using the ssDNA Drop-In approach

This protocol describes a method for CRISPR mediated knock-in of a fluorescent protein that relies on a ssDNA donor to provide the fluorescent protein ORF as an artificial exon, referred to as the ssDNA Drop-In method (Kanca et al., 2019). Following design of the knock-in and corresponding ssDNA and sgRNA, these molecular reagents are generated and transfected into cells. Cells are then single cell isolated by FACS and grown to form colonies, and individual colonies are tested using imaging and molecular analysis. The most effective method for molecular confirmation is PCR amplification of each junction site using a genomic-specific primer and an insertion cassette-specific primer, followed by sequencing. The protocol takes approximately two months to complete.

Materials

Cas9+ Drosophila cells, such as S2R+-MT::Cas9 (Viswanatha et al., 2018) (DGRC #268)

Schneider’s Media (see Reagents & Solutions)

Genomic DNA extraction kit, such as a Zymo Quick-DNA MiniPrep Kit (Zymo Research #D3024)

GFP Flanking Primer R1: 5’ ACCCTGAAGTTCATCTGCAC 3’

GFP Flanking Primer F2: 5’ GCATCACCCTGGGCATGGAT 3’

PCR polymerase and buffer such as High Fidelity Phusion Polymerase (NEB #M0530) and 5X Buffer

dNTP Mixture (2.5 mM each)

Ethidium Bromide

Agarose

QIAquick Gel Extraction Kit (Qiagen #28704)

Fluorescent Microscope

Image analysis software such as CellProfiler (Carpenter et al., 2006)

Eppendorf Tubes

Tabletop Centrifuge

PCR Machine

Spectrophotometer such as a NanoDrop microvolume spectrophotometer

Standard agarose gel electrophoresis apparatus

Target selection, knock-in design, and isolation of single-cell clones

  • 1

    Obtain the gene structure from FlyBase GBrowse or import it from NCBI with the sequence accession number to DNA editing software program such as SnapGene.

  • 2

    Choose a target intron. If there are multiple suitable introns, select the introns shared in all annotated transcripts and do not divide known or putative functional domains (e.g. using the SMART database, http://smart.embl-heidelberg.de/)(Letunic & Bork, 2018).

  • 3

    Scan the selected intron for sgRNAs. Select sgRNA target sites that are >50 nt away from endogenous splice donor/splice acceptor sites to ensure proper splicing of the artificial exon in the mature transcript. Also apply general sgRNA design principles (Support Protocol 1).

  • 4

    Clone the sgRNA (Support Protocol 1).

  • 5

    Generate the ssDNA donor (Support Protocol 2).

  • 6

    Transfect cells with the ssDNA donor and sgRNA plasmid (Support Protocol 3 or 4).

  • 7

    Optional: View the cells using a fluorescence microscope (40x or 60x objective). Some GFP-positive cells might be detectable.

    Visualize cells with a 40x or 60x microscope objective. If S2R+-MT::Cas9 cells are used, then all cells will be positive for the mCherry signal. We see a range of percent positive cells with this approach and even in cases where a signal is not obvious, GFP-positive cells might be identified, so we continue the workflow. For knock-in cell lines reported in (Kanca et al., 2019), we observed a range of ~0.5 to 7%, depending on the target.

  • 8

    Grow the cells to confluency in Schneider’s Media.

  • 9

    Isolate single cells and grow to form colonies (Support Protocol 5).

Validation of single-cell clones

  • 10

    With aliquots of cells isolated in step (8), identify strong GFP-expressing clones using fluorescence microscopy and an image analysis software package such as CellProfiler (Carpenter et al., 2006). See also Basic Protocol 2, step 5. The mCherry signal present in parental S2R+-MT::Cas9 cells can be used to define all cells and compared with the GFP signal to determine brightness and localization.

    For the work described in (Kanca et al., 2019), we identified the three brightest clones using Cell Profiler version 2.1.1 (see Internet Resources) and selected these for follow-up. You can cryopreserve the remaining cells, either individually or as a pool, so they are available for testing if the initial candidates fail validation.

  • 11

    Grow each candidate clone until the cells have reached confluency in a T25 flask. Resuspend the cells in media and aliquot 1 mL into an Eppendorf tube.

  • 12

    Prepare genomic DNA. If using a Zymo gDNA Miniprep Kit, spin the cells for 5 minutes at 500 rpm, discard supernatant, and resuspend the pellet in 1,000 μL of Genomic Lysis Buffer, then follow the rest of the manufacturer’s protocol to isolate genomic DNA. Measure the concentration of the sample.

  • 13

    Design a forward primer that amplifies ~300 bp upstream of the insert sequence in the target locus using the GFP Flanking Primer R1: 5’ ACCCTGAAGTTCATCTGCAC 3’ as the reverse primer. This will be Flanking Primer F1. See Fig. 3.

  • 14

    Design a reverse primer that amplifies ~300 bp downstream of the insert sequence in the target genome using the GFP Flanking Primer F2: 5’ GCATCACCCTGGGCATGGAT 3’ as the forward primer. This primer will be Flanking Primer R2.

  • 15

    Run a PCR reaction following the parameters set in Table 1 and assess the products by gel analysis and Sanger or next-generation sequencing.

  • 16

    Optional: Detect GFP fusion proteins by immunoblotting.

    Grow cell lines in 6-well plates until confluent, resuspend cells, and transfer 1 ml of resuspended cells into a 1.5 ml centrifuge tube. Centrifuge at 250 g for 10 min to pellet the cells. Aspirate the supernatant and gently resuspend cells in 1ml of ice-cold 1x PBS. Centrifuge at 250 g for 10min to pellet the cells. Lyse and denature the cell pellet by boiling in 250 μl of 2x SDS-Sample Buffer for 5 min. Load 10 μl of protein onto a SDS-Page gel, transfer to blotting paper, and detect GFP fusion proteins using an anti-GFP antibody at an appropriate dilution.

  • 17

    For successful clones, further expand and cryopreserve, following standard protocols such as those found at https://dgrc.bio.indiana.edu/include/file/FreezingCells.pdf.

Figure 3: Example design for ssDNA Drop-In into Fibrillarin.

Figure 3:

The position of the ssDNA donor and location of the primers used to amplify the 5’ and 3’ insertion sites for molecular validation are shown.

Table 1:

PCR conditions for amplification of the junctions between the ssDNA Drop-In cassette and the intron into which it has inserted.

Component Volume (μL)
Nuclease-free water 12.4
5X Phusion HF 4
2.5 mM dNTPs 0.4
10 μM F1 or F2 1
10 μM R1 or R2 1
100 ng/μL Genomic DNA 1
Phusion DNA Polymerase 0.2
Step Temperature Time
Initial Denaturation 98˚C 30 s
35 Cycles 98˚C 10 s
54˚C 30 s
72˚C 30 s
Elongation 72˚C 10 min
Hold 4˚C

BASIC PROTOCOL 2: Knock-in into Cas9-positive S2R+ cells by homology-independent insertion of universal donor plasmids that provide mNeonGreen (CRISPaint method)

This protocol describes CRISPR/Cas9 knock-in using the CRISPaint approach (Bosch et al., 2019; Schmid-Burgk et al., 2016), which uses an NHEJ-mechanism to insert a universal donor plasmid into a target gene (Fig. 4). To tag proteins with mNeonGreen in S2R+-MT::Cas9 cells (Viswanatha et al., 2018), you will first need to design and clone sgRNA-expressing plasmid(s) that target your gene(s) of interest. Next, for each target you will transfect the target-specific sgRNA plasmid along with two publicly available plasmids, a frame-selector sgRNA plasmid and the mNeonGreen universal donor plasmid. After transfection, puromycin selection can be used to enrich for in-frame insertions, followed by single-cell isolation, visualization of the tagged protein, and molecular confirmation.

Figure 4: Step-wise schematic of mNeonGreen-T2A-PuroR knock-in using homology-independent insertion.

Figure 4:

mNeonGreen-T2A-PuroR is inserted into 3’ coding sequence. From (Bosch et al., 2019); used with permission.

Materials

Cas9+ Drosophila cells, such as S2R+-MT::Cas9 (Viswanatha et al., 2018) (DGRC #268)

Schneider’s Media (Reagents & Solutions)

Schneider’s Media with Puromycin (Reagents & Solutions)

Frame selector plasmids (pCFD3-frame_selector_(0,1,or 2) (Addgene #s 127553–127555; DGRC#s 1482–1484)

CRISPaint donor plasmid(s) (See Addgene Kit # 1000000086; pCRISPaint-mNeonGreen-T2A-PuroR cannot be distributed through Addgene and is available directly from Hornung lab; (Schmid-Burgk et al., 2016))

Optional control sgRNA, pCFD3-Act5c (Addgene #130278; DGRC #1492)

Genomic DNA extraction kit, such as a Zymo Quick-DNA MiniPrep Kit (Zymo Research #D3024)

PCR polymerase and buffer such as High Fidelity Phusion Polymerase (NEB #M0530) and 5X Buffer

dNTP Mixture (2.5 mM each)

Ethidium Bromide

Agarose

Fluorescent Microscope

Image analysis software such as CellProfiler (Carpenter et al., 2006)

Eppendorf Tubes

Tabletop Centrifuge

PCR Machine

Spectrophotometer, such as a NanoDrop microvolume spectrophotometer

Standard agarose gel electrophoresis apparatus

Target selection, knock-in design, and isolation of single-cell clones

  • 1

    For each target gene, identify a sgRNA target site in the 3’ coding sequence. The sgRNA should be as close to the stop codon as possible (<100 bp away) and follow general rules for sgRNA design (Support Protocol 1).

  • 2

    For each target cut site, identify a matching frame-selector sgRNA (named frame 0, 1, or 2) (Fig. 4, 5). The frame-selector sgRNA is used to cut and linearize the donor plasmid. Matching the cutting frame of the donor plasmid with the target gene improves the chances of generating seamless in-frame insertions (see (Schmid-Burgk et al., 2016)). Choose an appropriate frame-selector sgRNA by analyzing the location of the target gene sgRNA DNA cleavage site relative to the reading frame. Note that the frame-selector sgRNA #’s are reversed relative to the traditional coding frame numbering system (Fig. 5).

    We analyze the genomic sequence of the target gene using Lasergene DNAstar software, though other DNA analysis programs are available. This helps locate and annotate the sgRNA target site, predicted DNA cleavage site, and amino acid reading frame. To help this analysis, we recommend using the online sgRNA prediction tool CRISPR3 (www.flyrnai.org/crispr3/web), which reports the cutting frame of the sgRNA in the target gene (Fig. 5). The orientation of the target gene sgRNA site (5’ or 3’) does not matter.

  • 3

    Clone the target-specific sgRNA (Support Protocol 1). Plasmids for Drosophila expression of the frame-selector sgRNAs can be obtained from Addgene or the DGRC (see Materials above for catalog numbers).

  • 4

    Transfect donor and sgRNA plasmids into cells (Support Protocol 3 or 4).

    The work described in (Bosch et al., 2019) used Effectene (Support Protocol 3). The experimental transfection mix will contain a donor plasmid (pCRISPaint-mNeonGreen-T2A-PuroR), the appropriate frame-selector sgRNA plasmid, and the target gene sgRNA plasmid. As a positive control for knock-in, you can use pCFD3-Act5c, frame selector 2, and the pCRISPaint-mNeonGreen-T2A-PuroR donor plasmid.

  • 5

    Optional: Visualize cells on an inverted fluorescence microscope. Cells should be proliferating and some might be noticeably fluorescent.

    Visualize cells using a 40x or 60x microscope objective. If S2R+-MT::Cas9 is used as the starting cell line, then all cells will be positive for mCherry signal. The number of knock-in tagged fluorescent cells is dependent on the transfection and knock-in efficiency, and the level of fluorescence in cells is dependent on the expression of the target gene. Act5c-mNeonGreen positive control integration events should be visible at this stage, with ~3% cells expected to be positive for mNeonGreen signal.

  • 6

    3–4 days after transfection, split cells into new 6-well plates at a 1:6 dilution with fresh Schneider’s Medium with puromycin at a final concentration of 2 μg/ml (1:5000 dilution of 10 mg/ml stock; see Reagents & Solutions). Incubate plates at 25˚C.

    It is possible that genes with lower expression levels may require lower concentrations of puromycin, or longer recovery period after transfection before puromycin treatment. We use the DGET tool from the Drosophila RNAi Screening Center (DRSC) (https://www.flyrnai.org/tools/dget/web/) to determine gene expression levels in S2R+ cells from RNA-seq data. We also note that you could skip the puromycin selection step and go to step 8 (FACS isolation).

  • 7

    Every 3–5 days, gently replace the media with fresh Schneider’s Medium with 2 μg/ml puromycin. Monitor the growth of cells on an inverted microscope. Cultures should become confluent after 12–16 days at 25˚C.

    To avoid disturbing the adherent cells when changing the media, we use vacuum aspiration to remove the media and cell debris, and add fresh Schneider’s Medium to the side of the well. Clones of adherent mNeonGreen+ cells may be observed using an inverted microscope even after a few days of puromycin selection. Use positive control wells (e.g. pCFD3-Act5c) and negative control wells (e.g. untransfected) to help determine the success of the puromycin selection. If cells are growing but not yet confluent after 12–16 days, continue replacing puromycin and monitoring every 3–5 days.

  • 8

    Single-cell isolate cells positive for the fluorescent tag (Support Protocol 5).

Figure 5: Diagram to help determine the appropriate CRISPaint frame-selector.

Figure 5:

A. Conversion table for target gene cut frame and CRISPaint frame selector sgRNA. B. Screenshot of output of the Find CRISPRs tool (www.flyrnai.org/crispr3/web). Arrow, column in a search results table showing the target gene sgRNA cut frame. The example sgRNA shown was used to generate His2Av-mNeonGreen knock-in cell line in Bosch et al. 2019. C. Schematic of gene targeting when using the three CRISPaint frame selector sgRNAs (0, 1, 2).

Validation of single-cell clones

  • 9

    Examine mNeonGreen localization in single-cell cloned lines using a confocal microscope or inverted fluorescence microscope. Retain cell lines that exhibit correct localization and robust growth. See Fig. 6 for representative results.

    To image live cell lines with high resolution and in a high-throughput manner, we transfer cell lines to a glass bottom 384 well plate and obtain images on an In Cell 6000 microscope using a 60x objective.

  • 10

    Remove the media from cells growing in 96 well plates and add 100 μl of QuickExtract solution to each well. Pipette up and down to resuspend and lyse the cells, then transfer the solution to a 96-well PCR plate or 8-well PCR strip tube. Incubate at 65˚C for 15 min, then 98˚C for 2 min. Store genomic DNA at 4˚C.

    We typically prepare a 96 well plate containing replicate cultures of each cell line in an organized layout to facilitate downstream PCR analysis. We allow the cells to adhere to the plate for at least 2 hrs before harvesting. Genomic DNA can be extracted from cells between 1×106-1×107 cells/ml.

  • 11

    Design a gene-specific forward primer upstream of the insertion site, to be used with the mNeonGreen_R reverse primer in a PCR reaction.

    We design the gene-specific forward primer using Primer3 (http://bioinfo.ut.ee/primer3-0.4.0/) to result in an amplified DNA fragment size of 300–1000 bp when used with mNeonGreen_R

  • 12

    Run a PCR reaction following the parameters in Table 1. See Fig. 7.

    The successful amplification of a DNA fragment indicates that the cell line contains at least one correct orientation insertion of mNeonGreen into the target cut site.

  • 13

    Purify amplified DNA fragments from agarose gels and submit for Sanger sequencing to determine the sequence of the mNeonGreen insertion site. See Fig. 5 for representative data.

    We use QIAquick columns to purify DNA fragments from agarose gels. We use the same primers for sequencing as were used for PCR amplification. We analyze the sequence of the insertion site using Lasergene software. First, using SeqBuilder, we create a DNA sequence file representing the hypothetical seamless mNeonGreen insertion site. Next, we align the chromatogram sequences to this reference file using SeqMan. Finally, we use SeqBuilder to annotate any differences between the two. If indels are present at the insertion site, the predicted amino acid sequence is analyzed to determine if mNeonGreen is in coding frame with the target gene. If more than one type of insertion allele is present (double peaks in sanger sequencing data), users will have to use a different method to resolve the sequences. This can be done by TOPO cloning the PCR fragment and sequence individual plasmids to identify the different allele sequences, or using next-generation sequencing.

  • 14

    Optional: PCR amplify the non-insertion locus using the gene-specific forward primer and a gene-specific reverse primer that flanks the insertion site. Analyze DNA fragments by gel imaging and sequencing as described above.

    If more than one indel allele is present, users will have to TOPO clone the PCR fragment and sequence individual plasmids to identify the different allele sequences, or use next-generation sequencing.

  • 15

    Optional: Detect mNeonGreen fusion proteins by immunoblotting.

    Grow cell lines in 6-well plates until confluent, resuspend cells, and transfer 1 ml of resuspended cells into a 1.5 ml centrifuge tube. Centrifuge at 250 g for 10 min to pellet the cells. Aspirate the supernatant and gently resuspend cells in 1ml of ice-cold 1x PBS. Centrifuge at 250 G for 10min to pellet the cells. Lyse and denature the cell pellet by boiling in 250 μl of 2x SDS-Sample Buffer for 5 min. Load 10 μl of protein onto a SDS-Page gel, transfer to blotting paper, and detect mNeonGreen fusion proteins using mouse anti-mNeonGreen antibody at 1:1000 concentration. See Figure 2F in (Bosch et al., 2019) for representative data.

  • 16

    For successful clones, further expand and cryopreserve (see for example https://dgrc.bio.indiana.edu/include/file/FreezingCells.pdf).

Figure 6. Confocal images of live mNeonGreen-expressing single-cell cloned S2R+ lines.

Figure 6.

Results with targeting four genes are shown. Images show fluorescence from Clic-mCherry (red), which is present in the parental Cas9-positive cell line, and mNeonGreen (green). Scale bar, 25 μm. Modified from (Bosch et al., 2019); used with permission.

Figure 7. Agarose gel with PCR fragments amplified from knock-in (Gene_F/mNeonGreen_R) and non-knock-in loci (Gene_F/R).

Figure 7.

Positive control bands were amplified from Rp49 genomic sequence. From (Bosch et al., 2019); used with permission.

Support Protocol 1: sgRNA Design and Cloning

Factors relevant to sgRNA design include (a) position of the sgRNA relative to the target, with the specific approach in mind (see Basic Protocol 1 and Basic Protocol 2), (b) predicted effectiveness of the sgRNA, and (c) the presence of single nucleotide polymorphisms (SNPs) in the target region in the cell line being used. Online resources such as DRSC Find CRISPRs or CRISPR Optimal Target Finder can be used to identify and evaluate appropriate sgRNAs (see below and Internet Resources). Cloning of the sgRNA is straightforward.

Materials

pCFD3 (Addgene # 49410)

Chemically competent E. coli, such as TOP10 cells (C404010, Invitrogen)

BbsI-HF enzyme (New England Biolabs, R3539)

T4 DNA ligase (New England Biolabs, M0202)

Lysogeny broth (LB) (L3022–1KG, Sigma)

LB agarose plates with 50ug/ml carbenicillin (C1389–10G, Sigma)

LB with Ampicillin at 100μg/ml (10835269001, Roche)

QIAprep Spin Miniprep Kit (27104, Qiagen)

Oligos to anneal for sgRNA (user specific)

Shaking incubator at 37˚C (MS012T6, Multitron)

14ml culture tubes (60818–703, VWR)

42˚C water bath (182, Precision)

  1. Use a Drosophila sgRNA design resource such as the DRSC Find CRISPRs tool or CRISPR Optimal Target Finder to select sgRNA target sites (see Internet Resources). For the DRSC Find CRISPRs tool, optimal designs have a seed score of 12 or 13, and an efficiency score of >5. When possible, you should exclude target sites that overlap known genomic variation in the cell line. The DRSC Find CRISPRs tool displays whole genome variation data for the S2R+ Cas9 cell line. Users working with cell lines that have not had their genome sequenced should PCR amplify and sequence the target region to determine if the region contains variants that would affect sgRNA effectiveness.

  2. Evaluate cutting efficiency of sgRNAs that do not have any predicted off target sites for cutting using CRISPR Efficiency Predictor (https://www.flyrnai.org/evaluateCrispr/). If available, choose sgRNAs with efficiency scores of >5.

    It is important to avoid possible U6 Terminator (TTTT) in the sgRNA sequence, since the pCFD3 plasmid uses the U6 promoter.

  3. Clone the sgRNA-expressing plasmid(s). For each sgRNA target site, design and order two complementary oligonucleotides (IDT or equivalent company) encoding the sgRNA. Anneal each pair of oligos, creating overhanging sticky ends, and ligate the annealed oligos into a sgRNA expression plasmid backbone.

    We clone single sgRNAs into pCFD3 (Port, Chen, Lee, & Bullock, 2014), which contains the Drosophila U6:3 promoter (see also https://www.crisprflydesign.org/grna-expression-vectors/). Briefly, this involves digesting pCFD3 with BbsI-HF, ligating annealed oligos into the digested pCFD3 backbone using T4 DNA ligase, transforming ligated plasmid into chemically competent TOP10 bacteria using a 42˚C heat shock, and plating cells onto LB carbenicillin agar plates for incubation overnight at 37˚C. Single bacterial colonies are cultured in LB ampicillin shaking overnight at 37˚C. Plasmids are isolated from cultures using a miniprep kit (Qiagen or equivalent) and sequenced by Sanger sequencing using the pCFD3 sequencing primer. We typically resuspend final plasmids in water (or Qiagen EB) at a final concentration of 200 ng/μl. Alternatively, sgRNAs can be cloned into alternative plasmid backbones or synthesized and ordered from a company.

SUPPORT PROTOCOL 2: ssDNA Donor Synthesis

This protocol describes amplification of the ssDNA Drop-In method cassette using primers that recognize the donor vector and add gene-specific homology arms, and subsequent production of ssDNA from the amplified fragment.

Materials

Drop-In SA-sfGFP-SD donor template plasmid (Kanca et al., 2019)

Q5 Hot start high fidelity 2X master mix (NEB #M0494)

Lambda exonuclease (NEB #M0262)

Qiaquick PCR purification kit (Qiagen #28106)

Monarch PCR and DNA Cleanup kit (NEB #T1030)

Optional: 2X QX RNA denaturation solution (Qiagen #929607)

0.2 μL thin-walled PCR tubes

PCR machine

  1. Design PCR primers with gene-specific homology arms as 5’overhangs. See Fig. 8.

    Primer binding sites in the template vector are 5’-GAATTCTGTAAAACGACGGCCAGTGG-3’ for forward primer and 5’-GCCCCCAGGAAACAGCTATGACGG-3’ for reverse primer. Example gene specific primers would be:
    • Forward primer: 5’ 100nts left homology arm- GAATTCTGTAAAACGACGGCCAGTGG 3’
    • Reverse primer: 5’ 100nt right homology arm (reverse complement)- GCCCCCAGGAAACAGCTATGACGG 3’.

    Left homology arm is 100 nts upstream of 3 nts prior to Protospacer Adjacent Motif (PAM) and right homology arm is 100 nts downstream of 3 nts prior to PAM. An easy way to design oligos is to integrate the template construct (from 5’ primer binding site to the 3’ primer binding site) in silico to the targeted region (3 nts prior to PAM sequence) and design the primers on this in silico generated sequence. SA-sfGFP-SD construct is directional. It is important to know the orientation of the gene relative to the reference genome sequence while designing the knock-in approach. We work on in silico files that are orientated according to the orientation of gene of interest (i.e. if the gene of interest is transcribed on the minus strand of genomic reference sequence in FlyBase, then we use the reverse complement of the sequence for in silico design steps). In addition, you should PCR amplify and sequence the homology arm region using genomic DNA from your target cell type, to make sure that there are not SNPs in this region as compared with the reference genome.

  2. Order high-quality oligo primers (such as Ultramers from Integrated DNA Technologies; IDT, Coralville IA). One of the primers should be ordered with the 5’ phosphorylation modification. This will ensure that one of the resulting PCR strands is 5’ phosphorylated and the other strand not phosphorylated, which is critical for production of ssDNA from the PCR product.

  3. Spin down the lyophilized primers briefly and dissolve in 40 μL distilled water at 50℃ for 15 minutes.

  4. Set up four 50 μL PCR reactions per construct using Q5 Hot Start High Fidelity 2X Master Mix (NEB #M0494). A master mix of the reactions should be set up with 1 μL of each primer, 4 μL of template (diluted to 1 ng/μL), 94 μL ddH2O and 100 μL 2X Master Mix. Distribute 200 μL reaction to 4 PCR tubes. PCR conditions:
    Step Temperature Time
    Initial Denaturation 98℃ 30 sec
    35 Cycles 98℃ 10 sec
    70℃ 30 sec
    72℃ 40 sec
    Elongation 72℃ 2 min
    Hold 12℃

    The template should be selected according to the last codon of preceding exon in order to avoid frame shift mutations. There are three donor template vectors: phase 0, phase 1 and phase 2. If the splice donor is situated at position 0 at a codon (i.e. if the last codon in the exon before splice donor is not divided) phase 0 PCR template is used. If the last codon in preceding exon is divided in position one (one nucleotide in preceding exon and two nucleotides in following exon) then phase 1 template is used. If the last codon is divided in position 2 (two nucleotides in preceding exon, one nucleotide in the following exon) then phase 2 template is used.

  5. PCR reactions result in 1392 nt amplicons. Pool the resulting PCR reactions and run

    10μL on an agarose gel to confirm the band size.

    This results in a single strong band at 1392 nts and a weak band of a smaller size that we assume is comprised of primer dimers.

  6. Distribute the remaining PCR products in two tubes and isolate amplicons using two Qiaquick PCR purification kit columns. Elute each column in 50 μL elution buffer and mix the two isolated samples. Measure DNA concentration in NanoDrop using 1 μL sample with a dsDNA protocol.

    A typical yield is 100 μL of dsDNA in solution at a concentration of ~100–200 ng/μL.

  7. Set up two Lambda exonuclease reactions using 4–6 μg of DNA, 10 μl of 10X Lambda exonuclease buffer, 8 μl of Lambda exonuclease, ddH2O to 100 μl.

    We distribute the two samples in a total of four PCR tubes (50 μl per tube) and do the reactions in a thermocycler for convenience.

  8. Incubate at 37℃ for 1 hour, 10 min at 75℃ and hold at 4℃.

  9. Pool the reaction results in two 100 μl samples and isolate using Monarch PCR and DNA Cleanup kit using single stranded DNA isolation protocol in two columns. Elute each column in 10 μl 55℃ elution buffer per column. Pool the eluates and measure DNA concentration with NanoDrop using ssDNA protocol.

    We typically achieve a yield of ssDNA at ~100–150 ng/μl.

  10. Optional: Denature the ~200–300 ng ssDNA by mixing ssDNA with equal volume 2X QX RNA denaturation solution and incubating at 70℃ for 2 min and on ice 1 min. Load samples on 2% agarose gel along with untreated double stranded PCR products to confirm the band size.

    The single stranded donor construct runs at a lower size than double stranded controls (~650–850 bps).

Figure 8. Schematic of PCR-based generation of drop-in ssDNA constructs to tag genes with sfGFP using an artificial exon.

Figure 8.

Gray boxes, UTRs; orange boxes, coding exons; yellow line, coding introns; black line, outside coding introns and exons . sfGFP: superfolderGFP; SA: Splice Acceptor of mhc; SD: Splice Donor of mhc; L: flexible linker that consists of four copies of Gly-Gly-Ser. Adapted from (Kanca et al., 2019).

SUPPORT PROTOCOL 3: Transfection using Effectene

This protocol describes transfection using the chemical transfection reagent Effectene (Qiagen), the method used in (Bosch et al., 2019), and follows closely the protocol provided by the manufacturer. Another commercial transfection reagent shown to be effective for Drosophila cultured cells could be used, following the manufacturer’s protocol.

Materials

Cas9-expressing S2R+ cells (S2R+-MT::Cas9, DGRC Stock #268)

Schneider’s Medium (see Reagents & Solutions)

Effectene transfection reagent (301427, Qiagen)

sgRNA plasmid for each target/design (see Basic Protocol 1 or 2 for design and Support Protocol 1 for synthesis)

ssDNA donor (see Basic Protocol 1, Support Protocol 2) or CRISPaint donor plasmid (see Basic Protocol 2)

For Basic Protocol 2, frame selector plasmid (pCFD3-frame_selector_(0,1,or 2) (Addgene #s 127553–127555)

Optional control sgRNA for Basic Protocol 2, pCFD3-Act5c (GP07595, Addgene # 130278)

6-well tissue culture plates (3516, Costar)

  1. Plate Drosophila Cas9-expressing S2R+ cells into 6-well dishes at a concentration of 1.8×106 cells/ml (2 ml volume each well, 3.6×106 cells/well) in Schneider’s Medium. Incubate the plates at 25˚C for at least 4 hours to allow the cells to adhere to the bottom of the plate.

    We grow cells for at least two passages after thawing before using them for transfections, at which point cells should be growing robustly. We also avoid using cells that have gone through more than 30 passages.

  2. Prepare transfection mixture with sgRNA plasmid(s) and donor. As a negative control, use untransfected cells or omit the gene-targeting sgRNA plasmid from the transfection mix.

    For the work described in (Bosch et al., 2019), we used Qiagen Effectene transfection reagent, though similar products are available. Briefly, 400 ng of each plasmid are diluted in EC buffer up to a total volume of 90.4 μl and mixed by brief vortexing. Diluted plasmids are mixed with 9.6 μl of Enhancer, vortexed, and incubated for 2–5 minutes. 15 μl Effectene is added to the mixture, vortexed, and incubated for 10–15 minutes.

  3. Add transfection mixture to cultured cells prepared as described in step 1.

    Gently transfer 600 μl of media from a 6-well culture well to the transfection mix, mix by pipetting up and down, and add the media + transfection mix back to the same culture well in a gentle drop wise manner.

SUPPORT PROTOCOL 4: Electroporation of S2R+-MT::Cas9 Drosophila Cells

This protocol describes electroporation of Drosophila S2R+ cells using the Nucleofect system (Lonza V4XC-2032). This method was used to generate the ssDNA Drop-In tagged cell lines reported in (Kanca et al., 2019).

Materials

Drosophila cultured cells at ~70% confluency

Schneider’s Media (see Reagents and Solutions)

sgRNA plasmid for each target/design (see Basic Protocol 1 or 2 for design and Support Protocol 1 for synthesis)

ssDNA donor (see Basic Protocol 1, Support Protocol 2) or CRISPaint donor plasmid (see Basic Protocol 2)

For Basic Protocol 2, frame selector plasmid (pCFD3-frame_selector_(0,1,or 2) (Addgene #s 127553–127555)

Optional control sgRNA for Basic Protocol 2, pCFD3-Act5c (GP07595, Addgene # 130278)

SF Cell Line Solution Box (Lonza, cat. no. V4XC-2032)

Lonza 4D Nucleofector (Lonza, cat. no. AAF-1002B)

48-well Flat-Bottom Plate (Corning, cat. no. 29442–952)

Standard Bright Field Microscope or equivalent (such as the Leica DM IL)

Bright-Line hemocytometer with coverslip (Sigma-Aldrich, cat. no. Z359629)

15 mL Conical Tubes (Falcon, cat. no. 352097)

Centrifuge

PCR Tubes

  1. Fill the wells of a 48-well plate with 500 μL of room temperature Schneider’s Media per reaction and set aside for after the cells are electroporated.

    Even though the outermost wells will not be used with cells, they should be filled with media to help maintain local humidity. The cells have difficulty growing after electroporation and humidity encourages growth.

  2. Grow cells to ~70% confluency prior to electroporation.

    The cells should not reach full confluency before being electroporated. If they do, fewer viable cells will be recovered post-electroporation.

  3. Gently resuspend ~70% confluent Drosophila S2R+-MT::Cas9 cells in media. Extract 10 μL of the media, load onto a hemocytometer, and count the cells.

  4. For each reaction, load 4×105 cells in a 15 mL conical tube and centrifuge at 700 RPM for 10 minutes. Aspirate the media.

  5. Gently resuspend the cells in 17 μL of SF Cell Line Solution and aliquot the reactions into separate PCR tubes.

  6. Add sgRNA(s) and donor to the cell solution for a final volume of 20 μL.

    For (Kanca et al., 2019), we used 1 μL of sgRNA (~100 ng/μL) and 2 μL of the ssDNA donor (~100 ng/μL).

  7. Add the entire reaction volume to an electroporation cuvette by pipetting into the space between the electrode plate.

    Avoid making bubbles by pipetting slowly. If bubbles are present or if solution is unevenly placed between the electrodes, gently tap the cuvette on the counter a few times to settle the solution.

  8. Load the electroporation cuvette into the Lonza 4D Nucleofector, select the cuvette wells to be electroporated, and run Program DS-137.

  9. Remove the cells from the cuvette immediately following transfection and place into the individual wells of the prepared 48-well plate.

  10. Observe the cells daily for health and growth. After two days, remove half of the media along with any floating cells and replace with room temperature regular media.

    Fluorescence from the knock-in tag should be visible after 7–10 days.

Support Protocol 5: Single-Cell Isolation of Fluorescent Cells Using FACS

This protocol describes use of FACS to isolate single cells. If FACS is not available, serial dilution can be used as an alternative method for single cell isolation (see Internet Resources). Single cells will not survive in Schneider’s Medium alone. To support growth, use conditioned media (see Reagents & Solutions) or feeder cells (see Critical Parameters).

Materials

Schneider’s Medium (see Reagents & Solutions)

Conditioned Medium (see Reagents & Solutions)

FACS machine, such as the FACSAria II (BD Biosciences)

Parafilm (PM-999, Bemis)

Plastic box with lid to create a humidity chamber

96-well Flat Bottom Plate (Corning, cat. no. 29442–056)

12-well Flat Bottom Plate (Corning, cat. no. 29442–040)

6-well Flat Bottom Plate (Corning, cat. no. 29442–042)

T25 Flacon Tissue Culture Treated Flasks (VWR, cat. no. 29185–300)

T75 TC Treated Flasks (Genesee Scientific, cat. no. 25–209)

  1. Monitor growth of the cells after transfection/electroporation. Expand the cells by resuspending the cells when confluent and transferring them to plates/flasks of increasing size: 12-well plate, 6-well plate, T25 flask, T75 flask. Use an aliquot of each cell population to document the GFP fluorescence using a fluorescent microscope.

    Confluent cells have a fast growth rate. You should be able to expand cells to a larger vessel each day. Check expanded cells the next morning and expand further if confluent.

  2. Prepare conditioned media (Reagents & Solutions) at least 2–3 days before single cell sorting the clones. Conditioned media can be stored at 4˚C for a few weeks. Prepare several 96-well plates filled with 100 μL of conditioned media per well.

    To ensure the recovery of sufficient numbers of single cell clones, we prepare three 96 well plates per knock-in experiment. Do not sort cells into the outer wells. Fill the outer wells of the 96-well plates with 200 μL of regular media instead of conditioned media. This is extremely important as the 96-well plates are to be left alone for ~20 days, and filling the outer wells with extra media will provide a mini-humidity chamber that will last the 20 days.

  3. Grow cells to near confluency (~80%) in a T75 flask, resuspend the cells in media, and count the cells on the hemocytometer. Remove at least 105 cells and filter through a 40 μm filter into a 15 mL conical tube. If necessary, add regular media to bring the total volume in the tube to 1 mL. Include a sample of untransfected S2R+-MT::Cas9 cells as a negative control for GFP.

  4. Pass the samples through a FACS machine (such as the BD Aria II), using the untransfected S2R+-MT::Cas9 cells to set the gates as shown in Fig. 9. Sort the cells that are positive for GFP (Gate P4) into the inner wells of the 96-well plates filled with conditioned media.

  5. Wrap each 96-well plate and lid with parafilm and incubate at 25˚C.

  6. To further prevent the plates from drying out, we place plates inside a plastic box containing wet strips of paper towels. Alternatively, plates can be kept in an incubator with 33% relative humidity. Users can also keep plates at room temperature in a dark drawer.

  7. About 3 weeks after FACS isolation of single cells, look at the wells of the 96-well plate under a bright field microscope to find growing single cells clones.

    Clones that are growing well are usually visible by eye as tiny specks in the well. Under the microscope, such a clone should look similar to the example shown in Fig. 10. However, it is important to look at all of the wells under the microscope as some clones might be present but not large enough to be seen by eye. Cells in small clusters (5 cells or less) have a much lower chance of surviving. Clusters of ~6 cells or more are likely to grow to confluence, they will just take a little longer to grow.

  8. Resuspend the clump of cells in the media to spread out the cells and encourage growth to confluency. Check the cells in a few hours or the following day for confluency. If confluent, expand the cells as previously described using regular media.

    We find it helpful to assign unique identifiers to each clone to keep track of the targeted gene and the number of clones. We typically obtain single cell cloning efficiencies of ~10–30%. In our experience, some individual wells might contain mold. If this is widespread, consult the FACS facility or review sterile techniques. If this is rare, we recommend to continue the workflow but avoid selecting colonies from these wells.

Figure 9: FACS gating to identify potential GFP-positive cell populations.

Figure 9:

Demonstration of the three different gates used to separate living and healthy cells from debris and dead cells (Gates P1-P3). Healthy cells are then gated by the presence of GFP (Gate P4). The top row shows the S2R+-MT::Cas9 cells used to set up the gates that are then applied to the electroporated populations. The bottom row is an electroporated population showing the initial gating of the entire population (Gate P1) and then the GFP positive cells (Gate P4). Cells in this gate are then single-cell sorted.

Figure 10: Example clone in a 96-well plate well 20 days after single-cell isolation.

Figure 10:

A clone of this size can be expected to expand easily after transfer to Schneider’s Media in a flask.

REAGENTS AND SOLUTIONS

Schneider’s Medium

Schneider’s Drosophila Media 1X (ThermoFisher Scientific, cat. no. 21720024)

10% FBS (A3912, Sigma)

1% penicillin/streptomycin (15070–063, ThermoFisher)

Prepare Schneider’s Medium by adding FBS (10% final) and penicillin and streptomycin (final 500U/ml) to Schneider’s Drosophila Media and filter sterilize at 0.2μm.

Schneiders’ Medium + Puromycin

Schneider’s Medium (see above)

10mg/ml stock solution of Puromycin in H20 (540411 Calbiochem)

Prepare Schneiders’ Medium + Puromycin by diluting the 10mg/ml Puromycin stock solution to a final concentration of 2 μg/ml (1:5000 dilution).

Conditioned Media

Schneider’s Medium

Cultured cells, e.g. Drosophila S2R+ cells

Inoculate T175 flasks with 1×106 cells/ml S2R+ cells and collect the conditioned media 1 week later, when the cultures are confluent. Gently pour off media from a culture flask containing adherent confluent cells into a conical centrifuge tube, centrifuge at 250 G for 10 min to pellet any remaining cells and pass the supernatant through a 0.2 μm filter. We store conditioned media at 4˚C and use it for as long 1 month. See also (Housden, Nicholson, & Perrimon, 2017) and https://fgr.hms.harvard.edu/single-cell-isolation.

COMMENTARY

CRITICAL PARAMETERS

Location of the tag

Before attempting a knock-in, it is important to predict whether a fluorescent protein fusion will impact the target protein localization or function. This can be accomplished using a combination of literature review and bioinformatic tools. For example, users should search for publications in which their target protein (or homolog) was previously tagged, making note of the location of the tag (i.e. N-terminus, C-terminus, or internal) and whether the tagged protein was functional. Users should also analyze the amino acid sequence of their target protein for insights into whether the location of a tag is acceptable. For example, users should tag a protein in a region that is less evolutionarily conserved, unstructured, or does not disrupt a protein domain. Unlike the standard knock-in approach, which has the most flexibility, the ssDNA Drop-In knock-in method is limited to tagging at an internal protein location defined by an intron and the CRISPaint knock-in method is limited to C-terminal tags. Furthermore, the CRISPaint method will delete a small portion of C-terminus from the final target protein. For all methods, when possible, users should compare their final tagged protein localization to the localization of untagged protein using antibody staining.

Gene expression in cells

Another important consideration is the expression levels of the target gene, because isolation of the knock-in cell line is dependent on expression of the inserted tag; fluorescence from the endogenously tagged protein is detected by FACS and used to sort single cells. See also Strategic Planning and DGET (Hu et al., 2017) in Internet Resources. Furthermore, when selecting knock-in cells by antibiotic resistance (standard and CRISPaint methods), co-expression of the antibiotic resistance gene is required. Note that we have not empirically determined the lower limit of gene expression to obtain a knock-in. If target gene expression is off or low but induced by a biological or chemical factor (such as a signaling ligand or drug), we recommend inducing expression of the gene before FACS or antibiotic selection.

sgRNA design

A functional sgRNA is critical to generate knock-in events. We recommend using established protocols and online resources to design sgRNAs with efficient binding to their target site (see Support Protocol 1 and Internet Resources). In addition, the exact location of the sgRNA target site is an important decision. For the standard and ssDNA Drop-In method, the sgRNA-binding site should be close to the homology arms (Paquet et al., 2016). Moreover, for the ssDNA Drop-In method, the chosen intron(s) must be of sufficient length, and the sgRNA binding site should not disrupt non-coding regulatory elements. For the CRISPaint method, the sgRNA binding site in 3’ coding sequence must be as close to the stop codon as possible to limit the removal of C-terminal amino acids. To increase the chances of obtaining a knock-in cell line, users may want to perform parallel knock-ins using independent sgRNAs. For the standard and Drop-In methods, donor DNAs corresponding to each sgRNA would have to be constructed. In contrast, for the CRISPaint method, the same universal donor can be used with multiple sgRNAs.

Transfection

The efficiency of introducing DNA into the cells (via electroporation or transfection) is also an important variable. We strongly recommend including positive controls to ensure that multiple steps in the protocol are performed correctly. For the CRISPaint method, pCFD3-Act5c can be used as a positive control, where ~3% of cells are expected to be positive for the knock-in.

Single-cell isolation and growth

Drosophila cultured cells can be difficult to grow following single-cell isolation. As presented, we use conditioned media to support growth of cells following FACS. There are alternative methods for support of cell growth following single-cell isolation, such as use of irradiated feeder cells. Available methods are reviewed in (Luhur, Klueg, & Zelhof, 2019) and feeder cells are available from the DGRC (https://dgrc.bio.indiana.edu/cells/FeederCells).

TROUBLESHOOTING

Table 2 below describes common problems and corresponding troubleshooting advice.

Table 2:

Troubleshooting

Problem Potential causes Solutions
No fluorescent cells (or, for Basic Protocol 2, no puromycin resistant cells) Target gene sgRNA does not cut efficiently Confirm sgRNA can cut target site using a T7 endonuclease assay; design and clone an alternate sgRNA if necessary; test cutting efficiency prior to attempting the knock-in
SNP in the sgRNA target region Check for SNPs; design and clone an alternate sgRNA if necessary
Target protein not expressed or expressed at low levels Check RNAseq or qPCR data for the target; detect fluorescent tagged protein using an anti-tag antibody by immunostaining or immunoblot
Cells are positive by FACS (or for Basic Protocol 2, cells are puromycin resistant) but do not appear fluorescent by live-cell imaging Target protein expressed at low levels Check by qRT-PCR for the target; detect fluorescent tagged protein using an anti-tag antibody by immunostaining or immunoblot; check for the presence of the fluorescent ORF transcript by qRT-PCR
Unexpected fusion protein localization within cells Fusion protein affects protein localization If possible, target the tag to different location in the protein, or a different protein in the complex or organelle
Unidentified on- or off-target genome changes Screen multiple single cell cloned lines for expected localization; bioinformatics analysis of off targets

UNDERSTANDING RESULTS

Both protocols can result in successful fusions. Example results, including information about molecular and image-based validation of successful knock-in events, are presented in (Kanca et al., 2019) and (Bosch et al., 2019). If fluorescent protein-tagged knock-in events are not obtained, a number of steps can be looked at, including sgRNA design and cell health following transfection, FACS, and growth (see Troubleshooting). For some targets, reconsideration of the design might be needed, for example in the rare case that the fusion generates a dominant negative mutation that prevents successful events from being isolated. Moreover, successful detection of a fusion protein does not ensure that it is localized properly. For proteins with an expected localization based on antibody detection or other data, confirmation that the knock-in results in the expected localization pattern is relatively easy. When an antibody is not available and localization data has not been reported, it can be more difficult to have confidence that the correct localization pattern is observed. In addition, for tagged proteins with the expected subcellular distribution, there is no guarantee that the fusion protein is functional. Additional testing would be required to determine if that is the case.

As mentioned in Strategic Planning, the protocols presented here describe methods for tagging any allele, not all alleles. For many assays, such as when the tagged protein is being used to visualize an organelle or compartment, the fact that wild-type alleles are present does not matter. For other assays, however, it might be important that only the fusion protein is expressed. In those cases, additional molecular analysis would be necessary to determine if the knock-in is present in all alleles or a subset, and if any remaining non-tagged alleles are wildtype or modified, e.g. by an NHEJ-mediated indel. If antibodies against the target protein are available, immunoblotting could be used to determine if both fusion and wild-type proteins are present. Finally, we have not observed loss of the fusion protein in culture but if multiple types of alleles are present, then this could occur, such as by gene conversion. Thus, cells should be monitored during repeated passages to ensure that the fusion protein is retained.

TIME CONSIDERATIONS

Both protocols take about two to two and half months from start to finish. Timeline for the ssDNA Drop-In method (Basic Protocol 1): design, sgRNA cloning, and ssDNA synthesis, 4 days; transfection and growth, 7 days; FACS, single-cell cloning and characterization, 30 days. Timeline for the CRISPaint method (Basic Protocol 2): design and sgRNA cloning, 4 days; transfection and growth, 7 days; selection in puromycin-containing media, 20 days (optional); single-cell cloning and characterization, 30 days.

INTERNET RESOURCES

FlyBase

http://flybase.org/

Information on fly genes, bioinformatics tools, and fly genome browser. Described in (Thurmond et al., 2019).

DRSC Find CRISPR online resource, updated version

https://www.flyrnai.org/crispr3/web/

Useful for sgRNA design for Drosophila melanogaster. Includes ability to check if there are SNPs in the sgRNA region in the genome sequence of the S2R+-MT::Cas9 cell line. With this online resource, you can also view the frame of the cut site for a given sgRNA (see Fig. 4B).

CRISPR Optimal Target Finder

http://targetfinder.flycrispr.neuro.brown.edu/

Alternative online resource for design of sgRNAs for Drosophila melanogaster.

DRSC Efficiency Predictor

https://www.flyrnai.org/evaluateCrispr/

Useful for evaluation of sgRNA designs for Drosophila melanogaster made using any design tool. Described in (Housden et al., 2015).

Drosophila Gene Expression Tool (DGET)

https://www.flyrnai.org/tools/dget/web/

Useful for query of modENCODE transcriptomics data for S2R+ and other cell lines, for confirmation that a target gene is expressed at levels likely to be sufficient for detection of a fluorescent fusion protein. Described in (Hu et al., 2017).

DRSC protocols for Drosophila cell culture

https://fgr.hms.harvard.edu/fly-cell-culture

Step-by-step protocols and media formulations for Drosophila cell culture.

DGRC protocols for Drosophila cell culture

https://dgrc.bio.indiana.edu/Protocols?tab=cells

Step-by-step protocols and media formulations for Drosophila cell culture. See also (Luhur et al., 2019).

DGRC cell catalog

https://dgrc.bio.indiana.edu/Protocols?tab=cells

Catalog of cell lines available, including S2R+-MT::Cas9, fluorescence-tagged cell lines generated using the approaches presented here (Bosch et al., 2019; Kanca et al., 2019), and many other Drosophila cultured cell lines. See also (Luhur et al., 2019).

CRISPR Fly Design website

https://www.crisprflydesign.org

Online resource for Drosophila CRISPR protocols and information, including cloning of sgRNAs into the pCFD3 plasmid vector. See also (Port et al., 2014).

CellProfiler image analysis software user manual resources

http://cellprofiler-manual.s3.amazonaws.com/CellProfiler-3.0.0/modules/objectprocessing.html#identifyprimaryobjects

Details how to identify Primary Objects (for these protocols, mCherry-positive cells) and Secondary Objects (for these protocols, the GFP or mNeonGreen-tagged fusion proteins) from fluorescent micrographs of cells. See also (Carpenter et al., 2006).

http://cellprofiler-manual.s3.amazonaws.com/CellProfiler-3.0.0/modules/measurement.html#measureobjectintensity

Details how to measure the intensity of the targeted object of interest (for these protocols, brightness of the GFP or mNeonGreen fusion proteins). See also (Carpenter et al., 2006).

Addgene protocol for single-cell isolation by dilution

https://www.addgene.org/protocols/limiting-dilution/

KEY REFERENCES

(Viswanatha et al., 2018)

Describes generation of Cas9-expressing S2R+ cells (S2R+-MT::Cas9, DGRC catalog #268).

(Kanca et al., 2019)

First demonstration of the use of the ssDNA Drop-In method in Drosophila cells (and use of related technologies in vivo).

(Bosch et al., 2019)

First demonstration of the use of the CRISPaint method in Drosophila cells (and in vivo).

(Schmid-Burgk et al., 2016)

First demonstration of homology-independent insertion in mammalian cells using the CRISPaint system. Generated CRISPaint donor plasmids that are compatible in Drosophila cultured cells.

(Higuchi & Ochman, 1989)

First demonstration of production of ssDNA following PCR amplification.

(Port et al., 2014)

Describes the pCFD3 plasmid vector.

(Housden & Perrimon, 2016)

Describes production of long homology arm donors for the standard knock-in approach.

ACKNOWLEDGEMENTS

We thank Jonathan Schmid-Burgk for advice and the CRISPaint-mNeonGreen donor plasmid, and thank Rich Binari and Cathryn Murphy for general assistance. Figure 2 was drawn using BioRender. This work was supported by NIH ORIP R24OD019847 (N. Perrimon, PI; A. Simcox, Co-PI; S.E. Mohr, Co-I) and NIH NIGMS R01GM067858 (H.J. Bellen, PI). Additional relevant support from NIGMS R01 GM067761 and NIGMS P41 GM132087 (N. Perrimon, PI; S.E. Mohr, Co-I). S.E.M. is also supported in part by the Dana Farber/Harvard Cancer Center, which is supported in part by NIH NCI Cancer Center Support Grant # NIH 5 P30 CA06516 (L. Glimcher, PI). J.A.B. was supported by the Damon Runyon Foundation. N.P. and H.J.B. are investigators of the Howard Hughes Medical Institute.

LITERATURE CITED

  1. Beumer KJ, Trautman JK, Mukherjee K, & Carroll D (2013). Donor DNA Utilization During Gene Targeting with Zinc-Finger Nucleases. G3 (Bethesda), 3(4), 657–664. doi: 10.1534/g3.112.005439 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bosch JA, Colbeth R, Zirin J, & Perrimon N (2019). Gene Knock-Ins in Drosophila Using Homology-Independent Insertion of Universal Donor Plasmids. Genetics. doi: 10.1534/genetics.119.302819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bottcher R, Hollmann M, Merk K, Nitschko V, Obermaier C, Philippou-Massier J, … Forstemann K (2014). Efficient chromosomal gene modification with CRISPR/cas9 and PCR-based homologous recombination donors in cultured Drosophila cells. Nucleic Acids Res, 42(11), e89. doi: 10.1093/nar/gku289 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH, Friman O, … Sabatini DM (2006). CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol, 7(10), R100. doi: 10.1186/gb-2006-7-10-r100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cherbas L, Willingham A, Zhang D, Yang L, Zou Y, Eads BD, … Cherbas P (2011). The transcriptional diversity of 25 Drosophila cell lines. Genome Res, 21(2), 301–314. doi: 10.1101/gr.112961.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Gratz SJ, Cummings AM, Nguyen JN, Hamm DC, Donohue LK, Harrison MM, … O’Connor-Giles KM (2013). Genome engineering of Drosophila with the CRISPR RNA-guided Cas9 nuclease. Genetics, 194(4), 1029–1035. doi: 10.1534/genetics.113.152710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Higuchi RG, & Ochman H (1989). Production of single-stranded DNA templates by exonuclease digestion following the polymerase chain reaction. Nucleic Acids Res, 17(14), 5865. doi: 10.1093/nar/17.14.5865 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Housden BE, Nicholson HE, & Perrimon N (2017). Synthetic Lethality Screens Using RNAi in Combination with CRISPR-based Knockout in Drosophila Cells. Bio Protoc, 7(3). doi: 10.21769/BioProtoc.2119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Housden BE, & Perrimon N (2016). Design and Generation of Donor Constructs for Genome Engineering in Drosophila. Cold Spring Harb Protoc, 2016(9). doi: 10.1101/pdb.prot090787 [DOI] [PubMed] [Google Scholar]
  10. Housden BE, Valvezan AJ, Kelley C, Sopko R, Hu Y, Roesel C, … Perrimon N (2015). Identification of potential drug targets for tuberous sclerosis complex by synthetic screens combining CRISPR-based knockouts with RNAi. Sci Signal, 8(393), rs9. doi: 10.1126/scisignal.aab3729 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hu Y, Comjean A, Perrimon N, & Mohr SE (2017). The Drosophila Gene Expression Tool (DGET) for expression analyses. BMC Bioinformatics, 18(1), 98. doi: 10.1186/s12859-017-1509-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Kanca O, Zirin J, Garcia-Marques J, Knight SM, Yang-Zhou D, Amador G, … Bellen HJ (2019). An efficient CRISPR-based strategy to insert small and large fragments of DNA using short homology arms. Elife, 8. doi: 10.7554/eLife.51539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Kunzelmann S, Bottcher R, Schmidts I, & Forstemann K (2016). A Comprehensive Toolbox for Genome Editing in Cultured Drosophila melanogaster Cells. G3 (Bethesda), 6(6), 1777–1785. doi: 10.1534/g3.116.028241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Letunic I, & Bork P (2018). 20 years of the SMART protein domain annotation resource. Nucleic Acids Res, 46(D1), D493–D496. doi: 10.1093/nar/gkx922 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Luhur A, Klueg KM, & Zelhof AC (2019). Generating and working with Drosophila cell cultures: Current challenges and opportunities. Wiley Interdiscip Rev Dev Biol, 8(3), e339. doi: 10.1002/wdev.339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Nagarkar-Jaiswal S, Lee PT, Campbell ME, Chen K, Anguiano-Zarate S, Gutierrez MC, … Bellen HJ (2015). A library of MiMICs allows tagging of genes and reversible, spatial and temporal knockdown of proteins in Drosophila. Elife, 4. doi: 10.7554/eLife.05338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Neumuller RA, Wirtz-Peitz F, Lee S, Kwon Y, Buckner M, Hoskins RA, … Perrimon N (2012). Stringent analysis of gene function and protein-protein interactions using fluorescently tagged genes. Genetics, 190(3), 931–940. doi: 10.1534/genetics.111.136465 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Paquet D, Kwart D, Chen A, Sproul A, Jacob S, Teo S, … Tessier-Lavigne M (2016). Efficient introduction of specific homozygous and heterozygous mutations using CRISPR/Cas9. Nature, 533(7601), 125–129. doi: 10.1038/nature17664 [DOI] [PubMed] [Google Scholar]
  19. Port F, Chen HM, Lee T, & Bullock SL (2014). Optimized CRISPR/Cas tools for efficient germline and somatic genome engineering in Drosophila. Proc Natl Acad Sci U S A, 111(29), E2967–2976. doi: 10.1073/pnas.1405500111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Santos MG, Jorge SA, Brillet K, & Pereira CA (2007). Improving heterologous protein expression in transfected Drosophila S2 cells as assessed by EGFP expression. Cytotechnology, 54(1), 15–24. doi: 10.1007/s10616-007-9060-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Sarov M, Barz C, Jambor H, Hein MY, Schmied C, Suchold D, … Schnorrer F (2016). A genome-wide resource for the analysis of protein localisation in Drosophila. Elife, 5, e12068. doi: 10.7554/eLife.12068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Schmid-Burgk JL, Honing K, Ebert TS, & Hornung V (2016). CRISPaint allows modular base-specific gene tagging using a ligase-4-dependent mechanism. Nat Commun, 7, 12338. doi: 10.1038/ncomms12338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Thurmond J, Goodman JL, Strelets VB, Attrill H, Gramates LS, Marygold SJ, … FlyBase C (2019). FlyBase 2.0: the next generation. Nucleic Acids Res, 47(D1), D759–D765. doi: 10.1093/nar/gky1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Viswanatha R, Li Z, Hu Y, & Perrimon N (2018). Pooled genome-wide CRISPR screening for basal and context-specific fitness gene essentiality in Drosophila cells. Elife, 7. doi: 10.7554/eLife.36333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Wang H, Becuwe M, Housden BE, Chitraju C, Porras AJ, Graham MM, … Farese RV (2016). Seipin is required for converting nascent to mature lipid droplets. Elife, 5. doi: 10.7554/eLife.16582 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Wissel S, Kieser A, Yasugi T, Duchek P, Roitinger E, Gokcezade J, … Neumuller RA (2016). A Combination of CRISPR/Cas9 and Standardized RNAi as a Versatile Platform for the Characterization of Gene Function. G3 (Bethesda), 6(8), 2467–2478. doi: 10.1534/g3.116.028571 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES