A model for accurate quantification of CRISPR effects in pooled FACS screens

Harold Pimentel; Jacob W Freimer; Maya M Arce; Christian M Garrido; Alexander Marson; Jonathan K Pritchard

doi:10.1101/2024.06.17.599448

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2024 Jun 18:2024.06.17.599448. [Version 1] doi: 10.1101/2024.06.17.599448

A model for accurate quantification of CRISPR effects in pooled FACS screens

Harold Pimentel ^1,^11,^12,^*, Jacob W Freimer ^1,^2,^3,^10,^12,^*, Maya M Arce ^2,⁴, Christian M Garrido ², Alexander Marson ^2,^3,^4,^5,^6,^7,^8,^13,^*, Jonathan K Pritchard ^1,^9,^13,^*

PMCID: PMC11213010 PMID: 38948774

Abstract

CRISPR screens are powerful tools to identify key genes that underlie biological processes. One important type of screen uses fluorescence activated cell sorting (FACS) to sort perturbed cells into bins based on the expression level of marker genes, followed by guide RNA (gRNA) sequencing. Analysis of these data presents several statistical challenges due to multiple factors including the discrete nature of the bins and typically small numbers of replicate experiments. To address these challenges, we developed a robust and powerful Bayesian random effects model and software package called Waterbear. Furthermore, we used Waterbear to explore how various experimental design parameters affect statistical power to establish principled guidelines for future screens. Finally, we experimentally validated our experimental design model findings that, when using Waterbear for analysis, high power is maintained even at low cell coverage and a high multiplicity of infection. We anticipate that Waterbear will be of broad utility for analyzing FACS-based CRISPR screens.

Introduction

Genetic screening is a powerful technique to identify the genes that underlie a phenotype or that are involved in a particular biological process. The ability of CRISPR/Cas9 to induce genetic perturbations efficiently has facilitated large-scale screens in many mammalian cell types^1,2. CRISPR screens can be paired with FACS to map the genetic wiring underlying complex phenotypes by identifying key upstream regulators of specific, relevant target genes^3–8. These screens use fluorescent reporters or fluorescent antibodies to directly measure the expression level of a protein of interest or a protein that is a surrogate marker of a biological process (such as a phosphorylated protein at the end of a signaling cascade). For simplicity, we will refer to any target measured by FACS – either an endogenous protein or a reporter protein – as a marker throughout the rest of the text. After pooled CRISPR perturbations, FACS is used to sort cells into different bins based on the fluorescence intensity of the marker. By sequencing the relative abundance of gRNAs in each bin it is possible to associate genetic perturbations with their effect on the levels of the marker.

There are a number of experimental and computational challenges when performing CRISPR FACS screens. These screens must balance a desire to perturb many genes, with high cell coverage for each perturbation, against costs and experimental demands. Furthermore, there is increasing interest in performing such screens in primary cells or in in vivo models which are more relevant for disease, but for which the number of cells that can be used is often limited ^9–11. CRISPR screens are also usually only performed with two or three replicates. Limiting numbers of cells and replicates reduce the number of times each gRNA is measured, which increases noise and uncertainty. This noise compounds with other sources of variability between replicates and donors, between different gRNAs targeting the same gene, and imprecise FACS gates. Finally, FACS screens are further complicated by the fact that they involve a pool of perturbed cells so the effect of each gRNA on the marker cannot be measured directly, but must instead be inferred by the relative abundance of gRNAs in different FACS bins. These challenges necessitate the development of new analysis methods designed specifically to analyze CRISPR FACS screens. Furthermore, there are often not principled guidelines on how different parameters affect the statistical power of these screens to inform experimental design.

We developed a computational framework, Waterbear, that (1) performs robust inference of CRISPR FACS screens and (2) informs optimal experimental design by iterating over thousands of plausible experimental configurations through simulation. Given parameters learned from real data, the model can generate realistic simulations of experiments at the single-cell level using a generative view of the model. The generative view of the model enables stepping through each parameter of the model to generate data that is consistent with parameters learned from real data while still introducing randomness at every stage of the model consistent with biological and experimental variability. Once a simulation is done, the statistical power of that experimental configuration can be estimated using our gene-level inference model which is a simplified version of the cell-level model which more closely mimics how the data is observed in practice. The inference model aggregates the cell-level information into a count observation and models the gRNA count distribution across discrete bins as is observed in actual screens. Waterbear is robust in that it can infer bin sizes, model the latent effects of the gRNAs on the marker distribution, and share information across guides, genes, and replicates to assess uncertainty. Further, this model is also used to analyze real data where the cell-level information is not available.

Waterbear is designed to use all available information to make informed decisions about whether each perturbed gene affects the marker distribution by modeling several relationships that are inherent to such screens. This model enables inferring thousands of parameters by shrinking the results towards a shared prior across relevant dimensions. For example, on average, gRNAs targeting the same gene should behave similarly, so in the model, gRNAs targeting the same gene share a “parent” distribution, while allowing each guide to have a unique effect size. While some gRNAs will produce off-target effects, Waterbear’s design does not ignore them, but rather downweights the evidence of the gene-level effect size if the off-target gRNA is inconsistent with other guides. Similarly, negative controls are used to infer experiment-level parameters such as the null marker distribution, variance between replicates, and variance between gRNAs. Additionally, Waterbear uses a sparse prior for gene-level effects since out of thousands of gRNAs, only a modest fraction of gRNAs will have a true measurable effect on the marker distribution.

While other tools have been used widely for FACS-based screen analysis, the current tools all have limitations for this purpose. MAGeCK was originally designed to analyze cell abundance screens and is one of the most commonly used CRISPR screen analysis tools ¹². However, MAGeCK only supports comparisons between two populations, and therefore it cannot take advantage of the additional information collected with more than two FACS bins, preventing it from modeling the underlying marker distribution. In contrast, MAUDE was developed specifically for the analysis of FACS screens ¹³. However, MAUDE does not explicitly handle replicates, requires a separate input population, and requires precise bin sizes to be manually specified. Finally, RELICS shares technical similarities with Waterbear as it is also a Bayesian hierarchical model. However, RELICS is designed for CRISPR tiling screens that perturb non-coding sequences where one would expect spatial correlations between guides, and thus the software is designed around the concept of finding “functional sites’’, rather than finding gene-gene relationships as we describe here. Waterbear was designed to overcome these limitations.

In addition to being able to analyze data, an expanded cell-level view of the Waterbear model can be used to simulate experiments to explore a wide range of experimental parameters and their implication on the design of these screens. We used Waterbear to explore how FACS bin size, gRNA coverage (the number of times a gRNA is measured), and lentiviral gRNA library multiplicity of infection ¹⁴ (MOI) affect the power of FACS-based screens and show that it is possible to reduce the number of cells required while still maintaining high sensitivity. Unlike existing simulation methods ¹⁵, we simulate each cell individually, enabling us to change cell-level parameters such as the MOI. We validate our simulations and inference procedure by repeating previous screens ³ at a higher MOI and with lower coverage of the gRNAs. Our results provide a roadmap to reduce the number of cells required for FACS screens and we introduce a powerful new analysis tool to analyze such screens. These advances open the door for future screens addressing novel biological questions in rare, primary cell types and in vivo models. Waterbear is available as free and open-source software at https://github.com/pimentel/waterbear.

Results

Overview of CRISPR FACS screens and tunable parameters

To understand the genetic regulation of a marker of interest, the most straightforward approach would be to individually perturb the expression of candidate regulatory genes and then measure the expression of the marker. This approach is challenging to scale and instead perturbations are often performed in a pool of cells with each cell containing a different perturbation ¹⁶. When the cells are pooled, the distribution of the marker reflects a mixture of many different perturbations and the effect of each individual perturbation cannot be directly observed (Figure 1 A). However, using FACS, the perturbed cells can be sorted into different bins based on the expression of the marker ¹⁶. By sequencing the gRNAs in the sorted cells and identifying which gRNAs are differentially enriched between the sorted populations, one can identify regulators of the marker (Figure 1A).

Figure 1: — A) Schematic of pooled CRISPR FACS screens. Perturbations that differentially affect the expression of a target protein of interest are mixed together in a pool of modified cells. To infer the effect of each gRNA, cells are sorted into bins based on expression of a target protein using FACS; gRNA abundance is compared across bins through sequencing. B) Experimental design considerations focused on reducing cell requirements, including the effect of changing i. gRNA coverage, ii. MOI, and iii. FACS bin configuration.

The field lacks principled guidelines for how experimental design choices affect the statistical power to detect differentially enriched gRNAs. Coverage, MOI, and FACS bin size are highly interrelated experimental parameters that can be altered to balance the number of cells needed for an experiment and the accuracy of gRNA abundance measurements (Figure 1B). For instance, at high MOI there will be multiple gRNAs per cell, which increases the effective coverage for a fixed number of cells while also increasing the variance of the observations. Here, we provide a framework to explore how these experimental parameters affect false discoveries and statistical power and provide principled guidelines for future screens.

A statistical model for CRISPR FACS screen data

We present two possible generative views of the data; one with cell-level information (the “cell-level” view) and one with aggregate information as is observed in FACS screen data (the “gene-level” view). The cell-level view offers details on all guides within a single cell (as in in single-cell sequencing), while the gene-level view provides only the total occurrences of a guide-bin combination, the default in FACS-based screens. For simplicity, we describe the gene-level model in detail below, but the more general cell-level model is discussed in detail in Supplementary Section 1. Importantly, we perform simulated screens using the cell-level model while iterating over the parameter space, but perform inference using the gene-level model to match the data available from a typical FACS screen.

In the Waterbear model (Figure 2A), there are two classes of genes, those that have no effect on the marker (denoted by $ψ_{i} = 0$ ), and those that have an effect on the marker $(ψ_{i} = 1)$ . By classifying genes into two discrete groups, our model defines different gRNA-marker behavior based on the inferred class. The central goal of Waterbear is to report, for each gene, the posterior probability that $ψ_{i} = 1$ , as well as estimated effect sizes.

When a gene has no effect $(ψ_{i} = 0)$ , all of the gRNAs targeting this gene should be similar to the null marker distribution with effects being shrunk towards zero. Thus, deviations in the observed sequencing counts between bins represent noise in the experiment. In particular, control gRNAs enable us to force $ψ_{i} = 0$ which enables a direct estimation of the experimental noise.

When a gene has an effect $(ψ_{i} = 1)$ the model allows the guide level effect estimates to vary and we employ a hierarchical process which assumes gRNAs targeting that gene behave similarly. The true gene-level effect is drawn from a Gaussian effect distribution enabling a large range of true effects, as is common in Bayesian models due to its flexibility and modeling convenience ¹⁷. Then, each gRNA has its own random effect centered around the gene-level effect distribution. Each gRNA thus results in a different marker distribution that is shifted around the gene-level effect, resulting in a gRNA-specific gRNA-marker bin count distribution.

As a result of this two-class approach, rather than focusing on gRNA- or gene-level p-values, Waterbear prioritizes genes where the individual gRNAs targeting that gene produce consistent shifts in the marker distribution (Figure 2B). In particular, a gene is considered a notable target if the inferred posterior inclusion probability (PIP), $P r [ψ_{i} = 1 | data]$ , is sufficiently close to one. Genes with high PIP have gRNAs with consistent, non-zero effects and genes with low PIP have gRNAs with effects close to zero. As the count of inconsistent gRNAs targeting a gene rises, the PIP will decrease, indicating higher uncertainty in the relationship between the target gene and the marker. Statistically, this is one of the major contributions of Waterbear; rather than aggregating results from each guide, Waterber fits a holistic model in which the data help infer whether the “effect” class or the “no effect” class is more consistent over all guides targeting a gene.

Importantly, Waterbear learns experimental parameters including the sizes of the sorted FACS bins using either the counts of the control gRNAs or, in the absence of controls, by assuming that a subset of gRNAs do not have an effect (Figure 2C). Experimentally, FACS bin sizes can shift during long sorts and opposing bins (e.g. bin one and bin four) might not always be collected in equal proportions. Using the control gRNA counts in each bin, Waterbear infers for each replicate what bin cutoffs divide the marker distribution to produce the observed count distribution.

Waterbear has relatively high sensitivity while controlling the false discovery rate

To establish the performance of Waterbear and compare it to other methods, we used the cell-level view of the generative model which includes a number of tunable parameters (Supplementary Section 2). We performed simulations and analyzed the results with the collapsed “gene-level” view of Waterbear, MAGeCK, and MAUDE (Methods) ^12,13 to observe how changing these parameters affect each method’s ability to detect hits.

To assess each method’s calibration, we compared the estimated false discovery rate (FDR) to the true FDR. Ideally, the true FDR should be at or below the estimated FDR. We first ran simulations across a range of coverage levels with 10% of the gRNAs in the library having a true effect on the marker. At all tested coverage levels, Waterbear and MAGeCK maintained lower true FDRs at the estimated 10% FDR (Figure 3A). In contrast, MAUDE’s true FDR was nearly 50% across all coverage levels. We also evaluated the calibration with low cell:gRNA ratios while varying the MOI. Increasing the MOI resulted in a higher effective coverage without having to increase cell numbers (Figure 1B). Again, MAGeCK and Waterbear controlled the FDR, while MAUDE had a highly inflated FDR for nearly all MOI levels (Figure 3B).

Figure 3: — Sensitivity and calibration plots comparing Waterbear, MAGeCK, and MAUDE on simulated screen data. For each method, we consider a hit to be ‘significant’ if the estimated FDR is less than q = 0.10. Since Waterbear produces posterior inclusion probabilities, we consider a test to be significant if PIP > 1 − q and the gene effect size (1 − q) credibility interval does not include zero. True FDR of the methods across various coverage levels (A) and various MOIs (B). Sensitivity of the methods across various coverage levels (C) and various MOIs (D).

Experiments are often collected under less ideal conditions than simulations. Given that Waterbear learns most parameters from the observed data rather than making assumptions about the experiment, we thought that it should be more robust than existing tools under non-ideal conditions. For instance, since MAGeCK was not designed to analyze FACS-based screens, using it for this type of analysis requires the implicit assumption that all FACS bins are equally sized with each other and between donors. While MAUDE supports uneven bin sizes, it requires the user to manually specify the exact bin sizes collected. In contrast, for each sample Waterbear learns the sizes of individual FACS bins directly from the data. Therefore, we expected that Waterbear would outperform both MAGeCK and MAUDE in situations where the bin sizes were misspecified. In simulations with uneven bin sizes, Waterbear and, surprisingly, MAGeCK controlled the FDR, while MAUDE did not (Supplementary Figure 2.8). It is important to note that since MAUDE does not estimate the bin sizes, we input the true simulated bin sizes for MAUDE, while letting Waterbear estimate the bin sizes and MAGeCK normalize the counts. These results show that MAGeCK’s RNA-seq style normalization still performs well in this setting.

We next tested which methods had the highest sensitivity while controlling the FDR. Ideally, sensitivity should be close to one, while the FDR is close to zero. Consistently across different coverage levels, MOI, and bin sizes, Waterbear had the highest sensitivity while controlling the FDR (Figures 3C–D). While MAUDE technically had the highest sensitivity, this came at a high false discovery cost, as nearly half of the calls were false positives. Despite not being designed for FACS screens, MaGeCK had low FDR and still had relatively high sensitivity, albeit, nearly always second to Waterbear.

Waterbear simulations suggest high sensitivity is maintained at low cell counts and high MOI

As discussed previously, we posited that the gRNA coverage, FACS bin configuration, and MOI have the largest impacts on experiments, so we focused on these parameters in our simulations. Given that we know the ground truth for these simulations, we can calculate how changing any of these parameters affects the sensitivity of the screen. Our cell-level generative model and effect size distributions are detailed in Supplementary Section 2.

Increasing the gRNA coverage with fixed MOI enables a more accurate quantification of how perturbations affect the marker (Figure 4A). However, at high coverage levels, a large increase in cell number yields only diminishing returns for quantifying gRNAs. This result suggests that identifying the minimum coverage needed to detect significant hits would have a meaningful impact on reducing the resources required for FACS screens. For example, Figure 4A shows that 38% of true hits will still be detected with only 50X coverage at a true FDR of 0.05. Furthermore, in this scenario, almost 80% of large effect size hits, defined as having an effect size greater than 0.2 standard deviations, will be detected (Figure 4B). Therefore, the strongest hits in a given screen will be detected even when the coverage is as low as 50X per gRNA. Increasing the coverage increases the number of hits detected and allows the detection of hits with a smaller effect size. However, the sensitivity starts to saturate around 1000X coverage, suggesting that it is not worthwhile to collect additional cells.

We next asked, given limited experimental resources, would it be better to collect a third biological replicate or to collect two replicates with higher coverage. At lower coverage levels, adding a third replicate only increased the mean sensitivity roughly half as much as doubling the coverage of the existing two replicates (Figure 4C). At higher coverage levels, the two replicates already start to saturate the mean sensitivity, suggesting that a third replicate would only have minimal benefit. To confirm this result on real data, we down-sampled data from Figure 5 to two replicates and found comparable sensitivity across the down-sampled counterparts and that the top hits in every configuration replicated (Supplementary Section 3.1).

Figure 5: — A) CRISPR FACS screens to identify regulators of IL2RA were performed at low (~0.3) and high (~2) MOI and low and high coverage. Low coverage (average of 195x coverage for MOI 0.3, 208x coverage for MOI 2) and high coverage (average of 1662x coverage for MOI 0.3, 1180 x coverage for MOI 2) n = 3 donors. The relative number of cells collected for each condition is shown. B) Quantification of the number of viral integrations per cell using droplet digital PCR in the unsorted cells (total) or after sorting cells that express GFP in the gRNA lentiviral construct (GFP+). C) Comparison of screens hits from the high coverage, low MOI screen vs low coverage, high MOI screen analyzed using Waterbear. D) Experimentally validated regulators of IL2RA were detected as screen hits in the low coverage, high MOI screen using Waterbear.

Commonly, screens are often performed using a low MOI of 0.3 – 0.5 to minimize the number of cells that contain more than one gRNA. However, with an MOI of 0.3, approximately 74% of cells will receive no gRNAs during the viral infection, which often limits the number of gRNAs that can be screened, especially when dealing with a limited number of starting cells. Therefore, at a fixed coverage for the experiment, even modestly increasing the MOI drastically reduces the number of cells needed (Figures 1Bii). We fixed the number of cells at 50,000 with a 6,000 gRNA library. Increasing the MOI served to effectively increase the coverage, while not having any negative effect on mean sensitivity even as the MOI approached 10 (Figure 4D). This trend is also observed when going to higher numbers of total guides and different proportions of guides with an effect (Supplementary Section 2.2). To check if the number of controls affected the inference, we also performed many of the simulations with 10, 100, and 1000 controls (Supplementary Section 3.2) which verified that this does not affect the final result. Thus, increasing the MOI, even a marginal amount, can greatly improve sensitivity.

While increasing the MOI will result in some cells containing random combinations of multiple gRNAs, in our previous screens less than 10% of the gRNAs had a statistically significant effect. In similar screens, the majority of cells will contain combinations of gRNAs where none or only one of the gRNAs have an effect (Supplementary Section 4). We define cells containing “gRNA collisions” as cells which contain two or more gRNAs that exhibit an effect on the marker distribution, but even at moderate MOIs gRNA collisions are rare. If a gRNA has an effect then cells containing both this gRNA and random mostly no-effect background gRNAs should still end up enriched in either the low or high FACS bins. However, if a gRNA has no effect and it is randomly paired with mostly no-effect background gRNAs, cells containing these no-effect gRNAs will end up equally distributed between the high and low FACS bins.

Many FACS screens only collect the outer two bins. However, we expected that sequencing four bins spanning the full distribution would increase the gRNA coverage data that Waterbear uses for inference without requiring additional input cells. We asked, given the same number of cells: How moving from two bins to four bins affects the sensitivity to detect hits (Figure 4E)? With 15 % outer bins, four bins is almost always preferable, however, the increase in sensitivity is relatively minor. However, performance greatly degrades if one only collects two outerbins at the tail of the extremes of the marker distribution (5% and 1% outer bins), but is maintained if one also collects the inner additional bins. In summary, sequencing four bins provides consistent results across coverage levels tested and is preferable when coverage is low (<= 50X gRNA coverage).

While these simulations suggest that the cumulative coverage is an important driver of sensitivity when four gRNAs target the same gene, it is unclear whether four gRNAs are necessary. We next explored how the number of gRNAs affects the sensitivity when the coverage is fixed per gene, under the assumption of high-quality gRNAs. To test the effect of the number of gRNAs targeting a gene, we performed an analysis where the total gene coverage was fixed, but was achieved using 1, 2, 3, or 4 gRNAs (Figure 4F). For example, at 1000X coverage, 1 gRNA represents 1000 cells for that gRNA, whereas in the 4 gRNA case, it represents 250 cells per gRNA. Using one gRNA almost always has less power than other configurations. As coverage increases, more gRNAs are helpful up to about three guides. This result is likely due to the additional evidence against the null as each additional gRNA can be thought of as its own “experiment” during inference, akin to a meta-analysis.

While these simulations suggest that increasing the coverage by many different means will increase the ability to detect hits, they also provide a path to reducing the number of cells and thus enabling novel screens in systems previously limited by the number of cells available. In particular, increasing the MOI while capturing four bins can increase the overall coverage with relatively small experimental burden and no additional cells.

Experiments validate that high sensitivity is maintained at low coverage and high MOI

We previously performed CRISPR FACS-based screens in primary human T cells to identify the upstream regulators of IL2RA, an important cell surface receptor implicated in numerous autoimmune diseases ^3,18,19. However, these screens were expensive and experimentally demanding because they were performed at 640–2273x coverage and each screen required 100–290 million primary cells. However, our simulations suggest that we could have identified the top hits using much lower coverage and a higher MOI. To confirm these results, we repeated the screen using an MOI of 0.3 (low MOI) and 2 (high MOI). For both MOI conditions, we collected the cells from 3 donors across 4 FACS bins at high coverage (average of 1662x coverage for MOI 0.3, 1180x coverage for MOI 2) and at low coverage (average of 195x coverage for MOI 0.3, 208x coverage for MOI 2) (Figure 5A).

To confirm that the cells were infected at the desired MOI, we quantified the genomic integration copy number for the gRNA lentiviral construct. We used droplet digital PCR to quantify the number of copies of GFP, which is part of the gRNA lentiviral construct, relative to the number of copies of the control gene RPP30 in the cells. The average copy number of GFP in the population ranged from 0.28 to 0.33 for the MOI 0.3 condition and from 1.9 to 2.3 for the MOI 2 condition across the 3 donors, closely matching the theoretical copy number for each condition (Figure 5B). These data suggest that the cells were infected at the desired MOIs and that the majority of cells in the high MOI condition contained more than one gRNA per cell. Given that the majority of cells in the low MOI condition contained no gRNAs, one would need ~5.5 fold more cells to obtain the same effective coverage as the high MOI condition.

We next compared the significant hits in the high coverage, low MOI condition compared to the low coverage, high MOI condition using Waterbear. Despite having multiple gRNAs per cell and being collected at 4.6-fold lower coverage, the top hits were highly correlated between the two conditions (Figure 5C). We previously validated 26/33 hits from our original screen by performing individual knockouts and directly measuring the effect on IL2RA protein levels using flow cytometry ³. We used these 26 genes as a set of high confidence positive controls (Supplementary Section 3.2). Waterbear detected 24 out of 26 of these validated hits in the low coverage, high MOI screen (Figure 5D). Together, these results experimentally validate both the predictions regarding coverage and MOI from our simulations with Waterbear as well as demonstrate that Waterbear is a powerful tool to analyze these screens, even under challenging conditions.

Given that Waterbear outperformed MAGeCK and MAUDE in our simulations, we wanted to compare Waterbear’s sensitivity to these tools under real conditions. On the low coverage, high MOI screen MAGeCK detected 17 out of 26 of these validated hits (Supplementary Figure 3.6). MaGeCK’s lower sensitivity to Waterbear’s (24/26) is consistent with observation that in general MaGeCK generally calls fewer hits with lower sensitivity; Waterbear called 79/1350 and MAGeCK called 31/1350 genes significant. MAUDE reports many more signals (406/1350 and detects 25/26 hits, Supplementary Figure 3.7), however we regard these as inflated given the simulations showing poor calibration at 10% FDR. These results further validate that Waterbear maintains high sensitivity, without seemingly over-calling relationships.

Discussion

Genetic screens are a powerful approach to link genes to phenotypes. Coupling CRISPR perturbations with FACS in mammalian cells enables mapping the genetic basis of many biological processes and regulatory relationships. Early CRISPR screens were performed in abundant cell lines ^20–22, but increasingly, these screens are being performed in rare primary cell types or in vivo models with limited cell numbers ^4,9,23,24. In these screens, researchers must often make choices about experimental conditions based on their intuition as there have not been good guidelines to inform how changing experimental parameters affect the ability to identify hits. Furthermore, changes such as lowering gRNA coverage or increasing the MOI increase the statistical challenge of identifying hits.

We solve these experimental design and analysis gaps by introducing Waterbear, an end-to-end experimental design and inference procedure for CRISPR FACS screens. Our cell-level generative implementation enables exploration of experimental parameters such as effect size distribution, gRNA distribution, and MOI. In conjunction with a “gene-level” version of this model for inference, Waterbear enabled us to show that (1) sensitivity saturates relatively quickly, and thus if using four bins can be dropped from about 1,000X coverage to 250X coverage with little loss in sensitivity, (2) increasing the MOI modestly from 0.3 to 2 improves effective coverage while greatly reducing the number of input cells needed, (3) the number of gRNAs targeting each gene can be reduced from four to three while achieving similar sensitivity.

Overall, our results demonstrate that the number of cells for such screens can be reduced, enabling one to assay significantly smaller cell populations. The prevailing view has been that such screens should be performed with only a single perturbation per cell. These guidelines likely emerged from siRNA screens where there are many more off-target effects. Consistent with other recent reports ^25–27, we demonstrate that increasing the MOI greatly reduces the number of uninfected input cells that are thrown away during screening, while having a similar sensitivity as the low MOI screen. Coupled with our results that coverage and gRNA number can be reduced without impacting sensitivity, these guidelines should reduce the resources and effort required to perform CRISPR FACS screens and enable a next generation of CRISPR screens in rare cell types and with in vivo models that will be essential to understand many disease-relevant processes.

Equipped with knowledge of the important experimental aspects, we developed a hierarchical statistical model that enables principled inference of guide-effects with replicates. The unique approach models the unobserved FACS marker and connects it to the observed sequencing counts. This key hierarchy is what sets the Waterbear method apart from existing methods and makes it adaptable for other FACS-based sequencing experiments. The statistical challenge lies in how one connects the unobserved marker distribution to the FACS bins, while maintaining the correlation structure in the bins and not treating each bin independently, but rather, as samples joint from an unobserved marker. Concretely, this is modeled through our function $q (\cdot)$ in the methods section.

Importantly, the Waterbear model infers experimental parameters and is robust in small sample settings which are common in these screens. We do so by “shrinking” estimates in the hierarchy, thus jointly modeling experimental parameters across replicates or within replicates when appropriate. To address the concerns of model assumptions and general performance, we additionally demonstrated that our model outperforms tools commonly used for these types of analyses (MAGeCK and MAUDE) in many different experimental settings and on real biological data.

Through both simulations and follow-up experiments, we have demonstrated an approach of model driven experimental design followed by experimentally guided inference models that can be seen as a vignette for other similar screens. For example, both the experimental design and inference framework can be modified to inform design and inference of proliferation screens, in vivo screens, scRNA-seq screens, and potentially multi-ome readout screens.

Data Availability

The raw sequencing files generated during this study are available at GEO: GSE242880.

Code Availability

The code to reproduce all of the analyses in this paper can be found at: https://github.com/pimentel/waterbear_analysis. The pipeline tool Snakemake ²⁸ was used to run Waterbear, MAGeCK, and MAUDE.

Methods

Sample collection

This study was approved by the University of California, San Francisco (UCSF) Committee on Human Research and Stanford University Panel on Medical Human Subjects (IRB#53302) and written consent was obtained from all donors. Primary human T cells were obtained through consented Leukopaks (STEMCELL) (Catalog #70500.2).

Isolation, culture and expansion of human CD4+CD25− effector T cells

PBMCs from Leukopaks (STEMCELL) were diluted 1:1 with PBS containing 2% FCS and 1mM EDTA and spun at 500g for 10 minutes. The StemCell EasySep Human Isolation Kit (Catalog # 18063) was used to isolate CD4+CD25− effector T cells from washed PBMCs while excluding CD4+CD25+ regulatory T cells. Isolated cells were then stimulated with Immunocult Human CD3/CD28/CD2 T Cell Activator (STEMCELL, Cat #10970) at 6.25 uL per 1E6 cells and grown in RPMI with 50 U/mL IL-2 (Amerisource Bergen, Cat #10101641) at a concentration of 1E6 cells/mL.

Pooled CRISPR screens

Pooled CRISPR screens were performed as in Freimer et al. ³.

Lentiviral transduction

Approximately twenty-four hours post stimulation, lentivirus containing the sgRNA library was added directly to cultured T cells at various multiplicity of infections (MOIs). After an additional twenty-four hours, the media was changed.

Cas9-ribonucleotide protein (RNP) preparation

Cas9 (MacroLab, Berkeley, 40 μM stock) ribonucleoprotein complex was delivered into the cells using a modified Guide Swap technique ²⁹. Lyophilized Dharmacon Edit-R crRNA Non-targeting Control #3 (Dharmacon, Cat #U-007503-01-05) and Dharmacon Edit-R CRISPR-Cas9 Synthetic tracrRNA (Dharmacon, Cat #U-002005-20) were resuspended at a stock concentration of 160 uM in 10 mM Tris-HCl (pH 7.4) with 150 mM KCl. They were mixed at a 1:1 ratio and incubated at 37°C for 30 minutes. A single-stranded donor oligonucleotide (ssODN; sequence: TTAGCTCTGTTTACGTCCCAGCGGGCATGAGAGTAACAAGAGGGTGTGGTAATATTACGGTACCGAGCACTATCGATACAATATGTGTCATACGGACACG) was then added at a 1:1 molar ratio of the final Cas9-Guide complex and the solution was mixed well by pipetting. The solution was incubated for an additional 5 minutes at 37°C. Cas9 protein was then added slowly at a 1:1 volume and incubated at 37°C for 15 minutes.

Electroporation

Approximately twenty-four hours after viral transduction the cells were centrifuged at 100 g for 10 minutes and then resuspended in room temperature Lonza P3 electroporation buffer (Lonza, Cat #V4XP-3032) at 1–2E6 cells per 17.8 μL. For every 17.8 μL of cells, 7.2 μL of the RNP-ssODN complex was added and the solution was mixed well. 23 uL of the cells-RNP-ssODN mixture was added to each well of a 96 well electroporation cuvette plate (Lonza, Cat #VVPA-1002), and nucleofected using the pulse code EH-115. Immediately after electroporation, 90 μL of warm media was added to each well and incubated at 37°C for 15 minutes. Cells were then pooled and grown at a concentration of 1E6 cells/mL.

Screen phenotyping and cell sorting

Cells were collected for analysis 6 days after electroporation. Cells were stained for IL2RA using an APC fluorescent antibody (Tonbo, Cat #20-0259-T100) at a 1:25 dilution according to the manufacturer’s protocol. GFP positive cells were sorted into 4 bins based on IL2RA protein levels using a BD FACS Aria II and FACSDiva version 8.0.1. Bulk GFP positive and negative population were also collected for ddPCR assessment.

GFP copy number assessment using Droplet Digital PCR

Following genomic DNA purification, an aliquot was reserved for ddPCR analysis. For each sample, 10 ng of purified genomic DNA was added to a reaction consisting of 10 μL of ddPCR Supermix for Probes (Bio-Rad, Cat #1863024), 1 μL of MseI (New England Biolabs, Cat #R0525S), 1 μL of both the reference and target primer assays, and water to a total volume of 20 μL. A Bio-Rad validated HEX ddPCR Copy Number Assay targeting the reference gene, RPP30, (Cat #10031243) was used in addition to a custom FAM Copy Number Assay targeting GFP (Bio-Rad, Cat #10042958). After assembling the ddPCR reactions, the samples were emulsified in oil droplets using the QX200 Droplet Generator (Bio-Rad, Cat # 1864002) following the manufacturer’s instructions. In brief, a DG8tm Cartridge (Cat #1864008) was inserted into a DG8 Cartridge Holder (Cat # 1863051) and each 20 μL reaction mixture was transferred to a well, followed by 70 μL of Droplet Generation Oil for Probes (Cat #1863005). The cartridge was covered with a gasket (Cat # 1863009) and loaded into the Droplet Generator. Following emulsification, the droplets were transferred to a 96 well plate (Cat #12001925). This process was repeated until all samples were transferred to the plate, which was then sealed using a pierceable heat seal (Cat # 1814040) and the PX1 PCR Plate Sealer (Cat #1814000). DNA was was fragmented and amplified using the BioRad C1000 Thermocycler (Cat #1851196) programmed to the following specifications: 95 °C for 10 minutes, followed by 40 cycles of 94°C for 20 seconds and 57°C for 1 minute, and ending with 98 °C for 10 minutes and a final 4°C hold until data acquisition. Each step was performed with a ramp rate of 2°C/sec until the final cool down at 1°C/sec. Amplification was determined with the QX200 Droplet Reader (Cat # 1864003) using the Copy Number Variant (CNV) assay on the QuantaSoft^™ Software. Positive and negative populations in each channel were manually defined using the oval tool.

Lentiviral production

14 E6 HEK 293T cells were seeded in a 15 cm tissue culture dish (Corning, Cat #430599) in Opti-MEM (UCSF CCF, Cat #CCFAC008) approximately twenty-four hours prior to transfection. Cells were transfected with the sgRNA library plasmid, and two lentiviral packaging plasmids, pMD2.G (Addgene, Cat #12259) and psPAX2 (Addgene, Cat #12260) using Lipofectamine 3000 (Lifetech, Cat #L3000075). Cells were incubated for 5 hours at 37°C. The media was then replaced with fresh Opti-MEM containing ViralBoost at 1x (Alstem, Cat #VB100). The cells were cultured for approximately twenty-four hours and then the media was collected and spun down at 300 g for 5 minutes to remove cellular debris. The media was then filtered using 0.45-μm filter and one volume of cold Lentivirus Precipitation Solution (Alstem, Cat #VC125) was added for every four volumes of lentivirus-containing media. Samples were mixed and then put at 4°C overnight. The viral media was then spun in a centrifuge at 1500 g for 30 minutes at 4°C, followed by a second spin at 1500 g for 5 minutes to concentrate the virus. The viral pellet was then resuspended in 4°C PBS (Fisher Scientific, Cat #10010049) at a 1:100 dilution of the original media volume. The concentrated virus was stored at −80°C until use.

Culture media

Cells were grown in RPMI (Sigma, Cat # R0883) with 10% FCS (Sigma, Cat # F0926), with 100 U/mL Pen-Strep (Gibco, Cat # 15140–122), 2mM L-Glutamine (Sigma, Cat # G7513), 10mM HEPES (Sigma, Cat # H0887), 1X MEM Non-essential Amino Acids (Gibco, Cat # 11140–050), 1 mM Sodium Pyruvate (Gibco, Cat # 11360–070), and 50 U/mL IL-2 (Amerisource Bergen, Cat #10101641) at a concentration of 1E6 cells/mL.

Genomic DNA extraction and preparation for next generation sequencing

After sorting, cells were washed with PBS, counted, and resuspended at up to 5E6 cells per 400 μl of lysis buffer (1 % SDS, 50 mM Tris, pH 8, 10 mM EDTA). 16μl of NaCl (5M) was added and the sample was incubated overnight at 66°C. Then 8μl of RNAse A (10mg/ml) (Thermo Scientific, Cat #EN0531) was added and incubated at 37°C for 1 hour. Then 8μl of Proteinase K (20 mg/ml) (Thermo Scientific, Cat # AM2548) was added and incubated at 55°C for 1 hour. A phase lock tube (Quantabio, Cat #2302820) was prepared for each sample and then 400μl of Phenol:Chloroform:Isoamyl Alcohol (25:24:1) was added to each tube. 400μl of the sample was then added and the tube was shaken vigorously. The sample was centrifuged for 5 min at max speed at room temperature. The aqueous phase was transferred to a low-bind Eppendorf tube (Eppendorf, Cat #022431021). 40μl of Sodium Acetate (3M), 1μl GlycoBlue (Invitrogen, Cat # AM9515), and 600μl of room temperature isopropanol were added to the sample. The sample was stored at −80°C for 30 minutes and then centrifuged for 30 minutes at max speed at 4°C. The pellet was washed with fresh 70% room temperature Ethanol and allowed to air dry for 15 minutes. Pellets were then resuspended in Zymo DNA elution buffer (Zymo, Cat No: D3004-4-10), and then incubated at 65°C for 1 hour to completely dissolve the genomic DNA.

sgRNAs were amplified from the genomic DNA as initially described by Joung et al. ³⁰. Up to 2.5 μg of genomic DNA was added to each 50 μL PCR reaction with 25 μL of NEBNext Ultra II Q5 master mix (NEB, Cat #M0544L), 1.25 μL of the 10 μM forward primer and 1.25 μL of the 10 μM reverse primer, and H2O to 50 uL. The reaction was then amplified with the following cycling conditions: 98°C for 3 minutes, followed by 23 cycles at 98°C for 10 seconds, 63°C for 10 seconds, and 72°C for 25 seconds, and finally 2 minutes at 72°C. The amplicons were cleaned with Sera-Mag Speed Beads (Cytiva, Cat #65152105050250) used at a 1X v/v ratio. The concentration of each sample was then measured using the Qubit dsDNA high sensitivity assay kit (Thermo Fisher Scientific, Cat #Q32854) and the successful removal of adapter dimers confirmed with the 4200 TapeStation system (Agilent, Cat# G299 1BA). Samples were then sequenced on an Illumina HiSeq 4000 (Illumina, Cat #15017872) using a custom sequencing primer.

Statistics and analysis

A cell-level generative model for FACS screens

For Figures 3 and 4, we simulate from a cell-level generative model. The full description of the cell-level generative model is in Supplementary Section 1.1, but we briefly describe the cell-level process here. First, a number of guides is chosen according to a Poisson distribution (MOI). Next, the guides integrating with this cell are chosen at random, according to a Dirichlet. If more than one of the included guides has an effect, we assume additive, linear effects, and no interaction effects. A value from the resulting marker distribution is drawn and the corresponding guide bin count is incremented. The final observation is a Dirichlet Multinomial centered around the total bin counts corresponding to a noisy observation of the entire process.

Parameters for simulations

The guide representation proportion is modeled as $D i r i c h l e t (τ / p 1_{p})$ where $τ > 0$ represents the dispersion and $p$ represents the number of guides. The parameter $τ$ was learned from the GFP+ population on real data. Further, effect sizes were also learned on this data set. See Supplementary Section 2 for full details.

A model for gene-level inference

Waterbear implements a hierarchical Bayesian model to infer the latent effect sizes at the gene-level. In order to allow inference with small sample sizes that are common in screens, we use various shared priors in the hierarchy. We begin with the observed bin counts in sample $n$ , resulting from guide $h$ by,

B_{n h} ∣ c_{n h}, ϕ_{n}, t_{n}, β_{h} \sim D i r M u l t (c_{n h}, ϕ_{n} q (t_{n}, β_{h})),

where $c_{n h}$ is the guide coverage, $ϕ_{n}$ is the sample specific dispersion, and $q (t_{n}, β_{h})$ takes the null probability mass in each bin ( $t_{n})$ and the guide effect size $(β_{h})$ and returns the probability mass function across the bins. In particular, $t_{n j}$ is the mass in bin $j$ under the null marker and thus we model the joint over $t_{n}$ as a Dirichlet. If we assume the marker distribution to be $N o r m a l (β_{h}, 1)$ , the mass in bin $j$ is defined as,

q {(t_{n}, β_{h})}_{j} = \Pr (Φ^{- 1} (t_{n, j - 1}) < Z - β_{h} < Φ^{- 1} (t_{n, j})),

where we define $t_{n, 0} = 0$ and $t_{n, 5} = 1$ (under a 4-bin experiment), $Z$ is a standard normal random variable, and $Φ^{- 1} (\cdot)$ is the inverse CDF of a standard normal.

To draw guide level effect sizes, first we must decide if the gene is included in the model $ψ_{g} = 1$ or if it is not $(ψ_{g} = 0)$ . Guide-level effect sizes are drawn from a prior centered at the latent gene effect, $μ_{g}$ if the gene is included in the model or a point mass at 0 if it is not according to a spike-and-slab prior,

\begin{matrix} μ_{g} ∣ ψ_{g}, σ^{2} \sim ψ_{g} N (0, σ^{2}) + (1 - ψ_{g}) δ_{0} \\ β_{h} ∣ ψ_{g}, μ_{g} \sim N (μ_{g}, τ^{2}) \\ δ_{0} \equiv 0 . \end{matrix}

At the population level, the parameter $π$ dictates the proportion of genes that are allowed to be included in the model $(ψ_{g})$ . Thus $ψ_{g} ∣ π \sim B e r n o u l l i (π)$ Of note, this prior encourages guide-level effects if there is evidence with many guides. A full specification of all priors and hyper parameters can be found in Supplementary Section 1.2. Posterior sampling is performed with a MCMC sampler implemented in NIMBLE ³¹. All results in this paper were run with 4 chains, each chain producing 5,000 samples preceded by 5,000 adaptive burn-in samples.

MAGeCK

MAGeCK was run using the following parameters:

mageck test -k [INPUT_COUNTS] -t [LOW_BIN_COLUMNS] -c [HIGH_BIN_columns]
--sort-criteria pos -n [OUTPUT_DIRECTORY]

Since MAGeCK does not model the bins, we provide the outermost bins in every analysis. Additionally, MAGeCK does not provide a two-sided test at the gene-level. The heuristic we used to deal with this was to take the minimum of the reported gene-level FDR in the positive and negative direction.

MAUDE

MAUDE was run using default parameters. For specifics, please see the pipeline code. Since MAUDE requires the gRNA proportions, we gave it the true bin fraction in every simulation. In experimental data we gave MAUDE the GFP+ proportion. Since MAUDE does not have a direct way to merge replicates, we took the gene-level replicate p-values and merged them using Fisher’s method. The resulting Fisher’s p-value was then Benjamini-Hochberg FDR correcte d³².

Supplementary Material

Supplement 1

media-1.pdf^{(1.4MB, pdf)}

Acknowledgements

We thank Clemens Weiss for helpful comments. This research was supported by NIH grant R01HG008140 and UO1HG012069 (JKP). A.M. is a member of the Parker Institute for Cancer Immunotherapy (PICI), and has received funding from the Innovative Genomics Institute (IGI), the Cancer Research Institute (CRI) Lloyd J. Old STAR award, a gift from the Jordan Family, a gift from the Byers family, and funds from the Simons Foundation and the CRISPR Cures for Cancer Initiative. H.P. is supported by the HHMI Hanna H. Gray Fellowship and the Sloan Foundation Fellowship. Sequencing was carried out at the UCSF CAT, supported by a PBBR grant. We would like to acknowledge the support of the Gladstone Institute Flow Cytometry Core. We would like to thank the Stanford Research Computing Center for providing computational resources and support.

Footnotes

Competing Interests

A.M. is a cofounder of Site Tx, Arsenal Biosciences, Spotlight Therapeutics and Survey Genomics, serves on the boards of directors at Site Tx, Spotlight Therapeutics and Survey Genomics, is a member of the scientific advisory boards of Site Tx, Arsenal Biosciences, Spotlight Therapeutics, Survey Genomics, NewLimit, Amgen, and Tenaya, owns stock in Arsenal Biosciences, Site Tx, Spotlight Therapeutics, NewLimit, Survey Genomics, Tenaya and Lightcast and has received fees from Site Tx, Arsenal Biosciences, Spotlight Therapeutics, NewLimit, 23andMe, PACT Pharma, Juno Therapeutics, Tenaya, Lightcast, Trizell, Vertex, Merck, Amgen, Genentech, GLG, ClearView Healthcare, AlphaSights, Rupert Case Management, Bernstein and ALDA. A.M. is an investor in and informal advisor to Offline Ventures and a client of EPIQ. The Marson laboratory has received research support from the Parker Institute for Cancer Immunotherapy, the Emerson Collective, Juno Therapeutics, Epinomics, Sanofi, GlaxoSmithKline, Gilead and Anthem and reagents from Genscript and Illumina. J.W.F. was a consultant for NewLimit and has been an employee of Genentech since July 2022. J.W.F. owns stock in Roche. The remaining authors declare no competing interests.

References

1.Sanjana N. E. Genome-scale CRISPR pooled screens. Anal. Biochem. 532, 95–99 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Doench J. G. Am I ready for CRISPR? A user’s guide to genetic screens. Nat. Rev. Genet. 19, 67–80 (2017). [DOI] [PubMed] [Google Scholar]
3.Freimer J. W. et al. Systematic discovery and perturbation of regulatory genes in human T cells reveals the architecture of immune networks. Nat. Genet. 54, 1133–1144 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Cortez J. T. et al. CRISPR screen in regulatory T cells reveals modulators of Foxp3. Nature 582, 416–420 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Parnas O. et al. A Genome-wide CRISPR Screen in Primary Immune Cells to Dissect Regulatory Networks. Cell 162, 675–686 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Henriksson J. et al. Genome-wide CRISPR Screens in T Helper Cells Reveal Pervasive Crosstalk between Activation and Differentiation. Cell vol. 176 882–896.e18 Preprint at 10.1016/j.cell.2018.11.044 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Brockmann M. et al. Genetic wiring maps of single-cell protein states reveal an off-switch for GPCR signalling. Nature 546, 307–311 (2017). [DOI] [PubMed] [Google Scholar]
8.Schmidt R. et al. CRISPR activation and interference screens decode stimulation responses in primary human T cells. Science 375, eabj4008 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Kuhn Maria, Santinha António J., Platt Randall J.. Moving from in vitro to in vivo CRISPR screens. Gene and Genome Editing 2, 100008 (2021). [Google Scholar]
10.Manguso R. T. et al. In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target. Nature 547, 413–418 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Wei J. et al. Targeting REGNASE-1 programs long-lived effector T cells for cancer therapy. Nature 576, 471–476 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Li W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 1–12 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.de Boer C. G., Ray J. P., Hacohen N. & Regev A. MAUDE: inferring expression changes in sorting-based CRISPR screens. Genome Biol. 21, 1–16 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Shabram P. & Aguilar-Cordova E. Multiplicity of infection/multiplicity of confusion. Mol. Ther. 2, 420–421 (2000). [DOI] [PubMed] [Google Scholar]
15.Nagy T. & Kampmann M. CRISPulator: a discrete simulation tool for pooled genetic screens. BMC Bioinformatics 18, 1–12 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Bock C. et al. High-content CRISPR screening. Nature Reviews Methods Primers 2, 1–23 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Gelman A. Bayesian Data Analysis. (Chapman and Hall/CRC, 1995). [Google Scholar]
18.Abbas A. K., Trotta E., R Simeonov D., Marson A. & Bluestone J. A. Revisiting IL-2: Biology and therapeutic prospects. Sci Immunol 3, (2018). [DOI] [PubMed] [Google Scholar]
19.Spolski R., Li P. & Leonard W. J. Biology and regulation of IL-2: from molecular mechanisms to human therapy. Nat. Rev. Immunol. 18, 648–659 (2018). [DOI] [PubMed] [Google Scholar]
20.Shalem O. et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science 343, 84–87 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Wang T., Wei J. J., Sabatini D. M. & Lander E. S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 343, 80–84 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Gilbert L. A. et al. Genome-Scale CRISPR-Mediated Control of Gene Repression and Activation. Cell 159, 647–661 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Dong M. B. et al. Systematic Immunotherapy Target Discovery Using Genome-Scale In Vivo CRISPR Screens in CD8 T Cells. Cell 178, 1189–1204.e23 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Chen Z. et al. In vivo CD8 T cell CRISPR screening reveals control by Fli1 in infection and cancer. Cell 184, 1262–1280.e22 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Zhu S. et al. Guide RNAs with embedded barcodes boost CRISPR-pooled screens. Genome Biol. 20, 20 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Gasperini M. et al. A Genome-wide Framework for Mapping Gene Regulation via Cellular Genetic Screens. Cell 176, 1516 (2019). [DOI] [PubMed] [Google Scholar]
27.Yao D. et al. Scalable genetic screening for regulatory circuits using compressed Perturb-seq. Nat. Biotechnol. (2023) doi: 10.1038/s41587-023-01964-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Köster J. & Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 34, 3600–3600 (2018). [DOI] [PubMed] [Google Scholar]
29.Ting P. Y. et al. Guide Swap enables genome-scale pooled CRISPR-Cas9 screening in human primary cells. Nat. Methods 15, 941–946 (2018). [DOI] [PubMed] [Google Scholar]
30.Joung J. et al. Genome-scale CRISPR-Cas9 knockout and transcriptional activation screening. Nat. Protoc. 12, 828–863 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Valpine P. de et al. Programming With Models: Writing Statistical Algorithms for General Model Structures With NIMBLE. Journal of Computational and Graphical Statistics vol. 26 403–413 Preprint at 10.1080/10618600.2016.1172487 (2017). [DOI] [Google Scholar]
32.Hochberg Y. B. A. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological) 289–300. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

media-1.pdf^{(1.4MB, pdf)}

Data Availability Statement

The raw sequencing files generated during this study are available at GEO: GSE242880.

[R1] 1.Sanjana N. E. Genome-scale CRISPR pooled screens. Anal. Biochem. 532, 95–99 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Doench J. G. Am I ready for CRISPR? A user’s guide to genetic screens. Nat. Rev. Genet. 19, 67–80 (2017). [DOI] [PubMed] [Google Scholar]

[R3] 3.Freimer J. W. et al. Systematic discovery and perturbation of regulatory genes in human T cells reveals the architecture of immune networks. Nat. Genet. 54, 1133–1144 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Cortez J. T. et al. CRISPR screen in regulatory T cells reveals modulators of Foxp3. Nature 582, 416–420 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Parnas O. et al. A Genome-wide CRISPR Screen in Primary Immune Cells to Dissect Regulatory Networks. Cell 162, 675–686 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Henriksson J. et al. Genome-wide CRISPR Screens in T Helper Cells Reveal Pervasive Crosstalk between Activation and Differentiation. Cell vol. 176 882–896.e18 Preprint at 10.1016/j.cell.2018.11.044 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Brockmann M. et al. Genetic wiring maps of single-cell protein states reveal an off-switch for GPCR signalling. Nature 546, 307–311 (2017). [DOI] [PubMed] [Google Scholar]

[R8] 8.Schmidt R. et al. CRISPR activation and interference screens decode stimulation responses in primary human T cells. Science 375, eabj4008 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Kuhn Maria, Santinha António J., Platt Randall J.. Moving from in vitro to in vivo CRISPR screens. Gene and Genome Editing 2, 100008 (2021). [Google Scholar]

[R10] 10.Manguso R. T. et al. In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target. Nature 547, 413–418 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Wei J. et al. Targeting REGNASE-1 programs long-lived effector T cells for cancer therapy. Nature 576, 471–476 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Li W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 1–12 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.de Boer C. G., Ray J. P., Hacohen N. & Regev A. MAUDE: inferring expression changes in sorting-based CRISPR screens. Genome Biol. 21, 1–16 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Shabram P. & Aguilar-Cordova E. Multiplicity of infection/multiplicity of confusion. Mol. Ther. 2, 420–421 (2000). [DOI] [PubMed] [Google Scholar]

[R15] 15.Nagy T. & Kampmann M. CRISPulator: a discrete simulation tool for pooled genetic screens. BMC Bioinformatics 18, 1–12 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Bock C. et al. High-content CRISPR screening. Nature Reviews Methods Primers 2, 1–23 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Gelman A. Bayesian Data Analysis. (Chapman and Hall/CRC, 1995). [Google Scholar]

[R18] 18.Abbas A. K., Trotta E., R Simeonov D., Marson A. & Bluestone J. A. Revisiting IL-2: Biology and therapeutic prospects. Sci Immunol 3, (2018). [DOI] [PubMed] [Google Scholar]

[R19] 19.Spolski R., Li P. & Leonard W. J. Biology and regulation of IL-2: from molecular mechanisms to human therapy. Nat. Rev. Immunol. 18, 648–659 (2018). [DOI] [PubMed] [Google Scholar]

[R20] 20.Shalem O. et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science 343, 84–87 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Wang T., Wei J. J., Sabatini D. M. & Lander E. S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 343, 80–84 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Gilbert L. A. et al. Genome-Scale CRISPR-Mediated Control of Gene Repression and Activation. Cell 159, 647–661 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Dong M. B. et al. Systematic Immunotherapy Target Discovery Using Genome-Scale In Vivo CRISPR Screens in CD8 T Cells. Cell 178, 1189–1204.e23 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Chen Z. et al. In vivo CD8 T cell CRISPR screening reveals control by Fli1 in infection and cancer. Cell 184, 1262–1280.e22 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Zhu S. et al. Guide RNAs with embedded barcodes boost CRISPR-pooled screens. Genome Biol. 20, 20 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Gasperini M. et al. A Genome-wide Framework for Mapping Gene Regulation via Cellular Genetic Screens. Cell 176, 1516 (2019). [DOI] [PubMed] [Google Scholar]

[R27] 27.Yao D. et al. Scalable genetic screening for regulatory circuits using compressed Perturb-seq. Nat. Biotechnol. (2023) doi: 10.1038/s41587-023-01964-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Köster J. & Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 34, 3600–3600 (2018). [DOI] [PubMed] [Google Scholar]

[R29] 29.Ting P. Y. et al. Guide Swap enables genome-scale pooled CRISPR-Cas9 screening in human primary cells. Nat. Methods 15, 941–946 (2018). [DOI] [PubMed] [Google Scholar]

[R30] 30.Joung J. et al. Genome-scale CRISPR-Cas9 knockout and transcriptional activation screening. Nat. Protoc. 12, 828–863 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Valpine P. de et al. Programming With Models: Writing Statistical Algorithms for General Model Structures With NIMBLE. Journal of Computational and Graphical Statistics vol. 26 403–413 Preprint at 10.1080/10618600.2016.1172487 (2017). [DOI] [Google Scholar]

[R32] 32.Hochberg Y. B. A. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological) 289–300. [Google Scholar]

PERMALINK

This is a preprint.

A model for accurate quantification of CRISPR effects in pooled FACS screens

Harold Pimentel

Jacob W Freimer

Maya M Arce

Christian M Garrido

Alexander Marson

Jonathan K Pritchard

Roles

Abstract

Introduction

Results

Overview of CRISPR FACS screens and tunable parameters

Figure 1: Overview of CRISPR FACS screens and tunable parameters.

A statistical model for CRISPR FACS screen data

Figure 2: A statistical model for CRISPR FACS screen data.

Waterbear has relatively high sensitivity while controlling the false discovery rate

Figure 3: Waterbear has relatively high sensitivity while controlling the false discovery rate.

Waterbear simulations suggest high sensitivity is maintained at low cell counts and high MOI

Figure 4: Waterbear simulations suggest high sensitivity is maintained at low cell counts and high MOI.

Figure 5: Experiments validate that high sensitivity is maintained at low cell counts and high MOI.

Experiments validate that high sensitivity is maintained at low coverage and high MOI

Discussion

Data Availability

Code Availability

Methods

Sample collection

Isolation, culture and expansion of human CD4+CD25− effector T cells

Pooled CRISPR screens

Lentiviral transduction

Cas9-ribonucleotide protein (RNP) preparation

Electroporation

Screen phenotyping and cell sorting

GFP copy number assessment using Droplet Digital PCR

Lentiviral production

Culture media

Genomic DNA extraction and preparation for next generation sequencing

Statistics and analysis

A cell-level generative model for FACS screens

Parameters for simulations

A model for gene-level inference

MAGeCK

MAUDE

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases