Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jul 11.
Published in final edited form as: Nat Methods. 2018 Dec;15(12):992–993. doi: 10.1038/s41592-018-0225-6

CRISPR-SURF: discovering regulatory elements by deconvolution of CRISPR tiling screen data

Jonathan Y Hsu 1,2, Charles P Fulco 3,4, Mitchel A Cole 5,6,7, Matthew C Canver 2,5,6,7, Danilo Pellin 8,9, Falak Sher 5,6,7, Rick Farouni 2,3,10, Kendell Clement 2,3,10, Jimmy A Guo 2, Luca Biasco 8,9, Stuart H Orkin 5,6,7,11, Jesse M Engreitz 3,12, Eric S Lander 3,4,13, J Keith Joung 2,10, Daniel E Bauer 5,6,7,*, Luca Pinello 2,3,10,*
PMCID: PMC6620603  NIHMSID: NIHMS1030070  PMID: 30504875

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

To the Editor –

Tiling screens that use CRISPR-Cas technologies provide a powerful approach for the mapping of regulatory elements to phenotypes of interest16. Here we present CRISPR screening uncharacterized region function (CRISPR-SURF), a deconvolution framework that can be used to identify functional regulatory regions in the genome from data generated by CRISPR-Cas nuclease, CRISPR interference (CRISPRi), or CRISPR activation (CRISPRa) tiling screens. CRISPR-SURF can be run as a stand-alone command line utility (https://github.com/pinellolab/CRISPR-SURF) or as a web application (http://crisprsurf.pinellolab.org/) (Supplementary Note 1).

The methodology underlying the CRISPR-SURF framework leverages the concept that single guide RNAs (sgRNAs) represent a functional readout for base pairs within the perturbation range. This range depends on the CRISPR screening approach used: CRISPR-Cas nucleases introduce insertion and deletion (indel) mutations of varying lengths (typically < 30 bp, although potentially varying with cell type), whereas CRISPRi and CRISPRa strategies may remodel chromatin structure across hundreds of nucleotides. Importantly, each CRISPR technology offers its own advantage: CRISPRi and CRISPRa strategies increase the likelihood of detecting regulatory elements, given their larger perturbation ranges, whereas CRISPR-Cas nucleases provide higher resolution on the boundaries of regulatory elements, given their sharper perturbation windows. Because each sgRNA perturbs variable-size regions around its target site, the sgRNA data from CRISPR tiling screens can be seen as imprecise measurements of an underlying genomic regulatory signal. To address this variable, we model these imprecise measurements by means of a convolution operation that accounts for the perturbation profiles associated with different CRISPR technologies.

CRISPR-SURF deconvolves tiling screen data to find the genomic regulatory signal that best explains the observed sgRNA scores given the perturbation profile and sgRNA spacing (Fig. 1). The CRISPR-SURF framework accounts for overlapping perturbation profiles between neighboring sgRNAs and leverages shared information to infer the underlying genomic regulatory signal even from noisy measurements. The exact sgRNA targeting coordinates are also taken into account, thus allowing for location-dependent statistical tests with a power that reflects the local density of sgRNAs in a region. This enables CRISPR-SURF to estimate perturbation-specific and position-specific statistical power for CRISPR tiling screens (Supplementary Note 2).

Fig. 1 |. CRISPR-SURF deconvolution framework.

Fig. 1 |

An illustration of the deconvolution based on sgRNA targeting positions, different perturbation profiles (CRISPRi/a and CRISPR-Cas), and enrichment scores.

We evaluated the performance of CRISPR-SURF by using three published CRISPR tiling screens spanning CRISPR-Cas91, CRISPRi2, and CRISPRa3 modalities. For all three datasets, CRISPR-SURF reliably identified all of the experimentally validated regulatory elements. CRISPR-SURF further identified potentially novel regulatory regions supported by both chromatin accessibility and epigenetic marks (Supplementary Notes 3 and 4, Supplementary Figs. 13). We elaborate on key differences between CRISPR-SURF and the analysis methods used in these previous studies in Supplementary Notes 5 and 6.

Furthermore, we carried out two matched CRISPR tiling screens using CRISPR-Cas9 (SpCas9) and CRISPRi (dCas9-KRAB) on the BCL11A locus (Supplementary Note 7) and found that significant regions identified within previously validated functional enhancers1,7,8 were narrower in the CRISPR- Cas9 screen than in the CRISPRi screen, consistent with the narrower perturbation profiles of CRISPR-Cas9 indel mutations compared with those of CRISPRi epigenetic modifications (Supplementary Fig. 4). In summary, CRISPR-SURF leverages the broad CRISPRi and CRISPRa perturbation profile for efficient enhancer discovery and the narrow CRISPR-Cas perturbation profile for high-resolution mapping of critical elements within enhancers.

Supplementary Material

Supplementary Text and Figures

Acknowledgements

We thank R. Kurita and Y. Nakamura (RIKEN BioResource Center, Tsukuba, Japan) for sharing HUDEP-2 cells. L.P. is supported by NHGRI Career Development Award R00 HG008399 and CEGS RM1HG009490. D.E.B. is support by NIH R03 DK109232, NIH DP2 0D022716, NIH P01 HL032262, the Burroughs Wellcome Fund, and the Doris Duke Charitable Foundation. J.K.J. is supported by NIH R35 GM118158, NIH RM1 HG009490, and the Desmond and Ann Heathwood MGH Research Scholar Award. M.C.C. is supported by NIDDK Award F30-DK103359. J.M.E. is supported by NIH NHGRI 1K99HG009917-01 and the Harvard Society of Fellows.

Footnotes

Competing interests

J.K.J. has financial interests in Beam Therapeutics,Editas Medicine, Endcadia, Monitor Biotechnologies (formerly known as Beacon Genomics), Pairwise Plants, Poseida Therapeutics, and Transposagen Biopharmaceuticals. J.K.J. holds equity in EpiLogic Therapeutics. J.K.J’s interests were reviewed and are managed by Massachusetts General Hospital and Partners HealthCare in accordance with their conflict-of-interest policies. J.K.J. is a member of the Board of Directors of the American Society of Gene and Cell Therapy, and an inventor on patents and patent applications covering CRISPR-based nucleases and gene regulatory proteins. E.S.L. serves on the Board of Directors for Codiak BioSciences and Neon Therapeutics, and on the Scientific Advisory Board of F-Prime Capital Partners and Third Rock Ventures; he is also affiliated with several nonprofit organizations, including through his service on the Board of Directors of the Innocence Project and Biden Cancer Initiative and the Board of Trustees for the Parker Institute for Cancer Immunotherapy. He has served and continues to serve on various federal advisory committees. The Broad Institute, which E.S.L. directs, holds patents and has filed patent applications on technologies related to other aspects of CRISPR.

Additional information

Supplementary information is available for this paper at https://doi.org/10.1038/s41592-018-0225-6.

Data availability

The data are available at SRA under project number PRJNA494935.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Text and Figures

RESOURCES