Abstract
CRISPR-Cas9 is a powerful genome editing technology in which a single guide RNA (sgRNA) confers target site specificity to achieve Cas9-mediated genome editing. Numerous sgRNA design tools have been developed based on reference genomes for humans and model organisms. However, existing resources are not optimal as genetic mutations or single nucleotide polymorphisms (SNPs) within the targeting region affect the efficiency of CRISPR-based approaches by interfering with guide-target complementarity. To facilitate identification of sgRNAs (1) in non-reference genomes, (2) across varying genetic backgrounds, or (3) for specific targeting of SNP-containing alleles, for example, disease relevant mutations, we developed a web tool, SNP-CRISPR (https://www.flyrnai.org/tools/snp_crispr/). SNP-CRISPR can be used to design sgRNAs based on public variant data sets or user-identified variants. In addition, the tool computes efficiency and specificity scores for sgRNA designs targeting both the variant and the reference. Moreover, SNP-CRISPR provides the option to upload multiple SNPs and target single or multiple nearby base changes simultaneously with a single sgRNA design. Given these capabilities, SNP-CRISPR has a wide range of potential research applications in model systems and for design of sgRNAs for disease-associated variant correction.
Keywords: genome editing, CRISPR, genome variant
The CRISPR-Cas9 system, a repurposed bacterial adaptive immune system, is a powerful programmable genome editing tool for research, including in eukaryotic systems, that also has potential for gene therapy (Pickar-Oliver and Gersbach 2019). With this system, Streptococcus pyogenes Cas9 nuclease is directed to a target site or sites in the genome that have a unique 20 nt sequence followed by a 3 bp sequence conforming to NGG known as the protospacer adjacent motif (PAM). A double-strand break (DSB) induced by Cas9 nuclease recruits the cellular machinery, which can repair the break either through the error-prone non-homologous end-joining (NHEJ) pathway or through homology directed repair (HDR). NHEJ often results in insertions and/or deletions (indels), which can result in frameshift mutations. HDR allows researchers to introduce or ‘knock in’ specific DNA sequences, such as precise nucleotide changes or reporter cassettes.
In addition, catalytically dead forms of Cas9 have been fused with different effector proteins to manipulate DNA or gene expression (Pickar-Oliver and Gersbach 2019). For example, to correct disease-causative point mutations, CRISPR-Cas9 mediated DNA base editing has been developed as a promising method to convert undesired spontaneous point mutations to the wild-type nucleotide (Gaudelli et al. 2017; Komor et al. 2016; Pickar-Oliver and Gersbach 2019). DNA base editing can be achieved by fusing a Cas9 nickase with a cytidine deaminase enzyme and uracil glycosylate inhibitor to achieve a C->T (or G->A) substitution. Similarly, a transfer RNA adenosine deaminase is fused to a catalytically dead Cas9 to generate A->G (or T->C) conversion. Notably, unlike for knock-in, DNA editing-induced changes occur without a DSB and without the need for introduction of a donor template. Disease-relevant mutations in mammalian cells can be corrected with base editing strategies (Dandage et al. 2019). Prime Editing based on the fusion of Cas9 and reverse transcriptase, is another recently published technique that could add more precision and flexibility to CRISPR editing (Anzalone et al. 2019). Thus, programmable editing of a target base in genomic DNA provides a potential therapy for genetic diseases that arise from point mutations.
Single-nucleotide polymorphisms (SNPs) can be defined as single-nucleotide differences from reference genomes. The targeting efficiency of Cas9 has been examined using data from genome-wide studies combined with machine learning (Chuai et al. 2018; Doench et al. 2014; Listgarten et al. 2018; Najm et al. 2018; Tycko et al. 2019). The position of specific nucleotides in the target sequences has been shown to affect targeting efficiency, which is the major determinant of CRISPR-Cas9 dependent genetic modification (Doench et al. 2014; Housden et al. 2015). Therefore, the presence of a SNP (or of an indel) can cause inefficient binding of the Cas9-sgRNA ribonucleoprotein (RNP) complex, resulting in inefficient genome editing.
Many rules for sgRNA design are generalizable and many web tools have been developed to predict sgRNA sequences for the human genome and genomes of numerous model organisms. There are two types of input that sgRNA design tools typically accept: (1) gene symbols or genome coordinates and (2) sequences. Resources that support the former typically precompute sgRNAs based on annotated reference genome information. Moreover, these sgRNA sequences are designed based on a single wild-type reference sequence without considering variants (e.g., CHOPCHOP, GuideScan; Table 1). With the second type of input, some tools BLAST user input sequence against a reference genome and correct any differences introduced by the user, thus making it impossible to design sgRNAs against a variant allele (this is the case for example for CRISPR-ERA and CRISPR-DT; Table 1). With others, it is possible to design a sgRNA to target variant allele (e.g., E-CRISPR, CRISPOR and CRISPRscan; Table 1). However, these tools require that the user retrieve the genomic sequences surrounding the variant and select designs that specifically target the variant region after the program sends back all the results. For a bench scientist, this is a time-consuming and error-prone process. For example, when the coding variant is near an exon-intron boundary, the user needs to retrieve the exon sequence as well as the intron sequence and enter these into the program. In addition, the user cannot do batch entries with most of the online tools that take sequence as an input (e.g., E-CRISPR, CRISPOR and CRISPRscan; Table 1). A few command line programs that take sequence as the input were developed for batch design; however, based on our experience, these tools or specific features either do not work or are not easily configured by bench scientists without programing experience (Table 1). In addition, researchers might need features that are missing from current tools, such as an option to target SNPs either together or independently of one another when the SNPs are nearby one another. Moreover, the ability to compare sgRNA designs targeting the same locus in the wild-type and the variant allele in terms of efficiency and specificity would also be very useful when targeting a heterozygous variant.
Table 1. A Survey of CRISPR design tools.
Tool | Type | Web Input | Web batch entry? | Consider variant? | Variant specific designs compared to SNP-CRISPR | URL/Reference |
---|---|---|---|---|---|---|
CHOPCHOP | web-based | Gene, Genome coordinates | No | No | None | chopchop.rc.fas.harvard.edu (Labun et al. 2019) |
GuideScan | web-based | Gene, Genome coordinates | Yes | No | None | guidescan.com (Perez et al. 2017) |
DRSC find CRISPR Tool | web-based | Gene, Genome coordinates | No | No | None | flyrnai.org/crispr (Housden et al. 2015) |
E-CRISPR | web-based | Gene, Sequence | No | Possible | Fewera | e-crisp.org (Heigwer et al. 2014) |
CRISPOR | web-based | Gene, Sequence | No (seq) Yes (gene) | Possible | Samea | crispor.tefor.net (Haeussler et al. 2016) |
CRISPRscan | web-basedb | Gene, Sequence | No | Possible | Fewera | crisprscan.org (Moreno-Mateos et al. 2015) |
CRISPRdirect | web-based | Gene, Sequence | No | Possible | Samea | crispr.dbcls.jp (Naito et al. 2015) |
CRISPR-ERA | web-based | Gene, Sequence, Genome coordinates | No | No | None | crispr-era.stanford.edu (Liu et al. 2015) |
CRISPR-DT | web-based | Sequence | No | No | None | bioinfolab.miamioh.edu/CRISPR-DT (Zhu and Liang 2019) |
DeepCRISPR | web-based | Sequence | No | Undeterminedc | deepcrispr.net (Chuai et al. 2018) | |
GT-Scan | web-based | Sequence | No | Possible | Samea | gt-scan.csiro.au/ (Oliveros et al. 2016) |
GPP sgRNA Designer | web-based | Gene, Sequence | Yes | Possible | Samea | portals.broadinstitute.org/gpp/public/analysis-tools/sgrna-design (Sanson et al. 2018) |
CCTop | web-based | Sequence | Yes | Possible | Samea | crispr.cos.uni-heidelberg.de (Stemmer et al. 2015) |
Cas-Designer | web-based | Sequence | Yes | Possible | Samea | rgenome.net/cas-designer (Park et al. 2015) |
CRISPR Optimal Target Finder | web-based | Sequence | No | Possible | Samea | targetfinder.flycrispr.neuro.brown.edu (Gratz et al. 2014) |
Breaking-Cas | web-basedb | Sequence | Yes | Possible | Samea | bioinfogp.cnb.csic.es/tools/breakingcas (Oliveros et al. 2016) |
Off-Spotter | web-basedb | Sequence | No | Possible | Samea | cm.jefferson.edu/Off-Spotter (Pliatsika and Rigoutsos 2015) |
Protospacer | GUI (OSX only) | NA | NA | No | None | protospacer.com |
CrisPam | command line | NA | NA | Undeterminedc | github.com/ristllin/CrisPam (Rabinowitz et al. 2019) | |
CRISPRseek | command line | NA | NA | Possible | Samea | bioconductor.org/packages/release/bioc/html/CRISPRseek.html (Zhu et al. 2014) |
AlleleAnalyzer | command line | NA | NA | Yes | Same | github.com/keoughkath/AlleleAnalyzer (Keough et al. 2019) |
SNP-CRISPR | web-basedb | Variants (e.g., VCF file) | Yes | Yes | NA | flyrnai.org/tools/snp_crispr |
Note: We limited our survey to CRISPR design tools that do not require registration or user login.
Users need to provide flanking sequence and filter out irrelevant designs.
A command line version is also available.
Test was attempted but results were not obtained.
To broaden the application of sgRNA design tools to better accommodate SNPs and small indels, we developed SNP-CRISPR. SNP-CRISPR is a web-based tool that accepts variant annotations as the input and uses rigorous off-target search algorithms to predict the specificity of each target site in the genome for wild-type and variant sequences. SNP-CRISPR offers customized options and allows users to easily and rapidly select optimal variant-specific CRISPR-Cas9 target sequences in genes from a variety of organisms.
Methods
Pipeline development
The SNP-CRISPR pipeline environment is managed using the Conda package and environment management system (Anaconda 2016). This allows for convenient reproduction of the necessary software dependencies and versions on different machines. The majority of the pipeline logic at SNP-CRISPR is written in Python using Biopython, with some Perl used for the BLAST and efficiency score analysis (Cock et al. 2009). Potential off-target loci are evaluated by performing a BLAST search of each design against the species reference genome. An off-target score is assigned based on both the number of hits found in the BLAST results and the number of mismatched nucleotides per off-target hit. Designs are also assigned an efficiency score that was computed using a position matrix; detailed information about the input dataset and algorithm can be found in (Housden et al. 2015). GNU Parallel is used to allow for parallelized computation of designs on different chromosomes and with different parameters for improved performance on multi-core systems (Tange 2018). The full source code of the pipeline, including instructions for installation and use, is available at https://github.com/jrodiger/snp_crispr.
Implementation of the web-based tool
The SNP-CRISPR web tool (https://www.flyrnai.org/tools/snp_crispr/) is located at the web site of the Drosophila RNAi Screening Center (DRSC). The back-end is written in PHP using the Symfony framework and the front end HTML pages take advantage of the Twig template engine. The JQuery JavaScript library with the DataTables plugin is used for handling Ajax calls and displaying table views. The Bootstrap framework and some custom CSS is also used on the user interface. Hosting by Harvard Medical School Research Computing makes it possible to provide a web-facing user interface to run the SNP-CRISPR core pipeline on Harvard Medical School’s “O2” high-performance computing cluster. When jobs are submitted from the website, the form parameters and uploaded input file path are passed to a bash script controlling the pipeline, which is then run as a cluster job. When the job is complete, an E-mail is sent to the user with a URL that contains a unique ID used to retrieve the corresponding results.
Data availability
SNP-CRISPR is available for online use without any restrictions at https://www.flyrnai.org/tools/snp_crispr.
The source code for the pipeline, including instructions for installation and use, is available at https://github.com/jrodiger/snp_crispr
Results
SNP-CRISPR web tool
The web-based version of SNP-CRISPR provides the functionality of the design pipeline with an easy-to-use interface and interactive results view. Users can select up to 2,000 variants of interest in Variant Call Format (VCF) or a csv file in the provided format, and then upload this file on the SNP-CRISPR homepage. The acceptable variants include single nucleotide changes, small insertions and small deletions. The user then chooses the species and whether to create designs that target each input variant individually or to target all SNPs within each potential sgRNA sequence. When a user submits input, the web logic starts a job on the Harvard Medical School “O2” high-performance computing cluster, using the uploaded file and parameters as input for the pipeline. After the pipeline finishes running, an automated E-mail is sent to the user with a link to a webpage at which the user can view and export results. For a couple of variants, the design pipeline usually takes up to a few minutes and with an input of 2,000 human SNP variants, it takes about half an hour for users to receive the results by E-mail. The result page shows the wild-type and variant designs with corresponding scores in a tabular view that can be sorted by one or more columns. The output table also lists the genome targeting position of each sgRNA and the position of the variant within the sgRNA sequence relevant to the PAM sequence. The variant is shown in lowercase, which can be easily spotted by users. Using the checkboxes in the left-most column, users can opt to export all or only selected rows to an Excel or csv file. Currently, SNP-CRISPR supports reference genomes from human, mouse, rat, fly and zebrafish (Figure 1).
Computation of potential variant-targeting sgRNAs
Users are required to upload variant information in one of the supported formats including the genome coordinates, the sequence of the reference alleles and the sequence of the variant alleles. First, SNP-CRISPR validates the input reference sequences and will warn users if the submitted reference sequences does not match, which might reflect a different version of the genome assembly being used in the user input vs. SNP-CRISPR. After validation, SNP-CRISPR then re-constructs the template sequence, swapping the reference nucleotide with the variant nucleotide for SNPs, while inserting or deleting the corresponding fragment for indel type variants. Second, SNP-CRISPR computes potential variant-targeting sgRNAs based on availability of PAM sequences in the neighboring region since the presence of a PAM sequence (NGG or NAG) is one of the few requirements for binding. Third, sgRNA designs that contain four or more consecutive thymine residues, which can result in termination of RNA transcription by RNA polymerase III, are filtered out (Gao et al. 2018). Cas9 can have off-target activity across the genome and tolerance to mismatches shows significant variance depending on the position within the sgRNA (Fu et al. 2013; Hsu et al. 2013). Therefore, for each sgRNA design, SNP-CRISPR computes an efficiency score (Housden et al. 2015) and a specificity score calculated based on BLAST results against the reference genome. All possible sgRNAs are provided to the user along with specificity and efficiency scores, without further filtering; filtering options are available for custom applications based on user needs (Figure 2). With the command line version of the program, it is possible to calculate specificity scores based on non-reference genomes if users also provide the non-reference genome as additional input. Users can generate the non-reference genome by modifying the reference based on genome-scale variant information or via de novo assembly. However, supporting this with the online version is not practical.
To facilitate identification of the best variant-specific sgRNAs, we provide information about both sgRNAs targeting specific variants and sgRNAs targeting the reference sequence in the same region. The efficiency score and an off-target score are provided, and the positions of relevant SNPs or indels in the sgRNA are included so that users can select the most suitable sgRNA or filter out less optimal ones.
The web tool supports up to 2,000 variants per batch while the command line version has no limit with the number of variants and can be used for any annotated genome. The command line version also provides better performance on large inputs when run multi-threaded. For example, a multi-threaded test run was able to process over 1,000 human SNPs per minute on Harvard Medical School’s “O2” high-performance computing cluster. We pre-computed sgRNA designs (NGG-PAM) for all clinically associated SNPs annotated at the Ensembl genome browser (ftp://ftp.ensembl.org/pub/release-97/variation/gvf/homo_sapiens/homo_sapiens_clinically_associated.gvf.gz) using the command line version of the pipeline and the designs can be found at https://github.com/jrodiger/snp_crispr/tree/master/results.
Conclusion
SNP-CRISPR is a unique web tool that designs sgRNAs targeting specific SNPs or indels. SNP-CRISPR is user-friendly and provides all possible CRISPR-Cas9 target sites in a given genomic region with required parameters, allowing users to select an optimal sgRNA. SNP-CRISPR provides not only efficiency scores but also off-target information for sgRNAs targeting sequences with and without SNPs and/or indels of interest in the same genomic region. SNP-CRISPR supports the human reference genome and genomes from major model organisms; namely, mouse, rat, fly and zebrafish. Conveniently, SNP-CRISPR displays the positions of variant nucleotides in each sgRNA region as part of the design output. Moreover, SNP-CRIPSR accepts up to 2,000 inputs per batch for design of large-scale experiments at the website. The command line version has no limit as to the number of variants and can be used for any genome that has been properly annotated. Altogether, SNP-CRISPR improves the ability of researchers to edit SNP or indel-containing loci by facilitating the design of sgRNAs that target specific variants. As such, SNP-CRISPR provides a valuable new resource to the genome editing technology field.
More and more variant data has become available in recent years, and much current research focuses on the biological impact of variants (Amberger and Hamosh 2017; Bragin et al. 2014; Landrum et al. 2014; Song et al. 2016), motivating us to develop a variant-centered tool. For instance, a CRISPR/Cas9-based targeting approach has been used to specifically correct heterozygous missense mutations associated with dominantly inherited conditions by including the mutated base in the sgRNA sequence (Courtney et al. 2016). CRISPR/Cas9-based therapeutic approaches show great promise for permanent correction of genetic disorders in somatic cells. In addition, to facilitate direct research in gene therapy of human diseases, SNP-CRISPR will be valuable for modeling human disease using model organisms. With a vast and growing amount of sequences from different strains of model organisms such as Drosophila melanogaster, millions of novel sequence variants have been identified (Huang et al. 2014; Wang et al. 2015). However, the biological significance of most of these sequence variants is still unclear. By facilitating design of sgRNAs targeting variant-specific alleles, including at a large scale, SNP-CRISPR makes it more feasible to study these variants systematically.
Acknowledgments
Relevant grant support includes NIH NIGMS R01 GM067761 and P41 GM132087. In addition, C.C. is supported by R21 ES025615, and S.E.M. is supported in part by the Dana Farber/Harvard Cancer Center, which is supported in part by NIH NCI Cancer Center Support Grant P30 CA006516. N.P. is an investigator of Howard Hughes Medical Institute.
Footnotes
Communicating editor: B. Oliver
Literature Cited
- Amberger, J. S., and A. Hamosh, 2017 Searching Online Mendelian Inheritance in Man (OMIM): A Knowledgebase of Human Genes and Genetic Phenotypes. Curr Protoc Bioinformatics 58: 1 2 1–1 2 12. [DOI] [PMC free article] [PubMed]
- Anaconda, 2016 Anaconda Software Distribution. Computer software. Vers. 2–2.4.0. Web. https://anaconda.com
- Anzalone A. V., Randolph P. B., Davis J. R., Sousa A. A., Koblan L. W. et al. , 2019. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576: 149–157. 10.1038/s41586-019-1711-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bragin E., Chatzimichali E. A., Wright C. F., Hurles M. E., Firth H. V. et al. , 2014. DECIPHER: database for the interpretation of phenotype-linked plausibly pathogenic sequence and copy-number variation. Nucleic Acids Res. 42: D993–D1000. 10.1093/nar/gkt937 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chuai G., Ma H., Yan J., Chen M., Hong N. et al. , 2018. DeepCRISPR: optimized CRISPR guide RNA design by deep learning. Genome Biol. 19: 80 10.1186/s13059-018-1459-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cock P., Antao T., Chang J., Chapman B., Cox C. et al. , 2009. Biopython: freely available Python tools [DOI] [PMC free article] [PubMed]
- Courtney D. G., Moore J. E., Atkinson S. D., Maurizi E., Allen E. H. et al. , 2016. CRISPR/Cas9 DNA cleavage at SNP-derived PAM enables both in vitro and in vivo KRT12 mutation-specific targeting. Gene Ther. 23: 108–112. 10.1038/gt.2015.82 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dandage R., Despres P. C., Yachie N., and Landry C. R., 2019. beditor: A Computational Workflow for Designing Libraries of Guide RNAs for CRISPR-Mediated Base Editing. Genetics 212: 377–385. 10.1534/genetics.119.302089 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doench J. G., Hartenian E., Graham D. B., Tothova Z., Hegde M. et al. , 2014. Rational design of highly active sgRNAs for CRISPR-Cas9-mediated gene inactivation. Nat. Biotechnol. 32: 1262–1267. 10.1038/nbt.3026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu Y., Foden J. A., Khayter C., Maeder M. L., Reyon D. et al. , 2013. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat. Biotechnol. 31: 822–826. 10.1038/nbt.2623 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao Z., Herrera-Carrillo E., and Berkhout B., 2018. Delineation of the Exact Transcription Termination Signal for Type 3 Polymerase III. Mol. Ther. Nucleic Acids 10: 36–44. 10.1016/j.omtn.2017.11.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gaudelli N. M., Komor A. C., Rees H. A., Packer M. S., Badran A. H. et al. , 2017. Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage. Nature 551: 464–471. 10.1038/nature24644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gratz S. J., Ukken F. P., Rubinstein C. D., Thiede G., Donohue L. K. et al. , 2014. Highly specific and efficient CRISPR/Cas9-catalyzed homology-directed repair in Drosophila. Genetics 196: 961–971. 10.1534/genetics.113.160713 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haeussler M., Schonig K., Eckert H., Eschstruth A., Mianne J. et al. , 2016. Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR. Genome Biol. 17: 148 10.1186/s13059-016-1012-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heigwer F., Kerr G., and Boutros M., 2014. E-CRISP: fast CRISPR target site identification. Nat. Methods 11: 122–123. 10.1038/nmeth.2812 [DOI] [PubMed] [Google Scholar]
- Housden B. E., Valvezan A. J., Kelley C., Sopko R., Hu Y. et al. , 2015. Identification of potential drug targets for tuberous sclerosis complex by synthetic screens combining CRISPR-based knockouts with RNAi. Sci. Signal. 8: rs9 10.1126/scisignal.aab3729 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsu P. D., Scott D. A., Weinstein J. A., Ran F. A., Konermann S. et al. , 2013. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 31: 827–832. 10.1038/nbt.2647 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang W., Massouras A., Inoue Y., Peiffer J., Ramia M. et al. , 2014. Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines. Genome Res. 24: 1193–1208. 10.1101/gr.171546.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keough K. C., Lyalina S., Olvera M. P., Whalen S., Conklin B. R. et al. , 2019. AlleleAnalyzer: a tool for personalized and allele-specific sgRNA design. Genome Biol. 20: 167 10.1186/s13059-019-1783-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Komor A. C., Kim Y. B., Packer M. S., Zuris J. A., and Liu D. R., 2016. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533: 420–424. 10.1038/nature17946 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Labun K., Montague T. G., Krause M., Torres Cleuren Y. N., Tjeldnes H. et al. , 2019. CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing. Nucleic Acids Res. 47: W171–W174. 10.1093/nar/gkz365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Landrum M. J., Lee J. M., Riley G. R., Jang W., Rubinstein W. S. et al. , 2014. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42: D980–D985. 10.1093/nar/gkt1113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Listgarten J., Weinstein M., Kleinstiver B. P., Sousa A. A., Joung J. K. et al. , 2018. Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs. Nat. Biomed. Eng. 2: 38–47. 10.1038/s41551-017-0178-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu H., Wei Z., Dominguez A., Li Y., Wang X. et al. , 2015. CRISPR-ERA: a comprehensive design tool for CRISPR-mediated gene editing, repression and activation. Bioinformatics 31: 3676–3678. 10.1093/bioinformatics/btv423 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreno-Mateos M. A., Vejnar C. E., Beaudoin J. D., Fernandez J. P., Mis E. K. et al. , 2015. CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo. Nat. Methods 12: 982–988. 10.1038/nmeth.3543 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Naito Y., Hino K., Bono H., and Ui-Tei K., 2015. CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites. Bioinformatics 31: 1120–1123. 10.1093/bioinformatics/btu743 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Najm F. J., Strand C., Donovan K. F., Hegde M., Sanson K. R. et al. , 2018. Orthologous CRISPR-Cas9 enzymes for combinatorial genetic screens. Nat. Biotechnol. 36: 179–189. 10.1038/nbt.4048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oliveros J. C., Franch M., Tabas-Madrid D., San-Leon D., Montoliu L. et al. , 2016. Breaking-Cas-interactive design of guide RNAs for CRISPR-Cas experiments for ENSEMBL genomes. Nucleic Acids Res. 44: W267–W271. 10.1093/nar/gkw407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park J., Bae S., and Kim J. S., 2015. Cas-Designer: a web-based tool for choice of CRISPR-Cas9 target sites. Bioinformatics 31: 4014–4016. [DOI] [PubMed] [Google Scholar]
- Perez A. R., Pritykin Y., Vidigal J. A., Chhangawala S., Zamparo L. et al. , 2017. GuideScan software for improved single and paired CRISPR guide RNA design. Nat. Biotechnol. 35: 347–349. 10.1038/nbt.3804 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pickar-Oliver A., and Gersbach C. A., 2019. The next generation of CRISPR-Cas technologies and applications. Nat. Rev. Mol. Cell Biol. 20: 490–507. 10.1038/s41580-019-0131-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pliatsika V., and Rigoutsos I., 2015. “Off-Spotter”: very fast and exhaustive enumeration of genomic lookalikes for designing CRISPR/Cas guide RNAs. Biol. Direct 10: 4 10.1186/s13062-015-0035-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rabinowitz R., Darnell R., and Offen D., 2019. CrisPam – a tool for designing gRNA sequences to specifically target a variant allele using CRISPR. Cytotherapy 21: e6 10.1016/j.jcyt.2019.04.021 [DOI] [Google Scholar]
- Sanson K. R., Hanna R. E., Hegde M., Donovan K. F., Strand C. et al. , 2018. Optimized libraries for CRISPR-Cas9 genetic screens with multiple modalities. Nat. Commun. 9: 5416 10.1038/s41467-018-07901-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song W., Gardner S. A., Hovhannisyan H., Natalizio A., Weymouth K. S. et al. , 2016. Exploring the landscape of pathogenic genetic variation in the ExAC population database: insights of relevance to variant classification. Genet. Med. 18: 850–854. 10.1038/gim.2015.180 [DOI] [PubMed] [Google Scholar]
- Stemmer M., Thumberger T., Del Sol Keyer M., Wittbrodt J., and Mateo J. L., 2015. CCTop: An Intuitive, Flexible and Reliable CRISPR/Cas9 Target Prediction Tool. PLoS One 10: e0124633 10.1371/journal.pone.0124633 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tange, O., 2018 GNU Parallel 2018, ISBN 9781387509881, 10.5281/zenodo.1146014 [DOI] [Google Scholar]
- Tycko J., Wainberg M., Marinov G. K., Ursu O., Hess G. T. et al. , 2019. Mitigation of off-target toxicity in CRISPR-Cas9 screens for essential non-coding elements. Nat. Commun. 10: 4063 10.1038/s41467-019-11955-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang F., Jiang L., Chen Y., Haelterman N. A., Bellen H. J. et al. , 2015. FlyVar: a database for genetic variation in Drosophila melanogaster. Database (Oxford) 2015 10.1093/database/bav079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu H., and Liang C., 2019. CRISPR-DT: designing gRNAs for the CRISPR-Cpf1 system with improved target efficiency and specificity. Bioinformatics 35: 2783–2789. 10.1093/bioinformatics/bty1061 [DOI] [PubMed] [Google Scholar]
- Zhu L. J., Holmes B. R., Aronin N., and Brodsky M. H., 2014. CRISPRseek: a bioconductor package to identify target-specific guide RNAs for CRISPR-Cas9 genome-editing systems. PLoS One 9: e108424 10.1371/journal.pone.0108424 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
SNP-CRISPR is available for online use without any restrictions at https://www.flyrnai.org/tools/snp_crispr.
The source code for the pipeline, including instructions for installation and use, is available at https://github.com/jrodiger/snp_crispr