Skip to main content
Synthetic and Systems Biotechnology logoLink to Synthetic and Systems Biotechnology
. 2016 Feb 12;1(2):118–121. doi: 10.1016/j.synbio.2016.01.003

CRISPy-web: An online resource to design sgRNAs for CRISPR applications

Kai Blin a, Lasse Ebdrup Pedersen a, Tilmann Weber a,*, Sang Yup Lee a,b,**
PMCID: PMC5640694  PMID: 29062934

Abstract

CRISPR/Cas9-based genome editing has been one of the major achievements of molecular biology, allowing the targeted engineering of a wide range of genomes. The system originally evolved in prokaryotes as an adaptive immune system against bacteriophage infections. It now sees widespread application in genome engineering workflows, especially using the Streptococcus pyogenes endonuclease Cas9. To utilize Cas9, so-called single guide RNAs (sgRNAs) need to be designed for each target gene. While there are many tools available to design sgRNAs for the popular model organisms, only few tools that allow designing sgRNAs for non-model organisms exist. Here, we present CRISPy-web (http://crispy.secondarymetabolites.org/), an easy to use web tool based on CRISPy to design sgRNAs for any user-provided microbial genome. CRISPy-web allows researchers to interactively select a region of their genome of interest to scan for possible sgRNAs. After checks for potential off-target matches, the resulting sgRNA sequences are displayed graphically and can be exported to text files. All steps and information are accessible from a web browser without the requirement to install and use command line scripts.

Keywords: CRISPR, Cas9, sgRNA, Guide RNA, Genome editing, Genome engineering

1. Introduction

“Clustered regularly interspaced short palindromic repeats” (CRISPR) and their associated RNA-guided endonucleases are bacterial adaptive immune systems protecting the bacteria from infections with bacteriophages.1 The biotechnological application of this system is currently revolutionizing molecular biology and provides new opportunities for synthetic biology applications.2 The CRISPR system allows targeted genome engineering of bacteria,3, 4, 5 and also eukaryotes including yeast,6, 7 plants,8 human cell lines9 and many more. Although several alternative CRISPR systems have recently been described (e.g., References 10, 11, 12, 13), most CRISPR systems for genome engineering are based on the Streptococcus pyogenes type II CRISPR/Cas9 system. Cas9 is an RNA-guided endonuclease, which introduces double strand DNA breaks at positions that are complementary to a crRNA sequence that binds to Cas9 in a duplex with a tracrRNA. It has also been demonstrated that Cas9 accepts artificially fused crRNA:tracrRNA-hybrids termed “single guide RNAs” (sgRNAs) to direct it to the target cut sites.14 Thus CRISPR can be used as an “in vivo programmable restriction enzyme,” which cuts the target DNA within an exactly defined target sequence determined by the sgRNA.

One essential step in the design of CRISPR experiments is to identify suitable sgRNA sequences within the target gene(s), which have to fulfill certain conditions: (i) the 20 bp target sequence has to be directly upstream of a “protospacer adjacent motif” (PAM). For the S. pyogenes PAM, this motif is “NGG”, (ii) it is desirable that the sequence of the target motif is unique within the genome of the organisms to prevent off-target activity, i.e. the cleavage of the chromosome at wrong positions.

Many programs and web-servers exist to assist biologists in finding such CRISPR/Cas9 target sites. However, most of these tools (e.g. CCTop,15 CHOPCHOP,16 CRISPR Design,17 WU-CRISPR,18 WGE CRISPR Finder,19 and CRISPy CHO20) are limited to a narrow set of – mostly eukaryotic – target genomes of model organisms. Only very few tools (e.g. sgRNAcas921) allow running Cas9 target searches on user-specified genomes. Unfortunately, to the best of our knowledge, none of these generally applicable tools are provided asuser-friendly web services, which can be used by non-computer scientists.

Here, we therefore present a web server implementation of the sgRNA prediction software CRISPy (http://crispy.secondarymetabolites.org/) that was originally designed for CRISPR-based engineering of Chinese hamster ovary (CHO) cells.20 It is implemented as a stand-alone web application for Cas9 target prediction that can design sgRNA for any gene/region of interest in user-submitted genome data. In addition, CRISPy-web tightly connects with the popular natural product prediction tool antiSMASH22, 23, 24 to assist researchers working on microbial secondary metabolites in leveraging the CRISPR/Cas9 system. Using antiSMASH or working on natural products of course is not a prerequisite for using CRISPy-web.

2. Design and implementation

CRISPy-web consists of multiple decoupled service components tied together by a message queue. The main services are the CRISPy worker service, the web application programming interface (API) server and the web user interface (UI). Sequences to be scanned for Cas9 target sites can be uploaded directly (in GenBank format) or transferred from antiSMASH22, 23, 24 by specifying the antiSMASH job ID. In the next version of antiSMASH, a direct link from antiSMASH will also be implemented.

2.1. CRISPy worker service

Two steps are performed by the worker service: preparing a sequence for analysis and then scanning and ranking Cas9 target sites. In the preparation step, the input files are parsed and – if the files were transferred from antiSMASH – annotated with secondary metabolite gene cluster locations using Biopython.25

Based on the CRISPy algorithm initially used in Ronda et al.,20 the scanning step searches for the ‘NGG’ PAM sequence on both the forward and the reverse strands of the selected region of interest, counting the number of direct off-target hits for sequences with the same 13 bp core sequence upstream of the PAM. It then calculates the number of off-target hits containing 0, 1, 2, or 3 bp mismatches on the whole genome for the 13 bp sequence by considering all possible combinations.

After collecting all possible Cas9 targets, sites that have direct off-target matches, sites with off-target matches with a single base-pair mutation, or sites with more than ten off-target matches with a two base-pair mutation are discarded. All these hits are stored in a GenBank file. To further reduce the number of sites displayed on the user interface, the targets are now sorted by 2 bp mismatch off-targets, 3 bp mismatch off-targets and coordinates on the genome, and only the first 1000 targets are displayed to the user.

2.2. Web application programing interface server

The web API server provides a Representational State Transfer (REST) interface26 that both the CRISPy-web UI and other tools can use to submit jobs to and obtain results from the CRISPy worker service. Data are transferred using the JavaScript Object Notation (JSON) format. The API server is implemented in Python using the Flask web framework and a Redis server for the message queue.

2.3. Web user interface

Built around the AngularJS framework and D3 for cluster visualization, the web UI is the primary method of interacting with CRISPy-web. Communicating with the web API server via asynchronous JavaScript (AJAX) calls, the web UI leads the user through the process of identifying Cas9 target sites in regions of interest.

3. Example usage: scanning for sgRNAs in the actinorhodin gene cluster of Streptomyces coelicolor

To scan for sgRNAs in S. coelicolor, the user first needs to upload the corresponding genome file in GenBank format. Such a file can easily be obtained from NCBI with the RefSeq ID NC_003888. After a short preliminary scan of the genome, CRISPy-web displays the target selection page. The target selection page gives a short overview of the uploaded genome, such as organism, description, sequence length and the number of annotated genes. Targets can be selected as a nucleotide range or by providing a locus tag, gene name, or protein-ID if annotated in the genome. If available, annotated gene clusters will also be shown in a list and can be selected as regions to scan. On S. coelicolor, the actinorhodin cluster is located from 5,509,800 to 5,552,424 bp on the genome, so that is the range to select as target region. After a click on “Find targets”, the scan for sgRNAs starts.

Depending on the size of the selected region and the overall genome size, this sgRNA scan will take several minutes. After the scan is completed, a page graphically displays the scanned region (Fig. 1A). Genes are shown as gray arrows, sgRNAs on the forward strand are displayed as red boxes above the genes. sgRNAs on the reverse strand are displayed below the genes. A table below the graphical overview gives details on the identified sgRNAs, ranked by the least number of potential off-target matches. Hovering over the table row of an sgRNA hit highlights the corresponding sgRNA box in the graphics, so you can quickly locate the hit on the region.

Fig. 1.

Fig. 1

Example output of a CRISPy-web run to identify sgRNAs in the actinorhodin biosynthetic gene cluster of Streptomyces coelicolor A3(2) (NCBI GenBank ID: NC_003888.3). (A) Overview of sgRNAs identified in the complete actinorhodin gene cluster. (B) Zoom view of SCO5087 (actIORF1). (C) List of sgRNAs selected for export.

As there are a lot of hits for this region, it is possible to zoom in on a particular gene of interest by clicking on the gene arrow and selecting “show results for this gene only”. Because the whole region already was scanned, this operation is instantaneous (Fig. 1B). On both the overview and on the zoomed view, sgRNAs you want to save for download can be added to the download basket by clicking the corresponding table row. The download basket button on the upper right will track the number of selected sgRNAs.

A click on the download basket icon will display the download overview page (Fig. 1C). There, the selected sgRNAs can be verified again. A click on the “Download CSV file” button will start the download of all selected sgRNAs in comma-separated value format. Files in this format can easily be used in a spreadsheet application, processed for downstream computational analysis or oligonucleotide orders.

Acknowledgment

This work was funded by a grant of the Novo Nordisk Foundation.

Footnotes

Peer review under responsibility of KeAi Communications Co., Ltd.

Contributor Information

Tilmann Weber, Email: tiwe@biosustain.dtu.dk.

Sang Yup Lee, Email: leesy@kaist.ac.kr.

References

  • 1.Horvath P., Barrangou R. CRISPR/Cas, the immune system of Bacteria and Archaea. Science. 2010;327:167–170. doi: 10.1126/science.1179555. [DOI] [PubMed] [Google Scholar]
  • 2.Doudna J.A., Charpentier E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science. 2014;346 doi: 10.1126/science.1258096. 1258096. [DOI] [PubMed] [Google Scholar]
  • 3.Jiang W., Bikard D., Cox D., Zhang F., Marraffini L.A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat Biotechnol. 2013;31:233–239. doi: 10.1038/nbt.2508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tong Y., Charusanti P., Zhang L., Weber T., Lee S.Y. CRISPR-Cas9 based engineering of actinomycetal genomes. ACS Synth Biol. 2015;4:1020–1029. doi: 10.1021/acssynbio.5b00038. [DOI] [PubMed] [Google Scholar]
  • 5.Cobb R.E., Wang Y., Zhao H. High-efficiency multiplex genome editing of Streptomyces species using an engineered CRISPR/Cas system. ACS Synth Biol. 2015;4:723–728. doi: 10.1021/sb500351f. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ronda C., Maury J., Jakociunas T., Jacobsen S.A., Germann S.M., Harrison S.J. CrEdit: CRISPR mediated multi-loci gene integration in Saccharomyces cerevisiae. Microb Cell Fact. 2015;14:97. doi: 10.1186/s12934-015-0288-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Jakociunas T., Bonde I., Herrgard M., Harrison S.J., Kristensen M., Pedersen L.E. Multiplex metabolic pathway engineering using CRISPR/Cas9 in Saccharomyces cerevisiae. Metab Eng. 2015;28:213–222. doi: 10.1016/j.ymben.2015.01.008. [DOI] [PubMed] [Google Scholar]
  • 8.Shan Q., Wang Y., Li J., Zhang Y., Chen K., Liang Z. Targeted genome modification of crop plants using a CRISPR-Cas system. Nat Biotechnol. 2013;31:686–688. doi: 10.1038/nbt.2650. [DOI] [PubMed] [Google Scholar]
  • 9.Shalem O., Sanjana N.E., Hartenian E., Shi X., Scott D.A., Mikkelsen T.S. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science. 2014;343:84–87. doi: 10.1126/science.1247005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zetsche B., Gootenberg J.S., Abudayyeh O.O., Slaymaker I.M., Makarova K.S., Essletzbichler P. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell. 2015;163(3):759–771. doi: 10.1016/j.cell.2015.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Esvelt K.M., Mali P., Braff J.L., Moosburner M., Yaung S.J., Church G.M. Orthogonal Cas9 proteins for RNA-guided gene regulation and editing. Nat Methods. 2013;10:1116–1121. doi: 10.1038/nmeth.2681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Friedland A.E., Baral R., Singhal P., Loveluck K., Shen S., Sanchez M. Characterization of Staphylococcus aureus Cas9: a smaller Cas9 for all-in-one adeno-associated virus delivery and paired nickase applications. Genome Biol. 2015;16:257. doi: 10.1186/s13059-015-0817-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Shmakov S., Abudayyeh O.O., Makarova K.S., Wolf Y.I., Gootenberg J.S., Semenova E. Discovery and functional characterization of diverse class 2 CRISPR-cas systems. Mol Cell. 2015;60:385–397. doi: 10.1016/j.molcel.2015.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J.A., Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Stemmer M., Thumberger T., Del Sol Keyer M., Wittbrodt J., Mateo J.L. CCTop: an intuitive, flexible and reliable CRISPR/Cas9 target prediction tool. PLoS ONE. 2015;10:e0124633. doi: 10.1371/journal.pone.0124633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Montague T.G., Cruz J.M., Gagnon J.A., Church G.M., Valen E. CHOPCHOP: a CRISPR/Cas9 and TALEN web tool for genome editing. Nucleic Acids Res. 2014;42:W401–7. doi: 10.1093/nar/gku410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hsu P.D., Scott D.A., Weinstein J.A., Ran F.A., Konermann S., Agarwala V. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat Biotechnol. 2013;31:827–832. doi: 10.1038/nbt.2647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wong N., Liu W., Wang X. WU-CRISPR: characteristics of functional guide RNAs for the CRISPR/Cas9 system. Genome Biol. 2015 doi: 10.1186/s13059-015-0784-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hodgkins A., Farne A., Perera S., Grego T., Parry-Smith D.J., Skarnes W.C. WGE: a CRISPR database for genome engineering. Bioinformatics. 2015;31:3078–3080. doi: 10.1093/bioinformatics/btv308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ronda C., Pedersen L.E., Hansen H.G., Kallehauge T.B., Betenbaugh M.J., Nielsen A.T. Accelerating genome editing in CHO cells using CRISPR Cas9 and CRISPy, a web-based target finding tool. Biotechnol Bioeng. 2014;111:1604–1616. doi: 10.1002/bit.25233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Xie S., Shen B., Zhang C., Huang X., Zhang Y. sgRNAcas9: a software package for designing CRISPR sgRNA and evaluating potential off-target cleavage sites. PLoS ONE. 2014;9:e100448. doi: 10.1371/journal.pone.0100448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Weber T., Blin K., Duddela S., Krug D., Kim H.U., Bruccoleri R. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43:W237–43. doi: 10.1093/nar/gkv437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Blin K., Medema M.H., Kazempour D., Fischbach M., Breitling R., Takano E. antiSMASH 2.0 – a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res. 2013;41:W204–12. doi: 10.1093/nar/gkt449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Medema M.H., Blin K., Cimermancic P., de Jager V., Zakrzewski P., Fischbach M.A. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39:W339–46. doi: 10.1093/nar/gkr466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cock P.J., Antao T., Chang J.T., Chapman B.A., Cox C.J., Dalke A. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25:1422–1423. doi: 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Fielding R.T., Taylor R.N. Principled design of the modern Web architecture. ACM T Internet Techn. 2002;2:115–150. [Google Scholar]

Articles from Synthetic and Systems Biotechnology are provided here courtesy of KeAi Publishing

RESOURCES