Abstract
Background
Zinc Finger Nucleases (ZFNs) are man-made restriction enzymes useful for manipulating genomes by cleaving target DNA sequences. ZFNs allow therapeutic gene correction or creation of genetically modified model organisms. ZFN specificity is not absolute; therefore, it is essential to select ZFN target sites without similar genomic off-target sites. It is important to assay for off-target cleavage events at sites similar to the target sequence.
Results
ZFN-Site is a web interface that searches multiple genomes for ZFN off-target sites. Queries can be based on the target sequence or can be expanded using degenerate specificity to account for known ZFN binding preferences. ZFN off-target sites are outputted with links to genome browsers, facilitating off-target cleavage site screening. We verified ZFN-Site using previously published ZFN half-sites and located their target sites and their previously described off-target sites. While we have tailored this tool to ZFNs, ZFN-Site can also be used to find potential off-target sites for other nucleases, such as TALE nucleases.
Conclusions
ZFN-Site facilitates genome searches for possible ZFN cleavage sites based on user-defined stringency limits. ZFN-Site is an improvement over other methods because the FetchGWI search engine uses an indexed search of genome sequences for all ZFN target sites and possible off-target sites matching the half-sites and stringency limits. Therefore, ZFN-Site does not miss potential off-target sites.
Background
The ability to create double-stranded DNA breaks at specific genomic sequences is important for gene correction therapeutics, targeted gene integration and gene modification for research models as well as gene disruption [1]. Zinc Finger Nucleases (ZFNs) are promising candidates for such specific nucleases. ZFNs consist of the sequence-independent FokI nuclease domain fused to zinc finger proteins (ZFPs). ZFPs can be altered to change their sequence specificity. Cleavage of targeted DNA requires binding of two ZFNs (designated left and right) to adjacent half-sites on opposite strands with correct orientation and spacing, thus forming a FokI dimer [2]. The requirement for dimerization increases ZFN specificity significantly. Three or four finger ZFPs target ~9 or 12 bases per ZFN, or ~18 or 24 bases for the ZFN pair. ZFN pairs have been used for gene targeting at specific genomic loci in insect, plant, animal and human cells [3-10] (and reviewed in [11,12]). Methods are available to measure general ZFN toxicity or the amount of unrepaired DNA ends resulting from ZFN treatment [13-16]; however, determining all possible off-target cleavage sites may be challenging, as some possible cleavage sites can be missed by BLAST and similar methods. ZFN-Site determines the most probable off-target sites for further analysis or testing. Several ZFN design web tools exist that offer BLAST-based searches for potential ZFN off-target sites [17-22]. BLAST searches, which implement a local alignment search, are not optimal for finding ZFN off-target sites and may miss some sites because they utilize seed-based methods with a non-overlapping word index to search only for perfect matches, rather than longer imperfect matches. BLAST also uses an E-value threshold that does not directly correspond to a "# of mismatches" threshold. ZFN-Site is more thorough because it scans one index entry for each nucleotide in the genome, ensuring that no matches are missed. ZFN-Site was created to provide a simple, easy-to-use interface that does not require the end user to possess specialized bioinformatics or search algorithm expertise. ZFN-Site provides an interface that searches multiple genomes for sites with ambiguities, mismatches, multiple spacings, hetero-dimeric binding sites and homo-dimeric binding sites composed of two left or two right ZFN half-sites. Changing these parameters can expand the number of possible off-target sites returned to match the purpose. A larger list enables thorough screening for potential ZFN off-target sites using new methods, such as high-throughput sequencing or mutation screens.
Implementation
ZFN-Site was developed to quickly locate all possible ZFN target and off-target sites that might be cleaved. Based on the tailoring of search parameters, ZFN-Site generates sets of search strings. To ensure that all sites matching these criteria are found in the requested genomes, ZFN-Site employs the FetchGWI search engine [23]. The input can be either the nucleotide sequence of the intended target site of each ZFN (basic search) or information about each ZFN's binding specificity (relaxed specificity). The number of possible sites is expanded by choice of ZFN spacing, the possibility of ZFN homo-dimerization (see below) and the number of allowed mismatches. The output from ZFN-Site aids in the choice of ZFN pairs that minimize potential off-target sites and allows experimental testing of each ZFN pairs' off-target sites in cells or in mutated animals. Experimentally testing the list of found sites under a series of different conditions may determine the conditions favoring more specific targeting and less off-target cleavage events.
Basic Target Search
The simplest search method uses the intended target site to scan whole genomes. This type of search is valuable when choosing prospective target sites or when there is no available ZFP mismatch specificity data. ZFN-Site allows searches for off-target sites containing up to two mismatches per half-site. ZFN-Site outputs all target and off-target sites matching the selection criteria.
The genome or genomes to be searched are chosen by clicking on the species list on the left side of the ZFN-Site web page. Scrolling down reveals the full list. Use command-click (mac) or control-click (pc) to choose multiple genomes to be searched simultaneously. A click on ALL searches the entire list of genomes shown in Table 1.
Table 1.
Genome Release (Code) | Species |
---|---|
Homo sapiens (HS) | Human |
Mus musculus (MM) | Mouse |
Danio rerio Zv6 (DR) | Zebrafish |
Danio rerio Zv5 (DR5) | Zebrafish |
Drosophila melanogaster (DM) | Fruit Fly |
Apis mellifera (AME) | Bee |
Bos taurus (BT) | Cow |
Caenorhabditis elegans NCBIWS170(CE) | Nematode |
Canis familiaris (CFA) | Dog |
Pan troglodytes (PTR) | Chimpanzee |
Rattus norvegicus (RN) | Rat |
Saccharomyces cerevisiae (SCE) | Yeast |
Tribolium castaneum (TCA) | Beetle |
All genomes (ALL) | All of the above |
Half-sites are entered without spaces, 5' to 3', as they occur on the opposite strand of a ZFN target. The following sequence is an example of the top DNA strand of a three finger ZFN pair target site: 5'-CGGAGCCGCTTTaacccACTCTGTGGAAG-3'[3]. The right ZFN half-site is underlined and should be entered into the program 5'-3' as ACTCTGTGGAAG. The left ZFN half-site is the reverse complement of the bold sequence and should be entered 5'-3' as AAAGCGGCTCCG (Figure 1).
The sequence of the DNA spacer between ZFN half-sites (lower case, above) does not greatly influence ZFN specificity, but the length of the spacer between half-sites influences how well a site is cleaved [24]. The allowed number of spacer nucleotides depends on the ZFP-to-FokI linker and is usually five or six nucleotides, although ZFNs with altered linkers have different nucleotide length preferences [25,26]. Genome searches can be run on ZFN-Site with one allowed spacing between half-sites or two spacings if entered separated by a comma (e.g., 5,6). Searches can be repeated using alternate spacings if searching with more than two spacings is required.
In addition to a left ZFN and a right ZFN binding as hetero-dimers, two left or two right ZFNs can bind correctly spaced sites to form homo-dimers and cleave off-target sites [16]. If the "Allow Left and Right Protein Homo-dimerization" box is checked, ZFN-Site also searches for homo-dimeric sites. Use of modified FokI domains may prevent cleavage at most homo-dimeric sites [13,27]. However, identification of homo-dimeric sites and experimental testing for cleavage at each site on these output lists may be necessary to quantitate low levels of cleavage and generate further predictive rules for off-target cleavage events. The specificity of nuclease variants can be experimentally tested using cleavage analysis on the sites comprising the lists of possible off-target sites generated by ZFN-Site [13,25,27-29].
ZFN-Site expands the query targets into a list of queries (or tags) based on the half-sites and inputs. Using increased ambiguities broadens the search. Degenerate nucleotides (specified by standard IUPAC codes) are allowed in the half-site queries because they are then expanded into all possible matching tags. These queries are submitted to an exact search algorithm (described in [23]). The number of such queries increases with the required mismatches and ambiguities (such as Ns and nucleotide IUPAC codes), thus increasing RAM and search time required. Very complex searches may be achieved by breaking the search into parts to speed processing and prevent stalling.
The number of mismatches per half-site (0, 1 or 2) is inputted into the last box. Use 0 to scan only for sites exactly matching the half-sites. This mode is useful for verifying the location of target sites in one or more genomes. The number of off-target sites returned can be greatly increased by allowing 1 or 2 mismatches per half-site. The use of ambiguous nucleotides in the half-sites does not count as a mismatch, and both can be used if needed. Mismatches are allowed in degenerate positions as well. If the user specifies a search with one or two mismatches, ZFN-Site will generate all possible sequence tags that match the target up to the specified number of mismatches.
Once the information above is entered, clicking run will display the query sequences on the next web page, while the genome searches are performed using the FetchGWI program (see paragraph on FetchGWI below). ZFN-Site outputs a list of half-site matches sorted by genome position. This list is scanned by a second program that extracts all combinations on each DNA strand that have the required spacing. For fast performance on the Web, we have limited the number of possible mismatches per ZFN half-site to two. The total number of degenerate nucleotides is also limited to two, such that the computational complexity is manageable.
Based on these inputs, ZFN-Site generates a list of genomic sequences that are exact or near-exact matches to the input query set, along with chromosomal coordinates (including NCBI chromosomal accession number and the start and end positions within the chromosome), DNA strand and HTML links to their exact location on ENSEMBL, UCSC and NCBI browsers [23] (Figure 2). Results are output under "WORD MATCHES" in a two-line format for each genomic sequence returned. The top line of each pair of lines depicts the genomic sequence. The lower line displays the differences from the query sequence. Spacer nucleotides are indicated in blue, and in cases where there are ambiguous nucleotides, genomic nucleotides matching an unambiguous portion of the query sequence are in blue. The number of nucleotides in the spacer is indicated by the number of green Ns in the lower line. Red nucleotides depict mismatches. The number of mismatches is displayed, not including positions with degenerate nucleotides (unless mismatches occur at degenerate positions). The next four columns list the matched sequence's "Species", "Chromosomal Coordinates [start..end]", "Strand" and "Links to Genome Browsers". Clicking on the HTML links to the right of a matched genomic sequence will open a browser in either the ENSEMBL, UCSC or NCBI genome browsers. This will direct the user to that exact location, allowing one to identify whether that targeted sequence is in an annotated gene, intron, exon or regulatory sequence.
ZFN-Site can be used to determine if ZFNs may be used to specifically target sites in multiple different genomes. ZFN-Site can scan multiple genomes simultaneously using the same settings or can be run sequentially.
Relaxed Specificity Search
Previous in vitro and cellular ZFP specificity studies may help determine other sequences that may be possibly cleaved by a ZFN pair. This information can come from studies of individual fingers [30-32]. Without Systematic Evolution of Ligands by Exponential Enrichment (SELEX) or similar data (described below), the specificity of a ZFN can be approximated by combining the specificity of the individual fingers, even though this fails to account for the effects of adjacent fingers. There are many manuscripts detailing individual ZFP specificity; non-exhaustive examples include [30-35]. Approximating the specificity of the whole ZFN by compiling the relaxed specificity of the constituent ZFPs may provide more predictive results than using the basic target search, as the individual finger data may help determine the non-specified bases. If there are individual nucleotide positions where the ZFPs can bind several nucleotides, standard IUPAC ambiguity codes should be entered in the half-site.
More specific information comes from binding studies of full ZFPs or ZFNs using SELEX. Searches based on experimentally determined specificity are more informative than searches with increased mismatches. If there is SELEX or similar data describing each ZFN's binding specificity, it is also entered in 5' to 3' orientation using standard IUPAC ambiguity codes (as in Figure 3). This allows relaxed specificity searches. For example, a nucleotide in a half-site that can be bound if it is either G or T can be entered as a K. Any non-specified position can be represented by an N (N=A, C, G or T). If scanning with two mismatches, the pair of half-sites should contain less than three ambiguities to prevent computational stalling (see above).
FetchGWI
ZFN-Site uses FetchGWI to perform rapid and accurate searches of the large sequence databases comprising full genomes. FetchGWI is a C program that relies on pre-computed genome indices and is best used in cases where queries must be mapped very rapidly and efficiently. To get maximal search speed, FetchGWI only searches within the index files that represent the genome sequences. There is one index entry for each nucleotide in the genome. This exhaustive index also ensures that no match can possibly be missed. Other programs, such as BLAST, occasionally scan non-overlapping words and thus can miss possible off-target sites (see below) [20].
Testing Located Off-Target Sites
Predicted genomic off-target sites should be tested for cleavage. The HTML links are used to download the sequences flanking the site, for use in designing amplification primers for either mutation or sequence analysis. The listed potential off-target sites can be assayed by PCR and mutation detection [7] or deep sequencing [5] to determine ZFN specificity.
If ZFN-Site locates more sites that match the selected criteria than can be tested, the criteria may be narrowed by using less mismatches or using less ambiguous nucleotides for relaxed searches. The list of found sites can also be narrowed using the text output. If the text output link is clicked, the found sites are outputted in another screen in order of increasing number of total mismatches. If a search is conducted using two mismatches per half-site, the output can be greatly narrowed by selecting the genomic sequences at the top of the list with three or fewer total mismatches.
This list of possible target sequences can be further analysed using other computer programs. For example, the output can be ranked using an excel spreadsheet containing a positional weight matrix based on experimentally determined specificity data as described below.
Results
ZFN-Site was validated by comparing our results to a previously published study by Perez et al. [7]. Perez et al. looked for off-target cleavage by a pair of ZFNs specific for the gene coding for human C-C chemokine receptor type 5 (CCR5). This study used an unpublished algorithm to identify potential off-target sites by scanning the human genome using in vitro SELEX selection specificity data [7]. Their sequencing of the identified off-target sites revealed that a site in the related CCR2 gene was also cleaved at a low frequency. The left and right ZFN half-sites, including ambiguities suggested from their SELEX data, were compiled and entered into ZFN-Site (Figure 3). ZFN-Site found the CCR5 target site and each of the off-target sites on their list, including the experimentally verified CCR2 off-target cleavage site (Figure 4). Additional file 1, Figure S1 contains ZFN-Site output with less than three total mismatches.
Multiple BLAST searches sometimes accomplish the same function as ZFN-Site if one inputs all possible permutations of homo/heterodimers, spacings and relaxed specificities. This can be labor-intensive. For example, six BLAST queries for permutations of the Perez et al. ZFNs could replace one ZFN-Site search without ambiguities (Figure 5). However, in contrast to ZFN-Site, BLAST does not allow ambiguous bases. While BLAST could return these sites, user intervention would be required to distinguish these from true mismatches. ZFN-Site thus simplifies the process of searching for ZFN off-target sites.
ZFN-Site locates every site matching the specified search criteria. In contrast, it has been noted that the BLAST methodology may not find every ZFN site [20]. Because BLAST searches implement a local alignment search, they are incapable of reproducing the same type of results as ZNF-Site. To compare results to the single ZFN-Site search above, six sets of BLAST searches for the CCR5 ZFN pair were done to include homo- and hetero-dimerization at both 5 bp and 6 bp spacing. Some of the sites found by Perez et al. and by ZFN-Site were not found using BLAST, although the BLAST parameters were optimized to attempt to return all matches (Additional file 2, Figure S2). The BLAST search for the right homo-dimer pair with six base spacing failed to return two sequences found by Perez et al. and ZFN-Site (numbers 10 and 11). This search returned 474 genomic sequences, many of which were too dissimilar to be likely off-target sites. Because BLAST outputs the matching portion of the sequences with the ends truncated, further user intervention was required to verify the total similarity of these sequences.
In some cases, ZFN-Site may return a large number of sequences. The degree to which one may wish to narrow a list of ZFN-Site outputs depends on the experimental means used to search for off-target cleavage and the resources for scanning multiple sites. The use of deep sequencing may require less narrowing of the list because one can quantitatively test hundreds of sites. Until more information is available on the actual prevalence of ZFN off- target cleavage, it would be desirable to test as many potential off-target sites as experimentally feasible.
A post-processing step using positional weight matrices (PWM) can be used to rank the output of ZFN-Site. Additional file 3 is an example of a spreadsheet used to rank ZFN-Site output using PWMs based on the graph of nucleotide frequencies in Perez et al. [7]. The top putative target sites could then be tested experimentally.
Conclusions
ZFN-Site is applicable to genome searches for pairs of half-sites in nucleases or other types of DNA binding proteins. Here, we have presented a user friendly interface allowing a directed search of multiple genomes and have validated its use for finding ZFN sites and off-target sites in the human genome. Experimental testing for ZFN cleavage at the potential sites found by ZFN-Site using large scale sequencing or mutation detection may provide a more thorough understanding of the determinants of ZFN specificity and allow optimization for decreased off-target cleavage events. These results can also be compared with results from other methods for detecting off-target cleavage and toxicity [13-16].
Recently, other nucleases, such as TALE nucleases, have been used for genome alteration [36-39]. While ZFN-Site was tailored to locate ZFN off-target sites, it can also be used to find targets for TALE nucleases. A spreadsheet for creating PWMs and ranking output for TALE nucleases is available upon request.
Authors' contributions
TJC provided the initial concept, methods and pseudo-code. GA redesigned the querying methods and implemented the Web interface. TJC tested and benchmarked early versions and provided spreadsheet and supplemental files. CI developed the FetchGWI interface with contribution from GA. TJC & APM wrote the manuscript with contributions from GA and PB.
Availability and requirements
ZFN-Site is available freely on our web site, http://ccg.vital-it.ch/tagger/targetsearch.html[40], and the FetchGWI source code is also available at Source Forge, http://sourceforge.net/projects/tagger/[41].
Project name: ZFN-Site
Project home page: http://ccg.vital-it.ch/tagger/targetsearch.html
Operating system(s): Platform independent
Programming language: C and Perl
Other requirements: None
License: GNU General Public License (GPL), version 2
Any restrictions to use by non-academics: No
Supplementary Material
Contributor Information
Thomas J Cradick, Email: tj@alum.mit.edu.
Giovanna Ambrosini, Email: giovanna.ambrosini@epfl.ch.
Christian Iseli, Email: christian.iseli@licr.org.
Philipp Bucher, Email: philipp.bucher@isb-sib.ch.
Anton P McCaffrey, Email: antonmccaffrey@gmail.com.
Acknowledgements
The authors would like to thank Ramona McCaffrey for editorial assistance. This work was supported by the National Institutes of Health [grant number R01 5R01AI068885-03] (TC & AM). Conflict of Interests: none declared.
References
- Porteus MH, Baltimore D. Chimeric nucleases stimulate gene targeting in human cells. Science. 2003;300(5620):763. doi: 10.1126/science.1078395. [DOI] [PubMed] [Google Scholar]
- Bitinaite J, Wah DA, Aggarwal AK, Schildkraut I. FokI dimerization is required for DNA cleavage. Proc Natl Acad Sci USA. 1998;95(18):10570–10575. doi: 10.1073/pnas.95.18.10570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Urnov FD, Miller JC, Lee YL, Beausejour CM, Rock JM, Augustus S, Jamieson AC, Porteus MH, Gregory PD, Holmes MC. Highly efficient endogenous human gene correction using designed zinc-finger nucleases. Nature. 2005. [DOI] [PubMed]
- Beumer K, Bhattacharyya G, Bibikova M, Trautman JK, Carroll D. Efficient gene targeting in Drosophila with zinc-finger nucleases. Genetics. 2006;172(4):2391–2403. doi: 10.1534/genetics.105.052829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meng X, Noyes MB, Zhu LJ, Lawson ND, Wolfe SA. Targeted gene inactivation in zebrafish using engineered zinc-finger nucleases. Nat Biotechnol. 2008;26(6):695–701. doi: 10.1038/nbt1398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morton J, Davis MW, Jorgensen EM, Carroll D. Induction and repair of zinc-finger nuclease-targeted double-strand breaks in Caenorhabditis elegans somatic cells. Proc Natl Acad Sci USA. 2006;103(44):16370–16375. doi: 10.1073/pnas.0605633103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perez EE, Wang J, Miller JC, Jouvenot Y, Kim KA, Liu O, Wang N, Lee G, Bartsevich VV, Lee YL. et al. Establishment of HIV-1 resistance in CD4+ T cells by genome editing using zinc-finger nucleases. Nat Biotechnol. 2008;26(7):808–816. doi: 10.1038/nbt1410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai CQ, Doyon Y, Ainley WM, Miller JC, Dekelver RC, Moehle EA, Rock JM, Lee YL, Garrison R, Schulenberg L. et al. Targeted transgene integration in plant cells using designed zinc finger nucleases. Plant Mol Biol. 2009;69(6):699–709. doi: 10.1007/s11103-008-9449-7. [DOI] [PubMed] [Google Scholar]
- Geurts AM, Cost GJ, Freyvert Y, Zeitler B, Miller JC, Choi VM, Jenkins SS, Wood A, Cui X, Meng X. et al. Knockout rats via embryo microinjection of zinc-finger nucleases. Science. 2009;325(5939):433. doi: 10.1126/science.1172447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hockemeyer D, Soldner F, Beard C, Gao Q, Mitalipova M, Dekelver RC, Katibah GE, Amora R, Boydston EA, Zeitler B, Efficient targeting of expressed and silent genes in human ESCs and iPSCs using zinc-finger nucleases. Nat Biotechnol. 2009. [DOI] [PMC free article] [PubMed]
- Carroll D. Progress and prospects: zinc-finger nucleases as gene therapy agents. Gene Ther. 2008;15(22):1463–1468. doi: 10.1038/gt.2008.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Urnov FD, Rebar EJ, Holmes MC, Zhang HS, Gregory PD. Genome editing with engineered zinc finger nucleases. Nat Rev Genet. 2010;11(9):636–646. doi: 10.1038/nrg2842. [DOI] [PubMed] [Google Scholar]
- Szczepek M, Brondani V, Buchel J, Serrano L, Segal DJ, Cathomen T. Structure-based redesign of the dimerization interface reduces the toxicity of zinc-finger nucleases. Nat Biotechnol. 2007;25(7):786–793. doi: 10.1038/nbt1317. [DOI] [PubMed] [Google Scholar]
- Cornu TI, Thibodeau-Beganny S, Guhl E, Alwin S, Eichtinger M, Joung JK, Cathomen T. DNA-binding specificity is a major determinant of the activity and toxicity of zinc-finger nucleases. Mol Ther. 2008;16(2):352–358. doi: 10.1038/sj.mt.6300357. [DOI] [PubMed] [Google Scholar]
- Pruett-Miller SM, Reading DW, Porter SN, Porteus MH. Attenuation of zinc finger nuclease toxicity by small-molecule regulation of protein levels. PLoS Genet. 2009;5(2):e1000376. doi: 10.1371/journal.pgen.1000376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Radecke S, Radecke F, Cathomen T, Schwarz K. Zinc-finger nuclease-induced gene repair with oligodeoxynucleotides: wanted and unwanted target locus modifications. Mol Ther. 2010;18(4):743–753. doi: 10.1038/mt.2009.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zinc Finger Tools. http://www.scripps.edu/mb/barbas/zfdesign/zfdesignhome.php http://www.scripps.edu/mb/barbas/zfdesign/zfdesignhome.php
- Mandell JG, Barbas CF. Zinc Finger Tools: custom DNA-binding domains for transcription factors and nucleases. Nucleic Acids Res. 2006;34(Web Server issue):W516–523. doi: 10.1093/nar/gkl209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ZiFit. http://zifit.partners.org/ZiFiT/ http://zifit.partners.org/ZiFiT/
- Sander JD, Maeder ML, Reyon D, Voytas DF, Joung JK, Dobbs D. ZiFiT (Zinc Finger Targeter): an updated zinc finger engineering tool. Nucleic Acids Res. 2010;38(Suppl):W462–468. doi: 10.1093/nar/gkq319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ZiFDB. http://bindr.gdcb.iastate.edu:8080/ZiFDB http://bindr.gdcb.iastate.edu:8080/ZiFDB
- Fu F, Sander JD, Maeder M, Thibodeau-Beganny S, Joung JK, Dobbs D, Miller L, Voytas DF. Zinc Finger Database (ZiFDB): a repository for information on C2H2 zinc fingers and engineered zinc-finger arrays. Nucleic Acids Res. 2009;37(Database issue):D279–283. doi: 10.1093/nar/gkn606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iseli C, Ambrosini G, Bucher P, Jongeneel CV. Indexing strategies for rapid searches of short words in genome sequences. PLoS ONE. 2007;2(6):e579. doi: 10.1371/journal.pone.0000579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bibikova M, Carroll D, Segal DJ, Trautman JK, Smith J, Kim YG, Chandrasegaran S. Stimulation of homologous recombination through targeted cleavage by chimeric nucleases. Mol Cell Biol. 2001;21(1):289–297. doi: 10.1128/MCB.21.1.289-297.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Handel EM, Alwin S, Cathomen T. Expanding or restricting the target site repertoire of zinc-finger nucleases: the inter-domain linker as a major determinant of target site selectivity. Mol Ther. 2009;17(1):104–111. doi: 10.1038/mt.2008.233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shimizu Y, Bhakta MS, Segal DJ. Restricted spacer tolerance of a zinc finger nuclease with a six amino acid linker. Bioorg Med Chem Lett. 2009. [DOI] [PMC free article] [PubMed]
- Miller JC, Holmes MC, Wang J, Guschin DY, Lee YL, Rupniewski I, Beausejour CM, Waite AJ, Wang NS, Kim KA. et al. An improved zinc-finger nuclease architecture for highly specific genome editing. Nat Biotechnol. 2007;25(7):778–785. doi: 10.1038/nbt1319. [DOI] [PubMed] [Google Scholar]
- Fajardo-Sanchez E, Stricher F, Paques F, Isalan M, Serrano L. Computer design of obligate heterodimer meganucleases allows efficient cutting of custom DNA sequences. Nucleic Acids Res. 2008;36(7):2163–2173. doi: 10.1093/nar/gkn059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo J, Gaj T, Barbas CF. Directed evolution of an enhanced and highly efficient FokI cleavage domain for zinc finger nucleases. J Mol Biol. 2010;400(1):96–107. doi: 10.1016/j.jmb.2010.04.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Q, Xia Z, Zhong X, Case CC. Validated zinc finger protein designs for all 16 GNN DNA triplet targets. J Biol Chem. 2002;277(6):3850–3856. doi: 10.1074/jbc.M110669200. [DOI] [PubMed] [Google Scholar]
- Segal DJ, Dreier B, Beerli RR, Barbas CF. Toward controlling gene expression at will: selection and design of zinc finger domains recognizing each of the 5'-GNN-3' DNA target sequences. Proc Natl Acad Sci USA. 1999;96(6):2758–2763. doi: 10.1073/pnas.96.6.2758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choo Y, Klug A. Selection of DNA binding sites for zinc fingers using rationally randomized DNA reveals coded interactions. Proc Natl Acad Sci USA. 1994;91(23):11168–11172. doi: 10.1073/pnas.91.23.11168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreier B, Segal DJ, Barbas CF. Insights into the molecular recognition of the 5'-GNN-3' family of DNA sequences by zinc finger domains. J Mol Biol. 2000;303(4):489–502. doi: 10.1006/jmbi.2000.4133. [DOI] [PubMed] [Google Scholar]
- Dreier B, Beerli RR, Segal DJ, Flippin JD, Barbas CF. Development of zinc finger domains for recognition of the 5'-ANN-3' family of DNA sequences and their use in the construction of artificial transcription factors. J Biol Chem. 2001;276(31):29466–29478. doi: 10.1074/jbc.M102604200. [DOI] [PubMed] [Google Scholar]
- Dreier B, Fuller RP, Segal DJ, Lund CV, Blancafort P, Huber A, Koksch B, Barbas CF. Development of zinc finger domains for recognition of the 5'-CNN-3' family DNA sequences and their use in the construction of artificial transcription factors. J Biol Chem. 2005;280(42):35588–35597. doi: 10.1074/jbc.M506654200. [DOI] [PubMed] [Google Scholar]
- Christian M, Cermak T, Doyle EL, Schmidt C, Zhang F, Hummel A, Bogdanove AJ, Voytas DF. Targeting DNA double-strand breaks with TAL effector nucleases. Genetics. 2010;186(2):757–761. doi: 10.1534/genetics.110.120717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li T, Huang S, Jiang WZ, Wright D, Spalding MH, Weeks DP, Yang B. TAL nucleases (TALNs): hybrid proteins composed of TAL effectors and FokI DNA-cleavage domain. Nucleic Acids Res. 2010;39(1):359–372. doi: 10.1093/nar/gkq704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller JC, Tan S, Qiao G, Barlow KA, Wang J, Xia DF, Meng X, Paschon DE, Leung E, Hinkley SJ. et al. A TALE nuclease architecture for efficient genome editing. Nat Biotechnol. 2010;29(2):143–148. doi: 10.1038/nbt.1755. [DOI] [PubMed] [Google Scholar]
- Mahfouz MM, Li L, Shamimuzzaman M, Wibowo A, Fang X, Zhu JK. De novo-engineered transcription activator-like effector (TALE) hybrid nuclease with novel DNA binding specificity creates double-strand breaks. Proc Natl Acad Sci USA. 2011;108(6):2623–2628. doi: 10.1073/pnas.1019533108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ZFN-Site. http://ccg.vital-it.ch/tagger/targetsearch.html http://ccg.vital-it.ch/tagger/targetsearch.html
- Source Forge. http://sourceforge.net/projects/tagger/ http://sourceforge.net/projects/tagger/
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.