PEM-seq comprehensively quantifies DNA repair outcomes during gene-editing and DSB repair

Yang Liu; Jianhang Yin; Tingting Gan; Mengzhu Liu; Changchang Xin; Weiwei Zhang; Jiazhi Hu

doi:10.1016/j.xpro.2021.101088

. 2022 Jan 17;3(1):101088. doi: 10.1016/j.xpro.2021.101088

PEM-seq comprehensively quantifies DNA repair outcomes during gene-editing and DSB repair

Yang Liu ^1,^2,^3,^∗, Jianhang Yin ^1,², Tingting Gan ^1,², Mengzhu Liu ¹, Changchang Xin ¹, Weiwei Zhang ¹, Jiazhi Hu ^1,^4,^∗∗

PMCID: PMC9019705 PMID: 35462794

Summary

The repair products of double-stranded DNA breaks (DSBs) are crucial for investigating the mechanism underlying DNA damage repair as well as evaluating the safety and efficiency of gene-editing; however, a comprehensively quantitative assay remains to be established. Here, we describe the step-by-step instructions of the primer extension-mediated sequencing (PEM-seq), followed by the framework of data processing and statistical analysis. PEM-seq presents a full spectrum of repair outcomes for both genome-editing-induced and endogenous DSBs in mouse and human cells.

For complete details on the use and execution of this profile, please refer to Gan et al. (2021), Yin et al. (2019), Liu et al. (2021a), and Zhang et al. (2021).

Subject areas: Bioinformatics, CRISPR, High Throughput Screening, Molecular Biology, Sequence analysis, Sequencing

Graphical abstract

Highlights

•
PEM-seq comprehensively quantifies DSB repair outcomes
•
PEM-seq evaluates the efficiency and safety of genome-editing tools
•
PEM-seq studies the impact of DNA damage response pathways on DSB repair
•
PEM-seq identifies endogenous DNA damage sites and DNA fragment integrations

Before you begin

Double-stranded DNA breaks (DSBs) are intrinsic to DNA metabolism processes, including DNA replication (Liu et al., 2021b; Tubbs et al., 2018), transcription (Liu et al., 2021a; Meng et al., 2014), DNA damage repair (Tubbs and Nussenzweig, 2017), V(D)J recombination (Hu et al., 2015), and antibody class switch recombination (Dong et al., 2015). Besides, the emerging nuclease-mediated genome editing also induces DSBs, including FokI domain-containing nucleases, transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced palindromic repeats (CRISPR)-Cas (Li et al., 2020). DSBs are sealed by two main DSB repair pathways in mammalian cells, homologous recombination (HR) and non-homologous end joining (NHEJ) (Figure 1A). Although alternative end joining (a-EJ), including microhomology-mediated end joining (MMEJ), is discovered when NHEJ is deficient, it also participates in DSB repair in the presence of NHEJ and HR (Figure 1A). These DSB repair pathways are triggered under different scenarios and generate diverse repair outcomes that are mirrors of both DSBs and involved repair process(es) (Liu et al., 2021a). For instance, the repair of a single DSB would lead to perfect re-joinings, small indels, microhomology-mediated deletions, and large deletions (Figure 1A). While the repair of multiple DSBs induce not only the above mentioned products but also large deletions and intra- or inter-chromosomal translocations (Figure 1A). Regarding genome editing, large deletions and translocations usually are unwanted editing products. With these regards, a quantitative assay to profile a full spectrum of repair outcomes is urgently demanded to study DSB repair pathway(s) and evaluate the efficiency and safety of gene-editing tools (Saha et al., 2021).

The design and experimental procedure of PEM-seq

(A) DSB repair pathways and repair outcomes. A DSB formed at the designed site, termed bait DSB, is mainly repaired by two types of repair processes, which are characterized by the end resection or not. With ends resection, bait DSB will be repaired by 1) the homologous recombination (HR) to form an error-free product that is identical to the reference sequence of bait DSB, termed perfect re-joining, 2) microhomology-mediated end joining (MMEJ) inducing microhomology (MH)-mediated deletions, or 3) non-homologous end joining (NHEJ) producing large deletions if 3′-overhang at the resected ends are removed and DNA ends are ligated. Without end resection, bait DSB will be re-joined by NHEJ to form perfect re-joining and small insertion or deletions (indels). When another DSB (prey DSB) is formed simultaneously with the bait DSB, they may join together and form intra- or inter-chromosomal translocations. The orange boxes are microhomology around the bait DSB.

(B) Procedures for the preparation of PEM-seq library and following analysis. With one round primer extension of a biotinylated primer and the following on-beads ligation with the barcoded bridge adapter, PEM-seq captures and quantifies multiple types of DSB repair outcomes at the bait DSB and genome-wide translocations. The major steps of PEM-seq are shown on the left; the right panel highlights indicated operations. Green boxes, with yellow-shadow background, show the bio-primer targeted regions. RMB, random molecular barcode.

To capture and quantify the DSB repair outcomes, we developed the primer extension-mediated sequencing (PEM-seq) by using primer-extension amplification and introducing random molecular barcode (RMB) (Liu et al., 2021a; Yin et al., 2019). As shown in Figure 1B, the procedure starts with the generation of DSB at the target site, followed by a limited duration to allow the formation of DSB repair products (Step 1). Extracted genomic DNA (Step 2) is sheared to 300–700 bp fragments by sonication (Step 3). All products containing the complementary sequence of the biotinylated primer are then amplified by a one-round primer extension (Step 4). After removal of exceeded biotinylated primer (Step 5), biotin-labeled ssDNA is enriched by streptavidin C1 beads (Step 6) and ligated with a bridge adapter containing a 14-bp RMB (Step 7). Adapter-ligated ssDNA fragments are subjected to nested PCR (Step 8), size selection (Step 9), amplification with indexed Illumina primers (Step 10), size selection again (Step 11), and finally are sequenced by Hi-seq with 2×150 bp reads (Step 12). PEM-Q is applied to data processing and statistical analysis (Step 13) (Liu et al., 2021a).

PEM-seq can be used to determine the editing efficiency, off-target sites, and unwanted products of genome editing (Yin et al., 2019; Zhang et al., 2021). In addition, PEM-seq can also be applied to interrogate the DSB level, endogenous DSB hotspots, DSB repair pathway choice, and the underlying molecular mechanism (Liu et al., 2021a). Though couples of high-throughput sequencing approaches have been developed to evaluate the efficiency or off-target activity of gene-editing tools, including LAM-HTGTS, GUIDE-seq, DISCOVER-seq, CIRCLE-seq, SITE-seq, Digenome-seq, BLESS, etc., which have been well-reviewed elsewhere (Hu et al., 2016; Kim et al., 2019). However, PEM-seq is the only one that quantitively profiles a full spectrum of repair outcomes. Here, we provide a step-by-step protocol for PEM-seq, based on our earlier publications (Gan et al., 2021; Liu et al., 2021a; Yin et al., 2019; Zhang et al., 2021). The protocol described below depicts the specific steps for using HEK-293T cells. However, we have also successfully applied this protocol in human primary T cells, cancer cell lines (HeLa, MRC-5, K562, etc.), and mouse abelson virus-transformed pro-B, CH12F3, and mouse embryonic stem cells (mESCs). Before initiating the experiment, the audience should select a desired DSB site, termed the bait DSB, and design primers for PEM-seq.

Bait DSB selection

PEM-seq depends on the use of recurrent DSB as bait DSB to achieve the best performance. If there is a recurrent DSB site in your experiments, such as V(D)J recombination loci in lymphocytes, antibody class-switch recombination loci in B cells, or genome editing target sites, please skip this section and start to design primers for PEM-seq analysis. If not, the audience should introduce a bait DSB into the cell. PEM-seq is compatible with multiple types of DSB, including blunt ends, sticky ends, ends with adducts, hairpin, and nick transformed DSBs (Figure 2A). Therefore, the bait DSB can also be generated by a broad scope of enzymes, such as AsiSI, I-SceI, RAG, AID, transposons (e.g., Cre), and genome editing tools, e.g., FokI domain-containing nucleases, TALENs, and CRISPR-Cas. Typically, we use CRISPR-SpCas9 to introduce the bait DSB at the c-Myc locus in mouse and human cells, which are provided in Table 1.

Principles for PEM-seq analysis

(A) PEM-seq is compatible with multiple types of bait and prey DSBs. Left: different types of bait DSBs, including blunt or sticky ends, ends with adducts, hairpin, nick-transformed DSB, etc., that can be analyzed by PEM-seq. Right: both off-target dependent and independent DSBs can be captured and analyzed by PEM-seq.

(B) Principles for the primer design. Generally, the distance from the start site of the nested primer to the cleavage site ranges from 50 to 110 bp, termed bait length, and the optimized distance between biotinylated primer (bio-primer) and nested primer is 10–50 bp.

(C) Definition of the repair products in PEM-Q. Events located in the ±500 kb of bait DSB are grouped into insertions or deletions, and events out of the ±500 kb of bait DSB are translocations.

Table 1.

Bait DSBs for PEM-seq analysis

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Chemicals and recombinant proteins

Nuclease-free water	Milli-Q, 0.22 mm filtered	N/A
DMEM	Corning	Cat#10-013-CV
opti-MEM	Gibco	Cat#31985070
Fetal Bovine Serum	Gibco	Cat#10091148
L-Glutamine	Corning	Cat#25-005-CI
Penicillin-Streptomycin Solution, 100×	Corning	Cat#30-002-CI
β-mercaptoethanol	Sigma-Aldrich	Cat#M3148
1× PBS, pH 7.4	Gibco	Cat#10010031
Polyethylenimine	Sigma-Aldrich	Cat#919012
Trypsin	Corning	Cat#25-052-CV
1 M Tris-HCl, pH 7.5	Invitrogen	Cat#15567027
5 M NaCl	Sigma-Aldrich	Cat#S6546
0.5 M EDTA, pH 8.0	Invitrogen	Cat#15575020
10% (wt/vol) SDS solution	Invitrogen	Cat#15553027
Proteinase K (20 mg/mL)	Invitrogen	Cat#AM2546
Isopropanol	Sigma-Aldrich	Cat#I9516
Ethanol	Sigma-Aldrich	Cat#1085430250
10× Isothermal Amplification Buffer II Pack	New England BioLabs	Cat#B0374S
Bst 3.0 DNA Polymerase	New England BioLabs	Cat#M0374L
dNTPs, 2.5 mM each	TransGen Biotech	Cat#AD101-01
Betaine solution, 5M	Sigma-Aldrich	Cat#B0300
Triton X-100	Sigma-Aldrich	Cat#T8787
Sodium hydroxide solution, 10 M	Sigma-Aldrich	Cat#72068
10× T4 DNA ligase buffer	Thermo Scientific	Cat#EL0011
T4 DNA ligase	Thermo Scientific	Cat#EL0011
PEG8000	Sigma-Aldrich	Cat#89510
10× EasyTaq buffer	TransGen Biotech	Cat#AP111-01
EasyTaq DNA Polymerase	TransGen Biotech	Cat#AP111-01
AMPure XP	Beckman Coulter	Cat#A63880
5× FastPfu buffer	TransGen Biotech	Cat#AP221-01
FastPfu DNA Polymerase	TransGen Biotech	Cat#AP221-01
Agarose	Thermo Scientific	Cat#75510019
Trans DNA Marker II	TransGen Biotech	Cat#BM411-01
6× DNA loading buffer	Beyotime Biotech	Cat#D0072
50× TAE buffer	Thermo Scientific	Cat#B49
GelRed Nucleic Acid Stain 10000× Water	Merck Millipore	Cat#SCT123
GeneJET Gel Extraction Kit	Thermo Scientific	Cat#K0692

Critical commercial assays

Dynabeads MyOne streptavidin C1 beads	Invitrogen	Cat#65002

Experimental models: Organisms/strains

Human cell line: HEK-293T	Lab stock	N/A

Oligonucleotides

See Tables 1 and 2 for details	Sangon Biotech	N/A

Recombinant DNA

pX330	(Cong et al., 2013)	Addgene; Cat#42230
pX330-MYC1	Lab stock	N/A; sgRNA targeting human c-MYC is cloned into pX330 by BsaI
pX330-GFP	Lab stock	N/A; SpCas9 is replaced by EGFP

Deposited data

Example sequencing datasets & Original pictures for Figure 3	This paper; Mendeley Data	http://doi.org/10.17632/gjhk3wk4h4.1

Software and algorithms

ImageJ 1.53c	(Schneider et al., 2012)	https://imagej.nih.gov/ij/download.html
PEM-Q	(Liu et al., 2021a)	https://github.com/liumz93/PEM-Q
Circos 0.69	(Krzywinski et al., 2009)	http://circos.ca/software/download/circos/
Prism 8	GraphPad Software	https://www.graphpad.com/scientific-software/prism/

Others

CO2 incubator	Nuaire	Cat#NU-5700
Thermomixer	Eppendorf	Cat#5382000023
Centrifuge	Eppendorf	Cat#5406000291
Spectrophotometer	DeNovix	Cat#DS-11 FX+
M220 Focused-ultrasonicator	Covaris	Cat#500295
MicroTUBE-130 AFA Fiber	Covaris	Cat#520045
PCR machine	Eppendorf	Cat#6336000074
DynaMag-2	Invitrogen	Cat#12321D
DynaMa-PCR Magnet	Invitrogen	Cat#492025
Vortex Genie 2	VWR	Cat#G560E
VWR tube rotator UK plug	VWR	Cat#10136-084
Electrophoresis system	Thermo Scientific	Cat#FB-SBR-2025
UV Transilluminators	UVP	Cat#95-0461-02
ChemiDoc MP imaging system	Bio-Rad Laboratories	Cat#17001402
1.5 mL Microtubes	Axygen	Cat#MCT-150-C
0.2 mL PCR strip tubes	Axygen	Cat#PCR-0208-CP-C

Reagent	Final concentration	Amount
DMEM	n/a	440 mL
Fetal bovine serum	10% (vol/vol)	50 mL
L-Glutamine, 100× (vol/vol)	1× (vol/vol)	5 mL
Penicillin-Streptomycin Solution, 100× (vol/vol)	1× (vol/vol)	5 mL
β-mercaptoethanol (50 mM)	50 μM	0.5 mL
Total	n/a	500 mL

Reagent	Final concentration	Amount
Polyethylenimine	1 mg/mL	10 mg
ddH₂O	n/a	10 mL
Total	n/a	10 mL

Reagent	Final concentration	Amount
Tris-HCl, pH 7.5 (1 M)	10 mM	0.5 mL
NaCl (5 M)	200 mM	2 mL
EDTA, pH 8.0 (0.5 M)	2 mM	0.2 mL
10% SDS (wt/vol)	0.2% (wt/vol)	1 mL
ddH₂O	n/a	46.3 mL
Total	n/a	50 mL

Reagent	Final concentration	Amount
Tris-HCl, pH 7.5 (1 M)	5 mM	0.25 mL
NaCl (5 M)	1 M	10 mL
EDTA, pH 8.0 (0.5 M)	1 mM	0.1 mL
ddH₂O	n/a	39.65 mL
Total	n/a	50 mL

Reagent	Final concentration	Amount
Bridge adapter-upper/-lower	400 μM	100 nmol
ddH₂O	n/a	250 μL
Total	n/a	250 μL

Reagent	Final concentration	Amount
PEG8000	50% (wt/vol)	5 g
ddH₂O	n/a	10 mL
Total	n/a	10 mL

Reagent	Final concentration	Amount
NaOH (10 M)	10 mM	0.05 mL
ddH₂O	n/a	49.95 mL
Total	n/a	50 mL

Parameter	Setting value
Temperature (°C)	4
Peak Incident Power (W)	50
Duty Factor (%)	20
Treatment Time (sec)	50–65
Cycles per Burst (cpb)	200

Reagent	Final concentration	Amount
10× Bst buffer	1×	16 μL
Bio-primer-MYC1 (+) (1 μM)	25 nM	4 μL
5 M Betaine	1 M	32 μL
sonicated DNA	6.25–250 ng/μL	1–40 μg
ddH₂O	n/a	To 160 μL
Total	n/a	160 μL

PCR cycling conditions
Steps	Temperature	Time	Cycles
Initial Denaturation	95°C	3 min	1
Denaturation	95°C	2 min	5 cycles
Annealing	58°C	3 min	5 cycles
Final Annealing	58°C	3 min	1
Hold	10°C	Forever

PCR cycling conditions
Steps	Temperature	Time	Cycles
Primer extension	65°C	15 min	1
Inactivation	80°C	5 min	1
Hold	25°C	Forever

PCR cycling conditions
Steps	Temperature	Time	Cycle
Denaturation	95°C	3 min	1
Annealing	85°C, Ramp at 0.1°C/s	1 min	1
Annealing	80°C, Ramp at 0.1°C/s	1 min	1
Annealing	75°C, Ramp at 0.1°C/s	1 min	1
Annealing	70°C, Ramp at 0.1°C/s	1 min	1
Annealing	65°C, Ramp at 0.1°C/s	1 min	1
Annealing	60°C, Ramp at 0.1°C/s	1 min	1
Annealing	55°C, Ramp at 0.1°C/s	1 min	1
Annealing	50°C, Ramp at 0.1°C/s	1 min	1
Annealing	45°C, Ramp at 0.1°C/s	1 min	1
Annealing	40°C, Ramp at 0.1°C/s	1 min	1
Annealing	35°C, Ramp at 0.1°C/s	1 min	1
Annealing	30°C, Ramp at 0.1°C/s	1 min	1
Hold	10°C	forever

PCR cycling conditions
Steps	Temperature	Time	Cycles
Initial Denaturation	95°C	5 min	1
Denaturation	95°C	1 min	15 cycles
Annealing	58°C	45 s
Extension	72°C	1 min
Final Extension	72°C	5 min	1
Hold	10°C	forever

Reagent	Final concentration	Amount
10× Bst buffer	1×	4 μL
dNTPs (2.5 mM each)	50 μM	4 μL
Bst 3.0 DNA polymerase (8 U/μL)	0.1 U/μL	2.5 μL
ddH₂O	n/a	29.5 μL
Total	n/a	40 μL

Reagent	Final concentration	Amount
Denatured DNA (from step 33)	n/a	200 μL
NaCl (5 M)	1 M	50 μL
EDTA (0.5 M, pH 8.0)	5 mM	2.5 μL
10% (vol/vol) Triton X-100	0.02%	0.5 μL
Total	n/a	253 μL

Reagent	Final concentration	Amount
Bridge adapter-upper (400 μM)	200 μM	20 μL
Bridge adapter-lower (400 μM)	200 μM	20 μL
Total	n/a	40 μL

Reagent	Final concentration	Amount
ssDNA on C1 beads (from step 44)	n/a	42.4 μL
10× T4 DNA ligase buffer	1×	8 μL
Bridge adapter (50 μM from step 47)	1 μM	1.6 μL
T4 DNA ligase (400 U/μL)	20 U/μL	4 μL
50% (wt/vol) PEG8000	15%	24 μL
Total	n/a	80 μL

Reagent	Final concentration	Amount
On-beads ligation products (from step 55)	n/a	73 μL
10× EasyTaq buffer	1×	10 μL
dNTPs (2.5 mM each)	200 μM	8 μL
I5-Nested-1-MYC1 (+) (10 μM)	400 nM	4 μL
I7-index primer (10 μM)	400 nM	4 μL
EasyTaq DNA polymerase (5 U/μL)	0.05 U/μL	1 μL
Total	n/a	100 μL

Reagent	Final concentration	Amount
PCR products (from step 67)	n/a	63 μL
5× FastPfu buffer	1×	20 μL
dNTPs (2.5 mM each)	200 μM	8 μL
P5-I5 (10 μM)	400 nM	4 μL
P7-tag (10 μM)	400 nM	4 μL
FastPfu DNA polymerase (2.5 U/μL)	0.025 U/μL	1 μL
Total	n/a	100 μL

Example information for PEM-Q analysis
Name of bait DSB	Genome	Name	Cut-site	Chromosome	P-start	P-end	P-strand	P-sequence (5' > 3′)
Myc6	mm10	Name of your .fastq file	61986726	chr15	61986633	61986652	+	GGAAACCAGAGGGAATCCTC
MYC1 (+)	hg38	Name of your .fastq file	127743326	chr8	127743238	127743257	+	CCTCAGAATAGGAGAGAGTG
MYC1 (–)	hg38	Name of your .fastq file	127743326	chr8	127743381	127743400	–	AGAGCCATTCTCTGGCTCAG
MYC-YH	hg38	Name of your .fastq file	127738978	chr8	127738879	127738898	+	AGTCCTGCGCCTCGCAAGAC

Information for vector integration analysis only (Optional)
Name	Name of the vector sequence file (.fa file)	Genome	Chromosome	P-strand	sgRNA-start	sgRNA-end

Editing events after PEM-Q analysis
Types	Categories	Definition	Analysis	Files
Editing events	All editing events	Deletions, insertions, inversions, and translocations	Editing efficiency; Frequency of each type of editing events, & The top rank of editing events	∗_Editing_events.tab ∗_Editing_events_dot_plot.pdf ∗_Editing_events.html ∗_statistics.txt
Deletions	Small deletions	Deletions, 1–100 bp around the bait DSB	Length distribution & Microhomology	∗_Deletion.tab ∗_deletion_length.txt ∗_del_len_statistics.txt
Deletions	Large deletions	Deletions, 0.1–500 kb around the bait DSB	Length distribution & Microhomology
Insertions	Small insertions	Insertions, < 20 bp in size, 0–500 kb around the bait DSB	Length distribution, Inserted sequences & Plasmid integration	∗_Insertion.tab ∗_insertion_length.txt ∗_inser_len_statistics.txt
Insertions	Large insertions	Insertions, ≥ 20 bp in size, 0–500 kb around the bait DSB
Translocations	Intra-chromosomal translocations	Junctions on the bait chromosome, but out of the +/- 500 kb around the bait DSB	Prey DSBs & Off-targets analysis	∗_Translocation.tab
Translocations	Inter-chromosomal translocations	Junctions on other chromosomes	Prey DSBs & Off-targets analysis	∗_Translocation.tab
Vector integrations	Vector integrations	Junctions on vector	Distribution of vector integrations	∗_all_vector_2.2.tab

Libray name	Control	SpCas9-MYC1 treated
Host & cell lines	Human, HEK-293T	Human, HEK-293T
Bait DSB	None	SpCas9-MYC1
Bio-primer & nested primer	Bio-/nested MYC1 (+)	Bio-/nested MYC1 (+)
Events	Hits or percentage	Hits or percentage
NoJunction (perfect re-joinings or non-cuttings)	587,577	118,768
Deletions	2,455	104,408
Small_deletions (<=100bp)	2,368	99,899
Large_deletions (>100bp)	87	4,509
Insertions	2,062	41,190
1_bp insertions	472	25,977
Small_insertions (<20bp)	985	34,048
Large_insertions (≥20bp)	1,077	7,142
Translocations	106	11,170
Vector integrations	106	3,119
Editing events	4,623	156,768
Total events	592,222	282,624
Editing efficiency (%)	0.78%	55.47%
Deletions (%)	0.41%	36.94%
Insertions (%)	0.35%	14.57%
Translocations (%)	0.02%	3.95%

PERMALINK

PEM-seq comprehensively quantifies DNA repair outcomes during gene-editing and DSB repair

Yang Liu

Jianhang Yin

Tingting Gan

Mengzhu Liu

Changchang Xin

Weiwei Zhang

Jiazhi Hu

Summary

Graphical abstract

Highlights

Before you begin

Figure 1.

Bait DSB selection

Figure 2.

Table 1.

Primer design

Table 2.

Key resources table

Materials and equipment

Step-by-step method details

Generating bait DSB

Isolating genomic DNA

Sonication

Primer extension

Primer removal

Streptavidin purification

On-beads ligation

Nested PCR

Size selection

Tagged PCR

Size selection

High-throughput sequencing

Sequence reads processing

Expected outcomes

Figure 3.

Quantification and statistical analysis

Table 3.

Table 4.

Figure 4.

Limitations

Troubleshooting

Problem 1

Potential solution

Problem 2

Potential solution

Problem 3

Potential solution

Problem 4

Potential solution

Problem 5

Potential solution

Problem 6

Potential solution

Resource availability

Lead contact

Materials availability

Acknowledgments

Author contributions

Declaration of interests

Contributor Information

Data and code availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Off target	Chr	Start position	End position	Off-target sequences with PAM (5' > 3′)	Junctions in treated cells
OT1	chr9	127166926	127166949	AGGAAGTGGAGCTTGGCCTT GGG	62
OT2	chr8	19171846	19171869	AGGAAGTGGAGCTTGGCCTT GGG	31
OT3	chr8	19171768	19171791	GGGGTGTGGAGCTTGACTAT GAG	31
OT4	chr4	144444276	144444299	TGGGAGTGGAGCTTGGTTTT GGG	25
OT5	chr3	15065290	15065313	AGGATGAAGAGATTGGCTAT GGG	24
OT6	chr12	21299145	21299168	GGGAAGTGGAACCTGGCTCT GGG	14
OT7	chr3	159731043	159731066	TGGATGTGCAGCCTGGCTAT TGG	10
OT8	chr8	19171905	19171928	GGAATTTGGCGCTTGATTAT AGA	5
OT9	chr12	3250772	3250795	GGTATGCAGAGCTTGGCTTT CGG	3