Abstract
Linkage and association studies have mapped thousands of genomic regions that contribute to phenotypic variation, but narrowing these regions to the underlying causal genes and variants has proven much more challenging. Resolution of genetic mapping is limited by the recombination rate. We developed a method that uses CRISPR to build mapping panels with targeted recombination events. We tested the method by generating a panel with recombination events spaced along a yeast chromosome arm, mapping trait variation, and then targeting a high density of recombination events to the region of interest. Using this approach, we fine-mapped manganese sensitivity to a single polymorphism in the transporter Pmr1. Targeting recombination events to regions of interest allows us to rapidly and systematically identify causal variants underlying trait differences.
Identification of DNA sequence differences that underlie trait variation is a central goal of modern genetic research. The primary tools for connecting genotype and phenotype are linkage and association studies. In these studies, co-inheritance of genetic markers with the trait of interest in large panels of individuals is used to localize variants that influence the trait to specific regions of the genome. The localization relies on meiotic recombination events that break up linkage between markers on a chromosome. Therefore, the spatial resolution of genetic mapping is limited by the recombination rate. In practice, the recombination rate in most settings is too low to resolve mapped regions to individual genes, much less to specific variants within genes. Increasing mapping resolution requires construction of ever-larger panels of individuals and/or additional generations of recombination, and these approaches are laborious to the point of often being impractical. As a consequence, the genes and variants underlying trait variation remain unidentified for the vast majority of regions implicated by linkage or association mapping.
To address this problem, we have devised a new method for genetic mapping that precisely targets recombination events to regions of interest. The method uses recombination events that occur during mitosis rather than meiosis. Rare mitotic recombination events occur naturally when a chromosomal double strand break (DSB) is repaired by homologous recombination (HR) that leads to the formation of a recombined chromosome (1). In a heterozygous individual, cell division can then generate daughter cells with a new genotype that is completely homozygous from the recombination site to the telomere and unchanged heterozygous everywhere else (Fig. 1A); such an event is termed “loss of heterozygosity” (LOH). Individuals with LOH events at various locations in the genome have been used to construct a genetic map (2), and this and related approaches (3) can, in principle, be used to map the genetic basis of trait variation (Fig. 1B). However, this approach has been limited in practice by the very low frequency of natural mitotic recombination events.
Fig. 1.
DSBs generated by Cas9 in diploid mitotic cells can lead to mitotic recombination and loss of heterozygosity (LOH). (A) LOH can result from repair after a DSB in mitotically dividing cells, which is generated by CRISPR. (The Streptococcus pyogenes Cas9 protein is depicted as a green cartoon.) Individuals with LOH events are isolated via the loss of a heterozygous dominant marker, denoted with a green bar. The orange and purple chromosomes are homologs. (B) By measuring trait values in a panel of individuals with LOH events distributed across a region of interest, we can map genetic variants that contribute to trait variation. The process can be iterated to increase mapping resolution.
We have leveraged the CRISPR-Cas9 system to produce targeted mitotic recombination events at high frequency and at any desired location, allowing facile construction of LOH-based mapping panels. In the CRISPR (clustered, regularly interspaced, short palindromic repeats) system, the endonuclease Cas9 creates a DSB at a site specified by the targeting sequence of a bound guide RNA (gRNA) (4). Successful cutting requires the targeted sequence to be followed by an invariant protospacer-adjacent motif (PAM). In a heterozygous diploid individual, an LOH event can be generated by cutting only one chromosome, leaving its homolog intact to serve as a template for repair by HR. This is accomplished by using polymorphic heterozygous PAM sites.
To demonstrate that LOH events can be targeted to precise loci using CRISPR, we designed 95 gRNAs targeting the bacterial Streptococcus pyogenes Cas9 to sites distributed across the left arm of the yeast Saccharomyces cerevisiae chromosome 7 (Chr 7L). The gRNAs targeted heterozygous sites in a diploid yeast strain generated by crossing a lab strain (BY) and a vineyard strain (RM), using PAMs polymorphic between the two strains. After cutting, repair, and mitosis, cells in which the DSB repair led to an LOH event were isolated by fluorescence-activated cell sorting (FACS) through their loss of a telomere-proximal green fluorescent protein (GFP) gene. We picked approximately four GFP(−) lines per targeted site, for a total of 384 lines. Whole-genome sequencing demonstrated that CRISPR-induced recombination was highly effective, with LOH events in more than 95% of lines and few off-target effects (5). 75% of LOH recombination events occurred within 20 kb of the targeted site (Fig. 2A), consistent with previous measurements of LOH gene conversion tract length (6). LOH events were generated at sites across the entire targeted chromosome arm (Fig. 2A), demonstrating that our method is not limited to certain genomic contexts.
Fig. 2.
LOH events generated at sites across a chromosome arm mapped manganese sensitivity. (A) For each individual in the panel with a Chr 7L recombination event, the site of its recombination event is plotted against the site targeted for DSB formation in that individual. Individuals targeted to gain BY and RM homozygosity are plotted in orange and purple, respectively. The dashed lines enclose individuals with recombination events within 20 kb of the targeted site. The location of the Chr 7 centromere is denoted by “cen”. (B) Sensitivity to manganese vs. observed LOH recombination location. For each individual in the Chr 7L panel, the site of the LOH recombination event is plotted against manganese sensitivity, measured as colony radius after growth on 10 mM manganese sulfate plates. Orange and purple points denote individuals that are homozygous BY and RM to the left of their recombination events, respectively. (All individuals are heterozygous BY/RM to the right of their recombination events.) The gray line plots the LOD score by position along Chr 7L for manganese sensitivity. Dashed vertical lines denote the QTL support interval.
We next used the LOH panel to map quantitative traits to loci on Chr 7L. We measured growth of the 384 LOH lines in 12 different conditions, chosen because we previously mapped quantitative trait loci (QTLs) for growth in these conditions to Chr 7L (7). In parallel, we measured growth of 768 segregants from a cross between BY and RM. One of the traits, growth on 10 mM manganese sulfate, mapped to a large-effect QTL with a maximum logarithm-of-odds (LOD) score of 109.4 in the LOH panel (Fig. 2B). The confidence interval obtained with the 384 LOH lines overlapped with and was narrower (2.9 kb) than that obtained with 768 segregants (3.9 kb). The LOH-based interval contained two genes and 12 polymorphisms between BY and RM. We identified concordant QTLs of smaller effect in the two panels for eight other traits (fig. S1). Two traits mapped to a QTL of small effect in just one panel, likely due to low statistical power (fig. S2). One trait lacked a Chr 7L QTL in both panels.
To rapidly fine-map the causal variant for manganese sensitivity, we generated a second panel of LOH lines whose recombination events were all targeted to the mapped manganese sensitivity interval. We took advantage of the fact that LOH gene conversion tracts vary in length, which means that in different individuals, DSBs generated by the same gRNA can lead to slightly different LOH crossover sites, typically within 10 kb of the DSB (6). We isolated 358 GFP(−) lines generated with three gRNAs that target sites near the mapped interval. Sequencing revealed that 46 lines (13.1%) had a recombination event within the 2.9 kb QTL interval; together, the recombination events separated almost all the variants in the interval (Fig. 3A). In contrast, only 0.7% of segregants had recombination events in the interval (7). To obtain a comparable number of recombination events at this locus by random meiotic segregation, a segregant panel would require more than 7,500 lines. Thus, with targeted LOH events, we can generate very strong mitotic recombination hotspots at any region of interest (fig. S3).
Fig. 3.
Targeted high-resolution mapping of manganese sensitivity. (A) Ratio of recombination rate (in centimorgans; cM) to physical distance (in kilobases; kb) near the manganese sensitivity QTL, for the manganese fine-mapping LOH panel (black line) and a segregant panel (red line) (7). The ratio is plotted for every interval between adjacent BY/RM polymorphisms that are at least 300 bp apart. The fine-mapping panel contains recombination events between all such pairs of polymorphisms in the interval, as indicated by the observation that the ratio does not drop to zero. The 2.9 kb QTL interval is denoted with dashed lines. (B) Recombination sites of individuals in the fine-mapping panel plotted against their manganese sensitivity, as in Figure 2B, near the manganese sensitivity QTL. Dashed blue lines denote the QTL support interval for the fine-mapping panel and dashed black lines denote the QTL support interval for the whole-Chr 7L panel. Shown below the plot are all BY/RM polymorphisms in the region (black bars), as well as all open reading frames (red lines).
We measured manganese sensitivity in this fine-mapping panel (Fig. 3B). Comparison of the panel phenotypes with the breakpoint locations pinpointed a single polymorphism as responsible for increased sensitivity in BY. The variant encodes a phenylalanine in BY and a leucine in RM at position 548 of Pmr1, a manganese transporter. Six lines had recombination events between Pmr1-F548L and the closest polymorphism to the right, 402 bp away, and were either fully sensitive or resistant to manganese, depending on which Pmr1-F548L allele was homozygous in the line. One line had a recombination between Pmr1-F548L and the closest polymorphism to the left, 125 bp away, and showed the intermediate manganese sensitivity phenotype expected for a heterozygote at the causal variant. LOD score analysis of the fine-mapping panel also identified a support interval containing only Pmr1-F548L (Fig. 3B).
To directly test the effect of Pmr1 variants on manganese sensitivity, we individually engineered into BY the RM alleles of Pmr1-F548L, the two neighboring polymorphisms, and the two remaining nonsynonymous Pmr1 polymorphisms, using a CRISPR-based variant replacement approach. As expected from the LOH fine-mapping, changing phenylalanine-548 to leucine conferred significant manganese resistance, whereas none of the other four polymorphisms had a significant effect (Fig. 4).
Fig. 4.
Direct introduction of Pmr1-F548L into BY enhances manganese resistance. Boxplots of manganese sensitivity for strains with single PMR1 variants introduced from RM into BY, along with the BY and RM parental strains (first and second leftmost boxes). n ≥ 10 for all genotypes. * p < 0.001 in comparison to BY, Welch's two-sided T-test.
PMR1 encodes an ion pump that transports manganese and calcium into the Golgi (8). Pmr1 is a member of the P-type ATPase family of ion and lipid pumps found in all branches of life, and many other P-type ATPases have a conserved leucine at the position homologous to phenylalanine-548 of Pmr1. Solved structures of P-type ATPases with this leucine (9) (10) show it directly contacting ATP (fig. S4). Furthermore, mutating the homologous leucine of the rabbit calcium pump to phenylalanine decreases its function by affecting ATP binding (11). Thus, the F548L polymorphism is expected to reduce the ability of Pmr1BY to transport manganese into the Golgi, relative to Pmr1RM, consistent with BY's manganese sensitivity.
Pmr1 leucine-548 is conserved across fungi, with some species having an isoleucine or valine at the homologous position, and none with phenylalanine (fig. S5). In the S. cerevisiae population, almost all sequenced PMR1 alleles have leucine-548, with phenylalanine-548 found only in BY and other laboratory strains (12, 13) whose PMR1 alleles are likely directly related to BY (14). BY is derived from EM93, a diploid strain isolated from a fig (15). Sequencing of PMR1 in EM93 revealed that EM93 is heterozygous for Pmr1-F548L (fig. S6), suggesting that either the mutation is not laboratory-derived or that it occurred between EM93's isolation and its entry into a stock collection.
Decades of mapping studies have uncovered loci for myriad traits, but identification of the underlying genes and variants has lagged. Our CRISPR-assisted mapping approach promises to close this gap. In contrast to previous strategies, our method generates a higher density of recombination events, is easily targetable to any region of the genome, and does not require time-consuming extra generations of crossing to increase recombination frequency. Conversely, the strength of a traditional meiotic mapping panel is the ability to scan the entire genome. Complex traits, with multiple small-effect QTLs, pose a greater challenge for any mapping method. Importantly, in LOH mapping the rest of the genome outside the region targeted for LOH is held constant when a given QTL is being queried, thus effectively reducing the complexity of a trait by eliminating variance due to other segregating QTLs.
We anticipate that trait mapping with targeted LOH panels will aid efforts to understand the genetic basis of trait variation. In addition to applications in single-celled organisms, LOH panels could be generated from cultured cells, enabling in vitro genetic dissection of human traits with cellular phenotypes. In multicellular organisms, mapping resolution could be enhanced with CRISPR-directed meiotic recombination events. Indeed, the mutagenic chain reaction system developed in vivo in fruit flies (16) and mosquitos (17, 18) uses CRISPR to generate gene conversion events in meiosis with high efficiency. Additionally, LOH in early development could generate chimeric individuals. The targeted LOH method also has the potential to be applied to viable interspecies hybrids that cannot produce offspring, allowing trait variation between species to be studied genetically beyond the few systems where it is currently possible (19, 20).
In addition to their research applications, targetable endonucleases hold promise for gene therapy (21, 22). Certain disease alleles may be difficult to directly target by CRISPR because of their sequence complexity, such as the expanded trinucleotide repeats that underlie Huntington's disease. In these cases, directing a DSB to occur in the vicinity of a pathogenic allele so that it is replaced with its nonpathogenic counterpart by LOH may represent a more feasible alternative.
Supplementary Material
One Sentence Summary.
We report a method that uses CRISPR to generate targeted recombination events for rapid and systematic identification of causal variants that underlie trait differences.
Acknowledgements
We thank Kruglyak laboratory members for helpful discussion, S. Clarke for strain BY4742, G. Church for plasmids, and S. Kosuri for his flow cytometer. Funding was provided by the Howard Hughes Medical Institute and NIH grants R01 GM102308 (L.K.) and F32 GM116318 (M.J.S). Sequencing data was deposited at the Sequence Read Archive under accession no. SRP072527, and other data and code was deposited at https://github.com/joshsbloom/crispr_loh.
References and Notes
- 1.Yin Y, Petes TD. Genome-Wide High-Resolution Mapping of UV-Induced Mitotic Recombination Events in Saccharomyces cerevisiae. PLoS Genet. 2013;9:e1003894+. doi: 10.1371/journal.pgen.1003894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Henson V, Palmer L, Banks S, Nadeau JH, Carlson GA. Loss of heterozygosity and mitotic linkage maps in the mouse. Proc. Natl. Acad. Sci. U. S. A. 1991;88:6486–6490. doi: 10.1073/pnas.88.15.6486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Laureau R, et al. Extensive Recombination of a Yeast Diploid Hybrid through Meiotic Reversion. PLOS Genet. 2016;12:e1005781. doi: 10.1371/journal.pgen.1005781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Doudna JA, Charpentier E. The new frontier of genome engineering with CRISPR-Cas9. Science (80−.) 2014;346:1258096–1258096. doi: 10.1126/science.1258096. [DOI] [PubMed] [Google Scholar]
- 5.Materials and methods are available as supporting material on Science Online
- 6.St. Charles J, Petes TD. High-Resolution Mapping of Spontaneous Mitotic Recombination Hotspots on the 1.1 Mb Arm of Yeast Chromosome IV. PLoS Genet. 2013;9 doi: 10.1371/journal.pgen.1003434. doi:10.1371/journal.pgen.1003434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bloom JS, Ehrenreich IM, Loo WT, V Lite T-L, Kruglyak L. Finding the sources of missing heritability in a yeast cross. Nature. 2013;494:234–237. doi: 10.1038/nature11867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Culotta VC, Yang M, Hall MD. Manganese transport and trafficking: lessons learned from Saccharomyces cerevisiae. Eukaryot. Cell. 2005;4:1159–65. doi: 10.1128/EC.4.7.1159-1165.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hilge M, et al. ATP-induced conformational changes of the nucleotide- binding domain of Na, K-ATPase. Nat. Struct. Biol. 2003;10:468–474. doi: 10.1038/nsb924. [DOI] [PubMed] [Google Scholar]
- 10.Toyoshima C, Mizutani T. Crystal structure of the calcium pump with a bound ATP analogue. Nature. 2004;430:529–535. doi: 10.1038/nature02680. [DOI] [PubMed] [Google Scholar]
- 11.Clausen JD, McIntosh DB, Vilsen B, Woolley DG, Andersen JP. Importance of conserved N-domain residues Thr441, Glu442, Lys515, Arg560, and Leu562 of sarcoplasmic reticulum Ca2+-ATPase for MgATP binding and subsequent catalytic steps. Plasticity of the nucleotide-binding site. J. Biol. Chem. 2003;278:20245–20258. doi: 10.1074/jbc.M301122200. [DOI] [PubMed] [Google Scholar]
- 12.Liti G, et al. Population genomics of domestic and wild yeasts. Nature. 2009;458:337–341. doi: 10.1038/nature07743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Song G, et al. AGAPE (Automated Genome Analysis PipelinE) for pan-genome analysis of Saccharomyces cerevisiae. PLoS One. 2015;10:e0120671. doi: 10.1371/journal.pone.0120671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schacherer J, Shapiro JA, Ruderfer DM, Kruglyak L. Comprehensive polymorphism survey elucidates population structure of Saccharomyces cerevisiae. Nature. 2009;458:342–345. doi: 10.1038/nature07670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mortimer RK, Johnston JR. Genealogy of principal strains of the yeast genetic stock center. Genetics. 1986;113:35–43. doi: 10.1093/genetics/113.1.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gantz VM, Bier E. The mutagenic chain reaction: A method for converting heterozygous to homozygous mutations. Science (80−.) 2015;3042:1–7. doi: 10.1126/science.aaa5945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gantz VM, et al. Highly efficient Cas9-mediated gene drive for population modification of the malaria vector mosquito Anopheles stephensi. Proc. Natl. Acad. Sci. 2015 doi: 10.1073/pnas.1521077112. 201521077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hammond A, et al. A CRISPR-Cas9 gene drive system targeting female reproduction in the malaria mosquito vector Anopheles gambiae. Nat. Biotechnol. 2015;34:1–8. doi: 10.1038/nbt.3439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Orr HA, Presgraves DC. Speciation by postzygotic isolation: Forces, genes and molecules. BioEssays. 2000;22:1085–1094. doi: 10.1002/1521-1878(200012)22:12<1085::AID-BIES6>3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
- 20.Woodruff GC, Eke O, Baird SE, Félix M-AA, Haag ES. Insights into species divergence and the evolution of hermaphroditism from fertile interspecies hybrids of Caenorhabditis nematodes. Genetics. 2010;186:997–1012. doi: 10.1534/genetics.110.120550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tebas P, et al. Gene editing of CCR5 in autologous CD4 T cells of persons infected with HIV. N. Engl. J. Med. 2014;370:901–10. doi: 10.1056/NEJMoa1300662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hsu PD, Lander ES, Zhang F. Development and Applications of CRISPR-Cas9 for Genome Engineering. Cell. 2014;157:1262–1278. doi: 10.1016/j.cell.2014.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gibson DG, et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods. 2009;6:343–5. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
- 24.Dicarlo JE, et al. Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems. Nucleic Acids Res. 2013;41:4336–4343. doi: 10.1093/nar/gkt135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Becker DM, Lundblad V. Introduction of DNA into yeast cells. Curr. Protoc. Mol. Biol. 2001;Chapter 13(Unit13.7) doi: 10.1002/0471142727.mb1307s27. [DOI] [PubMed] [Google Scholar]
- 26.Huh W-K, et al. Global analysis of protein localization in budding yeast. Nature. 2003;425:686–691. doi: 10.1038/nature02026. [DOI] [PubMed] [Google Scholar]
- 27.Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Albert FW, Treusch S, Shockley AH, Bloom JS, Kruglyak L. Genetics of single-cell protein abundance variation in large yeast populations. Nature. 2014;506:1–19. doi: 10.1038/nature12904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Bloom JS, et al. Genetic interactions contribute less than additive effects to quantitative trait variation in yeast. Nat. Commun. 2015;6:8712. doi: 10.1038/ncomms9712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pau G, Fuchs F, Sklyar O, Boutros M, Huber W. EBImage-an R package for image processing with applications to cellular phenotypes. Bioinformatics. 2010;26:979–981. doi: 10.1093/bioinformatics/btq046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lynch M, Walsh B. Genetics and Analysis of Quantitative Traits. 1998. pp. 319–532. [Google Scholar]
- 34.Hill JT, et al. Poly peak parser: Method and software for identification of unknown indels using sanger sequencing of polymerase chain reaction products. Dev. Dyn. 2014;243:1632–6. doi: 10.1002/dvdy.24183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Byrne KP, Wolfe KH. The Yeast Gene Order Browser: Combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Res. 2005;15:1456–1461. doi: 10.1101/gr.3672305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sievers F, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.