Abstract
Robust technology has been developed to systematically quantify fitness landscapes that provide critical opportunities to improve our understanding of drug resistance and define new avenues to develop drugs with reduced resistance susceptibility. We outline the critical importance of drug resistance studies and the potential for fitness landscape approaches to contribute to this effort. We describe the major technical advancements in mutational scanning, which is the primary approach used to quantify protein fitness landscapes. There are a number of complex steps to consider in planning and executing mutational scanning projects including developing a selection scheme, generating mutant libraries, tracking the frequency of variants using next-generation sequencing, and processing and interpreting the data. Key experimental parameters impacting each of these steps are discussed to provide to aid in planning fitness landscape studies. There is a strong need for improved understanding of drug resistance, and fitness landscapes provide a promising new approach.
Introduction
The promise of fitness landscapes to study drug resistance
The efficient and systematic quantification of drug resistant mutations provides a powerful tool that can contribute to the development of inhibitors with reduced potential for resistance (Figure 1). Once a selection scheme and library have been developed, fitness landscapes can be efficiently generated in parallel with hundreds to thousands of inhibitors. These comprehensive maps of mutations that are compatible with function in the presence of inhibitors could be mined to determine structure-resistance relationships and to guide the development of inhibitors with reduced susceptibility to resistance. In addition, resistant maps also provide the opportunity to identify inhibitors with non-overlapping resistance profiles that could be used in combination to decrease susceptibility to resistance. With the available advancements in technology and the increasing threats to human health from drug resistance, it is an apt time to apply fitness landscapes to understand and combat resistance.
Figure 1.

Fitness landscapes can assess drug resistance potential early in the drug development process. The current throughput of mutational scans is compatible with rapid and comprehensive identification of drug resistant mutations for thousands of compounds. Determination of resistance landscapes at the lead development stage of drug development would enable rationale strategies to limit the evolution of resistance.
A number of studies have demonstrated the biological relevance of resistance fitness landscapes. In bacteria, a number of groups have determined fitness landscapes of β-lactamase with inhibitors1 that have recapitulated clinically relevant mutations as well as provided an understanding of the mechanism underlying resistance. In bacteria, mutational scanning has also been utilized to identify resistant mutations in deoxyxylulose phosphate reductoisomerase to the inhibitor fosmidomycin2, and in RNA polymerase to the inhibitor rifamycin3. These studies demonstrate the feasibility of using fitness landscapes to comprehensively identify drug resistant mutations.
Systematic mutational scanning approaches have also identified clinically relevant drug resistant mutations in oncogenes and viruses. Prior to clinical observation, a fitness landscape of the BRAFV600E oncogene that causes melanoma identified L505H as a resistant-causing mutation. While most clinical resistance to BRAF drugs have been caused by mutations in other genes (e.g. MEK), L505H was subsequently identified as causing resistance in some patients4. In BCR-ABL1-driven leukemia, clinically relevant resistance mutations have been well studied and are predominantly caused by mutations in BCR-ABL1. A fitness landscape in mammalian cells of a region of BCR-ABL1 near the active site revealed a strong correlation between the clinical frequency of resistant mutations and experimental measurements5. A nearly complete fitness map of ERK2 mutations6 identified 15-fold more resistant mutations than a prior study using random mutagenesis. Similarly, in multiple studies of viruses, systematic mutational scans have both recovered known resistance mutations and identified previously unappreciated resistant variants7, 8. Together, these findings demonstrate the ability of fitness landscapes to predict clinical resistant outcomes and serve as comprehensive guides in developing inhibitors with reduced susceptibility to resistance.
Development of approaches to measure an experimental fitness landscape
The tremendous efficiency of next-generation sequencing powers most approaches to determine fitness landscapes (Figure 2). Next-generation sequencing yields hundreds of millions of individual reads in a lane, enabling tracking of the frequency of a hundred thousand mutational variants in a single experiment9. This approach can be used to follow randomly introduced mutations10 or when combined with systematic mutational approaches to illuminate comprehensive fitness landscapes11. These sequencing-driven approaches have broad applications in the investigation of sequence-function-fitness relationships including drug resistance12. The general approach, termed mutational scanning, is sufficiently robust and effective that it has been applied by a large number of laboratories to a wide array of genes and selection schemes. Here, we provide a guide to the use of systematic fitness landscapes for the analysis of drug resistance.
Figure 2.

Mutational scanning approach to measure fitness landscapes. First, a DNA library is generated that includes all possible amino acid changes in a target gene. Next, the library is introduced into a biological form, usually cells. Each cell will contain a single mutant copy. In the next step, a selection is performed on the cells to enrich or deplete variants based on their function. For example, a drug may be applied that will hinder the growth of cells harboring a mutant with normal or defective function, but permit the growth of cells harboring a resistant mutant. Finally, next-generation sequencing is used to estimate the frequency of each mutant before and after selection. Enrichment or depletion of a mutant in the sequencing reactions is used to calculate the function of the variant under the selection conditions.
Approaches to determine fitness landscapes have undergone many key developments since the initial report from about 10 years ago11. For many applications, systematic and extensive coverage of all possible mutations is critical. For example, in the analysis of drug resistance, any mutation that is unassessed causes a blind spot that hinders clinical predictions and development of inhibitors with reduced likelihood of resistance evolution. The development of approaches to systematically generate and analyze all possible amino acid changes was critical for the generation of comprehensive fitness landscapes11, 13. Another key technical advancement has been the use of nucleic acid barcodes to track mutant variants5, 14, 15. The pre-association of barcodes with mutations and subsequent tracking of barcodes reduces the impacts of sequencing errors on fitness estimates and enables the analyses of mutations spread across large regions of a gene that cannot be directly sequenced by the limited read length of next-generation sequencing platforms. Numerous additional approaches to generate and analyze systematic mutations have been developed and these are covered later in this work. These approaches provide a broad technical foundation for quantifying fitness landscapes and investigating the underpinnings of function and resistance.
A wide array of selections have been developed for mutational scanning readouts that range from in vitro affinity to growth impacts in microbes, viruses, and mammalian cells. Many selections for in vitro binding involve the use of either yeast16, 17 or phage display18 where binding is quantified based on the enrichment of variants in bound populations. In these binding studies, fluorescently labeled ligands are frequently used so that flow accelerated cell sorting (FACS) can be applied to isolate bound and unbound fractions. In a conceptually related approach, the separation of active and inactive enzyme variants using microfluidic sorting has provided a fitness landscape of a glycosidase with a fluorogenic substrate19. GFP fusions can be used to analyze the steady state expression level of mutant variants that can serve as a proxy for function for some genes20. While in vitro assays can provide fitness landscapes, in our experience assays based on growth rate have generally proven faster to develop and more reproducible. Growth assays for mutational scans have been developed in bacteria1, 21, yeast15, 22, viruses23, and mammalian cells5, 6, 24, 25. Of note, the relationship between protein function and fitness are typically non-linear, making it challenging to relate the two types of measurements17, 26. While both in vitro and growth assays can provide information on drug resistance mutations, growth provides direct experimental fitness measurements that should be of greater biological relevance.
Steps involved in using a fitness landscape to study drug resistance
Developing a selection to distinguish the function of mutant variants
More than any other step, the development of a robust selection step is critical to the success of fitness landscape studies. There are a large number of potential options that can have tremendous impacts on the reproducibility of results, the clarity of interpretations, and biological relevance. Ideally, planning the selection system should be done with a deep understanding of the gene and organism of focus so that the studies are both feasible and relevant. We suggest the following as a rough guideline in considering different selection options.
Because of the influence of the selection scheme on conceptual conclusions, we recommend pilot experiments testing multiple options. For example, there may be three potential cell lines that could be used to study drug resistance in an oncogene. While you may prefer one cell line because of its genetic relationship to the native cancer, it may prove challenging to work with experimentally (e.g., it may be challenging to efficiently introduce mutant variants into the genome). Pilot experiments with mutant variants including positive controls (e.g., those with known resistance) and negative controls (e.g., internal stop codons that destroy function) can provide information regarding feasibility including the efficiency of genetic manipulation as well as biological relevance. The efficiency of genetic manipulation influences the complexity of variants that can be analyzed. As a general guideline, we aim for 10-fold or greater coverage of the number of variants in a library at each genetic manipulation step. This limits bottlenecks that can skew the frequency of or eliminate variants from the analyses. We also suggest analyzing a small library focusing on a critical region of the gene as a valuable approach to assess potential selection schemes. These pilot studies can also be used to explore different strengths and lengths of selection to identify experimental parameters that provide the best signal to noise. Performing a biological repeat with the small library will reveal the accuracy of measurements in each setup. In all selection experiments, it is important to have an unselected control in order to quantify mutant frequency in the absence of selection. We often analyze variant frequencies in the plasmid library and just after introduction into the selection system (e.g., integration into the chromosome) as separate assessments of mutant frequencies without selection.
Prior studies in different systems can also provide a guideline for planning selection schemes. In studies of resistance in oncogenes, we suggest performing selection for growth in an immortalized cell line with variants integrated into a consistent location in the genome and tracked using a barcoding strategy because this provides results that are highly reproducible and predictive of clinical prevalence5. Similarly, in studying bacterial resistance, we recommend performing selections that mimic the clinical setting including genetic location (plasmid or genome depending on the gene) and similarity to the pathogenic strain or species if the actual pathogen is not feasible. Viruses provide some unique challenges because their compact genomes may not allow for neutral barcodes, though this has not been extensively examined. The tracking of mutant frequencies in viruses has typically involved directly sequencing of mutations, which can require specialized strategies to reduce the noise from misreads8. If growth assays are not feasible for viruses (e.g. due to biosafety concerns), in vitro selection schemes can provide reasonable alternatives. When using in vitro selections, we suggest examining the relationship between protein function and growth rate in order to predict growth effects from in vitro findings.
Generating systematic libraries of mutant variants
A wide array of approaches have been used to generate systematic libraries of mutants. These include approaches based on Kunkel mutagenesis27, cassette ligation11, 13, various PCR strategies28, and nicking mutagenesis29. While we reference these here for completeness, in our opinion commercial options for gene synthesis of site saturation libraries provide a reliable option and are the approach that we use for most current projects. Generating systematic libraries involves extensive optimization and dedication using all approaches that we have explored. Many companies use oligonucleotide directed gene synthesis approaches to generate linear DNA encoding all 19 possible amino acid changes at a position for a cost of about $50 per position. Of note, prior fitness landscape studies have demonstrated that synonymous mutations tend to have little to no observable impact. For most fitness landscape applications, it is sufficient to measure one codon for each amino acid. When generating libraries, we recommend leaving at least 20 base pairs at either end invariant for use as handles (e.g., primer binding sites) for downstream manipulations. As we describe below, these libraries can then be easily transferred to different plasmids and barcoded.
Transferring libraries into plasmids and genomes, and barcoding
Libraries can be efficiently transferred into plasmids and barcoded for most applications (Figure 3). We transfer libraries into plasmids using either Gibson Assembly or NEBuilder HiFi DNA Assembly enzyme blends (both from New England Biolabs). Following the vendor recommended conditions, we routinely obtain greater than 100,000 independent transformants from a single reaction. The efficiency and flexibility of this process makes it simple and fast to transfer complex libraries while maintaining variant diversity. For libraries with larger than 10,000 variants, the reactions can be scaled up to achieve 10x coverage.
Figure 3.

Outline of approaches to transfer libraries into plasmids and add barcodes. (A) Regions of 20–30 identical bases enable efficient transfer of mutant versions of a gene into a plasmid using Gibson Assembly (NEB) or related enzyme blends. (B) Duplex DNA barcodes (e.g., N18) with constant regions at both ends and overhangs that match with plasmid ends can be efficiently ligated.
We introduce DNA barcodes into plasmid libraries using a ligation strategy (Figure 3). We recommend using restriction sites with 8 base recognition sites (e.g., NotI and PacI) for this step because it will reduce the likelihood that the enzyme cuts a mutant variant. For yeast applications, we typically use a random 18 base barcode (N18) either upstream of the promoter or downstream of the 5’ UTR. This barcode includes 418 or roughly 1011 possible combinations. Even for large variant libraries (e.g., with 106 variants), only a minute fraction of possible barcodes will be utilized. Therefore, almost all variants are barcoded with sequences that differ from all other variant barcodes by more than one base. Thus, when barcodes are sequenced, most errors in sequencing result in an unused barcode that can be detected and either corrected or ignored. Of note, the diversity of the barcode also makes it unlikely that a perfect pair will be present when samples of N18 oligos are annealed. For this reason, we typically add at least 12 bases of invariant sequence at either end of each N18 oligo to facilitate stable duplex ends for ligation. In addition, we have found that purifying the vector on a silica column without gel purification provides the best results. For efficient ligation reactions, we dephosphorylate the vector prior to ligation and phosphorylate each oligo individually prior to annealing them. It is also critical to dilute the annealed oligo so that the ligation reaction contains a ratio of about 1 vector molecule for every two duplex barcode molecules. Using these optimizations, we typically obtain about 50,000 independent transformants from a standard barcode ligation reaction, and this can readily be scaled up when necessary. Individual colonies from the N18 ligation transformation typically have a mixture of two barcodes such that the barcode diversity is twice the number of transformants. This does not have a large impact on downstream steps, but can be confusing when first encountered. For barcoding mammalian genes, we have used an intron barcoding strategy 5. The main difference of the intron barcode from N18 is that it avoids recognition motifs involved in mRNA splicing. As with the N18 barcode, the intron barcode can also be introduced into plasmid libraries using ligation.
While many mutational libraries will contain copies of the wildtype (WT) gene because of the manner in which they are built, we have found it useful to add a small fraction of additional WT molecules. We do this by ligating barcodes into a plasmid containing the WT gene and isolating plasmid DNA from about 100 colonies scraped from a plate of bacterial transformants. We then identify the barcodes associated with the WT sample by sequencing the barcodes as a small fraction of an Illumina sequencing lane.
Multiple strategies have been used to efficiently introduce variant libraries into the genomes of mammalian cells. Of note, genome integration is critical for reproducible results in mammalian cells as introducing libraries into random locations using transduction strategies24 can lead to variable translation efficiencies with dramatic impacts on phenotypes including growth and drug resistance. Genomic integration efficiencies of 8% have been achieved using either an optimized CRISPR/Cas9 strategy5 or a Bxb1 recombinase strategy30. For studies of resistance in cancer, these strategies provide tools that can readily interrogate comprehensive fitness landscapes for most oncogenes.
Next-generation sequencing and data processing
The use of barcodes to track variant frequencies provides multiple benefits, but require an additional sequencing step to associate barcodes with mutant variants. In addition to reducing the impact of misreads on estimates of variant frequency that was discussed previously, barcoding can reduce sequencing costs when analyzing variant frequency. Because barcodes are typically 18–20 bases, they can be read using short reads that are cost effective, especially when fitness landscapes will be quantified under multiple conditions (e.g., different drug environments). Of note, with Illumina sequencing, the length of the DNA analyzed by sequencing influences the diameter of the bridge amplicons on the chip. Larger DNA generates larger diameter amplicon clusters and fewer reads per lane. It is necessary to have a sufficiently large DNA that it can form bridges and we typically use DNA that is about 270 base pairs in length for optimal Illumina sequencing. As discussed further below, reading barcodes of 18–20 bases enables highly efficient Illumina sequencing reactions for the quantification of variant frequencies.
For the association of barcodes with mutant variants (Figure 4), we recommend using circular consensus sequencing (CCS) available on PacBio instruments. By performing multiple reads of single molecules of circular DNA, this approach provides both long read lengths (>10 kb) and outstanding accuracy (>99% for each base call). Of note, we have found that PCR of large (>1 kb) fragments of DNA containing point mutant variants can lead to up to 25% of molecules recombining. For this reason, we typically generate samples for PacBio sequencing by cutting with restriction enzymes close to the ends of the barcode and location of mutations, blunting the ends, and sending the sample for purification on a BluePippin instrument (Sage Biosciences), followed by circularization, and PacBio sequencing. It is important not to expose the DNA to UV light or ethidium bromide during the purification steps as these will damage the DNA and impair PacBio sequencing. The resulting CCS files of libraries with up to 100,000 barcode variants typically contain multiple independent reads of each barcode with identical mutations associated for almost all (>99%) barcode containing reads.
Figure 4.

Outline of sequencing to associate barcodes with mutations and assess the frequency of barcodes. (A) Barcodes can be associated with mutations using PacBio circular consensus sequencing that provides long reads and high accuracy. Linear DNA containing the open reading frame and barcode need to be prepared. The addition of proprietary hairpin adapters and sequencing can be performed by commercial sources. (B) The frequency of each barcode in multiple samples can be assessed in the same lane of Illumina sequencing by using different index sequences for each sample. Index sequences are encoded on primers used to amplify the barcodes. All index sequences should differ from one another by two or more bases to reduce the likelihood of misreads leading to mistakes in assigning a read to the correct sample. The primer used to read the index in Illumina sequencing is called i7.
We process CCS data using custom analysis scripts that are available upon request. Our analysis pipeline includes organizing the data by barcode, and then for each barcode detailing the mutations associated with each read. For barcodes with two or more reads, we typically find complete agreement between the mutations in each read. In these cases, we accept the barcode and add it to a file listing the barcode and the associated mutations. This file is then used as a lookup table when we quantify barcode frequencies in samples from selection experiments.
In our experience, Illumina sequencing currently provides the most effective tool for analyzing barcode frequencies in samples from selection experiments (Figure 4). For accurate estimates of barcode frequency, we aim for an average of more than 100 reads of the average barcode. For an experiment with 100,000 barcodes, we would aim for 107 reads for each condition (e.g., drug condition). The throughput of Illumina sequencing (~4×108 reads per lane) enables us to analyze multiple conditions (up to 40 in this example) in a single lane. To distinguish samples in the sequencing sample, we add index sequences using a PCR strategy (Figure 4). We read the index sequence using the illumine index sequencing primer. Of note, because Illumina sequencing reads one base at a time from clonal clusters, it can be flooded if the same base is present at the same position in all clusters (especially when reading the initial bases used in defining cluster positions). To avoid sequencing failures, we blend samples from different genes in order to generate sequence diversity at all positions and/or blend in DNA with a random sequence (e.g. sheered phi-X DNA) at 25%.
We analyze Illumina barcode reads using custom scripts that are available upon request. Our analysis pipeline includes steps to organize the data by barcode and index sequence, tabulate reads of each barcode with each identifier, lookup the associated mutations with each barcode, and output the sequencing observations of each mutant variant under each condition. Fitness effects are estimated by comparing the frequency of each variant in an unselected control compared to its frequency under selection. Fitness effects are often normalized to WT and stops in order to facilitate comparison between different experiments and conditions.
Comparison of selection on the change in mutant variant frequency to WT can be used to identify mutations with functional impacts. The individual barcodes of WT provide an internal estimate of measurement variation. Multiple statistical approaches can be applied to assess variants that are significantly distinct from the WT distribution. One of the simplest approaches is to use twice the standard deviation to define a 95% confidence interval for WT fitness effects. Corrections can be made for multiple tests (e.g., Bonferroni corrections) and where experimental replicates are available variations in both WT and each mutant variant can be included in the statistical analyses.
Assessing evolutionary potential
Evolutionary potential is determined by both selection and mutation. While fitness landscapes described in this work are determined by selection only, evolutionary potential can be estimated by including a model of mutation. For example, in BCR-ABL1, we observed that drug resistant amino acid changes requiring more than one nucleotide change were rarely if ever observed in patients5. This is consistent with the known prevalence of single-base mutations in mammalian cancer cells. However, incorporating a simple mutational model provided predictions of evolutionary potential that closely matched clinical observations5. The availability of genome sequencing of parents and offspring for many organisms provides accurate mutational models that can be readily utilized to assess evolutionary potential from fitness landscapes.
Interpreting functional effects in the light of protein structure provides opportunities to understand mechanism and the biophysical principles that underly evolutionary potential. For example, structural modeling can indicate if drug resistant mutations act by directly interfering with inhibitor binding, or through longer-range allosteric or conformational changes. Also, identifying the set of mutations that are compatible with function in the absence of inhibitors provides the potential to structurally define target sites for drugs that can only be disrupted by mutations that also disturb function. Structural analyses of fitness landscapes can identify additional pockets beyond the active site that are critical for function (e.g., allosteric sites) that may be targeted by inhibitors.
Summary and conclusions
Drug resistance continues to increase in prevalence and new approaches to understand and combat it are critical for human health. The technology for measuring fitness landscapes can be widely applied to drug resistance and has the potential to aid in the development of next generation therapies with reduced potential for resistance. Given the state of resistance and fitness landscape technology, this is a critical time for collaborative efforts between scientists in academics, biotech, and pharmaceuticals.
Acknowledgements
This work was supported by grants R01GM112844 from the National Institutes of Health to D.N.A.B.
References
- (1).Deng Z; Huang W; Bakkalbasi E; Brown NG; Adamski CJ; Rice K; Muzny D; Gibbs RA; Palzkill T Deep sequencing of systematic combinatorial libraries reveals beta-lactamase sequence constraints at high resolution. J Mol Biol 2012, 424 (3–4), 150–167. DOI: 10.1016/j.jmb.2012.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]; Firnberg E; Labonte JW; Gray JJ; Ostermeier M A comprehensive, high-resolution map of a gene’s fitness landscape. Mol Biol Evol 2014, 31 (6), 1581–1592. DOI: 10.1093/molbev/msu081. [DOI] [PMC free article] [PubMed] [Google Scholar]; Jacquier H; Birgy A; Le Nagard H; Mechulam Y; Schmitt E; Glodt J; Bercot B; Petit E; Poulain J; Barnaud G; et al. Capturing the mutational landscape of the beta-lactamase TEM-1. Proc Natl Acad Sci U S A 2013, 110 (32), 13067–13072. DOI: 10.1073/pnas.1215206110. [DOI] [PMC free article] [PubMed] [Google Scholar]; Stiffler MA; Hekstra DR; Ranganathan R Evolvability as a function of purifying selection in TEM-1 beta-lactamase. Cell 2015, 160 (5), 882–892. DOI: 10.1016/j.cell.2015.01.035. [DOI] [PubMed] [Google Scholar]
- (2).Pines G; Oh EJ; Bassalo MC; Choudhury A; Garst AD; Fankhauser RG; Eckert CA; Gill RT Genomic Deoxyxylulose Phosphate Reductoisomerase (DXR) Mutations Conferring Resistance to the Antimalarial Drug Fosmidomycin in E. coli. ACS Synth Biol 2018, 7 (12), 2824–2832. DOI: 10.1021/acssynbio.8b00219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (3).Choudhury A; Fenster JA; Fankhauser RG; Kaar JL; Tenaillon O; Gill RT CRISPR/Cas9 recombineering-mediated deep mutational scanning of essential genes in Escherichia coli. Mol Syst Biol 2020, 16 (3), e9265. DOI: 10.15252/msb.20199265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (4).Choi J; Landrette SF; Wang T; Evans P; Bacchiocchi A; Bjornson R; Cheng E; Stiegler AL; Gathiaka S; Acevedo O; et al. Identification of PLX4032-resistance mechanisms and implications for novel RAF inhibitors. Pigment Cell Melanoma Res 2014, 27 (2), 253–262. DOI: 10.1111/pcmr.12197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Ma L; Boucher JI; Paulsen J; Matuszewski S; Eide CA; Ou J; Eickelberg G; Press RD; Zhu LJ; Druker BJ; et al. CRISPR-Cas9-mediated saturated mutagenesis screen predicts clinical drug resistance with improved accuracy. Proc Natl Acad Sci U S A 2017, 114 (44), 11751–11756. DOI: 10.1073/pnas.1708268114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (6).Brenan L; Andreev A; Cohen O; Pantel S; Kamburov A; Cacchiarelli D; Persky NS; Zhu C; Bagul M; Goetz EM; et al. Phenotypic Characterization of a Comprehensive Set of MAPK1/ERK2 Missense Mutants. Cell Rep 2016, 17 (4), 1171–1183. DOI: 10.1016/j.celrep.2016.09.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Dingens AS; Arenz D; Overbaugh J; Bloom JD Massively Parallel Profiling of HIV-1 Resistance to the Fusion Inhibitor Enfuvirtide. Viruses 2019, 11 (5). DOI: 10.3390/v11050439. [DOI] [PMC free article] [PubMed] [Google Scholar]; Qi H; Olson CA; Wu NC; Ke R; Loverdo C; Chu V; Truong S; Remenyi R; Chen Z; Du Y; et al. A quantitative high-resolution genetic profile rapidly identifies sequence determinants of hepatitis C viral fitness and drug sensitivity. PLoS Pathog 2014, 10 (4), e1004064. DOI: 10.1371/journal.ppat.1004064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Soh YQS; Malone KD; Eguia RT; Bloom JD Comprehensive Profiling of Mutations to Influenza Virus PB2 That Confer Resistance to the Cap-Binding Inhibitor Pimodivir. Viruses 2021, 13 (7). DOI: 10.3390/v13071196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Melamed D; Young DL; Gamble CE; Miller CR; Fields S Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein. RNA 2013, 19 (11), 1537–1551. DOI: 10.1261/rna.040709.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (10).Fowler DM; Araya CL; Fleishman SJ; Kellogg EH; Stephany JJ; Baker D; Fields S High-resolution mapping of protein sequence-function relationships. Nat Methods 2010, 7 (9), 741–746. DOI: 10.1038/nmeth.1492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Hietpas RT; Jensen JD; Bolon DN Experimental illumination of a fitness landscape. Proc Natl Acad Sci U S A 2011, 108 (19), 7896–7901. DOI: 10.1073/pnas.1016024108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Canale AS; Cote-Hammarlof PA; Flynn JM; Bolon DN Evolutionary mechanisms studied through protein fitness landscapes. Curr Opin Struct Biol 2018, 48, 141–148. DOI: 10.1016/j.sbi.2018.01.001. [DOI] [PubMed] [Google Scholar]
- (13).Hietpas R; Roscoe B; Jiang L; Bolon DN Fitness analyses of all possible point mutations for regions of genes in yeast. Nat Protoc 2012, 7 (7), 1382–1396. DOI: 10.1038/nprot.2012.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Mavor D; Barlow K; Thompson S; Barad BA; Bonny AR; Cario CL; Gaskins G; Liu Z; Deming L; Axen SD; et al. Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting. Elife 2016, 5. DOI: 10.7554/eLife.15802. [DOI] [PMC free article] [PubMed] [Google Scholar]; Hiatt JB; Patwardhan RP; Turner EH; Lee C; Shendure J Parallel, tag-directed assembly of locally derived short sequence reads. Nat Methods 2010, 7 (2), 119–122. DOI: 10.1038/nmeth.1416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Mighell TL; Evans-Dutson S; O’Roak BJ A Saturation Mutagenesis Approach to Understanding PTEN Lipid Phosphatase Activity and Genotype-Phenotype Relationships. Am J Hum Genet 2018, 102 (5), 943–955. DOI: 10.1016/j.ajhg.2018.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Reich LL; Dutta S; Keating AE SORTCERY-A High-Throughput Method to Affinity Rank Peptide Ligands. J Mol Biol 2015, 427 (11), 2135–2150. DOI: 10.1016/j.jmb.2014.09.025. [DOI] [PMC free article] [PubMed] [Google Scholar]; Starr TN; Greaney AJ; Hilton SK; Ellis D; Crawford KHD; Dingens AS; Navarro MJ; Bowen JE; Tortorici MA; Walls AC; et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell 2020, 182 (5), 1295–1310 e1220. DOI: 10.1016/j.cell.2020.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (17).Roscoe BP; Bolon DN Systematic exploration of ubiquitin sequence, E1 activation efficiency, and experimental fitness in yeast. J Mol Biol 2014, 426 (15), 2854–2870. DOI: 10.1016/j.jmb.2014.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Starita LM; Young DL; Islam M; Kitzman JO; Gullingsrud J; Hause RJ; Fowler DM; Parvin JD; Shendure J; Fields S Massively Parallel Functional Analysis of BRCA1 RING Domain Variants. Genetics 2015, 200 (2), 413–422. DOI: 10.1534/genetics.115.175802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Romero PA; Tran TM; Abate AR Dissecting enzyme function with microfluidic-based deep mutational scanning. Proc Natl Acad Sci U S A 2015, 112 (23), 7159–7164. DOI: 10.1073/pnas.1422285112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Matreyek KA; Starita LM; Stephany JJ; Martin B; Chiasson MA; Gray VE; Kircher M; Khechaduri A; Dines JN; Hause RJ; et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat Genet 2018, 50 (6), 874–882. DOI: 10.1038/s41588-018-0122-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (21).Mehlhoff JD; Stearns FW; Rohm D; Wang B; Tsou EY; Dutta N; Hsiao MH; Gonzalez CE; Rubin AF; Ostermeier M Collateral fitness effects of mutations. Proc Natl Acad Sci U S A 2020, 117 (21), 11597–11607. DOI: 10.1073/pnas.1918680117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (22).Flynn JM; Rossouw A; Cote-Hammarlof P; Fragata I; Mavor D; Hollins C 3rd; Bank C; Bolon DN Comprehensive fitness maps of Hsp90 show widespread environmental dependence. Elife 2020, 9. DOI: 10.7554/eLife.53810. [DOI] [PMC free article] [PubMed] [Google Scholar]; Roscoe BP; Thayer KM; Zeldovich KB; Fushman D; Bolon DN Analyses of the effects of all ubiquitin point mutants on yeast growth rate. J Mol Biol 2013, 425 (8), 1363–1377. DOI: 10.1016/j.jmb.2013.01.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Canale AS; Venev SV; Whitfield TW; Caffrey DR; Marasco WA; Schiffer CA; Kowalik TF; Jensen JD; Finberg RW; Zeldovich KB; et al. Synonymous Mutations at the Beginning of the Influenza A Virus Hemagglutinin Gene Impact Experimental Fitness. J Mol Biol 2018, 430 (8), 1098–1115. DOI: 10.1016/j.jmb.2018.02.009. [DOI] [PubMed] [Google Scholar]; Jiang L; Liu P; Bank C; Renzette N; Prachanronarong K; Yilmaz LS; Caffrey DR; Zeldovich KB; Schiffer CA; Kowalik TF; et al. A Balance between Inhibitor Binding and Substrate Processing Confers Influenza Drug Resistance. J Mol Biol 2016, 428 (3), 538–553. DOI: 10.1016/j.jmb.2015.11.027. [DOI] [PubMed] [Google Scholar]; Duenas-Decamp M; Jiang L; Bolon D; Clapham PR Saturation Mutagenesis of the HIV-1 Envelope CD4 Binding Loop Reveals Residues Controlling Distinct Trimer Conformations. PLoS Pathog 2016, 12 (11), e1005988. DOI: 10.1371/journal.ppat.1005988. [DOI] [PMC free article] [PubMed] [Google Scholar]; Doud MB; Bloom JD Accurate Measurement of the Effects of All Amino-Acid Mutations on Influenza Hemagglutinin. Viruses 2016, 8 (6). DOI: 10.3390/v8060155. [DOI] [PMC free article] [PubMed] [Google Scholar]; Haddox HK; Dingens AS; Bloom JD Experimental Estimation of the Effects of All Amino-Acid Mutations to HIV’s Envelope Protein on Viral Replication in Cell Culture. PLoS Pathog 2016, 12 (12), e1006114. DOI: 10.1371/journal.ppat.1006114. [DOI] [PMC free article] [PubMed] [Google Scholar]; Thyagarajan B; Bloom JD The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. Elife 2014, 3. DOI: 10.7554/eLife.03300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Wagenaar TR; Ma L; Roscoe B; Park SM; Bolon DN; Green MR Resistance to vemurafenib resulting from a novel mutation in the BRAFV600E kinase domain. Pigment Cell Melanoma Res 2014, 27 (1), 124–133. DOI: 10.1111/pcmr.12171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (25).Matreyek KA; Stephany JJ; Chiasson MA; Hasle N; Fowler DM An improved platform for functional assessment of large protein libraries in mammalian cells. Nucleic Acids Res 2020, 48 (1), e1. DOI: 10.1093/nar/gkz910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Kacser H; Burns JA The molecular basis of dominance. Genetics 1981, 97 (3–4), 639–666. [DOI] [PMC free article] [PubMed] [Google Scholar]; Jiang L; Mishra P; Hietpas RT; Zeldovich KB; Bolon DN Latent effects of Hsp90 mutants revealed at reduced expression levels. PLoS Genet 2013, 9 (6), e1003600. DOI: 10.1371/journal.pgen.1003600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (27).Firnberg E; Ostermeier M PFunkel: efficient, expansive, user-defined mutagenesis. PLoS One 2012, 7 (12), e52031. DOI: 10.1371/journal.pone.0052031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Bloom JD An experimentally determined evolutionary model dramatically improves phylogenetic fit. Mol Biol Evol 2014, 31 (8), 1956–1978. DOI: 10.1093/molbev/msu173. [DOI] [PMC free article] [PubMed] [Google Scholar]; Kitzman JO; Starita LM; Lo RS; Fields S; Shendure J Massively parallel single-amino-acid mutagenesis. Nat Methods 2015, 12 (3), 203–206, 204 p following 206. DOI: 10.1038/nmeth.3223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (29).Wrenbeck EE; Klesmith JR; Stapleton JA; Adeniran A; Tyo KE; Whitehead TA Plasmid-based one-pot saturation mutagenesis. Nat Methods 2016, 13 (11), 928–930. DOI: 10.1038/nmeth.4029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (30).Matreyek KA; Stephany JJ; Fowler DM A platform for functional assessment of large variant libraries in mammalian cells. Nucleic Acids Res 2017, 45 (11), e102. DOI: 10.1093/nar/gkx183. [DOI] [PMC free article] [PubMed] [Google Scholar]
