Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Aug 1.
Published in final edited form as: Clin Pharmacol Ther. 2019 May 30;106(2):290–294. doi: 10.1002/cpt.1468

Applying multiplex assays to understand variation in pharmacogenes

Melissa Chiasson 1, Maitreya J Dunham 1,4, Allan E Rettie 2, Douglas M Fowler 1,3,4,*
PMCID: PMC6663607  NIHMSID: NIHMS1024238  PMID: 31145826

Introduction

Genome sequencing has enabled the detection of unprecedented numbers of new pharmacogene variants. But, interpreting how these variants affect pharmacogene biology and ultimately drug response is difficult. Multiplexed assays for variant effects (MAVEs) leverage high throughput DNA sequencing to assess the functional consequences of thousands of variants simultaneously. We discuss the utility of large-scale functional data in pharmacogene variant interpretation and suggest that implementing MAVEs could empower pharmacogenetics and improve patient care.

Genomes can now be sequenced with ease, but understanding the effect of the variants found therein poses a major challenge. Each uninterpreted variant represents a missed opportunity to improve patient outcomes. For example, the Clinical Pharmacogenetics Implementation Consortium (CPIC) lists 358 gene-drug pairs where variation can change drug response. For 63 of these 358 pairs, CPIC has issued guidelines regarding clinical interventions that may improve patient care. These guidelines focus on common variants (minor allele frequencies, MAF, typically >5%) whose clinical consequences are most clearly documented. However, understanding the effects of rare variants (MAF < 0.5–1%) is also essential, and this goal is far from realized.

The magnitude of the unmet need requires consideration of the totality of rare variation that will be identified as sequencing becomes more common. As of February 2019, the Genome Aggregation Database contained ~125,000 exomes and ~15,000 genomes, which included 404 rare coding single nucleotide variants in CYP2C9 alone, 212 of which were singletons. Only 55 of these variants were in the PharmVar database (accessed 2/10/19), and only about a dozen have been functionally annotated. Undoubtedly, as sequencing continues, many more CYP2C9 variants will be identified. This issue is not confined to CYP2C9: 731 novel non-synonymous variants in 12 CYP genes were discovered in the exomes of ~6,500 individuals1. ~10% of individuals carried at least one of these potentially deleterious novel variants. These results, obtained from a handful of genes in a few individuals relative to the number that will ultimately be sequenced, illustrate that an onslaught of new and potentially important variants is coming.

The challenge of variant functional analysis

Current methods for determining the impact of pharmacogene variants fall into two categories. Biochemical assays using known substrates for drug disposition genes can reveal variant functional consequences. However, this approach is limited in scale to tens or hundreds of missense variants. Computational predictions can scale to all possible variants of a gene of interest, but are of limited value as they often produce incorrect or conflicting results. For example, the CYP2C9*3 variant, present in ~7% of Caucasians, confers ~90% loss of function according to experimental data2, but is predicted computationally to be benign. To overcome the limitations of biochemical assays and computational predictions, an experimental approach to assess pharmacogene variants on a massive scale is needed.

MAVEs can characterize tens of thousands of variants simultaneously

A multiplex assay for variant effects (MAVE) measures the functional consequences of a large library of genetic variants simultaneously3,4. MAVEs can be applied to a wide range of genetic elements including mRNA UTRs, promoters, enhancers, splice sites, and proteins. The result of a MAVE is a variant effect map that reveals the functional consequences of all possible single variants in the genetic element.

All MAVEs share the same basic design (Figure 1A, reviewed in 3,4). First, a pooled library of variants is constructed either by PCR-based mutagenesis or synthesized oligo arrays programmed with mutations of interest. The library is then introduced into an experimental system, typically yeast or cultured human cells. Each cell must express a single variant to maintain the link between variant sequence and phenotype. For example, in human cells, expression of a single variant is typically achieved using lentiviral transduction or recombinase-based systems. Cells expressing the library of interest are then assayed for a phenotype of interest, like growth or reporter activation. These assays stratify variants based on their phenotypic effect. For example, in a growth assay, cells expressing wild type-like variants grow rapidly whereas cells expressing loss-of-function variants grow slowly. In a fluorescent reporter assay, wild type-like variants drive high fluorescence whereas loss-of-function variants drive low fluorescence. Cells are sorted into bins according to fluorescence. High throughput sequencing is used to measure a variant’s frequency in the assay, either before and after growth or across bins. Variant frequencies are then used to compute effect scores.

Figure 1.

Figure 1

A) Overview of MAVEs. A library of variants of the genetic element of interest is created and introduced into cells. The cells are subjected to a growth- or fluorescent reporter-based assay. High-throughput sequencing is used to determine the frequency of variants before and after the assay, and variant frequencies are used to calculate functional scores. B) VAMP-seq uses a GFP fusion reporter to measure steady-state variant abundance. GFP was fused N-terminally to a library of TPMT variants; mCherry was used as a transcriptional control. This library was introduced into HEK293T cells using a serine integrase landing pad system such that only one variant is expressed per cell. Cells were sorted based on their fluorescence into four bins. High-throughput sequencing was used to determine the frequency of every variant in each bin. Frequencies were then converted to abundance scores. C) A FACS plot of WT TPMT (red) and three high-frequency variants known to be low abundance (blue): A80P, A154T, and Y240C. The library of TPMT variants and bins used for sorting are shown (gray). D) Density plot of abundance scores, with dotted blue line showing distribution of nonsense variants and red dotted line showing synonymous variants. The missense variant distribution is shaded from blue (low abundance) to red (high abundance). E) Heatmap of TPMT abundance scores shaded from blue (low abundance) to red (high abundance); gray indicates missing data. F) Abundance scores from four replicates for six new TPMT variants found in gnomAD.

MAVEs for coding and noncoding variants differ in the type of assays used. For example, noncoding MAVEs generally measure how variants affect expression, often by quantifying mRNA transcripts or using a fluorescent reporter. Coding MAVEs measure different aspects of a protein’s function. For example, reporter assays can measure specific protein properties like abundance or substrate binding using fluorescent protein tags or fluorophore-labeled antibodies. Growth-based assays measure each variant’s ability to drive cell growth, either in the context of a deletion of the genomic copy of the protein or by using a metabolic reporter.

MAVEs have the power to functionally annotate variants in many, if not most, pharmacogenes. However, achieving this goal will take time and effort, requiring the implementation of existing MAVEs and the development of new assays. To illustrate these issues, we first discuss the recent application of a MAVE to thiopurine methyltransferase (TPMT) and then consider other pharmacogenes that could benefit most from MAVEs.

Analyzing TPMT abundance reveals new variants that confer thiopurine toxicity risk

TPMT inactivates thiopurine drugs commonly used to treat cancer and autoimmune diseases, including 6-thioguanine and 6-mercaptopurine (6-MP). Thus, TPMT reduces the quantity of drug available for transformation into thioguanine nucleotides, which inhibit de novo purine synthesis. During routine dosing with thiopurines, TPMT deficiency results in high levels of thioguanine nucleotides and, ultimately, hematopoietic toxicity. Three variants, A80P, A154T, and Y240C, are known to lead to decreased TPMT function. CPIC recommends testing for these three variants, enabling patients to be classified as normal, intermediate, or poor metabolizers based on diplotype, with doses adjusted accordingly.

Previously, we applied Variant Abundance by Massively Parallel sequencing (VAMP-seq), a generalizable, multiplex assay for measuring protein abundance inside cells, to TPMT5 (Figure 1B). We generated abundance scores for 3,689 of the 4,655 possible variants (Figure 1C, D, and E). A80P, A154T, and Y240C were all low abundance variants, in accordance with their poor metabolizer status. In contrast, four rare variants from a clinical study of acute lymphoblastic leukemia (S125L, Q179H, R215H, R226Q) were all wild type (WT)-like in abundance, and patients with these variants tolerated higher doses of 6-MP better than those with A80P, A154T, or Y240C. We then identified 31 reduced abundance variants in gnomAD, and suggested that patients with these variants could have increased risk for thiopurine toxicity. Since our publication of the TPMT variant abundance map, seven new TPMT variants have been added to gnomAD: K77E, W78R, G83V, L155S, P160A, K191E, and C216Y. VAMP-seq data indicate that K77E, W78R, G83V, L155S, and K191E are of low abundance relative to WT (Figure 1F). Accordingly, these variants might confer drug sensitivity in patients that carry them.

Thus, protein abundance is a useful phenotype for identifying loss-of-function variants. We also anticipate that measurement of protein activity will be necessary for many pharmacogenes. Fortunately, in some cases, existing low-throughput activity assays can be adapted. For example, a reporter cell line developed to measure vitamin K oxidoreductase (VKOR) activity6 could be combined with a variant library to assess activity of all VKOR missense variants. Since some VKOR variants confer resistance to warfarin, cells could also be treated with warfarin to reveal the relationship between activity and resistance. Ultimately, the activity and resistance scores from such an assay could be used to help predict a patient’s warfarin dose based on their VKOR sequence.

MAVEs could aid pharmacogene variant interpretation

Including TPMT, CPIC lists 127 genes that have differing levels of evidence for identification as an actionable pharmacogene. 5,132,280 possible single nucleotide variants exist amongst these genes. Assaying such a large number of variants is possible, but daunting. Thus, we suggest prioritization of the most promising pharmacogenes.

We focused solely on missense variants for this analysis; however many pharmacogenes have noncoding variants that contribute to drug response and could be assayed with an appropriately designed noncoding MAVE. First, we restricted our analysis to the 31 genes that are designated as CPIC level A or B where genetic information can be used to guide drug therapy. We annotated each gene according to the localization of the protein it encodes, length, number of missense variants already in gnomAD, and number of variants registered in PharmVar (Table 1).

Table 1.

31 genes designated by CPIC as A or B level genes, along with factors to consider when designing MAVEs.

Gene Drug(s) Length (AA) Total possible single AA variants Missense variants in gnomAD Missense variants in PharmVar Localization
MT-RNR1 aminoglycoside antibacterials 16 300 0 0 Secreted
NUDT15 azathioprine, mercaptopurine, thioguanine 164 3,260 83 12 Cytoplasm
IFNL3 peginterferon alfa-2a, peginterferon alfa-2b, ribavirin 196 3,900 152 0 Secreted
HPRT1 mycophenolic acid 218 4,340 20 0 Cytoplasm
TPMT azathioprine, mercaptopurine, thioguanine 245 4,880 119 0 Cytoplasm
OTC valproic acid 354 7,060 88 0 Mitochondrion matrix
HLA-B abacavir, allopurinol, carbamazepine, oxcarbazepine 362 7,220 180 0 Membrane; Single-pass type I membrane protein
HLA-A carbamazepine, allopurinol 365 7,280 194 0 Membrane; Single-pass type I membrane protein
ASS1 valproic acid 412 8,220 211 0 Cytoplasm
ASL valproic acid 464 9,260 242 0 Cytoplasm, extracellular exosome
CYP2C9 phenytoin, warfarin, acenocoumarol 490 9,780 381 55 Endoplasmic reticulum membrane, peripheral membrane
CYP2C19 amitriptyline, clopidogrel, citalopram, voriconazole 490 9,780 375 5 Endoplasmic reticulum membrane, peripheral membrane
CYP2B6 efavirenz, methadone 491 9,800 331 0 Endoplasmic reticulum membrane, peripheral membrane
CYP2D6 codeine, oxycodone, tamoxifen, tramadol 497 9,920 374 30 Endoplasmic reticulum membrane, peripheral membrane
CYP3A5 tacrolimus 502 10,020 215 11 Endoplasmic reticulum membrane, peripheral membrane
G6PD rasburicase, chloramphenicol, chloroquine, ciprofloxacin 515 10,280 171 0 Cytoplasm, extracellular exosome, nucleus
CYP4F2 warfarin, acenocoumarol 520 10,380 344 2 Endoplasmic reticulum membrane, peripheral membrane
UGT1A1 atazanavir, irinotecan, belinostat 533 10,640 308 0 Endoplasmic reticulum membrane; Single-pass membrane protein
NAGS carglumic acid 534 10,660 216 0 Mitochondrion matrix
GBA velaglucerase alfa 536 10,700 247 0 Lysosome membrane, peripheral membrane protein
SLCO1B1 simvastatin, cerivastatin 691 13,800 399 0 Basolateral cell membrane, Multi-pass membrane protein
DPYD capecitabine, fluorouracil 1,025 20,480 566 0 Cytoplasm
ABL2 valproic acid 1,182 23,620 509 0 Cytoplasm, cytoskeleton
POLG valproic acid 1,239 24,760 762 0 Mitochondrion, mitochondrion matrix, mitochondrion nucleoid
ABCB1 antidepressants, digoxin 1,280 25,580 578 0 Cell membrane, multi-pass membrane protein
CFTR ivacaftor 1,480 29,580 991 0 Apical cell membrane
CPS1 valproic acid 1,500 29,980 679 0 Mitochondrion, nucleus, nucleolus
CACNA1S desflurane, enflurane, isoflurane, halothane 1,873 37,440 1,071 0 Cell membrane, sarcolemma, T-tubule, multi-pass membrane protein
SCN1A carbamazepine 2,009 40,160 587 0 Cell membrane, multi-pass membrane protein
RYR1 desflurane, enflurane, isoflurane, halothane 5,038 100,740 2,663 0 Sarcoplasmic reticulum membrane, multi-pass membrane protein

Among this list, small proteins should be given high priority, since they have fewer possible variants and are thus easier to assay. Larger proteins affecting dosing of multiple, widely-prescribed drugs should also be prioritized, as they impact many patients. For these, we suggest focusing initial efforts on functionally important domains. All the genes have tens to thousands of variants deposited in gnomAD; however, most genes do not have any variants deposited yet in PharmVar. Therefore, concentrating on the genes that have the greatest number of rare variants in gnomAD, but no information in PharmVar, would yield new insight. Two pharmacogenes, IFN3 and MT-RNR1, encode secreted proteins requiring new assays that maintain the sequence-phenotype link. In addition to these factors, analyzing published CRISPR screen data will identify which of these genes cause growth defects in a relevant cell line; growth-based MAVEs would be an attractive starting point for these. For the remainder, we suggest applying reporter-based assays such as VAMP-seq.

Despite their promise, MAVEs also have limitations. MAVEs often take the genetic element of interest out of its endogenous genomic or cellular context and thus demand careful validation of results. Data generated from MAVEs, while comprehensive, can be noisy. Thus, adequate replication is required to improve measurement accuracy and facilitate error estimation. Finally, MAVEs generally focus on one or a few experimental conditions and so may not fully capture condition-dependent effects. For pharmacogenes, therefore, it will be critical to evaluate variants in physiologically relevant concentration, time and drug contexts.

In summary, a community-wide effort to apply MAVEs to high-priority pharmacogenes would result in variant effect maps that could aid in the interpretation of variants seen in the clinic. As pharmacogene variant effect maps are produced, they will yield a better understanding of pharmacogene biology and create opportunities for more rigorous, data-driven customization of patient treatment.

Acknowledgments

Funding: This work was supported by the National Institute of General Medical Sciences (5R24GM115277 to D.M.F., A.R., and M.D., P01 GM116691 for A.R.). D.M.F. is a Canadian Institute for Advanced Research Azrieli Global Scholar. M.D. is a Senior Fellow in the Genetic Networks program at the Canadian Institute for Advanced Research. M.D. is supported in part by a Faculty Scholars grant from the Howard Hughes Medical Institute.

Footnotes

Conflict of Interest: The authors declared no competing interests for this work.

References

  • 1.Gordon AS et al. Quantifying rare, deleterious variation in 12 human cytochrome P450 drug-metabolism genes in a large-scale exome dataset. Hum. Mol. Genet 23, 1957–1963 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Rettie AE & Jones JP Clinical and toxicological relevance of CYP2C9: drug-drug interactions and pharmacogenetics. Annu. Rev. Pharmacol. Toxicol 45, 477–494 (2005). [DOI] [PubMed] [Google Scholar]
  • 3.Starita LM et al. Variant Interpretation: Functional Assays to the Rescue. Am. J. Hum. Genet 101, 315–325 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Weile J & Roth FP Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum. Genet 137, 665–678 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Matreyek KA et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat. Genet 50, 874–882 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Haque JA, McDonald MG, Kulman JD & Rettie AE A cellular system for quantitation of vitamin K cycle activity: structure-activity effects on vitamin K antagonism by warfarin metabolites. Blood 123, 582–589 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES