Applying multiplex assays to understand variation in pharmacogenes

Melissa Chiasson; Maitreya J Dunham; Allan E Rettie; Douglas M Fowler

doi:10.1002/cpt.1468

. Author manuscript; available in PMC: 2020 Aug 1.

Published in final edited form as: Clin Pharmacol Ther. 2019 May 30;106(2):290–294. doi: 10.1002/cpt.1468

Applying multiplex assays to understand variation in pharmacogenes

Melissa Chiasson ¹, Maitreya J Dunham ^1,⁴, Allan E Rettie ², Douglas M Fowler ^1,^3,^4,^*

PMCID: PMC6663607 NIHMSID: NIHMS1024238 PMID: 31145826

Introduction

Genome sequencing has enabled the detection of unprecedented numbers of new pharmacogene variants. But, interpreting how these variants affect pharmacogene biology and ultimately drug response is difficult. Multiplexed assays for variant effects (MAVEs) leverage high throughput DNA sequencing to assess the functional consequences of thousands of variants simultaneously. We discuss the utility of large-scale functional data in pharmacogene variant interpretation and suggest that implementing MAVEs could empower pharmacogenetics and improve patient care.

Genomes can now be sequenced with ease, but understanding the effect of the variants found therein poses a major challenge. Each uninterpreted variant represents a missed opportunity to improve patient outcomes. For example, the Clinical Pharmacogenetics Implementation Consortium (CPIC) lists 358 gene-drug pairs where variation can change drug response. For 63 of these 358 pairs, CPIC has issued guidelines regarding clinical interventions that may improve patient care. These guidelines focus on common variants (minor allele frequencies, MAF, typically >5%) whose clinical consequences are most clearly documented. However, understanding the effects of rare variants (MAF < 0.5–1%) is also essential, and this goal is far from realized.

The magnitude of the unmet need requires consideration of the totality of rare variation that will be identified as sequencing becomes more common. As of February 2019, the Genome Aggregation Database contained ~125,000 exomes and ~15,000 genomes, which included 404 rare coding single nucleotide variants in CYP2C9 alone, 212 of which were singletons. Only 55 of these variants were in the PharmVar database (accessed 2/10/19), and only about a dozen have been functionally annotated. Undoubtedly, as sequencing continues, many more CYP2C9 variants will be identified. This issue is not confined to CYP2C9: 731 novel non-synonymous variants in 12 CYP genes were discovered in the exomes of ~6,500 individuals¹. ~10% of individuals carried at least one of these potentially deleterious novel variants. These results, obtained from a handful of genes in a few individuals relative to the number that will ultimately be sequenced, illustrate that an onslaught of new and potentially important variants is coming.

The challenge of variant functional analysis

Current methods for determining the impact of pharmacogene variants fall into two categories. Biochemical assays using known substrates for drug disposition genes can reveal variant functional consequences. However, this approach is limited in scale to tens or hundreds of missense variants. Computational predictions can scale to all possible variants of a gene of interest, but are of limited value as they often produce incorrect or conflicting results. For example, the CYP2C9*3 variant, present in ~7% of Caucasians, confers ~90% loss of function according to experimental data², but is predicted computationally to be benign. To overcome the limitations of biochemical assays and computational predictions, an experimental approach to assess pharmacogene variants on a massive scale is needed.

MAVEs can characterize tens of thousands of variants simultaneously

A multiplex assay for variant effects (MAVE) measures the functional consequences of a large library of genetic variants simultaneously^3,4. MAVEs can be applied to a wide range of genetic elements including mRNA UTRs, promoters, enhancers, splice sites, and proteins. The result of a MAVE is a variant effect map that reveals the functional consequences of all possible single variants in the genetic element.

All MAVEs share the same basic design (Figure 1A, reviewed in ³^,⁴). First, a pooled library of variants is constructed either by PCR-based mutagenesis or synthesized oligo arrays programmed with mutations of interest. The library is then introduced into an experimental system, typically yeast or cultured human cells. Each cell must express a single variant to maintain the link between variant sequence and phenotype. For example, in human cells, expression of a single variant is typically achieved using lentiviral transduction or recombinase-based systems. Cells expressing the library of interest are then assayed for a phenotype of interest, like growth or reporter activation. These assays stratify variants based on their phenotypic effect. For example, in a growth assay, cells expressing wild type-like variants grow rapidly whereas cells expressing loss-of-function variants grow slowly. In a fluorescent reporter assay, wild type-like variants drive high fluorescence whereas loss-of-function variants drive low fluorescence. Cells are sorted into bins according to fluorescence. High throughput sequencing is used to measure a variant’s frequency in the assay, either before and after growth or across bins. Variant frequencies are then used to compute effect scores.

A) Overview of MAVEs. A library of variants of the genetic element of interest is created and introduced into cells. The cells are subjected to a growth- or fluorescent reporter-based assay. High-throughput sequencing is used to determine the frequency of variants before and after the assay, and variant frequencies are used to calculate functional scores. B) VAMP-seq uses a GFP fusion reporter to measure steady-state variant abundance. GFP was fused N-terminally to a library of TPMT variants; mCherry was used as a transcriptional control. This library was introduced into HEK293T cells using a serine integrase landing pad system such that only one variant is expressed per cell. Cells were sorted based on their fluorescence into four bins. High-throughput sequencing was used to determine the frequency of every variant in each bin. Frequencies were then converted to abundance scores. C) A FACS plot of WT TPMT (red) and three high-frequency variants known to be low abundance (blue): A80P, A154T, and Y240C. The library of TPMT variants and bins used for sorting are shown (gray). D) Density plot of abundance scores, with dotted blue line showing distribution of nonsense variants and red dotted line showing synonymous variants. The missense variant distribution is shaded from blue (low abundance) to red (high abundance). E) Heatmap of TPMT abundance scores shaded from blue (low abundance) to red (high abundance); gray indicates missing data. F) Abundance scores from four replicates for six new TPMT variants found in gnomAD.

MAVEs for coding and noncoding variants differ in the type of assays used. For example, noncoding MAVEs generally measure how variants affect expression, often by quantifying mRNA transcripts or using a fluorescent reporter. Coding MAVEs measure different aspects of a protein’s function. For example, reporter assays can measure specific protein properties like abundance or substrate binding using fluorescent protein tags or fluorophore-labeled antibodies. Growth-based assays measure each variant’s ability to drive cell growth, either in the context of a deletion of the genomic copy of the protein or by using a metabolic reporter.

MAVEs have the power to functionally annotate variants in many, if not most, pharmacogenes. However, achieving this goal will take time and effort, requiring the implementation of existing MAVEs and the development of new assays. To illustrate these issues, we first discuss the recent application of a MAVE to thiopurine methyltransferase (TPMT) and then consider other pharmacogenes that could benefit most from MAVEs.

Analyzing TPMT abundance reveals new variants that confer thiopurine toxicity risk

TPMT inactivates thiopurine drugs commonly used to treat cancer and autoimmune diseases, including 6-thioguanine and 6-mercaptopurine (6-MP). Thus, TPMT reduces the quantity of drug available for transformation into thioguanine nucleotides, which inhibit de novo purine synthesis. During routine dosing with thiopurines, TPMT deficiency results in high levels of thioguanine nucleotides and, ultimately, hematopoietic toxicity. Three variants, A80P, A154T, and Y240C, are known to lead to decreased TPMT function. CPIC recommends testing for these three variants, enabling patients to be classified as normal, intermediate, or poor metabolizers based on diplotype, with doses adjusted accordingly.

Previously, we applied Variant Abundance by Massively Parallel sequencing (VAMP-seq), a generalizable, multiplex assay for measuring protein abundance inside cells, to TPMT⁵ (Figure 1B). We generated abundance scores for 3,689 of the 4,655 possible variants (Figure 1C, D, and E). A80P, A154T, and Y240C were all low abundance variants, in accordance with their poor metabolizer status. In contrast, four rare variants from a clinical study of acute lymphoblastic leukemia (S125L, Q179H, R215H, R226Q) were all wild type (WT)-like in abundance, and patients with these variants tolerated higher doses of 6-MP better than those with A80P, A154T, or Y240C. We then identified 31 reduced abundance variants in gnomAD, and suggested that patients with these variants could have increased risk for thiopurine toxicity. Since our publication of the TPMT variant abundance map, seven new TPMT variants have been added to gnomAD: K77E, W78R, G83V, L155S, P160A, K191E, and C216Y. VAMP-seq data indicate that K77E, W78R, G83V, L155S, and K191E are of low abundance relative to WT (Figure 1F). Accordingly, these variants might confer drug sensitivity in patients that carry them.

Thus, protein abundance is a useful phenotype for identifying loss-of-function variants. We also anticipate that measurement of protein activity will be necessary for many pharmacogenes. Fortunately, in some cases, existing low-throughput activity assays can be adapted. For example, a reporter cell line developed to measure vitamin K oxidoreductase (VKOR) activity⁶ could be combined with a variant library to assess activity of all VKOR missense variants. Since some VKOR variants confer resistance to warfarin, cells could also be treated with warfarin to reveal the relationship between activity and resistance. Ultimately, the activity and resistance scores from such an assay could be used to help predict a patient’s warfarin dose based on their VKOR sequence.

MAVEs could aid pharmacogene variant interpretation

Including TPMT, CPIC lists 127 genes that have differing levels of evidence for identification as an actionable pharmacogene. 5,132,280 possible single nucleotide variants exist amongst these genes. Assaying such a large number of variants is possible, but daunting. Thus, we suggest prioritization of the most promising pharmacogenes.

We focused solely on missense variants for this analysis; however many pharmacogenes have noncoding variants that contribute to drug response and could be assayed with an appropriately designed noncoding MAVE. First, we restricted our analysis to the 31 genes that are designated as CPIC level A or B where genetic information can be used to guide drug therapy. We annotated each gene according to the localization of the protein it encodes, length, number of missense variants already in gnomAD, and number of variants registered in PharmVar (Table 1).

Table 1.

31 genes designated by CPIC as A or B level genes, along with factors to consider when designing MAVEs.

Gene	Drug(s)	Length (AA)	Total possible single AA variants	Missense variants in gnomAD	Missense variants in PharmVar	Localization
MT-RNR1	aminoglycoside antibacterials	16	300	0	0	Secreted
NUDT15	azathioprine, mercaptopurine, thioguanine	164	3,260	83	12	Cytoplasm
IFNL3	peginterferon alfa-2a, peginterferon alfa-2b, ribavirin	196	3,900	152	0	Secreted
HPRT1	mycophenolic acid	218	4,340	20	0	Cytoplasm
TPMT	azathioprine, mercaptopurine, thioguanine	245	4,880	119	0	Cytoplasm
OTC	valproic acid	354	7,060	88	0	Mitochondrion matrix
HLA-B	abacavir, allopurinol, carbamazepine, oxcarbazepine	362	7,220	180	0	Membrane; Single-pass type I membrane protein
HLA-A	carbamazepine, allopurinol	365	7,280	194	0	Membrane; Single-pass type I membrane protein
ASS1	valproic acid	412	8,220	211	0	Cytoplasm
ASL	valproic acid	464	9,260	242	0	Cytoplasm, extracellular exosome
CYP2C9	phenytoin, warfarin, acenocoumarol	490	9,780	381	55	Endoplasmic reticulum membrane, peripheral membrane
CYP2C19	amitriptyline, clopidogrel, citalopram, voriconazole	490	9,780	375	5	Endoplasmic reticulum membrane, peripheral membrane
CYP2B6	efavirenz, methadone	491	9,800	331	0	Endoplasmic reticulum membrane, peripheral membrane
CYP2D6	codeine, oxycodone, tamoxifen, tramadol	497	9,920	374	30	Endoplasmic reticulum membrane, peripheral membrane
CYP3A5	tacrolimus	502	10,020	215	11	Endoplasmic reticulum membrane, peripheral membrane
G6PD	rasburicase, chloramphenicol, chloroquine, ciprofloxacin	515	10,280	171	0	Cytoplasm, extracellular exosome, nucleus
CYP4F2	warfarin, acenocoumarol	520	10,380	344	2	Endoplasmic reticulum membrane, peripheral membrane
UGT1A1	atazanavir, irinotecan, belinostat	533	10,640	308	0	Endoplasmic reticulum membrane; Single-pass membrane protein
NAGS	carglumic acid	534	10,660	216	0	Mitochondrion matrix
GBA	velaglucerase alfa	536	10,700	247	0	Lysosome membrane, peripheral membrane protein
SLCO1B1	simvastatin, cerivastatin	691	13,800	399	0	Basolateral cell membrane, Multi-pass membrane protein
DPYD	capecitabine, fluorouracil	1,025	20,480	566	0	Cytoplasm
ABL2	valproic acid	1,182	23,620	509	0	Cytoplasm, cytoskeleton
POLG	valproic acid	1,239	24,760	762	0	Mitochondrion, mitochondrion matrix, mitochondrion nucleoid
ABCB1	antidepressants, digoxin	1,280	25,580	578	0	Cell membrane, multi-pass membrane protein
CFTR	ivacaftor	1,480	29,580	991	0	Apical cell membrane
CPS1	valproic acid	1,500	29,980	679	0	Mitochondrion, nucleus, nucleolus
CACNA1S	desflurane, enflurane, isoflurane, halothane	1,873	37,440	1,071	0	Cell membrane, sarcolemma, T-tubule, multi-pass membrane protein
SCN1A	carbamazepine	2,009	40,160	587	0	Cell membrane, multi-pass membrane protein
RYR1	desflurane, enflurane, isoflurane, halothane	5,038	100,740	2,663	0	Sarcoplasmic reticulum membrane, multi-pass membrane protein

Open in a new tab

Among this list, small proteins should be given high priority, since they have fewer possible variants and are thus easier to assay. Larger proteins affecting dosing of multiple, widely-prescribed drugs should also be prioritized, as they impact many patients. For these, we suggest focusing initial efforts on functionally important domains. All the genes have tens to thousands of variants deposited in gnomAD; however, most genes do not have any variants deposited yet in PharmVar. Therefore, concentrating on the genes that have the greatest number of rare variants in gnomAD, but no information in PharmVar, would yield new insight. Two pharmacogenes, IFN3 and MT-RNR1, encode secreted proteins requiring new assays that maintain the sequence-phenotype link. In addition to these factors, analyzing published CRISPR screen data will identify which of these genes cause growth defects in a relevant cell line; growth-based MAVEs would be an attractive starting point for these. For the remainder, we suggest applying reporter-based assays such as VAMP-seq.

Despite their promise, MAVEs also have limitations. MAVEs often take the genetic element of interest out of its endogenous genomic or cellular context and thus demand careful validation of results. Data generated from MAVEs, while comprehensive, can be noisy. Thus, adequate replication is required to improve measurement accuracy and facilitate error estimation. Finally, MAVEs generally focus on one or a few experimental conditions and so may not fully capture condition-dependent effects. For pharmacogenes, therefore, it will be critical to evaluate variants in physiologically relevant concentration, time and drug contexts.

In summary, a community-wide effort to apply MAVEs to high-priority pharmacogenes would result in variant effect maps that could aid in the interpretation of variants seen in the clinic. As pharmacogene variant effect maps are produced, they will yield a better understanding of pharmacogene biology and create opportunities for more rigorous, data-driven customization of patient treatment.

Acknowledgments

Funding: This work was supported by the National Institute of General Medical Sciences (5R24GM115277 to D.M.F., A.R., and M.D., P01 GM116691 for A.R.). D.M.F. is a Canadian Institute for Advanced Research Azrieli Global Scholar. M.D. is a Senior Fellow in the Genetic Networks program at the Canadian Institute for Advanced Research. M.D. is supported in part by a Faculty Scholars grant from the Howard Hughes Medical Institute.

Footnotes

Conflict of Interest: The authors declared no competing interests for this work.

References

1.Gordon AS et al. Quantifying rare, deleterious variation in 12 human cytochrome P450 drug-metabolism genes in a large-scale exome dataset. Hum. Mol. Genet 23, 1957–1963 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Rettie AE & Jones JP Clinical and toxicological relevance of CYP2C9: drug-drug interactions and pharmacogenetics. Annu. Rev. Pharmacol. Toxicol 45, 477–494 (2005). [DOI] [PubMed] [Google Scholar]
3.Starita LM et al. Variant Interpretation: Functional Assays to the Rescue. Am. J. Hum. Genet 101, 315–325 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Weile J & Roth FP Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum. Genet 137, 665–678 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Matreyek KA et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat. Genet 50, 874–882 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Haque JA, McDonald MG, Kulman JD & Rettie AE A cellular system for quantitation of vitamin K cycle activity: structure-activity effects on vitamin K antagonism by warfarin metabolites. Blood 123, 582–589 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Gordon AS et al. Quantifying rare, deleterious variation in 12 human cytochrome P450 drug-metabolism genes in a large-scale exome dataset. Hum. Mol. Genet 23, 1957–1963 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Rettie AE & Jones JP Clinical and toxicological relevance of CYP2C9: drug-drug interactions and pharmacogenetics. Annu. Rev. Pharmacol. Toxicol 45, 477–494 (2005). [DOI] [PubMed] [Google Scholar]

[R3] 3.Starita LM et al. Variant Interpretation: Functional Assays to the Rescue. Am. J. Hum. Genet 101, 315–325 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Weile J & Roth FP Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum. Genet 137, 665–678 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Matreyek KA et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat. Genet 50, 874–882 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Haque JA, McDonald MG, Kulman JD & Rettie AE A cellular system for quantitation of vitamin K cycle activity: structure-activity effects on vitamin K antagonism by warfarin metabolites. Blood 123, 582–589 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Applying multiplex assays to understand variation in pharmacogenes

Melissa Chiasson

Maitreya J Dunham

Allan E Rettie

Douglas M Fowler

Introduction

The challenge of variant functional analysis

MAVEs can characterize tens of thousands of variants simultaneously

Figure 1.

Analyzing TPMT abundance reveals new variants that confer thiopurine toxicity risk

MAVEs could aid pharmacogene variant interpretation

Table 1.

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Applying multiplex assays to understand variation in pharmacogenes

Melissa Chiasson

Maitreya J Dunham

Allan E Rettie

Douglas M Fowler

Introduction

The challenge of variant functional analysis

MAVEs can characterize tens of thousands of variants simultaneously

Figure 1.

Analyzing TPMT abundance reveals new variants that confer thiopurine toxicity risk

MAVEs could aid pharmacogene variant interpretation

Table 1.

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases