Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Sep 16.
Published in final edited form as: Nat Rev Cardiol. 2025 Sep 1;23(3):149–163. doi: 10.1038/s41569-025-01201-7

Creating an atlas of variant effects to resolve variants of uncertain significance and guide cardiovascular medicine

Andrew M Glazer 1, Daniel R Tabet 2,3, Victoria N Parikh 4, Brett M Kroncke 1, Atina G Cote 2,3, Yuta Yamamoto 4, Qianru Wang 4, Ayesha Muhammad 5, Megan C Lancaster 6, Matthew J O’Neill 1,7, Jochen Weile 2,3, Tao Yang 1, Calum A Macrae 7, Euan A Ashley 4, Frederick P Roth 2,3,8, Dan M Roden 9
PMCID: PMC12434709  NIHMSID: NIHMS2109968  PMID: 40890511

Abstract

Cardiovascular diseases are leading global causes of death and disability, often presenting as interrelated phenotypes of atherosclerotic vascular disease, heart failure, and arrhythmias. They arise from interactions between environmental factors and predisposing genotypes and include relatively common Mendelian lipid disorders, cardiomyopathies, and arrhythmia syndromes. The identification of a pathogenic variant through genetic testing can inform disease diagnosis, risk prediction, treatment, and family screening. However, a major roadblock in genomic medicine is that many variants—especially missense variants—lack sufficient evidence to enable a definitive classification and are therefore deemed “variants of uncertain significance” (VUS). In this Review, we describe how multiplexed assays of variant effect (MAVEs) can enable functional assessment of nearly all coding variants in a target sequence, potentially offering evidence proactively for variants observed later in a patient. We discuss validation, including the role of in silico variant effect predictors (VEPs), and how multiplexed experimental methods are illuminating cardiovascular disease biology and ultimately resolving the VUS problem at scale.

Introduction

Major cardiovascular disease phenotypes – ischemia, heart failure, and arrhythmias – are driven by a complex mixture of common polygenic variation, rare large-effect variants, and environmental factors16. Familial hypercholesterolemia (FH) affects nearly 1% of Americans7. Channelopathies such as long QT syndrome (LQTS) and Brugada Syndrome (BrS) can cause sudden cardiac death (SCD) and predispose individuals to drug-induced arrhythmias8,912. Hypertrophic cardiomyopathy (HCM), has been revealed by sensitive imaging to affect 1 in 200 people13. Inherited dilated cardiomyopathy (DCM) is now recognized as the predominant cause of “nonischemic cardiomyopathy” (~1/250)14,15, and many of the underlying genes also contribute to atrial fibrillation susceptibility16,17. In a recent survey, 32% of 709 patients in a general cardiology clinic had actionable genetic testing results18. Accurately diagnosing cardiovascular genetic diseases has immediate implications for prevention, treatment, and cost of care. For example, FH is often underdiagnosed7, but early genetic diagnosis enables therapy that prevents stroke and early myocardial infarction19. Early diagnosis of heritable arrhythmias and cardiomyopathies can reduce catastrophic outcomes like SCD2022.

Accessible sequencing and the power of genetic diagnosis are propelling a new era of genomic medicine23,24. However, one factor constraining implementation is a limited ability to determine the pathogenicity of most newly discovered variants, which are therefore frequently classified as variants of uncertain significance (VUS). In patients with possible Mendelian cardiovascular conditions, each VUS annotation represents a failure to illuminate the genetic underpinning of disease. VUS classifications can prevent the identification of at-risk individuals, hinder family screening, or lead to unnecessary clinical surveillance in low-risk patients. As more gene-specific risks and therapies are identified for cardiovascular diseases, VUS also impede the application of potentially life-saving, guideline-driven precision care, such as early SCD prevention in LMNA cardiomyopathy or early treatment of familial hypercholesterolemia with cholesterol-lowering medication22,25,26.

The scope of the variant interpretation problem is vast. One study found that only 2% of the over 4 million missense variants in the Genome Aggregation Database (gnomAD)27 had a clinical interpretation in the NIH database ClinVar. Furthermore, the most common classification for variants with an interpretation was VUS28,29. In the eMERGE-Seq study30, sequencing of 27 cardiovascular disease genes in nearly 22,000 participants identified 6,840 unique coding variants, of which 6,095 (89%) either lacked a definitive classification in ClinVar, or were absent from the database entirely. The number of variants in ClinVar has increased significantly due to the growing use and reporting of genetic tests in clinical and population screening settings. As of January 2025, there were 35,395 missense variants in ClinVar across 50 ClinGen-validated Mendelian cardiac disease genes (Figure 1). Of these variants, 79% are VUS, 12% have conflicting interpretations, and only 5% and 3% have P/LP or B/LB classifications, respectively. Figure 1C shows missense variants in three key cardiovascular disease genes, KCNQ1, LDLR, and MYH7, in the UK Biobank and All of Us cohorts. For all three genes, over 75% of all detected variants are VUS or have conflicting annotations in ClinVar, or are absent from ClinVar. Furthermore, given the de novo mutation rate, the size of the human genome, and the global population size, nearly every missense variant compatible with life is already present in dozens of individuals in the living human population31,32. Currently, individuals with non-European ancestry have a higher rate of VUS in cardiac genes than individuals with European-ancestry33, emphasizing the need for larger non-European datasets which will enable more accurate and equitable variant classification. Thus, there is a critical need for systematic evaluation of the pathogenicity of even the rarest variants. This need was highlighted by the 2020 NIH National Human Genome Research Institute Strategic Plan, which set a goal of making the term “VUS” obsolete by 203024.

Figure 1: The Variant of Uncertain Significance problem in cardiovascular genes.

Figure 1:

A) Missense variants in ClinVar over time in 50 genes associated with Mendelian cardiovascular disorders. B) Barplot by disease class for the same set of genes shown in panel A. Truncating variants in titin (TTN) are associated with DCM and early-onset atrial fibrillation, but due to TTN’s large size and the unclear impact of TTN missense variants, TTN is listed as a separate category and not included in panel A. ClinVar variants in panel B were categorized into disease classes based on ClinGen-validated gene-disease associations, rather than based on phenotype assertions for individual variants provided in ClinVar. Additionally, in panel B, some genes are present in multiple disease classes. C) Number of missense variants in 3 key cardiovascular genes in the UK Biobank and All of Us. Variants are colored by their Clinvar classification. VUS = variant of uncertain significance, P/LP = pathogenic/likely pathogenic, B/LB = benign/likely benign, Arrhyth. = Arrhythmia, AC = arrhythmogenic cardiomyopathy, DCM = dilated cardiomyopathy, FH = familial hypertension, HCM = hypertrophic cardiomyopathy.

Multiplexed Assays of Variant Effect (MAVEs) are in vitro functional assays that can simultaneously assess the function of thousands of variants in a gene or genomic region of interest. In this Review, we summarize the growing use of MAVEs to help solve the VUS problem for cardiovascular genes. We also discuss variant effect predictors (VEPs), and how they can be integrated with MAVEs to help inform cardiovascular genomic medicine.

MAVEs: functional genomics on a massive scale

Overview of MAVEs

Functional assessments of genetic variants have traditionally focused on a few variants at a time, or occasionally dozens of variants using non-multiplexed assays like automated patch clamping3436. As more variants are discovered, the proportion of observed variants with high-quality functional data is rapidly decreasing29. Multiplexed Assays of Variant Effect (MAVEs), which can investigate the function of all possible single nucleotide variants in a target genomic region37, can address issues of both standardization and scale (Figure 2). The MAVE process (outlined in Figure 3 and discussed below) involves saturation mutagenesis of a target gene or region; expression of pools of mutants in an appropriate cell model; en masse multiplexed selection for cellular phenotypes that reflect variant effects on relevant protein function(s) (Figure 4); next-generation sequencing to compare variant abundance in pre- and post-selection pools; and data analysis and display (Figure 5)29,32,37,38. In the following paragraphs, we summarize key aspects of MAVEs (e.g., mutagenesis approach, cell or model organism type, library delivery method, and assay type). Examples of published published and potential cardiovascular MAVE experiments are presented in Table 1.

Figure 2. Overview of lead candidate genes (left) and assay methods (right) for cardiovascular Multiplexed Assays of Variant Effects (MAVEs).

Figure 2.

As described further in the text and Figures 35, some assay methods can be implemented across multiple genes and disease domains, while in other cases, gene-specific (“bespoke”) assays are needed.

Figure 3: Multiplexed Assay of Variant Effect approach.

Figure 3:

Pools of cells each expressing a single variant are generated using gene editing (top left) or saturation mutagenesis in plasmids (bottom left) which are then integrated into cells for study. The cell pool is then challenged as described in the text (drug challenge, assay of cell surface expression, etc), variant abundance is assayed pre- and post-challenge (e.g., Figure 4), and the data are displayed as a variant effect map (e.g., Figure 5).

Figure 4: Examples of MAVE assays.

Figure 4:

A. Cell surface expression assay. A pool of cells is sorted as a function of cell surface expression of the protein of interest. The abundance of each variant across bins of expression is assayed to establish the cell surface expression phenotype. B. Functional assay. In this example, the pool of cells is grown under conditions favoring survival of loss of function variants, as described further in the text. Comparing survival at baseline and after a period of growth is then used as the functional readout.

Figure 5: Variant effect map for KCNE1, a 129 amino acid potassium channel subunit.

Figure 5:

A. ClinVar annotations. The amino acid position is shown on the x-axis and the amino acid substitutions on the y-axis. The dots indicate the wild-type amino acid at each position. Pathogenic or likely pathogenic (P/LP) variants are shown in orange, benign or likely benign (B/LB) variants in green and variants of uncertain significance (VUS) in gray. B. A map of function assayed by survival of cells coexpressing the gain of function KCNQ1 variant S140G (details in the text). Loss of function variants are shown in shades of red, gain of function variants in blue, and no change in white; no data were generated for cells shown in gray. Panel B reproduced from52.

Table 1:

Example published and possible MAVE studies of cardiovascular genes

Gene Mutagenesis approach Cell type Library delivery Assay type Reference
Published cardiovascular MAVE studies
CALM1/2/3 POPCode Yeast Complementation Cell survival 38
SCN5A (12 aa) Inverse PCR HEK293 Landing pad Cell survival 71
MTHFR POPCode Yeast Complementation Cell survival 111
CYP2C9 Inverse PCR Yeast, HEK293 Transformation, landing pad Click-seq activity probe, protein abundance 119
KCNJ2 (mouse) SPINE mutagenesis HEK293 Landing pad Surface abundance, membrane potential reporter 62
HNF1A Commercial synthesis HUH7 Lentivirus Downstream marker expression 118
KCNH2 Inverse PCR HEK293 Landing pad Surface abundance 65
KCNE1 Inverse PCR HEK293 Landing pad Surface abundance, cell survival 52
MYH7 (5 aa) CRaTER iPSC-CM Integration at endogenous locus Protein abundance, cell survival 58
Potential cardiovascular MAVE studies
Lipoprotein metabolism (e.g. LDLR, PCSK9, LPL, APOB) Exogenous library synthesis HEK293 or Hela Landing pad LDL metabolism, triglyceride hydrolysis, surface abundance
Ion channel (e.g. KCNQ1, RYR2, CACNA1C) Exogenous library synthesis HEK293 or Hela Landing pad Surface abundance, cell survival, intracellular calcium reporter
Cardiomyopathy (e.g. MYBPC3, PKP2, RBM20) Endogenous locus editing or integration iPSC-CM Endogenous locus BNP reporter, cell survival, cell size, cell morphology

Saturation mutagenesis and generation of cell pools

A key step in a MAVE experiment is to generate a large pool of cells, each expressing a single variant in the target genomic region, on which the selection assay will be deployed. Experimental methods for library generation and integration into a model system have been discussed extensively in previous reviews37,39 and are described in brief below.

The generation of mutant libraries can be accomplished using genome editing at the endogenous locus, or by exogenous delivery and expression of a variant library (Figure 3)40,41. Libraries can be synthesized with a variety of methods including error-prone PCR, oligonucleotide-targeted editing, or commercial synthesis37. For exogenous delivery, the mutagenized library can be cloned en masse into an appropriate expression vector and integrated at an engineered ‘landing pad’ locus to ensure that a single variant is expressed per cell42,43,44. Alternatively, CRISPR-based gene editing technologies have introduced a range of options for mutagenizing the genome at the endogenous locus40,45,46,47. These include saturation genome editing based on replacement of an endogenous segment with a mutagenized donor template; direct base editing (e.g., adenine and cytosine base editors directed using catalytically inactive Cas9 or “dCas9”); or more recently using prime editing, using CRISPR-directed introduction of a mutagenized oligonucleotide at the endogenous locus45,48. One method, CRISPR-X, achieves relatively wide base-editing windows by combining dCas9 and sgRNA with activation-induced cytidine deaminase-mediated somatic hypermutation40. These technologies are especially useful in cell types that are not tractable for genome engineering with large constructs, or for assessing variant function in cellular and disease contexts such as the effect of HCM variants on indices of contractility in cardiomyocytes. One advantage of endogenous DNA-editing methods over cDNA delivery approaches is the ability to measure the functional effects of non-coding variants, which are not present in cDNA constructs.

Functional assays for cardiovascular disease–associated genes

Assays have been developed to investigate disease-relevant phenotypes such as cell surface trafficking, response to a selection intervention (such as a drug challenge), or cellular phenotypes such as contractility or cell size. Some assay methods are broadly applicable to multiple genes, while others are gene-specific “bespoke” assays, optimized to measure pathophysiologic effects unique to variants in a particular gene. Typically, assays are developed based on known pathophysiology and are validated using conventional low-throughput measurements of control variants. Central to assay development is the choice of cell system. In some instances, protein function is examined in uniquely disease-relevant cell types, such as cardiomyocytes derived from induced pluripotent stem cells (iPSC-CMs). On the other hand, human protein variant function can often instead be accurately assessed in tractable heterologous cell models, such as human embryonic kidney (HEK) cells51,52 or even yeast38,53.

Multiple MAVEs may be required to provide a comprehensive functional assessment, using assays that either measure overall protein function or specific subfunctions. Here, we present examples with a focus on Mendelian cardiovascular disease genes.

Protein abundance:

A major class of loss-of-function variants are variants that disrupt protein stability or subcellular localization5457. One method, Variant Abundance by Massively Parallel Sequencing (VAMP-seq)51, involves the creation of a library of variants in a protein fused to a fluorescent protein such as GFP. Variants that disrupt protein abundance result in lower fusion protein abundance and lower fluorescence; pools of variant-expressing cells are sorted using Fluorescence Activated Cell Sorting (FACS) and deep-sequenced51. This method can be applied to a broad set of cardiovascular proteins where such variants are a major disease mechanism. For example, a multiplexed study of 113 variants in the HCM-associated gene MYH7 used VAMP-seq to identify pathogenic variants that disrupted β-myosin heavy chain protein abundance.58 This assay accurately classified 3 benign and 3 pathogenic control variants. Another method, the abundance protein fragment complementation assay (aPCA), uses a fragment of an essential enzyme fused to the protein of interest59. This technique translates differences in protein stability into changes in growth rate in yeast cells; pools of mutant-expressing cells can thus be assessed by deep sequencing. aPCA has been applied on a massive scale to assess the stability of thousands of variants in 500 protein domains60. Although protein abundance assays can be readily applied to many genes to identify function-disrupting variants, a limitation of this assay type is that most genes likely have function-disrupting variants that do not affect protein abundance.

Cell surface abundance:

Many cardiovascular disease genes are cell surface proteins, including ion channels associated with arrhythmias and receptors and other proteins that regulate lipid processing. A decrease in cell surface abundance, typically due to misfolding and/or mistrafficking, is a common loss-of-function mechanism for cell surface proteins61. Cell surface proteins can be labeled using a fluorescently-labeled antibody against the protein or or an epitope tag (e.g. HA or myc tag; Figure 4A) on live cells. This cell surface abundance assay is applicable to a large set of cardiovascular genes, including key hyperlipidemia and arrhythmia genes such as LDLR, LPL, SCN5A, KCNE1, KCNQ1, KCNH2, and KCNJ2. A comprehensive scan of cell-surface abundance of KCNJ2 missense variants identified regions especially sensitive to mutation, including regions in the transmembrane helices and ER and Golgi export motifs62. A cell surface abundance assay of all possible KCNE1 variants identified known loss of trafficking variants associated with LQTS and predicted new loss-of-trafficking variants52. This study also identified glycosylation and N-terminal cysteine mutations that affect cell surface trafficking, and surprisingly revealed that large C-terminal truncations do not disrupt trafficking to the cell surface. Nearly 50% of possible variants in a KCNH2 hotspot region disrupted trafficking and MAVE trafficking scores were strongly correlated with automated patch clamp measurements of protein function63,64. Extension of this trafficking assay to over 18,000 variants across KCNH2 revealed that the MAVE data, in combination with other variant features, predicted severe cardiac events65. As with protein abundance assays, a limitation of cell surface trafficking assays is that cell-surface genes likely have function-disrupting variants that do not affect trafficking.

Ion channel function can also be assessed by MAVE-compatible voltage-sensitive fluorescent markers. For instance, overexpression of the cardiac potassium channel gene KCNJ2 in HEK293 cells leads to a decrease in the resting membrane potential. A membrane potential-sensitive fluorescent reporter was used in a MAVE to comprehensively measure the function of KCNJ2 variants60. This dataset was compared to the cell-surface trafficking MAVE described above to identify variants that specifically altered channel function without affecting trafficking. For example, the lipid PIP2 activates KCNJ2, and the “function-only” variants included residues involved in binding to and responding to PIP2.

Cellular growth or survival:

Functional variant assays based on cellular growth have been the basis of many MAVE studies (Figure 4B). For example, Findlay and colleagues47 exploited the observation that functional BRCA1 is required for survival of HAP1 cells, a haploid cell line. After saturation genome editing of BRCA1 exons encoding critical functional domains, high-throughput sequencing of variant pools was used to identify loss of function and normal function variants. The assay nearly perfectly validated 375 known pathogenic or benign variants and discovered ~700 new loss of function variants out of 3,893 missense variants tested. This approach is currently being deployed for additional genes that are essential in the HAP1 cell line6668. A map for the three genes CALM1, CALM2, and CALM3, which encode the identical calmodulin protein product, was generated based on cellular growth and survival in a yeast selection assay38. This map has already been used to interpret clinical variants in 3 patients presenting with variants in CALM1/2/3 and arrhythmias (see below for additional details)53. In other work, a multiplexed study of 113 MYH7 variants in iPSC-CMs identified a cell survival phenotype that could accurately classify 8 control benign and pathogenic variants58.

Many Mendelian cardiovascular genes are not essential in commonly-used tractable cell lines, limiting applicability of this relatively straightforward cellular survival MAVE approach. However, it is often possible to impose environmental or genetic contexts that sensitize cells to the function of the gene of interest. For example, when the fatty acid synthase gene (FASN) is knocked out in HAP1 cells, the loss of LDLR (which encodes the low-density lipoprotein (LDL) receptor) causes a more pronounced growth impact69. Other examples involve SCN5A and KCNE1, ion channel genes linked to arrhythmias. A scan of a 12 residue voltage sensor region in the sodium channel SCN5A was performed using a triple drug selection assay in which veratridine and brevetoxin inhibited normal channel inactivation, and the sodium-potassium ATPase inhibitor ouabain sensitized cells expressing normal or gain of function channels to sodium overload and cell death70,71. Similarly, a KCNE1 map was generated in HEK293 cells stably expressing the KCNQ1 variant S140G, which exhibits gain of function properties when coexpressed with KCNE1 (Figure 4B)72,73. Cells expressing normal function KCNE1 variants were depleted from the cell pool, whereas cells expressing KCNE1 loss of function variants remained in the cell pool.52 This assay generated functional data for 2,534 variants, of which only 111 of which are annotated in ClinVar, most of which are VUS (Figure 5).

Cardiomyopathy phenotypes:

Many pathogenic HCM- or DCM-causing variants cause specific contractile phenotypes that are difficult to assess in non-cardiomyocyte cells, and iPSC-CMs are a promising system in these cases. Direct measurements of hypercontractility or hypocontractility — primary cellular correlates of HCM and DCM — are possible in single iPSC-CMs74, 2D monolayer, or engineered 3D tissue of iPSC-CMs75, but these measurements are difficult to scale to multiplexed assays. However, cardiomyopathy is also associated with other phenotypic changes, including altered cellular size, stiffness, sarcomeric structure, and expression of markers such as BNP. Assays to detect these phenotypes may be useful for genes harboring variants that cause sarcomeric hyperactivation (MYH7, MYBPC3, TNNT2 and TPM1)76,77 or aberrant sarcomeric and cytoskeletal stretch-sensing (TTN, LMNA, DSP)7880. For example, a non-multiplexed study of 51 TNNT2 variants identified changes in contractility in iPSC-CMs expressing pathogenic variants74. The same study also distinguished variants using a fluorescent reporter for expression of NPPB (encoding BNP), an approach that is compatible with MAVE experiments. Passive size-based cell sorting could also provide a direct measure of single iPSC-CM hypertrophy81. High speed microscopy and advanced image analysis using deep learning can define static and dynamic cell morphology characteristics82 and may enable active sorting of cells based on multi-feature morphology assessment49,58. Pathogenic variants in cardiomyopathy genes might also cause changes in protein abundance or cell survival in iPSC-CMs that are compatible with the corresponding scalable assays described above58.

Uptake and secretion:

The ability of cultured cells to effectively take up LDL and other lipoprotein particles that have been labeled with a traceable fluorophore facilitates a range of scalable functional assays for dyslipidemia genes83. For instance, cells expressing a disrupted LDLR gene exhibit decreased internalization of fluorescently labeled LDL84. This approach could be expanded to other genes, such as PCSK9 and LDLRAP1, which are involved in LDL uptake, as well as other lipoprotein subtypes and receptors (e.g., very-low-density lipoprotein and VLDLR).

The general MAVE approach (Figure 3) relies on culturing a large number of cells that each express a different variant. This is not ideal for secreted proteins like PCSK9 because, in a heterogeneous culture, the impact of a secreted protein variant becomes decoupled from the genotype of its cell of origin. To address this issue, the secreted protein can be tethered to the surface of the secreting cell, for example, using a glycosylphosphatidylinositol anchor85. Alternatively, secreted proteins can be bound to their cell of origin by expressing them as a fusion with a selected transmembrane domain. This strategy was applied to coagulation factor IX, a secreted serine protease linked with hemophilia B86.

Gene expression and splicing assays:

Most MAVEs and clinical classification efforts to date have focused on protein-altering coding variants. Non-coding regions are often more difficult to interpret, due to their large size and the complexity of their effects on regulation of gene expression or function. However, non-coding regions are increasingly recognized for their contribution to disease, both through common risk loci detected through genome-wide association studies and rare, large-effect variants. Regulation of gene expression and aberrant splicing can also be examined using variant effect mapping.

Enhancers can be studied with Massively Parallel Reporter Assays (MPRAs). MPRAs involve the creation of a library of candidate enhancer or promoter regions, followed by multiplexed expression in cells and high-throughput sequencing. MPRAs can be applied to inform disease processes for a variety of common and rare cardiovascular disorders87. MPRAs of candidate cardiovascular enhancers can be performed in cell lines or in vivo, e.g. by transduction of mouse hearts88,89. The LDLR promoter has been comprehensively scanned, identifying loss of function regulatory variants that may be linked to FH90. A recent MPRA study of over 680,000 candidate enhancer sequences included saturation mutagenesis of an enhancer of PKLR, a gene linked to pyruvate kinase deficiency and hemolytic anemia91.

To address the cis effects of variants in splicing regions of disease genes, previous studies have identified coding variants that lead to reduced transcript expression by RNA sequencing47. In some cases, nonsense-mediated decay-affected variants can be identified using genotypes and expression levels derived by RNAseq92. Exploration of the cis splicing effects of intronic or exonic variants in genes can be performed with pooled minigene splice reporter assays93. In addition, single cell RNA sequencing can be used to quantify the splicing of specific transcripts94, although existing methods for simultaneous DNA and RNA sequencing in single cells remain too resource-intensive for broad use in variant effect mapping95, These latter methods may be more fruitfully deployed for examination of pathogenic missense variation in trans splicing regulators, such as RBM20 and RBFOX2 in cardiomyocytes96.

Other methods to probe variant-to-function relationships in regulatory regions are emerging. A single cell imaging and gene expression platform (LipocyteProfiler) was applied to variants in the 2p23.3 locus (previously linked to insulin resistance97) and identified mitochondrial function phenotypes during early differentiation which progressed to altered lipid-droplet formation in mature visceral adipocytes98. More generally, methods that can evaluate variants within risk loci will continue to be important for predictive cardiovascular medicine.

Data analysis and display

In MAVE experiments, the abundance of each variant is quantified by high-throughput sequencing in the pre-selection pool, either in the delivered library or in the cell pool. After phenotypic selection using one of the assay approaches described above, the abundance of each variant is again quantified by high-throughput sequencing and compared with the pre-selection pools. This quantification can be accomplished either by direct sequencing or by identification of barcodes paired with known variants38,64,71. Variant effect scores are then calculated using standardized and reproducible data analysis pipelines99. These scores are then normalized to internal controls, e.g., rescaling scores so that synonymous variants have a median score of 1 and nonsense variants have a median score of 0. Larger numbers of replicates allow more accurate estimation of random error in scores, although even two replicates allows error estimation via trends that relate pre-selection variant abundance to replicate agreement38,99,100. Scores (and estimated errors) can then be visualized as a “variant effect map”, as illustrated in Figure 5.

Validation and implementation of MAVEs

Using MAVEs in clinical variant classification

MAVE datasets are already being used in clinical settings to help interpret cardiovascular variants. A MAVE for the three identical calmodulin genes, CALM1, CALM2, and CALM3, was performed using a yeast selection assay38. This map was deployed to interpret clinical variants in three patients, each presenting with arrhythmias or sudden unexplained death and distinct variants in CALM1, CALM2, or CALM353. For two of the patients, the MAVE data supported the loss of function of the variants, providing rapid evidence for variant reclassification towards pathogenic. For the third patient, the MAVE data, combined with additional clinical and family information indicating a phenotype atypical for calmodulin deficiency, supported the conclusion that the CALM1 variant was not the major cause of the patient’s phenotype.

Catalogs of variant function will be increasingly useful for annotating clinical variants and reducing the burden of VUS. MAVEs generate a large amount of variant data using a single standardized assay, thus minimizing the between-lab variability associated with ‘small-batch’ approaches while also facilitating assessments of assay quality. Notably, some “medium-throughput” assays such as automated patch clamping can also generate large, standardized functional datasets101; correlation between MAVE data and these types of datasets can be an important validation step65. In the initial ACMG/AMP guidelines, the PS3 criterion was applied if a well-validated functional assay showed a damaging effect or the BS3 criterion was applied if a well-validated functional assay showed no damaging effect102, with each of these criteria being applied at the “strong” level. The ClinGen Sequence Variant Interpretation (SVI) working group recommended a more nuanced approach, in which functional assays are validated with a large set of accepted pathogenic and benign controls, and this functional evidence can be applied at variable evidence strengths towards variant classification (Box 1)103. One potential caveat of MAVEs is that no single assay interrogates all functions of a protein. Because of this, “edge case” variants may be discovered with borderline assertions or disagreements between MAVEs and other data sources such as patient datasets or computational predictors. These variants can be further assessed in other model systems such as zebrafish or medium-throughput assays such as automated patch clamping.

Box 1: Calibrating MAVE assays using benign and pathogenic controls.

Box 1: Calibrating MAVE Assays Using the ClinGen SVI Approach.

Box 1:

Example calibration of two MAVE assays. P=pathogenic, B=benign. The left image shows MAVE assay results for 30 pathogenic and 30 benign controls, with the first assay having better performance than the second assay. The middle panel shows two 2×2 matrices summarizing MAVE score results by variant classification. The right panel shows Log Likelihood Ratios for pathogenicity and benignity (LLRP and LLRB), and how those convert to evidence strengths in the ACMG classification scheme.

The ACMG/AMP guidelines originally categorized functional evidence from well-established assays as either PS3 (abnormal assay result) or BS3 (normal assay result), both at a strong evidence level97. More recently, the ClinGen Sequence Variant Interpretation (SVI) group recommended calibrating functional assays using a large set of control variants known to be benign or pathogenic98. This calibration allows the calculation of a likelihood ratio of pathogenicity (LLRP, sometimes called “OddsPath”98). Depending on LLRP values, the strength of evidence for pathogenicity or benignity can range from supporting to strong or even very strong (for pathogenicity). MAVEs are well-suited for this approach because they generate data for many control variants, enabling implementation of the ClinGen SVI framework100. For example, in the figure, two MAVE assays are calibrated: one achieves PS3_strong and BS3_strong, while the other achieves PS3_moderate and BS3_moderate. A limitation of the original “OddsPath” approach is the use of fixed score thresholds, which apply the same strength of evidence to all variants above a threshold. Newer methods allow finer calibration, assigning different levels of evidence strength to variants based on their LLRP values, even within the same dataset53,124.

For some genes, large sets of control variants are available in ClinVar. However, for many genes, the number of reference variants is limited—especially for benign variants. This is partly due to reporting bias and the difficulty of proving a variant has no disease impact. Control variants can be supplemented with additional pathogenic variants from disease-focused datasets like SHARE125 or with variants designated benign because they are present at high frequency (sometimes in only some ancestries) in resources like gnomAD27,98. Finally, misannotated variants in reference sets can affect calibration accuracy, often underestimating evidence strength. The calibration approach described here can also be adapted for other types of data, such as computational variant effect predictors117.

The performance of a MAVE dataset is often assessed against existing clinical variant classifications (e.g. by precision-recall analysis). A recent method evaluated the performance of computational variant effect predictors based on their ability to infer human phenotypes in two large population-based cohorts, UK Biobank and All of Us104. Applying this approach to assess MAVE data could facilitate performance benchmarking across multiple MAVE datasets and could help interpret variants found in ancestrally diverse cohorts105.

A long-term goal of MAVE science is to improve variant classifications and accelerate the use of genetics in clinical care. A recent analysis of MAVE datasets for germline cancer variants revealed that 15–69% of VUS could be reclassified with the inclusion of MAVE data, depending on the gene/dataset106. A variant effect map for calmodulin was used to prospectively inform variant interpretation, diagnosis, and patient care53. “Best practice” guidelines have been established for reporting MAVE data107. These guidelines are designed to enhance the quality of MAVE datasets and broaden their clinical applications across populations of diverse ancestry105,108,109.

MAVEs also offer significant potential for gene-disease pairs with limited or conflicting evidence by generating robust datasets to address critical gaps in variant interpretation. For genes with few known pathogenic variants, such as KCNE1 or JPH2, MAVEs can identify a broader set of function-disrupting variants to clarify their impact on disease risk (Figure 4)52. To maximize their utility, gene prioritization for MAVE studies should consider clinical relevance, evidence gaps, and assay feasibility.

The ACMG/AMP system is primarily designed to classify large-effect variants, but there is growing recognition of low-penetrance or ‘modifier’ variants that do not fit neatly within the 5-tiered framework110. MAVEs offer some promise in modeling these variants, as low-penetrance variants may have moderately dysfunctional molecular phenotypes that can be measured in MAVE experiments. Additionally, low-penetrance common variants can interact synergistically with large-effect variants, as exemplified by MTHFR, which is linked to an inherited disorder of folate metabolism. The common partial loss-of-function MTHFR variant p.Ala222Val interacts with a key environmental factor (dietary folate yielding folinate) to modulate the phenotype. Combinatorial MAVEs have been used to model these interactions, examining the effects of p.Ala222Val, other MTHFR variants, and folinate concentration111.

Insight into underlying biology

High-throughput in vitro studies of variant function can not only enhance variant classification, but also serve as powerful tools to reveal disease biology. These datasets readily identify residues, motifs or domains that are highly intolerant to variation. Pairing this map information to experimentally-derived and computed protein structures112114 can provide insights into underlying mechanisms. Functional scores that incorporate disease-cohort and population variation are being used to identify novel functional domains in genes causing arrhythmogenic cardiomyopathy80,115, to probe the functional consequences of missense variation in channelopathies35,64, and to inform quantitative models of pathogenicity and penetrance71,116,117. MAVEs can provide insight into unexpected mechanisms of protein function that lead to follow-up investigations. For example, the recent KCNE1 map highlighted the high number of loss of function variants at protein-protein (KCNE1-KCNQ1-calmodulin) interaction sites52. Variant effect maps can also reveal underlying variant mechanisms, i.e. to determine whether a variant causes loss or gain of function, or affects the stability or function of a protein. MAVEs can also reveal the disease implications of variants in genes not traditionally considered Mendelian cardiovascular disease genes. For example, loss of function variants in HNF1A are linked to monogenic diabetes. An analysis of HNF1A gain of function variants identified by a HNF1A MAVE in biobank cohorts found an unexpected association between these variants and the hepatic secretion of atherogenic lipoproteins118. MAVEs of the cytochrome P450 genes CYP2C9119 and CYP2C19120 identified novel function-altering variants predicted to impact drug metabolism. These studies have implications for prescribing the anticoagulant/antiplatelet drugs warfarin and clopidogrel, which are affected by CYP2C9 and CYP2C19 genotypes, respectively.

The Atlas of Variant Effect Alliance has developed a controlled vocabulary for MAVE studies107, implemented in the MaveDB repository109. This vocabulary enables a standardized description of key MAVE features, including library generation methods, cellular systems, phenotypic assay types, and sequencing methods. As with all in vitro models, it is essential to critically evaluate whether MAVE assays accurately reflect the biological processes and mechanisms relevant to the disease being studied. MAVEs report the specific functional feature(s) being assayed in vitro across large numbers of variants. The extent to which variants that alter function in a MAVE generate clinical phenotypes, i.e., penetrance, is highly variable and dependent on multiple and incompletely understood factors. These include genetic background (currently often captured by polygenic scores) and environmental influences; in addition, the severity of protein dysfunction captured by a MAVE may influence disease penetrance. Public resources like ClinVar108 and MaveDB109 make MAVE data accessible in standardized formats, which will help clinical labs integrate this evidence into variant classification efficiently.

Variant Effect Predictors in clinical variant classification

Overview of Variant Effect Predictors

In the last decade, Variant Effect Predictors (VEPs) have begun to offer an increasingly reliable means of predicting variant effects. We note that our use of the term “VEP” does not refer to the Ensembl Variant Effect Predictor (VEP) annotation tool, but to the general class of variant computational prediction tools. These models make predictions based on patterns derived from input features (e.g., evolutionary, biochemical, or population-level evidence) and can be broadly categorized by whether they were developed using supervised or unsupervised machine learning methods (see Box 2). Because they can be scaled to provide exome-wide predictions, computational predictors can be readily applied to a large set of human genetic variants—comparatively, MAVE functional assessments have to date covered fewer than 1% of human genes37.

Box 2: Variant effect predictors.

The most important input for essentially every well-performing VEP is sequence conservation. This feature has been traditionally derived from multiple sequence alignments, and indeed VEPs have generally performed best for regions that are well aligned across many species126. Since the earliest examples115,127129, most VEPs have used a protein-specific multiple sequence alignment, with the frequency of each residue at each position informing the probability that a given substitution is damaging. Here, substitutions are more likely to disrupt function when they are at conserved positions and also involve amino acid exchanges that are less-commonly seen in aligned homologous sequences (as captured in substitution matrices such as BLOSUM62130). Later methods integrated other protein information such as biochemical features, site annotations (e.g., binding sites), and (often-predicted) secondary and tertiary structure131. More recently, deep learning models have been employed to capture higher-order (between-position) dependencies between amino acid substitutions through the course of evolution126. Most VEPs have been “supervised” in the sense that their models have been optimized to perform well for a training set of variants that have been labeled according to pathogenicity. One recent high-performing VEP, AlphaMissense, did not use pathogenicity labels, but instead optimized models using the absence of variants in a population-level cohort (gnomAD) as a proxy for pathogenicity. An important class of “unsupervised” methods (e.g., ESM1b113) exploits conservation information by ‘learning the language’ of protein sequences drawn from many species, without ever using pathogenicity annotations. Given that these VEPs do not require multiple sequence alignments, they have greater potential to estimate variant effects within intrinsically-disordered or other less well-aligned regions.

Some VEPs can predict the impact of noncoding variants, including both single-nucleotide variants and short insertions and deletions. Some such VEPs predict the effect of variants genome-wide132,133 while others are optimized for sequence subtypes such as 5’-UTRs134 or promoters135 or to predict epigenetic effects136. These VEPs can incorporate a broad range of features such as chromatin accessibility, transcript levels, epigenetic marks to infer regulatory variant effects, cross-species conservation, or sequence properties. Importantly, comprehensive ‘truth sets’ and independent functional assessments are less available for noncoding variants, complicating the assessment and interpretation of their performance.

Where they have been ‘overfit’ to training labels, supervised VEPs are generally more prone to inflated estimates of performance, e.g., where performance has been judged with variants not used in training the VEP. Use of allele frequency information either directly or indirectly (e.g. using knowledge of a variant’s appearance in a population cohort) can introduce some circularity, given the strong role of allele frequency in determining benign labels112. While unsupervised methods can circumvent some of these issues (by not training on labelled data), some VEPs advertised as unsupervised have used labeled data for parameter optimization. Finally, performance of all VEPs may be inflated where related evidence (e.g., sequence conservation) has been used in assigning pathogenicity labels used in testing.

Predictors are increasingly incorporating the outputs of other prediction algorithms as features in training (typically in a supervised framework). These ‘meta-predictors’ now rank among the top-performing methods (e.g., VARITY114, REVEL137)99,112. However, given the breadth of data they incorporate in training (both directly and indirectly through the use of other VEPs), it remains challenging to obtain unbiased ground-truth sets by which to benchmark meta-predictors. Measuring the ability of VEPs to predict human phenotypes in independent cohorts offers an unbiased approach99.

To assess VEP accuracy and inform predictor choice, the predictors must be benchmarked. This is generally done using a ‘gold standard’ reference set of annotated variants from available databases (e.g., ClinVar). Benchmarking of individual VEPs has been performed with deep mutational scan data121,122, clinical classifications121,122, biobank cohorts104, and case-control and population cohorts123. These benchmarking studies summarize individual VEPs, their main features, and their performances. Importantly, performance evaluations can depend on the benchmarking dataset123 and their reliability is often limited by circularity and bias, which can have a profound effect on the perceived performance of a given method.124 For instance, where data that were used to train a predictor are later re-used in its evaluation, performance estimates may be inflated124,125. Moreover, where training data are skewed towards variants of a certain class (e.g., pathogenic or benign variants within proteins or protein families), a predictor may perform well against variants in this class, but poorly in classes where annotations are more balanced124,125. Some predictors will have also incorporated historical bias given the rates at which variants have been annotated as pathogenic or benign for a given gene, even where these rates do not reflect the a priori probability that a given patient’s variant is pathogenic. In an effort to limit concerns over circularity and bias, Livesey and Marsh have used experimental functional measurements from MAVEs (which have not been seen in training) to benchmark predictor performance122,124,126. Alternatively, measuring the correspondence between functional predictions and human traits in large population-based cohorts has enabled unbiased benchmarking104, and this can be applied in cohorts of diverse ancestry and ethnicity.

VEPs are becoming increasingly accurate, with recently published methods (e.g., AlphaMissense114, ESM1b127, and VARITY128) outperforming commonly-used early predictors (e.g., PolyPhen2129 and SIFT130)104,126. As with MAVEs, predictors should be calibrated against control variants to inform evidence strength prior to use in clinical variant classification (e.g., PP3 and BP4; Box 1)131. Recently, the ClinGen SVI estimated the score intervals corresponding to each evidence strength for commonly-used predictors131, as well as more recent methods132, finding top-performing predictors from independent benchmarking efforts (e.g., REVEL, MutPred2, AlphaMissense, ESM1b, and VARITY) to reach Strong evidence towards pathogenicity and benignity for stringent score windows. One limitation to this genome-wide calibration, however, is that it may mask disparate performance between genes133. Ideally, as described for MAVEs, calibration should be carried out more accurately at the gene level. (Note: Initial reports of new VEPs often recommend thresholds for pathogenicity or benignity. However, in the absence of independent score calibration131,132, self-reported thresholds and classifications should be taken with extreme caution).

Opportunities for synergy between variant effect predictors and MAVEs

In vitro and in silico methods are already complementary in that agreement among MAVEs, VEPs, and reference sets can increase mutual confidence in each approach, and MAVEs have become a useful means of benchmarking VEPs121,122,126. Beyond this, functional data from MAVEs are increasingly used to refine and validate computational variant effect predictors, and functional evidence has even been incorporated into VEP training128,134, although this approach can complicate the use of MAVEs to benchmark VEPs).

An important form of synergy between VEPs and MAVEs is that, under current variant classification guidelines, they are considered independent sources of evidence102, so that they are not “in competition”. However, as MAVEs and VEPs become increasingly accurate at predicting functional impacts of variation, their outputs are expected to become dependent126. Indeed, some extensions of the ACMG framework allow use of either VEP or experimental functional evidence, but not both135. How best to combine VEP and MAVE evidence together with other criteria (including variable clinical phenotypes) in variant interpretation remains an open and important research question. For example, for a subset of variants, VEP and MAVE evidence might disagree with each other or with clinical case or allele frequency evidence. A close examination of disagreements between these datasets should identify areas where VEPs or MAVEs are underperforming in modeling disease pathophysiology.

There are opportunities for MAVE strategies aimed that augment or increase the utility of VEP methods. For example, current predictors have been shown to underperform in predicting the effects of non-loss-of-function variants—particularly gain-of-function and dominant-negative mutations136. Thus, functional assays could be targeted to regions where VEP performance is known or expected to be less reliable (e.g., positions under strong positive selection or intrinsically-disordered regions). A complete understanding of variant effects may require functional assessments in many relevant genetic or environmental contexts. Current VEPs do not consider context-dependence of variant effects (e.g., lipoprotein subtypes, co-expression of binding partners, or drug exposures), MAVEs are scalable across such diverse physiologic contexts and thus well-suited for those functional assessments. Thus, MAVEs and VEPs represent complementary approaches in classifying and understanding human variants and their biology at scale and a future vision of VUS reclassification will engage both.

Conclusion

The VUS problem poses a growing challenge to the implementation of genomic medicine for cardiovascular disorders. It is increasingly evident that a major part of the solution will be high-throughput functional genomic experiments such as MAVEs, which can provide functional evidence for all possible variants within a genetic element. Through MAVEs, multiple features of variants in cardiovascular proteins can be defined; examples include measurements of protein abundance, cell surface trafficking, lipoprotein uptake, or electrical signaling, or cellular contractile properties.

MAVEs can provide new insights into underlying biological functions of key cardiovascular proteins and accelerate personalized care for patients and their families. Because they can be used to study every possible variant in a gene or target region, MAVEs can partially address current inequities in clinical genetics by reducing the higher rate of VUS that exist in populations of non-European-ancestry33,105. MAVEs are complementary to other emerging variant interpretation methods, such as VEPs and large patient cohorts. MAVEs, when integrated with these other datasets, will help improve variant annotations and make significant progress towards making the term “VUS” obsolete24.

Key points.

  • While genetic testing is increasingly used in clinical management of inherited cardiovascular disorders, most variants are classified as ‘variants of uncertain significance’ (VUS).

  • Multiplexed assays of variant effect (MAVEs) can assess function of nearly all coding variants in a target sequence, providing proactive evidence for variants observed in patients.

  • In silico variant effect predictors (VEPs) are becoming increasingly accurate and can provide predicted variant effects for nearly every variant in the genome.

  • MAVEs and VEPs provide complementary information to illuminate cardiovascular disease biology and resolve the VUS problem.

Acknowledgements

This work was supported by NIH grants HL149826, HL164675, HL168059, GM150465, HG010904, HL160863, HG010461, and HG011989. We gratefully acknowledge UK Biobank and All of Us participants for their contributions, without whom this research would not have been possible. We also thank the UK Biobank and the NIH All of Us Research Program for making available the participant data examined in this study. F.P.R. is an advisor for and shareholder of Constantiam Biosciences.

Footnotes

Competing interests

F.P.R. is an advisor for and shareholder of Constantiam Biosciences. The other authors declare no competing interests.

Ethics approval and consent to participate

Consent to participate was obtained via the UK Biobank and All of Us projects, and datasets were analyzed in accordance with the associated data use agreements. The transfer of human data was approved and overseen by The UK Biobank Ethics Advisory Committee. This study was performed in alignment with the ethical principles outlined in the All of Us Policy on the Ethical Conduct of Research.

Data availability

This study used data from the All of Us Research Program’s Controlled Tier Dataset version 8, available to authorized users on the Researcher Workbench via https://www.workbench.researchallofus.org. The UK Biobank dataset is available by application via https://www.ukbiobank.ac.uk/.qq

References

  • 1.van der Harst P, Verweij N. Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease. Circ Res. 2018;122:433–443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Nelson CP, Goel A, Butterworth AS, Kanoni S, Webb TR, Marouli E, Zeng L, Ntalla I, Lai FY, Hopewell JC, Giannakopoulou O, Jiang T, Hamby SE, Di Angelantonio E, Assimes TL, et al. Association analyses based on false discovery rate implicate new loci for coronary artery disease. Nat Genet. 2017;49:1385–1391. [DOI] [PubMed] [Google Scholar]
  • 3.Khera AV, Kathiresan S. Genetics of coronary artery disease: discovery, biology and clinical translation. Nat Rev Genet. 2017;18:331–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bezzina CR, Barc J, Mizusawa Y, Remme CA, Gourraud J-B, Simonet F, Verkerk AO, Schwartz PJ, Crotti L, Dagradi F, Guicheney P, Fressart V, Leenhardt A, Antzelevitch C, Bartkowiak S, et al. Common variants at SCN5A-SCN10A and HEY2 are associated with Brugada syndrome, a rare disease with high risk of sudden cardiac death. Nat Genet. 2013;45:1044–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lahrouchi N, Tadros R, Crotti L, Mizusawa Y, Postema PG, Beekman L, Walsh R, Hasegawa K, Barc J, Ernsting M, Turkowski KL, Mazzanti A, Beckmann BM, Shimamoto K, Diamant U-B, et al. Transethnic Genome-Wide Association Study Provides Insights in the Genetic Architecture and Heritability of Long QT Syndrome. Circulation. 2020;142:324–338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Shah S, Henry A, Roselli C, Lin H, Sveinbjörnsson G, Fatemifar G, Hedman ÅK, Wilk JB, Morley MP, Chaffin MD, Helgadottir A, Verweij N, Dehghan A, Almgren P, Andersson C, et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat Commun. 2020;11:163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Abul-Husn NS, Manickam K, Jones LK, Wright EA, Hartzel DN, Gonzaga-Jauregui C, O’Dushlaine C, Leader JB, Lester Kirchner H, Lindbuchler DM, Barr ML, Giovanni MA, Ritchie MD, Overton JD, Reid JG, et al. Genetic identification of familial hypercholesterolemia within a single U.S. health care system. Science [Internet]. 2016;354. [Google Scholar]
  • 8.Schwartz PJ, Stramba-Badiale M, Crotti L, Pedrazzini M, Besana A, Bosi G, Gabbarini F, Goulene K, Insolia R, Mannarino S, Mosca F, Nespoli L, Rimini A, Rosati E, Salice P, et al. Prevalence of the Congenital Long-QT Syndrome. Circulation. 2009;120:1761–1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Roden DM. Drug-induced prolongation of the QT Interval. N Engl J Med. 2004;350:1013–1022. [DOI] [PubMed] [Google Scholar]
  • 10.Kaab S, Crawford DC, Sinner MF, Behr ER, Kannankeril PJ, Wilde AA, Bezzina CR, Schulze-Bahr E, Guicheney P, Bishopric NH, Myerburg RJ, Schott JJ, Pfeufer A, Beckmann BM, Martens E, et al. A large candidate gene survey identifies the KCNE1 D85N polymorphism as a possible modulator of drug-induced torsades de pointes. Circ Cardiovasc Genet. 2012;5:91–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Darbar D, Yang T, Churchwell K, Wilde AA, Roden DM. Unmasking of brugada syndrome by lithium. Circulation. 2005;112:1527–1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wada Y, Yang T, Shaffer CM, Daniel LL, Glazer AM, Davogustto GE, Lowery BD, Farber-Eger E, Wells QS, Roden DM. Common Ancestry-specific Ion Channel Variants Predispose to Drug-induced Arrhythmias. Circulation. 2022;145:299–308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Marian AJ, Braunwald E. Hypertrophic Cardiomyopathy: Genetics, Pathogenesis, Clinical Manifestations, Diagnosis, and Therapy. Circ Res. 2017;121:749–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hershberger RE, Hedges DJ, Morales A. Dilated cardiomyopathy: the complexity of a diverse genetic architecture. Nat Rev Cardiol. 2013;10:531–547. [DOI] [PubMed] [Google Scholar]
  • 15.McNally EM, Mestroni L. Dilated Cardiomyopathy: Genetic Determinants and Mechanisms. Circ Res. 2017;121:731–748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Choi SH, Weng LC, Roselli C, Lin H, Haggerty CM, Shoemaker MB, Barnard J, Arking DE, Chasman DI, Albert CM, Chaffin M, Tucker NR, Smith JD, Gupta N, Gabriel S, et al. Association Between Titin Loss-of-Function Variants and Early-Onset Atrial Fibrillation. JAMA. 2018;320:2354–2364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Yoneda ZT, Anderson KC, Quintana JA, O’Neill MJ, Sims RA, Glazer AM, Shaffer CM, Crawford DM, Stricker T, Ye F, Wells Q, Stevenson LW, Michaud GF, Darbar D, Lubitz SA, et al. Early-Onset Atrial Fibrillation and the Prevalence of Rare Variants in Cardiomyopathy and Arrhythmia Genes [Internet]. JAMA Cardiology. 2021;6:1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Murdock DR, Venner E, Muzny DM, Metcalf GA, Murugan M, Hadley TD, Chander V, de Vries PS, Jia X, Hussain A, Agha AM, Sabo A, Li S, Meng Q, Hu J, et al. Genetic testing in ambulatory cardiology clinics reveals high rate of findings with clinical management implications. Genet Med [Internet]. 2021; [Google Scholar]
  • 19.Luirink IK, Wiegman A, Kusters DM, Hof MH, Groothoff JW, de Groot E, Kastelein JJP, Hutten BA. 20-Year Follow-up of Statins in Children with Familial Hypercholesterolemia. N Engl J Med. 2019;381:1547–1556. [DOI] [PubMed] [Google Scholar]
  • 20.Maron BJ. Clinical Course and Management of Hypertrophic Cardiomyopathy. N Engl J Med. 2018;379:655–668. [DOI] [PubMed] [Google Scholar]
  • 21.Rowin EJ, Burrows A, Madias C, Estes NAM 3rd, Link MS, Maron MS, Maron BJ. Long-Term Outcome in High-Risk Patients With Hypertrophic Cardiomyopathy After Primary Prevention Defibrillator Implants. Circ Arrhythm Electrophysiol. 2020;13:e008123. [DOI] [PubMed] [Google Scholar]
  • 22.Al-Khatib SM, Stevenson WG, Ackerman MJ, Bryant WJ, Callans DJ, Curtis AB, Deal BJ, Dickfeld T, Field ME, Fonarow GC, Gillis AM, Granger CB, Hammill SC, Hlatky MA, Joglar JA, et al. 2017 AHA/ACC/HRS Guideline for Management of Patients With Ventricular Arrhythmias and the Prevention of Sudden Cardiac Death. Circulation. 2018;138:e272–e391. [DOI] [PubMed] [Google Scholar]
  • 23.Green ED, Guyer MS. Charting a course for genomic medicine from base pairs to bedside. Nature. 2011;470:204–213. [DOI] [PubMed] [Google Scholar]
  • 24.Green ED, Gunter C, Biesecker LG, Di Francesco V, Easter CL, Feingold EA, Felsenfeld AL, Kaufman DJ, Ostrander EA, Pavan WJ, Phillippy AM, Wise AL, Dayal JG, Kish BJ, Mandich A, et al. Strategic vision for improving human health at The Forefront of Genomics. Nature. 2020;586:683–692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Grundy SM, Stone NJ, Bailey AL, Beam C, Birtcher KK, Blumenthal RS, Braun LT, de Ferranti S, Faiella-Tommasino J, Forman DE, Goldberg R, Heidenreich PA, Hlatky MA, Jones DW, Lloyd-Jones D, et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA Guideline on the Management of Blood Cholesterol: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation. 2019;139:e1046–e1081. [DOI] [PubMed] [Google Scholar]
  • 26.Mach F, Baigent C, Catapano AL, Koskinas KC, Casula M, Badimon L, Chapman MJ, De Backer GG, Delgado V, Ference BA, Graham IM, Halliday A, Landmesser U, Mihaylova B, Pedersen TR, et al. 2019 ESC/EAS Guidelines for the management of dyslipidaemias: lipid modification to reduce cardiovascular risk. Eur Heart J. 2020;41:111–188. [DOI] [PubMed] [Google Scholar]
  • 27.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Landrum MJ, Lee JM, Riley GR, Jang W, Rubinstein WS, Church DM, Maglott DR. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 2014;42:D980–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Starita LM, Ahituv N, Dunham MJ, Kitzman JO, Roth FP, Seelig G, Shendure J, Fowler DM. Variant Interpretation: Functional Assays to the Rescue. Am J Hum Genet. 2017;101:315–325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gordon AS, Zouk H, Venner E, Eng CM, Funke BH, Amendola LM, Carrell DS, Chisholm RL, Chung WK, Denny JC, Fedotov A, Hakonarson H, Kullo IJ, LE B, Leduc MS, et al. Frequency of genomic secondary findings among 21,915 eMERGE network participants. Genet Med [Internet]. 2020; [Google Scholar]
  • 31.Shirts BH, Pritchard CC, Walsh T. Family-Specific Variants and the Limits of Human Genetics. Trends Mol Med. 2016;22:925–934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Weile J, Roth FP. Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum Genet. 2018;137:665–678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pottinger TD, Puckelwartz MJ, Pesce LL, Robinson A, Kearns S, Pacheco JA, Rasmussen-Torvik LJ, Smith ME, Chisholm R, McNally EM. Pathogenic and Uncertain Genetic Variants Have Clinical Cardiac Correlates in Diverse Biobank Participants. J Am Heart Assoc. 2020;9:e013808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Vanoye CG, Desai RR, Fabre KL, Gallagher SL, Potet F, DeKeyser J-M, Macaya D, Meiler J, Sanders CR, George AL. High-Throughput Functional Evaluation of KCNQ1 Decrypts Variants of Unknown Significance. Circulation: Genomic and Precision Medicine. 2018;11:e002345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Glazer AM, Wada Y, Li B, Muhammad A, Kalash OR, O’Neill MJ, Shields T, Hall L, Short L, Blair MA, Kroncke BM, Capra JA, Roden DM. High-Throughput Reclassification of SCN5A Variants. Am J Hum Genet. 2020;107:111–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Glazer AM, Davogustto GE, Shaffer CM, Vanoye CG, Desai RR, Farber-Eger E, Dikilitas O, Shang N, Pacheco JA, Yang T, Muhammad A, Mosley JD, Van Driest SL, Wells QS, Shaffer LL, et al. Arrhythmia Variant Associations and Reclassifications in the eMERGE-III Sequencing Study. Circulation [Internet]. 2021; [Google Scholar]
  • 37.Tabet D, Parikh V, Mali P, Roth FP, Claussnitzer M. Scalable Functional Assays for the Interpretation of Human Genetic Variation. Annu Rev Genet. 2022;56:441–465. [DOI] [PubMed] [Google Scholar]
  • 38.Weile J, Sun S, Cote AG, Knapp J, Verby M, Mellor JC, Wu Y, Pons C, Wong C, van Lieshout N, Yang F, Tasan M, Tan G, Yang S, Fowler DM, et al. A framework for exhaustively mapping functional missense variants. Mol Syst Biol. 2017;13:957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Fowler DM, Adams DJ, Gloyn AL, Hahn WC, Marks DS, Muffley LA, Neal JT, Roth FP, Rubin AF, Starita LM, Hurles ME. An Atlas of Variant Effects to understand the genome at nucleotide resolution. Genome Biol. 2023;24:147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Hess GT, Frésard L, Han K, Lee CH, Li A, Cimprich KA, Montgomery SB, Bassik MC. Directed evolution using dCas9-targeted somatic hypermutation in mammalian cells. Nat Methods. 2016;13:1036–1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Findlay GM, Boyle EA, Hause RJ, Klein JC, Shendure J. Saturation editing of genomic regions by multiplex homology-directed repair. Nature. 2014;513:120–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Matreyek KA, Stephany JJ, Chiasson MA, Hasle N, Fowler DM. An improved platform for functional assessment of large protein libraries in mammalian cells. Nucleic Acids Res. 2020;48:e1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Matreyek KA, Stephany JJ, Fowler DM. A platform for functional assessment of large variant libraries in mammalian cells. Nucleic Acids Res. 2017;45:e102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zhu F, Gamboa M, Farruggio AP, Hippenmeyer S, Tasic B, Schüle B, Chen-Tsai Y, Calos MP. DICE, an efficient system for iterative genomic editing in human pluripotent stem cells. Nucleic Acids Res. 2014;42:e34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Anzalone AV, Randolph PB, Davis JR, Sousa AA, Koblan LW, Levy JM, Chen PJ, Wilson C, Newby GA, Raguram A, Liu DR. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature. 2019;576:149–157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Thuronyi BW, Koblan LW, Levy JM, Yeh W-H, Zheng C, Newby GA, Wilson C, Bhaumik M, Shubina-Oleinik O, Holt JR, Liu DR. Continuous evolution of base editors with expanded target compatibility and improved activity. Nat Biotechnol. 2019;37:1070–1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Findlay GM, Daza RM, Martin B, Zhang MD, Leith AP, Gasperini M, Janizek JD, Huang X, Starita LM, Shendure J. Accurate classification of BRCA1 variants with saturation genome editing. Nature. 2018;562:217–222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Koblan LW, Doman JL, Wilson C, Levy JM, Tay T, Newby GA, Maianti JP, Raguram A, Liu DR. Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction. Nat Biotechnol. 2018;36:843–846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Friedman CE, Fayer S, Pendyala S, Chien W-M, Loiben A, Tran L, Chao LS, Mckinstry A, Ahmed D, Karbassi E, Fenix AM, Murry CE, Starita LM, Fowler DM, Yang K-C. CRaTER enrichment for on-target gene editing enables generation of variant libraries in hiPSCs. J Mol Cell Cardiol. 2023;179:60–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Dainis A, Zaleta-Rivera K, Ribeiro A, Chang ACH, Shang C, Lan F, Burridge PW, Liu WR, Wu JC, Chang ACY, Pruitt BL, Wheeler M, Ashley E. Silencing of MYH7 ameliorates disease phenotypes in human iPSC-cardiomyocytes. Physiol Genomics. 2020;52:293–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Matreyek KA, Starita LM, Stephany JJ, Martin B, Chiasson MA, Gray VE, Kircher M, Khechaduri A, Dines JN, Hause RJ, Bhatia S, Evans WE, Relling MV, Yang W, Shendure J, et al. Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat Genet. 2018;50:874–882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Muhammad A, Calandranis ME, Li B, Yang T, Blackwell DJ, Harvey ML, Smith JE, Daniel ZA, Chew AE, Capra JA, Matreyek KA, Fowler DM, Roden DM, Glazer AM. High-throughput functional mapping of variants in an arrhythmia gene, KCNE1, reveals novel biology. Genome Med. 2024;16:73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Floyd BJ, Weile J, Kannankeril PJ, Glazer AM, Reuter CM, MacRae CA, Ashley EA, Roden DM, Roth FP, Parikh VN. Proactive variant effect mapping aids diagnosis in pediatric cardiac arrest. Circ Genom Precis Med [Internet]. 2023; [Google Scholar]
  • 54.Wang Z, Moult J. SNPs, protein structure, and disease. Hum Mutat. 2001;17:263–270. [DOI] [PubMed] [Google Scholar]
  • 55.Yue P, Li Z, Moult J. Loss of protein structure stability as a major causative factor in monogenic disease. J Mol Biol. 2005;353:459–473. [DOI] [PubMed] [Google Scholar]
  • 56.Schlebach JP, Sanders CR. The safety dance: biophysics of membrane protein folding and misfolding in a cellular context. Q Rev Biophys. 2015;48:1–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Zhou Y, Bowie JU. Building a thermostable membrane protein. J Biol Chem. 2000;275:6975–6979. [DOI] [PubMed] [Google Scholar]
  • 58.Friedman CE, Fayer S, Pendyala S, Chien W-M, Loiben A, Tran L, Chao LS, McKinstry A, Ahmed D, Farris SD, Stempien-Otero A, Jonlin EC, Murry CE, Starita LM, Fowler DM, et al. Multiplexed Functional Assessments of MYH7 Variants in Human Cardiomyocytes. Circ Genom Precis Med. 2024;17:e004377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Levy ED, Kowarzyk J, Michnick SW. High-resolution mapping of protein concentration reveals principles of proteome architecture and adaptation. Cell Rep. 2014;7:1333–1340. [DOI] [PubMed] [Google Scholar]
  • 60.Beltran A, Jiang X ‘er, Shen Y, Lehner B. Site-saturation mutagenesis of 500 human protein domains. Nature. 2025;637:885–894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Anderson CL, Delisle BP, Anson BD, Kilby JA, Will ML, Tester DJ, Gong Q, Zhou Z, Ackerman MJ, January CT. Most LQT2 mutations reduce Kv11.1 (hERG) current by a class 2 (trafficking-deficient) mechanism. Circulation. 2006;113:365–373. [DOI] [PubMed] [Google Scholar]
  • 62.Coyote-Maestas W, Nedrud D, He Y, Schmidt D. Determinants of trafficking, conduction, and disease within a K+ channel revealed through multiparametric deep mutational scanning. Elife [Internet]. 2022;11. [Google Scholar]
  • 63.Ng C-A, Ullah R, Farr J, Hill AP, Kozek KA, Vanags LR, Mitchell DW, Kroncke BM, Vandenberg JI. A massively parallel assay accurately discriminates between functionally normal and abnormal variants in a hotspot domain of KCNH2. Am J Hum Genet. 2022;109:1208–1216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Kozek KA, Glazer AM, Ng CA, Blackwell D, Egly CL, Vanags LR, Blair M, Mitchell D, Matreyek KA, Fowler DM, Knollmann BC, Vandenberg JI, Roden DM, Kroncke BM. High-throughput discovery of trafficking-deficient variants in the cardiac potassium channel K(V)11.1. Heart Rhythm [Internet]. 2020; [Google Scholar]
  • 65.O’Neill MJ, Ng C-A, Aizawa T, Sala L, Bains S, Winbo A, Ullah R, Shen Q, Tan C-Y, Kozek K, Vanags LR, Mitchell DW, Shen A, Wada Y, Kashiwa A, et al. Assays of variant effect and automated patch clamping improve KCNH2-LQTS variant classification and cardiac event risk stratification. Circulation [Internet]. 2024. [cited 2024 Oct 6]; [Google Scholar]
  • 66.Radford EJ, Tan H-K, Andersson MHL, Stephenson JD, Gardner EJ, Ironfield H, Waters AJ, Gitterman D, Lindsay S, Abascal F, Martincorena I, Kolesnik-Taylor A, Ng-Cordell E, Firth HV, Baker K, et al. Saturation genome editing of DDX3X clarifies pathogenicity of germline and somatic variation. Nat Commun. 2023;14:7702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Buckley M, Terwagne C, Ganner A, Cubitt L, Brewer R, Kim D-K, Kajba CM, Forrester N, Dace P, De Jonghe J, Shepherd STC, Sawyer C, McEwen M, Diederichs S, Neumann-Haefelin E, et al. Saturation genome editing maps the functional spectrum of pathogenic VHL alleles. Nat Genet. 2024;56:1446–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Olvera-León R, Zhang F, Offord V, Zhao Y, Tan HK, Gupta P, Pal T, Robles-Espinoza CD, Arriaga-González FG, Matsuyama LSAS, Delage E, Dicks E, Ezquina, Rowlands CF, Turnbull C, et al. High-resolution functional mapping of RAD51C by saturation genome editing. Cell. 2024;187:5719–5734.e19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Aregger M, Lawson KA, Billmann M, Costanzo M, Tong AHY, Chan K, Rahman M, Brown KR, Ross C, Usaj M, Nedyalkova L, Sizova O, Habsid A, Pawling J, Lin Z-Y, et al. Systematic mapping of genetic interactions for de novo fatty acid synthesis identifies C12orf49 as a regulator of lipid metabolism. Nat Metab. 2020;2:499–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Ov VV, Stys PK, Hu B. Regulation of Na+,K+-ATPase by persistent sodium accumulation in adult rat thalamic neurones. J Physiol. 2000;525 Pt 2: 343–353. [Google Scholar]
  • 71.Glazer AM, Kroncke BM, Matreyek KA, Yang T, Wada Y, Shields T, Salem J-E, Fowler DM, Roden DM. Deep Mutational Scan of an SCN5A Voltage Sensor. Circ Genom Precis Med. 2020;13:e002786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Chen Y-H, Xu S-J, Bendahhou S, Wang X-L, Wang Y, Xu W-Y, Jin H-W, Sun H, Su X-Y, Zhuang Q-N, Yang Y-Q, Li Y-B, Liu Y, Xu H-J, Li X-F, et al. KCNQ1 gain-of-function mutation in familial atrial fibrillation. Science. 2003;299:251–254. [DOI] [PubMed] [Google Scholar]
  • 73.Peng G, Barro-Soria R, Sampson KJ, Larsson HP, Kass RS. Gating mechanisms underlying deactivation slowing by two KCNQ1 atrial fibrillation mutations. Sci Rep. 2017;7:45911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Pettinato AM, Ladha FA, Mellert DJ, Legere N, Cohn R, Romano R, Thakar K, Chen Y-S, Hinson JT. Development of a Cardiac Sarcomere Functional Genomics Platform to Enable Scalable Interrogation of Human TNNT2 Variants. Circulation. 2020;142:2262–2275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Dou W, Malhi M, Zhao Q, Wang L, Huang Z, Law J, Liu N, Simmons CA, Maynes JT, Sun Y. Microengineered platforms for characterizing the contractile function of in vitro cardiac models. Microsyst Nanoeng. 2022;8:26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Nag S, Trivedi DV, Sarkar SS, Adhikari AS, Sunitha MS, Sutton S, Ruppel KM, Spudich JA. The myosin mesa and the basis of hypercontractility caused by hypertrophic cardiomyopathy mutations. Nat Struct Mol Biol. 2017;24:525–533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Garcia-Pavia P, Kim Y, Restrepo-Cordoba MA, Lunde IG, Wakimoto H, Smith AM, Toepfer CN, Getz K, Gorham J, Patel P, Ito K, Willcox JA, Arany Z, Li J, Owens AT, et al. Genetic Variants Associated With Cancer Therapy-Induced Cardiomyopathy. Circulation. 2019;140:31–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Smith ED, Lakdawala NK, Papoutsidakis N, Aubert G, Mazzanti A, McCanta AC, Agarwal PP, Arscott P, Dellefave-Castillo LM, Vorovich EE, Nutakki K, Wilsbacher LD, Priori SG, Jacoby DL, McNally EM, et al. Desmoplakin Cardiomyopathy, a Fibrotic and Inflammatory Form of Cardiomyopathy Distinct From Typical Dilated or Arrhythmogenic Right Ventricular Cardiomyopathy. Circulation. 2020;141:1872–1884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Mazzarotto F, Tayal U, Buchan RJ, Midwinter W, Wilk A, Whiffin N, Govind R, Mazaika E, de Marvao A, Dawes TJW, Felkin LE, Ahmad M, Theotokis PI, Edwards E, Ing AY, et al. Reevaluating the Genetic Contribution of Monogenic Dilated Cardiomyopathy. Circulation. 2020;141:387–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Parikh VN, Caleshu C, Reuter C, Lazzeroni LC, Ingles J, Garcia J, McCaleb K, Adesiyun T, Sedaghat-Hamedani F, Kumar S, Graw S, Gigli M, Stolfo D, Dal Ferro M, Ing AY, et al. Regional Variation in RBM20 Causes a Highly Penetrant Arrhythmogenic Cardiomyopathy. Circ Heart Fail. 2019;12:e005371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Masaeli M, Gupta D, O’Byrne S, Tse HT, Gossett DR, Tseng P, Utada AS, Jung HJ, Young S, Clark AT, Di Carlo D. Multiparameter mechanical and morphometric screening of cells. Sci Rep. 2016;6:37863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Mavropoulos A, Johnson C, Lu V, Nieto J, Schneider EC, Saini K, Phelan ML, Hsie LX, Wang MJ, Cruz J, Mei J, Kim JJ, Lian Z, Li N, Boutet SC, et al. Artificial Intelligence-Driven Morphology-Based Enrichment of Malignant Cells from Body Fluid. Mod Pathol. 2023;36:100195. [DOI] [PubMed] [Google Scholar]
  • 83.Ritter P, Yousefi K, Ramirez J, Dykxhoorn DM, Mendez AJ, Shehadeh LA. LDL Cholesterol Uptake Assay Using Live Cell Imaging Analysis with Cell Health Monitoring. J Vis Exp [Internet]. 2018; [Google Scholar]
  • 84.Jasiecki J, Targońska M, Janaszak-Jasiecka A, Chmara M, Żuk M, Kalinowski L, Waleron K, Wasąg B. Novel Tools for Comprehensive Functional Analysis of LDLR (Low-Density Lipoprotein Receptor) Variants. Int J Mol Sci [Internet]. 2023;24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Zurzolo C, Simons K. Glycosylphosphatidylinositol-anchored proteins: Membrane organization and transport. Biochim Biophys Acta. 2016;1858:632–639. [DOI] [PubMed] [Google Scholar]
  • 86.Popp Nicholas A., Powell Rachel L., Wheelock Melinda K., Zapp Brendan D., Holmes Kristen J., Sheldon Kathryn M., Fletcher Shelley N., Wu Xiaoping, Fayer Shawn, View ORCID ProfileRubin Alan F., Lannert Kerry W., Chang Alexis T., Sheehan John P., Johnsen Jill M., Fowler Douglas M.. Multiplex, multimodal mapping of variant effects in secreted proteins. BioRxiv [Internet]. 2024; [Google Scholar]
  • 87.Zheng Y, VanDusen NJ. Massively parallel reporter assays for high-throughput in vivo analysis of Cis-regulatory elements. J Cardiovasc Dev Dis [Internet]. 2023. [cited 2024 Oct 6];10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Cao Y, Zhang X, Akerberg BN, Yuan H, Sakamoto T, Xiao F, VanDusen NJ, Zhou P, Sweat ME, Wang Y, Prondzynski M, Chen J, Zhang Y, Wang P, Kelly DP, et al. In vivo dissection of chamber-selective enhancers reveals estrogen-related receptor as a regulator of ventricular cardiomyocyte identity. Circulation. 2023;147:881–896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Akerberg BN, Gu F, VanDusen NJ, Zhang X, Dong R, Li K, Zhang B, Zhou B, Sethi I, Ma Q, Wasson L, Wen T, Liu J, Dong K, Conlon FL, et al. A reference map of murine cardiac transcription factor chromatin occupancy identifies dynamic and conserved enhancers. Nat Commun. 2019;10:4907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Kircher M, Xiong C, Martin B, Schubach M, Inoue F, Bell RJA, Costello JF, Shendure J, Ahituv N. Saturation mutagenesis of twenty disease-associated regulatory elements at single base-pair resolution. Nat Commun. 2019;10:3583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Agarwal V, Inoue F, Schubach M, Penzar D, Martin BK, Dash PM, Keukeleire P, Zhang Z, Sohota A, Zhao J, Georgakopoulos-Soares I, Noble WS, Yardımcı GG, Kulakovskiy IV, Kircher M, et al. Massively parallel characterization of transcriptional regulatory elements. Nature [Internet]. 2025. [cited 2025 Feb 7]; [Google Scholar]
  • 92.Neums L, Suenaga S, Beyerlein P, Anders S, Koestler D, Mariani A, Chien J. VaDiR: an integrated approach to Variant Detection in RNA. Gigascience. 2018;7:1–13. [Google Scholar]
  • 93.O’Neill MJ, Yang T, Laudeman J, Calandranis ME, Harvey ML, Solus JF, Roden DM, Glazer AM. ParSE-seq: a calibrated multiplexed assay to facilitate the clinical classification of putative splice-altering variants. Nat Commun. 2024;15:8320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Rosenberg AB, Roco CM, Muscat RA, Kuchina A, Sample P, Yao Z, Graybuck LT, Peeler DJ, Mukherjee S, Chen W, Pun SH, Sellers DL, Tasic B, Seelig G. Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding. Science. 2018;360:176–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Macaulay IC, Haerty W, Kumar P, Li YI, Hu TX, Teng MJ, Goolam M, Saurat N, Coupland P, Shirley LM, Smith M, Van der Aa N, Banerjee R, Ellis PD, Quail MA, et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat Methods. 2015;12:519–522. [DOI] [PubMed] [Google Scholar]
  • 96.van den Hoogenhof MMG, Pinto YM, Creemers EE. RNA Splicing: Regulation and Dysregulation in the Heart. Circ Res. 2016;118:454–468. [DOI] [PubMed] [Google Scholar]
  • 97.Orho-Melander M, Melander O, Guiducci C, Perez-Martinez P, Corella D, Roos C, Tewhey R, Rieder MJ, Hall J, Abecasis G, Tai ES, Welch C, Arnett DK, Lyssenko V, Lindholm E, et al. Common missense variant in the glucokinase regulatory protein gene is associated with increased plasma triglyceride and C-reactive protein but lower fasting glucose concentrations. Diabetes. 2008;57:3112–3121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Laber S, Strobel S, Mercader JM, Dashti H, Dos Santos FRC, Kubitz P, Jackson M, Ainbinder A, Honecker J, Agrawal S, Garborcauskas G, Stirling DR, Leong A, Figueroa K, Sinnott-Armstrong N, et al. Discovering cellular programs of intrinsic and extrinsic drivers of metabolic traits using LipocyteProfiler. Cell Genom. 2023;3:100346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Rubin AF, Gelman H, Lucas N, Bajjalieh SM, Papenfuss AT, Speed TP, Fowler DM. A statistical framework for analyzing deep mutational scanning data. Genome Biol. 2017;18:150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Faure AJ, Schmiedel JM, Baeza-Centurion P, Lehner B. DiMSum: an error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies. Genome Biol. 2020;21:207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Ma JG, O’Neill MJ, Richardson E, Thomson KL, Ingles J, Muhammad A, Solus JF, Davogustto G, Anderson KC, Shoemaker MB, Stergachis AB, Floyd BJ, Dunn K, Parikh VN, Chubb H, et al. Multisite validation of a functional assay to adjudicate SCN5A Brugada syndrome-associated variants. Circ Genom Precis Med. 2024;17:e004569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E, Voelkerding K, Rehm HL. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Brnich SE, Abou Tayoun AN, Couch FJ, Cutting GR, Greenblatt MS, Heinen CD, Kanavy DM, Luo X, McNulty SM, Starita LM, Tavtigian SV, Wright MW, Harrison SM, Biesecker LG, Berg JS, et al. Recommendations for application of the functional evidence PS3/BS3 criterion using the ACMG/AMP sequence variant interpretation framework. Genome Med. 2019;12:3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Tabet DR, Kuang D, Lancaster MC, Li R, Liu K, Weile J, Coté AG, Wu Y, Hegele RA, Roden DM, Roth FP. Benchmarking computational variant effect predictors by their ability to infer human traits. Genome Biol. 2024;25:172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Dawood M, Fayer S, Pendyala S, Post M, Kalra D, Patterson K, Venner E, Muffley LA, Fowler DM, Rubin AF, Posey JE, Plon SE, Lupski JR, Gibbs RA, Starita LM, et al. Using multiplexed functional data to reduce variant classification inequities in underrepresented populations. Genome Med. 2024;16:143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Fayer S, Horton C, Dines JN, Rubin AF, Richardson ME, McGoldrick K, Hernandez F, Pesaran T, Karam R, Shirts BH, Fowler DM, Starita LM. Closing the gap: Systematic integration of multiplexed functional data resolves variants of uncertain significance in BRCA1, TP53, and PTEN. Am J Hum Genet. 2021;108:2248–2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Claussnitzer M, Parikh VN, Wagner AH, Arbesfeld JA, Bult CJ, Firth HV, Muffley LA, Nguyen Ba AN, Riehle K, Roth FP, Tabet D, Bolognesi B, Glazer AM, Rubin AF. Minimum information and guidelines for reporting a multiplexed assay of variant effect. Genome Biol. 2024;25:100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Landrum MJ, Lee JM, Benson M, Brown GR, Chao C, Chitipiralla S, Gu B, Hart J, Hoffman D, Jang W, Karapetyan K, Katz K, Liu C, Maddipatla Z, Malheiro A, et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res. 2018;46:D1062–D1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Rubin AF, Stone J, Bianchi AH, Capodanno BJ, Da EY, Dias M, Esposito D, Frazer J, Fu Y, Grindstaff SB, Harrington MR, Li I, McEwen AE, Min JK, Moore N, et al. MaveDB 2024: a curated community database with over seven million variant effects from multiplexed functional assays. Genome Biol. 2025;26:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Schmidt RJ, Steeves M, Bayrak-Toydemir P, Benson KA, Coe BP, Conlin LK, Ganapathi M, Garcia J, Gollob MH, Jobanputra V, Luo M, Ma D, Maston G, McGoldrick K, Palculict TB, et al. Recommendations for risk allele evidence curation, classification, and reporting from the ClinGen Low Penetrance/Risk Allele Working Group. Genet Med. 2024;26:101036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Weile J, Kishore N, Sun S, Maaieh R, Verby M, Li R, Fotiadou I, Kitaygorodsky J, Wu Y, Holenstein A, Bürer C, Blomgren L, Yang S, Nussbaum R, Rozen R, et al. Shifting landscapes of human MTHFR missense-variant effects [Internet]. The American Journal of Human Genetics. 2021;108:1283–1300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, Bridgland A, Meyer C, Kohl SAA, Ballard AJ, Cowie A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Tunyasuvunakool K, Adler J, Wu Z, Green T, Zielinski M, Žídek A, Bridgland A, Cowie A, Meyer C, Laydon A, Velankar S, Kleywegt GJ, Bateman A, Evans R, Pritzel A, et al. Highly accurate protein structure prediction for the human proteome. Nature. 2021;596:590–596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Cheng J, Novati G, Pan J, Bycroft C, Žemgulytė A, Applebaum T, Pritzel A, Wong LH, Zielinski M, Sargeant T, Schneider RG, Senior AW, Jumper J, Hassabis D, Kohli P, et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science. 2023;381:eadg7492. [DOI] [PubMed] [Google Scholar]
  • 115.Dries AM, Kirillova A, Reuter CM, Garcia J, Zouk H, Hawley M, Murray B, Tichnell C, Pilichou K, Protonotarios A, Others. The genetic architecture of Plakophilin 2 cardiomyopathy. Genet Med. 2021;1–8. [DOI] [PubMed] [Google Scholar]
  • 116.Kroncke BM, Mendenhall J, Smith DK, Sanders CR, Capra JA, George AL, Blume JD, Meiler J, Roden DM. Protein structure aids predicting functional perturbation of missense variants in SCN5A and KCNQ1. Comput Struct Biotechnol J. 2019;17:206–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Kroncke BM, Smith DK, Zuo Y, Glazer AM, Roden DM, Blume JD. A Bayesian method to estimate variant-induced disease penetrance. PLoS Genet. 2020;16:e1008862. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.DeForest N, Kavitha B, Hu S, Isaac R, Krohn L, Wang M, Du X, De Arruda Saldanha C, Gylys J, Merli E, Abagyan R, Najmi L, Mohan V, Alnylam Human Genetics, AMP-T2D Consortium, et al. Human gain-of-function variants in HNF1A confer protection from diabetes but independently increase hepatic secretion of atherogenic lipoproteins. Cell Genom. 2023;3:100339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Amorosi CJ, Chiasson MA, McDonald MG, Wong LH, Sitko KA, Boyle G, Kowalski JP, Rettie AE, Fowler DM, Dunham MJ. Massively parallel characterization of CYP2C9 variant enzyme activity and abundance. Am J Hum Genet. 2021;108:1735–1751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Boyle GE, Sitko KA, Galloway JG, Haddox HK, Bianchi AH, Dixon A, Wheelock MK, Vandi AJ, Wang ZR, Thomson RES, Garge RK, Rettie AE, Rubin AF, Geck RC, Gillam EMJ, et al. Deep mutational scanning of CYP2C19 in human cells reveals a substrate specificity-abundance tradeoff. Genetics [Internet]. 2024. [cited 2025 Jan 28];228. [Google Scholar]
  • 121.Livesey BJ, Marsh JA. Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations. Mol Syst Biol. 2020;16:e9380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Livesey BJ, Marsh JA. Updated benchmarking of variant effect predictors using deep mutational scanning. Mol Syst Biol. 2023;19:e11474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Finucane HK, Parsa S, Guez J, Kanai M, Satterstrom FK, Nkambule LL, Daly MJ, Seed C, Karczewski KJ. Variant scoring performance across selection regimes depends on variant-to-gene and gene-to-disease components. bioRxivorg [Internet]. 2024; 10.1101/2024.09.17.613327. [DOI] [Google Scholar]
  • 124.Livesey BJ, Marsh JA. Interpreting protein variant effects with computational predictors and deep mutational scanning. Dis Model Mech [Internet]. 2022. [cited 2024 Oct 6];15. [Google Scholar]
  • 125.Grimm DG, Azencott C, Aicheler F, Gieraths U, MacArthur DG, Samocha KE, Cooper DN, Stenson PD, Daly MJ, Smoller JW, Duncan LE, Borgwardt KM. The Evaluation of Tools Used to Predict the Impact of Missense Variants Is Hindered by Two Types of Circularity. Human Mutation. 2015;36:513–523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Livesey BJ, Marsh JA. Variant effect predictor correlation with functional assays is reflective of clinical classification performance [Internet]. bioRxiv. 2024; [Google Scholar]
  • 127.Brandes N, Goldman G, Wang CH, Ye CJ, Ntranos V. Genome-wide prediction of disease variant effects with a deep protein language model. Nat Genet. 2023;55:1512–1522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Wu Y, Li R, Sun S, Weile J, Roth FP. Improved pathogenicity prediction for rare human missense variants. Am J Hum Genet. 2021;108:1891–1906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7:248–249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Vaser R, Adusumalli S, Leng SN, Sikic M, Ng PC. SIFT missense predictions for genomes. Nat Protoc. 2016;11:1–9. [DOI] [PubMed] [Google Scholar]
  • 131.Pejaver V, Byrne AB, Feng B-J, Pagel KA, Mooney SD, Karchin R, O’Donnell-Luria A, Harrison SM, Tavtigian SV, Greenblatt MS, Biesecker LG, Radivojac P, Brenner SE, ClinGen Sequence Variant Interpretation Working Group. Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria. Am J Hum Genet. 2022;109:2163–2177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Bergquist T, Stenton SL, Nadeau EAW, Byrne AB, Greenblatt MS, Harrison SM, Tavtigian SV, O’Donnell-Luria A, Biesecker LG, Radivojac P, Brenner SE, Pejaver V, ClinGen Sequence Variant Interpretation Working Group. Calibration of additional computational tools expands ClinGen recommendation options for variant classification with PP3/BP4 criteria. bioRxivorg [Internet]. 2024; [Google Scholar]
  • 133.Tejura M, Fayer S, McEwen AE, Flynn J, Starita LM, Fowler DM. Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes. Am J Hum Genet. 2024;111:2031–2043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Riesselman AJ, Ingraham JB, Marks DS. Deep generative models of genetic variation capture the effects of mutations. Nat Methods. 2018;15:816–822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Nykamp K, Anderson M, Powers M, Garcia J, Herrera B, Ho Y-Y, Kobayashi Y, Patil N, Thusberg J, Westbrook M, Invitae Clinical Genomics Group, Topper S. Sherloc: a comprehensive refinement of the ACMG-AMP variant classification criteria. Genet Med. 2017;19:1105–1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Gerasimavicius L, Livesey BJ, Marsh JA. Loss-of-function, gain-of-function and dominant-negative mutations have profoundly different effects on protein structure. Nat Commun. 2022;13:3895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.van Loggerenberg W, Sowlati-Hashjin S, Weile J, Hamilton R, Chawla A, Sheykhkarimli D, Gebbia M, Kishore N, Frésard L, Mustajoki S, Pischik E, Di Pierro E, Barbaro M, Floderus Y, Schmitt C, et al. Systematically testing human HMBS missense variants to reveal mechanism and pathogenic variation. Am J Hum Genet. 2023;110:1769–1786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Ho CY, Day SM, Ashley EA, Michels M, Pereira AC, Jacoby D, Cirino AL, Fox JC, Lakdawala NK, Ware JS, Caleshu CA, Helms AS, Colan SD, Girolami F, Cecchi F, et al. Genotype and Lifetime Burden of Disease in Hypertrophic Cardiomyopathy: Insights from the Sarcomeric Human Cardiomyopathy Registry (SHaRe). Circulation. 2018;138:1387–1398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Frazer J, Notin P, Dias M, Gomez A, Min JK, Brock K, Gal Y, Marks DS. Disease variant prediction with deep generative models of evolutionary data. Nature. 2021;599:91–95. [DOI] [PubMed] [Google Scholar]
  • 140.Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, Musolf A, Li Q, Holzinger E, Karyadi D, Cannon-Albright LA, Teerlink CC, Stanford JL, Isaacs WB, Xu J, et al. REVEL: An ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. 2016;99:877–885. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This study used data from the All of Us Research Program’s Controlled Tier Dataset version 8, available to authorized users on the Researcher Workbench via https://www.workbench.researchallofus.org. The UK Biobank dataset is available by application via https://www.ukbiobank.ac.uk/.qq

RESOURCES