Skip to main content
eBioMedicine logoLink to eBioMedicine
. 2022 Jun 24;81:104115. doi: 10.1016/j.ebiom.2022.104115

Predicting the functional effects of voltage-gated potassium channel missense variants with multi-task learning

Christian Malte Boßelmann a,b, Ulrike BS Hedrich a, Peter Müller a, Lukas Sonnenberg c, Shridhar Parthasarathy d,e,f, Ingo Helbig d,e,f,g, Holger Lerche a,1,, Nico Pfeifer b,h,i,j,1,
PMCID: PMC9250003  PMID: 35759918

Summary

Background

Variants in genes encoding voltage-gated potassium channels are associated with a broad spectrum of neurological diseases including epilepsy, ataxia, and intellectual disability. Knowledge of the resulting functional changes, characterized as overall ion channel gain- or loss-of-function, is essential to guide clinical management including precision medicine therapies. However, for an increasing number of variants, little to no experimental data is available. New tools are needed to evaluate variant functional effects.

Methods

We catalogued a comprehensive dataset of 959 functional experiments across 19 voltage-gated potassium channels, leveraging data from 782 unique disease-associated and synthetic variants. We used these data to train a taxonomy-based multi-task learning support vector machine (MTL-SVM), and compared performance to several baseline methods.

Findings

MTL-SVM maintains channel family structure during model training, improving overall predictive performance (mean balanced accuracy 0·718 ± 0·041, AU-ROC 0·761 ± 0·063) over baseline (mean balanced accuracy 0·620 ± 0·045, AU-ROC 0·711 ± 0·022). We can obtain meaningful predictions even for channels with few known variants (KCNC1, KCNQ5).

Interpretation

Our model enables functional variant prediction for voltage-gated potassium channels. It may assist in tailoring current and future precision therapies for the increasing number of patients with ion channel disorders.

Funding

This work was supported by intramural funding of the Medical Faculty, University of Tuebingen (PATE F.1315137.1), the Federal Ministry for Education and Research (Treat-ION, 01GM1907A/B/G/H) and the German Research Foundation (FOR-2715, Le1030/16-2, He8155/1-2).

Keywords: Potassium channel, Ion channel, Electrophysiology, Machine learning, Multi-task learning, Support vector machine

Abbreviations: AU-PRC, area under the precision-recall curve; AU-ROC, area under the receiver operator characteristic; BA, Balanced Accuracy; GOF, gain of function; LOF, loss of function; MCC, Matthews correlation coefficient; MTL, multi-task learning; RBF, radial basis function; SVM, support vector machine


Research in context.

Evidence before this study

In the last few years, next-generation sequencing has outpaced the ability of some traditional experimental methods to generate causative evidence for variant interpretation. At the same time, precision therapies dependent on knowledge of variant functional effects have become available. Predictive algorithms may bridge this gap in translational research by providing supportive in silico evidence. We conducted a literature search using the PubMed database for articles published between 01/12/2011 and 01/12/2021 using the search terms (prediction OR machine learning) AND (ion channel OR voltage-gated OR channelopathy) AND (GOF OR gain-of-function) AND (LOF OR loss-of-function). This search yielded 31 results, which were filtered for papers describing the implementation of machine-learning based methods. This identified a single previous study, in which Heyne and colleagues demonstrated the feasibility of functional variant prediction in voltage-gated sodium and calcium channels.

Added value of this study

In the present study, we developed a machine-learning based method that integrates experimental data from heterogeneous sources to predict gain-of-function or loss-of-function in voltage-gated potassium channel variants. To this end, we curated a large and representative data set of electrophysiological experiments reporting variant effects, sequence- and structure-based features, and expert annotation. Our multi-task learning support-vector machine (MTL-SVM), which leverages prior knowledge on the implicit relationship between channels, presents a uniquely suited and scalable framework for functional variant prediction. Model validation demonstrates the advantages over conventional single-task algorithms.

Implications of all the available evidence

Our results indicate that functional variant prediction, complementary to current methods of pathogenicity prediction, is feasible. Model performance can be improved by adopting multi-task learning algorithms and integrating heterogeneous data sources. This addresses a current issue in translational research, by providing supportive evidence for variant interpretation that may assist clinicians in tailoring current and future precision therapies for the increasing number of patients affected by ion channel disorders.

Alt-text: Unlabelled box

Introduction

Genetic disorders caused by variants in genes encoding voltage-gated ion channels are associated with a broad and heterogeneous spectrum of hereditary neurological, cardiac, nephrological, and other diseases.1 Among the ion channels associated with human disease, voltage-gated potassium channels (KV 1–12) constitute the largest and most diverse super-family with regards to their structure, gating kinetics, expression profiles, pharmacology, and associated phenotypes. Variants in these channels are involved in developmental and epileptic encephalopathy (KCNA2, KCNB1, KCNQ2, KCNQ3), benign familial neonatal seizures (KCNQ2, KCNQ3), episodic ataxia or paroxysmal dyskinesia (KCNA1), nonsyndromic hearing loss (KCNQ1, KCNE1, KCNQ4), and long or short QT syndrome (KCNQ1).2, 3, 4, 5 Many of these disease associations have only very recently been described, such as KCNQ5-associated intellectual disability, epileptic encephalopathy, or genetic generalized epilepsy, as well as KCNC1-associated progressive myoclonus epilepsy, intellectual disability, isolated myoclonus, or ataxia.6, 7, 8

In channelopathies, genetic and clinical data are inherently intertwined with the causative functional evidence provided by electrophysiological experiments. Voltage-clamp or patch-clamp studies in heterologous or neuronal expression systems are currently the methods of choice to bridge the translational gap between genetic diagnostics and clinical presentation, by establishing variant pathogenicity and elucidating the mechanism of channel dysfunction at a molecular and cellular level. For example, a variant may alter expression level, membrane trafficking, or the gating properties of a variant channel. The functional outcome can, in most cases, be characterized as an overall gain-of-function (GOF) or loss-of-function (LOF) with respect to subunit and channel function, with potential downstream effects on neuronal firing rate or network (dis)inhibition.9 However, we have to consider that cases exist, in which a net gain- or loss-of-function can be difficult to determine and may even differ between the type of neurons investigated.10,11 For clear net GOF or LOF cases, representing the majority of disease-associated variants, functional studies contribute to variant classification and inform clinical management, including genetic counselling and precision therapies.12 These cases often show clear phenotype-genotype correlations, facilitating diagnosis and offering therapeutic and prognostic guidance. But perhaps most interestingly, recent advances aim to correct the underlying biophysical defects. For example, retigabine selectively enhances the M-current in KCNQ2-5 by stabilizing the open-pore conformation, which reverses the effects of some LOF variants.13,14 Conversely, 4-aminopyridine acts as a non-selective blocker of KV1.x channels and may be a safe and effective treatment option in patients with developmental and epileptic encephalopathy due to GOF variants in KCNA2.15

However, the unprecedented pace of genetic discoveries has surpassed the ability of traditional methods to provide sufficient causal evidence for variant interpretation.16 Consequently, there is an increasing number of variants for which no direct (i.e. specific variant studied in the same channel), indirect, or no functional data at all are available. Electrophysiological studies present a bottleneck in translational research: they are laborious, expensive and time-intensive, and therefore limited to the analysis of just a few variants at a time. High-throughput automated electrophysiology platforms for large-scale functional studies are currently in development and may ameliorate some of these disadvantages.17 But this also presents an opportunity to develop predictive frameworks that integrate computational, biochemical, evolutionary, and experimental sources of information.

Here, we present a machine learning-based model that leverages data from electrophysiological experiments to predict the functional effects of non-synonymous missense variants in voltage-gated potassium channels. Our model efficiently integrates data from two very different sources – synthetic variants from biophysical studies designed to understand channel function, as well as patient variants from clinical genetic testing. We demonstrate that taxonomy-based multi-task learning enables us to preserve information on the relatedness of ion channels, improving predictive performance. Our framework for functional variant prediction is scalable and can easily be extended to other use cases.

Methods

Data

Our dataset consisted of previously published, publicly available data from patch-clamp or voltage-clamp experiments, with each experiment describing the functional effects of a non-synonymous missense variant in a voltage-gated potassium channel. Many of these variants were previously found to be disease-causing in patients and hence functionally characterized. Notably, we also included data from non-human channels (Shaker, KCNAS) and from experiments that studied synthetic variants to understand biophysical channel function and structure-function relationships. Although these variants may not appear in disease context or in the general population, they increase the sample size and contain salient information which is useful for our model to learn. For each experiment, we collected the biophysical properties of the mutant channels, including information on the gating kinetics, peak current or conductance, and expression level or trafficking defects, if available. We used this information to assign one label to each variant, the label categorizing the net overall effect on ion channel function at subunit level: gain-of-function (GOF), loss-of-function (LOF), and no functional effect (neutral).

In total, we curated the results of 959 electrophysiological experiments from 163 publications. 22 experiments were excluded due to conflicting evidence, where contradicting functional effects for the same variant were described in two or more independent publications. For 67 experiments, we were unable to ascertain a label due to mixed or unclear functional effects without clear net gain- or loss-of-function. These variants, too, were excluded from the training set. Finally, 88 experiments were marked as duplicate and removed from the data set if two or more experiments reported the same functional effect for the same variant. The final dataset used for model training included 782 unique non-synonymous missense variants across 19 voltage-gated potassium channels: 165 GOF, 544 LOF and 73 neutral (Supplementary Figure 1). The dataset was independently validated by two expert electrophysiologists, and disagreements were resolved by consensus-based discussion.

Features

Each variant was identified by its channel, sequence position, and amino acid substitution. From there, we extracted sequence- and structure-based features as follows: The residue-level properties of the original and substituted amino acids were included as features by physiochemical encoding, representing them in a lower-dimensional property space.18 Hydrophobicity was included as a separate feature, by principal component analysis of 98 hydrophobicity scales from Simm et al.19 Frequency and radicality of amino acid substitutions were described by the BLOSUM62 substitution matrix and Grantham score.20,21 Paralog conservation scores were obtained from PER viewer.22 Predictions of structural features were computed with PredictProtein, NetSurfP-2.0, and IUPred2A.23, 24, 25 This included information on secondary structure, residue accessibility, conservation, disordered regions, and interaction sites. We then annotated domains and motifs through UniProt and expert knowledge, including rich information on the position of gating charges, pore helix, S4-S5 linker, selectivity filter, PVP motif, and others.26 For preprocessing of our data, we used one-hot encoding of categorical features and scaling of numerical features. A list of features and the annotated training data set are available as Supplementary Tables 1 and 2.

Models

In the previous steps, we have obtained a data matrix X with n observations (variants) by p dimensions (features). Each observation n belonged to one of three classes y1,,yn{LOF,Neutral,GOF} and to one of 19 tasks (voltage-gated potassium channels) t1,,tn{KCNA1,KCNA2,...,KCNQ5}. For a previously unseen variant xX we wanted to predict the functional effect, class label y^. Intuitively, if a new variant is similar to a known variant, it should receive the same class label. We first considered several other methods including logistic regression and tree-based models (Random forest, Gradient Boosting) which are reported as Supplementary Figure 2. For this problem, we chose to implement a support vector machine (C-SVM) with a radial basis function (RBF) kernel Kn with k(xi,xj)=exp(xixj22σ2)xi,xjX.27 We constructed two methods for baseline comparison. For the single method, one SVM was trained for all observations belonging to one task, which yields t task-specific models. Each model was independently tuned and assessed. We did not attempt to train a model if there were less than 20 observations for the task, or if all observations had been assigned the same class label (e.g., channels for which only loss-of-function variants are known). In both cases, the resulting models would not have been meaningful. For the base method, one SVM was trained on all observations, irrespective of task membership.

We took measures to reduce overfitting with nested k-fold cross validation, k=10, and the cost regularization hyperparameter C. To correct for class imbalance, we assigned class weights w inversely proportional to the frequency of classt, wt=nall/nt. The minority class is assigned a larger wtand smaller C, thus increasing the misclassification penalty. The optimal values for the hyperparameters were estimated via grid search, with σ={1e5,1e4,,1e0} and C={1e4,1e2,,1e4}. We used Platt scaling to transform the output of our non-probabilistic binary linear classifier into a distribution of class probabilities, extended for multi-class classification. The model performance estimates, i.e. the expected generalization performance of the model on previously unseen data, were obtained by nested k-fold cross validation – a technique in which the model is trained and tested on different random (sub-)partitions of the data throughout multiple exhaustive iterations. A pseudocode and graphical representation of this method are provided as Supplementary Figures 3 and 4.

We report the following metrics: i) Balanced Accuracy (BA), defined as the average of sensitivity and specificity for each class; ii) Matthews correlation coefficient (MCC), a symmetric correlation coefficient between true and predicted classes; iii) Cohen's Kappa, a measure of agreement between true and predicted classes; iv) F1 score, the harmonic mean of precision and recall. The multi-class extension of these metrics has been described elsewhere.28 The area under the receiver operator characteristics (AU-ROC) curve was extended to the multi-class case using the method described by Hand and Till.29 We calculated the area under the precision-recall curve (AU-PRC) by taking the average of a one-versus-all approach. Further model information is available in our AIMe Registry report (https://aime.report/IFtQVF).

Multi-task learning

To define our hierarchy of tasks T, we first created a multiple sequence alignment of all tasks with MUSCLE.30 For each observation, we noted the relative position on the consensus sequence and included this as an additional feature for all models. Each pairwise distance on the multiple sequence alignment ds,t was transformed to a pairwise similarity measure by γs,t=ads,t/dmax to obtain our task similarity matrix Kt. Here, awas a hyperparameter controlling the baseline similarity between tasks and was included in our grid search with a={1,3,5,10,100}, while dmax was the maximum distance between two tasks in hierarchy T. This approach to taxonomy-based multitask learning is further detailed in Widmer et al. and is motivated by the previous work of Jacob and Vert.31,32 The final multi-task learning kernel matrix Km was obtained by element-wise multiplication of the task kernel matrix Kt with our RBF kernel matrix Kn. Intuitively, Km embeds information on task similarity and instance similarity into the kernel matrix. The task hierarchy T and kernel matrix Kt for voltage-gated potassium channels are shown in Supplementary Figure 5. For MTL-SVM, best performance was seen with σ=1e2, C=1, and a=1. A pre-trained model object is available as part of our Data Sharing Statement.

Interpretation

Explainable model decisions improve transparency and user trust. We carried out a feature correlation analysis with Wilcoxon-Mann-Whitney two-sample tests to compare group differences for each class in a one-versus-all approach, adjusting the p-values for multiple testing with the Benjamini-Hochberg method, thus highlighting discriminative features (Supplementary Table 1). We also implemented a model-agnostic fast approximation of Shapley values via the SHAP framework, which offers additive feature importance methods.33 In short, Shapley values express the contribution of a feature to a prediction.

Software

This study was carried out in the R programming language, version 4.1.0, with RStudio, version 1.4.1106. Packages used include tidyverse (data preparation, preprocessing and visualization), the Bioconductor collection (feature extraction and multiple sequence alignment), kernlab and the e1071 interface for LIBSVM (model training), yardstick and pROC (model assessment), fastshap (model interpretation), and Shiny (graphical user interface and web application). R Session Information is included as part of our data availability statement.

Ethics statement

Not applicable.

Role of the funding source

The funders had no role in study design, data collection, data analysis or interpretation, and decision to prepare or publish the manuscript.

Results

For our first naïve baseline method, we trained one separate SVM for variants of each voltage-gated potassium channel (single method). For tasks with sufficiently large and balanced training data sets, these single channel-specific models were already feasible, such as for variants in KCNA2 (BA 0·711), KCNQ1 (BA 0·784), or KCNQ4 (BA 0.838). However, we did not obtain meaningful models for 6/19 channels (Figure 1). For these channels, either an insufficient sample size of variants was available for training, or each variant had been assigned the same label (e.g. only loss-of-function variants). Notably, this included channels that have only recently been associated with human disease, such as KCNC1 or KCNQ5, for which little functional data are available. The single method is thus limited to a few common channels, but performs reasonably well on average (Table 1).

Figure 1.

Figure 1

Task-specific cross-validation model performance estimate of mean Balanced Accuracy for each method. Channels are given with their respective number of variants (n) included in the training data set. Two tasks, KCNC2 and KCNH5, contained just one variant each and are not included in the performance estimate. Abbreviation: BA – Balanced Accuracy.

Table 1.

Nested cross-validation model performance estimates for each method. Metrics are reported as mean ± standard deviation, where applicable. Metrics are reported for variants from common tasks, as well as for variants from rare tasks (KCNA4, KCNC1, KCNC2, KCNC3, KCNH1, KCNQ3, KCNQ5), depending on variant frequency in the training data set (cut-off ≤20 variants for rare tasks). For the subgroup of rare tasks, BA and AU-ROC are shown. The highest mean metrics over all methods and subgroups are shown in bold. Abbreviations: AU-PRC – area under the precision-recall curve; AU-ROC – area under the receiver operator characteristic; BA – Balanced Accuracy; MCC – Matthews correlation coefficient.

Single
Base
MTL
common tasks rare tasks common tasks rare tasks common tasks rare tasks
BA 0·745 ± 0·042 0·620 ± 0·045 0·558 0·718 ± 0·041 0·792
AU-ROC 0·758 ± 0·057 0·711 ± 0·022 0·417 0·761 ± 0·063 0·857
Cohen's κ 0·298 ± 0·103 0·192 ± 0·064 0·310 ± 0·092
MCC 0·303 ± 0·105 0·224 ± 0·066 0·328 ± 0·099
F1 score 0·659 ± 0·056 0·494 ± 0·065 0·624 ± 0·055
AU-PRC 0·557 ± 0·065 0·498 ± 0·057 0·555 ± 0·063

Next, we trained one SVM on a pooled dataset of all variants across the voltage-gated potassium channel family (base method). This represents a conventional single-task approach, for which data is shared between all tasks without differentiation. Clearly, sharing data is beneficial, enabling us to obtain predictions for variants from all 19 channels in our dataset. However, predictive performance was low for some channels, e.g. KCNA4, KCND2, and KCNQ5 (Figure 1). We also observed that the task-specific model performance estimate of the base method was lower than that of some single SVMs, and deteriorated further for some rare channels (Table 1). Put in other words, adding data from other channels generally caused the model to perform worse than a single channel-specific model. It stands to reason that one global model cannot fit all these channels equally well, some of which are markedly different from each other in structure and function.

Our multi-task learning approach (mtl method) embeds structural and evolutionary pairwise similarity between ion channels in the kernel matrix, efficiently integrating prior knowledge across all channels and channel sub-families. Intuitively, the model learns to assign more weight to variants from the same or similar channels, and less or no weight to variants from more distantly related channels. At the same time, transfer learning of a shared representation allows the model to learn higher-level concepts that apply across all channels, such as the relative feature importance of substitution radicality or residue conservation. Thus, this method allowed us to obtain meaningful predictions for variants from every channel, while maintaining a good overall performance (Figure 1). More importantly, multi-task learning achieved strong performance even for variants in rare channels (BA 0·792 and AU-ROC 0·857 versus BA 0·558 and AU-ROC 0·417, Table 1). The superior performance of the multi-task learning method over the base method is also shown as an overall gain in AU-ROC (mean AU-ROC 0·761 ± 0·063 versus AU-ROC 0·711 ± 0·022) (Figure 2).

Figure 2.

Figure 2

ROC of the baseline (“base”) versus multi-task learning (“mtl”) models. a: The multi-task learning model better discriminates between GOF and LOF variants. b: One-versus-one ROC curves for the baseline model. c: One-versus-one ROC curves for the multi-task learning model. Abbreviations: AU-ROC – area under the receiver operating characteristic; GOF – gain-of-function; LOF – loss-of-function.

Additionally, some new missense variants in voltage-gated potassium channels have very recently been described, which have not yet been included in the training set. These may not be sufficient for external validation, but they may serve as an example of the expected model use case. Firstly, Miceli et al. described four heterozygous de novo variants in KCNA1: P403S, P405L, P405S and A261T.34 Our MTL-SVM correctly predicted three out of four variants with high confidence (Figure 3a). The A261T variant, which resulted in a hyperpolarizing shift of the voltage-dependence of activation (GOF), was incorrectly predicted as LOF. This is expected behaviour: Gain-of-function in this channel, previously only seen in some synthetic variants from biophysical research, is very rare and has only now been associated with disease-causing variants. Clearly, experimental evidence still remains crucial to generate causative evidence. Secondly, Imbrici et al. described a heterozygous de novo variant in KCNA2, E236K.35 This variant caused a hyperpolarizing shift of the voltage-dependence of activation and slowed kinetics of activation and deactivation, resulting in an overall mixed GOF/LOF. Variants without a clear net overall functional effect were excluded from our model's training set. Still, the model correctly recognized that E236K is equally likely to result in a GOF or LOF (Figure 3). For both use cases presented here (KCNA1, KCNA2), possible precision therapies based on variant function have been proposed.15,34,35

Figure 3.

Figure 3

Radar plots of class probability distributions for each variant, with class probabilities shown in the legend. a: KCNA1 p.P403S was correctly predicted as LOF (P405L and P405S are not shown). b: For KCNA2 p.E236K, a variant with mixed functional effects, the model correctly predicted an equal probability of GOF and LOF. c: KCNA1 p.A261T was incorrectly predicted as LOF.

Explainable decisions contribute to user trust and model transparency. We calculated Shapley values, which are a model-agnostic approximation of a feature's marginal contribution (Figure 4). The feature names are shown here in brackets, with additional feature information available in Supplementary Table 1. Predictions for gain-of-function variants are based on paralog-conserved sites, low local protein disorder, and relative sequence position or ‘hotspots’, among others. Features contributing to predictions for loss-of-function variants are secondary structure, protein-protein interaction sites, and paralog-conserved sites. Variants are more likely to be predicted as neutral based on their ortholog conservation, protein-protein interaction sites, and local protein disorder. Overall, the model also recognizes the importance of domain and motif annotation including variant position in the S6 segment, transmembrane domains, or cytoplasmic domains.

Figure 4.

Figure 4

SHAP (SHapley Additive exPlanations) contribution plots and feature importance. Shapley values are the average expected marginal contribution for each feature after all possible feature combinations have been considered. a: Mean Shapley values for GOF variants. b: Mean Shapley values for LOF variants. c: Mean Shapley values for neutral variants. Contribution may either be positive (green) or negative (red). d: Mean absolute Shapley values for all classes. Feature abbreviations are shown in brackets, with additional information available in Supplementary Table 1.

Discussion

Next-generation sequencing continues to generate large datasets of genetic variants at an ever-increasing pace, driven by both clinical genetic diagnostics and the international cooperation of sequencing consortia. In silico tools can aid in predicting the pathogenicity of missense variants of unknown significance. Some of these prediction models are tailored to specific genes, such as Q1VarPred for KCNQ1, which outperforms other methods for pathogenicity prediction by incorporating some functional data.36 A gene-family-specific tool, KvSNP, outperforms other methods for pathogenicity prediction across all voltage-gated potassium channels.37 These are complementary to the method presented here. However, none of these tools are designed to provide information on the alteration of biophysical function at a molecular and cellular level: This is the difference between traditional pathogenicity prediction and functional variant prediction. In ion channel disorders, this functional knowledge is crucial to guide diagnosis, treatment, and prognosis.

Few studies have attempted to address this unmet need. Recently, Heyne and colleagues presented funNCion, a statistical method for functional variant prediction of missense variants in voltage-gated sodium (NaV) and calcium channels (CaV).38 In these channel families, de novo variants are enriched at paralog-conserved sites. Heyne and colleagues hence pooled variants for further analysis, under the assumption of similar molecular mechanisms across individual channels. Their training data set included 827 LOF and GOF variants in NaV and CaV, where the functional label was inferred from phenotype data alone (e.g., age at seizure onset). Using a Gradient Boosting machine, they predicted an independent test set of 87 variants with an estimated performance of BA 0·73, AU-ROC 0·73, and MCC 0·45. Interestingly, they reported a similar performance when predicting NaV variants with a training set of either NaV alone, or both NaV and CaV. Conversely, the prediction on a test set of CaV was worse when the training set contained just CaV alone, while performance improved when training on just NaV, or both NaV and CaV. They concluded that the increased power obtained by combining NaVs and CaVs outweighs the differences between these channel types. Evidently, pooling data may be beneficial. But how do we formalize the general notion of difference between ion channels, and how do we then decide to which variants in the training data we want to assign more weight?

The answer lies in the notion of an implicit or latent task relatedness. If we consider each member of the family of voltage-gated potassium channels as a single task, we can then employ multi-task learning, in which information is shared between the tasks: We learn our tasks in parallel using a shared representation, which has been shown to outperform the conventional single-task learning methods introduced above.39 Task relatedness may be learnt from the data structure, but we may also view this as an opportunity to introduce prior domain knowledge. For example, Jacob and Vert used a user-defined or supertype-based measure of similarity to improve peptide-MHC-I binding models across different alleles.32 They demonstrated that the method can be decomposed into two steps: i) Choosing an explicit description or pairwise similarity kernel for our tasks (channels) and observations (variants); ii) Applying an SVM to the product kernel. For our task kernel, we chose the taxonomy-based method by Widmer et al., for which task relatedness is derived from the hierarchical structure of a phylogeny tree, which is readily obtained through multiple sequence alignment.31 This is the same principle hierarchy that motivates the classification of voltage-gated potassium channel families, and it appropriately separates channels both according to their function (i.e. different potassium currents) and their tissue expression levels.40

Here, we have demonstrated the feasibility of functional variant prediction in voltage-gated potassium channels, supported by a considerable experimental data set. We then introduced an MTL-SVM framework which is uniquely suited to handle predictions across large and diverse channel families. Our multi-task learning method improves upon conventional single-task learning baseline models. Notably, we achieved good predictive performance even in channels for which data from functional studies are sparse, such as KCNC1 or KCNQ5, which have only recently been associated with human disease.6,7 Thus, our method may prospectively enable predictions for emerging channels and new disease-associated variants as they become relevant. Another strength of our approach is that we can leverage data from non-human channels and synthetic variants, making efficient use of prior information on structure-function relationships from decades of biophysical research to improve model performance. This tool may generate supportive evidence for in silico variant interpretation, and may assist clinicians and researchers in generating new hypotheses about voltage-gated potassium channel disorders, including potential new applications for precision therapies. The tool and a web-based graphical user interface are available as part of our Data Sharing Statement.

Importantly, our model predicts functional changes on a main/alpha subunit level and does not include information on modifying subunits (KCNFx, KCNGx, KCNVx, KCNSx, etc.) or experimental metadata (e.g., expression system, recording protocol). Care should be taken when generalizing the results of the prediction to a neuronal, network, or phenotype level. In silico predictions should always be considered in concert with other available sources of evidence. By themselves, in silico predictions are not considered strong or causal evidence by the genetic testing standards established by the American College of Medical Genetics and Genomics.41 We have assembled a large and representative data set of variants across the superfamily of voltage-gated potassium channels but, given their diversity and the wealth of published literature, this data set is far from complete. There is some concern for selection bias, as electrophysiological experiments are more likely to study either pathogenic variants, or those in locations of structural and functional importance. Further work is clearly required to improve model performance. These methods will benefit from large homogeneous data sets provided by high-throughput electrophysiology, as well as standardized experimental and phenotype data curation and integration frameworks that are currently in development. Additionally, static and dynamic structural information from AlphaFold and molecular dynamics may prove useful.

Contributors

Resources: HL, NP. Funding acquisition: CMB, HL, NP. Supervision: HL, NP. Methodology: CMB. Data Curation: CMB, UBSH, PM. Data Validation: UBSH, PM. Software: CMB. Formal analysis: CMB, LS, SP, IH. Writing – Original Draft: CMB. Visualization: CMB. Writing – Review & Editing: LS, IH, HL, NP. All authors have read and approved the final version of the manuscript.

Data sharing statement

Data and code generated during this study, including pre-processed training data and pre-trained models, will be made available on Zenodo at time of publication (DOI: https://doi.org/10.5281/zenodo.5749260).

Declaration of interests

The authors declare no conflict of interest related to this work.

Acknowledgements

This work was supported by intramural funding of the Medical Faculty, University of Tuebingen (PATE F.1315137.1 to CB), the Federal Ministry for Education and Research (Treat-ION, 01GM1907A/B/G/H to CB, UBSH, and HL) and the German Research Foundation (Research Unit FOR-2715, Le1030/16-2 to HL, He8155/1-2 to UBSH, and He5415/7-2 to IH). The funders had no role in study design, data collection, data analysis or interpretation, and decision to prepare or publish the manuscript. The authors would like to thank Mahmoud Koko (Department of Neurology and Epileptology, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany) for his assistance in data preparation for KCND2/KCND3 variants.

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.ebiom.2022.104115.

Contributor Information

Holger Lerche, Email: holger.lerche@med.uni-tuebingen.de.

Nico Pfeifer, Email: pfeifer@informatik.uni-tuebingen.de.

Appendix. Supplementary materials

mmc1.pdf (3.4MB, pdf)

References

  • 1.Lehmann-Horn F, Jurkat-Rott K. Voltage-gated ion channels and hereditary disease. Physiol Rev. 1999;79(4):1317–1372. doi: 10.1152/physrev.1999.79.4.1317. [DOI] [PubMed] [Google Scholar]
  • 2.Miceli F, Soldovieri MV, Joshi N, et al. In: GeneReviews((R)) Adam MP, Ardinger HH, Pagon RA, et al., editors. 1993. KCNQ2-related disorders. Seattle (WA) [Google Scholar]
  • 3.Paulhus K, Ammerman L, Glasscock E. Clinical spectrum of KCNA1 mutations: new insights into episodic ataxia and epilepsy comorbidity. Int J Mol Sci. 2020;21(8) doi: 10.3390/ijms21082802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hedley PL, Jorgensen P, Schlamowitz S, et al. The genetic basis of long QT and short QT syndromes: a mutation update. Hum Mutat. 2009;30(11):1486–1511. doi: 10.1002/humu.21106. [DOI] [PubMed] [Google Scholar]
  • 5.Jung J, Choi HB, Koh YI, et al. Whole-exome sequencing identifies two novel mutations in KCNQ4 in individuals with nonsyndromic hearing loss. Sci Rep. 2018;8(1):16659. doi: 10.1038/s41598-018-34876-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Krüger J, Schubert J, Kegele J, et al. Loss-of-function variants in the KCNQ5 gene are associated with genetic generalized epilepsies. medRxiv. 2021.04.20.21255696. [DOI] [PMC free article] [PubMed]
  • 7.Park J, Koko M, Hedrich UBS, et al. KCNC1-related disorders: new de novo variants expand the phenotypic spectrum. Ann Clin Transl Neurol. 2019;6(7):1319–1326. doi: 10.1002/acn3.50799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cameron JM, Maljevic S, Nair U, et al. Encephalopathies with KCNC1 variants: genotype-phenotype-functional correlations. Ann Clin Transl Neurol. 2019;6(7):1263–1272. doi: 10.1002/acn3.50822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hubner CA, Jentsch TJ. Ion channel diseases. Hum Mol Genet. 2002;11(20):2435–2445. doi: 10.1093/hmg/11.20.2435. [DOI] [PubMed] [Google Scholar]
  • 10.de Kovel CG, Meisler MH, Brilstra EH, et al. Characterization of a de novo SCN8A mutation in a patient with epileptic encephalopathy. Epilepsy Res. 2014;108(9):1511–1518. doi: 10.1016/j.eplepsyres.2014.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Makinson CD, Dutt K, Lin F, et al. An Scn1a epilepsy mutation in Scn8a alters seizure susceptibility and behavior. Exp Neurol. 2016;275(Pt 1):46–58. doi: 10.1016/j.expneurol.2015.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wolff M, Johannesen KM, Hedrich UBS, et al. Genetic and phenotypic heterogeneity suggest therapeutic implications in SCN2A-related disorders. Brain. 2017;140(5):1316–1336. doi: 10.1093/brain/awx054. [DOI] [PubMed] [Google Scholar]
  • 13.Orhan G, Bock M, Schepers D, et al. Dominant-negative effects of KCNQ2 mutations are associated with epileptic encephalopathy. Ann Neurol. 2014;75(3):382–394. doi: 10.1002/ana.24080. [DOI] [PubMed] [Google Scholar]
  • 14.Wuttke TV, Seebohm G, Bail S, Maljevic S, Lerche H. The new anticonvulsant retigabine favors voltage-dependent opening of the Kv7.2 (KCNQ2) channel by binding to its activation gate. Mol Pharmacol. 2005;67(4):1009–1017. doi: 10.1124/mol.104.010793. [DOI] [PubMed] [Google Scholar]
  • 15.Hedrich UBS, Lauxmann S, Wolff M, et al. 4-Aminopyridine is a promising treatment option for patients with gain-of-function KCNA2-encephalopathy. Sci Transl Med. 2021;13(609):eaaz4957. doi: 10.1126/scitranslmed.aaz4957. [DOI] [PubMed] [Google Scholar]
  • 16.Cooper GM, Shendure J. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat Rev Genet. 2011;12(9):628–640. doi: 10.1038/nrg3046. [DOI] [PubMed] [Google Scholar]
  • 17.Vanoye CG, Desai RR, Fabre KL, et al. High-throughput functional evaluation of KCNQ1 decrypts variants of unknown significance. Circ Genom Precis Med. 2018;11(11) doi: 10.1161/CIRCGEN.118.002345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Maetschke S, Towsey M, BodéN M. BLOMAP: an encoding of amino acids which improves signal peptide cleavage site prediction. Proceedings of the 3rd Asia-Pacific Bioinformatics Conference: Imperial College Press; World Scientific Publishing Co.; 2005. pp. 141–150. [Google Scholar]
  • 19.Simm S, Einloft J, Mirus O, Schleiff E. 50 years of amino acid hydrophobicity scales: revisiting the capacity for peptide classification. Biol Res. 2016;49(1):31. doi: 10.1186/s40659-016-0092-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA. 1992;89(22):10915–10919. doi: 10.1073/pnas.89.22.10915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Grantham R. Amino acid difference formula to help explain protein evolution. Science. 1974;185(4154):862–864. doi: 10.1126/science.185.4154.862. [DOI] [PubMed] [Google Scholar]
  • 22.Perez-Palma E, May P, Iqbal S, et al. Identification of pathogenic variant enriched regions across genes and gene families. Genome Res. 2020;30(1):62–71. doi: 10.1101/gr.252601.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Klausen MS, Jespersen MC, Nielsen H, et al. NetSurfP-2.0: improved prediction of protein structural features by integrated deep learning. Proteins. 2019;87(6):520–527. doi: 10.1002/prot.25674. [DOI] [PubMed] [Google Scholar]
  • 24.Erdos G, Dosztanyi Z. Analyzing protein disorder with IUPred2A. Curr Protoc Bioinformatics. 2020;70(1):e99. doi: 10.1002/cpbi.99. [DOI] [PubMed] [Google Scholar]
  • 25.Bernhofer M, Dallago C, Karl T, et al. PredictProtein - Predicting protein structure and function for 29 years. Nucleic Acids Res. 2021;49(W1):W535–W540. doi: 10.1093/nar/gkab354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.UniProt C. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021;49(D1):D480–D489. doi: 10.1093/nar/gkaa1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Schölkopf B, Smola AJ. MIT Press; 2002. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. [Google Scholar]
  • 28.Grandini M, Bagli E, Visani G. Metrics for multi-class classification: an overview. ArXiv2020; abs/2008.05756.
  • 29.Hand DJ, Till RJ. A Simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learning. 2001;45(2):171–186. [Google Scholar]
  • 30.Madeira F, Park YM, Lee J, et al. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res. 2019;47(W1):W636–W641. doi: 10.1093/nar/gkz268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Widmer C, Leiva J, Altun Y, Rätsch G. Springer Berlin Heidelberg; Berlin, Heidelberg: 2010. Leveraging Sequence Classification by Taxonomy-Based Multitask Learning; pp. 522–534. [Google Scholar]
  • 32.Jacob L, Vert J-P. Efficient peptide–MHC-I binding prediction for alleles with few known binders. Bioinformatics. 2007;24(3):358–366. doi: 10.1093/bioinformatics/btm611. [DOI] [PubMed] [Google Scholar]
  • 33.Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems; Long Beach, California, USA: Curran Associates Inc.; 2017. pp. 4768–4777. [Google Scholar]
  • 34.Miceli F, Guerrini R, Nappi M, et al. Distinct epilepsy phenotypes and response to drugs in KCNA1 gain- and loss-of function variants. Epilepsia. 2022;63(1):e7–e14. doi: 10.1111/epi.17118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Imbrici P, Conte E, Blunck R, et al. A novel KCNA2 variant in a patient with non-progressive congenital ataxia and epilepsy: functional characterization and sensitivity to 4-aminopyridine. Int J Mol Sci. 2021;22(18) doi: 10.3390/ijms22189913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Li B, Mendenhall JL, Kroncke BM, et al. Predicting the functional impact of KCNQ1 variants of unknown significance. Circ Cardiovasc Genet. 2017;10(5) doi: 10.1161/CIRCGENETICS.117.001754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Stead LF, Wood IC, Westhead DR. KvSNP: accurately predicting the effect of genetic variants in voltage-gated potassium channels. Bioinformatics. 2011;27(16):2181–2186. doi: 10.1093/bioinformatics/btr365. [DOI] [PubMed] [Google Scholar]
  • 38.Heyne HO, Baez-Nieto D, Iqbal S, et al. Predicting functional effects of missense variants in voltage-gated sodium and calcium channels. Sci Transl Med. 2020;12(556) doi: 10.1126/scitranslmed.aay6848. [DOI] [PubMed] [Google Scholar]
  • 39.Caruana R. Multitask learning. Machine Learning. 1997;28(1):41–75. [Google Scholar]
  • 40.Gutman GA, Chandy KG, Grissmer S, et al. International union of pharmacology. LIII. Nomenclature and molecular relationships of voltage-gated potassium channels. Pharmacol Rev. 2005;57(4):473–508. doi: 10.1124/pr.57.4.10. [DOI] [PubMed] [Google Scholar]
  • 41.Richards S, Aziz N, Bale S, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American college of medical genetics and genomics and the association for molecular pathology. Genet Med. 2015;17(5):405–424. doi: 10.1038/gim.2015.30. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.pdf (3.4MB, pdf)

Articles from eBioMedicine are provided here courtesy of Elsevier

RESOURCES