Abstract
Epilepsy is a highly heritable disorder affecting over 50 million people worldwide, of which about one-third are resistant to current treatments. Here we report a multi-ancestry genome-wide association study including 29,944 cases, stratified into three broad categories and seven subtypes of epilepsy, and 52,538 controls. We identify 26 genome-wide significant loci, 19 of which are specific to genetic generalized epilepsy (GGE). We implicate 29 likely causal genes underlying these 26 loci. SNP-based heritability analyses show that common variants explain between 39.6% and 90% of genetic risk for GGE and its subtypes. Subtype analysis revealed markedly different genetic architectures between focal and generalized epilepsies. Gene-set analyses of GGE signals implicate synaptic processes in both excitatory and inhibitory neurons in the brain. Prioritized candidate genes overlap with monogenic epilepsy genes and with targets of current antiseizure medications. Finally, we leverage our results to identify alternate drugs with predicted efficacy if repurposed for epilepsy treatment.
Subject terms: Genome-wide association studies, Epilepsy
Genome-wide association meta-analyses identify 26 risk loci for epilepsy, including 19 loci specific to genetic generalized epilepsy. Prioritized candidate genes implicate synaptic processes and overlap with targets of antiseizure medications.
Main
The epilepsies are a heterogeneous group of neurological disorders, characterized by an enduring predisposition to generate unprovoked seizures1. It is estimated that over 50 million people worldwide have active epilepsy, with an annual cumulative incidence of 68 per 100,000 persons2.
Similar to other common neurodevelopmental disorders, epilepsies have substantial genetic risk contributions from both common and rare genetic variations. Analysis of the epilepsies benefits from deep phenotyping, which allows clinical subtypes to be distinguished3, in contrast to other common neurodevelopmental disorders, where phenotypic subtypes are more difficult to define. Differences in the genetic architecture of clinical subtypes of epilepsy are also emerging, to complement the clinical partitioning4–7. The rare but severe epileptic encephalopathies are usually nonfamilial and are largely caused by single de novo dominant variants, often involving genes encoding ion channels or proteins of the synaptic machinery8. Both common and rare variants have been shown to contribute to the milder and more common focal and generalized epilepsies. This is particularly true for generalized epilepsy, which is primarily constituted by genetic generalized epilepsy (GGE)4,5,9,10. Nevertheless, previous genetic studies of common epilepsies have explained only a limited proportion of this common genetic variant, or single-nucleotide polymorphism (SNP)-based, heritability—9.2% for focal and 32.1% for GGE4–6,10.
Epilepsy is typically treated using antiseizure medications (ASMs). However, despite the availability of over 25 licensed ASMs worldwide, a third of people with epilepsy experience continuing seizures11. Diet, surgery and neuromodulation represent additional treatment options that can be effective in small subgroups of patients12. Accurate classification of clinical presentations is an important guiding factor in epilepsy treatment.
Here we report the third epilepsy genome-wide association study (GWAS) meta-analysis by the International League against Epilepsy (ILAE) Consortium on complex epilepsies, comprising a total of 29,944 deeply phenotyped cases recruited from tertiary referral centers and 52,538 controls, approximately doubling the previous sample size4. Results suggest markedly different genetic architectures between focal and generalized forms of epilepsy. Combining these results with those from less-stringently phenotyped biobank and deCODE genetics epilepsy cases did not substantially increase signal, despite almost doubling the sample size to 51,678 cases and 1,076,527 controls. Our findings shed light on the enigmatic biology of generalized epilepsy and the importance of accurate syndromic phenotyping and may facilitate drug repurposing for new therapeutic approaches.
Results
Study overview
We performed a GWAS meta-analysis by combining the previously published effort from our consortium4 with unpublished data from the Epi25 collaborative10 and four additional cohorts (Supplementary Tables 1 and 2). Our primary mixed model meta-analysis constitutes 4.9 million SNPs tested in 52,538 controls and 29,944 people with epilepsy, of which 16,384 had neurologist-classified focal epilepsy (FE) and 7,407 had GGE. The epilepsy cases were primarily of European descent (92%), with a smaller proportion of African (3%) and Asian (5%) ancestry (Supplementary Table 3). Cases were matched with controls of the same ancestry, and GWAS analyses were performed separately per ancestry, before performing multi-ancestry meta-analyses for the broad epilepsy phenotypes ‘FE’ (n = 16,384 cases) and ‘GGE’ (n = 7,407 cases). We further conducted meta-analyses in individuals of European ancestry of the well-defined GGE subtypes of juvenile myoclonic epilepsy (JME; n = 1,732), childhood absence epilepsy (CAE; n = 1,049), juvenile absence epilepsy (JAE; n = 662) and generalized tonic-clonic seizures alone (GTCSA; n = 485), as well as the FE subtypes of FE with hippocampal sclerosis (HS; n = 1,260), FE with other lesions (n = 4,213) and lesion-negative FE (n = 5,778). The same controls (n = 42,436) were shared across the different subphenotypes. We ran a variety of follow-up analyses to identify potential sex-specific signals and obtain biological insights and opportunities for drug repurposing. Sample size prevented the inclusion of other ethnicities in the subtype analyses.
GWAS for the epilepsies
Our ‘all epilepsy’ meta-analysis revealed four genome-wide significant loci, of which two are new (Fig. 1). Similar to our previous GWAS4, the 2q24.3 locus was composed of two independently significant signals (Supplementary Table 4). Using ASSET to determine the extent of FE and GGE-related pleiotropy, the 2q24.3 and 9q21.13 signals showed pleiotropic effects at a genome-wide significance level, with concordant SNP effect directions for both forms of epilepsy (Supplementary Table 5). The 2p16.1 and 10q24.32 loci were primarily derived from GGE. The FE analysis did not reveal any genome-wide significant signals.
Fig. 1. Manhattan plot of multi-ancestry all epilepsy (n = 29,944), focal epilepsy (n = 16,384) and genetic generalized epilepsy (n = 7,407) genome-wide meta-analyses, obtained by fixed-effects meta-analysis weighted by effective sample sizes.
The red line shows the genome-wide significance threshold (5 × 10−8). Chromosome and position are displayed on the x axis, and two-sided −log10 P value is on the y axis. New genome-wide significant loci are highlighted in red, and loci previously associated with epilepsy in orange. New loci were those previously unreported as GWAS significant in previous epilepsy GWASs. Annotated genes are those implicated by our gene prioritization analyses. See Supplementary Fig. 7 for QQ plots. QQ plots, quantile–quantile plot.
Our ‘GGE’ meta-analysis uncovered a total of 25 independent genome-wide significant signals across 22 loci, of which 13 loci are new. The strongest signal of association (P = 6.6 × 10−21), located at 2p16.1, constitutes three independently significant signals. Similarly, the new locus 12q13.13 was composed of two independently significant signals (Supplementary Table 4). Forest plots and P–M plots of these signals show that they appear consistent across all four GGE subphenotypes, with some exceptions (Supplementary Figs. 1 and 2).
We applied multitrait analysis of GWAS (MTAG)17 to exploit the correlation between FE and GGE, boosting the effective sample size. Results were concordant with our main analysis, and new signals did not emerge (Supplementary Fig. 3).
Functional annotation of the 1,082 genome-wide significant SNPs across the 22 GGE loci and 270 SNPs from the ‘all epilepsy’ loci revealed that most variants were intergenic or intronic (Supplementary Data 1). Eight of 1,082 (0.7%) GGE SNPs were exonic, of which five were located in protein-coding genes and were missense variants. We identified one exonic ‘all epilepsy’ SNP (rs7580482, synonymous), located in SCN1A. Seventy-four percent of ‘all epilepsy’ SNPs and 64% of GGE SNPs were located in open chromatin regions, as indicated by a minimum chromatin state of 1–7 (ref. 14). Further annotation by Combined Annotation-Dependent Depletion (CADD) scores predicted that 11 ‘all epilepsy’ and 50 GGE SNPs were deleterious (CADD score > 12.37) (ref. 15). LDAK heritability analyses showed significant enrichment of signal in ‘super-enhancers’ (Supplementary Table 6), suggesting that GGE SNPs regulate clusters of transcriptional enhancers that control the expression of genes that define cell identity16.
To assess potential syndrome-specific loci, we performed GWAS on seven well-defined FE and GGE subtypes (Supplementary Fig. 4a–g). We found three genome-wide significant loci associated specifically with JME (n = 1,813), of which one was new (8q23.1) and the other two (4p12 and 16p11.2) previously reported4. Our analysis of CAE (n = 1,072) consolidated an established genome-wide significant signal at 2p16.1, which was also observed in the GGE and all epilepsy GWAS. We did not find any genome-wide significant loci for JAE (n = 671), GTCSA (n = 499), ‘nonlesional FE’ (n = 6,367), ‘FE with HS’ (n = 1,375) or ‘FE with other lesions’ (n = 4,661).
MTAG17 analysis of individual GGE subphenotypes showed concordance with the main GGE GWAS, without identifying new loci. In addition, this analysis confirmed that the majority of GWAS-significant SNPs in GGE are overlapping (Supplementary Figs. 5 and 6 and Supplementary Table 7).
The vast majority of loci reported in our previous effort4 remained genome-wide significant. A summary of loci that fell below the genome-wide significance threshold is provided in Supplementary Table 8.
Genomic inflation was comparable to our previous GWAS, and all linkage-disequilibrium score regression (LDSC) intercepts were lower (Supplementary Table 9)4, suggesting that the signals are primarily driven by polygenicity. Computation of the attenuation ratio suggested that part of the inflation signal, in particular for FE (0.58), might be due to some form of bias (for example, confounding or population stratification)13. The attenuation ratio was lowest for GGE (0.11), which includes the vast majority of significant loci (Supplementary Table 9).
Locus annotation, gene-based analyses and gene prioritization
Using FUMA18 (Methods), the ‘all epilepsy’ meta-analysis was mapped to 43 genes and the GGE analysis to 278 genes (Supplementary Data 2). Thirty-nine of the 43 ‘all epilepsy’ genes overlapped with GGE, resulting in a total of 282 uniquely mapped genes. These 282 genes were enriched for monogenic epilepsy genes (hypergeometric test, 18/837 genes overlapped; odds ratio (OR) = 1.51, P = 0.04) and targets of ASMs (hypergeometric test, 9/191 genes overlap; OR = 3.39, P = 5.4 × 10−4).
We calculated a gene-based association score based on the aggregate of all SNPs inside each gene using MAGMA (Methods)19. This analysis yielded 39 significant genic associations—six with ‘all epilepsy’ and 37 with GGE (four overlapped with the ‘all epilepsy’ analysis), after correction for 16,371 tested genes (P < 0.05/16,371 genes; Supplementary Data 3). Thirteen of these 39 genes mapped to regions outside of the genome-wide significant loci from the single SNP analyses.
Next, we performed a transcriptome-wide association study (TWAS) to assess whether epilepsy was associated with differential gene expression in the brain (Methods)20,21. These analyses revealed significant associations with 27 genes in total; 13 genes with ‘all epilepsy,’ 16 with GGE and two with both phenotypes (Supplementary Data 4). Nineteen of the 27 genes mapped outside of the 26 loci were identified through the GWAS. Using summary-data-based Mendelian randomization (SMR)22, we determined a potentially causal relationship between brain expression of RMI1 and ‘all epilepsy,’ and among RMI1, CDK5RAP3 and TVP23B and GGE (Supplementary Data 5).
Of note, expression of RMI1 was associated with GGE in both TWAS (P = 4.0 × 10−10) and SMR (P = 5.2 × 10−8), as well as with ‘all epilepsy’ (TWAS P = 1.3 × 10−6; SMR P = 2.6 × 10−6). RMI1 has a crucial role in genomic stability23 and has not been previously associated with epilepsy or any other Mendelian trait (OMIM, 610404).
We used a combination of ten different criteria to identify the most likely implicated gene within each of the 26 associated loci from the meta-analysis (Methods). This resulted in a shortlist of 29 genes (Table 1; see Supplementary Data 6 for scores of all mapped genes), of which ten are monogenic epilepsy genes, seven are known targets of currently licensed ASDs and 17 are associated with epilepsy for the first time.
Table 1.
Genome-wide significant loci and prioritized genes
Phenotype | Locus | New/ replication | Lead SNP (A1:A2) | Freq1 | Z score | P value | Genes | Total | Missense | TWAS | SMR | MAGMA | PoPS | Brain exp | Brain-coX | KO mouse | AED target | Monogenic |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
All epilepsy | 2p16.1 | Replication | rs13032423 (A:G) | 0.53 | −7.04 | 1.85 × 10−12 | BCL11A | 5 | – | – | – | – | * | * | * | * | – | * |
2q24.3 | Replication | rs59237858 (T:C) | 0.23 | −6.89 | 5.75 × 10−12 | SCN1A | 8 | * | – | – | * | * | * | * | * | * | * | |
9q21.13 | New | rs4744696 (A:G) | 0.82 | −5.74 | 9.69 × 10−9 | RORB | 4 | – | – | – | – | * | * | * | * | – | – | |
10q24.32 | New | rs3740422 (C:G) | 0.33 | 6.04 | 1.52 × 10−9 | KCNIP2 | 3 | – | – | – | * | – | * | * | – | – | – | |
GGE | 1q43 | New | rs876793 (T:C) | 0.67 | −5.95 | 2.64 × 10−9 | RYR2 | 4 | – | – | – | – | * | * | * | * | – | – |
CHRM3 | 4 | – | – | – | – | – | * | * | * | * | – | |||||||
2p16.1 | Replication | rs11688767 (A:T) | 0.53 | 9.38 | 6.58 × 10−21 | BCL11A | 5 | – | – | – | – | * | * | * | * | – | * | |
2q12.1 | New | rs62151809 (T:C) | 0.43 | 6.77 | 1.28 × 10−11 | POU3F3 | 3 | – | – | – | – | * | * | – | * | – | – | |
2q24.3 | Replication | rs11890028 (T:G) | 0.72 | 5.63 | 1.73 × 10−8 | SCN1A | 8 | * | – | – | * | * | * | * | * | * | * | |
2q32.2 | Replication | rs6721964 (A:G) | 0.66 | −6.18 | 6.54 × 10−10 | GLS | 4 | – | – | – | – | – | * | * | * | – | * | |
3p22.3 | New | rs9861238 (A:G) | 0.41 | −6.42 | 1.33 × 10−10 | STAC | 2 | – | – | – | – | * | – | * | – | – | – | |
3p21.31 | New | rs739431 (A:G) | 0.84 | 6.23 | 4.82 × 10−10 | CACNA2D2 | 6 | – | – | – | * | – | * | * | * | * | * | |
4p15.1 | Replication | rs1463849 (A:G) | 0.59 | −6.59 | 4.38 × 10−11 | PCDH7 | 3 | – | – | – | * | * | – | * | – | – | – | |
5q22.3 | Replication | rs4596374 (T:C) | 0.55 | −6.98 | 2.91 × 10−12 | KCNN2 | 6 | – | – | – | * | * | * | * | * | – | * | |
5q31.2 | New | rs2905552 (C:G) | 0.48 | −6.33 | 2.49 × 10−10 | SPOCK1 | 5 | – | – | – | * | * | * | * | * | – | – | |
6q22.33 | Replication | rs13219424 (T:C) | 0.29 | −5.49 | 3.87 × 10−8 | PTPRK | 3 | – | – | – | – | * | – | * | * | – | – | |
7p14.1 | New | rs37276 (T:G) | 0.26 | −5.69 | 1.29 × 10−8 | SUGCT | 2 | – | * | – | – | * | – | – | – | – | – | |
9q21.32 | New | rs2780103 (T:C) | 0.26 | −6.93 | 4.34 × 10−12 | RMI1 | 5 | * | * | * | * | – | – | – | * | – | – | |
10q24.32 | New | rs11191156 (A:G) | 0.67 | −7.55 | 4.41 × 10−14 | KCNIP2 | 4 | – | – | – | * | * | * | * | – | – | – | |
12q13.13 | New | rs114131287 (A:T) | 0.02 | 5.83 | 5.46 × 10−9 | SCN8A | 6 | – | – | – | – | * | * | * | * | * | * | |
16p13.3 | New | rs62014006 (T:G) | 0.05 | 5.88 | 4.22 × 10−9 | RBFOX1 | 5 | – | – | – | * | * | * | * | * | – | – | |
17p13.1 | New | rs2585398 (A:C) | 0.53 | −6.37 | 1.84 × 10−10 | ARHGEF15 | 6 | * | * | * | * | – | – | * | * | – | – | |
17q21.32 | Replication | rs16955463 (T:G) | 0.25 | −5.97 | 2.30 × 10−9 | CDK5RAP3 | 4 | – | * | * | * | – | – | – | * | – | – | |
19p13.3 | New | rs75483641 (T:C) | 0.14 | −6.22 | 4.85 × 10−10 | AP3D1 | 5 | * | – | * | * | – | – | – | * | – | * | |
21q21.1 | New | rs1487946 (A:G) | 0.59 | 5.47 | 4.41 × 10−8 | TMPRSS15 | 1 | – | – | – | – | * | – | – | – | – | – | |
21q22.1 | Replication | rs7277479 (A:G) | 0.36 | −6.82 | 8.94 × 10−12 | GRIK1 | 4 | – | – | – | – | – | * | * | * | * | – | |
22q13.32 | New | rs469999 (A:G) | 0.31 | −6.32 | 2.65 × 10−10 | FAM19A5 | 2 | – | – | – | – | * | * | – | – | – | – | |
CAE | 2p16.1 | Replication | rs12185644 (A:C) | 0.70 | −7.12 | 1.04 × 10−12 | BCL11A | 5 | – | – | – | – | * | * | * | * | – | * |
JME | 4p12 | Replication | rs17537141 (T:C) | 0.851 | −5.47 | 4.62 × 10−8 | GABRA2 | 6 | – | – | – | * | – | * | * | * | * | * |
8q23.1 | New | rs3019359 (T:C) | 0.414 | −5.55 | 2.89 × 10−8 | RSPO2 | 3 | – | – | – | – | * | * | * | – | – | – | |
TMEM74 | 3 | – | – | – | – | – | * | * | * | – | – | |||||||
16p11.2 | Replication | rs1046276 (T:C) | 0.353 | 6.19 | 6.05 × 10−10 | STX1B | 5 | – | – | – | * | – | * | * | * | – | * | |
CACNA1I | 5 | – | – | – | – | – | * | * | * | * | * |
Genome-wide significant loci are annotated with details from the lead-SNP and prioritized genes. Loci were classified as new or replication according to the genome-wide significant results of previous GWAS publications. Genes were scored based on ten criteria/methods, after which the gene with the highest score in the locus was selected as the prioritized gene. Genomic coordinates for each locus (hg19) can be found in Supplementary Table 4. Two-tailed P values and z scores were obtained by fixed-effects meta-analysis weighted by effective sample sizes.
Total, number of satisfied criteria for gene prioritization; missense, the locus contains a missense variant in the gene; TWAS, significant transcriptome-wide association with the gene; SMR, significant summary-based Mendelian randomization association with the gene; MAGMA, significant genome-wide gene-based association; PoPS, gene prioritized by polygenic priority score; brain exp, the gene is preferentially expressed in brain tissue; brain-coX, the gene is prioritized as co-expressed with established epilepsy genes; KO mouse, knockout of the gene causes a neurological phenotype in mouse models; monogenic, the gene is a known cause of monogenic epilepsy.
The strongest association signal for GGE was found at 2p16.1, consistent with our previous results where we implicated VRK2 or FANCL24. Our gene prioritization analysis suggests the transcription factor BCL11A as the culprit gene, located 2.5 Mb upstream of the lead SNPs at this locus. Two of three lead SNPs are in enhancer regions (as assessed by chromatin states in brain tissue) that are linked to the BCL11A promoter via 3D chromatin interactions (Supplementary Fig. 8). Rare variants in BCL11A were recently associated with intellectual disability and epileptic encephalopathy25. However, interrogation of the MetaBrain expression quantitative trait loci (eQTL) database did not reveal a significant association of our lead SNPs with BCL11A expression.
The HLA system and common epilepsies
The highly polymorphic HLA region has been associated with various neuropsychiatric and autoimmune neurological disorders. Therefore, we imputed HLA alleles and amino acid residues using CookHLA v1.0.1 (ref. 26) and ran association across epilepsy, focal and GGE phenotypes, as well as the seven subphenotypes (Methods). No SNP, amino acid residue or HLA allele reached genome-wide significance (Supplementary Fig. 9). The most significant signal was an aspartame amino acid residue in exon 2 of HLA-B (position 31432494), which had a P value of 3.8 × 10−7 for GGE.
SNP-based heritability
We calculated SNP-based heritability using LDAK to determine the proportion of epilepsy risk attributable to common genetic variants. We observed liability scale SNP-based heritabilities of 17.7% (95% confidence interval (CI): 15.5–19.9%) for all epilepsy, 16.0% (14.0–18.0%) for FE and 39.6% (34.3–44.6%) for GGE. Heritabilities were notably higher for all individual GGE subtypes, ranging from 49.6% (14.0–85.3%) for GTCSA to 90.0% (63.3–116.6%) for JAE (Supplementary Table 10).
Using a univariate causal mixture model27 (Methods), we estimated that 2,850 causal SNPs (s.e.: 200) underlie 90% of the SNP-based heritability of GGE, comparable with previous estimates9. Power analysis demonstrated that the current genome-wide significant SNPs only explain 1.5% of the phenotypic variance, whereas an estimated sample size of around 2.5 million individuals would be necessary to identify the causal SNPs that explain 90% of GGE SNP-based heritability (Supplementary Fig. 10).
To further explore the heritability of the different epilepsy phenotypes, we used LDSC to perform genetic correlation analyses28. We found evidence for a strong genetic correlation among all four GGE syndromes (Supplementary Fig. 11 and Supplementary Table 11). We also observed the previously reported significant genetic correlation4 between the focal nonlesional and JME syndromes. Here CAE also showed a significant genetic correlation with the focal nonlesional cohort. Multivariate modeling of genetic correlation using Genomic structural equation modeling (SEM)29 confirmed that most of the heritability signal is shared among the four GGE syndromes, with some subtype-specific signals (Supplementary Fig. 12).
Tissue and cell type enrichment
To further illuminate the underlying biological causes of the epilepsies, we used MAGMA19 and data from the gene–tissue expression (GTEx) consortium to assess whether our GGE-associated genes were enriched for expression in specific tissues and cell types (Methods). We identified significant enrichment of associated genes expressed in brain and pituitary tissue (Supplementary Fig. 13). The implication of the pituitary gland in GGE might reflect a hormonal component to seizure susceptibility. Further subanalyses showed that our results were enriched for genes expressed in almost all brain regions, including subcortical structures such as the hypothalamus, hippocampus and amygdala (Supplementary Fig. 14). We did not find enrichment for genes expressed at specific developmental stages in the brain (Supplementary Fig. 15).
Cell-type specificity analyses of GGE data using various single-cell RNA-sequencing reference datasets (Methods) revealed enrichment in excitatory as well as inhibitory neurons, but not in other brain cells like astrocytes, oligodendrocytes or microglia (Supplementary Fig. 16). Similarly, stratified linkage-disequilibrium (LD)-score regression using single-cell expression data (Methods) did not reveal a difference between excitatory and inhibitory neurons (P = 0.18).
Gene-set analyses
MAGMA gene-set analyses showed significant associations between GGE and biological processes involving various functions in the synapse (Supplementary Data 7). To further refine the synaptic signal, we performed a gene-set analysis using lists of expert-curated gene sets involving 18 different synaptic functions30. These analyses showed that GGE was associated with intracellular signal transduction (n = 139 genes, P = 9.6 × 10−5) and excitability in the synapse (n = 54 genes, P = 0.0074). None of the other 16 synaptic functions showed any association (Supplementary Data 7). Genes involved with excitability include the N-type calcium channel gene CACNA2D2, implicated at the new GGE locus 3p21.31. N-type calcium channel blockers such as levetiracetam and lamotrigine are among the most widely used and effective ASMs for GGE as well as FE31–33. Together, these results suggest that the genes associated with GGE are expressed in excitatory as well as inhibitory neurons in various brain regions, where they affect excitability and intracellular signal transduction at the synapse.
Sex-specific analyses
There are known sex-related patterns in the epidemiology of epilepsy. Although females have a marginally lower incidence of epilepsy than males, GGE is known to occur more frequently in females34. To test whether this sex divergence has a genetic basis, we performed sex-specific GWAS for ‘all’, GGE and FE (Supplementary Figs. 17–19). These analyses revealed one female-specific genome-wide significant signal at 10q24.32 (lead SNP: rs72845653), containing KCNIP2. This locus was also implicated in our main GGE meta-analysis (lead SNP: rs11191156); however, the lead SNPs of these two signals show low allelic correlation (r2 = 0.05; D′ = 0.87). Interestingly, the direction of effect of this signal is opposite in females and males. This sex difference is further corroborated by significant sex heterogeneity (P = 1.54 × 10−8) and sex-differentiated GWAS (P = 5.6 × 10−9) (ref. 35). Sex-related differences in transcription levels in human heart have previously been reported for KCNIP2 (ref. 36). We did not find any sex-divergent signals for ‘all’ or FE. These analyses were limited by a reduction in sample size and prone to random fluctuation.
We used LDSC to assess the genetic correlation between male-only and female-only GWAS. The male and female GWAS of ‘all epilepsy,’ FE and GGE were strongly genetically correlated (all rG > 0.9), and none of these correlations were significantly different from 1 (all P > 0.05). These results suggest that, with the exception of the female-specific 10q24.32 signal, the overall genetic basis of common epilepsy appears largely similar between males and females.
Genetic overlap between epilepsy and other phenotypes
To explore the genetic overlap of epilepsy with other diseases, we first used the GWAS Catalog37 to cross-reference the 26 genome-wide epilepsy loci with other traits with significant associations (P < 5 × 10−8) for the same SNP, or SNPs in strong LD with our lead SNPs (as detailed in Table 1). This analysis revealed 18 likely pleiotropic loci, with previous associations reported across a variety of traits, the most common being cognitive, sleep, psychiatric, coronary and blood cell-related (Supplementary Fig. 20). The remaining eight loci appear to be specific to epilepsy (3p22.3, 4p12, 5q31.2, 7p14.1, 8q23.1, 9q21.13, 21q21.1 and 21q22.1).
We then performed genetic correlation analyses between 18 selected traits (Supplementary Table 12) and ‘all’, GGE and FE using LDSC13. The selected traits had either, or a combination of, epilepsy as a common comorbidity or pleiotropic loci shared with epilepsy. Significant correlations (P < 0.05/54 = 0.0009) were found with febrile seizures, stroke, headache, ADHD, type 2 diabetes and intelligence (Fig. 2).
Fig. 2. Genetic correlations of epilepsy with other phenotypes.
The genetic correlation coefficient was calculated with LDSC and is denoted by color scale from −1 (red; negatively (anti-)correlated) to +1 (blue; positively correlated). The square size relates to the absolute value of the corresponding correlation coefficient. Single asterisk indicates two-sided P < 0.05 and double asterisk indicates two-sided P < 0.0009 (Bonferroni corrected).
Genetic correlation analyses assess the aggregate of shared genetic variants associated with two phenotypes. However, genetic correlations can become close to zero when there is inverse directionality of SNP effects between two phenotypes38. To explore this further, we applied MiXeR v1.2.0 to quantify the polygenic overlap between GGE and the same 18 selected traits, irrespective of genetic correlation (Methods). Results showed a large polygenic overlap between epilepsy and various other brain traits (Supplementary Fig. 21). For most selected brain traits, the direction of effect was concordant for 40–60% of SNPs. This might explain why some LDSC correlations were low, together with other relevant factors including sample size, polygenicity and trait genetic architecture. In combination, these analyses suggest that the SNPs involved with GGE are highly pleiotropic; a large proportion of the ~2,850 causal SNPs underlying GGE seem to underlie the risk of a wide range of other brain diseases and traits, often with opposing directions of effect. These results emphasize that each phenotype has a specific underlying distribution of effect sizes and directions among shared causal variants, which together explain the shared and unique risk for different brain diseases.
Leveraging GWAS for drug repurposing
We next tested the potential of our meta-analysis to inform drug repurposing, by predicting the relative efficacy of drugs for epilepsy (Methods). This analysis was based on the predicted ability of each drug to modulate epilepsy-related changes in the function and abundance of proteins, as inferred from the GWAS summary statistics (Methods)39. In our predictions for all epilepsy, current ASMs were ranked higher than expected by chance (P < 1 × 10−6) and higher than drugs used to treat any other human disease (Supplementary Data 8). These observations were also true for a ‘test set’ (randomly selected 50%) of ASMs, when the remaining ASMs (‘training set’) were used for optimizing the predictions.
For GGE, broad-spectrum ASMs were predicted to be more effective than narrow-spectrum ASMs (P < 1 × 10−6), consistent with clinical experience40. Furthermore, the predicted order of efficacy for GGE of individual ASMs matched their observed order in the largest head-to-head randomized controlled clinical trials for generalized epilepsy33,41, an observation unlikely to occur by chance (P < 1 × 10−6).
Using this approach, we highlight the top 20 drugs that are licensed for conditions other than epilepsy, but are predicted to be efficacious for generalized epilepsy, and additionally have published evidence of antiseizure efficacy from multiple published studies and multiple animal models (Supplementary Table 13). The full list of all predictions can be found in Supplementary Data 9.
GWAS in epilepsies ascertained from population biobanks
Finally, we leveraged the data from several large-scale population biobanks and from deCODE genetics to explore the consistency of the epilepsy loci in cohorts that were less deeply phenotyped (total cases n = 21,734, total controls n = 1,023,989, phenotyped using International Classification of Diseases (ICD) codes; Methods; Supplementary Table 14). Forest plots showed a consistent direction of effect between the biobanks and our primary GWAS for all biobank-genotyped genome-wide significant top SNPs of the ‘all epilepsy’ GWAS and for all but one GGE top SNP (Supplementary Figs. 22 and 23). Although the biobank and deCODE genetics-specific GWAS did not identify any genome-wide significant loci for GGE or ‘all epilepsy,’ one significant locus at 2q22.1 (nearest gene, NXPH2) emerged for FE (Supplementary Fig. 24).
Meta-analysis of the biobank and deCODE genetics summary statistics with those from the primary epilepsy GWAS identified seven significant loci for the ‘all epilepsy’ phenotype. Six of these signals were previously identified in the primary ‘all epilepsy’ (n = 4) or the ‘GGE’ GWAS (n = 2). One locus (2q12.1) was new. The combined biobank and deCODE genetics meta-analysis for GGE identified five new loci, but four loci from our primary GWAS fell below the threshold of significance (Supplementary Fig. 25). The combined FE meta-analysis showed no significant associations. LDSC between the biobank/deCODE genetics and the primary GWAS results showed genetic correlations ranging between 0.31 and 0.74 (Supplementary Table 15).
Discussion
In this study, we leveraged a substantial increase in sample size to uncover 26 common epilepsy risk loci, of which 16 have not been reported previously. Using a combination of ten post-GWAS analysis methods, we pinpointed 29 genes that most likely underlie these signals of association. These signals showed enrichment throughout the brain and indicate an important role for synapse biology in excitatory as well as inhibitory neurons. Drug prioritization from the genetic data highlighted licensed ASMs, ranked the ASMs broadly in line with clinical experience and pointed to drugs for potential repurposing. These findings further our understanding of the pathophysiology of common epilepsies and provide new leads for therapeutics.
The 26 associated loci included some notable monogenic epilepsy genes. These include the calcium channel gene CACNA2D2, an established epileptic encephalopathy gene42 that is directly targeted by ten currently licensed drugs, including two ASMs (gabapentin and pregabalin) as well as the Parkinson’s disease drug safinamide and the nonsteroidal anti-inflammatory drug celecoxib. Both safinamide and celecoxib have evidence of antiseizure activity43,44. SCN8A, which encodes a voltage-gated sodium channel, is an established epileptic encephalopathy gene and is associated here with common epilepsies. Nav1.6 (encoded by SCN8A) is targeted by commonly used sodium channel-blocking drugs, the most efficacious ASMs for people with monogenic SCN8A-related epilepsies, that are often caused by gain-of-function pathogenic variants45. Additional drugs targeting Nav1.6 include safinamide and quinidine. RYR2 encodes a ryanodine receptor, is an established cardiac disorder gene, has recently been implicated in epilepsy46,47 and is targeted by caffeine as well simvastatin, atorvastatin and carvedilol. The acetylcholine receptor gene CHRM3 has been previously associated with epilepsy48 and is targeted by drugs including solifenacin, used to treat urinary incontinence.
We found that GGE, in particular, has a strong contribution from common genetic variation. When analyzing individual GGE syndromes, we found that up to 90% of liability is attributable to common variants in the JAE subtype, making it among the highest of over 700 traits reported in a large GWAS atlas49 (albeit with relatively large CIs; Supplementary Table 10). The heritability estimates decrease to 40% for the collective GGE phenotype, possibly due to increased heterogeneity from combining syndromes with pleiotropic as well as syndrome-specific risk loci. Although statistical power drastically decreased when assessing specific GGE syndromes, three loci appeared specific to JME. These findings highlight the unique genetic architecture of the subtypes of common epilepsies, which are characterized by a high degree of both shared and syndrome-specific genetic risk.
In contrast to GGE, for FEs, we found only a minor contribution of common variants, with no variant reaching genome-wide significance. It would seem that FEs, as a group, are far more heterogeneous than GGE, lack (common-variation) loci with high effect sizes, have a higher degree of polygenicity and/or have a lower contribution of common heritable risk variation. Our attempt to mitigate this heterogeneity by performing subtype analysis contrasted with the results from GGE, suggesting different genetic architectures, consistent with the experience from studies of common9 and rare5 genetic variation and polygenic risk score analyses6. There is also emerging evidence for a substantial role of noninherited, somatic mutations in FEs50.
This work highlights the challenges of working with epilepsy cohorts ascertained through large biobanking initiatives. Accurate classification of epilepsy requires a combination of clinical features, electrophysiology and neuroimaging. Such details were absent from the biobanks we worked with. Rather, phenotypes were generally limited to ICD codes, which are prone to misclassification51. Population biobanks are also probably ascertaining milder epilepsies that are responsive to treatment, contrasting with the enrichment for refractory epilepsies at tertiary referral centers.
Moreover, a proportion of adults with epilepsy have an acquired brain lesion, such as stroke, tumors or head trauma. Biobanks typically provide self-reported clinical information and codes from primary care and inpatient hospital care episodes, but not neurological specialist outpatient records that would indicate whether previous brain insults were considered relevant to epilepsy. As a result, the inclusion of the biobank data appeared to introduce more heterogeneity. This contrasts with genetic mapping of other polygenic diseases like type 2 diabetes and migraine, which are relatively easy and reliable to diagnose and classify, resulting in a great increase in GWAS loci when including data from the same biobanks as included in our study52,53.
We found enrichment of GGE variants in brain-expressed genes, involving excitatory and inhibitory neurons, but not any other brain cell type. This contrasts with other neurological diseases. For example, microglia are involved in Alzheimer’s disease54 and multiple sclerosis55, whereas migraine does not appear to have brain cell specificity53. We further refine this signal by showing the involvement of synapse biology, primarily intracellular signal transduction and synapse excitability. These findings suggest an important role of synaptic processes in excitatory and inhibitory neurons throughout the brain, which could be a potential therapeutic target. Indeed, synaptic vesicle transport is a known target of the ASMs levetiracetam and brivaracetam56.
We confirmed that our GWAS-identified genes had substantial overlap with monogenic epilepsy genes. A similar convergence of common and rare variant associations has been observed for other neurological neuropsychiatric conditions including schizophrenia57 and ALS58. The genes prioritized in our GWAS signals also overlapped with known targets of current ASMs4, and we have provided a list of other drugs that directly target these genes. Moreover, using a systems-based approach39, we highlight drugs that are predicted to be efficacious when repurposed for epilepsy, based on their ability to perturb function and abundance in gene expression. Insights from GWAS of epilepsy have the potential to accelerate the development of new treatments via the identification of promising drug repurposing candidates for clinical trials59. We anticipate that follow-up studies of the highlighted drugs in this study could show clinical efficacy in epilepsy treatment.
In summary, these new data reveal markedly different genetic architectures between the milder and more common focal and generalized epilepsies, provide new biological insights to disease etiology and highlight drugs with predicted efficacy when repurposed for epilepsy treatment.
Methods
Inclusion and ethics statement
Local institutional review boards approved study protocols at each contributing site. All study participants provided written, informed consent for the use of their data in genetic studies of epilepsy. For minors, written informed consent was obtained from their parents or legal guardian.
Sample and phenotype descriptions
This meta-analysis combines previously published datasets with new genotyped cohorts. Descriptions of the 24 cohorts included in our previous analysis can be found in the Supplementary Table 6 of that publication4. Here we included five new cohorts (Supplementary Table 1), comprising 14,732 epilepsy cases and 22,362 controls, resulting in a total sample size of 29,944 cases and 52,538 controls. Classification of epilepsy was performed as described previously (see Supplementary Note for a detailed description)4. In brief, we assigned people with epilepsy to FE, GGE or unclassified epilepsy. ‘All epilepsy’ was the combination of GGE, focal and unclassified epilepsy. Where possible, we used EEG, MRI and clinical history to further refine the subphenotypes—JME, CAE, JAE, GTCSA, nonlesional FE, FE with HS and FE with lesions other than HS.
Genotyping, quality control (QC) and imputation
Study participants were genotyped on SNP arrays (see Supplementary Table 1 for an overview of genotyping in new cohorts). QC was performed separately for each cohort. Pre-imputation QC included removal of SNPs with call rate (<98%), differential missing rate, duplicated and monomorphic SNPs, SNPs with batch association (P < 10−4) and violation of Hardy–Weinberg equilibrium (P < 10−10). In addition, the Epi25 cohort was split by ancestry, based on principal component analysis. Individuals were removed if their heterozygous/homozygous ratio was >4 s.d. from the mean. We also removed one from each pair of related samples (determined by identity-by-descent >0.2) and removed individuals with ambiguous or nonmatching genetically imputed sex. Furthermore, 3,180 duplicates between the Epi25 cohort and the previously published genome-wide mega-analysis4 were identified based on genotype and were removed from the Epi25 cohort. Of the 3,180 duplicates, 1,226 were GGE and 1,402 FE. Before imputation, cohorts were cross-referenced to the Haplotype Reference Consortium (HRC) panel to ensure SNPs matched in terms of strand, position and ref/alt allele assignment. Additionally, SNPs were removed if they were absent in the HRC panel, if they had a >20% allele frequency difference with the HRC panel or if any AT/GC SNPs had MAFs >40%, using tools available from https://www.well.ox.ac.uk/~wrayner/tools/. Data from Janssen Pharmaceuticals, Austrian GenEpa, Swiss GenEpa, Norwegian GenEpa and BPCCC were then imputed using the Wellcome Sanger Institutes’ imputation server (https://imputation.sanger.ac.uk/), using EAGLE v2.4.1 (ref. 60) for phasing, and the Positional Burrows–Wheeler Transform algorithm61 v3.1 for imputation. The HRC reference panel r1.1 was used as a reference for imputation (n = 32,470) (ref. 62). Similarly, data from the Epi25 cohort were imputed using the Michigan Imputation server (https://imputationserver.sph.umich.edu/). We used the HRC r1.1 as the reference panel for individuals of European and Asian ancestry and the 1000 Genomes Phase 3 v5 (n = 2,504) for individuals of African ancestry. Default imputation parameters were used. Due to data sharing restrictions and with the Epi25 cohort data located in the USA and the other cohorts located in the European Union, we were unable to merge the data or use the same imputation server. Postimputation QC was largely similar among all cohorts. The Epi25 cohort used an in-house pipeline, where imputed dosages were used for genome-wide association analyses, filtering on imputation INFO > 0.3, MAF < 1%, genotype coverage <0.98 and Hardy–Weinberg violations (P < 10−5). For all other cohorts, the same procedures as our previous study4 were used—imputed datasets were converted to hard-coded PLINK format, requiring a more stringent imputation filtering of INFO > 0.9 (as opposed to dosages, where imputation inaccuracy is incorporated in downstream analyses). Furthermore, we removed SNPs with MAF < 5%, genotype coverage <0.98 and Hardy–Weinberg violations (P < 10−5)(ref. 4). We removed SNPs <5% MAF in the Janssen Pharmaceuticals, Austrian GenEpa, Swiss GenEpa, Norwegian GenEpa and BPCCC cohorts for QC reasons, and note there will be a corresponding loss in study power for lower frequency SNPs in the ‘focal’ and ‘all epilepsy’ epilepsy analysis.
Genome-wide association analyses
GWAS of the Janssen Pharmaceuticals, Swiss GenEpa, Norwegian GenEpa and Austrian GenEpa cohorts was performed as a mega-analysis, as described previously4. GWAS of the Epi25 cohort was performed with a generalized mixed model using SAIGE v0.38 (ref. 63). SAIGE was performed in two steps. First, we fit the null logistic mixed model to estimate the variance component and other model parameters. For this step, SNPs were filtered on-call rate >0.98 and MAF > 5%, and SNPs were pruned to obtain approximate independent markers (window size of 100 SNPs and r2 > 0.3). Second, we tested for the association between each genetic variant and phenotypes by applying SPA to the score test statistics. Next, we performed P value-based fixed-effects meta-analyses with METAL v2020-05-05 (ref. 64) for each of the main phenotypes (‘all’, GGE and FE), as well as the subphenotypes, weighted by effective samples sizes (neff = 4/(1/ncases + 1/ncontrols)) to account for case–control imbalance. We performed multi-ancestry and European-only meta-analyses for the main phenotypes, and restricted the subphenotype analyses to Europeans only, due to limited sample size in other ancestries. We included all SNPs (~4.9 million, MAF > 1%) that were present in at least the previous mega-analysis and the Epi25 dataset, which together account for 88% of the total sample size. We calculated genomic inflation factors (λ), mean χ2 and LD-score regression intercepts to assess potential inflation of the test statistic. Because λ is known to scale with sample size, we also calculated λ1000, which is λ corrected for an equivalent sample size of 1,000 cases and 1,000 controls65. We limited these analyses to participants of European ancestry because LD-structure depends on ethnicity and Europeans constituted 92% of cases. For forest plots of genome-wide significant hits, Beta/SE was estimated from METAL z scores using a previously published formula22. For P–M plots, m values were generated using the default settings of the tool Metasoft v2.0.0 (ref. 66).
Data sources for the biobank and deCODE genetics GWAS
Summary statistics for epilepsy GWAS were obtained from three population biobanks (UK Biobank67, Biobank Japan68,69 and FinnGen release R6 (ref. 70)) and from deCODE genetics71 (Iceland). The Biobank Japan, FinnGen and deCODE genetics epilepsy cases were further assigned into either ‘focal’ or ‘generalized’ epilepsy, whereas the UK Biobank samples were not subdivided based on seizure localization, as the relevant clinical details were unavailable to facilitate an accurate subdivision (see Supplementary Table 14 for sample sizes per biobank and deCODE genetics). Control data were population-matched samples with no history of epilepsy.
Fixed-effects meta-analyses were conducted using METAL v2020-05-05 (ref. 64), weighted by effective sample size (neff = 4/(1/ncases + 1/ncontrols)) to account for case–control imbalance.
UK Biobank
We identified people with epilepsy from the UK Biobank using an analysis of self-reported data, inpatient hospital episode statistics, death certificate diagnostic data and primary care diagnostic data as described elsewhere72. This allowed us to interrogate the evidence available to support a diagnosis of epilepsy rather than relying purely on UK Biobank-generated data fields 131048 and 13049 based on ICD-10 G40 mapping.
FinnGen
Epilepsy was determined with ICD-10 G40, ICD-9 345, ICD-8 345 and Social Insurance Institution of Finland (KELA) code 111. Exclusion criteria were ICD-9 3452/3453 and ICD-8 34520. GGE was determined with ICD-10 G40.3, ICD-9 345(0-3) and ICD-8 34519. Exclusion criteria were ICD-8 34511. FE was determined with ICD-10 G40.0, G40.1, G40.2, ICD-9 345(45) and ICD-8 3453.
deCODE genetics
Epilepsy was determined with ICD-10 G40 and ICD-9 345 excluding 3452/3453. GGE with ICD-10 G40.3/G40.4/G40.6/G40.7 or ICD-9 3450/3451/3456, and FE with ICD-10 G40.0/G40.1/G40.2 or ICD-9 3454/3455.
Biobank Japan
Cases were classified into ‘Broad_Epilepsy,’ being any form of epilepsy; ‘Idiopathic_Epilepsy,’ being epilepsy with onset under 40 years and no known cause or ‘Idiopathic_Focal_Epilepsy’ and ‘Idiopathic_Generalized_Epilepsy,’ where focal and generalized syndromes could be ascertained.
Control data were population-matched samples with no history of epilepsy. GWAS fixed-effects meta-analyses were conducted using METAL64. To account for case–control imbalance, the effective sample size for each cohort was calculated as neff = 4/(1/ncases + 1/ncontrols)). GWAS Manhattan plots were generated using the qqman package73 in R v3.6.0. Genome-wide significant loci were mapped onto genes using the FUMA web platform18.
We performed three meta-analyses. As a primary analysis, we meta-analyzed all nonbiobank samples, then we meta-analyzed only biobank/deCODE genetics samples and finally, we performed a combined meta-analysis of biobank/deCODE genetics and nonbiobank samples.
Pleiotropy analysis
ASSET74 is a meta-analysis-based pleiotropy detection approach that identifies common or shared genetic effects between two or more related, but distinct traits. We used ASSET v2.2.0 with a genome-wide significance level of α = 5 × 10−8. We applied ASSET to the subset of European-ancestry samples, comprising 6,952 (3,244 + 3,708) GGE cases and 14,939 (5,344 + 9,095) FE cases from the Epi25 and our consortium as well as 42,434 partially overlapping controls from both consortia. Note that ASSET accounts for sample overlap in the analysis. Effect sizes, standard errors and the effective sample sizes estimated were from the main meta-analysis.
HLA association
Given the prior association of the HLA with autoimmune epilepsy75,76, we included a specific analysis of the HLA. HLA types and amino acid residues were imputed using CookHLA software v1.0.1 (ref. 26), with the 1000 Genomes Phase 3 used as a reference panel77. Samples were grouped by genetic ancestry for imputation.
Following imputation, association analysis was conducted using the HLA Analysis Toolkit (HATK) v1.2 (ref. 78). The following three phenotypes were analyzed: ‘all epilepsy’, FE and GGE. Samples from the ILAE and Epi25 datasets were analyzed separately, and the association results were meta-analyzed across datasets and ancestries using PLINK v1.9 (ref. 79).
Functional annotation
We annotated all genome-wide significant SNPs and tagged SNPs within the loci from our multi-ancestry meta-analyses. ANNOVAR v2017-07-17 was used to retrieve the location and function of each SNP80, the CADD score was used as a measure of predicted deleteriousness81 and chromatin states were incorporated from the ENCODE and NIH Roadmap Epigenomics Mapping Consortium14,82. We used FUMA v1.3.8 to define the independently significant SNPs within loci; that is, SNPs that were genome-wide significant but not in LD (r2 < 0.2 in Europeans) with the lead SNP in the locus.
MTAG
MTAG v1.0.8 (ref. 17) was used (with default settings) to increase the effective sample size from our European ancestry GGE subphenotype analysis by pairing it with the strongly correlated overall GGE GWAS with a larger sample size. MTAG accounts for sample overlap between traits and uses the fact that estimations of effect size and standard error of a primary GWAS, in this case GGE subtypes, can be improved by matching them to a genetically correlated secondary GWAS, in this case GGE17. Similarly, we applied MTAG to combine FE with GGE.
Gene mapping
To map genome-wide significant loci from our multi-ancestry meta-analyses to specific genes, we used FUMA v1.3.8 (ref. 18) with the same parameters as published previously4. We defined genome-wide significant loci as the region encompassing all SNPs with P < 10−4 that were in LD (r2 > 0.2) with the lead SNP (that is, the SNP with the strongest association within the region). We used a combination of positional mapping (within 250 kb from the locus), eQTL mapping (SNPs with FDR corrected eQTL P < 0.05 in blood or brain tissue) and 3D Chromatin Interaction Mapping (FDR P < 10−6 in brain tissue).
Genome-wide gene-based association study (GWGAS) and gene-set analyses
We performed the GWGAS using the default settings of MAGMA v1.08, as implemented in FUMA v1.3.8, which calculates an association P value based on all the associations of all SNPs within each gene in the GWAS19. Based on these GWGAS results, we performed competitive gene-set analyses with default MAGMA settings, using 15,483 default gene sets and GO-terms from MsigDB. In addition, we specifically assessed 18 curated gene sets involving different synaptic functions30.
TWAS
TWAS was performed with FUSION v3, with default settings20. We imputed gene expression based on our European-only GWAS (because the method relies on LD reference data) eQTL data from the PsychENCODE consortium, which includes dorsolateral prefrontal cortex tissue from 1,695 individuals21.
SMR
SMR v1.03 is an additional method to assess the association between epilepsy and expression of specific genes22. Although TWAS and SMR have similar aims, the differences in methods and reference datasets result in complementary information. As opposed to the FUSION TWAS method, which uses multi-SNP imputation of gene expression, SMR uses Mendelian randomization to test whether the effect size of an SNP on epilepsy is mediated by the expression of specific genes. We performed SMR analyses with default settings, using European-only GWAS and the MetaBrain expression data as reference, a new eQTL dataset including 2,970 human brain samples83.
Sex-specific analyses
We performed a GWAS, as described above, for all epilepsy (13,889 female cases and 19,676 female controls; 12,259 male cases and 18,645 male controls) and GGE (3,946 female cases and 19,676 female controls; 2,603 male cases and 18,645 male controls) separately for participants of either sex, after which we performed fixed-effects meta-analyses with METAL to merge the different cohorts. We performed meta-analyses between the male and female GWAS with GWAMA v2.2.2 (ref. 84) to assess the heterogeneity of effect sizes between sexes and sex-differentiated associations35. Sex-differentiated analyses are meta-analyses between female-only and male-only GWAS, allowing for different effect sizes between the sexes, while sex-heterogeneity tests the difference in effect size for each SNP between female-only and male-only GWAS35.
Gene prioritization
We combined ten methods to prioritize the most likely biological candidate gene within each genome-wide significant locus. For each gene in each locus, we assessed the following criteria:
Missense: we assessed whether the SNPs tagged in the genome-wide significant locus contained an exonic missense variant in the gene, as annotated by ANNOVAR v2017-07-17.
TWAS: we assessed whether imputed gene expression was significantly associated with the epilepsy phenotype, based on the FUSION TWAS as described above, Bonferroni corrected for each mapped gene with expression information.
SMR: we assessed whether the gene had a significant SMR association with the epilepsy phenotype, based on the SMR analyses as described above, Bonferroni corrected for each mapped gene with expression information.
MAGMA: we assessed whether the gene was significantly associated with the epilepsy phenotype through a GWGAS analysis, Bonferroni corrected for each mapped gene.
PoPS: we calculated the polygenic priority score (PoPS)85, a method that combines GWAS summary statistics with biological pathways, gene expression and protein–protein interaction data, to pinpoint the most likely causal genes. We scored the gene with the highest PoPS score within each locus.
Brain expression: for each mapped gene, we calculated the mean expression in all brain and nonbrain tissues based on data from the GTEx project v8 (ref. 86). Next, we assessed whether the gene was more strongly expressed in brain tissues than nonbrain tissues, by comparing the average expression in all brain tissues with all nonbrain tissues.
Brain-coX: we assessed whether genes were prioritized as co-expressed with established epilepsy genes in more than a third of brain tissue resources used, using the tool brain-coX (Supplementary Fig. 26)87.
Target of AED: we assessed whether the gene is a known target of an anti-epileptic drug, as detailed in the drug–gene interaction database (www.DGidb.com; accessed on 26-11-2021) and a list of drug targets from a recent publication (Supplementary Data 10)88.
Knockout mouse: we assessed whether a knockout of the gene in a mouse model results in a nervous system (phenotype ID: MP:0003631) or a neurological/behavior phenotype (MP:0005386) in the Mouse Genome Informatics database (http://www.informatics.jax.org; accessed on 26-11-2021).
Monogenic epilepsy gene: we evaluated whether the gene is listed as a monogenic epilepsy gene, in a curated list maintained by the Epilepsy Research Center at the University of Melbourne89 (Supplementary Data 10).
Similar to previous studies4,90, we scored all genes based on the number of criteria being met (range: 0–10; all criteria had an equal weight). The gene with the highest score was chosen as the most likely implicated gene (see Supplementary Data 6 for a complete list of scores for all genes in each locus). We implicated both genes if they had an identical, highest score. We calculated Pearson correlation coefficients between the ten criteria (Supplementary Table 16) and note that most correlations were low (range: −0.13 to 0.39), suggesting that they convey complementary information.
Long-distance expression regulation of BCL11A
Most eQTL databases, like PsychENCODE and MetaBrain, restrict eQTL analyses to 1 Mb distance between genes and SNPs. To specifically assess the hypothesis of long-distance regulation of BCL11A by the lead SNPs in the 2p16.1 epilepsy locus, we manually interrogated the MetaBrain database83 without distance restraints. Next, we calculated the association between the three lead SNPs in the locus (rs11688767, rs77876353 and rs13416557) with BCL11A expression.
Heritability analyses
We calculated SNP-based heritability on the European-only GWAS using LDAK v5.2, as it was recently shown to give more accurate heritability estimates for complex traits, when compared to other methods including LDSC91,92. We used default settings in LDAK and precalculated LD weights from 2,000 European (white British) reference samples under the BLD–LDAK SumHer model92. SNP-based heritabilities were converted to liability scale heritability estimates, using the following formula: h2l = h2o × K2(1 − K)2/p(1 − p) × Z2, where K is the disease prevalence, p is the proportion of cases in the sample and Z is the standard normal density at the liability threshold. To decrease downward bias, we performed these calculations based on the effective sample sizes (see calculation above), after which p = 0.5 can be assumed93, with the same population prevalences as our previous study (Supplementary Table 10)4. The total amount of causally associated variants (that is, variants with nonzero additive genetic effect) underlying epilepsy risk was calculated by a causal mixture model (MiXeR) v1.2.0 (ref. 38). MiXeR uses a likelihood-based framework to estimate the amount of causal SNPs underlying a trait, without the need to pinpoint which specific SNPs are involved. Furthermore, MiXeR allows for power calculations to assess the required sample size to explain a certain proportion of SNP-based heritability by genome-wide significant SNPs.
Genomic SEM
Genomic SEM entails two stages of estimation29. In the first stage, the empirical genetic covariance matrix and sampling covariance matrix are estimated using an extension of multivariable LDSC. This matrix is extended to include SNP effects for the multivariate GWAS SEM. In the second stage, an SEM is specified, and its parameters are estimated such that the discrepancies in the model covariance matrix and the empirical covariance matrix are minimized. The Genomic SEM models are specified such that the SNP effect, defined by multiple traits, occurs at a level of a latent factor (Fg), and the model fit is assessed using model chi-square, Akaike information criterion and standardized root mean square. However, this method also provides evidence of heterogeneity between the phenotypes via the QSNP statistics, which show the extent to which the univariate regression effects of SNPs for each phenotype are explained by a common genetic factor. QSNP is a chi-square distributed statistic that can test whether SNPs act entirely through a common factor.
Enrichment analyses
We used MAGMA v1.08 (as implemented in FUMA) to perform tissue and cell-type enrichment based on our multi-ancestry meta-analyses. First, we assessed whether our GGE GWAS was enriched for specific tissues from the GTEx database. Similarly, we assessed the enrichment of genes expressed in the brain at 11 general developmental stages, using data from the BrainSpan consortium. Next, we assessed whether GGE was associated with specific cell types, by cross-referencing two single-cell RNA-sequencing databases of human developmental and adult brain samples. The PsychENCODE database contains RNA-sequencing data from 4,249 human brain cells from developmental stages and 27,412 human adult brain cells94. The Zhong dataset (GSE104276) contains RNA-sequencing data from 2,309 human brain cells at different stages of development95. We performed FDR correction across datasets to assess which cell types were significantly associated with GGE. As a sensitivity analysis, we performed stratified LDSC with default settings using the cell-specific gene expression weights from the PsychENCODE consortium to compare GABAergic with glutamatergic neuron enrichment96.
Genetic overlap with other diseases
Using the FUMA web application, we searched the GWAS catalog for previously reported associations with P < 5 × 10−8 for SNPs at all 26 genome-wide significant loci.
Genetic correlations between ‘all’, FE and GGE and 18 other traits were computed with LDSC v1.01, using default settings. For these analyses, we used our European-only GWAS. Traits highlighted by the GWAS catalog analysis and/or those with established epilepsy comorbidity were prioritized and pursued provided recent summary statistics were available for public download (Supplementary Table 12). Although estimates are in general consistent between LDSC and LDAK90, we decided to use LDSC as it is the more established method of the two for genetic correlations and used by almost all genetic correlation atlases and databases97,98.
We used a recently described bivariate causal mixture model (MiXeR) v1.2.0 to quantify the polygenic overlap between GGE with the same 18 traits as assessed with LDSC. Bivariate MiXeR analyses estimate the total amount of causal SNPs underlying each trait, after which it assesses how many of these SNPs are shared between two traits. Notably, the number of overlapping SNPs is calculated regardless of the direction of effect. This makes it different from overall genetic correlation analyses such as LDSC, where overlapping SNPs with mixed directions of effect can cancel each other out, resulting in low genetic correlation. We used the same publicly available summary statistics as used for LDSC (Supplementary Table 12), after which bivariate MiXeR was run with default settings.
Drug-repurposing analyses
We used a recently developed method that uses the GWAS for a disease to predict the relative efficacy of drugs for the disease39. We applied this method to ‘all’ epilepsy and GGE GWAS results, using (1) imputed gene expression data from the FUSION analyses, as described above, and (2) gene-based P values from MAGMA (see above), with default settings. We predicted the relative efficacy of 1,343 drugs in total (Supplementary Data 8). We determined if our predictions correctly identify (area under the receiver operating characteristic curve) and prioritize (median rank) known clinically effective antiseizure drugs, as previously described39. We determined the statistical significance of drug identification and prioritization results by comparing the results to those from a null distribution generated by performing 106 random permutations of the scores assigned to drugs.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41588-023-01485-w.
Supplementary information
Supplementary Tables 1–16, Supplementary Figs. 1–26, Supplementary Note, source code for Fig. 2 and Supplementary References.
Genome-wide significant SNPs across epilepsy types. Functional annotation of the 2,355 genome-wide significant SNPs across the 22 GGE loci and 612 SNPs from all epilepsy loci.
Genes mapped for the meta-analysis results. Gene mapping of the ‘all epilepsy’ meta-analysis and the GGE analysis using FUMA.
GWGAS results. Analysis of gene-based association score based on the aggregate of all SNPs inside each gene using MAGMA.
TWAS results. Analysis of differential gene expression in brain for epilepsy using FUSION TWAS.
SMR results. Analysis of potentially causal relationship between brain expression and epilepsy using SMR.
Scores of biological prioritization criteria for each mapped gene, of each genome-wide significant locus.
Gene-set analyses. Analysis of biological processes in association with GGE using MAGMA.
Median ranks and AUROCs of all drug groups.
Prediction of the relative efficacy of drugs for epilepsy. Prediction of the relative efficacy of drugs for epilepsy.
Gene lists used in gene prioritization.
Acknowledgements
Some of the data reported in this study were collected as part of a project undertaken by the ILAE and some of the authors are experts selected by the ILAE. Opinions expressed by the authors, however, do not necessarily represent the policy or position of the ILAE.
This study received support from Science Foundation Ireland (SFI; 16/RC/3948), cofunded under the European Regional Development Fund, the Research Unit FOR-2715 of the German Research Foundation (MN: NO755/6-1 and NO755/13-1), from Wellcome Trust (grant 084730), European Union’s Seventh Framework Program (FP7/2007-2013) under grant agreement 279062 (EpiPGX), The Muir Maxwell Trust and the Epilepsy Society, UK and Fonds National de la Recherche Luxembourg (Research Unit FOR-2715, FNR grant INTER/DFG/21/16394868 MechEPI2) to P.M. and R. Krause. Part of this work was undertaken at University College London Hospitals, which received a proportion of funding from the NIHR Biomedical Research Centers funding scheme. Further support was received by a ‘Vrienden WKZ’ fund 1616091 (MING) to R. Stevelink and B.P.C.K., a National Health and Medical Research Council (NHMRC) of Australia Program Grant (1091593) to S.F.B. and I.E.S. and an NHMRC Investigator grant (APP1195236) to M.B. The Australian Government Research Training Program Scholarship (APP533086) provided by the Australian Commonwealth Government and the University of Melbourne supports K.L.O., a Wellcome Clinical Ph.D. Fellowship on the 4Ward North program (203914/Z/16/Z) supported D.L.-S., the UKRI MRC award MR/S02638X/1 and the NIHR Imperial Biomedical Research Center (BRC) support M.R.J., and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), Brazil (grant 2013/07559-3) supported I.L.-C. The funding bodies had no role in the study design, data collection, analysis and interpretation, or in writing the manuscript.
We thank the Epi25 principal investigators, local staff from individual cohorts and all patients with epilepsy who participated in research studies at local centers for making possible this global collaboration and resource to advance epilepsy genetics research. This work is part of the Centers for Common Disease Genomics (CCDG) program, funded by the National Human Genome Research Institute (NHGRI), The Eunice Kennedy Shriver National Institute of Child Health and Human Development and the National Heart, Lung and Blood Institute (NHLBI). CCDG-funded Epi25 research activities at the Broad Institute, including genomic data generation in the Broad Genomics Platform, were supported by NHGRI grant UM1 HG008895 (PIs: E. Lander, S. Gabriel, M. Daly, S. Kathiresan). The Genome Sequencing Program efforts were also supported by NHGRI under grant 5U01HG009088-02. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. We thank the Stanley Center for Psychiatric Research at the Broad Institute for supporting the genomic data generation efforts as well as the aggregation of control samples and cohorts to contribute to the Epi25 GWAS analyses. In particular, the Genomic Psychiatry Cohort controls were genotyped on the GSA-MD v1.0 by the Broad Genomics Platform with funding from NIH grant U01MH105641 and the Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard. The FINRISK controls were part of the FINRISK studies supported by the THL (formerly KTL: National Public Health Institute) through budgetary funds from the government, with additional funding from institutions such as the Academy of Finland, the European Union, ministries and national and international foundations and societies to support specific research purposes. The collection of the Hong Kong Osteoporosis Study (HKOS) control samples was funded by the Bone Health Fund and Research Grants Council—Early Career Scheme (project 27100416). Other control datasets included IBD NIDDK and samples from the Mass General Brigham (MGB) Biobank available from dbGaP under study accession number phs002018.v1.p1.
We acknowledge the participants and investigators of the FinnGen study. The FinnGen project is funded by two grants from Business Finland (HUS 4685/31/2016 and UH 4386/31/2016) and the following industry partners: AbbVie, AstraZeneca UK, Biogen MA, Bristol Myers Squibb (and Celgene Corporation & Celgene International II Sàrl), Genentech, Merck Sharp & Dohme Corp, Pfizer, GlaxoSmithKline Intellectual Property Development, Sanofi US Services, Maze Therapeutics, Janssen Biotech, Novartis AG and Boehringer Ingelheim. The following biobanks are acknowledged for delivering biobank samples to FinnGen: Auria Biobank (www.auria.fi/biopankki), THL Biobank (www.thl.fi/biobank), Helsinki Biobank (www.helsinginbiopankki.fi), Biobank Borealis of Northern Finland (https://www.ppshp.fi/Tutkimus-ja-opetus/Biopankki/Pages/Biobank-Borealis-briefly-in-English.aspx), Finnish Clinical Biobank Tampere (www.tays.fi/en-US/Research_and_development/Finnish_Clinical_Biobank_Tampere), Biobank of Eastern Finland (www.ita-suomenbiopankki.fi/en), Central Finland Biobank (www.ksshp.fi/fi-FI/Potilaalle/Biopankki), Finnish Red Cross Blood Service Biobank (www.veripalvelu.fi/verenluovutus/biopankkitoiminta) and Terveystalo Biobank (www.terveystalo.com/fi/Yritystietoa/Terveystalo-Biopankki/Biopankki/). All Finnish biobanks are members of BBMRI.fi infrastructure (www.bbmri.fi). Finnish Biobank Cooperative—FINBB (https://finbb.fi/)—is the coordinator of BBMRI-ERIC operations in Finland. The Finnish biobank data can be accessed through the Fingenious services (https://site.fingenious.fi/en/) managed by FINBB.
Source data
Correlation and P values for Fig. 2 as matrix.
Author contributions
Data analysis: Analytical design, imputation. O.M.A., M.B., C. Campbell (lead analyst), G.L.C., S.C. (lead analyst), Y.-C.A.F., E. Hassanin, B.P.C.K., R. Krause (data management), D. Lal, C.L., N.M., M.N., K.L.O., R. Stevelink (lead analyst). Data generation and quality control and management: L.B., D.R.B., J.P.B., R.J.B., G.L.C., F. Cerrato, S.S.C., C. Churchhouse, C. Cusick, Y-.C.A.F., N.G., H. Hakonarson, E.L.H., I.H., D.P.H., D.K., B.P.C.K., R. Krause., D. Lal, Z.L., C.L., I.L.-C., P.M., N.M., B.M.N., P.N., S.P., T. Sander, D.S., R. Stevelink, F. Zara, W.Z. Analysis coordination: G.L.C. (Cochair), B.P.C.K. (Cochair). External data resources and analysis: UK Biobank: C. Campbell, D.L.-S., R.H.T. BioBank Japan: Y. Kamatani, M. Kanai, M. Kato, Y.O. FinnGen: M.J.D., H.O.H., R. Kälviäinen, M.I.K., A. Palotie. deCODE genetics: S.M., E.Ó., H. Stefansson, K.S., U.U. Writing committee: O.M.A., M.B., S.F.B., C. Campbell, G.L.C., S.C., B.P.C.K., K.L.O., R. Stevelink (wrote first draft). Strategy committee: L.B., S.F.B. (Chair), R.J.B., G.L.C., H. Hakonarson, E.L.H., M.R.J., R. Kälviäinen, B.P.C.K., R. Krause, P. Kwan, D. Lal, H.L., Q.S.L., I.L.-C., D.H.L., T.J.O’B., S.M.S. Phenotyping committee: C.D., D.J.D., W.S.K., P. Kwan, D.H.L. (Chair), A.G.M., P. Striano. Governance committee: S.F.B., A. Compston, A.-E.L., D.H.L. Patient recruitment and phenotyping: B.A.-K., Z.A., E.A., A. Anderson, J.A., D.M.A., G.A., P.A., A. Avbersek, M.D.B., G.B., S.B., C. Barba, K. Barboza, F. Bartolomei, T. Bast, T. Baumgartner, B. Baykan, N. Bebek, A.J.B., F. Becker, C.A.B., B. Berghuis, S.F.B., A.B., C. Bianchini, F. Bisulli, I. Blatt, I. Borggraefe, C. Bosselmann, V. Braatz, K. Brockmann, R.J.B., R.M.B., H.C., E.C., L.C., C. Canavati, G.D.C., B.C., C.B.C., F. Chassoux, K.C., I.-J.C., S.-K.C., P.O.C., A.J.C., A. Coppola, M.C., P.C., J.J.C., L.K.D., G.-J.d.H., N.D., C.D., P.D., O.D., L.D.V., D.J.D., V.D., C.P.D., H.E.-N., C.E.E., C.A.E., A.Faucon, L.Ferguson, T.N.Ferraro, L.Ferri, M.Feucht, M.Fitzgerald, B.Fonferko-Shadrach, F.Fortunato, S.Franceschetti, J.A.F., E.F., M.G., A. Gambardella, E.B.G., T. Giangregorio, L.G., T. Glauser, E.G., A. Goldman, T. Granata, D.A.G., R.G., K.F.H., K.H., M.H., I.H., C.H., S.H., E. Hirsh, H. Hjalgrim, D.H., P.-C.H., M.I., L.L.I., Y.I., A.I., J.J.-K., L.J., M.R.J., R. Kälviäinen, M. Kanaan, A.-M.K., B.K., S.M.K., D.K.-N.T., J.K., Y. Kesim, N.K.-Z., C.K., H.E.K., K.M.K., G. Kluger, S.K., R.C.K., A.D.K., A.K., I. Kousiappa, M. Krenn, H.K., I. Krey, W.S.K., G. Kurlemann, R. Kuzniecky, P. Kwan, A. Labate, A. Lacey, S. Lauxmann, S.L.L., A.-E.L., J.R.L., H.L., G.L., N.L., Q.S.L., L. Licchetta, K.-L.L., D. Lindhout, T.L., I.L.-C., D.H.L., C.H.T.L., F.M., A.G.M., C.M.M., D.M., R.M., R.S.M., M.M., B.M., L.M., H.M., K.M.-S., I.M.N., W.N., B.N., C.R.J.C.N., T.J.O’B., Ç.Ö., S.S.P., E.P., M. Pendziwiat, W.O.P., R.P., T.P., A. Poduri, F. Pondrelli, R.H.W.P., M. Privitera, A. Rademacher, R.R., F. Ragona, S. Rau, M.I.R., B.M.R., P.S.R., S. Rhelms, A. Riva, F. Rosenow, P.R., A. Saarela, L.G.S., J.W.S., T. Sander, M.S., T. Scattergood, S.C.S., C.J.S., I.E.S., B.S., S.S., S.S.-B., A.S.-B., P. Scudieri, B.R.S., J.J.S., G.J.S., S.M.S., M.C.S., P.E.S., A.C.M.S., M.R.S., B.J.S., U.S., W.C.S., C.S., P. Striano, H. Stroink, A. Strzelczyk, R. Surges, T. Suzuki, K.M.T., R.S.T., G.A.T., E.T., L.L.T., O.T., P. Tinuper, M.T., P. Topaloglu, R.T., M.-H.T., B.T., D.T., A.U., P.V., L.V., A.v.B., A.V., E.P.G.V., F.V., S.v.B., R.v.W., R.G.W., Y.G.W., S. Weckhuysen, J.W., M. Weller, P.W.-W., M. Wolff, S. Wolking, D.W., K.Y., Z.Y., E.Y., S.Z., F. Zahnert, F. Zimprich, G.Z., Q.Z.A. Control cohorts: L.C.B., C.-L.C., J.G.E., A. Franke, H. Hakonarson, Y.-L.L., G.H.-Y.L., J.L.M., A.M.M., M.M.N., A. Palotie, F. Pangilinan, C.N.P., M.T.P., P. Sham, H. Stroink, G.N.T., W. Yang. Consortium coordination: K.L.O.
Peer review
Peer review information
Nature Genetics thanks Manuel Mattheisen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The GWAS summary statistics data that support the findings of this study (for both multi-ancestry and European-only analyses) are publicly available at https://www.epigad.org/ and in the NHGRI-EBI GWAS Catalog at https://www.ebi.ac.uk/gwas/ (accession IDs: GCST90271608, GCST90271609, GCST90271610, GCST90271611, GCST90271612, GCST90271613, GCST90271614, GCST90271615, GCST90271616, GCST90271617, GCST90271618, GCST90271619 and GCST90271620). Individual-level GSA-MD v1.0 data for the Epi25 case samples and HKOS control samples are available in dbGaP/AnVIL under phs001489.v2.p2. GSA-MD v1.0 data for Genomic Psychiatry Cohort (GPC) control samples data will be made available in dbGAP/AnVIL under study phs002041. Individual-level SNP genotype data for other cohorts used as controls in the Epi25 analyses are accessible via an application through the THL Biobank portal (https://thl-biobank.elixir-finland.org/) for FINRISK, and in dbGaP/AnVIL under study accession numbers phs001642 (NIDDK IBDGC) and phs002018.v1.p1 (MGB Biobank) (see Supplementary Note for more details). Data relating to UK Biobank are available via the application to UK Biobank (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access). The FinnGen data can be accessed through the Fingenious services (https://site.fingenious.fi/en/) managed by FINBB: release R6. The summary statistics of the Japanese GWAS in this study are publicly available from the National Bioscience Database Center (https://biosciencedbc.jp/en) under research ID: hum0014. We also accessed data from the following online database: www.DGidb.com (accessed on 26 November 2021). Source data are provided with this paper.
Code availability
No custom code was used in this study. Publicly available software tools were used to perform genetic analyses and are referenced throughout the manuscript.
Competing interests
G.L.C. is in receipt of research funding from Congenica and Janssen Pharmaceuticals and has conducted consultancy for Ono Pharmaceuticals. S.F.B. received funding from UCB Pharma and Eisai and has been a consultant for Praxis Precision Medicines and Sequiris. Q.S.L. is an employee of Janssen Research & Development, LLC and a shareholder in Johnson & Johnson, which is the parent company of the Janssen companies. B.M.N. currently serves as a member of the scientific advisory board at Deep Genomics and Neumora (previously RBNC) and as a consultant for Camp4 Therapeutics. S.P. is an employee and shareholder of AstraZeneca. U.U., S.M., H. Stefansson and K.S. are employees of deCODE genetics/Amgen.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A list of authors and their affiliations appears at the end of the paper.
Contributor Information
International League Against Epilepsy Consortium on Complex Epilepsies:
Remi Stevelink, Ciarán Campbell, Siwei Chen, Bassel Abou-Khalil, Oluyomi M. Adesoji, Zaid Afawi, Elisabetta Amadori, Alison Anderson, Joseph Anderson, Danielle M. Andrade, Grazia Annesi, Pauls Auce, Andreja Avbersek, Melanie Bahlo, Mark D. Baker, Ganna Balagura, Simona Balestrini, Carmen Barba, Karen Barboza, Fabrice Bartolomei, Thomas Bast, Larry Baum, Tobias Baumgartner, Betül Baykan, Nerses Bebek, Albert J. Becker, Felicitas Becker, Caitlin A. Bennett, Bianca Berghuis, Samuel F. Berkovic, Ahmad Beydoun, Claudia Bianchini, Francesca Bisulli, Ilan Blatt, Dheeraj R. Bobbili, Ingo Borggraefe, Christian Bosselmann, Vera Braatz, Jonathan P. Bradfield, Knut Brockmann, Lawrence C. Brody, Russell J. Buono, Robyn M. Busch, Hande Caglayan, Ellen Campbell, Laura Canafoglia, Christina Canavati, Gregory D. Cascino, Barbara Castellotti, Claudia B. Catarino, Gianpiero L. Cavalleri, Felecia Cerrato, Francine Chassoux, Stacey S. Cherny, Ching-Lung Cheung, Krishna Chinthapalli, I-Jun Chou, Seo-Kyung Chung, Claire Churchhouse, Peggy O. Clark, Andrew J. Cole, Alastair Compston, Antonietta Coppola, Mahgenn Cosico, Patrick Cossette, John J. Craig, Caroline Cusick, Mark J. Daly, Lea K. Davis, Gerrit-Jan de Haan, Norman Delanty, Chantal Depondt, Philippe Derambure, Orrin Devinsky, Lidia Di Vito, Dennis J. Dlugos, Viola Doccini, Colin P. Doherty, Hany El-Naggar, Christian E. Elger, Colin A. Ellis, Johan G. Eriksson, Annika Faucon, Yen-Chen A. Feng, Lisa Ferguson, Thomas N. Ferraro, Lorenzo Ferri, Martha Feucht, Mark Fitzgerald, Beata Fonferko-Shadrach, Francesco Fortunato, Silvana Franceschetti, Andre Franke, Jacqueline A. French, Elena Freri, Monica Gagliardi, Antonio Gambardella, Eric B. Geller, Tania Giangregorio, Leif Gjerstad, Tracy Glauser, Ethan Goldberg, Alicia Goldman, Tiziana Granata, David A. Greenberg, Renzo Guerrini, Namrata Gupta, Kevin F. Haas, Hakon Hakonarson, Kerstin Hallmann, Emadeldin Hassanin, Manu Hegde, Erin L. Heinzen, Ingo Helbig, Christian Hengsbach, Henrike O. Heyne, Shinichi Hirose, Edouard Hirsch, Helle Hjalgrim, Daniel P. Howrigan, Donald Hucks, Po-Cheng Hung, Michele Iacomino, Lukas L. Imbach, Yushi Inoue, Atsushi Ishii, Jennifer Jamnadas-Khoda, Lara Jehi, Michael R. Johnson, Reetta Kälviäinen, Yoichiro Kamatani, Moien Kanaan, Masahiro Kanai, Anne-Mari Kantanen, Bülent Kara, Symon M. Kariuki, Dalia Kasperavičiūte, Dorothee Kasteleijn-Nolst Trenite, Mitsuhiro Kato, Josua Kegele, Yeşim Kesim, Nathalie Khoueiry-Zgheib, Chontelle King, Heidi E. Kirsch, Karl M. Klein, Gerhard Kluger, Susanne Knake, Robert C. Knowlton, Bobby P. C. Koeleman, Amos D. Korczyn, Andreas Koupparis, Ioanna Kousiappa, Roland Krause, Martin Krenn, Heinz Krestel, Ilona Krey, Wolfram S. Kunz, Mitja I. Kurki, Gerhard Kurlemann, Ruben Kuzniecky, Patrick Kwan, Angelo Labate, Austin Lacey, Dennis Lal, Zied Landoulsi, Yu-Lung Lau, Stephen Lauxmann, Stephanie L. Leech, Anna-Elina Lehesjoki, Johannes R. Lemke, Holger Lerche, Gaetan Lesca, Costin Leu, Naomi Lewin, David Lewis-Smith, Gloria H.-Y. Li, Qingqin S. Li, Laura Licchetta, Kuang-Lin Lin, Dick Lindhout, Tarja Linnankivi, Iscia Lopes-Cendes, Daniel H. Lowenstein, Colin H. T. Lui, Francesca Madia, Sigurdur Magnusson, Anthony G. Marson, Patrick May, Christopher M. McGraw, Davide Mei, James L. Mills, Raffaella Minardi, Nasir Mirza, Rikke S. Møller, Anne M. Molloy, Martino Montomoli, Barbara Mostacci, Lorenzo Muccioli, Hiltrud Muhle, Karen Müller-Schlüter, Imad M. Najm, Wassim Nasreddine, Benjamin M. Neale, Bernd Neubauer, Charles R. J. C. Newton, Markus M. Nöthen, Michael Nothnagel, Peter Nürnberg, Terence J. O’Brien, Yukinori Okada, Elías Ólafsson, Karen L. Oliver, Çiğdem Özkara, Aarno Palotie, Faith Pangilinan, Savvas S. Papacostas, Elena Parrini, Carlos N. Pato, Michele T. Pato, Manuela Pendziwiat, Slavé Petrovski, William O. Pickrell, Rebecca Pinsky, Tommaso Pippucci, Annapurna Poduri, Federica Pondrelli, Rob H. W. Powell, Michael Privitera, Annika Rademacher, Rodney Radtke, Francesca Ragona, Sarah Rau, Mark I. Rees, Brigid M. Regan, Philipp S. Reif, Sylvain Rhelms, Antonella Riva, Felix Rosenow, Philippe Ryvlin, Anni Saarela, Lynette G. Sadleir, Josemir W. Sander, Thomas Sander, Marcello Scala, Theresa Scattergood, Steven C. Schachter, Christoph J. Schankin, Ingrid E. Scheffer, Bettina Schmitz, Susanne Schoch, Susanne Schubert-Bast, Andreas Schulze-Bonhage, Paolo Scudieri, Pak Sham, Beth R. Sheidley, Jerry J. Shih, Graeme J. Sills, Sanjay M. Sisodiya, Michael C. Smith, Philip E. Smith, Anja C. M. Sonsma, Doug Speed, Michael R. Sperling, Hreinn Stefansson, Kári Stefansson, Bernhard J. Steinhoff, Ulrich Stephani, William C. Stewart, Carlotta Stipa, Pasquale Striano, Hans Stroink, Adam Strzelczyk, Rainer Surges, Toshimitsu Suzuki, K. Meng Tan, R. S. Taneja, George A. Tanteles, Erik Taubøll, Liu Lin Thio, G. Neil Thomas, Rhys H. Thomas, Oskari Timonen, Paolo Tinuper, Marian Todaro, Pınar Topaloğlu, Rossana Tozzi, Meng-Han Tsai, Birute Tumiene, Dilsad Turkdogan, Unnur Unnsteinsdóttir, Algirdas Utkus, Priya Vaidiswaran, Luc Valton, Andreas van Baalen, Annalisa Vetro, Eileen P. G. Vining, Frank Visscher, Sophie von Brauchitsch, Randi von Wrede, Ryan G. Wagner, Yvonne G. Weber, Sarah Weckhuysen, Judith Weisenberg, Michael Weller, Peter Widdess-Walsh, Markus Wolff, Stefan Wolking, David Wu, Kazuhiro Yamakawa, Wanling Yang, Zuhal Yapıcı, Emrah Yücesan, Sara Zagaglia, Felix Zahnert, Federico Zara, Wei Zhou, Fritz Zimprich, Gábor Zsurka, and Quratulain Zulfiqar Ali
Supplementary information
The online version contains supplementary material available at 10.1038/s41588-023-01485-w.
References
- 1.Fisher RS, et al. ILAE official report: a practical clinical definition of epilepsy. Epilepsia. 2014;55:475–482. doi: 10.1111/epi.12550. [DOI] [PubMed] [Google Scholar]
- 2.Fiest KM, et al. Prevalence and incidence of epilepsy: a systematic review and meta-analysis of international studies. Neurology. 2017;88:296–303. doi: 10.1212/WNL.0000000000003509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Scheffer IE, et al. ILAE classification of the epilepsies: position paper of the ILAE Commission for Classification and Terminology. Epilepsia. 2017;58:512–521. doi: 10.1111/epi.13709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.International League Against Epilepsy Consortium on Complex Epilepsies. Genome-wide mega-analysis identifies 16 loci and highlights diverse biological mechanisms in the common epilepsies. Nat. Commun. 2018;9:5269. doi: 10.1038/s41467-018-07524-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Epi4K Consortium & Epilepsy Phenome/Genome Project. Ultra-rare genetic variation in common epilepsies: a case–control sequencing study. Lancet Neurol. 16, 135–143 (2017). [DOI] [PubMed]
- 6.Leu C, et al. Polygenic burden in focal and generalized epilepsies. Brain. 2019;142:3473–3481. doi: 10.1093/brain/awz292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Koko M, et al. Distinct gene-set burden patterns underlie common generalized and focal epilepsies. EBioMedicine. 2021;72:103588. doi: 10.1016/j.ebiom.2021.103588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.McTague A, Howell KB, Cross JH, Kurian MA, Scheffer IE. The genetic landscape of the epileptic encephalopathies of infancy and childhood. Lancet Neurol. 2016;15:304–316. doi: 10.1016/S1474-4422(15)00250-1. [DOI] [PubMed] [Google Scholar]
- 9.Speed D, et al. Describing the genetic architecture of epilepsy through heritability analysis. Brain. 2014;137:2680–2689. doi: 10.1093/brain/awu206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Motelow JE, et al. Sub-genic intolerance, ClinVar, and the epilepsies: a whole-exome sequencing study of 29,165 individuals. Am. J. Hum. Genet. 2021;108:965–982. doi: 10.1016/j.ajhg.2021.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chen Z, Brodie MJ, Liew D, Kwan P. Treatment outcomes in patients with newly diagnosed epilepsy treated with established and new antiepileptic drugs: a 30-year longitudinal cohort study. JAMA Neurol. 2018;75:279–286. doi: 10.1001/jamaneurol.2017.3949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Devinsky O, et al. Epilepsy. Nat. Rev. Dis. Primers. 2018;4:18024. doi: 10.1038/nrdp.2018.24. [DOI] [PubMed] [Google Scholar]
- 13.Bulik-Sullivan BK, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods. 2012;9:215–216. doi: 10.1038/nmeth.1906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kircher M, et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 2014;46:310–315. doi: 10.1038/ng.2892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hnisz D, et al. Super-enhancers in the control of cell identity and disease. Cell. 2013;155:934–947. doi: 10.1016/j.cell.2013.09.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Turley P, et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 2018;50:229–237. doi: 10.1038/s41588-017-0009-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 2017;8:1826. doi: 10.1038/s41467-017-01261-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.De Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 2015;11:e1004219. doi: 10.1371/journal.pcbi.1004219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gusev A, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 2016;48:245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gandal MJ, et al. Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science. 2018;362:eaat8127. doi: 10.1126/science.aat8127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhu Z, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 2016;48:481–487. doi: 10.1038/ng.3538. [DOI] [PubMed] [Google Scholar]
- 23.Xu C, et al. Knockdown of RMI1 impairs DNA repair under DNA replication stress. Biochem. Biophys. Res. Commun. 2017;494:158–164. doi: 10.1016/j.bbrc.2017.10.062. [DOI] [PubMed] [Google Scholar]
- 24.International League Against Epilepsy Consortium on Complex Epilepsies. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies. Lancet Neurol. 2014;13:893–903. doi: 10.1016/S1474-4422(14)70171-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yoshida M, et al. Identification of novel BCL11A variants in patients with epileptic encephalopathy: expanding the phenotypic spectrum. Clin. Genet. 2018;93:368–373. doi: 10.1111/cge.13067. [DOI] [PubMed] [Google Scholar]
- 26.Cook S, et al. Accurate imputation of human leukocyte antigens with CookHLA. Nat. Commun. 2021;12:1264. doi: 10.1038/s41467-021-21541-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Holland D, et al. Beyond SNP heritability: polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model. PLoS Genet. 2020;16:e1008612. doi: 10.1371/journal.pgen.1008612. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bulik-Sullivan B, et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Grotzinger AD, et al. Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat. Hum. Behav. 2019;3:513–525. doi: 10.1038/s41562-019-0566-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ruano D, et al. Functional gene group analysis reveals a role of synaptic heterotrimeric G proteins in cognitive ability. Am. J. Hum. Genet. 2010;86:113–125. doi: 10.1016/j.ajhg.2009.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lukyanetz EA, Shkryl VM, Kostyuk PG. Selective blockade of N-type calcium channels by levetiracetam. Epilepsia. 2002;43:9–18. doi: 10.1046/j.1528-1157.2002.24501.x. [DOI] [PubMed] [Google Scholar]
- 32.Wang SJ, Huang CC, Hsu KS, Tsai JJ, Gean PW. Inhibition of N-type calcium currents by lamotrigine in rat amygdalar neurones. Neuroreport. 1996;7:3037–3040. doi: 10.1097/00001756-199611250-00048. [DOI] [PubMed] [Google Scholar]
- 33.Marson A, et al. The SANAD II study of the effectiveness and cost-effectiveness of levetiracetam, zonisamide, or lamotrigine for newly diagnosed focal epilepsy: an open-label, non-inferiority, multicentre, phase 4, randomised controlled trial. Lancet. 2021;397:1363–1374. doi: 10.1016/S0140-6736(21)00247-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Christensen J, Kjeldsen MJ, Andersen H, Friis ML, Sidenius P. Gender differences in epilepsy. Epilepsia. 2005;46:956–960. doi: 10.1111/j.1528-1167.2005.51204.x. [DOI] [PubMed] [Google Scholar]
- 35.Magi R, Lindgren CM, Morris AP. Meta-analysis of sex-specific genome-wide association studies. Genet. Epidemiol. 2010;34:846–853. doi: 10.1002/gepi.20540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gaborit N, et al. Gender-related differences in ion-channel and transporter subunit expression in non-diseased human hearts. J. Mol. Cell. Cardiol. 2010;49:639–646. doi: 10.1016/j.yjmcc.2010.06.005. [DOI] [PubMed] [Google Scholar]
- 37.Buniello A, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47:D1005–D1012. doi: 10.1093/nar/gky1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Frei O, et al. Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nat. Commun. 2019;10:2417. doi: 10.1038/s41467-019-10310-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Mirza N, et al. Using common genetic variants to find drugs for common epilepsies. Brain Commun. 2021;3:fcab287. doi: 10.1093/braincomms/fcab287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bourgeois BFD. Chronic management of seizures in the syndromes of idiopathic generalized epilepsy. Epilepsia. 2003;44:27–32. doi: 10.1046/j.1528-1157.44.s.2.1.x. [DOI] [PubMed] [Google Scholar]
- 41.Marson AG, et al. The SANAD study of effectiveness of valproate, lamotrigine, or topiramate for generalised and unclassifiable epilepsy: an unblinded randomised controlled trial. Lancet. 2007;369:1016–1026. doi: 10.1016/S0140-6736(07)60461-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Punetha J, et al. Biallelic CACNA2D2 variants in epileptic encephalopathy and cerebellar atrophy. Ann. Clin. Transl. Neurol. 2019;6:1395–1406. doi: 10.1002/acn3.50824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Fariello RG. Safinamide. Neurotherapeutics. 2007;4:110–116. doi: 10.1016/j.nurt.2006.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Alsaegh H, Eweis H, Kamal F, Alrafiah A. Celecoxib decrease seizures susceptibility in a rat model of inflammation by inhibiting HMGB1 translocation. Pharmaceuticals. 2021;14:380. doi: 10.3390/ph14040380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Johannesen KM, et al. Genotype-phenotype correlations in SCN8A-related disorders reveal prognostic and therapeutic implications. Brain. 2021;145:2991–3009. doi: 10.1093/brain/awab321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ma M-G, et al. RYR2 mutations are associated with benign epilepsy of childhood with centrotemporal spikes with or without arrhythmia. Front. Neurosci. 2021;15:629610. doi: 10.3389/fnins.2021.629610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yap SM, Smyth S. Ryanodine receptor 2 (RYR2) mutation: a potentially novel neurocardiac calcium channelopathy manifesting as primary generalised epilepsy. Seizure. 2019;67:11–14. doi: 10.1016/j.seizure.2019.02.017. [DOI] [PubMed] [Google Scholar]
- 48.EPICURE Consortium. et al. Genome-wide association analysis of genetic generalized epilepsies implicates susceptibility loci at 1q43, 2p16.1, 2q22.3 and 17q21.32. Hum. Mol. Genet. 2012;21:5359–5372. doi: 10.1093/hmg/dds373. [DOI] [PubMed] [Google Scholar]
- 49.Canela-Xandri O, Rawlik K, Tenesa A. An atlas of genetic associations in UK Biobank. Nat. Genet. 2018;50:1593–1599. doi: 10.1038/s41588-018-0248-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Heinzen EL. Somatic variants in epilepsy—advancing gene discovery and disease mechanisms. Curr. Opin. Genet. Dev. 2020;65:1–7. doi: 10.1016/j.gde.2020.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Beesley LJ, et al. The emerging landscape of health research based on biobanks linked to electronic health records: existing resources, statistical challenges, and potential opportunities. Stat. Med. 2020;39:773–800. doi: 10.1002/sim.8445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Xue A, et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat. Commun. 2018;9:2941. doi: 10.1038/s41467-018-04951-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hautakangas H, et al. Genome-wide analysis of 102,084 migraine cases identifies 123 risk loci and subtype-specific risk alleles. Nat. Genet. 2022;54:152–160. doi: 10.1038/s41588-021-00990-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Wightman DP, et al. A genome-wide association study with 1,126,563 individuals identifies new risk loci for Alzheimer’s disease. Nat. Genet. 2021;53:1276–1282. doi: 10.1038/s41588-021-00921-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.International Multiple Sclerosis Genetics Consortium. Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility. Science. 2019;365:eaav7188. doi: 10.1126/science.aav7188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wood MD, Gillard M. Evidence for a differential interaction of brivaracetam and levetiracetam with the synaptic vesicle 2A protein. Epilepsia. 2017;58:255–262. doi: 10.1111/epi.13638. [DOI] [PubMed] [Google Scholar]
- 57.Singh T, et al. Rare coding variants in ten genes confer substantial risk for schizophrenia. Nature. 2022;604:509–516. doi: 10.1038/s41586-022-04556-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Van Rheenen W, et al. Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology. Nat. Genet. 2021;53:1636–1648. doi: 10.1038/s41588-021-00973-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Reay WR, Cairns MJ. Advancing the use of genome-wide association studies for drug repurposing. Nat. Rev. Genet. 2021;22:658–671. doi: 10.1038/s41576-021-00387-z. [DOI] [PubMed] [Google Scholar]
- 60.Loh P-R, Palamara PF, Price AL. Fast and accurate long-range phasing in a UK Biobank cohort. Nat. Genet. 2016;48:811–816. doi: 10.1038/ng.3571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Rubinacci S, Ribeiro DM, Hofmeister RJ, Delaneau O. Efficient phasing and imputation of low-coverage sequencing data using large reference panels. Nat. Genet. 2021;53:120–126. doi: 10.1038/s41588-020-00756-0. [DOI] [PubMed] [Google Scholar]
- 62.McCarthy S, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 2016;48:1279–1283. doi: 10.1038/ng.3643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Zhou W, et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 2018;50:1335–1341. doi: 10.1038/s41588-018-0184-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.De Bakker PIW, et al. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum. Mol. Genet. 2008;17:R122–R128. doi: 10.1093/hmg/ddn288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Han B, Eskin E. Interpreting meta-analyses of genome-wide association studies. PLoS Genet. 2012;8:e1002555. doi: 10.1371/journal.pgen.1002555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Sudlow C, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Nagai A, et al. Overview of the BioBank Japan Project: study design and profile. J. Epidemiol. 2017;27:S2–S8. doi: 10.1016/j.je.2016.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ishigaki K, et al. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat. Genet. 2020;52:669–679. doi: 10.1038/s41588-020-0640-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Locke AE, et al. Exome sequencing of Finnish isolates enhances rare-variant association power. Nature. 2019;572:323–328. doi: 10.1038/s41586-019-1457-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Gudbjartsson DF, et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 2015;47:435–444. doi: 10.1038/ng.3247. [DOI] [PubMed] [Google Scholar]
- 72.Campbell, C. et al. Polygenic risk score analysis reveals shared genetic burden between epilepsy and psychiatric comorbidities. Preprint at medRxiv10.1101/2023.07.04.23292071 (2023).
- 73.Turner, S. D. qqman: an R package for visualizing GWAS results using Q–Q and Manhattan plots. J. Open Source Softw. 3, 731 (2018).
- 74.Bhattacharjee S, et al. A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. Am. J. Hum. Genet. 2012;90:821–835. doi: 10.1016/j.ajhg.2012.03.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Kim T-J, et al. Anti-LGI1 encephalitis is associated with unique HLA subtypes. Ann. Neurol. 2017;81:183–192. doi: 10.1002/ana.24860. [DOI] [PubMed] [Google Scholar]
- 76.Van Sonderen A, et al. Anti-LGI1 encephalitis is strongly associated with HLA-DR7 and HLA-DRB4. Ann. Neurol. 2017;81:193–198. doi: 10.1002/ana.24858. [DOI] [PubMed] [Google Scholar]
- 77.1000 Genomes Project Consortium. et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Choi W, Luo Y, Raychaudhuri S, Han B. HATK: HLA analysis toolkit. Bioinformatics. 2021;37:416–418. doi: 10.1093/bioinformatics/btaa684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Chang CC, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47:D886–D894. doi: 10.1093/nar/gky1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Roadmap Epigenomics Consortium. et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.De Klein N, et al. Brain expression quantitative trait locus and network analysis reveals downstream effects and putative drivers for brain-related diseases. Nat. Genet. 2023;55:377–388. doi: 10.1038/s41588-023-01300-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Mägi R, Morris AP. GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics. 2010;11:288. doi: 10.1186/1471-2105-11-288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Weeks EM, et al. Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases. Nat. Genet. 2023 doi: 10.1038/s41588-023-01443-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.GTEx Consortium. et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. doi: 10.1038/nature24277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Freytag S, Burgess R, Oliver KL, Bahlo M. Brain-coX: investigating and visualising gene co-expression in seven human brain transcriptomic datasets. Genome Med. 2017;9:55. doi: 10.1186/s13073-017-0444-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Rodriguez-Acevedo AJ, Gordon LG, Waddell N, Hollway G, Vadlamudi L. Developing a gene panel for pharmacoresistant epilepsy: a review of epilepsy pharmacogenetics. Pharmacogenomics. 2021;22:225–234. doi: 10.2217/pgs-2020-0145. [DOI] [PubMed] [Google Scholar]
- 89.Oliver KL, et al. Genes4Epilepsy: an epilepsy gene resource. Epilepsia. 2023;64:1368–1375. doi: 10.1111/epi.17547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Okada Y, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506:376–381. doi: 10.1038/nature12873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Speed D, Balding DJ. SumHer better estimates the SNP heritability of complex traits from summary statistics. Nat. Genet. 2019;51:277–284. doi: 10.1038/s41588-018-0279-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Speed D, Holmes J, Balding DJ. Evaluating and improving heritability models using summary statistics. Nat. Genet. 2020;52:458–462. doi: 10.1038/s41588-020-0600-y. [DOI] [PubMed] [Google Scholar]
- 93.Grotzinger AD, de la Fuente J, Nivard MG, Tucker-Drob EM. Pervasive downward bias in estimates of liability scale heritability in GWAS meta-analysis: a simple solution. Biol. Psychiatry. 2023;93:29–36. doi: 10.1016/j.biopsych.2022.05.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Wang D, et al. Comprehensive functional genomic resource and integrative model for the human brain. Science. 2018;362:eaat8464. doi: 10.1126/science.aat8464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Zhong S, et al. A single-cell RNA-seq survey of the developmental landscape of the human prefrontal cortex. Nature. 2018;555:524–528. doi: 10.1038/nature25980. [DOI] [PubMed] [Google Scholar]
- 96.Finucane HK, et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 2018;50:621–629. doi: 10.1038/s41588-018-0081-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Brainstorm Consortium. et al. Analysis of shared heritability in common disorders of the brain. Science. 2018;360:eaap8757. doi: 10.1126/science.aap8757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Zheng J, et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics. 2017;33:272–279. doi: 10.1093/bioinformatics/btw613. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Tables 1–16, Supplementary Figs. 1–26, Supplementary Note, source code for Fig. 2 and Supplementary References.
Genome-wide significant SNPs across epilepsy types. Functional annotation of the 2,355 genome-wide significant SNPs across the 22 GGE loci and 612 SNPs from all epilepsy loci.
Genes mapped for the meta-analysis results. Gene mapping of the ‘all epilepsy’ meta-analysis and the GGE analysis using FUMA.
GWGAS results. Analysis of gene-based association score based on the aggregate of all SNPs inside each gene using MAGMA.
TWAS results. Analysis of differential gene expression in brain for epilepsy using FUSION TWAS.
SMR results. Analysis of potentially causal relationship between brain expression and epilepsy using SMR.
Scores of biological prioritization criteria for each mapped gene, of each genome-wide significant locus.
Gene-set analyses. Analysis of biological processes in association with GGE using MAGMA.
Median ranks and AUROCs of all drug groups.
Prediction of the relative efficacy of drugs for epilepsy. Prediction of the relative efficacy of drugs for epilepsy.
Gene lists used in gene prioritization.
Data Availability Statement
The GWAS summary statistics data that support the findings of this study (for both multi-ancestry and European-only analyses) are publicly available at https://www.epigad.org/ and in the NHGRI-EBI GWAS Catalog at https://www.ebi.ac.uk/gwas/ (accession IDs: GCST90271608, GCST90271609, GCST90271610, GCST90271611, GCST90271612, GCST90271613, GCST90271614, GCST90271615, GCST90271616, GCST90271617, GCST90271618, GCST90271619 and GCST90271620). Individual-level GSA-MD v1.0 data for the Epi25 case samples and HKOS control samples are available in dbGaP/AnVIL under phs001489.v2.p2. GSA-MD v1.0 data for Genomic Psychiatry Cohort (GPC) control samples data will be made available in dbGAP/AnVIL under study phs002041. Individual-level SNP genotype data for other cohorts used as controls in the Epi25 analyses are accessible via an application through the THL Biobank portal (https://thl-biobank.elixir-finland.org/) for FINRISK, and in dbGaP/AnVIL under study accession numbers phs001642 (NIDDK IBDGC) and phs002018.v1.p1 (MGB Biobank) (see Supplementary Note for more details). Data relating to UK Biobank are available via the application to UK Biobank (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access). The FinnGen data can be accessed through the Fingenious services (https://site.fingenious.fi/en/) managed by FINBB: release R6. The summary statistics of the Japanese GWAS in this study are publicly available from the National Bioscience Database Center (https://biosciencedbc.jp/en) under research ID: hum0014. We also accessed data from the following online database: www.DGidb.com (accessed on 26 November 2021). Source data are provided with this paper.
No custom code was used in this study. Publicly available software tools were used to perform genetic analyses and are referenced throughout the manuscript.