Abstract
Objective
Systemic lupus erythematosus (SLE), an autoimmune disorder, has been associated with nearly 100 susceptibility loci. Nevertheless, these loci only partially explain SLE heritability and their putative causal variants are rarely prioritised, which make challenging to elucidate disease biology. To detect new SLE loci and causal variants, we performed the largest genome-wide meta-analysis for SLE in East Asian populations.
Methods
We newly genotyped 10 029 SLE cases and 180 167 controls and subsequently meta-analysed them jointly with 3348 SLE cases and 14 826 controls from published studies in East Asians. We further applied a Bayesian statistical approach to localise the putative causal variants for SLE associations.
Results
We identified 113 genetic regions including 46 novel loci at genome-wide significance (p<5×10−8). Conditional analysis detected 233 association signals within these loci, which suggest widespread allelic heterogeneity. We detected genome-wide associations at six new missense variants. Bayesian statistical fine-mapping analysis prioritised the putative causal variants to a small set of variants (95% credible set size ≤10) for 28 association signals. We identified 110 putative causal variants with posterior probabilities ≥0.1 for 57 SLE loci, among which we prioritised 10 most likely putative causal variants (posterior probability ≥0.8). Linkage disequilibrium score regression detected genetic correlations for SLE with albumin/globulin ratio (rg=−0.242) and non-albumin protein (rg=0.238).
Conclusion
This study reiterates the power of large-scale genome-wide meta-analysis for novel genetic discovery. These findings shed light on genetic and biological understandings of SLE.
Keywords: lupus erythematosus, systemic, polymorphism, genetic, epidemiology
Key messages.
What is already known about this subject?
Genome-wide association studies have identified nearly 100 susceptibility loci for systemic lupus erythematosus (SLE) risk.
The known SLE loci explain partially the disease heritability.
What does this study add?
This study identified 113 genomic regions including 46 novel loci for SLE risk.
The study prioritised 110 putative causal variants including 10 putative causal variants with high confidence (posterior probability ≥0.8).
How might this impact on clinical practice or future developments?
These findings revealed new genetic basis for SLE and generated molecular mechanisms hypotheses for further investigations.
Introduction
Systemic lupus erythematosus (SLE) is an autoimmune disorder characterised by the production of autoantibodies that damage multiple organs.1 Considerable genetic predisposition contributes to SLE aetiology.2 To date, nearly 100 susceptibility loci have been identified for SLE, mainly through genome-wide association studies (GWASs).3–8 However, these loci collectively only explain ~30% of SLE heritability9 and their biology, in terms of causal variants, effector genes and cell types and pathological pathways that mediate genetic effects, has not yet been fully characterised.10
Genome-wide association meta-analyses have been performed to uncover new genetic associations for SLE in Asians,11 Europeans12 and trans-ancestral populations.9 However, the study sample sizes were relatively modest, which limits their ability for genetic discovery. GWASs have successfully linked genetic variants with human common diseases and traits.13 Nonetheless, only ~8% of GWAS participants are East Asians.14 East Asians have a unique population genetic history and may have ethnicity-specific genetic architecture involved in the development of disease and manifestations. For example, SLE has a remarkably higher prevalence and younger age of onset in Asians.15 16 Genetic heterogeneity may explain, at least partly, the phenotypic diversity of SLE between East Asians and Europeans.9 Hence, large-scale East Asian investigations may provide an opportunity to identify unique genetic associations even for the same diseases and traits that have already been well studied in Europeans.17
Methods
Study participants
We recruited a total of 10 029 SLE cases and 180 167 healthy controls in three independent case–control cohorts from mainland China, Korea and Japan. We analysed additionally 3348 SLE cases and 14 826 controls that were published in our previous East Asian SLE GWASs4 6–9 to increase statistical power. All the cases fulfilled the revised American College of Rheumatology SLE classification criteria or were diagnosed by collagen disease physicians (online supplemental table 1). Each participant provided written informed consent.
annrheumdis-2020-219209supp001.pdf (13MB, pdf)
Genome-wide association analyses
We newly genotyped 10 029 SLE cases and 180 167 controls, and revisited raw genome-wide genotype data in 3348 SLE cases and 14 826 controls from the five published studies.4 6–9 Quality controls were conducted for each of the eight data sets. Genotype imputation was accomplished using reference panels from the 1000 Genomes Project (1KGP) phase 3 v518 and population-specific reference panels19 in IMPUTE2/420 21 or MINIMAC4.22
We tested association between SLE risk and genotype dosages in each data set using a logistic regression or linear mixed model in PLINK,23 SNPTEST24 or EPACTS (https://genome.sph.umich.edu/wiki/EPACTS) (online supplemental table 1). Within each data set, we filtered out association results based on imputation quality (IMPUTE info or MINIMAC r2 ≤0.3), minor allele frequency (MAF) ≤0.5% or Hardy-Weinberg equilibrium test p<1.0×10−6 in controls. For each cohort, the association analysis for the X chromosome was conducted separately by sex and then meta-analysed across both men and women. For data sets analysed using a linear mixed model (online supplemental table 1), allelic effects and standard errors were converted to a log-odds scale to correct for case–control imbalance.25
Fixed-effects meta-analysis
We aggregated the association summary statistics from the eight data sets using a fixed-effects inverse-variance meta-analysis in METAL.26 We applied a genomic control correction to each association summary statistic. Heterogeneity in allelic effect sizes among data sets was assessed using Cochran’s Q statistic. We excluded genetic variants available in only a single data set. We defined SLE susceptibility loci by merging ±250 kilobases (kb) windows around genome-wide associated variants to ensure that lead single nucleotide polymorphisms (SNPs) were at least 500 kb apart. We defined lead variants as the most significant SLE-associated variant within each locus. A locus was considered novel if the lead SNP was at least 500 kb away from any previously reported SLE-associated variants.
Approximate conditional association analysis
To dissect distinct association signals at each SLE locus, we performed an approximate conditional analysis using GCTA COJO27 with genome-wide meta-analysis summary statistics based on linkage disequilibrium (LD) estimated from 7021 unrelated Chinese controls. The Chinese reference individuals for LD calculation were retrieved from the Chinese study using the Illumina Infinium Global Screening Array data (online supplemental table 1), excluding first-degree and second-degree relatives.
Bayesian statistical fine-mapping analysis
To prioritise causal variants in SLE susceptibility loci, a statistical fine-mapping analysis was performed using FINEMAP v1.4 software,28 with meta-analysis z-scores and LD matrices estimated from the 7021 Chinese reference individuals. We used default priors and parameters in FINEMAP, assuming at most five causal signals in the ±250 kb region around a lead variant at each SLE locus. FINEMAP computed a posterior probability (PP) for each genetic variant being the true putative causal variant. For each association signal, we ranked the candidate putative causal variants in a descending order of their PPs, and then built a 95% credible set of causal variants by including the ordered variants until their cumulative PP reached 0.95.
Heritability estimation by LD score regression
Overall SLE heritability h2 explained by genome-wide variants was estimated using the LD score regression model29 with LD scores18 from the 1KGP East Asian descendants, based on an SLE population prevalence of 0.03% in East Asian populations.1 SLE heritability estimate was further partitioned according to known and novel SLE loci using stratified LD score regression.30 The boundary of each SLE locus was arbitrarily defined as ±500 kb flanking the lead SLE-risk variant.
Genetic correlation between SLE and other traits by LD score regression
We calculated genetic correlations between 98 traits (39 diseases17 and 59 quantitative traits31 and SLE by using bivariate LD score regression.32 We used the LD scores18 from the 1KGP East Asian descendants, limited the genetic variants to the HapMap3 SNPs and removed the variants with extended human leucocyte antigen (HLA) region (chromosome 6: 25 to 34 megabases (Mb)).
Patient and public involvement
Patients and the public were not involved in the design or analysis of this study.
Results
Identification of 46 novel SLE susceptibility loci
We performed a large genome-wide association meta-analysis in 13 377 SLE cases and 194 993 controls of East Asians (online supplemental table 1). To the best of our knowledge, this is the largest genetic association study of SLE to date. The effective sample size (Neff=50 072) is three-fold and four-fold larger than that of the largest published trans-ancestry9 and East Asian11 meta-analyses, respectively.
We tested associations for 11 270 530 genetic variants in a fixed-effects meta-analysis. A quantile–quantile plot showed that test statistics were well-calibrated, with a genomic-control inflation factor λGC=1.06 (indicating that ancestry effects had been well controlled; online supplemental figure 1). LD score regression29 showed that polygenic effects (89.4%), rather than biasses, primarily caused the inflation residual (estimated mean χ2=1.32 and LD-score intercept=1.03).
We detected 26 379 genetic variants associated with SLE at p<5×10−8 within 113 loci (figure 1A and online supplemental table 2), of which 46 were novel (table 1). The pairwise LD between lead variants was low (LD r2 <0.002). For seven novel loci, MAFs of the lead SNPs were 10-fold higher in East Asians than in Europeans (figure 1B). Two of them and their LD neighbours (r2 ≥0.2 in either East Asians or Europeans) would be undetectable in Europeans with the same effective sample size and risk magnitude due to low statistical power (<10%; online supplemental table 3).
Figure 1.
Summary of meta-analysis association results and comparison of MAFs for lead variants within the 46 novel loci between East Asians and Europeans. (A) Manhattan plot of genome-wide association meta-analysis results from 208 370 SLE East Asians including 13 377 SLE cases and 194 993 controls. Minus log10-transformed association p values (y-axis) are plotted along chromosomal positions (x-axis). Known and novel loci are highlighted in light blue and pink, respectively. The red dashed line denotes the genome-wide association significance threshold of p=5×10−8. The grey dashed line represents p=10−30, at which the y-axis breaks. (B) Comparison of MAFs of lead variants within the 46 novel loci between East Asians (y-axis) and non-Finnish Europeans (x-axis) in the Genome Aggregation Database (gnomAD) v3. Variants with more than 10 times higher MAFs in East Asians are coloured purple above a red dashed line. MAF, minor allele frequency.
Table 1.
Association results for the 46 novel susceptibility loci for systemic lupus erythematosus
| Region | CHR | Position | Variant | EA | NEA | EAF | OR | SE | P value | I2 | PHet | N | Nearest gene |
| 1 | 1 | 117 043 302 | rs9651076 | A | G | 0.431 | 1.117 | 0.015 | 3.26E−13 | 10.7 | 0.347 | 208 370 | CD58 |
| 2 | 1 | 157 108 159 | rs116785379 | C | G | 0.107 | 1.211 | 0.024 | 6.68E−16 | 43.7 | 0.114 | 208 370 | ETV3 |
| 7 | 1 | 201 979 455 | rs3806357 | A | G | 0.251 | 1.106 | 0.017 | 4.25E−09 | 0.0 | 0.672 | 208 370 | ELF3 |
| 9 | 2 | 7 573 079 | rs75362385 | T | G | 0.321 | 0.887 | 0.017 | 8.40E−13 | 68.3 | 0.007 | 208 370 | LOC100506274 |
| 14 | 2 | 111 877 174 | rs73954925 | C | G | 0.878 | 1.169 | 0.024 | 5.11E−11 | 56.4 | 0.043 | 208 370 | BCL2L11 |
| 18 | 2 | 198 929 806 | rs7572733 | T | C | 0.260 | 1.143 | 0.017 | 1.25E−14 | 0.0 | 0.647 | 208 370 | PLCL1 |
| 20 | 3 | 28 072 086 | rs438613 | T | C | 0.588 | 0.920 | 0.014 | 7.52E−09 | 69.4 | 0.006 | 208 370 | LINC01980 |
| 21 | 3 | 72 225 916 | rs7637844 | A | C | 0.871 | 0.877 | 0.023 | 1.28E−08 | 0.0 | 0.906 | 208 370 | LINC00870 |
| 25 | 4 | 2 700 844 | rs231694 | T | C | 0.380 | 1.111 | 0.018 | 9.71E−09 | 23.7 | 0.269 | 57 253 | FAM193A |
| 26 | 4 | 40 307 587 | rs113284964 | G | GCTTC | 0.371 | 1.134 | 0.015 | 1.35E−16 | 67.2 | 0.009 | 208 370 | LINC02265 |
| 27 | 4 | 79 644 279 | rs6533951 | A | G | 0.350 | 1.111 | 0.016 | 1.25E−10 | 61.4 | 0.024 | 208 370 | LINC01094 |
| 28 | 4 | 84 146 996 | rs6841907 | T | C | 0.729 | 0.906 | 0.016 | 1.10E−09 | 43.5 | 0.115 | 208 370 | COQ2 |
| 31 | 4 | 109 061 618 | rs58107865 | C | G | 0.227 | 0.802 | 0.021 | 6.57E−25 | 1.1 | 0.409 | 208 370 | LEF1 |
| 34 | 5 | 131 120 338 | rs370449198 | A | AC | 0.922 | 0.721 | 0.060 | 4.41E−08 | 0.0 | 0.408 | 187 562 | FNIP1 |
| 35 | 5 | 131 829 578 | rs2549002 | A | C | 0.682 | 0.905 | 0.016 | 2.40E−10 | 20.6 | 0.279 | 208 370 | IRF1 |
| 40 | 6 | 243 302 | rs9503037 | A | G | 0.693 | 0.881 | 0.016 | 1.36E−15 | 42.3 | 0.123 | 208 370 | LOC285766 |
| 43 | 6 | 36 715 031 | rs34868004 | CA | C | 0.225 | 1.104 | 0.017 | 4.46E−09 | 40.7 | 0.134 | 208 370 | CPNE5 |
| 46 | 6 | 116 690 849 | rs9488914 | T | C | 0.920 | 0.862 | 0.026 | 1.14E−08 | 65.3 | 0.013 | 208 370 | DSE |
| 48 | 6 | 154 570 651 | rs9322454 | A | G | 0.659 | 1.090 | 0.015 | 2.42E−08 | 0.0 | 0.430 | 208 370 | IPCEF1 |
| 54 | 8 | 71 330 166 | rs142937720 | A | AAGTGGCC | 0.383 | 0.894 | 0.016 | 2.27E−12 | 67.9 | 0.008 | 208 370 | NCOA2 |
| 55 | 8 | 72 894 959 | rs17374162 | A | G | 0.411 | 0.917 | 0.015 | 3.02E−09 | 35.7 | 0.169 | 208 370 | MSC-AS1 |
| 56 | 8 | 129 425 593 | rs16902895 | A | G | 0.678 | 1.122 | 0.016 | 1.48E−13 | 0.0 | 0.801 | 208 370 | LINC00824 |
| 58 | 9 | 21 267 087 | rs7858766 | T | C | 0.538 | 1.139 | 0.016 | 2.25E−15 | 0.0 | 0.825 | 208 370 | IFNA22P |
| 59 | 10 | 5 910 746 | rs77448389 | A | G | 0.913 | 0.855 | 0.025 | 7.30E−10 | 0.0 | 0.584 | 208 370 | ANKRD16 |
| 62 | 10 | 64 411 288 | rs10995261 | T | C | 0.240 | 0.909 | 0.017 | 2.57E−08 | 43.9 | 0.113 | 208 370 | ZNF365 |
| 63 | 10 | 73 466 709 | rs10823829 | T | C | 0.718 | 0.910 | 0.016 | 1.05E−09 | 0.0 | 0.771 | 208 370 | CDH23 |
| 64 | 10 | 105 677 911 | rs111447985 | A | C | 0.073 | 1.172 | 0.028 | 1.72E−08 | 0.0 | 0.526 | 208 370 | STN1 |
| 65 | 10 | 112 664 114 | rs58164562 | T | C | 0.748 | 0.892 | 0.016 | 3.14E−12 | 33.3 | 0.186 | 208 370 | BBIP1 |
| 66 | 11 | 4 113 200 | rs3750996 | A | G | 0.834 | 1.167 | 0.022 | 1.89E−12 | 0.0 | 0.522 | 208 370 | STIM1 |
| 67 | 11 | 18 362 382 | rs77885959 | T | G | 0.978 | 1.694 | 0.062 | 3.16E−17 | 0.0 | 0.511 | 204 433 | GTF2H1 |
| 74 | 12 | 4 140 876 | rs2540119 | T | C | 0.544 | 1.086 | 0.015 | 3.51E−08 | 44.9 | 0.106 | 208 370 | PARP11 |
| 77 | 12 | 103 916 080 | rs6539078 | T | C | 0.591 | 0.894 | 0.015 | 9.49E−14 | 0.0 | 0.916 | 208 370 | LOC105369945 |
| 79 | 12 | 121 368 518 | rs3999421 | A | T | 0.506 | 0.910 | 0.016 | 1.29E−09 | 47.3 | 0.091 | 208 370 | XLOC_009911 |
| 81 | 12 | 133 040 182 | rs200521476 | G | GCATCAC | 0.812 | 0.875 | 0.023 | 5.66E−09 | 26.7 | 0.235 | 208 370 | FBRSL1 |
| 86 | 15 | 101 529 012 | rs35985016 | A | G | 0.930 | 0.843 | 0.030 | 1.95E−08 | 0.0 | 0.897 | 204 433 | LRRK1 |
| 90 | 16 | 50 089 207 | rs11288784 | G | GT | 0.365 | 0.902 | 0.016 | 2.38E−10 | 0.0 | 0.664 | 208 370 | HEATR3 |
| 93 | 16 | 79 745 672 | rs11376510 | G | GT | 0.737 | 0.898 | 0.017 | 2.23E−10 | 0.0 | 0.719 | 208 370 | MAFTRR |
| 95 | 17 | 7 240 391 | rs61759532 | T | C | 0.076 | 1.235 | 0.032 | 2.79E−11 | 24.9 | 0.247 | 208 370 | ACAP1 |
| 97 | 17 | 47 468 020 | rs2671655 | T | C | 0.651 | 1.087 | 0.015 | 4.60E−08 | 0.0 | 0.756 | 208 370 | LOC10272459 |
| 98 | 17 | 76 373 179 | rs113417153 | T | C | 0.193 | 0.893 | 0.020 | 1.90E−08 | 2.1 | 0.403 | 208 370 | PGS1 |
| 100 | 18 | 77 386 912 | rs118075465 | A | G | 0.147 | 1.140 | 0.020 | 1.16E−10 | 0.0 | 0.543 | 208 370 | LOC284241 |
| 101 | 19 | 948 532 | rs2238577 | T | C | 0.455 | 0.885 | 0.016 | 1.83E−14 | 60.8 | 0.026 | 208 370 | ARID3A |
| 102 | 19 | 6 697 088 | rs5826945 | A | T | 0.929 | 0.836 | 0.028 | 9.67E−11 | 50.0 | 0.075 | 208 370 | C3 |
| 105 | 19 | 33 072 768 | rs12461589 | T | C | 0.248 | 0.898 | 0.017 | 5.00E−10 | 0.0 | 0.510 | 208 370 | PDCD5 |
| 106 | 19 | 49 851 746 | rs33974425 | CCAGCTGCAT | C | 0.702 | 1.120 | 0.016 | 4.40E−12 | 42.6 | 0.121 | 208 370 | TEAD2 |
| 108 | 22 | 18 649 356 | rs4819670 | T | C | 0.210 | 1.151 | 0.022 | 5.53E−11 | 0.0 | 0.650 | 208 370 | USP18 |
CHR, chromosome; EA, effect allele; EAF, effect allele frequency; I2, genetic heterogeneity I2 statistics at scale of 0% to 100%; N, study sample size; NEA, non-effect allele; OR, Odds ratio; PHet, P-values for the χ2 test of genetic heterogeneity; Region, unique ID for genomic region;SE, Standard error of odds ratio.
Associations at exonic variants
The meta-analysis identified lead missense variants in two novel loci (CHD23 and LRRK1; figure 2A, B and online supplemental table 2). In addition, we detected three new exonic variants (including two missense variants) within the reported SLE loci including CSK (rs11553760), IKBKB (rs2272736) and TYK2 (rs55882956) genes (figure 2C–E and online supplemental table 2). They were not correlated with previously reported exonic variants within the same genes (LD r2 <0.02 in East Asians or Europeans; online supplemental table 4), suggesting possible allelic heterogeneity of these genes. We replicated four known associations for missense variants at AHNAK2 (rs2819426),33 IRAK1 (rs1059702),34 NCF2 (rs13306575) and WDFY4 (rs7097397; online supplemental table 2).35 36
Figure 2.
New lead exonic variants identified at three known (CSK, IKBKB and TYK2) and two novel (CHD23 and LRRK1) loci. (A) rs11553760 (synonymous variant) at CSK. (B) rs2272736 (p.Arg303Gln, missense variant) at IKBKB. (C) rs55882956 (p.Arg703Trp, missense variant) at TYK2. (D) rs10823829 (synonymous variant) at CHD23. (E) rs35985016 (p.Lys203Glu, missense variant) at LRRK1. The lead SNP is labelled as purple diamond. The LD is estimated from 7021 Chinese samples. LD, linkage disequilibrium; Mb, megabases; SNP, single nucleotide polymorphism.
Secondary association signals within SLE loci
To dissect the source of association signals at each locus, we conducted an approximate conditional analysis using GCTA27 with meta-analysis summary statistics and LD estimates from 7021 unrelated Chinese controls. We acknowledge the limitations of using LD estimation from a single population for a meta-analysis of diverse East Asians. We identified a total of 233 independent association signals with conditional p<5×10−8, 169 of which arose from non-HLA regions (online supplemental table 5). We observed from two to four signals at each of 28 non-HLA loci (including seven novel loci). For example, we discovered two distinct association signals within the known STAT4 locus, including the previously reported SNP rs1188934112 and the new insert-deletion variant (indel) rs71403211 (figure 3A). For the 46 novel loci, we discovered 55 distinct signals (online supplemental table 5 and figure 2). We noticed that most of the signal index variants (n=190, 82%) are common (MAF ≥5%) with modest effects (online supplemental table 5).
Figure 3.
Two independent association signals identified. (A) At two intronic variants within known STAT4 locus. (B) At known (rs7097397, p.Arg1816Gln) and new (rs7072606, p.Ser214Pro) missense variants within WDFY4 locus. (C) A known intronic variant within ATXN2 gene and a new (rs1131476, p.Ala352Thr) missense variant within OAS1 gene. The lead and secondary index variants are labelled in diamond. The lead variant and its LD proxies are in red while the secondary signal index variant and its LD proxies are in blue. The LD is estimated from 7021 Chinese samples. LD, linkage disequilibrium; Mb, megabases.
Approximate conditional analysis detected two novel missense variants at WDFY4 and OAS1 genes. We detected two distinct signals within WDFY4, including the known (rs7097397)37 and a new (rs7072606) missense variant (LD r2=0.02 between two variants in East Asians), which suggests allelic heterogeneity at this locus (figure 3B). We provided for the first time genome-wide association evidence at a missense variant within OAS1 (rs1131476, LD r2=0.78 with rs1051042, which is a known missense variant but only exhibited suggestive significance with SLE in previous study,33 figure 3C and online supplemental table 5).
Prioritisation of causal variants
To prioritise putative causal variants, we conducted a Bayesian statistical fine-mapping analysis for 111 loci using FINEMAP28 after excluding complex associations involving the HLA and 7q11.23. We found exactly the same number of association signals in 57 loci between FINEMAP causal configuration with the highest posterior probability and the GCTA approximate conditional test. To be conservative, we only summarised the statistical fine-mapping results for these 57 regions, which contained 65 association signals (online supplemental table 6).
For each signal, we built a credible set of putative causal variants with a 95% probability of including the true causal variants. The size of 28 credible sets was small (size ≤10; figure 4A). Among the 110 putative causal variants with posterior probability ≥0.1 (figure 4B), we found four coding variants (3.6%), which implies that most of these associations are probably induced by non-coding causal variants. The prioritised variants are available to be tested as potential targets in perturbation experiments. For example, the allele-specific regulatory activity of the intronic variant (rs10036748) with the highest posterior probability (0.387) in the TNIP1 locus was recently experimentally characterised in SLE.38
Figure 4.
Results of statistical fine-mapping analysis. (A) Number of 95% credible sets of putative causal variants, binned by their sizes. (B) Number of potential causal variants with posterior probabilities (PP) ≥0.1, which are considered to be the true causal variants.
We pinpointed a single most likely causal variant with high confidence (posterior probability ≥0.8) for four known (ATXN2, BACH2, DRAM1/WASHC3 and NCF2) and six novel (17p13.1, ELF3, GTF2H1, LRRK1, LOC102724596/PHB and STIM1) loci (online supplemental table 6). For example, we prioritised rs61759532 as a putative causal variant at the novel 17p13.1 locus (PP=0.999). This variant is located in an intron of ACAP1, which encodes a key regulator of integrin traffic for cell adhesion and migration.39
SNP-based heritability
To assess the proportion of phenotypic variance explained by common variants, we applied LD score regression29 to the meta-analysis results. Assuming a population prevalence of 0.03% for SLE,1 we estimated the liability-scale SNP-based heritability from all non-HLA variants as h2 SNP = 7.24% (SE=0.78%). The 66 known and 46 novel non-HLA loci explained 62.6% (SE=4.9%) and 22.1% (SE=2.6%) of this overall SNP-based heritability, respectively.
Genetic correlation with other diseases/traits
To explore shared genetics between SLE and various traits, we calculated genetic correlations of SLE with 39 complex diseases and 59 quantitative traits in Biobank Japan participants using bivariate LD score regression32 (online supplemental table 7). As expected, we detected significant positive genetic correlations between SLE and two other autoimmune diseases: rheumatoid arthritis (rg=0.437) and Graves’ disease (rg=0.318). In addition, we found unreported genetic correlations (FDR<0.05) with albumin/globulin ratio (rg=−0.242) and non-albumin protein (rg=0.238).
Discussion
Here, we carried out the largest-ever genome-wide association meta-analysis for SLE and identified 113 risk loci including 46 novel regions for SLE in 208 370 East Asians including 13 377 SLE cases and 194 993 controls. This study revealed new genetic predispositions for SLE and generated hypotheses for further studies to investigate diseases functional mechanisms.
Epidemiological studies have found the higher prevalence of SLE in East Asians and heterogeneous disease manifestations across ethnicities.15 16 Previous investigations suggested genetics might explain the phenotypic heterogeneity.9 We observed that the MAFs of the index variants for several novel genetic associations were much higher in East Asians than in Europeans. Specifically, we suggested two novel loci were more likely specific to East Asians. These findings might help explain the genetic basis of SLE phenotypic heterogeneity between East Asians and Europeans. The results reinforce the power of large-scale genetic association for genetic discovery of SLE in relatively less studied populations.
We identified 11 exonic variants including two missense variants within novel loci CHD23 and LRRK1, four novel missense variants within known SLE loci IKBKB,9 TYK2,9 WDFY4 37 and OAS1, 33 and three known missense variants within known AHNAK2,33 IRAK1 34 and NCF2.35 36 These findings suggested allelic heterogeneity within several of these loci and highlighted the disease-risk effects of genes AHNAK2, CSK, IKBKB, IRAK1, NCF2, OAS1, TYK2 and WDFY4 within eight known loci, and CDH23 and LRRK1 within two novel loci which potentially alter gene product activity in an allele-specific manner. The novel gene CHD23 plays a role in cell migration40 while LRRK1 encodes a multiple-domain leucine-rich repeat kinase. A previous study observed that LRRK1-deficient mice exhibited a profound defect in B-cell proliferation and survival and impaired B-cell receptor-mediated NF-κB activation,41 which suggested that the association within this region might confer the risk of SLE through modulating the NF-κB pathway and the activities of B cells. We noted that the Bayesian statistical fine-mapping analysis prioritised the lead missense variant rs35985016 as the most likely putative causal variant for this association. This variant is highly frequent in our study individuals but is rare in Europeans. The molecular mechanisms in SLE risk worthy further investigations.
In the present study, we localised the putative causal variants for SLE genetic association in high resolution. Our findings indicated that the putative causal variants for the majority of SLE associations were non-coding variants. We provided targets of candidate putative causal variants with high confidence for several SLE loci. These findings are worthy for further exploration in functional experiments. We showed the regulatory effect of one of the putative causal variants in an accompanied paper. We acknowledged the limitation of a small LD reference panel from single population in the Bayesian statistical fine-mapping analysis.
We found for the first time the significant genetic correlations between SLE, albumin/globulin ratio and non-albumin protein. These findings might reflect the renal complications commonly developed in SLE patients who have been reported to have significantly lower albumin/globulin ratio and higher serum globulin than healthy controls in epidemiological studies.42 These shared genetic basis findings might suggest a common pathway underlying the SLE risk and kidney function in addition to the direct damage of SLE autoantibodies on kidney.
In summary, we detected 46 novel loci for SLE risk in the largest meta-analysis and prioritised putative causal variants for 65 causal signals. This study highlights the power of large-scale genetic association study in East Asian populations. The findings reveal the genetic predispositions for SLE and provide clues for further the investigation of disease mechanisms.
Acknowledgments
We acknowledged the participants in this study. We appreciate the contribution of Japanese Research Committee on Idiopathic Osteonecrosis of the Femoral Head. We appreciate all contributors to BioBank Japan. Details are included in supplementary material.
Footnotes
Handling editor: Josef S Smolen
XY, KKim and HS contributed equally.
Contributors: XY, KKim and HS contributed equally to this work, and either has the right to list himself first in bibliographical documents. SCB, YC, CT, XZhang, XY, KKim and HS conceived the study design. SCB, YC, XZhang, SY, KKim and CT acquainted the financial support. XY, KKim, HS, CT, YC and SCB wrote the manuscript. XY, KKim, HS, EH, XZheng, VL and YW conducted all of the analyses with the help of JBH, LCK, MTW, SP, SE, HS, KT, NO, MK, KI and C Terao. KKim, SYB, LW, LL, RXL, YSheng, MYH, WL, KYoon, MC, HH, MW, YTang, HD, CL, CS, WF, KL, BJK, HSL, SCB, SH, YSakamoto, NSugano, MM, DT, KKarino, TMiyamura, JN, GM, TKuroda, HN, TMiyamoto, TT, YKawaguchi, KA, YTada, KYamaji, MS, TA, AS, TSumida, YOkada, KMatsuda, KMatsuo, YKochi, TSeki, YTanaka, TKubo, RH, TYoshioka, MY, TKabata, YA, YOhta, TO, YN, AK, YY, KOhzono, KYamamoto, KOhmura, TYamamoto and SI generated genetic data. SYB, SJ, YCK, WTC, SSL, SCS, YMK, DY, CHS, YBP, JYC, YP, GYA, JMS, YKL, DJP, WY, THK, SY, BJK, NShen, HSL, XZhang, CT and SCB managed the cohort data. All authors reviewed and approved the manuscript.
Funding: This research was supported by General Program (81872516, 81573033, 81872527, 81830019, 81421001), Young Program (81803117, 82003328), Exchange Program (81881340424), and Science Fund for Creative Research Groups (31630021) of National Natural Science Foundation of China (NSFC), Distinguished Young Scholar of Provincial Natural Science Foundation of Anhui (1808085J08), National Program on Key Basic Research Project of China (973 Program) (2014CB541901), China National Key R&D Program (2016YFC0906100), Science Foundation of Ministry of Education of China (213018A), Program for New Century Excellent Talents in University of Ministry of Education of China (NCET-12-0600), The Bio & Medical Technology Development Program of the National Research Foundation, funded by the Ministry of Science & ICT of the Republic of Korea (NRF-2017M3A9B4050355 to SCB), Basic Science Research Program through the National Research Foundation of Korea funded by the Ministry of Science, ICT and Future Planning (2015R1C1A1A02036527 and 2017R1E1A1A01076388 to KKim), National BioBank of Korea, the Centers for Disease Control and Prevention, Republic of Korea (KBN-2018-031 to SSL), Center for Genome Science, Korea National Institute of Health, Republic of Korea (4845-301, 3000-3031 to MYH, KYoon and BJK), Japan Agency for Medical Research and Development (AMED) and the BioBank Japan project supported by the Ministry of Education, Culture, Sports, Sciences and Technology of the Japanese Government and AMED under grant numbers (17km0305002 and 18km0605001), Grant of Japan Orthopaedics and Traumatology Research Foundation, lnc, (No. 350 to YSakamoto), RIKEN Junior Research Associate Program (to H.S.), US NIH grants (AI024717, AI130830, AI148276, HG172111 and AR070549 to JBH), US Department of Veterans Affairs (BX001834 to JBH) and Center for Pediatric Genomics Award and CCRF Endowed Scholar Award of Cincinnati Children’s Hospital (to MTW).
Competing interests: None declared.
Provenance and peer review: Not commissioned; externally peer reviewed.
Supplemental material: This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
Data availability statement
All data relevant to the study are included in the article or uploaded as supplementary information. The meta-analysis summary association statistics in the current study are available from the corresponding author on reasonable request.
Ethics statements
Patient consent for publication
Not required.
Ethics approval
The study protocol was approved by the Institutional Review Board at each participating institute and the meta-analysis study was additionally approved by the Institutional Review Boards at Anhui Medical University, Hanyang University Hospital of Rheumatic Diseases, and RIKEN Center for Medical Sciences.
References
- 1. Carter EE, Barr SG, Clarke AE. The global burden of SLE: prevalence, health disparities and socioeconomic impact. Nat Rev Rheumatol 2016;12:605–20. 10.1038/nrrheum.2016.137 [DOI] [PubMed] [Google Scholar]
- 2. Guerra SG, Vyse TJ, Cunninghame Graham DS. The genetics of lupus: a functional perspective. Arthritis Res Ther 2012;14:211. 10.1186/ar3844 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Gateva V, Sandling JK, Hom G, et al. A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus. Nat Genet 2009;41:1228–33. 10.1038/ng.468 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Han J-W, Zheng H-F, Cui Y, et al. Genome-Wide association study in a Chinese Han population identifies nine new susceptibility loci for systemic lupus erythematosus. Nat Genet 2009;41:1234–7. 10.1038/ng.472 [DOI] [PubMed] [Google Scholar]
- 5. Cunninghame Graham DS, Morris DL, Bhangale TR, et al. Association of NCF2, IKZF1, IRF8, IFIH1, and TYK2 with systemic lupus erythematosus. PLoS Genet 2011;7:e1002341. 10.1371/journal.pgen.1002341 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Okada Y, Shimane K, Kochi Y, et al. A genome-wide association study identified AFF1 as a susceptibility locus for systemic lupus eyrthematosus in Japanese. PLoS Genet 2012;8:e1002455. 10.1371/journal.pgen.1002455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Kim K, Bang S-Y, Lee H-S, et al. The HLA-DRβ1 amino acid positions 11-13-26 explain the majority of SLE-MHC associations. Nat Commun 2014;5:5902. 10.1038/ncomms6902 [DOI] [PubMed] [Google Scholar]
- 8. Akizuki S, Ishigaki K, Kochi Y, et al. PLD4 is a genetic determinant to systemic lupus erythematosus and involved in murine autoimmune phenotypes. Ann Rheum Dis 2019;78:509–18. 10.1136/annrheumdis-2018-214116 [DOI] [PubMed] [Google Scholar]
- 9. Morris DL, Sheng Y, Zhang Y, et al. Genome-Wide association meta-analysis in Chinese and European individuals identifies ten new loci associated with systemic lupus erythematosus. Nat Genet 2016;48:940–6. 10.1038/ng.3603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Catalina MD, Owen KA, Labonte AC, et al. The pathogenesis of systemic lupus erythematosus: harnessing big data to understand the molecular basis of lupus. J Autoimmun 2020;110:102359. 10.1016/j.jaut.2019.102359 [DOI] [PubMed] [Google Scholar]
- 11. Sun C, Molineros JE, Looger LL, et al. High-Density genotyping of immune-related loci identifies new SLE risk variants in individuals with Asian ancestry. Nat Genet 2016;48:323–30. 10.1038/ng.3496 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Bentham J, Morris DL, Graham DSC, et al. Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus. Nat Genet 2015;47:1457–64. 10.1038/ng.3434 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Buniello A, MacArthur JAL, Cerezo M, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res 2019;47:D1005–12. 10.1093/nar/gky1120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Sirugo G, Williams SM, Tishkoff SA. The missing diversity in human genetic studies. Cell 2019;177:26–31. 10.1016/j.cell.2019.02.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Danchenko N, Satia JA, Anthony MS. Epidemiology of systemic lupus erythematosus: a comparison of worldwide disease burden. Lupus 2006;15:308–18. 10.1191/0961203306lu2305xx [DOI] [PubMed] [Google Scholar]
- 16. Morais SA, Isenberg DA. A study of the influence of ethnicity on serology and clinical features in lupus. Lupus 2017;26:17–26. 10.1177/0961203316645204 [DOI] [PubMed] [Google Scholar]
- 17. Ishigaki K, Akiyama M, Kanai M, et al. Large-Scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet 2020;52:669–79. 10.1038/s41588-020-0640-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18., Auton A, Brooks LD, et al. , 1000 Genomes Project Consortium . A global reference for human genetic variation. Nature 2015;526:68–74. 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Kim Y, Kim B, Han B. The Korean reference genome Project: construction of the refer- ence panel for imputation analysis. Presented at the 61th Annual Meeting of The American Society of Human Genetics; October 6, 2015, Baltimore, MD, 2015. [Google Scholar]
- 20. Bycroft C, Freeman C, Petkova D, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9. 10.1038/s41586-018-0579-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 2009;5:e1000529. 10.1371/journal.pgen.1000529 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Das S, Forer L, Schönherr S, et al. Next-Generation genotype imputation service and methods. Nat Genet 2016;48:1284–7. 10.1038/ng.3656 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559–75. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Marchini J, Howie B. Genotype imputation for genome-wide association studies. Nat Rev Genet 2010;11:499–511. 10.1038/nrg2796 [DOI] [PubMed] [Google Scholar]
- 25. Cook JP, Mahajan A, Morris AP. Guidance for the utility of linear models in meta-analysis of genetic association studies of binary phenotypes. European Journal of Human Genetics 2017;25:240–5. 10.1038/ejhg.2016.150 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 2010;26:2190–1. 10.1093/bioinformatics/btq340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Yang J, Ferreira T, Morris AP, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet 2012;44:369–75. 10.1038/ng.2213 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Benner C, Spencer CCA, Havulinna AS, et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 2016;32:1493–501. 10.1093/bioinformatics/btw018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Bulik-Sullivan BK, Loh P-R, Finucane HK, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 2015;47:291–5. 10.1038/ng.3211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Finucane HK, Bulik-Sullivan B, Gusev A, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet 2015;47:1228–35. 10.1038/ng.3404 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Terao C, Momozawa Y, Ishigaki K, et al. GWAS of mosaic loss of chromosome Y highlights genetic effects on blood cell differentiation. Nat Commun 2019;10:4719. 10.1038/s41467-019-12705-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Bulik-Sullivan B, Finucane HK, Anttila V, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet 2015;47:1236–41. 10.1038/ng.3406 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Wen L, Zhu C, Zhu Z, et al. Exome-wide association study identifies four novel loci for systemic lupus erythematosus in Han Chinese population. Ann Rheum Dis 2018;77:417. 10.1136/annrheumdis-2017-211823 [DOI] [PubMed] [Google Scholar]
- 34. Zhang Y, Zhang J, Yang J, et al. Meta-Analysis of GWAS on two Chinese populations followed by replication identifies novel genetic variants on the X chromosome associated with systemic lupus erythematosus. Hum Mol Genet 2015;24:274–84. 10.1093/hmg/ddu429 [DOI] [PubMed] [Google Scholar]
- 35. Armstrong DL, Eisenstein M, Zidovetzki R, et al. Systemic lupus erythematosus-associated neutrophil cytosolic factor 2 mutation affects the structure of NADPH oxidase complex. J Biol Chem 2015;290:12595–602. 10.1074/jbc.M115.639021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Kim-Howard X, Sun C, Molineros JE, et al. Allelic heterogeneity in NCF2 associated with systemic lupus erythematosus (SLE) susceptibility across four ethnic populations. Hum Mol Genet 2014;23:1656–68. 10.1093/hmg/ddt532 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Yang W, Shen N, Ye D-Q, et al. Genome-Wide association study in Asian populations identifies variants in ETS1 and WDFY4 associated with systemic lupus erythematosus. PLoS Genet 2010;6:e1000841. 10.1371/journal.pgen.1000841 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Pasula S, Tessneer KL, Fu Y, et al. Role of systemic lupus erythematosus risk variants with opposing functional effects as a driver of hypomorphic expression of TNIP1 and other genes within a three-dimensional chromatin network. Arthritis Rheumatol 2020;72:780–90. 10.1002/art.41188 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Chen P-W, Luo R, Jian X, et al. The ARF6 GTPase-activating proteins ARAP2 and ACAP1 define distinct endosomal compartments that regulate integrin α5β1 traffic. J Biol Chem 2014;289:30237–48. 10.1074/jbc.M114.596155 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Sannigrahi MK, Srinivas CS, Deokate N, et al. The strong propensity of Cadherin‐23 for aggregation inhibits cell migration. Mol Oncol 2019;13:1092–109. 10.1002/1878-0261.12469 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Morimoto K, Baba Y, Shinohara H, et al. LRRK1 is critical in the regulation of B-cell responses and CARMA1-dependent NF-κB activation. Sci Rep 2016;6:25738. 10.1038/srep25738 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Kwon OC, Lee JS, Ghang B, et al. Predicting eventual development of lupus nephritis at the time of diagnosis of systemic lupus erythematosus. Semin Arthritis Rheum 2018;48:462–6. 10.1016/j.semarthrit.2018.02.012 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
annrheumdis-2020-219209supp001.pdf (13MB, pdf)
Data Availability Statement
All data relevant to the study are included in the article or uploaded as supplementary information. The meta-analysis summary association statistics in the current study are available from the corresponding author on reasonable request.




