Skip to main content
BMJ Open Access logoLink to BMJ Open Access
. 2020 Dec 3;80(5):632–640. doi: 10.1136/annrheumdis-2020-219209

Meta-analysis of 208370 East Asians identifies 113 susceptibility loci for systemic lupus erythematosus

Xianyong Yin 1,2,3,4,5,6,#, Kwangwoo Kim 7,#, Hiroyuki Suetsugu 8,9,10,#, So-Young Bang 11,12, Leilei Wen 1,2,3, Masaru Koido 9,13, Eunji Ha 7, Lu Liu 1,2,3, Yuma Sakamoto 14, Sungsin Jo 12, Rui-Xue Leng 15, Nao Otomo 8,9,16, Viktoryia Laurynenka 17, Young-Chang Kwon 12, Yujun Sheng 1,2,3, Nobuhiko Sugano 18, Mi Yeong Hwang 19, Weiran Li 1,2,3, Masaya Mukai 20, Kyungheon Yoon 19, Minglong Cai 1,2,3, Kazuyoshi Ishigaki 9,21,22,23, Won Tae Chung 24, He Huang 1,2,3, Daisuke Takahashi 25, Shin-Seok Lee 26, Mengwei Wang 1,2,3, Kohei Karino 27, Seung-Cheol Shim 28, Xiaodong Zheng 1,2,3, Tomoya Miyamura 29, Young Mo Kang 30, Dongqing Ye 15, Junichi Nakamura 31, Chang-Hee Suh 32, Yuanjia Tang 33, Goro Motomura 10, Yong-Beom Park 34, Huihua Ding 33, Takeshi Kuroda 35, Jung-Yoon Choe 36, Chengxu Li 5, Hiroaki Niiro 37, Youngho Park 12, Changbing Shen 38,39, Takeshi Miyamoto 40, Ga-Young Ahn 11, Wenmin Fei 5, Tsutomu Takeuchi 41, Jung-Min Shin 11, Keke Li 5, Yasushi Kawaguchi 42, Yeon-Kyung Lee 11, Yongfei Wang 43, Koichi Amano 44, Dae Jin Park 11, Wanling Yang 43, Yoshifumi Tada 45, Ken Yamaji 46, Masato Shimizu 47, Takashi Atsumi 48, Akari Suzuki 49, Takayuki Sumida 50, Yukinori Okada 51,52, Koichi Matsuda 53,54, Keitaro Matsuo 55,56, Yuta Kochi 57; Japanese Research Committee on Idiopathic Osteonecrosis of the Femoral Head, Leah C Kottyan 17,58, Matthew T Weirauch 17,58, Sreeja Parameswaran 17, Shruti Eswar 17, Hanan Salim 17, Xiaoting Chen 17, Kazuhiko Yamamoto 49, John B Harley 17,58,59, Koichiro Ohmura 60, Tae-Hwan Kim 11,12, Sen Yang 1,2,3, Takuaki Yamamoto 61, Bong-Jo Kim 19, Nan Shen 17,33,62, Shiro Ikegawa 8, Hye-Soon Lee 11,12, Xuejun Zhang 1,2,3,63, Chikashi Terao 9,64,65,, Yong Cui 5,, Sang-Cheol Bae 11,12,
PMCID: PMC8053352  PMID: 33272962

Abstract

Objective

Systemic lupus erythematosus (SLE), an autoimmune disorder, has been associated with nearly 100 susceptibility loci. Nevertheless, these loci only partially explain SLE heritability and their putative causal variants are rarely prioritised, which make challenging to elucidate disease biology. To detect new SLE loci and causal variants, we performed the largest genome-wide meta-analysis for SLE in East Asian populations.

Methods

We newly genotyped 10 029 SLE cases and 180 167 controls and subsequently meta-analysed them jointly with 3348 SLE cases and 14 826 controls from published studies in East Asians. We further applied a Bayesian statistical approach to localise the putative causal variants for SLE associations.

Results

We identified 113 genetic regions including 46 novel loci at genome-wide significance (p<5×10−8). Conditional analysis detected 233 association signals within these loci, which suggest widespread allelic heterogeneity. We detected genome-wide associations at six new missense variants. Bayesian statistical fine-mapping analysis prioritised the putative causal variants to a small set of variants (95% credible set size ≤10) for 28 association signals. We identified 110 putative causal variants with posterior probabilities ≥0.1 for 57 SLE loci, among which we prioritised 10 most likely putative causal variants (posterior probability ≥0.8). Linkage disequilibrium score regression detected genetic correlations for SLE with albumin/globulin ratio (rg=−0.242) and non-albumin protein (rg=0.238).

Conclusion

This study reiterates the power of large-scale genome-wide meta-analysis for novel genetic discovery. These findings shed light on genetic and biological understandings of SLE.

Keywords: lupus erythematosus, systemic, polymorphism, genetic, epidemiology


Key messages.

What is already known about this subject?

  • Genome-wide association studies have identified nearly 100 susceptibility loci for systemic lupus erythematosus (SLE) risk.

  • The known SLE loci explain partially the disease heritability.

What does this study add?

  • This study identified 113 genomic regions including 46 novel loci for SLE risk.

  • The study prioritised 110 putative causal variants including 10 putative causal variants with high confidence (posterior probability ≥0.8).

How might this impact on clinical practice or future developments?

  • These findings revealed new genetic basis for SLE and generated molecular mechanisms hypotheses for further investigations.

Introduction

Systemic lupus erythematosus (SLE) is an autoimmune disorder characterised by the production of autoantibodies that damage multiple organs.1 Considerable genetic predisposition contributes to SLE aetiology.2 To date, nearly 100 susceptibility loci have been identified for SLE, mainly through genome-wide association studies (GWASs).3–8 However, these loci collectively only explain ~30% of SLE heritability9 and their biology, in terms of causal variants, effector genes and cell types and pathological pathways that mediate genetic effects, has not yet been fully characterised.10

Genome-wide association meta-analyses have been performed to uncover new genetic associations for SLE in Asians,11 Europeans12 and trans-ancestral populations.9 However, the study sample sizes were relatively modest, which limits their ability for genetic discovery. GWASs have successfully linked genetic variants with human common diseases and traits.13 Nonetheless, only ~8% of GWAS participants are East Asians.14 East Asians have a unique population genetic history and may have ethnicity-specific genetic architecture involved in the development of disease and manifestations. For example, SLE has a remarkably higher prevalence and younger age of onset in Asians.15 16 Genetic heterogeneity may explain, at least partly, the phenotypic diversity of SLE between East Asians and Europeans.9 Hence, large-scale East Asian investigations may provide an opportunity to identify unique genetic associations even for the same diseases and traits that have already been well studied in Europeans.17

Methods

Study participants

We recruited a total of 10 029 SLE cases and 180 167 healthy controls in three independent case–control cohorts from mainland China, Korea and Japan. We analysed additionally 3348 SLE cases and 14 826 controls that were published in our previous East Asian SLE GWASs4 6–9 to increase statistical power. All the cases fulfilled the revised American College of Rheumatology SLE classification criteria or were diagnosed by collagen disease physicians (online supplemental table 1). Each participant provided written informed consent.

Supplementary data

annrheumdis-2020-219209supp001.pdf (13MB, pdf)

Genome-wide association analyses

We newly genotyped 10 029 SLE cases and 180 167 controls, and revisited raw genome-wide genotype data in 3348 SLE cases and 14 826 controls from the five published studies.4 6–9 Quality controls were conducted for each of the eight data sets. Genotype imputation was accomplished using reference panels from the 1000 Genomes Project (1KGP) phase 3 v518 and population-specific reference panels19 in IMPUTE2/420 21 or MINIMAC4.22

We tested association between SLE risk and genotype dosages in each data set using a logistic regression or linear mixed model in PLINK,23 SNPTEST24 or EPACTS (https://genome.sph.umich.edu/wiki/EPACTS) (online supplemental table 1). Within each data set, we filtered out association results based on imputation quality (IMPUTE info or MINIMAC r2 ≤0.3), minor allele frequency (MAF) ≤0.5% or Hardy-Weinberg equilibrium test p<1.0×10−6 in controls. For each cohort, the association analysis for the X chromosome was conducted separately by sex and then meta-analysed across both men and women. For data sets analysed using a linear mixed model (online supplemental table 1), allelic effects and standard errors were converted to a log-odds scale to correct for case–control imbalance.25

Fixed-effects meta-analysis

We aggregated the association summary statistics from the eight data sets using a fixed-effects inverse-variance meta-analysis in METAL.26 We applied a genomic control correction to each association summary statistic. Heterogeneity in allelic effect sizes among data sets was assessed using Cochran’s Q statistic. We excluded genetic variants available in only a single data set. We defined SLE susceptibility loci by merging ±250 kilobases (kb) windows around genome-wide associated variants to ensure that lead single nucleotide polymorphisms (SNPs) were at least 500 kb apart. We defined lead variants as the most significant SLE-associated variant within each locus. A locus was considered novel if the lead SNP was at least 500 kb away from any previously reported SLE-associated variants.

Approximate conditional association analysis

To dissect distinct association signals at each SLE locus, we performed an approximate conditional analysis using GCTA COJO27 with genome-wide meta-analysis summary statistics based on linkage disequilibrium (LD) estimated from 7021 unrelated Chinese controls. The Chinese reference individuals for LD calculation were retrieved from the Chinese study using the Illumina Infinium Global Screening Array data (online supplemental table 1), excluding first-degree and second-degree relatives.

Bayesian statistical fine-mapping analysis

To prioritise causal variants in SLE susceptibility loci, a statistical fine-mapping analysis was performed using FINEMAP v1.4 software,28 with meta-analysis z-scores and LD matrices estimated from the 7021 Chinese reference individuals. We used default priors and parameters in FINEMAP, assuming at most five causal signals in the ±250 kb region around a lead variant at each SLE locus. FINEMAP computed a posterior probability (PP) for each genetic variant being the true putative causal variant. For each association signal, we ranked the candidate putative causal variants in a descending order of their PPs, and then built a 95% credible set of causal variants by including the ordered variants until their cumulative PP reached 0.95.

Heritability estimation by LD score regression

Overall SLE heritability h2 explained by genome-wide variants was estimated using the LD score regression model29 with LD scores18 from the 1KGP East Asian descendants, based on an SLE population prevalence of 0.03% in East Asian populations.1 SLE heritability estimate was further partitioned according to known and novel SLE loci using stratified LD score regression.30 The boundary of each SLE locus was arbitrarily defined as ±500 kb flanking the lead SLE-risk variant.

Genetic correlation between SLE and other traits by LD score regression

We calculated genetic correlations between 98 traits (39 diseases17 and 59 quantitative traits31 and SLE by using bivariate LD score regression.32 We used the LD scores18 from the 1KGP East Asian descendants, limited the genetic variants to the HapMap3 SNPs and removed the variants with extended human leucocyte antigen (HLA) region (chromosome 6: 25 to 34 megabases (Mb)).

Patient and public involvement

Patients and the public were not involved in the design or analysis of this study.

Results

Identification of 46 novel SLE susceptibility loci

We performed a large genome-wide association meta-analysis in 13 377 SLE cases and 194 993 controls of East Asians (online supplemental table 1). To the best of our knowledge, this is the largest genetic association study of SLE to date. The effective sample size (Neff=50 072) is three-fold and four-fold larger than that of the largest published trans-ancestry9 and East Asian11 meta-analyses, respectively.

We tested associations for 11 270 530 genetic variants in a fixed-effects meta-analysis. A quantile–quantile plot showed that test statistics were well-calibrated, with a genomic-control inflation factor λGC=1.06 (indicating that ancestry effects had been well controlled; online supplemental figure 1). LD score regression29 showed that polygenic effects (89.4%), rather than biasses, primarily caused the inflation residual (estimated mean χ2=1.32 and LD-score intercept=1.03).

We detected 26 379 genetic variants associated with SLE at p<5×10−8 within 113 loci (figure 1A and online supplemental table 2), of which 46 were novel (table 1). The pairwise LD between lead variants was low (LD r2 <0.002). For seven novel loci, MAFs of the lead SNPs were 10-fold higher in East Asians than in Europeans (figure 1B). Two of them and their LD neighbours (r2 ≥0.2 in either East Asians or Europeans) would be undetectable in Europeans with the same effective sample size and risk magnitude due to low statistical power (<10%; online supplemental table 3).

Figure 1.

Figure 1

Summary of meta-analysis association results and comparison of MAFs for lead variants within the 46 novel loci between East Asians and Europeans. (A) Manhattan plot of genome-wide association meta-analysis results from 208 370 SLE East Asians including 13 377 SLE cases and 194 993 controls. Minus log10-transformed association p values (y-axis) are plotted along chromosomal positions (x-axis). Known and novel loci are highlighted in light blue and pink, respectively. The red dashed line denotes the genome-wide association significance threshold of p=5×10−8. The grey dashed line represents p=10−30, at which the y-axis breaks. (B) Comparison of MAFs of lead variants within the 46 novel loci between East Asians (y-axis) and non-Finnish Europeans (x-axis) in the Genome Aggregation Database (gnomAD) v3. Variants with more than 10 times higher MAFs in East Asians are coloured purple above a red dashed line. MAF, minor allele frequency.

Table 1.

Association results for the 46 novel susceptibility loci for systemic lupus erythematosus

Region CHR Position Variant EA NEA EAF OR SE P value I2 PHet N Nearest gene
1 1 117 043 302 rs9651076 A G 0.431 1.117 0.015 3.26E−13 10.7 0.347 208 370 CD58
2 1 157 108 159 rs116785379 C G 0.107 1.211 0.024 6.68E−16 43.7 0.114 208 370 ETV3
7 1 201 979 455 rs3806357 A G 0.251 1.106 0.017 4.25E−09 0.0 0.672 208 370 ELF3
9 2 7 573 079 rs75362385 T G 0.321 0.887 0.017 8.40E−13 68.3 0.007 208 370 LOC100506274
14 2 111 877 174 rs73954925 C G 0.878 1.169 0.024 5.11E−11 56.4 0.043 208 370 BCL2L11
18 2 198 929 806 rs7572733 T C 0.260 1.143 0.017 1.25E−14 0.0 0.647 208 370 PLCL1
20 3 28 072 086 rs438613 T C 0.588 0.920 0.014 7.52E−09 69.4 0.006 208 370 LINC01980
21 3 72 225 916 rs7637844 A C 0.871 0.877 0.023 1.28E−08 0.0 0.906 208 370 LINC00870
25 4 2 700 844 rs231694 T C 0.380 1.111 0.018 9.71E−09 23.7 0.269 57 253 FAM193A
26 4 40 307 587 rs113284964 G GCTTC 0.371 1.134 0.015 1.35E−16 67.2 0.009 208 370 LINC02265
27 4 79 644 279 rs6533951 A G 0.350 1.111 0.016 1.25E−10 61.4 0.024 208 370 LINC01094
28 4 84 146 996 rs6841907 T C 0.729 0.906 0.016 1.10E−09 43.5 0.115 208 370 COQ2
31 4 109 061 618 rs58107865 C G 0.227 0.802 0.021 6.57E−25 1.1 0.409 208 370 LEF1
34 5 131 120 338 rs370449198 A AC 0.922 0.721 0.060 4.41E−08 0.0 0.408 187 562 FNIP1
35 5 131 829 578 rs2549002 A C 0.682 0.905 0.016 2.40E−10 20.6 0.279 208 370 IRF1
40 6 243 302 rs9503037 A G 0.693 0.881 0.016 1.36E−15 42.3 0.123 208 370 LOC285766
43 6 36 715 031 rs34868004 CA C 0.225 1.104 0.017 4.46E−09 40.7 0.134 208 370 CPNE5
46 6 116 690 849 rs9488914 T C 0.920 0.862 0.026 1.14E−08 65.3 0.013 208 370 DSE
48 6 154 570 651 rs9322454 A G 0.659 1.090 0.015 2.42E−08 0.0 0.430 208 370 IPCEF1
54 8 71 330 166 rs142937720 A AAGTGGCC 0.383 0.894 0.016 2.27E−12 67.9 0.008 208 370 NCOA2
55 8 72 894 959 rs17374162 A G 0.411 0.917 0.015 3.02E−09 35.7 0.169 208 370 MSC-AS1
56 8 129 425 593 rs16902895 A G 0.678 1.122 0.016 1.48E−13 0.0 0.801 208 370 LINC00824
58 9 21 267 087 rs7858766 T C 0.538 1.139 0.016 2.25E−15 0.0 0.825 208 370 IFNA22P
59 10 5 910 746 rs77448389 A G 0.913 0.855 0.025 7.30E−10 0.0 0.584 208 370 ANKRD16
62 10 64 411 288 rs10995261 T C 0.240 0.909 0.017 2.57E−08 43.9 0.113 208 370 ZNF365
63 10 73 466 709 rs10823829 T C 0.718 0.910 0.016 1.05E−09 0.0 0.771 208 370 CDH23
64 10 105 677 911 rs111447985 A C 0.073 1.172 0.028 1.72E−08 0.0 0.526 208 370 STN1
65 10 112 664 114 rs58164562 T C 0.748 0.892 0.016 3.14E−12 33.3 0.186 208 370 BBIP1
66 11 4 113 200 rs3750996 A G 0.834 1.167 0.022 1.89E−12 0.0 0.522 208 370 STIM1
67 11 18 362 382 rs77885959 T G 0.978 1.694 0.062 3.16E−17 0.0 0.511 204 433 GTF2H1
74 12 4 140 876 rs2540119 T C 0.544 1.086 0.015 3.51E−08 44.9 0.106 208 370 PARP11
77 12 103 916 080 rs6539078 T C 0.591 0.894 0.015 9.49E−14 0.0 0.916 208 370 LOC105369945
79 12 121 368 518 rs3999421 A T 0.506 0.910 0.016 1.29E−09 47.3 0.091 208 370 XLOC_009911
81 12 133 040 182 rs200521476 G GCATCAC 0.812 0.875 0.023 5.66E−09 26.7 0.235 208 370 FBRSL1
86 15 101 529 012 rs35985016 A G 0.930 0.843 0.030 1.95E−08 0.0 0.897 204 433 LRRK1
90 16 50 089 207 rs11288784 G GT 0.365 0.902 0.016 2.38E−10 0.0 0.664 208 370 HEATR3
93 16 79 745 672 rs11376510 G GT 0.737 0.898 0.017 2.23E−10 0.0 0.719 208 370 MAFTRR
95 17 7 240 391 rs61759532 T C 0.076 1.235 0.032 2.79E−11 24.9 0.247 208 370 ACAP1
97 17 47 468 020 rs2671655 T C 0.651 1.087 0.015 4.60E−08 0.0 0.756 208 370 LOC10272459
98 17 76 373 179 rs113417153 T C 0.193 0.893 0.020 1.90E−08 2.1 0.403 208 370 PGS1
100 18 77 386 912 rs118075465 A G 0.147 1.140 0.020 1.16E−10 0.0 0.543 208 370 LOC284241
101 19 948 532 rs2238577 T C 0.455 0.885 0.016 1.83E−14 60.8 0.026 208 370 ARID3A
102 19 6 697 088 rs5826945 A T 0.929 0.836 0.028 9.67E−11 50.0 0.075 208 370 C3
105 19 33 072 768 rs12461589 T C 0.248 0.898 0.017 5.00E−10 0.0 0.510 208 370 PDCD5
106 19 49 851 746 rs33974425 CCAGCTGCAT C 0.702 1.120 0.016 4.40E−12 42.6 0.121 208 370 TEAD2
108 22 18 649 356 rs4819670 T C 0.210 1.151 0.022 5.53E−11 0.0 0.650 208 370 USP18

CHR, chromosome; EA, effect allele; EAF, effect allele frequency; I2, genetic heterogeneity I2 statistics at scale of 0% to 100%; N, study sample size; NEA, non-effect allele; OR, Odds ratio; PHet, P-values for the χ2 test of genetic heterogeneity; Region, unique ID for genomic region;SE, Standard error of odds ratio.

Associations at exonic variants

The meta-analysis identified lead missense variants in two novel loci (CHD23 and LRRK1; figure 2A, B and online supplemental table 2). In addition, we detected three new exonic variants (including two missense variants) within the reported SLE loci including CSK (rs11553760), IKBKB (rs2272736) and TYK2 (rs55882956) genes (figure 2C–E and online supplemental table 2). They were not correlated with previously reported exonic variants within the same genes (LD r2 <0.02 in East Asians or Europeans; online supplemental table 4), suggesting possible allelic heterogeneity of these genes. We replicated four known associations for missense variants at AHNAK2 (rs2819426),33 IRAK1 (rs1059702),34 NCF2 (rs13306575) and WDFY4 (rs7097397; online supplemental table 2).35 36

Figure 2.

Figure 2

New lead exonic variants identified at three known (CSK, IKBKB and TYK2) and two novel (CHD23 and LRRK1) loci. (A) rs11553760 (synonymous variant) at CSK. (B) rs2272736 (p.Arg303Gln, missense variant) at IKBKB. (C) rs55882956 (p.Arg703Trp, missense variant) at TYK2. (D) rs10823829 (synonymous variant) at CHD23. (E) rs35985016 (p.Lys203Glu, missense variant) at LRRK1. The lead SNP is labelled as purple diamond. The LD is estimated from 7021 Chinese samples. LD, linkage disequilibrium; Mb, megabases; SNP, single nucleotide polymorphism.

Secondary association signals within SLE loci

To dissect the source of association signals at each locus, we conducted an approximate conditional analysis using GCTA27 with meta-analysis summary statistics and LD estimates from 7021 unrelated Chinese controls. We acknowledge the limitations of using LD estimation from a single population for a meta-analysis of diverse East Asians. We identified a total of 233 independent association signals with conditional p<5×10−8, 169 of which arose from non-HLA regions (online supplemental table 5). We observed from two to four signals at each of 28 non-HLA loci (including seven novel loci). For example, we discovered two distinct association signals within the known STAT4 locus, including the previously reported SNP rs1188934112 and the new insert-deletion variant (indel) rs71403211 (figure 3A). For the 46 novel loci, we discovered 55 distinct signals (online supplemental table 5 and figure 2). We noticed that most of the signal index variants (n=190, 82%) are common (MAF ≥5%) with modest effects (online supplemental table 5).

Figure 3.

Figure 3

Two independent association signals identified. (A) At two intronic variants within known STAT4 locus. (B) At known (rs7097397, p.Arg1816Gln) and new (rs7072606, p.Ser214Pro) missense variants within WDFY4 locus. (C) A known intronic variant within ATXN2 gene and a new (rs1131476, p.Ala352Thr) missense variant within OAS1 gene. The lead and secondary index variants are labelled in diamond. The lead variant and its LD proxies are in red while the secondary signal index variant and its LD proxies are in blue. The LD is estimated from 7021 Chinese samples. LD, linkage disequilibrium; Mb, megabases.

Approximate conditional analysis detected two novel missense variants at WDFY4 and OAS1 genes. We detected two distinct signals within WDFY4, including the known (rs7097397)37 and a new (rs7072606) missense variant (LD r2=0.02 between two variants in East Asians), which suggests allelic heterogeneity at this locus (figure 3B). We provided for the first time genome-wide association evidence at a missense variant within OAS1 (rs1131476, LD r2=0.78 with rs1051042, which is a known missense variant but only exhibited suggestive significance with SLE in previous study,33 figure 3C and online supplemental table 5).

Prioritisation of causal variants

To prioritise putative causal variants, we conducted a Bayesian statistical fine-mapping analysis for 111 loci using FINEMAP28 after excluding complex associations involving the HLA and 7q11.23. We found exactly the same number of association signals in 57 loci between FINEMAP causal configuration with the highest posterior probability and the GCTA approximate conditional test. To be conservative, we only summarised the statistical fine-mapping results for these 57 regions, which contained 65 association signals (online supplemental table 6).

For each signal, we built a credible set of putative causal variants with a 95% probability of including the true causal variants. The size of 28 credible sets was small (size ≤10; figure 4A). Among the 110 putative causal variants with posterior probability ≥0.1 (figure 4B), we found four coding variants (3.6%), which implies that most of these associations are probably induced by non-coding causal variants. The prioritised variants are available to be tested as potential targets in perturbation experiments. For example, the allele-specific regulatory activity of the intronic variant (rs10036748) with the highest posterior probability (0.387) in the TNIP1 locus was recently experimentally characterised in SLE.38

Figure 4.

Figure 4

Results of statistical fine-mapping analysis. (A) Number of 95% credible sets of putative causal variants, binned by their sizes. (B) Number of potential causal variants with posterior probabilities (PP) ≥0.1, which are considered to be the true causal variants.

We pinpointed a single most likely causal variant with high confidence (posterior probability ≥0.8) for four known (ATXN2, BACH2, DRAM1/WASHC3 and NCF2) and six novel (17p13.1, ELF3, GTF2H1, LRRK1, LOC102724596/PHB and STIM1) loci (online supplemental table 6). For example, we prioritised rs61759532 as a putative causal variant at the novel 17p13.1 locus (PP=0.999). This variant is located in an intron of ACAP1, which encodes a key regulator of integrin traffic for cell adhesion and migration.39

SNP-based heritability

To assess the proportion of phenotypic variance explained by common variants, we applied LD score regression29 to the meta-analysis results. Assuming a population prevalence of 0.03% for SLE,1 we estimated the liability-scale SNP-based heritability from all non-HLA variants as h2 SNP = 7.24% (SE=0.78%). The 66 known and 46 novel non-HLA loci explained 62.6% (SE=4.9%) and 22.1% (SE=2.6%) of this overall SNP-based heritability, respectively.

Genetic correlation with other diseases/traits

To explore shared genetics between SLE and various traits, we calculated genetic correlations of SLE with 39 complex diseases and 59 quantitative traits in Biobank Japan participants using bivariate LD score regression32 (online supplemental table 7). As expected, we detected significant positive genetic correlations between SLE and two other autoimmune diseases: rheumatoid arthritis (rg=0.437) and Graves’ disease (rg=0.318). In addition, we found unreported genetic correlations (FDR<0.05) with albumin/globulin ratio (rg=−0.242) and non-albumin protein (rg=0.238).

Discussion

Here, we carried out the largest-ever genome-wide association meta-analysis for SLE and identified 113 risk loci including 46 novel regions for SLE in 208 370 East Asians including 13 377 SLE cases and 194 993 controls. This study revealed new genetic predispositions for SLE and generated hypotheses for further studies to investigate diseases functional mechanisms.

Epidemiological studies have found the higher prevalence of SLE in East Asians and heterogeneous disease manifestations across ethnicities.15 16 Previous investigations suggested genetics might explain the phenotypic heterogeneity.9 We observed that the MAFs of the index variants for several novel genetic associations were much higher in East Asians than in Europeans. Specifically, we suggested two novel loci were more likely specific to East Asians. These findings might help explain the genetic basis of SLE phenotypic heterogeneity between East Asians and Europeans. The results reinforce the power of large-scale genetic association for genetic discovery of SLE in relatively less studied populations.

We identified 11 exonic variants including two missense variants within novel loci CHD23 and LRRK1, four novel missense variants within known SLE loci IKBKB,9 TYK2,9 WDFY4 37 and OAS1, 33 and three known missense variants within known AHNAK2,33 IRAK1 34 and NCF2.35 36 These findings suggested allelic heterogeneity within several of these loci and highlighted the disease-risk effects of genes AHNAK2, CSK, IKBKB, IRAK1, NCF2, OAS1, TYK2 and WDFY4 within eight known loci, and CDH23 and LRRK1 within two novel loci which potentially alter gene product activity in an allele-specific manner. The novel gene CHD23 plays a role in cell migration40 while LRRK1 encodes a multiple-domain leucine-rich repeat kinase. A previous study observed that LRRK1-deficient mice exhibited a profound defect in B-cell proliferation and survival and impaired B-cell receptor-mediated NF-κB activation,41 which suggested that the association within this region might confer the risk of SLE through modulating the NF-κB pathway and the activities of B cells. We noted that the Bayesian statistical fine-mapping analysis prioritised the lead missense variant rs35985016 as the most likely putative causal variant for this association. This variant is highly frequent in our study individuals but is rare in Europeans. The molecular mechanisms in SLE risk worthy further investigations.

In the present study, we localised the putative causal variants for SLE genetic association in high resolution. Our findings indicated that the putative causal variants for the majority of SLE associations were non-coding variants. We provided targets of candidate putative causal variants with high confidence for several SLE loci. These findings are worthy for further exploration in functional experiments. We showed the regulatory effect of one of the putative causal variants in an accompanied paper. We acknowledged the limitation of a small LD reference panel from single population in the Bayesian statistical fine-mapping analysis.

We found for the first time the significant genetic correlations between SLE, albumin/globulin ratio and non-albumin protein. These findings might reflect the renal complications commonly developed in SLE patients who have been reported to have significantly lower albumin/globulin ratio and higher serum globulin than healthy controls in epidemiological studies.42 These shared genetic basis findings might suggest a common pathway underlying the SLE risk and kidney function in addition to the direct damage of SLE autoantibodies on kidney.

In summary, we detected 46 novel loci for SLE risk in the largest meta-analysis and prioritised putative causal variants for 65 causal signals. This study highlights the power of large-scale genetic association study in East Asian populations. The findings reveal the genetic predispositions for SLE and provide clues for further the investigation of disease mechanisms.

Acknowledgments

We acknowledged the participants in this study. We appreciate the contribution of Japanese Research Committee on Idiopathic Osteonecrosis of the Femoral Head. We appreciate all contributors to BioBank Japan. Details are included in supplementary material.

Footnotes

Handling editor: Josef S Smolen

XY, KKim and HS contributed equally.

Contributors: XY, KKim and HS contributed equally to this work, and either has the right to list himself first in bibliographical documents. SCB, YC, CT, XZhang, XY, KKim and HS conceived the study design. SCB, YC, XZhang, SY, KKim and CT acquainted the financial support. XY, KKim, HS, CT, YC and SCB wrote the manuscript. XY, KKim, HS, EH, XZheng, VL and YW conducted all of the analyses with the help of JBH, LCK, MTW, SP, SE, HS, KT, NO, MK, KI and C Terao. KKim, SYB, LW, LL, RXL, YSheng, MYH, WL, KYoon, MC, HH, MW, YTang, HD, CL, CS, WF, KL, BJK, HSL, SCB, SH, YSakamoto, NSugano, MM, DT, KKarino, TMiyamura, JN, GM, TKuroda, HN, TMiyamoto, TT, YKawaguchi, KA, YTada, KYamaji, MS, TA, AS, TSumida, YOkada, KMatsuda, KMatsuo, YKochi, TSeki, YTanaka, TKubo, RH, TYoshioka, MY, TKabata, YA, YOhta, TO, YN, AK, YY, KOhzono, KYamamoto, KOhmura, TYamamoto and SI generated genetic data. SYB, SJ, YCK, WTC, SSL, SCS, YMK, DY, CHS, YBP, JYC, YP, GYA, JMS, YKL, DJP, WY, THK, SY, BJK, NShen, HSL, XZhang, CT and SCB managed the cohort data. All authors reviewed and approved the manuscript.

Funding: This research was supported by General Program (81872516, 81573033, 81872527, 81830019, 81421001), Young Program (81803117, 82003328), Exchange Program (81881340424), and Science Fund for Creative Research Groups (31630021) of National Natural Science Foundation of China (NSFC), Distinguished Young Scholar of Provincial Natural Science Foundation of Anhui (1808085J08), National Program on Key Basic Research Project of China (973 Program) (2014CB541901), China National Key R&D Program (2016YFC0906100), Science Foundation of Ministry of Education of China (213018A), Program for New Century Excellent Talents in University of Ministry of Education of China (NCET-12-0600), The Bio & Medical Technology Development Program of the National Research Foundation, funded by the Ministry of Science & ICT of the Republic of Korea (NRF-2017M3A9B4050355 to SCB), Basic Science Research Program through the National Research Foundation of Korea funded by the Ministry of Science, ICT and Future Planning (2015R1C1A1A02036527 and 2017R1E1A1A01076388 to KKim), National BioBank of Korea, the Centers for Disease Control and Prevention, Republic of Korea (KBN-2018-031 to SSL), Center for Genome Science, Korea National Institute of Health, Republic of Korea (4845-301, 3000-3031 to MYH, KYoon and BJK), Japan Agency for Medical Research and Development (AMED) and the BioBank Japan project supported by the Ministry of Education, Culture, Sports, Sciences and Technology of the Japanese Government and AMED under grant numbers (17km0305002 and 18km0605001), Grant of Japan Orthopaedics and Traumatology Research Foundation, lnc, (No. 350 to YSakamoto), RIKEN Junior Research Associate Program (to H.S.), US NIH grants (AI024717, AI130830, AI148276, HG172111 and AR070549 to JBH), US Department of Veterans Affairs (BX001834 to JBH) and Center for Pediatric Genomics Award and CCRF Endowed Scholar Award of Cincinnati Children’s Hospital (to MTW).

Competing interests: None declared.

Provenance and peer review: Not commissioned; externally peer reviewed.

Supplemental material: This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Data availability statement

All data relevant to the study are included in the article or uploaded as supplementary information. The meta-analysis summary association statistics in the current study are available from the corresponding author on reasonable request.

Ethics statements

Patient consent for publication

Not required.

Ethics approval

The study protocol was approved by the Institutional Review Board at each participating institute and the meta-analysis study was additionally approved by the Institutional Review Boards at Anhui Medical University, Hanyang University Hospital of Rheumatic Diseases, and RIKEN Center for Medical Sciences.

References

  • 1. Carter EE, Barr SG, Clarke AE. The global burden of SLE: prevalence, health disparities and socioeconomic impact. Nat Rev Rheumatol 2016;12:605–20. 10.1038/nrrheum.2016.137 [DOI] [PubMed] [Google Scholar]
  • 2. Guerra SG, Vyse TJ, Cunninghame Graham DS. The genetics of lupus: a functional perspective. Arthritis Res Ther 2012;14:211. 10.1186/ar3844 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Gateva V, Sandling JK, Hom G, et al. A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus. Nat Genet 2009;41:1228–33. 10.1038/ng.468 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Han J-W, Zheng H-F, Cui Y, et al. Genome-Wide association study in a Chinese Han population identifies nine new susceptibility loci for systemic lupus erythematosus. Nat Genet 2009;41:1234–7. 10.1038/ng.472 [DOI] [PubMed] [Google Scholar]
  • 5. Cunninghame Graham DS, Morris DL, Bhangale TR, et al. Association of NCF2, IKZF1, IRF8, IFIH1, and TYK2 with systemic lupus erythematosus. PLoS Genet 2011;7:e1002341. 10.1371/journal.pgen.1002341 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Okada Y, Shimane K, Kochi Y, et al. A genome-wide association study identified AFF1 as a susceptibility locus for systemic lupus eyrthematosus in Japanese. PLoS Genet 2012;8:e1002455. 10.1371/journal.pgen.1002455 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Kim K, Bang S-Y, Lee H-S, et al. The HLA-DRβ1 amino acid positions 11-13-26 explain the majority of SLE-MHC associations. Nat Commun 2014;5:5902. 10.1038/ncomms6902 [DOI] [PubMed] [Google Scholar]
  • 8. Akizuki S, Ishigaki K, Kochi Y, et al. PLD4 is a genetic determinant to systemic lupus erythematosus and involved in murine autoimmune phenotypes. Ann Rheum Dis 2019;78:509–18. 10.1136/annrheumdis-2018-214116 [DOI] [PubMed] [Google Scholar]
  • 9. Morris DL, Sheng Y, Zhang Y, et al. Genome-Wide association meta-analysis in Chinese and European individuals identifies ten new loci associated with systemic lupus erythematosus. Nat Genet 2016;48:940–6. 10.1038/ng.3603 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Catalina MD, Owen KA, Labonte AC, et al. The pathogenesis of systemic lupus erythematosus: harnessing big data to understand the molecular basis of lupus. J Autoimmun 2020;110:102359. 10.1016/j.jaut.2019.102359 [DOI] [PubMed] [Google Scholar]
  • 11. Sun C, Molineros JE, Looger LL, et al. High-Density genotyping of immune-related loci identifies new SLE risk variants in individuals with Asian ancestry. Nat Genet 2016;48:323–30. 10.1038/ng.3496 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Bentham J, Morris DL, Graham DSC, et al. Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus. Nat Genet 2015;47:1457–64. 10.1038/ng.3434 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Buniello A, MacArthur JAL, Cerezo M, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res 2019;47:D1005–12. 10.1093/nar/gky1120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Sirugo G, Williams SM, Tishkoff SA. The missing diversity in human genetic studies. Cell 2019;177:26–31. 10.1016/j.cell.2019.02.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Danchenko N, Satia JA, Anthony MS. Epidemiology of systemic lupus erythematosus: a comparison of worldwide disease burden. Lupus 2006;15:308–18. 10.1191/0961203306lu2305xx [DOI] [PubMed] [Google Scholar]
  • 16. Morais SA, Isenberg DA. A study of the influence of ethnicity on serology and clinical features in lupus. Lupus 2017;26:17–26. 10.1177/0961203316645204 [DOI] [PubMed] [Google Scholar]
  • 17. Ishigaki K, Akiyama M, Kanai M, et al. Large-Scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet 2020;52:669–79. 10.1038/s41588-020-0640-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18., Auton A, Brooks LD, et al. , 1000 Genomes Project Consortium . A global reference for human genetic variation. Nature 2015;526:68–74. 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Kim Y, Kim B, Han B. The Korean reference genome Project: construction of the refer- ence panel for imputation analysis. Presented at the 61th Annual Meeting of The American Society of Human Genetics; October 6, 2015, Baltimore, MD, 2015. [Google Scholar]
  • 20. Bycroft C, Freeman C, Petkova D, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9. 10.1038/s41586-018-0579-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 2009;5:e1000529. 10.1371/journal.pgen.1000529 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Das S, Forer L, Schönherr S, et al. Next-Generation genotype imputation service and methods. Nat Genet 2016;48:1284–7. 10.1038/ng.3656 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559–75. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Marchini J, Howie B. Genotype imputation for genome-wide association studies. Nat Rev Genet 2010;11:499–511. 10.1038/nrg2796 [DOI] [PubMed] [Google Scholar]
  • 25. Cook JP, Mahajan A, Morris AP. Guidance for the utility of linear models in meta-analysis of genetic association studies of binary phenotypes. European Journal of Human Genetics 2017;25:240–5. 10.1038/ejhg.2016.150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 2010;26:2190–1. 10.1093/bioinformatics/btq340 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Yang J, Ferreira T, Morris AP, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet 2012;44:369–75. 10.1038/ng.2213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Benner C, Spencer CCA, Havulinna AS, et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 2016;32:1493–501. 10.1093/bioinformatics/btw018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Bulik-Sullivan BK, Loh P-R, Finucane HK, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 2015;47:291–5. 10.1038/ng.3211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Finucane HK, Bulik-Sullivan B, Gusev A, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet 2015;47:1228–35. 10.1038/ng.3404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Terao C, Momozawa Y, Ishigaki K, et al. GWAS of mosaic loss of chromosome Y highlights genetic effects on blood cell differentiation. Nat Commun 2019;10:4719. 10.1038/s41467-019-12705-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bulik-Sullivan B, Finucane HK, Anttila V, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet 2015;47:1236–41. 10.1038/ng.3406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Wen L, Zhu C, Zhu Z, et al. Exome-wide association study identifies four novel loci for systemic lupus erythematosus in Han Chinese population. Ann Rheum Dis 2018;77:417. 10.1136/annrheumdis-2017-211823 [DOI] [PubMed] [Google Scholar]
  • 34. Zhang Y, Zhang J, Yang J, et al. Meta-Analysis of GWAS on two Chinese populations followed by replication identifies novel genetic variants on the X chromosome associated with systemic lupus erythematosus. Hum Mol Genet 2015;24:274–84. 10.1093/hmg/ddu429 [DOI] [PubMed] [Google Scholar]
  • 35. Armstrong DL, Eisenstein M, Zidovetzki R, et al. Systemic lupus erythematosus-associated neutrophil cytosolic factor 2 mutation affects the structure of NADPH oxidase complex. J Biol Chem 2015;290:12595–602. 10.1074/jbc.M115.639021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Kim-Howard X, Sun C, Molineros JE, et al. Allelic heterogeneity in NCF2 associated with systemic lupus erythematosus (SLE) susceptibility across four ethnic populations. Hum Mol Genet 2014;23:1656–68. 10.1093/hmg/ddt532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Yang W, Shen N, Ye D-Q, et al. Genome-Wide association study in Asian populations identifies variants in ETS1 and WDFY4 associated with systemic lupus erythematosus. PLoS Genet 2010;6:e1000841. 10.1371/journal.pgen.1000841 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Pasula S, Tessneer KL, Fu Y, et al. Role of systemic lupus erythematosus risk variants with opposing functional effects as a driver of hypomorphic expression of TNIP1 and other genes within a three-dimensional chromatin network. Arthritis Rheumatol 2020;72:780–90. 10.1002/art.41188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Chen P-W, Luo R, Jian X, et al. The ARF6 GTPase-activating proteins ARAP2 and ACAP1 define distinct endosomal compartments that regulate integrin α5β1 traffic. J Biol Chem 2014;289:30237–48. 10.1074/jbc.M114.596155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Sannigrahi MK, Srinivas CS, Deokate N, et al. The strong propensity of Cadherin‐23 for aggregation inhibits cell migration. Mol Oncol 2019;13:1092–109. 10.1002/1878-0261.12469 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Morimoto K, Baba Y, Shinohara H, et al. LRRK1 is critical in the regulation of B-cell responses and CARMA1-dependent NF-κB activation. Sci Rep 2016;6:25738. 10.1038/srep25738 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Kwon OC, Lee JS, Ghang B, et al. Predicting eventual development of lupus nephritis at the time of diagnosis of systemic lupus erythematosus. Semin Arthritis Rheum 2018;48:462–6. 10.1016/j.semarthrit.2018.02.012 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data

annrheumdis-2020-219209supp001.pdf (13MB, pdf)

Data Availability Statement

All data relevant to the study are included in the article or uploaded as supplementary information. The meta-analysis summary association statistics in the current study are available from the corresponding author on reasonable request.


Articles from Annals of the Rheumatic Diseases are provided here courtesy of BMJ Publishing Group

RESOURCES