Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Sep 1.
Published in final edited form as: Arthritis Rheumatol. 2020 Aug 6;72(9):1483–1492. doi: 10.1002/art.41291

Pleiotropy in the Genetic Predisposition to Rheumatoid Arthritis - a phenome-wide association study and inverse-variance weighted meta-analysis

Vivian K Kawai 1, Mingjian Shi 2, Qiping Feng 1, Cecilia P Chung 1,3,4,5, Ge Liu 1, Nancy J Cox 5, Gail P Jarvik 6, Ming TM Lee 7, Scott J Hebbring 8, John B Harley 9,10,11, Kenneth M Kaufman 9,10,11, Bahram Namjou 9,10, Eric Larson 12, Adam S Gordon 13, Dan M Roden 1,2, C Michael Stein 1, Jonathan D Mosley 1,2; eMERGE investigators14
PMCID: PMC7572512  NIHMSID: NIHMS1597503  PMID: 32307929

Abstract

Objective:

To examine the hypothesis that a genetic predisposition toward rheumatoid arthritis (RA) increases the risk for 10 cardiometabolic and autoimmune disorders previously associated with RA in epidemiological studies and to define new genetic pleiotropy.

Methods:

We tested the hypothesis using two approaches. First, we constructed a weighted genetic risk score (wGRS) and studied its association with the 10 prespecified disorders; additionally, we performed a phenome-wide study (PheWAS) to identify potential new associations. Second, we used inverse variance weighted regression (IVWR) meta-analysis to characterize the association between genetic susceptibility to RA and the prespecified disorders.

Results:

The wGRS for RA was significantly associated with type I diabetes (T1D) (OR 1.10 [95% CI 1.04–1.16], P=9.82×10−4) and multiple sclerosis (OR 0.82 [0.77–0.88], P=1.73×10−8), but not with other cardiometabolic phenotypes. In the PheWAS, the wGRS was additionally associated with increased risk of several autoimmune phenotypes including RA, thyroiditis, and systemic sclerosis, and with decreased risk of demyelinating disorders. In the IVWR meta-analyses, RA was significantly associated with T1D (P=1.15×10−14) with evidence of horizontal pleiotropy (MR-Egger intercept estimate P=0.001) likely driven by rs2476601, a PTPN22 variant. The association between T1D and RA remained significant (P=9.53×10−9) after excluding rs2476601 with no evidence of horizontal pleiotropy (intercept estimate P=0.939). RA was also significantly associated with type 2 diabetes and C-reactive protein; these associations were driven by variation in the major histocompatibility complex (MHC).

Conclusion:

We found evidence suggesting pleiotropy between the genetic predisposition for RA and other autoimmune disorders including T1D.

Keywords: rheumatoid arthritis, genetic risk score, pleiotropy

INTRODUCTION

Rheumatoid arthritis (RA) is a chronic inflammatory disease that affects approximately 1% of the population (1). Epidemiologic and clinical studies have noted an increased prevalence of several conditions occurring in association with RA, including coronary artery disease (CAD) (2), ischemic stroke (3), atrial fibrillation (AF) (4), diabetes - particularly type 1 diabetes (T1D) (5, 6), chronic kidney disease (CKD) (7), hypertension (HTN) (8), central obesity (8), and low levels of low density lipoprotein cholesterol (LDL-C) (9), as well as with decreased prevalence of multiple sclerosis (MS) (10). Inflammation contributes to the pathogenesis of many of these conditions, raising the possibility of shared genetic pathogenesis with RA.

Numerous common single nucleotide polymorphisms (SNPs) associated with risk of RA have been identified by genome wide association studies (GWAS) (11). Such information can be harnessed to construct a genetic risk score (GRS) to predict not only RA but also conditions that may have shared pathogenesis; for example, as was used to define the inverse relationship between genetic predictors of LDL-C concentrations and risk of type 2 diabetes (T2D) (12). Moreover, in addition to examining the relationship between a GRS and a few prespecified phenotypes, phenome wide association studies (PheWAS) allow the unbiased identification of new phenotype associations (13).

Summary statistics from large GWAS studies can be used to test whether SNPs associated with one disease or risk factor (e.g. RA) are also associated with a second disease or risk factor (e.g., T1D, LDL-C) using different approaches including inverse variance weighted regression (IVWR) meta-analysis (14). Importantly, the genetics underlying many of the conditions associated with RA have been characterized by large GWAS.

To define whether a genetic predisposition to RA increases risk of other autoimmune and cardiometabolic conditions, we used two approaches: 1) we constructed a weighted GRS for RA and examined its associations with selected prespecified phenotypes reported to be associated with RA and also performed a PheWAS to identify new pleiotropic associations; 2) we used IVWR meta-analysis to test the hypothesis that genetic predisposition to RA increases the risk of selected prespecified phenotypes associated with RA.

METHODS

Data Sources

We studied the association between the genetic liability for RA and 10 autoimmune and cardiometabolic outcomes previously associated with RA: AF, ischemic stroke, CAD, T2D, obesity, HTN, CKD, hyperlipidemia, T1D, and MS using BioVU, the Vanderbilt University Medical Center (VUMC) DNA Biobank. A full description of BioVU has been published (15). Briefly, BioVU accrues DNA from blood samples obtained during routine clinical care from patients who have consented to have a DNA sample collected. DNA is extracted from samples that would otherwise be discarded, de-identified, and linked to a de-identified version of the electronic health record (EHR) at VUMC. Approval for the present study was obtained from the Vanderbilt Institutional Review Board. We used data from the eMERGE network (16) to replicate PheWAS findings. To replicate significant findings in the IVWR analysis, we used summary statistics from the UK Biobank (Neale Lab GWAS round 2 available at http://nealelab.is/uk-biobank).

For the IVWR analyses, we selected the largest genetic meta-analysis with summary-level data available for individuals of European ancestry for the selected outcomes (or proxies when the exact phenotype was not available) previously associated with RA (Supplementary Table S1). We studied the same 10 outcomes as used for the GRS association analyses and also 2 additional immunological biomarkers for which there are no good phenotype equivalents in the EHR, C-reactive protein (CRP) and interleukin 6 (IL-6) concentrations.

Genotyping

In the BioVU EHR cohort, genotyping was performed by the Vanderbilt Technologies for Advance Genomics (VANTAGE) according to standard protocols on the Illumina Infinium Multi-Ethnic Genotyping Array (MEGAEX) platform. eMERGE subjects were genotyped on multiple platforms and underwent QC analyses and imputation as previously described. (17, 18) Quality control (QC) analyses used PLINK version 1.90β3 (19, 20) and included reconciling strand flips, verifying that allele frequencies were concordant among data sets, and identifying duplicate and related individuals (one of each pair of subjects with a pi-hat>0.05 was excluded).. Data sets were standardized using the HRC-1000G-check tool v4.2.5 (http://www.well.ox.ac.uk/~wrayner/tools/) and pre-phased using SHAPEIT (21). BioVU data was imputed using IMPUTE2 (22), in conjunction with the 10/2014 release of the 1000 Genomes cosmopolitan reference haplotypes. All other genetic data were imputed using the Michigan Imputation Server (HRC v1.1)(23). . Imputed data were filtered for a sample missingness rate <2%, a SNP missingness rate <4% and SNP deviation from Hardy-Weinberg P< 10-6. Principal components (PCs) were calculated using the SNPRelate package (24).

Phenotype Data

For the 10 prespecified autoimmune and cardiometabolic outcomes, we extracted clinical diagnoses from the EHR identified by the International Classification of Diseases, Ninth Revision (ICD9) codes that mapped to the prespecified outcomes and transformed these ICD9 codes into phecodes, which aggregate one or more related ICD-9 codes into distinct diseases or traits (13). For each phenotype, cases were defined as individuals with 2 or more instances of the specific phecode in the EHR. Controls were defined as individuals without the phecode or any close related phecode. For the PheWAS analysis, we followed the same procedures and included 1162 clinical phenotypes with 100 or more cases in the EHR to assure statistical power. Controls were matched by the represented age of cases.

Statistical analysis

Because the risk alleles were derived largely from GWAS studies in whites of European ancestry, only individuals of European ancestry were included in the analyses. Genetically-predicted risk for RA was calculated using summary statistics for RA-related SNPs that were fed into a weighted GRS using the following equation:

Weightedgeneticriskscore(wGRS)=i=1#SNPS(βi×[SNPgenotype]i)

where β is the effect size (log odds-ratio) of the risk allele and the genotype is the number of copies of the risk allele coded as 0,1, or 2. Only independent SNPs (r2<0.10) that passed quality control and reached genome-wide significance (P<5×10−8) were included in the wGRS (Supplementary Table S2). A multivariable logistic regression adjusting for 5 PCs, median age in the EHR, and sex was performed for the pre-specified phenotype analysis and the PheWAS using the wGRS as main predictor. For the 10 pre-specified phenotypes, a Bonferroni-corrected P-value<0.005 (0.05/10 phenotypes) was considered significant. For the global PheWAS, a false discovery rate (FDR)<0.1 was considered statistically significant. Analyses were performed using the PheWAS R package (13).

To replicate PheWAS findings in eMERGE (16) we calculated the wGRS in a set of ~26,000 individuals of European ancestry and performed a logistic regression analysis adjusting for 5 PCs, sex, and median age in the EHR for phenotypes that were significantly associated with the wGRS in the PheWAS. Phecodes in eMERGE include ICD9 and ICD10 codes. A Bonferroni-adjusted replication P-value of 0.05 divided by the number of phenotypes tested in the replication analysis was considered significant (<0.05/ #phenotypes for replication).

To study the relationships between genetic liability for RA and selected phenotypes we performed IVWR meta-analyses (25, 26). Summary statistics for the exposure (RA) and the selected outcomes (Supplementary Table S1) were extracted and the SNPs were aligned to the 1000 Genome Project phase 3. To ensure the selection of independent SNPs from the exposure and each outcome dataset, we selected SNPs that were significantly associated with RA at P<5×10−6 (11) and a linkage disequilibrium (LD)-reduced (r2<0.05) set of SNPs with a minor allele frequency>0.05 were selected as instrumental variables (IVs) to estimate the combined effect of these SNPs in the IVWR meta-analysis.

As a sensitivity analysis, we performed weighted median regression since this approach, while less powered than the IVWR approach, provides better estimates of the true effect size when less than 50% of the IVs are not valid (25, 26). In addition, we also tested for unbalanced horizontal pleiotropy using MR-Egger regression (27). The analyses were performed using the Two-Sample Mendelian Randomization R-package and a P-value<0.004 (0.05/12 outcomes) was considered significant. P-value<0.05 for the intercept estimate in the Egger regression was considered significant and indicates the presence of horizontal pleiotropy.

Because the major histocompatibility complex (MHC) region is one of the most polymorphic regions in the human genome and is characterized by a complex LD structure and long-range haplotypes, identifying causal variants is challenging (28). Thus, we performed a sensitivity analysis excluding SNPs in this region (chr.6: from 25–35 Mb, GRCh37). All associations were expressed as odds ratio (OR) and 95% confidence interval (95% CI).

To replicate significant findings in the IVWR analysis, we extracted summary statistics from the UK Biobank. Low confidence SNPs (e.g. minor allele count < 25 or MAF<0.001 in cases) were filtered out and an LD-reduced set of SNPs (as described above) were selected as IVs. A P-value<0.05 was considered significant for replication.

RESULTS

GRS analysis

Ninety five of the 102 RA-associated SNPs passed quality control and were included in the wGRS calculation. The wGRS was calculated in 15,939 (53.4%) women and 13,922 (46.6%) men of European ancestry with genotype information and clinical data available in BioVU. The median (interquartile range) for the GRS was 0.057 (0.054, 0.060) and median age was 59 years (48, 69).

Among the 10 preselected phenotypes, the wGRS for RA was positively associated with T1D (OR [95%CI] =1.10 [1.04, 1.16], P=9.82×10−4) and negatively associated with MS (0.82 [0.77, 0.88], P=1.73×10−8) (Table 1, Figure 1). None of the other preselected phenotypes were significantly associated with the RA wGRS.

Table 1:

Association of the weighted genetic risk score for RA with cardiometabolic and autoimmune phenotypes

Phenotype phecode # cases # controls OR [95%CI] P-value
Atrial fibrillation 427.21 5195 14453 0.98 [0.94, 1.02] 0.249
Ischemic stroke 433.21 1399 23097 1.02 [0.97, 1.08] 0.392
Coronary atherosclerosis 411.4 8720 16746 0.99 [0.96, 1.02] 0.489
Type 2 diabetes 250.2 6309 19612 1.02 [0.98, 1.05] 0.252
Essential hypertension 401.1 16488 11767 0.98 [0.95, 1.01] 0.131
Chronic renal failure 585.3 3486 21257 1.01 [0.97, 1.05] 0.580
Hyperlipidemia 272.13 587.6 15039 1.00 [0.97, 1.03] 0.988
Obesity 278.1 4438 22965 0.99 [0.96, 1.02] 0.621
Type 1 diabetes 250.1 1456 19571 1.10 [1.04, 1.16] 9.82×10−4
Multiple sclerosis 335 1009 21542 0.82 [0.77, 0.88] 1.73×10−8

Figure 1: Clinical diagnoses associated with a weighted genetic risk score (wGRS) for rheumatoid arthritis (RA).

Figure 1:

Green filled triangles and black filled dots in the volcano plot represent significant and non-significant associations at false discovery rate (FDR)< 0.1 in BioVU, respectively. Table in the right provides the clinical labels and results for significant associations in BioVU and replication in eMERGE. * Less than 100 cases were available for this phenotype.

In addition to the findings for T1D and MS, the RA wGRS was associated with increased risk of several autoimmune related disorders including RA (P=1.54×10−27), T1D retinopathy (P=1.56×10−6), phenotypes related to hypothyroidism (P≤3.92×10−6), autoimmune thyroiditis related phenotypes (P≤7.27×10−5), and systemic sclerosis (P=4.33×10−4) in the PheWAS. In addition to the decreased risk of MS, there was also an inverse association with the risk of demyelinating disorders (0.83 [0.77, 0.91], P=4.93×10−5) (Figure 1, Supplementary Table S3). When the RA-associated SNP in the MHC region was excluded from the analysis, the inverse associations between the wGRS and MS and demyelinating disorders and the positive associations with T1D were no longer significant (Supplementary Table S4). Several other immune and non-immune phenotypes, such as psoriasis, infections, cancers, and arrhythmias were nominally associated with the wGRS for RA (P<0.05).

Replication of the significant results in the PheWAS showed that the wGRS for RA remained significantly associated with RA-related phenotypes and with hypothyroidism-related phenotypes in eMERGE (Figure 1) at a P-value threshold of <0.0036 (0.05/14 phenotypes), but the association with MS was nominal (P-value=0.02).

IVWR meta-analyses

A genetic predisposition toward RA was consistently associated with T1D (Figure 2) in the IVWR, weighted median, and Egger analysis (Supplementary Table S5), with evidence for horizontal pleiotropy (y-intercept was significantly different from zero in the Egger regression, P=0.001) and likely driven by rs2476601 (Table 2, Supplementary Figure S1).

Figure 2: IVWR meta-analysis for RA risk alleles.

Figure 2:

#SNPs indicates the number of SNPs that were selected for the meta-analysis, OR [95%CI]: odds ratio [95% confidence interval]. AF: atrial fibrillation, IS: ischemic stroke, CAD: coronary artery disease, T2D: type 2 diabetes, SBP: systolic blood pressure, CKD: chronic kidney disease, LDL: low density lipoprotein-cholesterol, WC: waist circumference, T1D: type 1 diabetes, MS: multiple sclerosis, CRP: C-reactive protein, IL6: Interleukin 6:

Table 2:

Genetic association analysis between RA and IVWR phenotypes

Phenotypes # SNPs IVWR Egger regression


OR [95%CI] P-value OR [95%CI] P-value I.E. P-valuea


T1D
 All RA SNPsb * * * * * * *
 Excluding MHC SNPs 99 1.67 [1.47, 1.90] 1.15×10−14 2.61 [1.96, 3.49] 6.92×10−11 −0.056 0.001
 Excluding rs2476601 98 1.47 [1.29, 1.68] 9.53×10−9 1.50 [0.90, 2.48] 0.117 −0.002 0.939

T2D
 All RA SNPsb 230 1.03 [1.02, 1.04] 1.11×10−9 1.04 [1.03, 1.06] 6.94×10−8 −0.003 0.055
 Excluding MHC SNPs 136 1.03 [1.00, 1.05] 0.020 1.04 [0.98, 1.09] 0.170 −0.001 0.612

CRP
 All RA SNPsb 226 1.02 [1.01, 1.03] 1.72×10−6 1.02 [1.01, 1.04] 0.001 −0.001 0.566
 Excluding MHC SNPs 136 1.02 [1.00, 1.04] 0.033 1.01 [0.97, 1.06] 0.620 0.001 0.753

CAD
 All RA SNPsba 230 1.02 [1.01, 1.03] 3.83×10−4 1.02 [1.00, 1.04] 0.031 0.000 0.924
 Excluding MHC SNPs 136 1.01 [0.99, 1.04] 0.258 1.01 [0.95, 1.08] 0.706 0.000 0.979

RA: rheumatoid arthritis; T1D: Type 1 diabetes; T2D: Type 2 diabetes; CRP: C-reactive protein; CAD: coronary artery disease; SNP: single nucleotide polymorphism; IVWR: Inverse weighted variance regression; OR: odds ratio; 95%CI: 95% confidence interval. I.E.: Intercept estimate for the Egger regression.

a

P-value for the intercept.

b

Includes independent SNPs (R2<0.05) that were previously associated with rheumatoid arthritis (RA) at P <5×10−6.

*

Summary statistics for T1D already excluded the MHC region.

T2D, CRP, and CAD were significantly associated with RA in the IVWR (Figure 1, Table 2); but discrepancies were observed in the weighted median approach for CAD (Supplementary Table S5), suggesting the association observed with CAD in the IVWR is less robust compared to T2D and CRP. A nominal association was found with SBP and CKD, with evidence of horizontal pleiotropy for CKD (P<0.001 for the y-intercept in the Egger regression, Supplementary Table S5).

To examine the possibility that SNPs in the MHC region were responsible for the associations observed with T2D, CRP and CAD, we excluded this region from the analyses and none of the associations remained significant (Table 2, Supplementary Table S7). The MHC region was not included in the summary statistics for T1D thus an analysis excluding this region was not necessary. For T1D, ninety-nine independent SNPs were included in the IVWR analysis and 22 of them were associated with RA and T1D at P<5×10-5. Twenty of these 22 SNPs had the same direction of effect in RA and T1D (Supplementary Table S6) and the most significant SNP associated with both phenotypes was rs2476601, a SNP in PTPN22. The other concordant SNPs mapped to SH2B3, IL2RA, CTLA4, PTPN2, TYK2, CD226, CD28, and AFF3 among other loci (Supplementary table S6). To assess if rs2476601 was responsible for the horizontal pleiotropy observed for T1D, we excluded this SNP from the IVWR analysis, and the association remained significant with no evidence of horizontal pleiotropy in the Egger regression (Table 2).

To validate these findings, we used summary statistics from UK Biobank for self-reported T1D, self-reported T2D, and CRP. As a proxy for CAD, we used summary statistics from self-reported myocardial infarction /heart attack. In T1D, a similar pattern of associations was observed with evidence of horizontal pleiotropy when the MHC region and rs2476601 were included in the analysis (Supplementary Table S8). CAD, T2D and CRP showed a smaller but still significant associations with RA (Supplementary Table S8), with evidence of horizontal pleiotropy for CRP (P=0.02 for the y-intercept in Egger regression, Supplementary Table S8). All the associations were attenuated when the MHC region was excluded from the analysis (Supplementary Table S8).

DISCUSSION

The increased prevalence of several autoimmune and cardiometabolic conditions in patients with RA has raised the possibility of shared genetic predisposition. However, little work has addressed this possibility. We found evidence for pleiotropy between SNPs contributing to a genetic predisposition for RA and other autoimmune disorders including T1D.

The wGRS for RA was associated with increased risk for T1D and related phenotypes, and in the IVWR analysis genetic predisposition to RA was also associated with increased risk of T1D, a finding independent of the PTPN22 variant. Genetic association between RA and T1D has been suggested previously. For example, a previous study comparing underlying genetic susceptibility of T1D and 15 other immune disorders used a variant set enrichment method to show that RA was strongly correlated with T1D (29). In the largest meta-analysis in RA, more than two thirds of non-MHC RA risk loci showed pleiotropic effects across several disorders, including T1D (11), and seven loci were significantly associated with both diseases (AFF3, CTLA4-CD28, BACH2, RASGRP1, PTPN2, TYK, and PTNP22) in a fine-mapping study of RA and T1D (30). We observed that SNPs in several of these loci were not only significantly associated with T1D and RA, but the direction of effect was consistent in both phenotypes.

The genetic association between RA and T1D was independent of rs2476601, a missense variant in PTPN22. Although PTPN22 is the most influential non-MHC gene for autoimmunity (with OR of ~1.7 and 2.0 for RA and T1D, respectively) (31), our findings suggest that the genetic association observed between RA and the increased risk of T1D results from the aggregate effect of the other RA risk alleles.

The wGRS for RA was also associated with a decreased risk of MS and this association disappeared when the single MHC region SNP in the wGRS was excluded. A similar trend was observed in the IVWR analysis where genetic predisposition to RA was nominally associated with a decreased risk for MS, and the exclusion of MHC region attenuated the association. However, the protective signal for MS was only nominally significant in the eMERGE replication set. Previous clinical and population studies have suggested an inverse relationship between RA and MS (10), and genetic studies have found that some risk alleles are significantly associated with both diseases, several with effects in the opposite direction so that the allele that increased risk for RA decreased risk for MS and vice versa (32, 33). The same risk alleles can have opposite directional effects in different autoimmune diseases; for example, a risk allele in CD40 increases the risk of MS and inflammatory bowel disease but decreases risk of RA, lupus, autoimmune thyroiditis and Kawasaki disease (34). Thus, we cannot exclude that genetic predisposition for RA protects against for MS, a finding that needs further exploration.

To our knowledge, this is the first study that explores the potential genetic pleiotropy of RA using a PheWAS with a wGRS for RA. In the PheWAS, we found that the wGRS for RA was associated with several autoimmune disorders besides T1D and MS including RA, T1D-related diagnoses, and autoimmune thyroiditis-related phenotypes. However, only the association with RA-related phenotypes and hypothyroidism were significant in the replication analysis. Interestingly, when the HLA SNP was excluded from the wGRS there were additional significant associations with other autoimmune disorders such as lupus-related phenotypes (nephritis and proteinuria), biliary cirrhosis, systemic sclerosis, and polyarteritis nodosa, among others.

The finding of an association between the non-HLA RA wGRS and other autoimmune disorders is not surprising. Genome-wide genotyping studies have found several risk loci that are shared among autoimmune disorders (35).For example, several non-MHC RA risk alleles have shown pleiotropic effects in several disorders, including T1D, MS, and lupus (11), and a cross phenotype meta-analysis of 7 common autoimmune diseases (RA, T1D, MS, lupus, psoriasis, Crohn’s and celiac disease) found that approximately 44% of disease-associated SNPs were shared by some of these disorders (36). Similarly, a study that explored interconnections across multiple diseases in a large EHR system found that the strongest disease network (indicated by the highest number of shared SNPs) was composed of RA, T1D, MS and psoriasis (37). The SNPs linking these conditions mapped to 18 loci in the MHC region and one in PTPN22.

The association between the wGRS for RA and T2D was not significant; however, genetic liability for RA was associated with increased risk of T2D in the IVWR analysis, probably through variants in the MHC region since its exclusion from the analysis attenuated the association. The differing results in the wGRS and the IVWR analyses is likely explained by the number of SNPs from the MHC region included in each approach; the wGRS includes one variant whereas the IVWR includes 94 independent MHC SNPs. Even though RA and T2D share several risk alleles (38), previous genetic studies using GWAS summary statistics have not found a genetic correlation between RA and T2D (11, 39), likely because the MHC region was omitted in these studies. Thus, it is possible that variants in the MHC region may explain the shared association observed in the IVWR analysis. Indeed, GWAS studies identified several MHC SNPs associated with T2D (40), but the biological mechanisms through which they affect the risk of T2D have not been defined. Also, a recent PheWAS using high resolution typing of 33 human leukocyte antigen (HLA) genes in Japanese individuals found that MHC variability was associated with several cardiometabolic phenotypes including T2D (28), while a study in people of European ancestry found a nominal association (P>1×10−5) between variation in HLA class II genes (HLA-DRB1, HLA-DQB1 and HLA-DPA1) and T2D complications (41).

Similar to the findings with T2D, we found no association between the wGRS for RA and CAD; however, genetic liability for RA was associated with increased risk of CAD only when the MHC region was included in the IVWR analysis. While RA and CAD share several risk alleles (38) that may affect common disease pathways (e.g. NF-kB signaling (42)), genetic pleiotropy between both phenotypes has not been reported in previous studies that excluded MHC genes (11, 39). Since genetic variability in MHC region, particularly in the HLA-DRB1 gene is associated with susceptibility for RA (43) and with CAD in studies that performed direct HLA genotyping (44, 45), it is possible that HLA haplotypes known to affect immune reactions may influence the risk of CAD by affecting presentation of oxidized LDL (46). Nevertheless, discrepancies observed on the weighted median analysis compared to the IVWR and Egger analysis suggest that the genetic association between RA and CAD is not robust and bias needs to be rule out.

We found a positive association between RA and CRP in the IVWR which was driven by variability in the MHC region. Because HLA-DQA1 and HLA-DRB1 play a major role in T cell activation for cytokine production, variation in both genes have been associated with susceptibility for RA (47) and with CRP levels (44, 48). A previous study found no association between genetically-determined variance in CRP and RA (48) using 52 SNPs of which only one was in the MHC region. Although our findings suggest that common susceptibility genes in the MHC region are responsible for the genetic association observed between RA and CRP, this needs to be studied further. In contrast to CRP, we found no genetic association between RA and IL6. All of these associations were replicated in the UK Biobank.

In the IVWR, we found nominal associations between genetic liability for RA and two cardiometabolic phenotype (SBP and CKD) when the MHC region was included. Clinical and animal studies have identified a role for the immune system in blood pressure regulation (49) and there is evidence that SBP shares risk alleles with RA (38). A previous genetic study that included the MHC region did not find genetic overlap between blood pressure and RA (50), and a PheWAS of HLA variants in ~37,000 Caucasians did not find any significant association with any blood pressure-related phenotypes (41). However, a recent PheWAS of 33 HLA genes in ~166,000 Japanese individuals reported a significant association between HLA-B gene and SBP (28). Thus, it is possible that SNPs in the MHC region are responsible for the nominal genetic association observed between RA and SBP through pleiotropic alleles or through the immune cell activation and inflammation that leads to endothelial dysfunction and hypertension (51). As for the nominal inverse association between CKD and genetic liability to RA, genes in the MHC region have been associated with both increased and decreased risk of kidney disorders (52)

Although low LDL-C levels and increased prevalence of central obesity, AF, and ischemic stroke have been associated with RA in epidemiological studies, the genetic association between these disorders and RA has not been well defined. Okada extracted 311 phenotypes from the NHGRI catalog (including LDL-C, obesity, AF, and stroke) to assess region-based pleiotropy for non-HLA RA risk SNPs and none these traits were associated with RA (11). Others have also found no genetic correlation between RA and cardiometabolic phenotypes such as BMI and LDL (53), but a study of 567 patients with RA and 979 controls found that a GRS for RA that excluded the HLA region was associated with decreased LDL-C concentrations (54). Although inflammation is an important component of these cardiometabolic traits and many inflammatory genes have been associated with susceptibility to some of these traits (55, 56), we did not find significant genetic pleiotropy between RA and these phenotypes. An alternative explanation is that it is not a genetic predisposition toward RA, but development of the inflammatory milieu that results from disease which causes the associations with these phenotypes (11,36,37,57).

The study has some limitations. First, because we studied a large cohort of patients of European ancestry from a tertiary care hospital, our findings may not be generalizable to other populations. Second, because PheWAS uses billing codes to assemble phenotypes, the quality of the case-control definition varies across phenotypes and can lead to misclassification (58). However, many of our findings are consistent with what is known about the risk of RA and other related disorders, suggesting the approach was robust to such misclassification. Nevertheless, further replication will be needed to generalize our findings. Third, for the IVWR analyses we extracted summary statistics already available for RA and the selected outcomes, which limited the number of SNPs included in some analyses. For example, while the summary statistics for RA and some of the outcomes were generated using imputed data from GWAS arrays, summary statistics for T1D and MS were generated from non-imputed data from the ImmunoChip, and LDL from non-imputed data from the Metabochip. Thus, we cannot exclude the possibility that the reduced number of SNPs available for some outcomes may have affected our ability to identify shared genetic predisposition. Fourth, although the IVWR analysis suggested the MHC RA risk alleles are associated with increased the risk of T2D and CAD, we cannot exclude the possibility that these risk alleles regulate an unobserved trait that causally influences the risk of these cardiometabolic phenotypes.

Summary conclusions:

In summary, we found that the wGRS for RA was associated with an increased risk of T1D as well as with susceptibility to other autoimmune disorders such as autoimmune thyroiditis and systemic sclerosis. The IVWR meta-analysis showed that genetic predisposition to RA is associated with an increased the risk of T1D, independent of PTPN22. These findings suggest pleiotropy between genetic predisposition for RA and other autoimmune disorders including T1D.

Supplementary Material

sup info

Acknowledgments:

See supplementary text.

Financial support: The study was supported by American Heart Association (AHA) grant 18SFRN34230089. The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center’s BioVU which is supported by numerous sources: institutional funding, private agencies, and federal grants. These include the NIH funded Shared Instrumentation Grant S10RR025141; and CTSA grants UL1TR002243, UL1TR000445, and UL1RR024975. Genomic data are also supported by investigator-led projects that include U01HG004798, R01NS032830, RC2GM092618, P50GM115305, U01HG006378, U19HL065962, R01HD074711; and additional funding sources listed at https://victr.vanderbilt.edu/pub/biovu/. VKK was supported by NIH/NIGMS K23GM117395 and the Arthritis National Research Foundation – All Arthritis Grant Program Award. CMS is funded by R35GM131770. JDM is funded by the AHA 16FTF30130005 and R01GM130791. QF is funded by R01GM120523, CPC by R01AR073764, R01GM126535 and the Veterans Health Administration Merit Award 1IO1CX001741, and S.H. by R01GM114128. The eMERGE Network was initiated and funded by NHGRI through the following grants for phase II: U01HG006828 (Cincinnati Children’s Hospital Medical Center/Boston Children’s Hospital); U01HG006830 (Children’s Hospital of Philadelphia); U01HG006389 (Essentia Institute of Rural Health, Marshfield Clinic Research Foundation and Pennsylvania State University); U01HG006382 (Geisinger Clinic); U01HG006375 (Group Health Cooperative/University of Washington); U01HG006379 (Mayo Clinic); U01HG006380 (Icahn School of Medicine at Mount Sinai); U01HG006388 (Northwestern University); U01HG006378 (Vanderbilt University Medical Center); U01HG006385 (Vanderbilt University Medical Center serving as the Coordinating Center); U01HG004438 (CIDR) and U01HG004424 (the Broad Institute) serving as Genotyping Centers. For phase I: U01-HG-004610 (Group Health Cooperative/University of Washington); U01-HG-004608 (Marshfield Clinic Research Foundation and Vanderbilt University Medical Center); U01-HG-04599 (Mayo Clinic); U01HG004609 (Northwestern University); U01-HG-04603 (Vanderbilt University Medical Center, also serving as the Administrative Coordinating Center); U01HG004438 (CIDR) and U01HG004424 (the Broad Institute) serving as Genotyping Centers.

Footnotes

Disclosure statements: The authors have nothing to disclose

References

  • 1.Hayter SM, Cook MC. Updated assessment of the prevalence, spectrum and case definition of autoimmune disease. Autoimmun Rev. 2012;11(10):754–65. [DOI] [PubMed] [Google Scholar]
  • 2.Chung CP, Oeser A, Raggi P, Gebretsadik T, Shintani AK, Sokka T, et al. Increased coronary-artery atherosclerosis in rheumatoid arthritis: relationship to disease duration and cardiovascular risk factors. Arthritis Rheum. 2005;52(10):3045–53. [DOI] [PubMed] [Google Scholar]
  • 3.Wiseman SJ, Ralston SH, Wardlaw JM. Cerebrovascular Disease in Rheumatic Diseases: A Systematic Review and Meta-Analysis. Stroke. 2016;47(4):943–50. [DOI] [PubMed] [Google Scholar]
  • 4.Ungprasert P, Srivali N, Kittanamongkolchai W. Risk of incident atrial fibrillation in patients with rheumatoid arthritis: a systematic review and meta-analysis. Int J Rheum Dis. 2017;20(4):434–41. [DOI] [PubMed] [Google Scholar]
  • 5.Liao KP, Gunnarsson M, Kallberg H, Ding B, Plenge RM, Padyukov L, et al. Specific association of type 1 diabetes mellitus with anti-cyclic citrullinated peptide-positive rheumatoid arthritis. Arthritis Rheum. 2009;60(3):653–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lu MC, Yan ST, Yin WY, Koo M, Lai NS. Risk of rheumatoid arthritis in patients with type 2 diabetes: a nationwide population-based case-control study. PLoS One. 2014;9(7):e101528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hickson LJ, Crowson CS, Gabriel SE, McCarthy JT, Matteson EL. Development of reduced kidney function in rheumatoid arthritis. Am J Kidney Dis. 2014;63(2):206–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Chung CP, Oeser A, Solus JF, Avalos I, Gebretsadik T, Shintani A, et al. Prevalence of the metabolic syndrome is increased in rheumatoid arthritis and is associated with coronary atherosclerosis. Atherosclerosis. 2008;196(2):756–63. [DOI] [PubMed] [Google Scholar]
  • 9.Myasoedova E, Crowson CS, Kremers HM, Roger VL, Fitz-Gibbon PD, Therneau TM, et al. Lipid paradox in rheumatoid arthritis: the impact of serum lipid measures and systemic inflammation on the risk of cardiovascular disease. Ann Rheum Dis. 2011;70(3):482–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Somers EC, Thomas SL, Smeeth L, Hall AJ. Autoimmune diseases co-occurring within individuals and within families: a systematic review. Epidemiology. 2006;17(2):202–17. [DOI] [PubMed] [Google Scholar]
  • 11.Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506(7488):376–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Andersson C, Lyass A, Larson MG, Robins SJ, Vasan RS. Low-density-lipoprotein cholesterol concentrations and risk of incident diabetes: epidemiological and genetic insights from the Framingham Heart Study. Diabetologia. 2015;58(12):2774–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Carroll RJ, Bastarache L, Denny JC. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics. 2014;30(16):2375–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Salem JE, Shoemaker MB, Bastarache L, Shaffer CM, Glazer AM, Kroncke B, et al. Association of Thyroid Function Genetic Predictors With Atrial Fibrillation: A Phenome-Wide Association Study and Inverse-Variance Weighted Average Meta-analysis. JAMA Cardiol. 2019;4(2):136–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Roden DM, Pulley JM, Basford MA, Bernard GR, Clayton EW, Balser JR, et al. Development of a large-scale de-identified DNA biobank to enable personalized medicine. Clin Pharmacol Ther. 2008;84(3):362–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.McCarty CA, Chisholm RL, Chute CG, Kullo IJ, Jarvik GP, Larson EB, et al. The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genomics. 2011;4:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zuvich RL, Armstrong LL, Bielinski SJ, Bradford Y, Carlson CS, Crawford DC, et al. Pitfalls of merging GWAS data: lessons learned in the eMERGE network and quality control procedures to maintain high data quality. Genet Epidemiol. 2011;35(8):887–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Stanaway IB, Hall TO, Rosenthal EA, Palmer M, Naranbhai V, Knevel R, et al. The eMERGE genotype set of 83,717 subjects imputed to ~40 million variants genome wide and association with the herpes zoster medical record phenotype. Genet Epidemiol. 2019;43(1):63–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Delaneau O, Zagury JF, Marchini J. Improved whole-chromosome phasing for disease and population genetic studies. Nat Methods. 2013;10(1):5–6. [DOI] [PubMed] [Google Scholar]
  • 22.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet. 2012;44(8):955–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Das S, Forer L, Schonherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zheng X, Levine D, Shen J, Gogarten SM, Laurie C, Weir BS. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics. 2012;28(24):3326–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol. 2016;40(4):304–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Burgess S, Bowden J, Fall T, Ingelsson E, Thompson SG. Sensitivity Analyses for Robust Causal Inference from Mendelian Randomization Analyses with Multiple Genetic Variants. Epidemiology. 2017;28(1):30–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hirata J, Hosomichi K, Sakaue S, Kanai M, Nakaoka H, Ishigaki K, et al. Genetic and phenotypic landscape of the major histocompatibilty complex region in the Japanese population. Nat Genet. 2019;51(3):470–80. [DOI] [PubMed] [Google Scholar]
  • 29.Onengut-Gumuscu S, Chen WM, Burren O, Cooper NJ, Quinlan AR, Mychaleckyj JC, et al. Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers. Nat Genet. 2015;47(4):381–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Westra HJ, Martinez-Bonet M, Onengut-Gumuscu S, Lee A, Luo Y, Teslovich N, et al. Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat Genet. 2018;50(10):1366–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mustelin T, Bottini N, Stanford SM. The Contribution of PTPN22 to Rheumatic Disease. Arthritis Rheumatol. 2019;71(4):486–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sirota M, Schaub MA, Batzoglou S, Robinson WH, Butte AJ. Autoimmune disease classification by inverse association with SNP alleles. PLoS Genet. 2009;5(12):e1000792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Restrepo NA, Butkiewicz M, McGrath JA, Crawford DC. Shared Genetic Etiology of Autoimmune Diseases in Patients from a Biorepository Linked to De-identified Electronic Health Records. Front Genet. 2016;7:185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Smets I, Fiddes B, Garcia-Perez JE, He D, Mallants K, Liao W, et al. Multiple sclerosis risk variants alter expression of co-stimulatory genes in B cells. Brain. 2018;141(3):786–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Yamamoto K, Okada Y. Shared genetic factors and their causality in autoimmune diseases. Ann Rheum Dis. 2019;78(11):1449–51. [DOI] [PubMed] [Google Scholar]
  • 36.Cotsapas C, Voight BF, Rossin E, Lage K, Neale BM, Wallace C, et al. Pervasive sharing of genetic effects in autoimmune disease. PLoS Genet. 2011;7(8):e1002254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Verma A, Bang L, Miller JE, Zhang Y, Lee MTM, Zhang Y, et al. Human-Disease Phenotype Map Derived from PheWAS across 38,682 Individuals. Am J Hum Genet. 2019;104(1):55–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Torkamani A, Topol EJ, Schork NJ. Pathway analysis of seven common diseases assessed by genome-wide association. Genomics. 2008;92(5):265–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pickrell JK, Berisa T, Liu JZ, Segurel L, Tung JY, Hinds DA. Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016;48(7):709–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Replication DIG, Meta-analysis C, Asian Genetic Epidemiology Network Type 2 Diabetes C, South Asian Type 2 Diabetes C, Mexican American Type 2 Diabetes C, Type 2 Diabetes Genetic Exploration by Nex-generation sequencing in muylti-Ethnic Samples C, et al. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet. 2014;46(3):234–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Karnes JH, Bastarache L, Shaffer CM, Gaudieri S, Xu Y, Glazer AM, et al. Phenome-wide scanning identifies multiple diseases and disease severity phenotypes associated with HLA variants. Sci Transl Med. 2017;9(389). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Karczewski KJ, Dudley JT, Kukurba KR, Chen R, Butte AJ, Montgomery SB, et al. Systematic functional regulatory assessment of disease-associated variants. Proc Natl Acad Sci U S A. 2013;110(23):9607–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bowes J, Barton A. Recent advances in the genetics of RA susceptibility. Rheumatology (Oxford). 2008;47(4):399–402. [DOI] [PubMed] [Google Scholar]
  • 44.Paakkanen R, Lokki ML, Seppanen M, Tierala I, Nieminen MS, Sinisalo J. Proinflammatory HLA-DRB1*01-haplotype predisposes to ST-elevation myocardial infarction. Atherosclerosis. 2012;221(2):461–6. [DOI] [PubMed] [Google Scholar]
  • 45.Bjorkbacka H, Lavant EH, Fredrikson GN, Melander O, Berglund G, Carlson JA, et al. Weak associations between human leucocyte antigen genotype and acute myocardial infarction. J Intern Med. 2010;268(1):50–8. [DOI] [PubMed] [Google Scholar]
  • 46.McPherson R, Davies RW. Inflammation and coronary artery disease: insights from genetic studies. Can J Cardiol. 2012;28(6):662–6. [DOI] [PubMed] [Google Scholar]
  • 47.Raychaudhuri S, Sandor C, Stahl EA, Freudenberg J, Lee HS, Jia X, et al. Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat Genet. 2012;44(3):291–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Ligthart S, Vaez A, Vosa U, Stathopoulou MG, de Vries PS, Prins BP, et al. Genome Analyses of >200,000 Individuals Identify 58 Loci for Chronic Inflammation and Highlight Pathways that Link Inflammation and Complex Disorders. Am J Hum Genet. 2018;103(5):691–706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Norlander AE, Madhur MS, Harrison DG. The immunology of hypertension. J Exp Med. 2018;215(1):21–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Andreassen OA, McEvoy LK, Thompson WK, Wang Y, Reppe S, Schork AJ, et al. Identifying common genetic variants in blood pressure due to polygenic pleiotropy with associated phenotypes. Hypertension. 2014;63(4):819–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Zhu F, Sun Y, Wang M, Ma S, Chen X, Cao A, et al. Correlation between HLA-DRB1, HLA-DQB1 polymorphism and autoantibodies against angiotensin AT(1) receptors in Chinese patients with essential hypertension. Clin Cardiol. 2011;34(5):302–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Dai CS, Chu CC, Chen SF, Sun CY, Lin M, Lee CC. Association between human leucocyte antigen subtypes and risk of end stage renal disease in Taiwanese: a retrospective study. BMC Nephrol. 2015;16:177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Liao KP, Diogo D, Cui J, Cai T, Okada Y, Gainer VS, et al. Association between low density lipoprotein and rheumatoid arthritis genetic factors with low density lipoprotein levels in rheumatoid arthritis and non-rheumatoid arthritis controls. Ann Rheum Dis. 2014;73(6):1170–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Fornage M, Chiang YA, O’Meara ES, Psaty BM, Reiner AP, Siscovick DS, et al. Biomarkers of Inflammation and MRI-Defined Small Vessel Disease of the Brain: The Cardiovascular Health Study. Stroke. 2008;39(7):1952–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Okada R, Wakai K, Naito M, Morita E, Kawai S, Hamajima N, et al. Pro-/anti-inflammatory cytokine gene polymorphisms and chronic kidney disease: a cross-sectional study. BMC Nephrol. 2012;13:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Zhernakova A, Withoff S, Wijmenga C. Clinical implications of shared genetics and pathogenesis in autoimmune diseases. Nat Rev Endocrinol. 2013;9(11):646–59. [DOI] [PubMed] [Google Scholar]
  • 58.Wang L, Damrauer SM, Zhang H, Zhang AX, Xiao R, Moore JH, et al. Phenotype validation in electronic health records based genetic association studies. Genet Epidemiol. 2017;41(8):790–800. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sup info

RESOURCES