Abstract
Chronic kidney disease (CKD) is responsible for a public health burden with multi-systemic complications. Through transancestry meta-analysis of genome-wide association studies of estimated glomerular filtration rate (eGFR) and independent replication (n = 1,046,070), we identified 264 associated loci (166 new). Of these, 147 were likely to be relevant for kidney function on the basis of associations with the alternative kidney function marker blood urea nitrogen (n = 416,178). Pathway and enrichment analyses, including mouse models with renal phenotypes, support the kidney as the main target organ. A genetic risk score for lower eGFR was associated with clinically diagnosed CKD in 452,264 independent individuals. Colocalization analyses of associations with eGFR among 783,978 European-ancestry individuals and gene expression across 46 human tissues, including tubulo-interstitial and glomerular kidney compartments, identified 17 genes differentially expressed in kidney. Fine-mapping highlighted missense driver variants in 11 genes and kidney-specific regulatory variants. These results provide a comprehensive priority list of molecular targets for translational research.
CKD is a major public health issue, with increasing incidence and prevalence worldwide1. Its associated burden of disease encompasses metabolic disturbances, end-stage kidney disease and multi-systemic complications such as cardiovascular disease1–4. CKD is a leading cause of death5 and has shown one of the highest increases in disease-attributable mortality over the last decade2. Nevertheless, public and clinical awareness remain low3. Moreover, clinical trials in nephrology are still under-represented6, which has resulted in a scarcity of therapeutic options to alter disease progression and high costs for health systems7. A major barrier to developing new therapeutics is the limited understanding of the mechanisms underlying kidney function in health and disease, with the consequent lack of therapeutic targets.
Genome-wide association studies (GWAS) and exome-chip studies of the glomerular filtration rate estimated from serum creatinine (eGFR), the main biomarker to quantify kidney function and define CKD, have identified nearly 100 eGFR-associated genetic loci8 in samples of European9–15, Asian16–19 and multiple20 ancestry. However, similarly to other complex traits and diseases, identifying causal genes and molecular mechanisms implicated by genetic associations is challenging and has only been successful for a few kidney-function-associated loci21,22. Advanced statistical fine-mapping approaches and newly emerging multi-tissue gene expression data provide new opportunities for prioritizing putative causal variants, effector genes and target tissues from the results of large-scale GWAS meta-analyses.
We therefore conducted a trans-ancestry GWAS meta-analysis in the CKD Genetics (CKDGen) Consortium (n = 765,348) and replicated findings in the Million Veteran Program (MVP; n = 280,722)23, for a combined sample size of greater than 1 million participants. The first aim of this study was to identify new globally important loci for kidney function through maximizing statistical power (Supplementary Fig 1). Results from GWAS of the complementary kidney function marker blood urea nitrogen (BUN; n = 416,178) were used to prioritize the eGFR-associated loci on the basis of those most likely to be relevant for kidney function. A genetic risk score (GRS) for low eGFR was used to test relevance for clinically diagnosed CKD among 452,264 independent individuals. The second aim was to characterize replicated eGFR-associated loci through complementary computational approaches, including various enrichment and network analyses, fine-mapping, and colocalization with gene expression in 46 tissues and protein levels (Supplementary Fig 1). We focused this aim on European-ancestry individuals, as fine-mapping based on summary statistics requires linkage disequilibrium (LD) reference panels whose sample size scales with that of the GWAS24. The resulting list of prioritized variants and genes provides a rich resource of potential therapeutic targets to improve CKD treatment and prevention.
Results
Discovery trans-ancestry meta-analysis.
We performed 121 GWAS encompassing 765,348 individuals of European (n = 567,460), East Asian (n = 165,726), African-American (n = 13,842), South Asian (n = 13,359) and Hispanic (n = 4,961) ancestry (median age, 54 years; 50% female; Supplementary Table 1). The median of the study-specific mean eGFR values was 89 ml min−1 per 1.73 m2 (interquartile range, IQR: 81, 94). GWAS were based on genotypes imputed from Haplotype Reference Consortium25 or 1000 Genomes Project26 reference panels (Methods and Supplementary Table 2). Following study-specific variant filtering and quality-control procedures, we performed a fixed-effects inverse-variance-weighted meta-analysis, finding no evidence of unmodeled population structure (LD score regression intercept = 1.04; genomic control factor λGC = 1.05). After variant filtering, 8,221,591 SNPs were used for downstream analysis (Methods).
We discovered 308 loci containing at least one eGFR-associated SNP at genome-wide significance (Methods), of which 200 were new and 108 contained an index SNP reported by previous GWAS of eGFR (Fig. 1 and Supplementary Table 3). Regional association plots are shown in Supplementary Fig 2. The minor alleles across index SNPs showed both decreasing and increasing effects on eGFR, with larger effects observed for lower-frequency SNPs (Fig. 1, inset). The 308 index SNPs explained 7.1% of the eGFR variance, nearly doubling recent GWAS-based estimates9, and 19.6% of eGFR genetic heritability (h2 = 39%, 95% credible interval = 32%, 47%), estimated in a participating general-population-based pedigree study (Methods and Supplementary Fig 3). The effects of index SNPs were largely homogeneous across studies (Fig. 2a and Supplementary Table 3) and ancestry groups (Supplementary Table 4 and Supplementary Note 1).
Replication and meta-analysis of more than 1 million individuals.
We assessed replication in an independent trans-ancestry GWAS meta-analysis of eGFR performed among 280,722 MVP participants23. Effect estimates, available for 305 of the 308 SNPs, showed almost perfect directional consistency (302/305 SNPs, 99%) and very strong correlation with the discovery results (Fig. 2b). For these 305 SNPs, we performed a meta-analysis of the 1,046,070 discovery and replication samples. Replication was met by 262 SNPs (Fig. 1, Methods and Supplementary Table 3). Of the three SNPs not available in MVP, the index SNPs at SHROOM3 (P = 3.5 × 10−120) and SH3YL1 (P = 1.2 × 10−11) were also considered to be replicated on the basis of previous evidence15,27, resulting in a total of 264 replicated SNPs (166 new). Of these, 74 SNPs were genome-wide significant in MVP alone (Supplementary Table 3).
Association of eGFR loci with BUN and CKD.
To evaluate whether associations with creatinine-based eGFR were probably related to kidney function or potentially to creatinine metabolism, we assessed the association of the 264 eGFR-associated index SNPs with BUN, an alternative marker of kidney function that is inversely correlated with eGFR. Trans-ancestry meta-analysis of 65 GWAS for BUN (n = 416,178; Supplementary Table 1) showed no evidence of unmodeled population structure (λGC = 1.03; LD score regression intercept = 0.98) and yielded 111 genome-wide-significant loci (15 known, 96 new; Supplementary Fig 4 and Supplementary Table 5).
Of the 264 replicated eGFR index SNPs, 34 and 146 showed genome-wide-significant and nominally significant (P < 0.05) association with BUN, respectively (Supplementary Table 6). SNP effects were inversely correlated (r=−0.65; Fig. 2c). Relevance to kidney function was classified as ‘likely’ for 147 eGFR index SNPs with inverse, significant associations with BUN (one-sided P < 0.05); ‘inconclusive’ for 102 eGFR index SNPs not associated with BUN (P ≥ 0.05); and ‘unlikely’ for 15 eGFR index SNPs showing concordant, significant association with BUN (one-sided P < 0.05; Supplementary Table 6). This comparative analysis of complementary biomarkers supports the idea that signals at the majority of eGFR-associated loci probably reflect kidney function.
Next, we investigated the effects of the eGFR index SNPs on CKD in CKDGen studies (n = 625,219, including 64,164 CKD cases; Methods). GWAS meta-analysis of CKD identified 23 genome-wide-significant loci, including 17 likely relevant for kidney function (SDCCAG8, LARP4B, DCDC1, WDR72, UMOD-PDILT, MYO19, AQP4, NFATC1, PSD4, HOXD8, NRIP1, SHROOM3, FGF5, SLC34A1, DAB2, UNCX and PRKAG2; Supplementary Table 6). The majority of replicated eGFR index SNPs (224 of 264) were associated with CKD (one-sided P < 0.05; Fig. 1, inset), including 130 likely relevant for kidney function (Supplementary Table 6).
Finally, we tested whether a GRS based on the combined effect of the 147 eGFR index SNPs likely relevant for kidney function was associated with clinically diagnosed CKD and CKD-related outcomes in the UK Biobank (n = 452,264; Methods). A lower GRS, reflecting genetically lower eGFR, was associated with higher odds ratios (ORs) of chronic renal failure, glomerular diseases, acute renal failure and hypertensive diseases (Fig. 2d and Supplementary Fig 5). The OR of chronic renal failure per 10% lower GRS-predicted eGFR was 2.13 (95% CI = 1.90, 2.39; P = 8.1 × 10−38). A significant protective association with urolithiasis may reflect a reduced ability to concentrate urine at lower eGFR.
Genetic correlations of eGFR and BUN with other phenotypes.
We assessed genome-wide genetic correlations (rg) of eGFR associations with each of 748 complex traits and diseases (Methods)28. We observed 37 significant correlations (P < 6.7 × 10−5 = 0.05/748; Supplementary Fig 6 and Supplementary Table 7). After serum creatinine, the largest negative correlations were observed between eGFR and serum citrate (rg = −0.27) and urate (rg=−0.23), followed by anthropometric traits including lean mass and physical fitness (for example, rg=−0.20 for left hand grip strength). While the inverse correlation with muscle-mass-related traits probably reflects higher creatinine generation leading to lower creatinine-based eGFR, the correlations with citrate and urate levels probably reflect reduced filtration function, as does the positive correlation with GFR estimated from cystatin C (rg=0.53).
A very similar pattern of genetic correlations was observed for BUN (Supplementary Table 7), but the genetic correlations with muscle-mass-related traits were generally lower than for eGFR. The largest genetic correlation for BUN was observed with CKD (rg = 0.47), as compared to creatinine-based (rg = −0.29) and cystatin C-based (rg = −0.26) eGFR.
In summary, significant genetic correlations with eGFR reflect the two biological components that govern serum creatinine concentrations: its excretion via the kidney and its generation in muscle. The fact that genetic correlations between BUN and muscle-mass-related traits are generally lower than was observed for eGFR underscores the value of using genetic associations with BUN to help prioritize eGFR-associated loci most likely to be relevant for kidney function.
Functional enrichment and pathway analyses.
To identify molecular mechanisms and tissues of importance for kidney function, we assessed the enrichment of the eGFR and BUN genetic associations by using tissue-specific gene expression, regulatory annotations, and gene sets and pathways (Methods). First, we used eGFR-associated SNPs (P < 5 × 10−8) to explore enriched pathways, tissues and cell types on the basis of gene expression data with DEPICT29. We identified 16 significantly enriched physiological systems, cell types and tissues highlighting several aspects of kidney function, physiology and disease. The strongest enrichment was observed for urogenital and renal physiological systems and tissues (kidney, kidney cortex and urinary tract; false-discovery rate (FDR) < 0.05; Supplementary Fig 7a,b). Pathway and gene set enrichment analysis identified three highly correlated and strongly associated meta gene sets (P < 1 × 10−6, FDR < 0.05), including some relevant to the kidney such as polyuria, dilated renal tubules and expanded mesangial matrix, as well as signaling and transcription, and energy metabolism (Supplementary Fig 7c). Tissue and cell-type enrichment analysis of BUN-associated SNPs associated at P < 5 × 10−8 highlighted a very similar pattern (Supplementary Fig 8) but without enrichment for muscle tissues, further supporting the use of BUN to prioritize the loci most likely to be related to kidney function.
Second, we used stratified LD score regression30 on the genome-wide eGFR and BUN summary statistics to identify cell-type groups with enriched heritability on the basis of data from diverse cell-type-specific functional genomic elements. The strongest enrichment for eGFR was observed for the kidney (13.2-fold), followed by the liver (7.3-fold) and adrenal/pancreas (5.7-fold enrichment; Supplementary Table 8). The kidney was also the most enriched cell-type group for BUN (11.5-fold enrichment; Supplementary Table 8).
Finally, by using a complementary approach, we assessed enrichment of eGFR-associated variants in genes in which disruption results in kidney phenotypes in genetically manipulated mice31. From the Mouse Genome Informatics (MGI) database, we selected all genes for which disruption causes abnormal GFR (n = 24), abnormal kidney physiology (n = 453) or abnormal kidney morphology (n = 764) and interrogated their human orthologs in the eGFR summary statistics (Methods). We identified significant associations in ten genes linked to abnormal GFR in mice (enrichment P = 8.9 × 10−4), 55 linked to abnormal kidney physiology (enrichment P = 1.1 × 10−4) and 96 linked to abnormal kidney morphology (enrichment P = 1.8 × 10−5; Fig. 3 and Methods). Of these, 25 genes represent new eGFR candidate genes in humans; that is, they have not previously been reported to contain genome-wide-significant eGFR-associated SNPs or map near known loci (Supplementary Table 9). The existing mouse models may pave the way for experimental confirmation of these findings.
Fine-mapping and secondary signal analysis in European-ancestry individuals.
Conditional and fine-mapping analyses were restricted to European-ancestry participants, for whom data to construct a large enough LD reference panel were publicly available (Methods). Meta-analysis of 85 European-ancestry CKDGen GWAS identified 256 genome-wide-significant loci (Supplementary Table 10). Replication among 216,518 European-ancestry MVP participants confirmed 228 SNPs, including 227 index SNPs that met replication criteria and the SHROOM3 index SNP (Methods and Supplementary Table 10). Of these 228 SNPs, 221 mapped to one of the 264 replicated loci from the trans-ancestry analysis (≤500 kb up- or downstream of the trans-ancestry index SNP), and the remaining 7 showed P < 3.3 × 10−6 in the trans-ancestry discovery analysis. BUN GWAS meta-analysis of CKDGen European-ancestry studies (n = 243,029) allowed us to classify 122 SNPs as likely relevant for kidney function, 90 as inconclusive and 16 as unlikely (Supplementary Table 10).
To conduct statistical fine-mapping of the 228 eGFR loci, we first performed summary-statistics-based conditional analysis and identified 253 independent genome-wide-significant SNPs (Supplementary Table 11) mapping to 189 regions (Methods). For each independent variant, we computed a 99% credible set32, with a median set size of 26 SNPs (IQR: 6, 60). We observed 58 small credible sets (≤5 SNPs), including 20 single-SNP sets: EDEM3, CACNA1S, HOXD11, CPS1, DAB2, SLC34A1, LINC01512, LARP4B, DCDC1, SLC25A45, SLC6A13, GATM, CGNL1, CYP1A1, NRG4, RPL3L, UMOD-PDILT, SLC47A1 and two independent sets at BCL2L14 (Fig. 4 and Supplementary Table 11). Of the 58 small credible sets, 33 were likely relevant for kidney function and contain genes and SNPs that can now be prioritized for further study (Supplementary Table 11).
Credible set SNPs were annotated with respect to their functional consequence and regulatory potential. Missense SNPs with >50% posterior probability (PP) of driving the association and/or mapping to a small credible set are of particular interest because they directly implicate the affected gene. Such missense SNPs were identified in 11 genes (SLC47A1, RPL3L, SLC25A45, CACNA1S, EDEM3, CPS1, KLHDC7A, PPM1J, CERS2, C9 and SLC22A2; Supplementary Table 12), of which CACNA1S, RPL3L, CERS2 and C9 were likely relevant for kidney function (Fig. 4a). The majority of the 11 variants had a combined annotation-dependent depletion (CADD) score greater than 15, indicating potential deleteriousness33. Several identified genes are plausible biological candidates for driving the association signal (Table 1). For example, the missense p.(Ala465Val) SNP in SLC47A1 (PP > 99%) alters the encoded multidrug and toxin extrusion protein (MATE1), a transport protein responsible for the secretion of cationic drugs, toxins and internal metabolites including creatinine across brush border membranes, including kidney-proximal tubules. The fact that Slc47a1-knockout mice have higher blood levels of both creatinine and BUN34 argues against a sole effect on creatinine transport.
Table 1 |.
CACNA1S | rs3850625 | 1 | 1.00 | p.(Arg1539Cys) (NP_000060.2) | 34.0 | - | Encodes a subunit of the slowly inactivating L-type voltage-dependent calcium channel in skeletal muscle. Reports of altered expression in kidney cancer48 and after indoxyl sulfate treatment49. Rare variants can cause autosomal dominant hypokalemic periodic paralysis, type 1 (MIM 170400) or malignant hyperthermia susceptibility (MIM 601887). Common variation at this locus has been reported as associated with eGFR in previous GWAS10,50. |
CPS1 | rs1047891 | 1 | 1.00 | p.(Thr1406Asn) (NP_001866.2) | 22.1 | - | Encodes a key mitochondrial enzyme of the urea cycle that catalyzes the synthesis of carbamoyl phosphate from ammonia and bicarbonate to remove excess urea. Rare mutations cause autosomal recessive carbamoyl phosphate synthetase I deficiency (MIM 237300). GWAS locus for eGFR13, serum metabolites51 and urinary glycine52, as well as for many other quantitative biomarkers. This variant has been reported to associate with hyperammonemia after valproate therapy53. |
EDEM3 | rs78444298 | 1 | 1.00 | p.(Pro746Ser) (NP_079467.3) | 24.6 | - | The gene product accelerates proteasome-mediated ER-associated degradation of glycoproteins by catalyzing mannose trimming from Man8GlcNAc2 to yield Man7GlcNAc2 on N-glycans. This variant has been identified by a previous exome chip association study with eGFR27. |
KLHDC7A | rs11261022 | 7 | 0.71 | p.(Arg160Ser) (NP_689588.2) | 1.1 | Roadmap + ENCODE, kidney | This gene encodes the Kelch-domain-containing 7A protein and is a paralog of KBTBD11. No specific entry in relation to kidney disease in PubMed. |
RPL3L | rs113956264 | 1 | 1.00 | p.(Val262Met) (NP_005052.1) | 27.2 | - | The gene product has sequence similarity with ribosomal protein L3. It has a tissue-specific expression pattern, with the highest levels in skeletal muscle and heart. |
5LC25A45 | rs34400381 | 1 | 1.00 | p.(Arg285Cys) (NP_001070709.2) | 26.0 | ENCODE, kidney | The encoded protein belongs to the SLC25 family of mitochondrial carrier proteins and is an orphan transporter. This variant has already been identified in a GWAS of symmetric dimethylarginine levels54 and in a whole-genome-sequencing analysis of serum creatinine55. SLC25A45 may have a role in biosynthesis of arginine, which is involved in the synthesis of creatine. |
SLC47A1 | rs111653425 | 1 | 1.00 | p.(Ala465Val) (NP_060712.2) | 24.6 | - | Encodes a multidrug and toxin extrusion protein (MATE1), a transport protein responsible for the secretion of cationic drugs and creatinine across brush border membranes. This variant has already been identified in a whole-genome-sequencing analysis of serum creatinine from Iceland5. Rare and common variants in the locus have been identified in exome chip studies27 and GWAS13 of eGFR, respectively. Slc47a1-knockout mice show higher levels of serum creatinine and BUN34, arguing against a sole effect on creatinine transport and supporting an effect on kidney function. |
PPM1J | rs34611728 | 5 | 0.02 | p.(Leu213Phe) (NP_005158.5) | 13.1 | ENCODE, kidney | This gene encodes a serine/threonine protein phosphatase. The variant has been reported in association with eGFR in an exome chip association study27. |
CERS2 | rs267738 | 5 | 0.46 | p.(Glu115Ala) (NP_071358.1) | 32.0/28.2 | - | Encodes ceramide synthase 2, which may be involved in sphingolipid synthesis. Changes in ceramide levels were reported as essential in renal Madin-Darby canine kidney (MDCK) cell differentiation56. Cers2-knockout mice show strongly reduced ceramide levels in the kidney and develop renal parenchyma abnormalities57. This variant has been reported as associated with the rate of albuminuria increase in individuals with diabetes58. |
C9 | rs700233 | 5 | 0.32 | p.(Arg5Trp) (NP_001728.1) | 6.6 | - | Encodes a constituent of the membrane attack complex that has a key role in the innate and adaptive immune responses. Rare mutations can cause C9 deficiency (MIM 613825). C9 is mentioned in several kidney disease case reports, including for patients with congenital factor 9 deficiency showing IgA nephropathy59. |
SLC22A2 | rs316019 | 4 | 0.04 | p.(Ser270Ala) (NP_003049.2) | 12.7 | - | Encodes the polyspecific organic cation transporter (OCT2) that is primarily expressed in the kidney, where it mediates tubular uptake of organic compounds including creatinine from the circulation. Many publications relate SLC22A2 to kidney function. rs316019 is a known pharmacogenomics variant associated with response to metformin and other drugs such as cisplatin. Carriers of the risk allele have a higher risk of cisplatin-induced nephrotoxicity43, indicating that this transporter is essential in excreting toxins. The locus has been reported in previous GWAS of eGFR13. |
Genes are included if they contain a missense SNP with a PP of association of >50% or map to a small credible set (≤5 SNPs). PP, posterior probability; CADD score, combined annotation-dependent depletion (CADD) Phred-like score (Methods); DHSs, DNase I-hypersensitive sites.
Boldface indicates the SNPs most likely to be relevant for kidney function on the basis of combined effects on eGFR and BUN.
To evaluate the regulatory potential of SNPs from small credible sets in the kidney, we annotated them to open chromatin regions identified from primary human tubular and glomerular cell cultures35, as well as from publicly available kidney cell types (Methods). We identified 72 SNPs mapping to one of these annotations, which may thus represent causal regulatory variants (Supplementary Table 12). A particularly interesting finding was the intronic rs77924615 SNP in PDILT, which showed PP > 99% of driving the association at the UMOD locus and mapped to open chromatin in all evaluated resources (native kidney cells, ENCODE and Roadmap kidney cell types; Fig. 4b).
Gene prioritization: colocalization with gene expression.
We performed colocalization analyses for each eGFR-associated locus with gene expression in cis across 46 tissues, including kidney glomerular and tubulo-interstitial compartments (Methods). PP > 80% of colocalization in at least one kidney tissue was observed for 17 transcripts mapping to 16 of the 228 replicated loci (Fig. 5), pointing toward a shared underlying SNP associated with both eGFR and gene expression and implicating the gene encoding the colocalized transcript as the effector gene for the locus.
New insights emerged on several levels: first, UMOD is a well-established causal gene for CKD and can therefore be used to evaluate our workflow. In the tubulo-interstitial compartment, we observed a shared underlying variant associated with higher UMOD gene expression and lower eGFR (Fig. 5), in agreement with previous GWAS of urinary uromodulin concentration, in which alleles associated with lower eGFR at UMOD15 were associated with higher urinary uromodulin concentrations36. The lead SNP at this locus was rs77924615, highlighted above as the candidate causal regulatory variant mapping to an intron of PDILT (upstream of UMOD). The association with differential UMOD but not PDILT gene expression supports UMOD as the causal gene and rs77924615 as a regulatory SNP.
Second, new biologically plausible candidates emerged. For example, our results suggest KNG1 and FGF5 as effector genes in the respective eGFR-associated loci (Fig. 5 and Supplementary Table 13). KNG1 encodes the high-molecular-weight kininogen, which is cleaved to bradykinin. Bradykinin influences blood pressure, natriuresis and diuresis and can be linked to kidney function via the renin-angiotensin-aldosterone system37. FGF5 encodes fibroblast growth factor 5, and the index SNPs for eGFR or highly correlated SNPs (r2 > 0.9) have been identified in multiple GWAS of blood pressure, atrial fibrillation, coronary artery disease, hematocrit and multiple kidney-function-related traits (Supplementary Table 13). The eGFR index SNP rs1458038 (PP > 50%, CADD score = 14.8; Supplementary Table 13) colocalized with the eGFR signal only in the tubulo-interstitial kidney portion (Fig. 5), supporting its regulatory potential in controlling the expression levels of FGF5 in this compartment. Both KNG1 and FGF5 index SNPs were associated with BUN and CKD and are thus probably related to kidney function.
Third, for loci that showed colocalization of eGFR signals with gene expression in kidney and multiple other tissues, in some cases the allelic effect direction on gene expression was concordant across all tissues (for example, METTL10), whereas in other cases it differed by tissue (for example, SH3YL1; Fig. 5). These observations were also reflected broadly across all transcripts with evidence of colocalization in any tissue (Supplementary Fig 9) and highlight tissue-shared and tissue-specific signals38,39.
Finally, trans expression quantitative trait locus (trans-eQTL) annotation of the index SNPs in whole and peripheral blood identified a reproducible link of rs10774625 (12q24.11) with several transcripts (Methods, Supplementary Table 14 and 15, and Supplementary Note 2).
Colocalization with uromodulin protein levels in urine.
The UMOD locus is of particular clinical interest for CKD research21: rare UMOD mutations cause autosomal dominant tubulo-interstitial kidney disease40, and common variants at UMOD give rise to the strongest eGFR and CKD GWAS signals15. We therefore performed conditional analyses based on European-ancestry-specific summary statistics and found two independent variants: rs77924615, mapping upstream of PDILT, and rs34882080, mapping to an intron of UMOD (Fig. 6a). SNP association with the urinary uromodulinto-creatinine ratio (UUCR) in one participating cohort (Fig. 6b) matched the eGFR association pattern. Colocalization of the conditional eGFR and UUCR associations was evaluated separately for rs34882080 (Fig. 6c) and rs77924615 (Fig. 6d). Both regions showed high probability of a shared underlying variant driving the respective associations with eGFR and UUCR levels (PP = 97% and 96%, respectively), further supporting rs77924615 as a causal regulatory variant and UMOD as its effector gene.
A summary of the various gene characterization results for replicated loci from the European-ancestry analysis is shown in Supplementary Table 16, to facilitate selection of the most promising candidates for further experimental studies.
Discussion
This trans-ancestry study is fivefold larger than previous GWAS meta-analyses for eGFR and identified 264 replicated loci, 166 of which are reported here for the first time. By also analyzing BUN, an established complementary marker of kidney function, we highlight eGFR-associated loci that are likely to be important for kidney function as opposed to creatinine metabolism and provide a comprehensive annotation resource. Clinical relevance is supported by associations of a GRS for low eGFR with higher odds of clinically diagnosed CKD, CKD-related phenotypes and hypertension. Enrichment analyses confirm the kidney as the main target organ. Colocalization of associations with eGFR and gene expression in the kidney implicates specific target genes for follow-up. Conditional analyses, fine-mapping and functional annotation at 228 replicated eGFR-associated loci among European-ancestry participants implicate single potentially causal variants at 20 loci.
Most previous GWAS meta-analyses for eGFR have been limited to a single ancestry group8 and did not prioritize causal variants or effector genes in associated loci. Although underpowered to uncover new loci, one previous trans-ancestry study used fine-mapping, resolving one signal to a single variant20, rs77924615 at UMOD-PDILT, which is also identified in our study. At this locus, we further characterized the relationship between the causal variant, UMOD expression in the target tissue and uromodulin protein levels. This increase in resolution—from a locus to a single potentially causal variant with its effector gene, protein and target tissue— represents a critical advance over 10 years of eGFR GWAS15 and is a prerequisite for translational research.
The complementary multi-tissue approaches, including enrichment analyses based on gene expression, regulatory annotations, and gene sets and pathways, highlight the kidney as the most important target organ. However, relatively few kidney-specific experimental datasets are publicly available. For example, the kidney is not well represented in the Genotype-Tissue Expression (GTEx) Project and is not included in its tissue-specific eQTL datasets38, emphasizing the value of open-access resources and in-depth characterization of uncommon tissues and cell types. We were able to specifically investigate the kidney by using a recently published eQTL dataset from glomerular and tubulo-interstitial portions of microdissected human kidney biopsies41, kidney-specific regulatory information from the ENCODE and Roadmap Epigenomics resources, and by obtaining regulatory information from primary cultures of human glomerular and tubulo-interstitial cells35.
Functional follow-up studies of potentially causal variants should benefit from prioritized loci that show clear evidence supporting one or a few SNPs driving the association signal. The fine-mapping workflow allowed us to prioritize several SNPs at single-SNP resolution or at a resolution of ≤5 SNPs, some of which may have broader clinical relevance. For example, the OCT2 protein encoded by SLC22A2 transports several cationic drugs such as metoprolol, cisplatin, metformin and cimetidine across the basolateral membrane of renal tubular cells42. The prioritized missense SNP encodes p.(Ser270Ala), a known pharmacogenomic variant that alters the transport of these drugs and their side effects, such as cisplatin-induced nephrotoxicity43. Along the same lines, the prioritized SNP encoding the p.(Ala465Val) substitution in the transporter MATE1 encoded by SLC47A1 may affect the ability to secrete drugs and other toxins from proximal tubular cells into the urine44 and hence alter CKD risk.
Strengths of this project include the large sample size with dense genotype imputation, standardized and automated phenotype generation and quality control, and independent replication, as well as the advanced and comprehensive downstream bioinformatics analyses. Further strengths are the use of BUN to prioritize eGFR-associated loci likely relevant for kidney function and to provide genome-wide BUN summary statistics as an annotation resource for other studies of eGFR. Moreover, we evaluated a GRS for eGFR for association with clinically diagnosed CKD in a large independent study. Among the limitations, non-European populations are still under-represented in our study, as in many other genomic efforts45. Statistical fine-mapping using trans-ancestry data with different LD structures can potentially narrow association signals. However, a sufficiently large reference dataset to compute ancestry-matched LD structure for summary-statistics-based fine-mapping was only available for European ancestry, highlighting the potential of future large-scale efforts with trans-ancestry fine-mapping and the need to generate data from non-European-ancestry populations, thereby enabling such endeavors. Finally, several SNPs had small effective sample sizes in some subpopulations, which might have affected the ability to assess between-ancestry heterogeneity and potentially underestimated true heterogeneity.
We estimated GFR from serum creatinine, as done in clinical practice and observational studies, because direct measurement of kidney function is invasive, time-consuming and burdensome. Under the assumption that genetic associations supported by multiple markers are less likely to reflect marker metabolism, we used BUN to prioritize eGFR-associated loci likely to be relevant to kidney function. Blood creatinine, urea and cystatin C concentrations are influenced not only by glomerular filtration but also by the synthesis, active secretion and reabsorption of these molecules, as illustrated by loci detected in our study: for example, the GATM locus was associated with eGFR but not with BUN, in agreement with the function of the encoded protein as a rate-limiting enzyme in creatine synthesis46. Conversely, the SLC14A2 locus was associated with BUN but not with eGFR, in line with the function of the encoded protein as a urea transporter47. Even so, lack of association for a SNP with one kidney function marker based on a combination of P value and effect direction may not necessarily mean that the locus is not relevant to kidney function. Our categorization of the eGFR loci into three classes on the basis of direction of effect and significance of BUN association should be interpreted with caution, with ‘likely’ and ‘unlikely’ reflecting uncertainty of the assignment. Factors complicating the comparison of eGFR and BUN associations at the locus level are differential statistical power, differential ancestry distribution and potential allelic heterogeneity. Further large-scale studies with multiple kidney function markers measured in the same individuals are therefore warranted.
To identify broadly representative and generalizable association signals, we focused on SNPs that were present in the majority of the participating studies. This choice might have limited our ability to uncover new variants or to fine-map low-frequency or population-specific variants, which represents a complementary avenue of research. Moreover, even with well-powered fine-mapping approaches, potentially causal SNPs need to be confirmed as functional variants in experimental studies. Although colocalization with gene expression can help prioritize effector genes, these associations are based on measures from a single time point and hence cannot answer whether changes in gene expression precede or follow changes in kidney function.
In summary, we have identified and characterized a large number of loci associated with eGFR and prioritized potential effector genes, driver variants and target tissues. These findings will help direct functional studies and advance the understanding of kidney function biology, a prerequisite to develop novel therapies to reduce the burden of CKD.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, statements of code and data availability and associated accession codes are available at https://doi.org/10.1038/s41588-019-0407-x.
Methods
Overview.
We set up a collaborative meta-analysis based on a distributive data model and quality-control procedures. To maximize phenotype standardization across studies, an analysis plan and a command line script (https://github.com/genepi-freiburg/ckdgen-pheno) were created centrally and provided to all participating studies (mostly population-based studies; Supplementary Table 1). Data processing, analysis and troubleshooting instructions were distributed to all studies via a wiki system (https://ckdgen.eurac.edu/mediawiki/index.php/ CKDGen_Round_4_EPACTS_analysis_plan). Automatically generated summary files were checked centrally. Upon phenotype approval, studies ran their GWAS and uploaded results and imputation quality (IQ) information to a common calculation server. GWAS quality control was performed with GWAtoolbox60 and custom scripts to assess ancestry-matched allele frequencies and variant positions. All studies had their own research protocols approved by the respective local ethics committees. All participants in all studies provided written informed consent.
Phenotype definition.
Each study measured serum creatinine and BUN concentrations as described in Supplementary Table 1. Creatinine values obtained with a Jaffé assay before 2009 were calibrated by multiplying by 0.95 (ref.61). Studies on adults (>18 years of age) estimated GFR with the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equation62, by using the R package nephro63. Studies on individuals who were 18 years old or younger used the Schwartz formula64. eGFR was winsorized at 15 and 200 ml min−1 per 1.73 m2. CKD was defined as an eGFR below 60 ml min−1 per 1.73 m2. In studies reporting blood urea measurements, BUN was derived as blood urea × 2.8, with units expressed as mg dl−1.
Genotyping and genotype imputation.
Genotypes were imputed on the basis of the Haplotype Reference Consortium v1.1 or 1000 Genomes Project phase 3 v5 (1000Gp3v5) ALL or phase 1 v3 (1000Gp1v3) ALL panel. Imputed variants were coded as allelic dosages accompanied by the corresponding IQ scores (IMPUTE2 info score, MACH/minimac RSQ or as applicable) and annotated on the NCBI b37 (hg19) reference build (see Supplementary Table 2 for study-specific genotyping arrays, haplotype phasing and genotype imputation methods).
Genome-wide association studies. Each study fitted sex- and age-adjusted linear regression models to log(eGFR) and BUN. Regression residuals were regressed on SNP dosage, assuming an additive genetic model. Study site, genetic principal components, relatedness and other study-specific features were accounted for in the study-specific models as appropriate (Supplementary Table 2). Logistic regression models were fitted for CKD.
Trans-ancestry GWAS meta-analysis.
Studies contributed 121 GWAS summary statistics files for eGFR (total post-quality-control n = 765,348), 60 GWAS files for CKD (total post-quality-control n = 625,219, including 64,164 CKD cases) and 65 GWAS files for BUN (total post-quality-control n = 416,178). Ancestry-specific details for eGFR, CKD and BUN are given in Supplementary Table 1.
Before meta-analysis, study-specific GWAS files were filtered to retain only variants with IQ score > 0.6 and minor allele count (MAC) > 10, and genomic control (GC) correction was applied in the case where GC factor λGC > 1. Fixed-effects inverse-variance-weighted meta-analysis was performed with METAL65, which was adapted to increase the precision of effect estimates and their standard errors (seven decimal places instead of four).
After meta-analysis of 43,994,957 SNPs, only SNPs present in ≥50% of the GWAS files and with total MAC ≥ 400 were retained. Across ancestry groups, this yielded 8,221,591 variants for eGFR (8,834,748 in European ancestry), 8,176,554 variants for BUN (8,358,347 in European ancestry) and 9,585,923 variants for CKD. Post-meta-analysis GC correction was not applied (LD score regression intercept ≈ 1 in all analyses of eGFR, BUN and CKD)66. The genome-wide significance level was set at 5 × 10−8. Between-study heterogeneity was assessed with the I2 statistic67. For CKD, variants with I2 ≥ 95% were removed to moderate the influence of single large studies. Variants were assigned to loci by selecting the SNP with the lowest P value across the genome as the index SNP, defining the corresponding locus as the 1-Mb segment centered on the index SNP, and repeating the procedure until no further genome-wide-significant SNPs remained. The extended major histocompatibility complex (MHC) region was considered as a single locus. A locus was considered to be new if not containing any variant identified by previous GWAS of eGFR.
Meta-regression analysis of trans-ancestry GWAS.
For eGFR, we evaluated ancestry-related heterogeneity by using the software Meta-Regression of Multi-Ethnic Genetic Association (MR-MEGA, v0.1.2)68 with study-specific GWAS results. Meta-regression models included three axes of genetic variation. Genomic control correction was applied to the meta-regression results. The 308 genome-wide-significant index SNPs from the trans-ancestry GWAS meta-analysis were tested for ancestry-related heterogeneity of the allelic effects at a significance level of 0.05/308 = 1.6 × 10−4 (referring to the corresponding P value as Panc-het).
Variance explained and genetic heritability.
The proportion of phenotypic variance explained by the index SNPs was estimated as , with β being the SNP effect, p the effect allele frequency and var the variance of the sex- and age-adjusted log(eGFR) residuals (assumed to be 0.016 on the basis of data from 11,827 European-ancestry participants of the population-based ARIC study)9. Genetic heritability for age- and sex-adjusted log(eGFR) was estimated with the R package MCMCglmm69 on the Cooperative Health Research in South Tyrol (CHRIS) study70, a participating pedigree-based study with 186 pedigrees of up to five generations (n = 4,373)71. We fitted two models with and without inclusion of the identified index SNPs (304/308), running 1 million MCMC iterations (burn-in = 500,000)71.
Comparison with and replication of results in the MVP.
The eGFR-associated SNPs identified in the discovery GWAS meta-analyses were tested for replication in a GWAS from the MVP23, an independent trans-ancestry study with participants recruited across 63 US Veterans Administration (VA) medical facilities. Written informed consent was obtained and all documents and protocols were approved by the VA Central Institutional Review Board. After genotyping and quality control, genotypes were phased and imputed on the 1000Gp3v5 reference panel. Serum creatinine was assessed up to 1 year before MVP enrollment by isotope dilution mass spectrometry. GFR was estimated by using the CKD-EPI equation62 after excluding subjects on dialysis, transplant patients, amputees, individuals on HIV medications and those with creatinine values of <0.4 mg dl−1. GWAS of eGFR on SNP dosage were performed by fitting linear regression models adjusted for age at creatinine measurement, age2, sex, body-mass index and the first ten genetic principal components, by using SNPTEST v2.5.4-beta72. All GWAS were stratified by self-reported ancestry (79.6% white non-Hispanic, 20.4% black non-Hispanic), diabetes and hypertension status. Results were combined across strata by fixed-effects inverse-variance-weighted meta-analysis in METAL65. This analysis encompassed a total of 280,722 individuals across all strata, of whom 216,518 were non-Hispanic whites (European ancestry). The MVP is described more extensively in the Supplementary Note 3.
Of the 308 eGFR index SNPs identified in the CKDGen trans-ancestry analysis, 305 variants or their good proxies were available in the MVP GWAS (proxies were required to have r2 ≥ 0.8 with the index SNP and were selected by maximum r2 followed by minimum distance in the case of ties). Replication testing of the 256 European-ancestry-specific index SNPs was restricted to the MVP European-ancestry GWAS. CKDGen and MVP meta-analysis results were pooled via sample-size-weighted meta-analysis of z scores with METAL65. In both the trans-ancestry and European-ancestry-specific analyses, replication was defined by one-sided P < 0.05 in the MVP and genome-wide significance of the CKDGen and MVP meta-analysis result.
Assessment of relevance to kidney function with BUN.
We used genetic associations with BUN to assess replicated eGFR-associated SNPs with respect to their potential relevance to kidney function. Support for kidney function relevance was categorized as ‘likely’ for all eGFR index SNPs with an inverse, significant (one-sided P < 0.05) association with BUN for a given reference allele, ‘inconclusive’ for eGFR index SNPs whose effect on BUN was not different from0 (P ≥ 0.05) and ‘unlikely’ for all eGFR index SNPs with a concordant, significant (one-sided P < 0.05) association with BUN for a given reference allele.
Genetic risk score analysis in the UK Biobank dataset.
To test the combined effect of eGFR-associated SNPs on outcomes related to clinically diagnosed CKD, a GRS-based association analysis was conducted on the basis of summary GWAS results, as described previously73,74. Genetic association results with diseases were obtained for 452,264 UK Biobank participants available in the GeneAtlas75 database for glomerular diseases (ICD-10 codes N00-N08; 2,289 cases); acute renal failure (N17; 4,913 cases); chronic renal failure (N18; 4,905 cases); urolithiasis (N20-N23; 7,053 cases); hypertensive diseases (I10-I15; 84,910 cases); and ischemic heart diseases (I20-I25; 33,387 cases). Asthma (J45; 28,628 cases) was included as a negative control. The log(estimated OR) value provided by the GeneAtlas PheWAS interface was used as the effect size, and its standard error was calculated from the corresponding effect size and P value. When OR = 1, the standard error was imputed by the median value of the remaining associations of the trait. Of the147 eGFR index SNPs from the trans-ancestry GWAS meta-analysis that were replicated and showed likely relevance to kidney function, 144 were available in the UK Biobank dataset, and 259 of all 264 replicated trans-ancestry GWAS meta-analysis SNPs were available. The effect of the GRS association (β) corresponds to the OR for the disease depending on the relative change in eGFR, for example, OR = 1.10β for a 10% change in eGFR. Alternatively, exp(β) can be interpreted as the OR for the disease per unit change in log(eGFR).
Genetic correlations with other complex traits and diseases.
Genome-wide genetic correlation analysis was performed to investigate evidence of co-regulation or shared genetic basis between eGFR and BUN concentrations and other complex traits and diseases, both known and not known to correlate with eGFR and BUN. We estimated pairwise genetic correlation coefficients (rg) between the results of our trans-ancestry meta-analyses of eGFR and BUN and each of 748 precomputed and publicly available GWAS summary statistics for complex traits and diseases available through LD Hub v1.9.0 by using LD score regression28. An overview of the sources of these summary statistics and their corresponding sample sizes is available at http://ldsc.broadinstitute.org/. Statistical significance was assessed at the Bonferroni-corrected level of 0.05/748 = 6.7 × 10−5.
Pathway and tissue enrichment analysis.
We used DEPICT v1 release 194 to perform DEPICT analysis29, including pathway/gene set enrichment and tissue/cell-type analyses as described previously9,10. All 14,461 gene sets were reconstituted by identifying genes that were transcriptionally co-regulated with other genes in a panel of 77,840 gene expression microarrays76, from mouse knockout studies, and molecular pathways from protein-protein interaction screening. In the tissue and cell-type enrichment analysis, we tested whether genes in associated regions were highly expressed in 209 MeSH annotation categories for 37,427 microarrays (Affymetrix U133 Plus 2.0 array platform). For both eGFR and BUN, we included all variants associated with the trait at P < 5 × 10−8 in the trans-ancestry meta-analysis. Independent variant clumping was performed by using PLINK 1.9 (ref.77) with 500-kb flanking regions and r2> 0.01 in the 1000Gp1v3 dataset. After excluding the MHC region, DEPICT was run with 500 repetitions to estimate the FDR and 5,000 permutations to compute P values adjusted for gene length by using 500 null GWAS. All significant gene sets were merged into meta gene sets by running an affinity propagation algorithm78 implemented in the Python scikit-learn package (http://scikit-learn.org/). The resulting network was visualized with Cytoscape (http://cytoscape.org/).
Enrichment of heritability by cell-type group.
We used stratified LD score regression to investigate important tissues and cell types on the basis of the trans-ancestry eGFR and BUN meta-analysis results. Heritability enrichment in ten cell-type groups was assessed by using the default options of stratified LD score regression described previously30. The ten cell-type groups were collapsed from 220 cell-type-specific regulatory annotations for the four histone marks H3K4me1, H3K4me3, H3K9ac and H3K27ac. Enrichment in a cell-type category was defined as the proportion of SNP heritability in that group divided by the proportion of SNPs in the same cell-type group.
Analysis of genes causing kidney phenotypes in mice.
A nested candidate gene analysis was performed with GenToS79 to identify additional genetic associations that were not genome-wide significant. Candidate genes that when manipulated cause kidney phenotypes in mice were selected with the comprehensive MGI phenotype ontology in September 2017 (abnormal renal glomerular filtration rate (MP:0002847); abnormal kidney morphology (MP:0002135); abnormal kidney physiology (MP:0002136)). The human orthologs of these genes were obtained, when available, with the Human-Mouse: Disease Connection webtool (http://www.informatics.jax.org/humanDisease.html). Statistical significance was defined as Bonferroni correction of a type I error level of 0.05 for the number of independent common SNPs across all genes in each of the three candidate gene lists plus their flanking regions, derived from an ancestry-matched reference population. The GWAS meta-analysis summary statistics for eGFR were queried for significantly associated SNPs mapping to the selected candidate genes. Enrichment of significant genetic associations in genes within each candidate list was computed from the complementary cumulative binomial distribution79. GenToS was used with default parameters on each of the three candidate gene lists, with the 1000 Genomes phase 3 release 2 ALL dataset as reference.
Independent variant identification in the European-ancestry meta-analysis.
To identify additional independent eGFR-associated variants within the European-ancestry-specific and replicated loci, approximate conditional analyses were performed on the basis of genome-wide discovery summary statistics that incorporated LD information from an ancestry-matched reference population. These analyses were restricted to participants of European ancestry because an LD reference sample scaled to the size of our meta-analysis could only be constructed from publicly available data for European-ancestry individuals24, for which we randomly selected 15,000 UK Biobank participants (dataset ID 8974). Individuals who withdrew consent and those not meeting data cleaning requirements were excluded, keeping only those who passed a sex-consistency check, had a >95% call rate and did not represent outliers with respect to SNP heterozygosity. For each pair of individuals, the proportion of variants shared identical by descent (IBD) was computed with PLINK80. Only one member of each pair with an IBD coefficient ≥0.1875 was retained. Individuals were restricted to those of European ancestry by excluding outliers along the first two principal components from a principal-component analysis seeded with the HapMap phase 3 release 2 populations as reference. The final dataset to estimate LD included 13,558 European-ancestry individuals and 16,969,363 SNPs.
The basis for statistical fine-mapping was the 228 1-Mb genome-wide-significant loci identified in the European-ancestry meta-analysis, clipping at chromosome borders. Overlapping loci as well as pairs of loci whose respective index SNPs were correlated (r2 > 0.1 in the UK Biobank LD dataset described above) were merged. A single SNP was chosen to represent the MHC region, resulting in a final list of 189 regions before fine-mapping. Within each region, the GCTA COJO Slct algorithm81 was applied to identify independent variants by using a stepwise forward selection approach. We used the default collinearity cutoff of 0.9 (sensitivity analyses showed no major influence of alternative cutoff values; data not shown). We deemed an additional SNP as independently genome-wide significant if the SNP’s P value conditional on all previously identified SNPs in the same region was <5 × 10−8.
Fine-mapping and credible sets in the European-ancestry meta-analysis.
Foreach region containing multiple independent SNPs and for each independent SNP in such regions, approximate conditional analyses were conducted with the GCTA COJO-Cond algorithm to generate approximate conditional association statistics conditioned on the other independent SNPs in the region. By using Wakefield’s formula implemented in the R package gtx82, we derived approximate Bayes factors (ABFs) from conditional estimates in regions with multiple independent SNPs and from the original estimates for regions with a single independent SNP. Given that 95% of the SNP effects on log(eGFR) fell within the range −0.01 to 0.01, the standard deviation prior was chosen as 0.0051 on the basis of formula (8) in the original publication32. Sensitivity analyses showed that results were robust when higher values were used for the standard deviation prior (data not shown). For each variant within an evaluated region, the ABF obtained from the association β values and their standard errors for the marginal (single-signal regions) or conditional (multi-signal regions) estimates was used to calculate the PP for a SNP of driving the association signal (‘causal variant’). We derived 99% credible sets, representing the SNP sets containing the variant(s) driving the association signal with 99% probability, by ranking variants by their PPs and adding them to the set until cumulative PP > 99% was reached in each region.
Variant annotation.
Functional annotation of SNPs mapping to credible sets was performed with SNiPA v3.2 (March 2017)83, on the basis of the 1000Gp3v5 and Ensembl v87 datasets. SNiPA was also used to derive the CADD Phred-like score84, on the basis of CADD v1.3. The Ensembl VEP tool85 was used for prediction of the primary effects of SNPs.
Colocalization of eGFR signal and gene expression in cis.
As the great majority of gene expression datasets are generated on the basis of European-ancestry samples, colocalization analysis was based on genetic associations with eGFR in the European-ancestry sample and with gene expression (eQTLs) quantified from microdissected human glomerular and tubulo-interstitial kidney portions from 187 individuals from the NEPTUNE study41, as well as the 44 tissues included in the GTEx Project v6p release38. The eQTL and GWAS effect alleles were harmonized. For each locus, we identified tissue gene pairs with reported eQTL data within ±100 kb of each GWAS index SNP. The region for each colocalization test was the eQTL cis window defined in the underlying GTEx and NephQTL studies. We used the coloc.fast function, with the default setting, from the R package gtx (https://github.com/tobyjohnson/gtx), which is an adaptation of Giambartolomei’s colocalization method86. The gtx package was also used to estimate the direction of effect over the credible sets as the ratio of the average PP-weighted GWAS effects over the PP-weighted eQTL effects.
Trans-eQTL analysis.
We performed trans-eQTL annotation through LD mapping on the basis of the 1000Gp3v5 European reference panel (r2 cutoff of >0.8). We limited annotation to replicated index SNPs with fine-mapping PP ≥ 1%. Owing to expected small effect sizes, only genome-wide trans-eQTL studies of either peripheral blood mononuclear cells or whole blood with n > 1,000 individuals were considered, resulting in five non-overlapping studies87–91 (Supplementary Table 14). For one study91, we had access to an update with larger sample size (n = 6,645) obtained by combining two non-overlapping studies (LIFE-Heart92 and LIFE-Adult93). To improve the stringency of results, we focused the analysis on interchromosomal trans-eQTLs with P < 5 × 10−8 in ≥2 studies.
Colocalization with urinary uromodulin concentrations.
Association of genetic variants with UUCR at the UMOD-PDILT locus was evaluated in the German Chronic Kidney Disease (GCKD) study94. Uromodulin concentrations were measured from frozen stored urine by an established ELISA with excellent performance36. Concentrations were indexed to creatinine to account for urine dilution. Genetic associations were assessed with the same software and settings as for eGFR association (Supplementary Table 2). Colocalization analyses were performed with identical software and settings as described above for the association with gene expression.
Reporting Summary.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Genome-wide summary statistics for this study have been made publicly available at http://ckdgen.imbi.uni-freiburg.de.
Supplementary Material
Acknowledgements
We thank D. Di Domizio (Eurac Research) and J. Knaus (University of Freiburg) for IT assistance and T. Johnson (GlaxoSmithKline) for sharing his code and discussion on credible set fine-mapping and colocalization analysis. This research has been conducted using the UK Biobank resource under application number 20272. Study-specific acknowledgements and funding sources are listed in the Supplementary Information.
Footnotes
competing interests
W. Koenig reports modest consultation fees for advisory board meetings from Amgen, DalCor, Kowa, Novartis, Pfizer and Sanofi and modest personal fees for lectures from Amgen, AstraZeneca, Novartis, Pfizer and Sanofi, all outside the scope of the submitted work. W.M. is employed with Synlab Services and holds shares of Synlab Holding Deutschland. D.O.M.-K. is a part-time research consultant at Metabolon. M.A.N. is supported by a consulting contract between Data Tecnica International and the National Institute on Aging (NIA), National Institutes of Health (NIH) and consults for Illumina, the Michael J. Fox Foundation and University of California Healthcare. O.H.F. works in ErasmusAGE, a center for aging research across the life course funded by Nestlé Nutrition (Nestec); Metagenics; and AXA. K.B.S., L.Y.-A., D.M.W. and M.A.L. are full-time employees of GlaxoSmithKline. M.L.O’D. received grant support from GlaxoSmithKline, MSD, Eisai, AstraZeneca, MedCo and Janssen. H.W. received grants and non-financial support from GlaxoSmithKline, during the conduct of the study; grants from Sanofi-Aventis, Eli Lilly, the National Institute of Health, Omthera Pharmaceuticals, Pfizer New Zealand, Elsai Inc. and Dalcor Pharma UK; honoraria and non-financial support from AstraZeneca; and is on advisory boards for Sirtex and Acetilion and received personal fees from CSL Behring and American Regent outside the scope of the submitted work. L. Wallentin received institutional grants from GlaxoSmithKline, AstraZeneca, BMS, Boehringer-Ingelheim, Pfizer, MSD and Roche Diagnostics. D.F.R. and A.I.P. are employees of MSD. M. Scholz received consultancy of and grant support from Merck Serono not related to this project. B.M.P. serves on the DSMB of a clinical trial funded by the manufacturer (Zoll LifeCor) and on the steering committee of the Yale Open Data Access Project funded by Johnson & Johnson. J. Danesh is a member of the Novartis Cardiovascular and Metabolic Advisory Board and received grant support from Novartis. A.S.B. received grants from MSD, Pfizer, Novartis, Biogen and Bioverativ and personal fees from Novartis. V.S. has participated in a conference trip sponsored by Novo Nordisk and received a honorarium from the same source for participating in an advisory board meeting. A. Köttgen received grant support from Gruenenthal. All other authors declare no conflicts of interest.
Supplementary information is available for this paper at https://doi.org/10.1038/s41588-019-0407-x.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Eckardt KU et al. Evolving importance of kidney disease: from subspecialty to global health burden. Lancet 382, 158–169 (2013). [DOI] [PubMed] [Google Scholar]
- 2.Jha V et al. Chronic kidney disease: global dimension and perspectives. Lancet 382, 260–272 (2013). [DOI] [PubMed] [Google Scholar]
- 3.Ene-Iordache B et al. Chronic kidney disease and cardiovascular risk in six regions of the world (ISN-KDDC): a cross-sectional study. Lancet Glob. Health 4, e307–e319 (2016). [DOI] [PubMed] [Google Scholar]
- 4.Go AS, Chertow GM, Fan D, McCulloch CE & Hsu CY Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N. Engl. J. Med 351, 1296–1305 (2004). [DOI] [PubMed] [Google Scholar]
- 5.GBD 2016 Causes of Death Collaborators. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet 390, 1151–1210 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Inrig JK et al. The landscape of clinical trials in nephrology: a systematic review of ClinicalTrials.gov. Am. J. Kidney Dis. 63, 771–780 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Levin A et al. Global kidney health 2017 and beyond: a roadmap for closing gaps in care, research, and policy. Lancet 390, 1888–1917 (2017). [DOI] [PubMed] [Google Scholar]
- 8.Wuttke M & Kottgen A Insights into kidney diseases from genome-wide association studies. Nat. Rev. Nephrol 12, 549–562 (2016). [DOI] [PubMed] [Google Scholar]
- 9.Gorski M et al. 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function. Sci. Rep 7, 45040 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pattaro C et al. Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun 7, 10023 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chasman DI et al. Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function. Hum. Mol. Genet 21, 5329–5343 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pattaro C et al. Genome-wide association and functional follow-up reveals new loci for kidney function. PLoS Genet. 8, e1002584 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kottgen A et al. New loci associated with kidney function and chronic kidney disease. Nat. Genet 42, 376–384 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chambers JC et al. Genetic loci influencing kidney function and chronic kidney disease. Nat. Genet 42, 373–375 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kottgen A et al. Multiple loci associated with indices of renal function and chronic kidney disease. Nat. Genet 41, 712–717 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kanai M et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet 50, 390–400 (2018). [DOI] [PubMed] [Google Scholar]
- 17.Okada Y et al. Meta-analysis identifies multiple loci associated with kidney function-related traits in east Asian populations. Nat. Genet 44, 904–909 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hishida A et al. Genome-wide association study of renal function traits: results from the Japan Multi-institutional Collaborative Cohort study. Am. J. Nephrol 47, 304–316 (2018). [DOI] [PubMed] [Google Scholar]
- 19.Lee J et al. Genome-wide association analysis identifies multiple loci associated with kidney disease-related traits in Korean populations. PLoS One 13, e0194044 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mahajan A et al. Trans-ethnic fine mapping highlights kidney-function genes linked to salt sensitivity. Am. J. Hum. Genet 99, 636–646 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Devuyst O & Pattaro C The UMODlocus: insights into the pathogenesis and prognosis of kidney disease. J. Am. Soc. Nephrol 29, 713–726 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yeo NC et al. Shroom3 contributes to the maintenance of the glomerular filtration barrier integrity. Genome Res. 25, 57–65 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gaziano JM et al. Million Veteran Program: a mega-biobank to study genetic influences on health and disease. J. Clin. Epidemiol 70, 214–223 (2016). [DOI] [PubMed] [Google Scholar]
- 24.Benner C et al. Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am. J. Hum. Genet 101, 539–551 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McCarthy S et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet 48, 1279–1283 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Abecasis GR et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li M et al. SOS2 and ACP1 loci identified through large-scale exome chip analysis regulate kidney development and function. J. Am. Soc. Nephrol 28, 981–994 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bulik-Sullivan B et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet 47, 1236–1241 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Pers TH et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun 6, 5890 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Finucane HK et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet 47, 1228–1235 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Jing J et al. Combination of mouse models and genomewide association studies highlights novel genes associated with human kidney function. Kidney Int. 90, 764–773 (2016). [DOI] [PubMed] [Google Scholar]
- 32.Wakefield J A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet 81, 208–227 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dong C et al. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum. Mol. Genet 24, 2125–2137 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tsuda M et al. Targeted disruption of the multidrug and toxin extrusion 1 (Mate1) gene in mice reduces renal secretion of metformin. Mol. Pharm 75, 1280–1286 (2009). [DOI] [PubMed] [Google Scholar]
- 35.Sieber KB et al. Integrated functional genomic analysis enables annotation of kidney genome-wide association study loci. J. Am. Soc. Nephrol 30, 421–441 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Olden M et al. Common variants in UMOD associate with urinary uromodulin levels: a meta-analysis. J. Am. Soc. Nephrol 25, 1869–1882 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Moreau ME et al. The kallikrein-kinin system: current and future pharmacological targets. J. Pharm. Sci 99, 6–38 (2005). [DOI] [PubMed] [Google Scholar]
- 38.Battle A, Brown CD, Engelhardt BE & Montgomery SB Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gamazon ER et al. Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat. Genet 50, 956–967 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Eckardt KU et al. Autosomal dominant tubulointerstitial kidney disease: diagnosis, classification, and management—a KDIGO consensus report. Kidney Int. 88, 676–683 (2015). [DOI] [PubMed] [Google Scholar]
- 41.Gillies CE et al. An eQTL landscape of kidney tissue in human nephrotic syndrome. Am. J. Hum. Genet 103, 232–244 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Dudley AJ, Bleasby K & Brown CD The organic cation transporter OCT2 mediates the uptake of β-adrenoceptor antagonists across the apical membrane of renal LLC-PK1 cell monolayers. Br. J. Pharm 131, 71–79 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Filipski KK, Mathijssen RH, Mikkelsen TS, Schinkel AH & Sparreboom A Contribution of organic cation transporter 2 (OCT2) to cisplatin-induced nephrotoxicity. Clin. Pharm. Ther 86, 396–402 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Motohashi H & Inui K Organic cation transporter OCTs (SLC22) and MATEs (SLC47) in the human kidney. AAPS J 15, 581–588 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Popejoy AB & Fullerton SM Genomics is failing on diversity. Nature 538, 161–164 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Humm A, Huber R & Mann K The amino acid sequences of human and pig L-arginine:glycine amidinotransferase. FEBS Lett. 339, 101–107 (1994). [DOI] [PubMed] [Google Scholar]
- 47.Olives B et al. Molecular characterization of a new urea transporter in the human kidney. FEBS Lett. 386, 156–160 (1996). [DOI] [PubMed] [Google Scholar]
- 48.Phan NN et al. Voltage-gated calcium channels: novel targets for cancer therapy. Oncol. Lett 14, 2059–2074 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Thi Do D, Phan NN, Wang CY, Sun Z & Lin YC Novel regulations of MEF2-A, MEF2-D, and CACNA1S in the functional incompetence of adipose-derived mesenchymal stem cells by induced indoxyl sulfate in chronic kidney disease. Cytotechnology 68, 2589–2604 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Parsa A et al. Common variants in Mendelian kidney disease genes and their association with renal function. J. Am. Soc. Nephrol 24, 2105–2117 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Xie W et al. Genetic variants associated with glycine metabolism and their role in insulin sensitivity and type 2 diabetes. Diabetes 62, 2141–2150 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Raffler J et al. Genome-wide association study with targeted and non-targeted NMR metabolomics identifies 15 novel loci of urinary human metabolic individuality. PLoS Genet. 11, e1005487 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Janicki N et al. Increased occurence of valporoic acid-induced hyperammonemia in carriers of T1405N polymorphism in carbamoyl phosphate synthetase 1 gene. ISRN Neurol. 2013, 261497 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Seppala A et al. Genome-wide association study on dimethylarginines reveals novel AGXT2 variants associated with heart rate variability but not with overall mortality. Eur. Heart J. 35, 524–531 (2014). [DOI] [PubMed] [Google Scholar]
- 55.Sveinbjornsson G et al. Rare mutations associating with serum creatinine and chronic kidney disease. Hum. Mol. Genet 23, 6935–6943 (2014). [DOI] [PubMed] [Google Scholar]
- 56.Pescio LG et al. Changes in ceramide metabolism are essential in Madin-Darby canine kidney cell differentiation. J. Lipid Res. 58, 1428–1438 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Imgrund S et al. Adult ceramide synthase 2 (CERS2)-deficient mice exhibit myelin sheath defects, cerebellar degeneration, and hepatocarcinomas. J. Biol. Chem 284, 33549–33560 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Shiffman D et al. A gene variant in CERS2 is associated with rate of increase in albuminuria in patients with diabetes from ONTARGET and TRANSCEND. PLoS One 9, e106631 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Yoshioka K et al. IgA nephropathy in patients with congenital C9 deficiency. Kidney Int. 42, 1253–1258 (1992). [DOI] [PubMed] [Google Scholar]
- 60.Fuchsberger C, Taliun D, Pramstaller PP & Pattaro C GWAtoolbox: an R package for fast quality control and handling of genome-wide association studies meta-analysis data. Bioinformatics 28, 444–445 (2012). [DOI] [PubMed] [Google Scholar]
- 61.Coresh J et al. Decline in estimated glomerular filtration rate and subsequent risk of end-stage renal disease and mortality. J. Am. Med. Assoc 311, 2518–2531 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Levey AS et al. A new equation to estimate glomerular filtration rate. Ann. Intern. Med 150, 604–612 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Pattaro C et al. Estimating the glomerular filtration rate in the general population using different equations: effects on classification and association. Nephron Clin. Pract 123, 102–111 (2013). [DOI] [PubMed] [Google Scholar]
- 64.Schwartz GJ et al. Improved equations estimating GFR in children with chronic kidney disease using an immunonephelometric determination of cystatin C. Kidney Int. 82, 445–453 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Willer CJ, Li Y & Abecasis GR METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Bulik-Sullivan BK et al. LD Score regression distinguishes confoundingfrom polygenicity in genome-wide association studies. Nat. Genet 47, 291–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Higgins JP & Thompson SG Quantifying heterogeneity in a meta-analysis. Stat. Med 21, 1539–1558 (2002). [DOI] [PubMed] [Google Scholar]
- 68.Magi R et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet 26, 3639–3650 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Hadfield J MCMC methods for multi-response generalized linear mixed models: the MCMC glmm R package. J. Stat. Softw 33, 1–22 (2010).20808728 [Google Scholar]
- 70.Pattaro C et al. The Cooperative Health Research in South Tyrol (CHRIS) study: rationale, objectives, and preliminary results. J. Transl. Med 13, 348 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Noce D et al. Sequential recruitment of study participants may inflate genetic heritability estimates. Hum. Genet 136, 743–757 (2017). [DOI] [PubMed] [Google Scholar]
- 72.Marchini J, Howie B, Myers S, McVean G & Donnelly P A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet 39, 906–913 (2007). [DOI] [PubMed] [Google Scholar]
- 73.Kottgen A et al. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat. Genet 45, 145–154 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Dastani Z et al. Novel loci for adiponectin levels and their influence on type 2 diabetes and metabolic traits: a multi-ethnic meta-analysis of 45,891 individuals. PLoS Genet. 8, e1002607 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Canela-Xandri O, Rawlik K & Tenesa A An atlas of genetic associations in UK Biobank. Nat. Genet 50, 1593–1599 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Fehrmann RS et al. Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat. Genet 47, 115–125 (2015). [DOI] [PubMed] [Google Scholar]
- 77.Chang CC et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Frey BJ & Dueck D Clustering by passing messages between data points. Science 315, 972–976 (2007). [DOI] [PubMed] [Google Scholar]
- 79.Hoppmann AS, Schlosser P, Backofen R, Lausch E & Kottgen A GenToS: use of orthologous gene information to prioritize signals from human GWAS. PLoS One 11, e0162466 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Purcell S et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet 81, 559–575 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Yang J, Lee SH, Goddard ME & Visscher PM GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet 88, 76–82 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Wakefield J Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol 33, 79–86 (2009). [DOI] [PubMed] [Google Scholar]
- 83.Arnold M, Raffler J, Pfeufer A, Suhre K & Kastenmuller G SNiPA: an interactive, genetic variant-centered annotation browser. Bioinformatics 31, 1334–1336 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Kircher M et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet 46, 310–315 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.McLaren W et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Giambartolomei C et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet 10, e1004383 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Zeller T et al. Genetics and beyond—the transcriptome of human monocytes and disease susceptibility. PLoS One 5, e10693 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Fehrmann RS et al. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet. 7, e1002197 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Westra HJ et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet 45, 1238–1243 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Joehanes R et al. Integrated genome-wide analysis of expression quantitative trait loci aids interpretation of genomic association studies. Genome Biol. 18, 16 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Kirsten H et al. Dissecting the genetics of the human transcriptome identifies novel trait-related trans-eQTLs and corroborates the regulatory relevance of non-protein coding loci. Hum. Mol. Genet 24, 4746–4763 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Beutner F et al. Rationale and design of the Leipzig (LIFE) Heart Study: phenotyping and cardiovascular characteristics of patients with coronary artery disease. PLoS One 6, e29070 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Loeffler M et al. The LIFE-Adult-Study: objectives and design of a population-based cohort study with 10,000 deeply phenotyped adults in Germany. BMC Public Health 15, 691 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Eckardt KU et al. The German Chronic Kidney Disease (GCKD) study: design and methods. Nephrol. Dial. Transpl 27, 1454–1460 (2012). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Genome-wide summary statistics for this study have been made publicly available at http://ckdgen.imbi.uni-freiburg.de.