Abstract
Urinary albumin excretion is an early sign of diabetic kidney disease, affecting every third individual with diabetes. Despite substantial estimated heritability, only variants in the GLRA3 gene have been genome-wide significantly associated (p-value < 5 × 10−8) with diabetic albuminuria, in Finnish individuals with type 1 diabetes; However, replication attempt in non-Finnish Europeans with type 1 diabetes showed nominally significant association in the opposite direction, suggesting a population-specific effect, but simultaneously leaving the finding controversial. In this study, the association between the common rs10011025 variant in the GLRA3 locus, and albuminuria, was confirmed in 1259 independent Finnish individuals with type 1 diabetes (p = 0.0013), and meta-analysis of all Finnish individuals yielded a genome-wide significant association. The association was particularly pronounced in subjects not reaching the treatment target for blood glucose levels (HbA1c > 7%; N = 2560, p = 1.7 × 10−9). Even though further studies are needed to pinpoint the causal variants, dissecting the association at the GLRA3 locus may uncover novel molecular mechanisms for diabetic albuminuria irrespective of population background.
Introduction
Diabetic nephropathy (DN) is a devastating disease affecting one third of the individuals with diabetes, with up to 20% of subjects with type 1 diabetes developing end-stage renal disease (ESRD) requiring dialysis treatment or renal transplantation for survival1. DN is classically characterized by onset of albuminuria, and subsequent loss of glomerular filtration rate (GFR). Even though loss of renal function may occur also without albuminuria, albuminuria remains an important early predictor of decline of renal function in diabetes2. Importantly, elevated albuminuria levels, even within the normal range (albumin excretion rate [AER] < 30 mg/24 hours), predict higher renal risk3. Furthermore, already moderately increased AER, i.e. microalbuminuria (30–300 mg/24 hours), is associated with 3-fold increased mortality rate compared with diabetic individuals with normal AER, and patients with macroalbuminuria (AER > 300 mg/24 hours) or ESRD have as high as six to fourteen-fold mortality risk, respectively4. There is also an increasing body of evidence that albuminuria may play a pathogenic role in renal disease2. Thus, it is of importance to target treatment already at the early stage of DN. However, the treatment options for DN are currently mainly limited to antihypertensive medication, and the development of novel medications has proved challenging as the pathology of DN remains poorly understood.
Albuminuria has been shown to have a genetic component, with heritability estimates of 30% to 45% in individuals with type 2 diabetes5,6, and 27% in type 1 diabetes7. However, genome-wide association studies (GWAS) have identified only a few genetic susceptibility loci for DN with genome-wide statistical significance, i.e. p-value < 5 × 10−8, with the main findings often related to the most extreme ESRD phenotype7–10. However, a recent GWAS on albuminuria as continuous trait found suggestive evidence of association between albuminuria and variants in the RAB38 and HS6ST1 genes (p-values < 10−6) in the subset of individuals with diabetes11.
Our previous GWAS on albuminuria, as measured by 24-hour AER, identified five single nucleotide polymorphisms (SNPs) in the GLRA3 gene that were associated with albuminuria with genome-wide significance (p-value = 1.5 × 10−9 for rs10011025) in Finnish individuals with type 1 diabetes in the Finnish Diabetic Nephropathy Study (FinnDiane). Replication in 598 additional FinnDiane individuals with AER measured from timed overnight urine collections (nu-AER) showed a non-significant trend but indeed in the same direction. On the contrary, a replication attempt in 3152 non-Finnish European individuals with type 1 diabetes reached a nominally-significant p-value of 0.03 for a directly genotyped SNP, rs1564939 in the GLRA3 gene, but with the opposite allele associated with albuminuria than in the Finnish subjects. Therefore, the association could not be considered replicated, and remained inconclusive despite the originally genome-wide significant results7.
We hypothesized that the observed associations with the common SNPs on the GLRA3 gene in fact reflect population specific effects and may therefore vary on effect direction. In order to study this topic further, we here assessed the associations of the previously identified genetic variants in additional Finnish individuals recently recruited into the FinnDiane study in order to confirm or to refute the role of these variants for AER in Finnish patients with type 1 diabetes. We further dissected the flanking region to detect low frequency and rare variants contributing to the association seen at the common rs10011025 variant.
Results
Replication of the previous results
We previously selected for replication three SNPs genome-wide significantly associated with albuminuria (p < 5 × 10−8), plus rs11725853 (p = 1.8 × 10−7), from the GLRA3 locus. All were in notable linkage disequilibrium (LD) with the lead SNP rs10011025 in the 1000 Genomes Finnish individuals (r2 = 0.78–0.94, D′ = 0.90–1), while in moderate to high LD in British individuals (r2 = 0.61–0.97, D′ = 0.93–1)12. In this study, we identified 902 newly recruited FinnDiane patients with 24-hour AER measurements, and 357 patients with overnight nu-AER measurements, and with GWAS data available (Table 1). It is of note that these patients had not been included in the previous GWAS or in the previous replication analysis. In a meta-analysis of these two novel replication sets, all four abovementioned SNPs from the GLRA3 region were associated with AER (p < 0.004) with the direction of effect in line with the original GWAS analyses, i.e. the minor alleles were associated with higher levels of AER (Table 2). As in the original GWAS analysis, the strongest association was obtained for rs10011025 with each additional minor G allele estimated to increase log10(AER) by 0.12 (95% confidence interval [CI] 0.05–0.19), equal to multiplying raw AER by 1.12.
Table 1.
Characteristic | 24-hour AER | nu-AER | P |
---|---|---|---|
N | 902 | 357 | |
Men (%) | 457 (50.7) | 176 (49.3) | NS |
Age at onset of diabetes (years) | 16.7 ± 9.5 | 14.9 ± 8.9 | 0.001 |
Age (years) | 38.2 ± 12.3 | 42.1 ± 12.6 | <0.001 |
Duration of diabetes (years) | 21.5 ± 11.0 | 27.2 ± 12.1 | <0.001 |
AHT medication (%) | 288 (31.9) | 183 (51.3) | <0.001 |
SBP (mmHg) | 132.6 ± 17.3 | 136.1 ± 17.8 | 0.0025 |
DBP (mmHg) | 78.9 ± 9.6 | 78.0 ± 10.4 | NS |
HbA1c (%) | 8.5 ± 1.4 | 8.5 ± 1.4 | NS |
HbA1c (mmol/mol) | 69.3 ± 15.0 | 69.1 ± 15.2 | NS |
Mean HbA1c (%) | 8.5 ± 1.3 | 8.5 ± 1.2 | NS |
Mean HbA1c (mmol/mol) | 69.6 ± 13.9 | 69.3 ± 13.6 | NS |
Number of HbA1c measurements | 8 (2, 18) | 15 (6, 31) | <0.001 |
Retinal laser treatment (%) | 20.8% | 35.2% | <0.001 |
24 h AER (mg/24 h), mean ± SD | 119 ± 523 | ||
24 h AER (mg/24 h), median (IQR) | 9 (5, 26) | ||
nu-AER (μg/min), mean ± SD | 114 ± 698 | ||
nu-AER (μg/min), median (IQR) | 8 (3, 24) |
Characteristics were collected at the same visit as the 24-hour/nu-AER. Mean HbA1c refers to mean of all available HbA1c since the onset of diabetes. Data are given as numbers (percent), or mean ± standard deviation (SD), or medians (interquartile range [IQR]). AHT: anti-hypertensive. SBP: systolic blood pressure; DBP: diastolic blood pressure. P: difference between the replication sets; p-values > 0.05 are indicated as non-significant (NS); calculated with Welch two sample t-test for continuous variables, and Pearson’s Chi-squared test for binary variables.
Table 2.
SNP | REF | EA | r2 | 1) Meta novel replication | 2) Meta all Finnish replication | 3) Meta all Finnish | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
P | β (95% CI) | P | β (95% CI) | P | β (95% CI) | EAF | PHET | ||||
rs10011025 | A | G | 0.93 | 0.0013 | 0.119 (0.047–0.192) | 0.0042 | 0.093 (0.029–0.157) | 4.25 × 10–10 | 0.147 (0.101–0.193) | 0.16 | 0.024 |
rs11725853 | G | A | 0.99 | 0.0032 | 0.100 (0.034–0.167) | 0.0053 | 0.081 (0.024–0.137) | 1.08 × 10−8 | 0.113 (0.074–0.151) | 0.20 | 0.179 |
rs12509729 | G | A | 0.92 | 0.0032 | 0.107 (0.036–0.178) | 0.0103 | 0.085 (0.020–0.149) | 3.66 × 10−8 | 0.139 (0.090–0.189) | 0.16 | 0.025 |
rs1564939 | T | C | 0.97 | 0.0033 | 0.105 (0.035–0.175) | 0.0073 | 0.082 (0.022–0.142) | 1.44 × 10−9 | 0.126 (0.085–0.166) | 0.18 | 0.032 |
Results are given for (1) meta-analysis of the two novel replication cohorts (24-hour AER and nu-AER, N = 902 and 357, respectively); (2) meta-analysis of these and the previous replication (nu-AER, N = 598; ref.7); and (3) meta-analysis of all Finnish cohorts, including the original GWAS finding (24-hour AER, N = 1925, ref.7), previous replication study, and the two current replication sets.
REF: Reference allele; EA: Effect allele; r2: imputation quality estimate. β: effect size estimate. Positive β indicates that effect allele (EA) is associated with higher AER. β is calculated for log10 transformed AER values, such that β = 0.119 indicates 1.119 fold change in AER per each additional copy of EA. 95% CI: 95% confidence interval. EAF: Mean effect allele frequency in meta-analysis. PHET: p-value for heterogeneity.
Meta-analysis with the previous results
Meta-analysis with the previous replication results in the 598 Finnish patients were significant (p < 0.05) for all four SNPs, with p = 0.0042 for rs10011025. Finally, meta-analysis of all Finnish patients improved the original GWAS results yielding a p = 4.25 × 10−10 for rs10011025 along with genome-wide significant p-values also for the three other SNPs (Table 2). Modest heterogeneity was observed in the meta-analysis (p-value for heterogeneity 0.02 for rs10011025), with the two largest sets of 24-hour AER measurements driving the signal (Fig. 1).
As the previous GWAS was imputed with the hapMap2 CEU population as the reference panel, we reanalysed the GLRA3 region in our novel GWAS, which was imputed using the denser 1000Genomes as reference panel12. Combining all subjects from the discovery and the replication steps, this novel GWAS included a total of 2864 patients with 24 hour AER, and in addition 748 patients with overnight nu-AER available, overlapping with the original report. In the meta-analysis of these two sets, the rs10011025 remained the most strongly associated SNP with a p-value of 3.29 × 10−8.
When we performed conditional analysis adjusting for rs10011025, 47 SNPs in GLRA3 remained nominally significantly associated with AER (p < 0.05), with the strongest residual association seen at the intronic rs112400253 (p = 0.0012, β [95% CI] = −0.277 [−0.445 – −0.109], minor allele frequency [MAF] = 0.024). However, as the association did not remain significant after correction for multiple testing (1192 SNPs tested), there seems to be no major associations on the locus independent of rs10011025.
Stratification by HbA1c
As the locus showed previously no evidence of association in the non-diabetic individuals, it suggests that the underlying genetic factor only affects albuminuria under diabetic conditions7,13. We therefore stratified the analysis further by HbA1c in the combined set of 2864 patients with updated GWAS data and 24-hour AER measurements available. While no association with AER was seen in the patients reaching the current recommended treatment target of an HbA1c ≤ 7.0% (N = 304, β [95% CI] = −0.017 [−0.137–0.101], p = 0.78), the association was highly significant (p = 1.7 × 10−9) in the patients with an HbA1c > 7.0% (N = 2560, β [95% CI] = 0.17 [0.117–0.230]). The effect size was similar in all the three highest quartiles (7.7% ≤ HbA1c ≤ 16.3%; Fig. 2). Interestingly, association was significant and of similar magnitude across all diabetes duration quartiles, ranging from 10 to 66 years (Supplementary Fig. S1).
Fine-mapping of the associated region with imputation
One possible explanation for the fact that the association between rs10011025 and AER was only seen in Finnish, and not in non-Finnish Europeans, could be that the variant is tagging population specific low frequency variants. As the causal variants may reside far away from the association signal14, we explored a 1 M base pair (bp) region up- and downstream of rs10011025 in the GWAS data. Indeed, there was an enrichment of nominally significant associations across all frequency categories (rare, low frequency, uncommon, common; Supplementary Fig. S2), supporting the possibility that rare/low frequency variants interact with, or underlie the association observed at the common rs10011025. We identified ten missense variants within a 1Mbp region of rs10011025, however, many of these were rare and none was associated with AER (p > 0.05; Supplementary Table S1).
Finemapping of the associated region with whole exome sequencing (WES)
As rare variants are not always well captured by GWAS imputation, we searched for low frequency and rare missense variants in the flanking region in WES data including 328 FinnDiane individuals with data on 24-hour AER, enriched for individuals with DN. As the lead variant rs10011025 is an intronic SNP on the GLRA3 gene, it was not captured by the WES. Two missense mutations were found in the GLRA3 gene in the WES data (rs144082170: NM_006529 Val449Ile, MAF = 0.018, minor allele count [MAC] = 12; and rs142149685 NM_006529 Arg368His, MAF = 0.003, MAC = 2). However, neither of them was significantly associated with 24-hour AER (p-values 0.46 and 0.81, respectively).
As the statistical power is rather low to detect association at single rare variants, we performed gene aggregate tests to detect burden of missense variants in genes near rs10011025. WES included 7 genes within 1Mbp from the rs10011025; 4 of these genes had missense variants in our WES subjects (CEP44, HPGD, ADAM29, and GLRA3). Gene aggregate test with SKAT-O suggested enrichment for missense variants in ADAM29, with three missense variants with MAF < 0.10 yielding an aggregate p-value of 0.0093 (Supplementary Table 2), significant (p < 0.05) after correction for four genes. One of these variants, Leu691 deletion (rs200852076), was nominally associated (p = 0.015) with AER also alone. Adjusting the association at GLRA3 rs10011025 for the Leu691 deletion attenuated the association (rs10011025 β = 0.145, p = 0.158 in WES patients; rs10011025 adjusted for Leu691del β = 0.042, p = 0.680; Supplementary Fig. S3), suggesting that missense variants in ADAM29 may contribute to the association observed at the GLRA3. As the Leu691 deletion is common in Finnish subjects (MAF = 0.096 in 1000 Genomes12, MAF = 0.085 in WES), but has low frequency in non-Finnish Europeans (MAF 0.010–0.028 in 1000 Genomes data12), it might also contribute to the potential population differences. However, as the Leu691 deletion is only in partial LD with rs10011025 in GLRA3 (r2 = 0.22, D′ = 0.70 in the 1000 genomes Finnish population12), it is unlikely to fully explain the observed association.
Discussion
We have previously identified genetic variants in the GLRA3 gene to be genome-wide significantly associated with AER in a Finnish GWAS discovery population (rs10011025 p = 1.5 × 10−9), but we were not able to replicate the signal in 598 additional Finnish individuals with type 1 diabetes. On the contrary, the association had a nominally significant association in the opposite direction in non-Finnish Europeans (N = 5077, p = 0.028), leaving this association vague and of uncertain relevance. In the present study including 902 + 357 new Finnish subjects with 24-hour AER and overnight nu-AER measurements available, respectively, we were able to validate the original finding (replication p = 0.0013, β [95% CI] = 0.119 [0.047–0.192]), yielding a genome-wide significant association in a meta-analysis of all Finnish patients with type 1 diabetes with GWAS and data on AER available. Furthermore, re-analysis on our updated GWAS platform that now included a total of 3612 individuals with type 1 diabetes and AER measurements available, and by using an improved imputation panel (1000 genomes samples) still suggested rs10011025 to be the lead SNP on the GLRA3 locus. Conditional analysis did not reveal any other major independent associations on the locus.
The association at rs10011025 with albuminuria seems to be specific to diabetes: No evidence of association was found in non-diabetic patients of European descent7,13. Of note, no GWAS on AER has been reported in non-diabetic Finnish subjects. When we evaluated the association stratified by HbA1c, the association was particularly strong for patients with HbA1c > 7% (p = 1.7 × 10−9), whereas no association was seen in those with an HbA1c below the currently recommended treatment target of 7%. A similar difference for p-value and effect size was seen for the lowest HbA1c quartile compared with other equally sized quartiles.
This study included sets of patients with either overnight, or 24-hour urine collection. While the combined effects in both replication and in meta-analysis were statistically significant, the association signal was driven by the 24-hour AER collections (Fig. 1). This may be a question of smaller sample size in the overnight collections; or something specific for the 24-hour collection. Importantly, the mean HbA1c was similar for individuals in the novel 24-hour and overnight AER collections, thus not explaining the difference. It is of note that the majority of the previous non-Finnish replication cohorts were based on overnight urine collections7. Interestingly, even light to moderate exercise is known to acutely increase albuminuria due to excess hemodynamic pressure15. Thus it is possible that the carriers of the rs10011025 minor variant are more sensitive to hemodynamic pressure and the effect of exercise, a phenomenon not observable in the overnight urine collected during rest.
The different results between the Finnish and the non-Finnish European subjects may also reflect population differences. Due to population isolation and multiple recent bottlenecks, the Finnish population is genetically homogenous and differs from the non-Finnish European population16,17. Even though Finns have fewer variant sites in the exomes, the variants that passed the population bottlenecks were enriched in frequency, resulting in a higher number of loss-of-function variants found in an average Finnish individual17,18.
While the emerging trans-ethnic GWAS meta-analyses suggest that the majority of GWAS associations show consistent effect across different populations, also population specific associations have been identified19,20. For example, in a trans-ethnic GWAS meta-analysis on HbA1c, the lead variant in TMEM79 was genome-wide significantly associated with HbA1c in the East-Asian population, and yielded a significant association also in the trans-ethnic meta-analysis allowing for population differences, but showed an association in the opposite direction in the European population (p = 0.0169)20. While the nominally significant association in the opposite direction may represent a false positive chance finding (both for our AER finding and for the HbA1c TMEM79 locus), the potential explanations for population specific effects include synthetic associations, i.e. that the lead locus reflects one or more (population specific) unobserved lower-frequency causal alleles with larger effects; or population specific gene – environment; or gene – gene interactions. Population specific gene – environment interactions are unlikely to explain our rs10011025 (GLRA3) finding, as the general environment and treatment can be considered similar in Finland as in other European countries. Gene – gene interactions may explain this, but these are hard to detect, especially if the variant interacting with rs10011025 (GLRA3) is a low frequency or rare variant, and not found in other ethnic groups.
To study the impact of rare variants for the association at GLRA3 rs10011025, we explored the 1Mbp region flanking rs10011025 in 328 FinnDiane patients with WES data. We identified two missense variants in the GLRA3 gene, but neither of them was significantly associated with AER in this data set; of note, the minor allele counts of these variants were very low, 12 and 2 copies, and thus, we had low power to detect association. There was a significant burden of rare missense variants associated with AER in the ADAM29 gene (SKAT-O p = 0.009), which is located only 185kbp from rs10011025. Further studies are needed to pinpoint the true causal variants behind the GWAS association signal, whether affecting ADAM29, GLRA3, or another nearby gene.
ADAM29, encoding the disintegrin and metalloproteinase domain-containing protein 29, is a transmembrane protein highly expressed in testis. In a genome-wide expression study on kidney diseases21 accessed through the NephroSeq data base22, ADAM29 was expressed at low levels both in kidney glomeruli and tubuli; moderate under-expression of ADAM29 was detected in glomeruli of individuals with other kidney diseases (Focal Segmental Glomerulosclerosis, N = 25, p = 1.2 × 10−4; lupus nephritis, N = 32, p = 1.7 × 10−4, gene rank top 8% for both analyses).
The GLRA3 gene encodes the α3 subunit of glycine receptors (GlyR), which are ligand-gated chloride channels triggered by extracellular glycine, an inhibitory neurotransmitter. In addition to their important role in the central nervous system, the GlyR have also many other functions, as reviewed by Van den Eynden et al.23. Even though the exact molecular mechanisms remain unclear, ischemia is thought to cause molecular perturbations in the GlyR channels, leading to porous defects in the plasma membranes, and eventually to cell death. Glycine can protect cells from ischemic cell death, and this cytoprotective effect has been reported in renal cells, hepatocytes, and endothelial cells23. Furthermore, the cytoprotection was attenuated after inhibition of endogenous GlyR expression by RNA interference in Madin–Darby canine kidney (MDCK) cells24. In vivo experiments in rats suggest that glycine increases the effective renal plasma flow and GFR, and decreases proximal and distal tubular sodium reabsorption, potentially through an increase in the renal interstitial hydrostatic pressure25. This might also link to the fact that the association was driven by individuals with 24-h AER collection, in which albuminuria may be elevated due to increased hemodynamic pressure induced by exercise15.
One limitation of the study is the unspecific AER lowering effect of anti-hypertensive medication, which we cannot fully account for. To take this into account, the analyses were adjusted for the use of anti-hypertensive medication; furthermore, when multiple visits were available, we chose the time point with the highest AER to minimize the effect of efficient antihypertensive treatment. Finally, when multiple AER values were measured within one year, we used the geometric mean of the values to increase stability of the values.
While some overlap may exist between the genetic factors for chronic kidney disease in the general population, and the renal complications in patients with diabetes26, our observation that this association is only seen in individuals with high blood glucose levels supports the assumption that there are genetic factors specific for DN, and that these can only be identified in diabetic individuals. However, the genetics of DN remains poorly understood. DN is a heterogeneous complication affected by both glomerular filtration rate and urinary albumin excretion due to defects in the glomerular barrier and also exaggerated tubular reabsorption of glucose and sodium27. Since many of the previous genetic findings for DN were identified for the most severe form, ESRD28, our current observation represents the first locus with a genome-wide significant association and replication for albuminuria in diabetic individuals. Even though the association was only observed in the Finnish population, the finding may improve the biological understanding and profit the diabetic individuals worldwide once the functional mechanism behind the genetic association is revealed. Indeed, more functional work is required, since despite previous pilot sequencing, imputed GWAS data, and novel WES data, we cannot yet pinpoint the culprit causal variant or variants behind the observed association.
Methods
Patients
The present study included 3612 Finnish individuals with type 1 diabetes as diagnosed by their attending physician, age at diabetes onset no more than 40 years, insulin treatment initiated within two years of the diabetes diagnosis, and data on AER and genotypes available: N = 2864 patients with 24-hour (du-)AER [mg/24 hours], and N = 748 with overnight nu-AER [µg/min]. For patients with prevalent or incident ESRD, only values before ESRD were considered. The AER phenotype definition closely followed our previous publication on the topic7. AER values were log10 transformed. For individuals with multiple study visits, the visit with the highest AER value was selected in order to reduce the potential effect of efficient treatment. If multiple AER measures of the same type (nu-AER or 24-hour AER) were available within one year of the study visit, these were combined by calculating their geometric mean.
Whole exome sequencing (WES) data were available for 479 FinnDiane patients with type 1 diabetes, of whom 240 had rapid onset of macroalbuminuria (mean time from diabetes onset to macroalbuminuria 16 ± 3 years) or ESRD (20 ± 3 years; jointly, “cases”) and 239 with long duration of diabetes (43 ± 7 years) without diabetic nephropathy (“controls”)29. 24-hour AER measurements were available for 328 of these patients, of which 212 were classified as controls, and 116 (35%) as DN cases in the original analysis (Supplementary Fig. S4).
Patients gave their informed consent. The study was approved by the ethics committee of the Hospital District of Helsinki and Uusimaa and the local ethics committees, and the reported investigations were carried out in accordance with the principles of the Declaration of Helsinki as revised in 2008.
Genotyping
A total of 6152 unique individuals with type 1 diabetes were genotyped on three batches with Illumina HumanCoreExome Bead arrays 12-1.0, 12-1.1, and 24-1.0 (Illumina, San Diego, CA, USA) at the University of Virginia. Variants were called with zCall30. After quality control to remove variants and subjects with low genotyping quality (e.g. minor allele frequency [MAF] < 0.01, genotyping rate <0.95), the variants were converted to human genome build 37 positive strand, and the batches were merged together. Genotype imputation was performed with minimac3-software31 using the 1000 Genomes phase 3 samples as the reference panel12. Within the 6019 samples passing the quality control, 3612 individuals had AER measurements and fulfilled the inclusion criteria. Imputation quality was good (r2 = 0.93) for the lead SNP rs10011025, and for the three other SNPs selected for replication in the original publication (r2 = 0.92–0.99). These four SNPs were also in Hardy-Weinberg equilibrium (p > 0.05). We further identified in the GWAS data 6690 non-monomorphic variants (i.e. MAC ≥1) with imputation quality r2 ≥0.8 and within 1 M base pairs (bp) of the SNP rs10011025, located on chromosome 4 at 175,654,223 bp.
Sequencing, quality control, variant calling and annotation of the WES data has previously been described29. Briefly, sequencing was performed at the University of Oxford on an Illumina HiSeq 2000 as part of a larger sequencing effort with other studies. An average 20-fold target capture was required for >80% of coverage. Mean sequencing depth was 54.97 for 497 included FinnDiane samples. Sequences were mapped with Burrows–Wheeler aligner v7.432, and variants were called with Genome analysis toolkit (GATK) v2.133. 188,068 polymorphic variants (MAC ≥1) remained for 479 unrelated FinnDiane individuals after quality control. Variants were annotated using CHAos (http://www.well.ox.ac.uk/~kgaulton/chaos.shtml), snpEff34 and VEP35 for functional class and transcript. Results from a meta-analysis of this and two other WES studies on DN have been previously reported29.
Statistical analyses
Genetic association analysis was performed with the rvtests software (version 20160404)36 using the score test and adjusting for duration of diabetes, age at diabetes onset, use of antihypertensive medication, sex, and kinship matrix to account for related individuals and potential population stratification in the data set. Meta-analysis was performed with metal software (version 2011-03-25) based on effect size estimates37. While -log10 transformed AER values were used in replication to allow meta-analysis with previous findings based on the effect size estimates, inversed normal transformed residuals were used for analysis of the larger GLRA3 region to optimize performance with rare variants. Variants were annotated with SNPnexus38.
For the WES data, single variant tests were performed in a similar way as for the GWAS analyses by using rvtests. To test enrichment of rare variants in genes, we used a kernel based gene aggregate method, SKAT-O, also implemented in rvtests36. SKAT-O and single marker tests were adjusted for sex and two principal components. AER was inverse normal transformed to optimize performance with rare variants. Relatives were excluded from the analyses.
Electronic supplementary material
Acknowledgements
The skilled technical assistance of Maikki Parkkonen, Mira Korolainen, Hanna Olanne, Anna-Reetta Salonen, Anna Sandelin, and Jaana Tuomikangas is gratefully acknowledged. The authors also acknowledge all the physicians and nurses at each center participating in the collection of patients (see Supplementary Table S3). This study was funded by the Folkhälsan Research Foundation, the Wilhelm and Else Stockmann Foundation, the Liv och Hälsa Society, the Novo Nordisk Foundation, the Helsinki University Hospital Research Funds (EVO), Päivikki and Sakari Sohlberg Foundation, European Foundation for the Study of Diabetes (EFSD) Young Investigator Research Award funds, and the Academy of Finland (275614 and 299200). Genotyping of the GWAS data was funded by the Juvenile Diabetes Research Foundation (JDRF) within the Diabetic Nephropathy Collaborative Research Initiative (DNCRI), with GWAS quality control and imputation performed at University of Virginia.
Author Contributions
N.S. contributed substantially to study conception and design, acquisition of data, analysis and interpretation of data, and drafted the article. J.K.H., I.T. and E.V. contributed substantially to acquisition of genetic data, and to the analysis and interpretation of data. V.H. and C.F. contributed substantially to acquisition of phenotypic data. P.H.G. contributed substantially to study conception and design and supervised the study. All authors revisited the article critically for important intellectual content and approved the final version to be published.
Data availability statement
The datasets generated and/or analyzed during the current study are not publicly available as the patients’ written consent does not allow data sharing. Data are locally available from the corresponding author on reasonable request.
Competing Interests
P-HG has received investigator research grants from Eli Lilly and Roche, lecture honoraria from AstraZeneca, Boehringer Ingelheim, Eli Lilly, Elo Water, Genzyme, MSD, Novartis, Novo Nordisk, and Sanofi. P-HG is an advisor for AbbVie, AstraZeneca, Boehringer Ingelheim, Cebix, Eli Lilly, Janssen, MSD, Novartis, Novo Nordisk and Sanofi.
Footnotes
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-29211-1.
References
- 1.Harjutsalo V, Forsblom C, Groop PH. Time trends in mortality in patients with type 1 diabetes: nationwide population based cohort study. BMJ. 2011;343:d5364. doi: 10.1136/bmj.d5364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Roscioni SS. Lambers Heerspink, H. J. & de Zeeuw, D. Microalbuminuria: target for renoprotective therapy PRO. Kidney Int. 2014;86:40–49. doi: 10.1038/ki.2013.490. [DOI] [PubMed] [Google Scholar]
- 3.Gansevoort RT, et al. Lower estimated GFR and higher albuminuria are associated with adverse kidney outcomes. A collaborative meta-analysis of general and high-risk population cohorts. Kidney Int. 2011;80:93–104. doi: 10.1038/ki.2010.531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Groop PH, et al. The presence and severity of chronic kidney disease predicts all-cause mortality in type 1 diabetes. Diabetes. 2009;58:1651–1658. doi: 10.2337/db08-1543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Forsblom CM, Kanninen T, Lehtovirta M, Saloranta C, Groop LC. Heritability of albumin excretion rate in families of patients with Type II diabetes. Diabetologia. 1999;42:1359–1366. doi: 10.1007/s001250051450. [DOI] [PubMed] [Google Scholar]
- 6.Langefeld CD, et al. Heritability of GFR and albuminuria in Caucasians with type 2 diabetes mellitus. American Journal of Kidney Diseases. 2004;43:796–800. doi: 10.1053/j.ajkd.2003.12.043. [DOI] [PubMed] [Google Scholar]
- 7.Sandholm N, et al. Genome-wide association study of urinary albumin excretion rate in patients with type 1 diabetes. Diabetologia. 2014;57:1143–1153. doi: 10.1007/s00125-014-3202-3. [DOI] [PubMed] [Google Scholar]
- 8.Sandholm N, et al. New susceptibility Loci associated with kidney disease in type 1 diabetes. PLoS Genet. 2012;8:e1002921. doi: 10.1371/journal.pgen.1002921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sandholm N, et al. Chromosome 2q31.1 associates with ESRD in women with type 1 diabetes. J. Am. Soc. Nephrol. 2013;24:1537–1543. doi: 10.1681/ASN.2012111122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Iyengar SK, et al. Genome-Wide Association and Trans-ethnic Meta-Analysis for Advanced Diabetic Kidney Disease: Family Investigation of Nephropathy and Diabetes (FIND) PLoS Genet. 2015;11:e1005352. doi: 10.1371/journal.pgen.1005352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Teumer A, et al. Genome-wide Association Studies Identify Genetic Loci Associated With Albuminuria in Diabetes. Diabetes. 2016;65:803–817. doi: 10.2337/db15-1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. 1000 Genomes Project Consortium et al. A global reference for human genetic variation. Nature526, 68–74 (2015). [DOI] [PMC free article] [PubMed]
- 13.Böger CA, et al. CUBN is a gene locus for albuminuria. J. Am. Soc. Nephrol. 2011;22:555–570. doi: 10.1681/ASN.2010060598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB. Rare variants create synthetic genome-wide associations. PLoS Biol. 2010;8:e1000294. doi: 10.1371/journal.pbio.1000294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Climie RE, Srikanth V, Keith LJ, Davies JE, Sharman JE. Exercise excess pressure and exercise-induced albuminuria in patients with type 2 diabetes mellitus. Am. J. Physiol. Heart Circ. Physiol. 2015;308:H1136–42. doi: 10.1152/ajpheart.00739.2014. [DOI] [PubMed] [Google Scholar]
- 16.Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature536, 285–291 (2016). [DOI] [PMC free article] [PubMed]
- 17.Lim ET, et al. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet. 2014;10:e1004494. doi: 10.1371/journal.pgen.1004494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chheda H, et al. Whole-genome view of the consequences of a population bottleneck using 2926 genome sequences from Finland and United Kingdom. Eur. J. Hum. Genet. 2017;25:477–484. doi: 10.1038/ejhg.2016.205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium et al. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat. Genet.46, 234–244 (2014). [DOI] [PMC free article] [PubMed]
- 20.Wheeler E, et al. Impact of common genetic determinants of Hemoglobin A1c on type 2 diabetes risk and diagnosis in ancestrally diverse populations: A transethnic genome-wide meta-analysis. PLoS Med. 2017;14:e1002383. doi: 10.1371/journal.pmed.1002383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ju W, et al. Defining cell-type specificity at the transcriptional level in human disease. Genome Res. 2013;23:1862–1873. doi: 10.1101/gr.155697.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nephroseq Research Edition. Ann Arbor, MI: University of Michigan; www.nephroseq.org. (April 2018).
- 23.den Eynden JV, et al. Glycine and glycine receptor signalling in non-neuronal cells. Front. Mol. Neurosci. 2009;2:9. doi: 10.3389/neuro.02.009.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pan C, et al. Cytoprotection by glycine against ATP-depletion-induced injury is mediated by glycine receptor in renal cells. Biochem. J. 2005;390:447–453. doi: 10.1042/BJ20050141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Thomsen K, Nielsen CB, Flyvbjerg A. Effects of glycine on glomerular filtration rate and segmental tubular handling of sodium in conscious rats. Clin. Exp. Pharmacol. Physiol. 2002;29:449–454. doi: 10.1046/j.1440-1681.2002.03683.x. [DOI] [PubMed] [Google Scholar]
- 26.Pattaro C, et al. Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun. 2016;7:10023. doi: 10.1038/ncomms10023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Vallon V. The proximal tubule in the pathophysiology of the diabetic kidney. Am. J. Physiol. Regul. Integr. Comp. Physiol. 2011;300:R1009–22. doi: 10.1152/ajpregu.00809.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dahlstrom E, Sandholm N. Progress in Defining the Genetic Basis of Diabetic Complications. Curr. Diab Rep. 2017;17:80-017–0906-z. doi: 10.1007/s11892-017-0906-z. [DOI] [PubMed] [Google Scholar]
- 29.Sandholm N, et al. The Genetic Landscape of Renal Complications in Type 1 Diabetes. J. Am. Soc. Nephrol. 2017;28:557–574. doi: 10.1681/ASN.2016020231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Goldstein JI, et al. zCall: a rare variant caller for array-based genotyping: genetics and population analysis. Bioinformatics. 2012;28:2543–2545. doi: 10.1093/bioinformatics/bts479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Das S, et al. Next-generation genotype imputation service and methods. Nat. Genet. 2016;48:1284–1287. doi: 10.1038/ng.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strainw1118; iso-2; iso-3. Fly (Austin) 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.McLaren W, et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics. 2010;26:2069–2070. doi: 10.1093/bioinformatics/btq330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhan X, Hu Y, Li B, Abecasis GR, Liu DJ. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics. 2016;32:1423–1426. doi: 10.1093/bioinformatics/btw079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dayem Ullah AZ, Lemoine NR, Chelala C. A practical guide for the functional annotation of genetic variations using SNPnexus. Brief Bioinform. 2013;14:437–447. doi: 10.1093/bib/bbt004. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated and/or analyzed during the current study are not publicly available as the patients’ written consent does not allow data sharing. Data are locally available from the corresponding author on reasonable request.