Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Feb 1.
Published in final edited form as: Am J Transplant. 2016 Oct 12;17(2):318–319. doi: 10.1111/ajt.14051

Validation is Critical for GWAS-based Associations

William S Oetting 1,*, Pamala A Jacobson 1, Ajay K Israni 2,3,4
PMCID: PMC5266629  NIHMSID: NIHMS818700  PMID: 27640398

One goal of a genome wide association study (GWAS) is to associate an allele, or multiple alleles, of a single nucleotide polymorphism (SNP) to a clinical outcome or phenotype. The hope is to identify risk alleles in individuals to better predict their genetic predisposition to that clinical outcome and individualize treatment to decrease this risk. To this end there have been many successful associations which are documented at the National Human Genome Research Institute-European Bioinformatics Institute (NHGRI-EBI) catalog of published genome-wide association studies (http://www.ebi.ac.uk/gwas/). Unfortunately there have been many more GWAS based results that have been later found to be false positives. This occurs with far too great a frequency and has even been given its own designation for initial GWAS associations later found to be incorrect: The “Winners Curse”, due in part to an overestimation of the genetic effect of a variant (1). This appears to be the case in the report by Philstrøm et al. entitled “Single nucleotide polymorphisms and long term clinical outcome in renal transplant patients. A validation study,” where two SNPs associated with 5-year creatinine and graft survival in kidney recipients could not be validated in an independent cohort (2).

In what was claimed to be the first GWAS using kidney transplant recipients, O’Brien and colleagues reported the association of two SNPs from approximately 511K SNPs (after data cleaning) and DNA from 326 kidney transplant recipients (3). The first outcome reported was 5-year creatinine levels. The SNP rs6565887, with a p-value of 4.04e−8, was reported to explain 8.8% of the variance of 5-year creatinine levels and rs3811321, with a p-value of 7.63e−8, was reported to explain 11.3% of the variance. For this analysis, 63 recipients needed to be dropped due to graft failure before the 5-year time point, leaving 263 recipients with 5-year creatinine levels to be analyzed. Both variants were also found to be predictors of 10-year graft survival. The ‘A’ allele of rs3811321 was associated with increased graft survival (70% vs. 30%, p = 0.004 as was the ‘A’ allele of rs6565887 (70% vs. 45%, p = 0.025).

In this issue, Philstrøm and colleagues attempted to validate these findings for death-censored graft loss and all-cause mortality. Their attempt, using a cohort of over 1,600 recipients, was unsuccessful for both outcomes. A lack of association was found for both SNPs using either univariate or multivariate Cox regression analysis. Additionally, mean serum creatinine levels (mean = 4.6 years after transplantation) were also not found to be significantly associated with these two SNPs in recipients as reported in the initial publication.

The fact that initially reported SNPs in a GWAS failed validation is not new. In response to this common occurrence individuals may be unconcerned believing that erroneous results will eventually be discarded through subsequent studies (4). It is this view that must be must be discarded. Pashler and Harris correctly stated “Unfortunately, however there is every reason to believe that the great majority of errors that do enter the literature will persist uncorrected indefinitely, given current practices. Errors will be propagated through textbooks and review articles, and people interested in a topic will be misinformed for generations.”(4) At least 13 publications have so far cited the results of O’Brien et al., though not all reported specific data from the original publication. For those that did report specific results, their statements ranged from acceptance of the results as fact to stating the need for validation in an independent cohort. None stated the possibility of the study results (associations) being false positives or the low statistical power of the study. O’Brien et al. reported that they had only 80% power to detect an effect size of 14%. The authors stated that replication of their study was needed and would require a larger cohort but this was not related when their results were cited in later publications. Citing results from a GWAS, especially when no validation of those results is being offered, should include a critical evaluation of those results including disclosure of the power of the original study to detect the association. The reporting of erroneous results will continue to occur and is the nature of scientific discovery, but continued citation of false positive results, especially in the absence of a full appraisal of these results, can turn fiction into fact.

The publication by Philstrøm et al., correctly points to the need for greater collaboration among transplant researchers to conduct meta-analysis and cross-validation of GWAS data. Though there are no formal criteria for the validation of GWAS results, typically such a study attempts to validate candidate variants utilizing an independent cohort and similar phenotypes resulting in a statistically significant association (p-value < 0.05 for a single SNP) with a similar effect size and direction. To provide for these types of studies, investigators in transplantation have formed the International Genetics & Translational Research in Transplantation Network (iGeneTRAiN) Consortium and have invited other investigators to join their efforts (5). Through this type of effort, we can have greater confidence in the outcomes related to GWAS produced associations in transplantation research.

Footnotes

Disclosure

The authors of this manuscript have no conflicts of interest to disclose as described by the American Journal of Transplantation.

References

  • 1.Zöllner S, Pritchard JK. Overcoming the winner’s curse: Estimating penetrance parameters from case-control data. Am J Hum Genet. 2007;80:605–615. doi: 10.1086/512821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Philstrøm H, Mjøen G, Mucha S, et al. Single nucleotide polymorphisms and long term clinical outcome in renal transplant patients. A validation study. Am J Transplant. 2016 doi: 10.1111/ajt.13995. IN PRESS-THIS ISSUE. [DOI] [PubMed] [Google Scholar]
  • 3.O'Brien RP, Phelan PJ, Conroy J, et al. A genome-wide association study of recipient genotype and medium-term kidney allograft function. Clin Transplant. 2013;27:379–387. doi: 10.1111/ctr.12093. [DOI] [PubMed] [Google Scholar]
  • 4.Pashler H, Harris CR. Is the replicability crisis overblown? Three arguments examined. Persp Psychol Sci. 2012;7:531–536. doi: 10.1177/1745691612463401. [DOI] [PubMed] [Google Scholar]
  • 5.International Genetics & Translational Research in Transplantation Network (iGeneTRAiN) Design and implementation of the International Genetics & Translational Research in Transplantation Network (iGeneTRAiN) Transplant. 2015;99:2401–2412. doi: 10.1097/TP.0000000000000913. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES