Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2020 Dec;140(12):2380–2385. doi: 10.1016/j.jid.2020.03.970

Genome-Wide Association Study Identifies Genetic Associations with Perceived Age

Victoria Roberts 1, Barry Main 1,2, Nicholas J Timpson 3, Simon Haworth 2,3,
PMCID: PMC7685007  PMID: 32339537

Abstract

Failure of dermal protection or repair mechanisms might lead to visibly aged skin. The study aimed to identify genetic associations with perceived age. A genome-wide association study was undertaken in 423,992 adult participants of UK Biobank, using questionnaire data on perceived age and genetic data imputed to the Haplotype Reference Consortium imputation panel. The study identified 74 independently associated genetic loci, to our knowledge previously unreported (P < 5 × 10−8), which were enriched for cell signaling pathways, including the NEK6 and SMAD2 subnetworks. Common genetic variation was estimated to account for 14% of variation in perceived age, and the heritability of perceived age was partially shared with that of 75 other traits, including multiple traits representing adiposity, suggesting that perceived age may be a useful proxy trait in genetic association studies.

Abbreviations: BMI, body mass index; FDR, false discovery rate; GCT-VL, Complex Trait Genomics Virtual Lab; GWAS, genome-wide association study; LD, linkage disequilibrium; MAF, minor allele frequency; SNV, single nucleotide variant

Introduction

Skin is the interface between the internal and external environment and has functions to prevent and repair damage from exogenous factors such as UV light and bacteria (Lee et al., 2006). Studies that characterize the biological mechanisms underlying normal skin barrier and repair functions may provide insight into diseases where these mechanisms fail (Williams, 2005), such as atopy; proxy phenotypes for skin response to exogenous factors; and improve understanding of these mechanisms.

One challenge is the ability of skin to respond to UV light, which has been explored using genome-wide association studies (GWASs) for proxy phenotypes such as self-reported tanning ability (Nan et al., 2009, Visconti et al., 2018), identifying a range of associated loci. Objective or subjective measures of skin age might represent the ability of skin to respond to a wider range of environmental challenges. Studies have investigated genetic factors associated with skin age, rationalizing that failure of photoprotective or other dermal integrity mechanisms will lead to visibly aged skin (Law et al., 2017, Liu et al., 2016). The results of these studies and GWASs investigating self-reported tanning identified association at the MC1R gene, which regulates pigmentation. Other genes identified have been linked to skin and hair pigmentation, poor tanning ability, increased freckling, and skin cancers (Duffy et al., 2010, Duffy et al., 2004, Han et al., 2011, Han et al., 2008, Kita and Fraser, 2016, López et al., 2014, Sulem et al., 2007, Zhang et al., 2013). This apparent similarity in findings for objective and subjective measures of skin function is mirrored in studies where multiple measures of skin appearance and function are available in the same participants (Oyetakin-White et al., 2015).

We reason that subjective measures of perceived age may act as a proxy for underlying dermal integrity and photoprotective mechanisms. This study aims to characterize genetic associations with perceived age, focusing on understanding heritability; identify genetic loci; and understand whether genetic mechanisms regulating dermal integrity are shared with other risk factors or diseases.

Results

Participants

A GWAS was performed for perceived age. After final exclusions, analysis included 423,992 adult participants. Of these, 8,630 reported looking older than their biological age, 103,300 reported looking about their age, and 312,062 reported looking younger than their biological age. There were trends with both age and sex, where females were more likely to report looking young for their age than males, and older participants were more likely to report looking young for their age than young participants (Supplementary Table S1). As there was an imbalance of responses in these three groups, the effective statistical power of the experiment was smaller than the total sample size (see Supplementary Text S1 for an estimate of effective sample size).

Total heritable contribution

After final quality control, approximately 9.6 million single nucleotide variants (SNVs) with a minor allele frequency (MAF) of 0.1% or greater were tested for association. There was evidence for inflation in test statistics (genomic control factor = 1.49), which is typical for large studies of complex, polygenic traits and lower than that reported in recent studies of height and body mass index (BMI) (Yengo et al., 2018). Linkage disequilibrium (LD) score regression analysis estimated that 14% (SE, 0.6%) of variation in perceived age was due to the effects of common genetic variants and that polygenic heritability rather than inflationary bias was responsible for most of the inflation in genomic control factor (LD score regression ratio = 0.09).

Single variant findings

There was evidence for association at 5,395 SNVs representing 81 independent signals of association (P < 5 × 10−8). Of those, 74 represented, to our knowledge, previously unreported discoveries, whereas seven were in the region of loci previously reported for a skin appearance– or facial age–related trait (Figure 1). A subset of lead variants is presented in Table 1, and full results are provided in Supplementary Table S2.

Figure 1.

Figure 1

Manhattan plot of GWAS. Genomic regions independently meeting genome-wide significance are highlighted in magenta (to our knowledge previously unreported loci) or blue (positive controls). The red line at P = 5 × 10−8 indicates the conventional threshold for genome-wide significance. The y-axis scale is split and truncated at P = 1 × 10−250. GWAS, genome-wide association study.

Table 1.

Top 10 Independently Associated Lead Variants in GWAS

Chr SNP Position Effect Allele EAF Beta SE OR Locus P (Conditional)
9 rs520015 211762 C 0.51 0.070 0.0043 1.07 C9orf66
- DOCK8
1.2 × 10−58
8 rs10956486 130699140 T 0.68 −0.073 0.0046 0.93 GSDMC 2.6 × 10−55
3 rs61263161 126691104 G 0.83 −0.087 0.0058 0.92 CHCHD6 1.4 × 10−51
2 rs1438898 145714354 A 0.75 0.059 0.0050 1.06 AC074093.1 3.5 × 10−32
2 rs76032374 56058356 A 0.87 0.068 0.0064 1.07 EFEMP1 7.9 × 10−27
20 rs28897169 22100542 T 0.39 0.043 0.0044 1.04 LOC100270679 7.0 × 10−22
6 rs4869723 151579432 C 0.56 −0.040 0.0044 0.96 AKAP12 1.7 × 10−20
2 rs116254882 223025055 G 0.96 0.095 0.0107 1.10 PAX3 5.2 × 10−19
15 rs1550436 74221157 C 0.53 −0.039 0.0043 0.96 LOXL1 5.5 × 10−19
2 rs7590866 223087329 G 0.86 −0.054 0.0062 0.95 PAX3 2.3 × 10−18

Abbreviations: Chr, chromosome; EAF, effect allele frequency; GWAS, genome-wide association study; SNP, single nucleotide polymorphism; SNV, single nucleotide variant.

Each row contains a lead SNV representing an independent signal of association following a stepwise selection procedure with P < 5.0 × 10−8 after conditioning on previously reported signals of association. The position column contains genomic position based on build 37 (GRCh37.p13) of the human genome. The beta coefficient and accompanying SE are on a log-odds scale and have been exponentiated to provide an odds ratio for reference. ORs > 1 reflect increased odds of appearing young. The locus column includes the name of the gene nearest the lead SNV. Results for all independent lead variants are provided in Supplementary Table S2 with both conditional and unconditional P-values. Full results for all variants are provided as a link in the data access statement.

Of these genetic loci, the strongest statistical evidence was seen at C9orf66-DOCK8 with a lead signal carried by rs520015_C (effect allele frequency, 0.51; OR, 1.07; P = 1.0 × 10−58). Single nucleotide polymorphisms were mapped to genes using positional mapping tools in FUMA (Watanabe et al., 2017). Rs520015 was annotated as an intergenic variant in LD with a missense variant within C9orf66 (rs481905, r2 = 0.81) and multiple intronic variants within DOCK8 (e.g., rs2484966, r2 = 0.88).

Of the genetic loci discovered, the largest effect size of a minor allele was estimated for rs139356332_G (effect allele frequency, 0.98; OR, 0.90; P = 8.1 × 10−13), an uncommon intronic variant within MFAP4. This gene encodes an extracellular glycoprotein that is thought to contribute to the organization of elastin fiber components in the extracellular matrix (Pilecki et al., 2016).

The study replicated evidence for association at seven previously studied loci, with the strongest evidence at rs12203592_C, a common intronic variant within IRF4 with large effects on odds of appearing youthful (effect allele frequency, 0.78; OR, 1.22; P = 1.2 × 10−327). This locus has been reported for skin age–related traits, including pigmented spot severity, perceived skin aging, tanning ability, risk of sunburn, and tanning response to sun (Jacobs et al., 2015, Law et al., 2017, Visconti et al., 2018, Zhang et al., 2013).

In silico transcriptome-wide association analysis and fine mapping

The predicted expression of 25,812 gene transcripts was tested for association with perceived facial age. A total of 175 gene transcripts passed a Bonferroni-corrected P-value threshold (P < 1.9 × 10−6). There was inflation in these results (Supplementary Figure S1), which might represent polygenic association signal but could also be related to correlation in the predicted expression of adjacent genes seen using the S-PrediXcan method (Wainberg et al., 2019). There was high concordance with the results of single variant analysis, and nearly all transcripts were in genomic loci already highlighted by the single variant results. The strongest evidence for association was IRF4, where higher transcription was predicted to associate with lower odds of looking youthful (P = 1.6 × 10−77) (Supplementary Table S3). The FOCUS method (Mancuso et al., 2019) for probabilistic fine mapping was applied to help resolve correlation in adjacent predictions and nominate a credible set of biologically causal genes underlying this association signal. This was able to resolve some association signals with a high degree of confidence; for example, an association signal on chromosome 2 represented by rs1438898 in single variant results that mapped to a uncharacterized transcript AC074093.1 using positional mapping was mapped to ZEB2 with a high probability using fine mapping. Conversely, the fine mapping approach was unable to produce stable models for associated loci on chromosome 6, and results from this chromosome are not presented. The single gene with the highest posterior probability estimate for each locus is reported in Supplementary Table S4, and full results are included in Supplementary Table S5.

Enrichment in gene sets and tissue expression

Enrichment analysis was performed using DEPICT (Pers et al., 2015) implemented in the Complex Trait Genomics Virtual Lab (GCT-VL) (G Cuéllar-Partida, unpublished data). Analysis identified enrichment (false discovery rate [FDR] ≤ 0.05) in 23 predefined sets of functionally related genes, with the strongest enrichment signal seen for the NEK6 subnetwork (P = 3.2 × 10−7) (Supplementary Table S6). Enrichment in tissue-specific transcription in 209 tissues was tested but did not identify any enrichment beyond chance (PFDR > 0.05 for all tissues) (Supplementary Table S7).

Shared heritability and genetic causality proportions

Genetic correlation with 1,362 traits was estimated, of which 75 traits had evidence for shared heritability with perceived age after a Benjamini-Hochberg correction for multiple testing (PFDR < 0.05). All genetic correlation estimates lay within the range −0.5 to +0.5, suggesting that the genetic determinants of perceived age are not fully captured by other traits in the GCT-VL (G Cuéllar-Partida, unpublished data) catalog.

The strongest evidence for genetic correlation was seen with obesity-related traits, where genotypes that associated with greater adiposity overlapped with genotypes associated with reduced odds of appearing youthful (e.g., BMI: genetic correlation = –0.25, PFDR = 2.9 × 10−12 and waist circumference: genetic correlation = –0.23, PFDR = 2.3 × 10–13) (Supplementary Table S8).

These correlations might be due to shared genetic influences on perceived age but may be due to vertically pleiotropic pathway effects where one trait exerts a causal effect on the other. To investigate this further, genetic causality proportions (O’Connor and Price, 2018) were estimated for all pairs of traits with a significant genetic correlation using an online platform (S Haworth, unpublished data). The single trait with the strongest evidence for causal effect was BMI. The effects of greater BMI on reduced odds of appearing youthful were modeled to explain part of the genetic correlation with BMI (genetic causality proportion estimate = –0.64); however, neither this finding (PFDR = 0.07) nor any other causality proportions were considered significant after correction for multiple testing (Supplementary Table S9).

Discussion

This study used genome-wide analysis to investigate genetic contributions to perceived age using the rationale that failure of dermal repair mechanisms would lead to visibly aged skin. There was evidence for a polygenic heritable contribution to youthful appearance, and single variant analysis identified 74 loci to our knowledge previously unreported. These loci were enriched for gene sets encoding a range of regulatory networks, supporting the idea that a range of different biological processes are implicated in maintaining a youthful appearance. The NEK6 subnetwork, identified in gene set enrichment analysis, helps govern the initiation of mitosis and progression through the cell cycle and prevents cell senescence (Jee et al., 2010). We hypothesize that natural variation in genes encoding the NEK6 subnetwork leads to variation in the ability to compensate for age-related decline in tissues, resulting in variation in signs of aging. This is supported by enrichment in other regulatory networks with roles in growth signaling, including the SMAD2, SMAD4, and SMAD9 subnetworks. This mechanism is likely one of several diverse pathways that contribute to maintaining a youthful appearance; for example, we also observed enrichment in lipid-mediated signaling.

Although perceived age may capture variation in skin biology and response to environmental challenges, it is likely to also capture other biological factors such as nutritional status. To help investigate the degree of overlap with other traits and biological specificity of the phenotype, we estimated genetic correlations with traits and diseases in a hypothesis-free manner and identified genetic correlations with 75 other traits or diseases. These correlations were modest in magnitude, suggesting that perceived age provides a proxy for an underlying phenotype which has not been extensively explored yet by other GWASs. We followed up these genetic correlations to explore whether there are causal relationships (in either direction) between perceived age and genetically correlated traits but did not find strong evidence supporting this. This may reflect the limited statistical power of this follow-up analysis (the genetic correlations were of modest magnitude) or that the genetic overlap between perceived age is due predominantly to biological processes that have underlying relevance for many traits rather than causal pathways between these phenotypes.

Aside from capturing the response to environmental stressors, the analysis may also capture genetic associations with stressors. Factors such as smoking and UV exposure related to geographical location were traditionally considered to be purely environmental and therefore uncorrelated with genotype. However, an increasing body of evidence now points to the heritability of the home environment (Kong et al., 2018) and evidence for correlation between genetic data and both socioeconomic conditions (Tyrrell et al., 2017) and latitude (Haworth et al., 2019) in UK Biobank. Despite the reasons for caution, there was little evidence for inflationary bias in the primary results. The lead single variants show good concordance with previously published findings and appear relevant to dermal protection functions, suggesting that the results of the study primarily capture host susceptibility and response to pro-aging stimuli rather than host liability to be exposed to those stimuli.

The existence of genetic predictors of skin function and likely aging trajectory provides opportunities for research and clinical applications. The results may help prioritize relevant biology for a detailed molecular study of photoprotective mechanisms and nominate proteins whose function, if modulated by cosmetic or pharmacological products, might enhance photoprotection. Statistical power for investigations into longitudinal mechanics of skin aging might be boosted by recruiting based on genotype participants who are at greatest risk of accelerated aging. In the longer term, integration of insights from population-level and individual genetic information may pave the way to precision skincare.

Aside from the conceptual limitations of complexity using perceived age as a proxy for skin traits, one practical limitation of this investigation is the use of categorical data, which is cruder than previous approaches such as using a panel of volunteers to guess the age of a participant and compare that with actual age to generate a continuous measure (Liu et al., 2016). As the phenotype used here was self-reported and subjective, there will be some degree of misclassification. We modeled the likely impact of this on statistical power and FDR using simulations. These showed that misclassification in this study likely affected the statistical power and led to some degree of underestimation of effect size at truly associated SNVs but would not lead to false positive associations (Supplementary Text S2, Supplementary Figure S2). Despite these limitations, the study identified association at previously reported positive controls such as variants within IRF4, MC1R, and BNC2 with high levels of statistical evidence. This indicates that the large sample size was sufficient to overcome regression dilution bias introduced by misclassification. Although we believe that the properties of the phenotypic assessment in this study would lead to underreporting of association signals, we have not undertaken replication in an independent sample, which is a limitation of the study.

In conclusion, apparent age is a partially heritable trait. The polygenic association signal and results of gene set analysis suggest that diverse mechanisms act to preserve a youthful appearance. Biological and functional characterization of the single variant association signals identified in this study may be a useful way to gain improved understanding of skin biology or a step toward interventions that moderate the rate of age-related skin changes.

Materials and Methods

Participants and phenotypes

This study used data from UK Biobank, a project which recruited approximately 500,000 participants aged between 40–69 between 2006 and 2010 (Fry et al., 2017). Eligible participants were identified from health records in the UK National Health Service and invited to participate in one of 22 assessment centers, which were in densely populated regions of Great Britain. Participants took part in a baseline assessment including completion of questionnaires, physical measurement, donation of biological samples, and consent for subsequent follow-up via linkage to National Health Service records. In the questionnaire, participants were asked to respond to the question “Do people say that you look…?” The possible answers were “Younger than you are,” “Older than you are,” “About your age,” “Do not know,” or “Prefer not to answer.” For this analysis, participants were coded 1 if they reported that they looked younger, 0 if they reported that they looked older, and 0.5 if they reported that they looked their age. Participants who did not know or preferred not to answer the question were excluded from analysis.

Genotypes

Genotype data was generated using one of two genotyping arrays, the UK BiLEVE Axiom array and the UK Biobank Axiom array. Quality control and imputation were undertaken centrally by UK Biobank as described previously (C Bycroft, unpublished data). Following imputation, in-house quality control was undertaken to remove participants with poor quality data and to restrict analysis to participants of European ancestry, following a published protocol (Mitchell et al., 2017). Genotype data was filtered to a high-confidence set of SNVs by removing monomorphic or rare variants with MAF < 0.1%, removing structural variation such as insertion-deletions, removing sites with poor imputation quality using a graded filter (minimum INFO score > 0.3 for MAF > 3%, INFO > 0.6 for MAF 1–3%, INFO > 0.8 for MAF 0.5–1%, and INFO > 0.9 for MAF 0.1–0.5%) and removing sites not in the Haplotype Reference Consortium imputation panel (Mitchell et al., 2017).

Genome-wide association analysis

Genome-wide analysis was performed using a linear mixed model approach implemented in BOLT-LMM (Loh et al., 2018). This tests the relationship between genotype and phenotype while accounting for covariates (age, sex, and study participation center) and relatedness, following a published protocol (Elsworth et al., 2017). This approach was chosen because the linear mixed model approach is reported to achieve good control for potential confounding owing to population stratification in the UK Biobank sample (Loh et al., 2018). Genome-wide summary statistics on a linear scale were transformed into log ORs using a Taylor expansion series. ORs > 1 indicate greater odds of looking youthful.

SNV selection procedure and conditional analysis

SNVs with P < 5 × 10−8 were considered associated with perceived age, chosen as a threshold for genome-wide significance. Nearby SNVs are typically correlated through LD, meaning that genetic effect sizes and P-values of nearby SNVs are also correlated. Lead SNVs were defined after reducing the association signals down to a subset of approximately independent signals of association within single variant results using a stepwise model selection procedure implemented in GCTA (v 1.91.4) (Yang et al., 2012, Yang et al., 2011), which takes into account LD between different SNVs to select independently associated SNVs (--cojo-slct function). This subset of approximately independent signals was tested against previously reported association signals in an approximate conditional analysis (--cojo-cond function in GCTA) to identify which signals capture previously reported associations, using a list of SNVs reported for perceived facial age or related traits (Supplementary Table S9). SNVs that were conditionally independent of previously reported signals of association with P < 5 × 10−8 in conditional analysis were defined as lead signals and are reported in Supplementary Table S9.

Enrichment analysis

To test for enrichment in predefined gene sets or gene pathways, enrichment analysis was performed using the DEPICT approach (Pers et al., 2015) implemented in GCT-VL. Analysis used full genome-wide results, and associated loci were defined internally by DEPICT using a reference panel for LD estimation.

Estimation of heritability

To estimate variation in perceived facial age attributable to common genetic variants, heritability was estimated using univariate LD score regression (Bulik-Sullivan et al., 2015b) implemented in LD-Hub, an automated online resource (Zheng et al., 2017). Summary statistics of the GWAS were uploaded and results processed through a standardized procedure. This uses a subset of approximately 1 million common variants and reference LD data to estimate heritability attributable to common genetic variants (h2_LDSR) and assess for inflationary bias in GWAS results.

Estimation of genetic correlation and partial genetic causality

Genetic correlation was assessed against 1,362 traits in the GCT-VL catalog (G Cuéllar-Partida, unpublished data) using bivariate LD score regression (Bulik-Sullivan et al., 2015a). Genetic correlation summarizes the similarity in the heritable contribution to a pair of traits assessed across the whole genome; values near 1 or −1 indicate that two traits have substantial shared genetic associations, which have consistent and proportionate effects on both traits. Values near 0 indicate largely independent genetic determinants with little overlapping heritability between the two traits. Adjustment for multiple testing used a Benjamini-Hochberg procedure and correlations with FDR < 0.05 were reported.

For each trait with a detectable non-zero genetic correlation, latent causal variable models (O’Connor and Price, 2018) were fitted to help distinguish between genetic correlations resulting from horizontally pleiotropic genetic effects and genetic correlations resulting from causal relationships. Models were fitted using an automated online pipeline (S Haworth, unpublished data) implemented in the GCT-VL platform (https://genoma.io).

Imputed transcriptome-wide association study and transcriptome-informed fine mapping

To test the consequences of a range of gene transcripts on perceived facial age, tests for association with predicted gene expression were performed using S-PrediXcan (Barbeira et al., 2018). This assesses the mediating effects of expression levels on phenotypes by imputing transcriptome levels using pretrained models derived in datasets with measured gene expression. Analysis was performed using prefitted elastic net prediction models of gene expression levels in the 48 Genotype-Tissue Expression tissues (GTEx Consortium et al., 2013), which are available online (http://predictdb.org/). Summary results from GWAS were uploaded to the S-PrediXcan (Barbeira et al., 2018) web pipeline (https://cloud.hakyimlab.org/). Results from 48 tissues were combined using the TissueXcan method (Barbeira et al., 2019), which prioritizes the most relevant tissue transcripts overall, taking into account evidence from multiple tissue-specific predictions while accounting for correlation in gene transcription between different tissues and multiple testing. In parallel, analysis using the FOCUS method (Mancuso et al., 2019) was performed using the standalone python software provided by the authors of the method at https://github.com/bogdanlab/focus/blob/master/README.md. It references LD data from the 1,000 Genomes Project samples (European ancestry) and reference transcription data from the precompiled database, including data from multiple sources described at https://github.com/bogdanlab/focus/wiki.

Data availability statement

Full results of analysis are provided as supplementary datasets. Underlying source data are available through UK Biobank, an open-access resource for healthcare research. Data access procedures are described at http://www.ukbiobank.ac.uk/wp-content/uploads/2012/09/Access-Procedures-2011-1.pdf. Genome-wide summary statistics have been uploaded to the University of Bristol data repository, data.bris, and are publicly available at https://doi.org/10.5523/bris.21crwsnj4xwjm2g4qi8chathha.

Ethics Statements

Participants in UK Biobank gave written informed consent and a copy of the consent form is published online at http://www.ukbiobank.ac.uk/wp-content/uploads/2011/06/Consent_form.pdf. The UK Biobank study received ethical approval from the North West Multi-centre Research Ethics Committee (MREC), which covers the UK, as described at (http://www.ukbiobank.ac.uk/the-ethics-and-governance-council/).

ORCIDs

Victoria Roberts: https://orcid.org/0000-0001-7540-1579

Barry Main: https://orcid.org/0000-0003-0622-805X

Nicholas J. Timpson: https://orcid.org/0000-0002-7141-9189

Simon Haworth: https://orcid.org/0000-0001-7793-7326

Conflict of Interest

The authors state no conflict of interest.

Acknowledgments

This research has been conducted using the UK Biobank Resource under Application Number 40644. BM is funded by a National Institute for Health Research Clinical Lectureship. NJT is a Wellcome Trust Investigator (202802/Z/16/Z), is the principal investigator of the Avon Longitudinal Study of Parents and Children (MRC & WT 102215/2/13/2), is supported by the University of Bristol National Institute for Health Research Biomedical Research Centre (BRC-1215-20011) and the MRC Integrative Epidemiology Unit (MC_UU_12013/3), and works within the CRUK Integrative Cancer Epidemiology Programme (C18281/A19169). SH is funded by a National Institute for Health Research Academic Clinical Fellowship.

Author Contributions

Conceptualization: VR, BM, NT, SH; Formal Analysis: VR, SH; Investigation: VR, BM, NT, SH; Supervision: BM, NT, SH; Visualization: VR, SH; Writing - Original Draft Preparation: VR; Writing - Review and Editing: VR, BM, NT, SH

accepted manuscript published online 24 April 2020; corrected proof published online 22 May 2020

Footnotes

Supplementary material is linked to the online version of the paper at www.jidonline.org, and at https://doi.org/10.1016/j.jid.2020.03.970.

Supplementary Material

Supplementary Material
mmc1.pdf (1.2MB, pdf)
Supplementary Data
mmc2.xlsx (8.5MB, xlsx)

References

  1. Barbeira A.N., Dickinson S.P., Bonazzola R., Zheng J., Wheeler H.E., Torres J.M. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9:1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barbeira A.N., Pividori M., Zheng J., Wheeler H.E., Nicolae D.L., Im H.K. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 2019;15 doi: 10.1371/journal.pgen.1007889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bulik-Sullivan B., Finucane H.K., Anttila V., Gusev A., Day F.R., Loh P.R. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bulik-Sullivan B.K., Loh P.R., Finucane H.K., Ripke S., Yang J., Schizophrenia Working Group of the Psychiatric Genomics Consortium LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Duffy D.L., Box N.F., Chen W., Palmer J.S., Montgomery G.W., James M.R. Interactive effects of MC1R and OCA2 on melanoma risk phenotypes. Hum Mol Genet. 2004;13:447–461. doi: 10.1093/hmg/ddh043. [DOI] [PubMed] [Google Scholar]
  6. Duffy D.L., Iles M.M., Glass D., Zhu G., Barrett J.H., Höiom V. IRF4 variants have age-specific effects on nevus count and predispose to melanoma. Am J Hum Genet. 2010;87:6–16. doi: 10.1016/j.ajhg.2010.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Elsworth B.L., Mitchell R., Raistrick C.A., Paternoster L., Hemani G., Gaunt T.R. MRC IEU UK Biobank GWAS pipeline version 1. 2017. https://data.bris.ac.uk/data/dataset/2fahpksont1zi26xosyamqo8rr (accessed 11 May 2020)
  8. Fry A., Littlejohns T.J., Sudlow C., Doherty N., Adamska L., Sprosen T. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol. 2017;186:1026–1034. doi: 10.1093/aje/kwx246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. GTEx Consortium The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Han J., Kraft P., Nan H., Guo Q., Chen C., Qureshi A. A genome-wide association study identifies novel alleles associated with hair color and skin pigmentation. PLoS Genet. 2008;4 doi: 10.1371/journal.pgen.1000074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Han J., Qureshi A.A., Nan H., Zhang J., Song Y., Guo Q. A germline variant in the interferon regulatory factor 4 gene as a novel skin cancer risk locus. Cancer Res. 2011;71:1533–1539. doi: 10.1158/0008-5472.CAN-10-1818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Haworth S., Mitchell R., Corbin L., Wade K.H., Dudding T., Budu-Aggrey A. Apparent latent structure within the UK Biobank sample has implications for epidemiological analysis. Nat Commun. 2019;10:333. doi: 10.1038/s41467-018-08219-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Jacobs L.C., Hamer M.A., Gunn D.A., Deelen J., Lall J.S., van Heemst D. A genome-wide association study identifies the skin color genes IRF4, MC1R, ASIP, and BNC2 influencing facial pigmented Spots. J Invest Dermatol. 2015;135:1735–1742. doi: 10.1038/jid.2015.62. [DOI] [PubMed] [Google Scholar]
  14. Jee H.J., Kim A.J., Song N., Kim H.J., Kim M., Koh H. Nek6 overexpression antagonizes p53-induced senescence in human cancer cells. Cell Cycle. 2010;9:4703–4710. doi: 10.4161/cc.9.23.14059. [DOI] [PubMed] [Google Scholar]
  15. Kita R., Fraser H.B. Local adaptation of sun-exposure-dependent gene expression regulation in human skin. PLoS Genet. 2016;12 doi: 10.1371/journal.pgen.1006382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Kong A., Thorleifsson G., Frigge M.L., Vilhjalmsson B.J., Young A.I., Thorgeirsson T.E. The nature of nurture: effects of parental genotypes. Science. 2018;359:424–428. doi: 10.1126/science.aan6877. [DOI] [PubMed] [Google Scholar]
  17. Law M.H., Medland S.E., Zhu G., Yazar S., Viñuela A., Wallace L. Genome-wide association shows that pigmentation genes play a role in skin aging. J Invest Dermatol. 2017;137:1887–1894. doi: 10.1016/j.jid.2017.04.026. [DOI] [PubMed] [Google Scholar]
  18. Lee S.H., Jeong S.K., Ahn S.K. An update of the defensive barrier function of skin. Yonsei Med J. 2006;47:293–306. doi: 10.3349/ymj.2006.47.3.293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Liu F., Hamer M.A., Deelen J., Lall J.S., Jacobs L., van Heemst D. The MC1R gene and youthful looks. Curr Biol. 2016;26:1213–1220. doi: 10.1016/j.cub.2016.03.008. [DOI] [PubMed] [Google Scholar]
  20. Loh P.R., Kichaev G., Gazal S., Schoech A.P., Price A.L. Mixed-model association for biobank-scale datasets. Nat Genet. 2018;50:906–908. doi: 10.1038/s41588-018-0144-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. López S., García O., Yurrebaso I., Flores C., Acosta-Herrera M., Chen H. The interplay between natural selection and susceptibility to melanoma on allele 374F of SLC45A2 gene in a South European population. PLoS One. 2014;9 doi: 10.1371/journal.pone.0104367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Mancuso N., Freund M.K., Johnson R., Shi H., Kichaev G., Gusev A. Probabilistic fine-mapping of transcriptome-wide association studies. Nat Genet. 2019;51:675–682. doi: 10.1038/s41588-019-0367-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Mitchell R., Hemani G., Dudding T., Paternoster L. UK Biobank genetic data: MRC-IEU quality control, Version 1. 2017. https://data.bris.ac.uk/data/dataset/2fahpksont1zi26xosyamqo8rr (accessed 11 May 2020)
  24. Nan H., Kraft P., Qureshi A.A., Guo Q., Chen C., Hankinson S.E. Genome-wide association study of tanning phenotype in a population of European ancestry. J Invest Dermatol. 2009;129:2250–2257. doi: 10.1038/jid.2009.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. O’Connor L.J., Price A.L. Distinguishing genetic correlation from causation across 52 diseases and complex traits. Nat Genet. 2018;50:1728–1734. doi: 10.1038/s41588-018-0255-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Oyetakin-White P., Suggs A., Koo B., Matsui M.S., Yarosh D., Cooper K.D. Does poor sleep quality affect skin ageing? Clin Exp Dermatol. 2015;40:17–22. doi: 10.1111/ced.12455. [DOI] [PubMed] [Google Scholar]
  27. Pers T.H., Karjalainen J.M., Chan Y., Westra H.J., Wood A.R., Yang J. Biological interpretation of genome-wide association studies using predicted gene functions. Nat Commun. 2015;6:5890. doi: 10.1038/ncomms6890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Pilecki B., Holm A.T., Schlosser A., Moeller J.B., Wohl A.P., Zuk A.V. Characterization of microfibrillar-associated protein 4 (MFAP4) as a tropoelastin- and fibrillin-binding protein involved in elastic fiber formation. J Biol Chem. 2016;291:1103–1114. doi: 10.1074/jbc.M115.681775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Sulem P., Gudbjartsson D.F., Stacey S.N., Helgason A., Rafnar T., Magnusson K.P. Genetic determinants of hair, eye and skin pigmentation in Europeans. Nat Genet. 2007;39:1443–1452. doi: 10.1038/ng.2007.13. [DOI] [PubMed] [Google Scholar]
  30. Tyrrell J., Wood A.R., Ames R.M., Yaghootkar H., Beaumont R.N., Jones S.E. Gene-obesogenic environment interactions in the UK Biobank study. Int J Epidemiol. 2017;46:559–575. doi: 10.1093/ije/dyw337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Visconti A., Duffy D.L., Liu F., Zhu G., Wu W., Chen Y. Genome-wide association study in 176,678 Europeans reveals genetic loci for tanning response to sun exposure. Nat Commun. 2018;9:1684. doi: 10.1038/s41467-018-04086-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Wainberg M., Sinnott-Armstrong N., Mancuso N., Barbeira A.N., Knowles D.A., Golan D. Opportunities and challenges for transcriptome-wide association studies. Nat Genet. 2019;51:592–599. doi: 10.1038/s41588-019-0385-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Watanabe K., Taskesen E., van Bochoven A., Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8:1826. doi: 10.1038/s41467-017-01261-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Williams H.C. Clinical practice. Atopic dermatitis. N Engl J Med. 2005;352:2314–2324. doi: 10.1056/NEJMcp042803. [DOI] [PubMed] [Google Scholar]
  35. Yang J., Ferreira T., Morris A.P., Medland S.E., Genetic Investigation of ANthropometric Traits (GIANT) Consortium, DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–375. doi: 10.1038/ng.2213. [S1–3] [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Yang J.A., Lee S.H., Goddard M.E., Visscher P.M. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Yengo L., Sidorenko J., Kemper K.E., Zheng Z., Wood A.R., Weedon M.N. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet. 2018;27:3641–3649. doi: 10.1093/hmg/ddy271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zhang M., Song F., Liang L., Nan H., Zhang J., Liu H. Genome-wide association studies identify several new loci associated with pigmentation traits and skin cancer risk in European Americans. Hum Mol Genet. 2013;22:2948–2959. doi: 10.1093/hmg/ddt142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Zheng J., Erzurumluoglu A.M., Elsworth B.L., Kemp J.P., Howe L., Haycock P.C. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics. 2017;33:272–279. doi: 10.1093/bioinformatics/btw613. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material
mmc1.pdf (1.2MB, pdf)
Supplementary Data
mmc2.xlsx (8.5MB, xlsx)

Data Availability Statement

Full results of analysis are provided as supplementary datasets. Underlying source data are available through UK Biobank, an open-access resource for healthcare research. Data access procedures are described at http://www.ukbiobank.ac.uk/wp-content/uploads/2012/09/Access-Procedures-2011-1.pdf. Genome-wide summary statistics have been uploaded to the University of Bristol data repository, data.bris, and are publicly available at https://doi.org/10.5523/bris.21crwsnj4xwjm2g4qi8chathha.

RESOURCES