Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
. 2020 Feb 27;106(3):389–404. doi: 10.1016/j.ajhg.2020.02.006

Genome-wide Association Analysis in Humans Links Nucleotide Metabolism to Leukocyte Telomere Length

Chen Li 1,3,85, Svetlana Stoma 2,3,85, Luca A Lotta 1,85, Sophie Warner 2,85, Eva Albrecht 4, Alessandra Allione 5,6, Pascal P Arp 7, Linda Broer 7, Jessica L Buxton 8,9, Alexessander Da Silva Couto Alves 10,11, Joris Deelen 12,13, Iryna O Fedko 14, Scott D Gordon 15, Tao Jiang 16, Robert Karlsson 17, Nicola Kerrison 1, Taylor K Loe 18, Massimo Mangino 19,20, Yuri Milaneschi 21, Benjamin Miraglio 22, Natalia Pervjakova 23, Alessia Russo 5,6, Ida Surakka 22,24, Ashley van der Spek 25, Josine E Verhoeven 21, Najaf Amin 25, Marian Beekman 13, Alexandra I Blakemore 26,27, Federico Canzian 28, Stephen E Hamby 2,3, Jouke-Jan Hottenga 14, Peter D Jones 2, Pekka Jousilahti 29, Reedik Mägi 23, Sarah E Medland 15, Grant W Montgomery 30, Dale R Nyholt 15,31, Markus Perola 29,32, Kirsi H Pietiläinen 33,34, Veikko Salomaa 29, Elina Sillanpää 22,35, H Eka Suchiman 13, Diana van Heemst 36, Gonneke Willemsen 14, Antonio Agudo 37, Heiner Boeing 38, Dorret I Boomsma 14, Maria-Dolores Chirlaque 39,40, Guy Fagherazzi 41,42, Pietro Ferrari 43, Paul Franks 44,45, Christian Gieger 4,46,47, Johan Gunnar Eriksson 48,49,50, Marc Gunter 43, Sara Hägg 17, Iiris Hovatta 51,52, Liher Imaz 53,54, Jaakko Kaprio 22,55, Rudolf Kaaks 56, Timothy Key 57, Vittorio Krogh 58, Nicholas G Martin 15, Olle Melander 59, Andres Metspalu 23, Concha Moreno 60, N Charlotte Onland-Moret 61, Peter Nilsson 44, Ken K Ong 1,62, Kim Overvad 63,64, Domenico Palli 65, Salvatore Panico 66, Nancy L Pedersen 17, Brenda WJ H Penninx 21, J Ramón Quirós 67, Marjo Riitta Jarvelin 10,68, Miguel Rodríguez-Barranco 41,69,70, Robert A Scott 1, Gianluca Severi 71,72,73, P Eline Slagboom 12,13, Tim D Spector 19, Anne Tjonneland 74, Antonia Trichopoulou 75, Rosario Tumino 76,77, André G Uitterlinden 7, Yvonne T van der Schouw 61, Cornelia M van Duijn 25,78, Elisabete Weiderpass 43, Eros Lazzerini Denchi 18,79, Giuseppe Matullo 5,6, Adam S Butterworth 16,80,81,83,84, John Danesh 16,80,81,82,83,84, Nilesh J Samani 2,3, Nicholas J Wareham 1,85, Christopher P Nelson 2,3,85, Claudia Langenberg 1,85,, Veryan Codd 2,3,85,∗∗
PMCID: PMC7058826  PMID: 32109421

Abstract

Leukocyte telomere length (LTL) is a heritable biomarker of genomic aging. In this study, we perform a genome-wide meta-analysis of LTL by pooling densely genotyped and imputed association results across large-scale European-descent studies including up to 78,592 individuals. We identify 49 genomic regions at a false dicovery rate (FDR) < 0.05 threshold and prioritize genes at 31, with five highlighting nucleotide metabolism as an important regulator of LTL. We report six genome-wide significant loci in or near SENP7, MOB1B, CARMIL1, PRRC2A, TERF2, and RFWD3, and our results support recently identified PARP1, POT1, ATM, and MPHOSPH6 loci. Phenome-wide analyses in >350,000 UK Biobank participants suggest that genetically shorter telomere length increases the risk of hypothyroidism and decreases the risk of thyroid cancer, lymphoma, and a range of proliferative conditions. Our results replicate previously reported associations with increased risk of coronary artery disease and lower risk for multiple cancer types. Our findings substantially expand current knowledge on genes that regulate LTL and their impact on human health and disease.

Keywords: telomere length, biological aging, Mendelian randomisation, age-related disease

Introduction

Telomeres are DNA-protein complexes found at the ends of eukaryotic chromosomes, and they serve to maintain genomic stability and determine cellular lifespan.1 Telomere length (TL) declines with cellular divisions; this is due to the inability of DNA polymerase to fully replicate the 3′ end of the DNA strand (the “end replication problem”), and once a critically short TL is reached, the cell enters replicative senescence.2 Protein complexes, including the SHELTERIN complexes—which are comprised of TERF1 (MIM: 600951), TERF2 (MIM: 602027), POT1 (MIM: 606478), TERF2IP (MIM: 605061), TINF2 (MIM: 604319), ACD (MIM: 609377), and CST (CTC1 [MIM: 613129], STN1 [MIM: 613128], and TEN1 [MIM: 613130])—along with DNA helicases such as RTEL1 (MIM: 608833), bind telomeres and regulate TL and structure.3 In some cell types, such as stem and germline progenitor cells, TL is maintained by the enzyme telomerase, a ribonucleoprotein containing the RNA template TERC (MIM: 602322), a reverse transcriptase (TERT [MIM: 187270]), and accessory proteins (DKC1 [MIM: 300126], NOP10 [MIM: 606471], GAR1 [MIM: 606468], and NHP2 [MIM: 606470]).4

Severe telomere loss, through loss-of-function mutations of core telomere and telomerase components, leads to several diseases which share features such as bone marrow failure and organ damage. These “telomere syndromes” include dyskeratosis congenita (MIM: 305000), aplastic anemia (MIM: 609135), and idiopathic pulmonary fibrosis (MIM:614742) among others.5,6 While the prevalence of such syndromes varies, they are all relatively rare. One feature of these syndromes is premature aging.5 Along with shorter TL observed at older ages in cross sectional population studies, this has led to TL (most commonly measured in human leukocytes as leucocyte telomere length [LTL]) to be proposed as a marker of biological age. LTL has been shown to be associated with the risk of common age-related diseases, including coronary artery disease (CAD) and some cancers.7, 8, 9, 10, 11, 12 However, whether LTL (reflecting TL across tissues) was causally associated with disease or whether the observed associations may have been due to reverse causation or confounding was unclear.

LTL is both variable among individuals, from birth and throughout the life course, and highly heritable, with heritability estimates from 44%–86%.13,14 Identification of genetic determinants of LTL through a genome-wide association study (GWAS) has allowed further studies to suggest a causal role for LTL in several diseases, including CAD, abdominal aortic aneurysm, several cancers, interstitial lung disease, and celiac disease.15, 16, 17, 18, 19 However, these studies are limited due to the small number of genetic variants that have been identified that replicate between studies.15,20, 21, 22, 23, 24, 25 To further our understanding of LTL regulation and its relationship with disease, we have conducted a genome-wide association (GWA) meta-analysis of 78,592 individuals from the European Network for Genetic and Genomic Epidemiology (ENGAGE) study and from the European Prospective Investigation into Cancer and Nutrition (EPIC) Cardiovascular Disease (CVD) and InterAct studies.

Subjects and Methods

Full descriptions of the EPIC-CVD and EPIC-InterAct cohorts, along with the participating cohorts within the ENGAGE consortium, are given in the Supplemental Information.

LTL Measurements and QC Analysis

Mean LTL measurements were conducted using an established quantitative PCR technique which expressed TL as a ratio of the telomere repeat number (T) to a single-copy gene (S).26,27 The majority of the ENGAGE samples were included within our previous analysis.15 LTL measurements were standardized either by using a calibrator sample or by quantifying against a standard curve, depending on the laboratory (Table S1 and Supplemental Methods). Full details of the methodology employed by each laboratory, along with quality control (QC) parameters, is given in the Supplemental Information or is given in detail elsewhere.15 Because the use of different calibrator samples or of standard curves for quantification can lead to different ranges in the T/S ratios being observed between laboratories, we standardized LTL by using a z-transformation approach (z = (μ - μ0)/σ, μ, T/S ratio, μ0, the mean T/S ratio, σ, standard deviation [SD]).

Genotyping, GWAS Analysis, and Study-Level QC

Genotyping platforms and imputation methods and panels varied across participating study centers. Detailed information about these is provided in Figure S1 and Table S2. A GWAS was run within each study through the use of linear regression under an additive mode of inheritance with adjustment for age, sex, and any study-specific covariates, including batch, center, and genetic principle components. There are 21 studies contributing to ENGAGE. For the EPIC InterAct and CVD studies, association analyses were stratified based on genotyping platform and disease status, resulting in nine strata. Within each study or stratum, related samples (k > 0.088) were removed. Population stratification was estimated using the genomic control inflation factor λ and used to adjust the standard errors. Genetic variants were filtered on the basis of the published standards that included call rate >95%, Hardy–Weinberg equilibrium p < 1 × 10−6, imputation quality info-score >0.4 or R2 > 0.3, minor allele count ≧10, and standard error of association estimates ranging from 0 to 10.15,28,29 These data were taken forward to the meta-analysis.

Meta-analyses

GWAS summary statistics were combined via two steps of meta-analyses by using inverse variance weighting in GWAMA.30 We first combined all 21 ENGAGE studies together and separately combined the nine EPIC-InterAct and EPIC-CVD strata, where a genetic variant was retained if it had >40% of the available sample size within these two cohorts. Fixed effects were used except for variants with significant heterogeneity (Cochrane’s Q: p < 1 × 10−6), in which case random effects were used. Additional adjustment was made for genomic inflation (see Figure S2). In the second step, association estimates derived from the two separate meta-analyses estimated in the first step were combined using fixed effects inverse variance weighted meta-analyses. We estimated the FDR by estimating q-values31 for these data.

Conditional Association Analysis

Conditionally independent signals were identified via an approximate genome-wide stepwise method, using GCTA (Version 1.25.2),32,33 that allows for conditional analyses to be run on summary statistics without individual-level data. Summary statistics from the final meta-analysis were used as the input, with p value cut-offs at 5 × 10−8 (genome-wide significance) or 1.03 × 10−5 (equivalent to an FDR < 0.05). The model starts with the most significant SNP, adds in SNPs iteratively in a forward stepwise manner, and calculates conditional p values for all SNPs within the model. If the target SNP shows evidence of collinearity (correlation coefficients r2 > 0.9, with linkage disequilibrium (LD) estimated based on a random subcohort of 50,000 UK Biobank samples) with any of the SNPs selected into the model, the conditional p value of the target SNP was set to 1. The selection process was repeated until no more SNPs could be fitted into the model, i.e., there were no more SNPs that could reach the conditional p value thresholds (5 × 10−8 or 1.03 × 10−5, corresponding to the p value cut-offs in the input). Joint effects of all selected SNPs that fitted in the model were calculated and reported as independent variants’ effects. Regional plots of a 1Mb window flanking the locus sentinel variants (p < 5 × 10−8) were generated using LocusZoom34 with LD structure estimated in the UK Biobank subcohort (see Figure S3).

Gene Prioritization

Variant Annotation

Sentinel variants (conditional p < 1.03 × 10−5) and their proxies (r2 < 0.8) were annotated on the human reference genome sequence hg19 using Annovar (v2017July16).35 Their functional consequences on the protein sequences encoded by the nearest genes were cross-validated using definitions from RefGene,36 Ensembl gene annotation,37 GENCODE,38 and the University of California, Santa Cruz (UCSC) human genome database.39 These variants were also evaluated for features including evolutionary conservation (whether they reside in or specifically encode an conserved element based on multiple alignments across 46 vertebrate species), chromatin states predicted using Hidden Markov Models trained by CHIP-seq data from ENCODE (15 classified states across nine cell types), histone modification markers (active promoter: H3K4Me3, H3K9Ac; active enhancer: H3K4me1, H3K27Ac; active elongation: H3K36me3; and repressed promoters and broad regions: H3K27me3), and CTCF transcription factor binding sites across nine cell lines, conserved putative TFBS, and DNaseI hypersensitive areas curated from the ENCODE database.38 Variants within the exonic regions were further annotated with allele frequencies in seven ethnical groups (retrieved from the Exome Aggregation Consortium database) and functional effects prediction performed using a number of different algorithms. For non-coding variants, we performed integrated analysis with SNP Nexus IW scoring.40

Transcriptomic Data Integration

(1) With summary statistics, we performed a gene-level analysis, using S-PrediXcan, that links LTL to predicted gene expressions across 44 tissues (GTex v6p). It uses multivariate sparse regression models that integrate cis-SNPs within 2Mb windows around gene transcript boundaries in order to predict the corresponding gene expression levels. A detailed description of the method can be found elsewhere.41,42 In brief, individual SNP-LTL associations were weighted by SNP-gene (wlg) and SNP-SNP (σl/σg) association matrix, estimated from the PredictDB training set (zg=lgwlg(σl/σg)zl, for a gene (g); the set of SNPs (l) were selected from an elastic net model with a mixing parameter of 0.5). Protein-coding genes with qualified prediction model performance (average Pearson’s correlation coefficients r2 between predicted and observed gene expressions >0.01, FDR < 0.05) were included in our analysis. We considered a predicted gene expression to be significantly associated with LTL at a Bonferroni corrected p value threshold (p < 2.61 × 10−7), conservatively assuming association of each gene in each tissue as an independent test.

(2) For a given region significantly associated with LTL (FDR < 0.05), we tested whether the potential causal variants are shared between LTL and gene expressions by using COLOC Bayesian approach.43 Regions for testing were determined as 2Mb windows surrounding the sentinel variants. Regional summary statistics were extracted from this GWA meta-analysis for associations with LTL and GTex v744 for cis-eGenes (genes with significant expression quantitative trait loci [eQTLs], FDR < 0.05) located within or on the boundaries of LTL regions defined. We selected the default priors for this analysis. We set p1 = p2 = 10−4, meaning that 1 in 10,000 variants is associated with either trait (LTL or gene expression), as has been suggested by others.43 We set p12 = 10−5, meaning that 1 in 10 (p12/(p12 + p1)) variants that are associated with one trait is also associated with the other. This was chosen because sensitivity analyses have shown broadly consistent results between this setting and more stringent (p12 = 10−5) settings, while allowing greater power.45 Evidence for colocalization was assessed by comparing the posterior probability (PP) for two hypotheses: that the associations for both traits were driven by the same causal variants (hypothesis 4) and that they were driven by distinct ones (hypothesis 3). Strong evidence of a co-localized eQTL was defined as PP3 + PP4 ≥ 0.99 and PP4/PP3 ≥ 5, and suggestive evidence was defined as PP3 + PP4 ≥ 0.90 and PP4/PP3 ≥ 3, consistent with previous studies.46,47

Epigenomic (DNA Methylation) Data Integration

For genes whose expressions are modulated by epigenetic modifications, such as the methylation of transcriptional regulators in cis, linking genetic variants associated with cis-methylation probes (cis-meQTLs, FDR < 0.05) to LTL can help gene prioritization. For this: (1) We conducted a systematic search of LTL-associated sentinel variants and their proxies (r2 > 0.8) in multiple publicly available meQTL databases.48, 49, 50 (2) We also performed an epigenome-wide association analysis that integrated multiple variants’ associations in a regularized linear regression model which was algorithmically similar to the transcriptome-wide association analyses.51 A reference panel for meQTLs was constructed based on individuals in the EPIC-Norfolk cohort, with detailed description published elsewhere.52 Bonferroni correction was applied, accounting for the total number of CpG markers tested (p = 1.00 × 10−7).

Pathway Enrichment Analysis

Using two different approaches, we sought to identify pathways that are responsible for regulating TL.

PANTHER

A list of our prioritized genes at each locus (or the nearest gene where no prioritization was possible) was submitted for statistical overrepresentation testing (Fisher’s exact test) in Protein Analysis through Evolutionary Relationships (PANTHER).53 Pathways (Gene Ontology [GO] molecular function complete annotation dataset) were considered over-represented where FDR p < 0.05.

DEPICT

We also used a hypothesis-free, data-driven approach using Data-driven Expression Prioritized Integration for Complex Traits (DEPICT)54 to highlight reconstituted gene sets and tissue and/or cell types where LTL-associated loci were enriched. Summary statistics of uncorrelated SNPs (LD r2 ≦ 0.5) significantly associated with LTL at a genome-wide level (p < 5 × 10−8) were used as the input, and the HLA region (chr6:29691116–33054976) was excluded. DEPICT first defined each locus around the uncorrelated variants and selected the genes within the region. It then characterized gene functions based on pairwise co-regulation of gene expressions, and these gene functions were quantified as membership probabilities across the 14,461 reconstituted gene sets. Then for each gene set, it assessed the enrichment by testing whether the sum of membership scores of all genes within each LTL-associated locus was higher than that for a gene-density-matched random locus. Detailed description of gene set construction was published elsewhere.54 In brief, DEPICT leveraged a broad range of pre-defined pathway-oriented databases to construct gene sets (14,461), including GO terms,55 KEGG,56 REACTOME pathways,57 the experimentally derived protein-protein interaction (PPI) subnetwork,58 and the gene-phenotype matrix curated by Mouse Genetics Initiative.59 Correlations (r ≧ 0.3) between significant gene sets were visualized using CytoScape.60

Clinical Relevance of LTL

Mendelian Randomization

Using two-sample Mendelian randomization (MR)61 we investigated the potential effect of LTL on 122 diseases manually curated in the UK Biobank (Table S3).62 Diseases were selected where there were sufficient case numbers to detect an odds ratio >1.1 (Table S4). LTL was genetically proxied based on 52 independently associated variants (FDR < 0.05). Individual SNP effects on disease were tested using logistic regression in SNPTEST,63 adjusting for sex, age, the first five genetic principal components, and genotyping array within the UK Biobank. MR estimates were calculated using an inverse variance weighted MR approach. Sensitivity analyses were performed using median-based MR,64 MR-RAPS,65 MR-Eggers,66 and MR-Steigers67 to identify inconsistency in the MR estimates, account for weak instrument bias, highlight any evidence of directional pleiotropy, and estimate direction of the MR relationship, repectively.

LD Score Regression

Cross-trait linkage disequilibrium score regression (LDSC) analysis was used to measure genetic correlations between LTL and selected traits through the use of the LD Hub database (version 1.4.1).68 From the 832 available traits in LD Hub, we a priori selected traits of interest in order to remove redundancy and/or duplication within the analysis. We removed poorly defined traits and diseases, those without prior evidence of a genetic basis, and medications. We also removed lipid sub-fractions because we thought these unlikely to be relevant. We excluded studies with a sample size <1,000. Where multiple datasets for the same trait existed, we first prioritized datasets from large specialist consortia (where relevant factors would have been accounted for within the GWAS analysis) over the UK Biobank analyses conducted by the Neale group (where the GWAS was acknowledged to be a “quick and dirty” analysis). We then prioritized larger sample size, more recent studies, and diagnosed conditions over self-reported ones. We also removed traits with low heritability estimates within LD Hub, leaving us with 320 traits (information, including PMIDs of the selected studies, is given in the Results section).

Genome-wide summary statistics were used as the input, and standardized quality control was implemented within the software, including minor allele frequency (MAF) (>1% for HapMap3 and >5% for 1000 Genomes EUR-imputed SNPs), effective sample size (>0.67 times the 90th percentile of sample size), removal of insertions or deletions or structural variants, allelic alignment to 1000 Genomes, and removal of SNPs within the major histocompatibility complex (MHC) region.

Variants-based Cross-database Query

Independent variants and their strong proxies (r2 ≥ 0.8) were queried against publicly available GWAS databases; for this, we used PhenoScanner69 for computational efficiency. A list of GWAS results implemented in the software was previously published. Results were filtered to include associations with p < 1 × 10−6, in high LD (r2>0.8) with the most significant SNPs within the region, and manually curated to retain only the most recent and largest study per trait.

Results

Discovery of Genetic Determinants of LTL

Mean LTL was measured within each cohort by using a quantitative polymerase chain reaction (qPCR)-based method, which expresses TL as a ratio of telomere repeat content (T) to single-copy gene (S) within each sample (see Subjects and Methods, Supplemental Information, and Table S1). T/S ratios were z-standardized to harmonize differences in the quantification and calibration protocols between cohorts. Associations of shorter LTL with increasing age and male gender were observed as expected (Table S1).

Variants were assessed for association with mean LTL within each cohort through the use of additive models adjusted for age, gender, and cohort-specific covariates and then combined using inverse-variance-weighted meta-analysis (Table S2).

In total, 20 sentinel variants at 17 genomic loci were independently associated with LTL at a level of genome-wide statistical significance (p < 5 × 10−8, Table 1, Figure S1), including six loci that had not previously been associated with LTL (SENP7 [MIM: 612846], MOB1B [MIM:609282], CARMIL1 [MIM: 609593], PRRC2A [MIM: 142580], TERF2, and RFWD3 [MIM: 614151]). We also identified genome-wide significant variants in four recently reported loci from a Singaporean Chinese population (POT1, PARP1 [MIM: 173870], ATM [MIM:607585], and MPHOSPH6 [MIM:605500])70 and confirmed association at seven previously reported loci in European ancestry studies (TERC, NAF1 [MIM: 617868], TERT, STN1(OBFC1), DCAF4 [MIM: 616372], ZNF208 [MIM: 603977], and RTEL1).15,23 Two and three conditionally independent signals were detected within the TERT and RTEL1 loci, respectively (Table 1). Within the known loci, three variants within the DCAF4 (r2 = 0.05) and TERT (r2 < 0.5) loci were distinct from the previously reported sentinel variants, while five (TERC, NAF1, STN1, ZNF208, and RTEL1; r2 > 0.8; Table S5) were in high LD with the previously reported ones from European studies. For the loci identified in a Chinese ancestry population, we observed the same sentinel variant for PARP1 and high LD variants for ATM and MPHOSPH6 (r2 > 0.8) but a distinct sentinel for POT1 (r2 < 0.5, Table S5). While we observed a distinct sentinel for POT1, we cannot rule out the possibility that the association signal observed in this region could be shared. In that case, the sentinels identified in each population would be reflective of a third, as yet unidentified, variant that is the true causal variant in this region. For the RTEL1 locus, there are significant differences in LD structure between ancestral populations. All of the RTEL1 variants we report at genome-wide statistical significance are in low LD with those reported in Singaporean Chinese and in South Asians.25,70 Our novel variants are of lower frequency (MAF < 0.1) and either are reported as being monoallelic (monomorphic) or fall below the MAF threshold for analysis in the Southern Han Chinese (CHS) population (MAF < 0.01). This suggests that genetic variation in this region may be, in part, population specific or that the MAF is so low that we currently are unable to detect any association.

Table 1.

Independent Variants Associated with LTL at Genome-Wide Significance (5x10−8)

SNP Gene Chr Position (hg19) EA EAF Beta SE p Value
Previously Reported Loci

rs3219104 PARP1 1 226562621 C 0.83 0.042 0.006 9.60 × 10−11
rs10936600 TERC 3 169514585 T 0.24 −0.086 0.006 7.18 × 10−51
rs4691895 NAF1 4 164048199 C 0.78 0.058 0.006 1.58 × 10−21
rs7705526 TERT 5 1285974 A 0.33 0.082 0.006 5.34 × 10−45
rs2853677 TERT 5 1287194 A 0.59 −0.064 0.006 3.35 × 10−31
rs59294613 POT1 7 124554267 A 0.29 −0.041 0.006 1.17 × 10−13
rs9419958 STN1 (OBFC1) 10 105675946 C 0.86 −0.064 0.007 5.05 × 10−19
rs228595 ATM 11 108105593 A 0.42 −0.029 0.005 1.43 × 10−8
rs2302588 DCAF4 14 73404752 C 0.10 0.048 0.008 1.68 × 10−8
rs7194734 MPHOSPH6 16 82199980 T 0.78 −0.037 0.006 6.94 × 10−10
rs8105767 ZNF208 19 22215441 G 0.30 0.039 0.005 5.42 × 10−13
rs75691080 RTEL1/STMN3 20 62269750 T 0.09 −0.067 0.009 5.99 × 10−14
rs34978822 RTEL1 20 62291599 G 0.02 −0.140 0.023 7.26 × 10−10
rs73624724 RTEL1/ZBTB46 20 62436398 C 0.13 0.051 0.007 6.33 × 10−12

Additional Loci

rs55749605 SENP7 3 101232093 A 0.58 −0.037 0.007 2.45 × 10−8
rs13137667 MOB1B 4 71774347 C 0.96 0.077 0.014 2.43 × 10−8
rs34991172 CARMIL1 6 25480328 G 0.07 −0.061 0.011 6.19 × 10−9
rs2736176 PRRC2A 6 31587561 C 0.31 0.035 0.006 3.53 × 10−10
rs3785074 TERF2 16 69406986 G 0.26 0.035 0.006 4.64 × 10−10
rs62053580 RFWD3 16 74680074 G 0.17 −0.039 0.007 4.08 × 10−8

Gene—the closest or candidate gene (known telomere-related function) within the region. EA—effect allele. EAF—effect allele frequency within the study. Beta—the per-allele effect on z-scored LTL. SE—standard error.

Additional, independent signals detected using conditional analysis are included.

It has been shown that many loci that fall just below the conventional threshold of genome-wide significance are genuinely associated with the trait of interest and do subsequently reach the conventional threshold when sample size is increased.71 In an attempt to gain additional insight into the genetic determination of LTL in humans, we applied a less stringent FDR threshold to the data. An additional 32 variants met an FDR threshold of <0.05, totaling 52 variants that estimate ~2.93% of the variance in TL (Table S6).71 Within this FDR list, 5% of variants (2–3) are estimated to be false positives, although we are not able to determine which they are. While we believe that this FDR is acceptable, we advise that individual loci should be interpreted with some caution. These variants were located within separate loci from those reported above, with the exception of a fourth, independent signal in the RTEL1 locus. Although we did not replicate the previously reported ACYP2 (MIM: 102595) locus, this did remain within the variants identified at the FDR < 0.05 threshold. TYMS (MIM: 188350), identified as genome-wide significant in a trans-ethnic meta-analysis of Singaporean Chinese67 and in the previously reported ENGAGE analysis,15 is within our FDR < 0.05 identified loci. This was to be expected considering the substantial sample overlap of the ENGAGE data; however, our sentinel variant is distinct and not reported in the Dorajoo et al. study. Aligning our data with available summary statistics from the Dorajoo et al. study (Singaporean Chinese samples only), we see at least nominal support for the vast majority of our genome-wide significant loci, with the exception of STN1(OBFC1) and SENP7 (Table S7). Although SENP7 has not previously been reported, variants in high LD (r2 > 0.6) with our STN1 sentinel have been reported in other European populations.21,22 There is also support for many variants in our extended FDR list. However, it should be noted that data are not available for around half of our FDR < 0.05 loci, with most of these being either monoallelic or too low frequency to have been included within the analysis in the CHS population, again suggesting that several may be specific to the European population.

Prioritization of Likely Candidate Genes

We applied in silico prediction tools, leveraging large-scale human genomic data integrated with multi-tissue gene expression, transcriptional regulation, and DNA methylation data, coupled with knowledge-driven manual curation, to prioritisze the genes that are most likely influenced by the genetic variants within each locus. All 52 sentinel variants identified at GWS and FDR < 0.05 (listed in Table S6) plus their high LD proxies (r2 > 0.8) were taken forward into our in silico analyses. First, we annotated all variants for genomic location and location with respect to regulatory chromatin marks (Tables S8 and S9). This also identified variants that led to non-synonymous changes in nine loci. Of these, five loci contained variants with predicted damaging effects on protein function (Table S10). We also found evidence that variants were associated with changes in gene expression in multiple loci (Table S11), with several showing co-localization and evidence from two approaches. This data, along with prediction of functional non-coding variants (Table S12), methylation QTL data (Table S13), and curation of gene functions within the region (Supplemental Methods), are summarized in Table S14. The summary data were utilized to prioritize genes that are most likely influenced at each locus. Where the prioritization methods suggested multiple genes for a given locus, we prioritized based on the amount of evidence across all considered lines of enquiry stated above. We were able to prioritize genes at 15 of the 17 genome-wide significant loci and 16 at of the 32 FDR loci (Table S14).

Four of the prioritized genes for newly identified loci have known roles in telomere regulation (PARP1, POT1, ATM, and TERF2; Figure 1). PARP1 (poly(ADP-ribose) polymerase 1), a variant in high LD (r2 = 1.0) with our identified sentinel variant, causes a Val762Ala substitution (Table S10) which is known to reduce PARP1 activity.72 This variant was associated with shorter LTL, in agreement with studies showing that knockdown of PARP1 leads to telomere shortening.73 PARP1 catalyzes the poly(ADP-ribosyl)ation of proteins in several cellular pathways, including DNA repair.73 It interacts with TERF2 and it regulates the binding of TERF2 to telomeric DNA through this post-translational modification.74

Figure 1.

Figure 1

Loci with Established Roles in Telomere Biology

Candidate genes found in this study are shown in red. These include genes that encode components of the SHELTERIN complex (A), regulate the formation and activity of telomerase (B), and regulate telomere structure (C).

Three genes, DCAF4, SENP7, and RFWD3, prioritized based on deleterious protein coding changes (DCAF4, SENP7) or strong evidence linking to gene expression levels (RFWD3), are all involved in DNA damage repair.75, 76, 77 SENP7 has previously been demonstrated to bind damaged telomeres.78 Components of DNA damage response and repair pathways (such as ATM) have been shown to also play roles in telomere regulation.79 Mutations in RFWD3 cause Fanconi anemia (MIM: 617784), a disease linked to telomere shortening and/or abnormalities.80

The PRRC2A locus contains 11 genetically linked SNPs located across the MHC class III region, which is a highly polymorphic and gene-dense region with complex LD structure. BAG6 (MIM: 142590) and CSNK2B (MIM: 115441) were suggested as gene candidates for this region, supported by gene expression data (see Supplemental Information and Tables S11 and S14). BAG6 is linked to DNA damage signaling and apoptosis,81 while CSNK2B, a subunit of casein kinase 2, interacts with TERF1 and regulates TERF1 binding at telomeres.82

Pathway Enrichment

To investigate context-specific functional connections between prioritized genes of the identified loci and to suggest plausible biological roles of these genes in the TL regulation, we performed enrichment analyses for pathways and tissues through the use of DEPICT54 and PANTHER.53 DEPICT is a hypothesis-free, data-driven approach for which we used summary statistics of all uncorrelated SNPs (LD r2 ≦ 0.5) associated at p < 5 × 10−8 as input. For PANTHER, we assessed overrepresentation of genes within our loci within known pathways. To minimize noise, we used our prioritized genes as input, along with the closest gene to the sentinel SNP, where no prioritization was possible. In total, 55 genes were submitted to PANTHER, of which six were not available within PANTHER, leaving 49 within the analysis.

Over 300 reconstituted gene sets (DEPICT) were significantly enriched for the LTL loci (FDR < 0.05); these could be further clustered into 34 meta-gene sets, highlighting pathways that are involved in several major cellular activities, including DNA replication, transcription, and repair; cell cycle regulation; immune response; and intracellular trafficking (Figure 2A).

Figure 2.

Figure 2

Pathways Enriched for Telomere-Associated Genes

(A) Gene sets significantly (false discovery rate [FDR] < 0.05) enriched for prioritised LTL-associated genes. Color intensity of the nodes (gene sets), classified into three levels, reflects enrichment strengths (FDR). Edge width indicates Pearson correlation coefficient (r2) between each pair of the gene sets. Some of the most significantly associated gene sets include telomere maintenance along with DNA replication and repair pathways as may be expected. How other enriched pathways may influence LTL is unclear.

(B) Role of LTL-associated genes in nucleotide metabolism. Five enzymatic reactions and genes encoding the corresponding enzymes prioritized from this GWAS are highlighted in bold.

The PANTHER analysis identified a number of telomere-related pathways, including regulation of telomeric loop disassembly, t-circle formation, protein binding at telomeres, and single-strand break repair, as being the mostly highly overrepresented (Table S15). Among other expected pathways, cellular aging and senescence were also highlighted. Of note, nucleotide metabolism pathways were overrepresented (2′-deoxyribonucleotide metabolic process, deoxyribose phosphate metabolic process, and deoxyribonucleotide metabolic process; Figure 2B; Table S15). The genes matched to these pathways were TYMS, SAMHD1 (MIM: 606754), and SMUG1 (MIM: 607753). While TYMS is critical for deoxythymidine monophosphate (dTMP) biosynthesis, SAMHD1 controls deoxynucloeside triphosphate (dNTP) catabolism and SMUG1 removes misincorporated uracil from DNA.83, 84, 85 Although not highlighted in the pathway analysis, two further genes within other identified loci (TK1 [MIM: 188300] and DCK [MIM:125450]) are key regulators of deoxynucleoside monophosphate (dNMP) biosynthesis;85 this adds further support to the possibility that nucleotide metabolism is a key pathway in regulating LTL. dNTPs constitute the fundamental building blocks required for DNA replication and repair.86 Genetic perturbations that disrupt dNTP homeostasis have been shown to result in increased replication error, cell cycle arrest, and DNA-damage-induced apoptosis.85,87

Relationship between Genetically Determined TL and Disease

To further understand the clinical relevance of TL, we used the 52 independent variants identified at FDR < 0.05 as genetic instruments for TL, and we applied a two-sample MR approach using UK Biobank data.62 We manually curated 122 diseases available in the UK Biobank and examined their relationships with shorter TL (Tables S3 and S16). We observed nine associations which passed a Bonferroni corrected threshold (p < 4.1x10−4). These included novel findings of an increased risk of hypothyroidism, and decreased risk of thyroid cancer, lymphoma, and diseases of excessive growth (uterine fibroids, uterine polyps, and benign prostatic hyperplasia). We also confirmed findings for decreased risk of lung and skin cancer and leukemia for subjects with shorter TL (Figure 3, Table S16).16,18,88 We observed a further 30 nominally significant associations (p < 0.05), confirming previous MR findings of an increased risk of CAD, within the UK Biobank population (Figure 3, Table S16). Our results also provide genetic evidence for associations of shorter LTL with increased risk of rheumatoid arthritis, aortic valve stenosis, chronic obstructive pulmonary disease, and heart failure, all of which have previously been observationally associated with shorter LTL.89, 90, 91, 92 We also ran the MR analyses using only the genome-wide significant variants (Figure S4), and we did not lose any Bonferroni-significant hits, with only small differences in those diseases that are nominally associated. In our sensitivity analyses, effect estimates were consistent across MR methods. The MR-Steigers analysis indicated that the direction of the relationship is that TL influences disease risk. This analysis also indicated that this direction was estimated correctly for the majority of diseases (Table S16).

Figure 3.

Figure 3

Mendelian Randomization Results for the Effect of Shorter LTL on the Risk of 122 Diseases in the UK Biobank

Data shown are odds ratios and 95% confidence intervals for a 1 standard deviation shorter LTL. Diseases are classified into groups, as indicated by the boxing, and sorted alphabetically within disease group. Nominally significant (p < 0.05) associations estimated via inverse-variance-weighted Mendelian randomization are shown in green for a reduction in risk and purple for an increase in risk due to shorter LTL. O indicates nominal (p < 0.05) evidence of pleiotropy estimated by MR-Eggers intercept. Full results are also shown in Table S16 along with the full MR sensitivity analysis.

We next sought to explore human diseases and traits that share common genetic etiologies with LTL. We did this by performing LD score regression analyses to test for genetic correlations between TL and 320 curated traits and diseases (Table S17) within LD Hub.15,16 In comparison to the MR approach, these analyses utilize genome-wide genetic information rather than selected SNPs with the most significant associations. In agreement with our MR analyses, TL was negatively correlated with CAD (r = −0.17, p = 0.01, Table S17). Dyslipidaemia risk factors for CAD also showed concordant associations with shorter TL, including higher LDL and total cholesterol and lower HDL cholesterol (Table S17). These results are suggestive of a shared genetic architecture underlying TL, CAD, and CAD risk factors. However, these results would not survive correction for multiple testing.

We also examined individual locus-driven genetic correlations between TL and a variety of human phenotypes and diseases by using PhenoScanner69 to query 52 FDR sentinel variants and their closely related SNPs in LD (r2 ≧ 0.8) against publicly available GWAS databases. While some morbidities showed specific correlations to a single locus, others showed correlations to a broader spectrum of loci. For example, self-reported hypothyroidism or myxoedema exhibited a strong association particularly at the TERT locus, which was also exclusively responsible for several subtypes of ovarian cancers (Table S18). In contrast, blood cell traits and hematological diseases were implicated with a wider range of loci, including TERC, TERT, SENP7, ATM, BBOF1, and MROH8; this result is similar to those for the respiratory function and lung cancers that also involved multiple TL loci (Table S18).

Discussion

We identify 20 lead variants at a level of genome-wide significance and a further 32 at FDR < 0.05. Within established loci, we report a second, independent, association signal within the TERT locus and redefine the RTEL1 locus into three independent signals. By applying a range of in silico tools that integrate multiple lines of evidence, we were able to pinpoint likely influenced genes for the majority of independent lead variants (34 of 52), several of which represent key telomere-regulating pathways (including components of the telomerase complex, the telomere-binding SHELTERIN and CST complexes, and the DNA damage response [DDR] pathway).

Telomeres function to prevent the 3′ single-stranded overhang at the end of the chromosome from being detected as a double-stranded DNA break. This is achieved through binding of the SHELTERIN complex (TERF1, TERF2, TERF2IP, TINF2, ACD, and POT1), which acts to block activation of DDR pathways via several mechanisms.3 SHELTERIN also binds a number of accessory factors that facilitate processing and replication of the telomere, including the DNA helicase RTEL1.3 SHELTERIN also interacts with the CST complex that regulates telomerase access to the telomeric DNA (Figure 1C).3 The associated loci contain two of the SHELTERIN components (TERF2 and POT1), a regulator of TERF1, CSNK2B (PRRC2A locus),82 the helicase RTEL1, and the CST component STN1.

Although telomere-binding proteins and structure aim to inhibit activation of DDR pathways, there is also evidence of a paradoxical involvement of a number of DDR factors in TL maintenance; these factors include both of the prioritized genes, ATM and PARP1.73,93 TERF2 inhibits ATM activation and the classical non-homologous end joining (c-NHEJ) at telomeres, thus preventing synapsis of chromosome ends (Figure 1A).94 However, ATM activation is required for telomere elongation, potentially by regulating access of telomerase to the telomere end through ATM-mediated phosphorylation of TERF1.93 It is possible that other DDR regulators can impact TL maintenance by regulating telomeric chromatin states, T-loop dynamics, and single-stranded telomere overhang processing.79 Other prioritised genes (SENP7 and RFWD3) also function within DDR pathways; this suggests a plausible mechanism through which they may influence LTL.

The telomerase enzyme is capable of extending telomeres and/or compensating sequence loss due to the end replication problem in stem and reproductive cells.4 Associated loci include genes encoding the core telomerase components TERT and TERC along with the chaperone protein NAF1. NAF1 is required for TERC accumulation and its incorporation into the telomerase complex.95 After transcription, TERC undergoes complex 3′ processing to produce the mature 451bp template.96 This involves components of the RNA exosome complex, PARN (MIM: 604212) and TENT4B (MIM: 605540), among others; this process is not fully understood.97 In addition to variants within regions containing TERT, TERC, and NAF1, a prioritiszed gene from another locus (MPHOSPH6) is a component of the RNA exosome.98

Comparing our findings to those reported in a non-European study,70 we find support for our most significantly associated loci. For many of our FDR < 0.05 loci, we were unable to look for support from this study because our sentinel variants were either monoallelic or rare (MAF < 0.01) in the CHS population. Different LD structures in regions such as RTEL1, coupled with the reported absence of some of the variants in other ancestral populations, suggest that some of our reported variants may specific to Europeans. Adding additional support for the existence of population-specific rare variants regulating LTL is the discovery of two loci in the Singaporean Chinese study that are monoallelic in Europeans.70 Because both of these replicate within CHS subjects and are located within regions containing telomere-related genes, they are unlikely to be false positive findings. Future large-scale trans-ethnic meta-analyses will be critical in determining shared causal variants from population-specific rare variants. This is of key importance to downstream analyses using genetically determined LTL to investigate disease risk in different populations. However, the current lack of large-scale data on LTL in non-European cohorts is limiting.

Utilizing the prioritized gene list as well as the closest genes to the sentinel variants, we showed a number of pathways to be enriched for telomere-associated loci. Of note, we observed significant overrepresentation of genes in several nucleotide metabolism pathways (Table S15, Figure 2B). Key genes were highlighted by this function in both the biosynthesis (TYMS, TK1, and DCK) and catabolism (SAMHD1) of dNTPs. Biosynthesis of dNTPs occurs via two routes: de-novo synthesis and the nucleotide salvage pathway. Thymidine kinase (TK1) and deoxycytidine kinase (DCK) are the rate-limiting enzymes that catalyze the first step of the salvage pathway of nucleotide biosynthesis, converting deoxynucleosides to their monophosphate forms (dNMPs) before other enzymes facilitate further phosphorylation into deoxynucleodie diphophates (dNDPs) and dNTPs (Figure 2B).85 Thymidylate synthetase (TYMS) is considered to be a component of the de novo pathway, and is the key regulator of dTMP biosynthesis, converting deoxyuridine monophosphate (dUMP) to dTMP.85 However, because the dUMP substrates can be derived from either de novo synthesis or deamination of deoxycytidine monophosphate (dCMP) produced from the salvage pathway, it could be considered to function within both pathways (Figure 2B).85 Besides controlling biosynthetic pathways, the equilibrium of cellular dNTP levels is also achieved by regulating degradation of dNTPs, a key regulator of which is SAMHD1. It catalyzes the hydrolysis of dNTPs to deoxynucleosides and triphosphates, thereby preventing the accumulation of excess dNTPs (Figure 2B).81 Although the finely tuned dNTP supply system inhibits incorrect insertions of bases into DNA synthesis, potential errors are monitored by the product of another prioritized gene, the base excision repair enzyme, SMUG1, which removes uracil and oxidized derivatives from DNA molecules.84

A balanced cellular pool of dNTPs is required for DNA replication and repair and for maintaining proliferative capacity and genome stability. Low levels of dNTPs can induce replication stress, subsequently leading to increased mutation rates.99 A surplus of dNTPs, on the other hand, reduces replication fidelity, thus also causing higher levels of spontaneous mutagenesis.100 A dynamic balance between biosynthesis and catabolism is required to maintain an equilibrium. Because maintaining the balance of the intracellular dNTP pool is also fundamental to other pathways that are implicated in telomere homeostasis, including cellular proliferation and DNA repair, disruption of dNTP homeostasis may trigger a sequence of cellular events that interplay synergistically, leading to abnormalities of TL and genome instability.

By clustering our prioritized genes via their functional connections, we highlighted a number of pathways that were enriched for TL regulation, which included DNA replication, transcription, and repair; cell cycle regulation; immune response; and intracellular trafficking. However, we noted that because the gene prioritization was based on integration of bioinformatic evidence from a number of publicly available databases, which also laid the foundation for establishing the pathways used in the enrichment analyses, this approach may suffer from self-fulfilling circular arguments.

While supporting previous evidence linking shorter TL to an increased risk of CAD and lower risk of several cancers, we demonstrated additional associations between TL and thyroid disease, thyroid cancer, lymphoma, and several non-malignant neoplasms. Shorter TL was protective against all of these proliferative disorders, potentially through limiting cell proliferative capacity, which in turn reduces the occurrence of potential oncogenic mutations that can occur during DNA replication. Furthermore, we also provide evidence suggesting that shorter TL is potentially causally associated with increased risk of several cardiovascular, inflammatory, and respiratory disorders that have previously been linked to TL in observational studies. Our findings linking nucleotide metabolism to TL regulation could in part explain the link between TL and cancer and proliferative disorders. This would suggest that cells with longer TL have higher dNTP levels that lead to higher proliferation rates and reduced DNA replication fidelity leading to higher mutation rates.

In summary, our findings substantially expand current knowledge on the genetic determinants of LTL, and they elucidate genes and pathways that regulate telomere homeostasis and their potential impact on human diseases and cancer development.

Declaration of Interests

A.S.B. holds grants unrelated to this work from AstraZeneca, Merck, Novartis, Biogen, and Bioverativ/Sanofi.

J.D. reports personal fees and non-financial support from Merck Sharpe and Dohme UK Atherosclerosis; personal fees and non-financial support from Novartis Cardiovascular and Metabolic Advisory Board; personal fees and non-financial support from Pfizer Population Research Advisory Panel; and grants from the British Heart Foundation, the European Research Council, Merck, the NIHR, NHS Blood and Transplant, Novartis, Pfizer, the UK Medical Research Council, Health Data Research UK, and the Wellcome Trust outside the submitted work.

Acknowledgments

The ENGAGE Project was funded under the European Union Framework 7—Health Theme (HEALTH-F4-2007- 201413). The InterAct project received funding from the European Union (Integrated Project LSHM-CT-2006-037197 in the Framework Programme 6 of the European Community). The EPIC-CVD study was supported by core funding from the UK Medical Research Council (MR/L003120/1), the British Heart Foundation (RG/13/13/30194; RG/18/13/33946), the European Commission Framework Programme 7 (HEALTH-F2-2012-279233), and the National Institute for Health Research (Cambridge Biomedical Research Centre at the Cambridge University Hospitals National Health Service (NHS) Foundation Trust)[*]. C.P.N is funded by the British Heart Foundation (BHF). V.C., C.P.N., and N.J.S. are supported by the National Institute for Health Research (NIHR) Leicester Cardiovascular Biomedical Research Centre and N.J.S. holds an NIHR Senior Investigator award. Chen Li is support by a four-year Wellcome Trust PhD Studentship; C.L., L.A.L., and N.J.W. are funded by the Medical Research Council (MC_UU_12015/1). N.J.W. is an NIHR Senior Investigator. J.D. is funded by the NIHR (Senior Investigator Award).[*]. *The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care. Cohort-specific and further acknowledgments are given in the Supplemental Information.

Published: February 27, 2020

Footnotes

Supplemental Data can be found online at https://doi.org/10.1016/j.ajhg.2020.02.006.

Contributor Information

Claudia Langenberg, Email: Claudia.Langenberg@mrc-epid.cam.ac.uk.

Veryan Codd, Email: vc15@leicester.ac.uk.

Web Resources

Supplemental Data

Document S1

Figures S1–S4, Supplemental Methods, Description of Individual Loci Associated with LTL, Supplemental Acknowledgments, Supplementary References

mmc1.pdf (1.9MB, pdf)
Document S2. Tables S1–S18
mmc2.xlsx (1.2MB, xlsx)
Document S3. Article plus Supplemental Information
mmc3.pdf (4MB, pdf)

References

  • 1.O’Sullivan R.J., Karlseder J. Telomeres: protecting chromosomes against genome instability. Nat. Rev. Mol. Cell Biol. 2010;11:171–181. doi: 10.1038/nrm2848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Allsopp R.C., Vaziri H., Patterson C., Goldstein S., Younglai E.V., Futcher A.B., Greider C.W., Harley C.B. Telomere length predicts replicative capacity of human fibroblasts. Proc. Natl. Acad. Sci. USA. 1992;89:10114–10118. doi: 10.1073/pnas.89.21.10114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.de Lange T. Shelterin-Mediated Telomere Protection. Annu. Rev. Genet. 2018;52:223–247. doi: 10.1146/annurev-genet-032918-021921. [DOI] [PubMed] [Google Scholar]
  • 4.Blackburn E.H., Collins K. Telomerase: an RNP enzyme synthesizes DNA. Cold Spring Harb. Perspect. Biol. 2011;3:a003558. doi: 10.1101/cshperspect.a003558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Armanios M., Blackburn E.H. The telomere syndromes. Nat. Rev. Genet. 2012;13:693–704. doi: 10.1038/nrg3246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Holohan B., Wright W.E., Shay J.W. Cell biology of disease: Telomeropathies: an emerging spectrum disorder. J. Cell Biol. 2014;205:289–299. doi: 10.1083/jcb.201401012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Brouilette S., Singh R.K., Thompson J.R., Goodall A.H., Samani N.J. White cell telomere length and risk of premature myocardial infarction. Arterioscler. Thromb. Vasc. Biol. 2003;23:842–846. doi: 10.1161/01.ATV.0000067426.96344.32. [DOI] [PubMed] [Google Scholar]
  • 8.Brouilette S.W., Moore J.S., McMahon A.D., Thompson J.R., Ford I., Shepherd J., Packard C.J., Samani N.J., West of Scotland Coronary Prevention Study Group Telomere length, risk of coronary heart disease, and statin treatment in the West of Scotland Primary Prevention Study: a nested case-control study. Lancet. 2007;369:107–114. doi: 10.1016/S0140-6736(07)60071-3. [DOI] [PubMed] [Google Scholar]
  • 9.Benetos A., Gardner J.P., Zureik M., Labat C., Xiaobin L., Adamopoulos C., Temmar M., Bean K.E., Thomas F., Aviv A. Short telomeres are associated with increased carotid atherosclerosis in hypertensive subjects. Hypertension. 2004;43:182–185. doi: 10.1161/01.HYP.0000113081.42868.f4. [DOI] [PubMed] [Google Scholar]
  • 10.Fitzpatrick A.L., Kronmal R.A., Gardner J.P., Psaty B.M., Jenny N.S., Tracy R.P., Walston J., Kimura M., Aviv A. Leukocyte telomere length and cardiovascular disease in the cardiovascular health study. Am. J. Epidemiol. 2007;165:14–21. doi: 10.1093/aje/kwj346. [DOI] [PubMed] [Google Scholar]
  • 11.Wentzensen I.M., Mirabello L., Pfeiffer R.M., Savage S.A. The association of telomere length and cancer: a meta-analysis. Cancer Epidemiol. Biomarkers Prev. 2011;20:1238–1250. doi: 10.1158/1055-9965.EPI-11-0005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhu X., Han W., Xue W., Zou Y., Xie C., Du J., Jin G. The association between telomere length and cancer risk in population studies. Sci. Rep. 2016;6:22243. doi: 10.1038/srep22243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Njajou O.T., Cawthon R.M., Damcott C.M., Wu S.H., Ott S., Garant M.J., Blackburn E.H., Mitchell B.D., Shuldiner A.R., Hsueh W.C. Telomere length is paternally inherited and is associated with parental lifespan. Proc. Natl. Acad. Sci. USA. 2007;104:12135–12139. doi: 10.1073/pnas.0702703104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Broer L., Codd V., Nyholt D.R., Deelen J., Mangino M., Willemsen G., Albrecht E., Amin N., Beekman M., de Geus E.J. Meta-analysis of telomere length in 19,713 subjects reveals high heritability, stronger maternal inheritance and a paternal age effect. Eur. J. Hum. Genet. 2013;21:1163–1168. doi: 10.1038/ejhg.2012.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Codd V., Nelson C.P., Albrecht E., Mangino M., Deelen J., Buxton J.L., Hottenga J.J., Fischer K., Esko T., Surakka I., CARDIoGRAM consortium Identification of seven loci affecting mean telomere length and their association with disease. Nat. Genet. 2013;45:422–427, e1–e2. doi: 10.1038/ng.2528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Haycock P.C., Burgess S., Nounu A., Zheng J., Okoli G.N., Bowden J., Wade K.H., Timpson N.J., Evans D.M., Willeit P., Telomeres Mendelian Randomization Collaboration Association Between Telomere Length and Risk of Cancer and Non-Neoplastic Diseases: A Mendelian Randomization Study. JAMA Oncol. 2017;3:636–651. doi: 10.1001/jamaoncol.2016.5945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhan Y., Song C., Karlsson R., Tillander A., Reynolds C.A., Pedersen N.L., Hägg S. Telomere Length Shortening and Alzheimer Disease--A Mendelian Randomization Study. JAMA Neurol. 2015;72:1202–1203. doi: 10.1001/jamaneurol.2015.1513. [DOI] [PubMed] [Google Scholar]
  • 18.Zhang C., Doherty J.A., Burgess S., Hung R.J., Lindström S., Kraft P., Gong J., Amos C.I., Sellers T.A., Monteiro A.N., GECCO and GAME-ON Network: CORECT, DRIVE, ELLIPSE, FOCI, and TRICL Genetic determinants of telomere length and risk of common cancers: a Mendelian randomization study. Hum. Mol. Genet. 2015;24:5356–5366. doi: 10.1093/hmg/ddv252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Iles M.M., Bishop D.T., Taylor J.C., Hayward N.K., Brossard M., Cust A.E., Dunning A.M., Lee J.E., Moses E.K., Akslen L.A., AMFS Investigators. IBD investigators. QMEGA and QTWIN Investigators. SDH Study Group. GenoMEL Consortium The effect on melanoma risk of genes previously associated with telomere length. J. Natl. Cancer Inst. 2014;106:dju267. doi: 10.1093/jnci/dju267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Codd V., Mangino M., van der Harst P., Braund P.S., Kaiser M., Beveridge A.J., Rafelt S., Moore J., Nelson C., Soranzo N., Wellcome Trust Case Control Consortium Common variants near TERC are associated with mean telomere length. Nat. Genet. 2010;42:197–199. doi: 10.1038/ng.532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Levy D., Neuhausen S.L., Hunt S.C., Kimura M., Hwang S.J., Chen W., Bis J.C., Fitzpatrick A.L., Smith E., Johnson A.D. Genome-wide association identifies OBFC1 as a locus involved in human leukocyte telomere biology. Proc. Natl. Acad. Sci. USA. 2010;107:9293–9298. doi: 10.1073/pnas.0911494107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Pooley K.A., Bojesen S.E., Weischer M., Nielsen S.F., Thompson D., Amin Al Olama A., Michailidou K., Tyrer J.P., Benlloch S., Brown J. A genome-wide association scan (GWAS) for mean telomere length within the COGS project: identified loci show little association with hormone-related cancer risk. Hum. Mol. Genet. 2013;22:5056–5064. doi: 10.1093/hmg/ddt355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mangino M., Christiansen L., Stone R., Hunt S.C., Horvath K., Eisenberg D.T., Kimura M., Petersen I., Kark J.D., Herbig U. DCAF4, a novel gene associated with leucocyte telomere length. J. Med. Genet. 2015;52:157–162. doi: 10.1136/jmedgenet-2014-102681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mangino M., Hwang S.J., Spector T.D., Hunt S.C., Kimura M., Fitzpatrick A.L., Christiansen L., Petersen I., Elbers C.C., Harris T. Genome-wide meta-analysis points to CTC1 and ZNF676 as genes regulating telomere homeostasis in humans. Hum. Mol. Genet. 2012;21:5385–5394. doi: 10.1093/hmg/dds382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Delgado D.A., Zhang C., Chen L.S., Gao J., Roy S., Shinkle J., Sabarinathan M., Argos M., Tong L., Ahmed A. Genome-wide association study of telomere length among South Asians identifies a second RTEL1 association signal. J. Med. Genet. 2018;55:64–71. doi: 10.1136/jmedgenet-2017-104922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cawthon R.M. Telomere measurement by quantitative PCR. Nucleic Acids Res. 2002;30:e47. doi: 10.1093/nar/30.10.e47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Cawthon R.M. Telomere length measurement by a novel monochrome multiplex quantitative PCR method. Nucleic Acids Res. 2009;37 doi: 10.1093/nar/gkn1027. e21–e21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Danesh J., Saracci R., Berglund G., Feskens E., Overvad K., Panico S., Thompson S., Fournier A., Clavel-Chapelon F., Canonico M., EPIC-Heart EPIC-Heart: the cardiovascular component of a prospective study of nutritional, lifestyle and biological factors in 520,000 middle-aged participants from 10 European countries. Eur. J. Epidemiol. 2007;22:129–141. doi: 10.1007/s10654-006-9096-8. [DOI] [PubMed] [Google Scholar]
  • 29.Langenberg C., Sharp S.J., Franks P.W., Scott R.A., Deloukas P., Forouhi N.G., Froguel P., Groop L.C., Hansen T., Palla L. Gene-lifestyle interaction and type 2 diabetes: the EPIC interact case-cohort study. PLoS Med. 2014;11:e1001647. doi: 10.1371/journal.pmed.1001647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Mägi R., Morris A.P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics. 2010;11:288. doi: 10.1186/1471-2105-11-288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Storey J.D. A direct approach to false discovery rates. J. R. Statist. Soc. B. 2002;64:479–498. [Google Scholar]
  • 32.Yang J., Lee S.H., Goddard M.E., Visscher P.M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Yang J., Ferreira T., Morris A.P., Medland S.E., Madden P.A., Heath A.C., Martin N.G., Montgomery G.W., Weedon M.N., Loos R.J., Genetic Investigation of ANthropometric Traits (GIANT) Consortium. DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 2012;44:369–375, S1–S3. doi: 10.1038/ng.2213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Pruim R.J., Welch R.P., Sanna S., Teslovich T.M., Chines P.S., Gliedt T.P., Boehnke M., Abecasis G.R., Willer C.J. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26:2336–2337. doi: 10.1093/bioinformatics/btq419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wang K., Li M., Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.O’Leary N.A., Wright M.W., Brister J.R., Ciufo S., Haddad D., McVeigh R., Rajput B., Robbertse B., Smith-White B., Ako-Adjei D. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44(D1):D733–D745. doi: 10.1093/nar/gkv1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zerbino D.R., Achuthan P., Akanni W., Amode M.R., Barrell D., Bhai J., Billis K., Cummins C., Gall A., Girón C.G. Ensembl 2018. Nucleic Acids Res. 2018;46(D1):D754–D761. doi: 10.1093/nar/gkx1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Harrow J., Frankish A., Gonzalez J.M., Tapanari E., Diekhans M., Kokocinski F., Aken B.L., Barrell D., Zadissa A., Searle S. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 2012;22:1760–1774. doi: 10.1101/gr.135350.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Karolchik D., Baertsch R., Diekhans M., Furey T.S., Hinrichs A., Lu Y.T., Roskin K.M., Schwartz M., Sugnet C.W., Thomas D.J., University of California Santa Cruz The UCSC Genome Browser Database. Nucleic Acids Res. 2003;31:51–54. doi: 10.1093/nar/gkg129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wang J., Dayem Ullah A.Z., Chelala C. IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome. Nucleic Acids Res. 2018;46:e47. doi: 10.1093/nar/gky057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Barbeira A.N., Dickinson S.P., Bonazzola R., Zheng J., Wheeler H.E., Torres J.M., Torstenson E.S., Shah K.P., Garcia T., Edwards T.L., GTEx Consortium Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 2018;9:1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Gamazon E.R., Wheeler H.E., Shah K.P., Mozaffari S.V., Aquino-Michaels K., Carroll R.J., Eyler A.E., Denny J.C., Nicolae D.L., Cox N.J., Im H.K., GTEx Consortium A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 2015;47:1091–1098. doi: 10.1038/ng.3367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Fortune M.D., Guo H., Burren O., Schofield E., Walker N.M., Ban M., Sawcer S.J., Bowes J., Worthington J., Barton A. Statistical colocalization of genetic risk variants for related autoimmune diseases in the context of common controls. Nat. Genet. 2015;47:839–846. doi: 10.1038/ng.3330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Battle A., Brown C.D., Engelhardt B.E., Montgomery S.B., GTEx Consortium. Laboratory, Data Analysis &Coordinating Center (LDACC)—Analysis Working Group. Statistical Methods groups—Analysis Working Group. Enhancing GTEx (eGTEx) groups. NIH Common Fund. NIH/NCI. NIH/NHGRI. NIH/NIMH. NIH/NIDA. Biospecimen Collection Source Site—NDRI. Biospecimen Collection Source Site—RPCI. Biospecimen Core Resource—VARI. Brain Bank Repository—University of Miami Brain Endowment Bank. Leidos Biomedical—Project Management. ELSI Study. Genome Browser Data Integration &Visualization—EBI. Genome Browser Data Integration &Visualization—UCSC Genomics Institute, University of California Santa Cruz. Lead analysts. Laboratory, Data Analysis &Coordinating Center (LDACC) NIH program management. Biospecimen collection. Pathology. eQTL manuscript working group Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. [Google Scholar]
  • 45.Giambartolomei C., Vukcevic D., Schadt E.E., Franke L., Hingorani A.D., Wallace C., Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Guo H., Fortune M.D., Burren O.S., Schofield E., Todd J.A., Wallace C. Integration of disease association and eQTL data using a Bayesian colocalisation approach highlights six candidate causal genes in immune-mediated diseases. Hum. Mol. Genet. 2015;24:3305–3313. doi: 10.1093/hmg/ddv077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Jin Y., Andersen G., Yorgov D., Ferrara T.M., Ben S., Brownson K.M., Holland P.J., Birlea S.A., Siebert J., Hartmann A. Genome-wide association studies of autoimmune vitiligo identify 23 new risk loci and highlight key pathways and regulatory variants. Nat. Genet. 2016;48:1418–1424. doi: 10.1038/ng.3680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bonder M.J., Luijk R., Zhernakova D.V., Moed M., Deelen P., Vermaat M., van Iterson M., van Dijk F., van Galen M., Bot J., BIOS Consortium Disease variants alter transcription factor levels and methylation of their binding sites. Nat. Genet. 2017;49:131–138. doi: 10.1038/ng.3721. [DOI] [PubMed] [Google Scholar]
  • 49.Chen L., Ge B., Casale F.P., Vasquez L., Kwan T., Garrido-Martín D., Watt S., Yan Y., Kundu K., Ecker S. Genetic Drivers of Epigenetic and Transcriptional Variation in Human Immune Cells. Cell. 2016;167:1398–1414.e24. doi: 10.1016/j.cell.2016.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gaunt T.R., Shihab H.A., Hemani G., Min J.L., Woodward G., Lyttleton O., Zheng J., Duggirala A., McArdle W.L., Ho K. Systematic identification of genetic influences on methylation across the human life course. Genome Biol. 2016;17:61. doi: 10.1186/s13059-016-0926-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Gusev A., Ko A., Shi H., Bhatia G., Chung W., Penninx B.W.J.H., Jansen R., de Geus E.J.C., Boomsma D.I., Wright F.A. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 2016;48:245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Wright D.J., Day F.R., Kerrison N.D., Zink F., Cardona A., Sulem P., Thompson D.J., Sigurjonsdottir S., Gudbjartsson D.F., Helgason A. Genetic variants associated with mosaic Y chromosome loss highlight cell cycle genes and overlap with cancer susceptibility. Nat. Genet. 2017;49:674–679. doi: 10.1038/ng.3821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Mi H., Muruganujan A., Ebert D., Huang X., Thomas P.D. PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 2019;47(D1):D419–D426. doi: 10.1093/nar/gky1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pers T.H., Karjalainen J.M., Chan Y., Westra H.J., Wood A.R., Yang J., Lui J.C., Vedantam S., Gustafsson S., Esko T., Genetic Investigation of ANthropometric Traits (GIANT) Consortium Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 2015;6:5890. doi: 10.1038/ncomms6890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T., The Gene Ontology Consortium Gene ontology: tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Kanehisa M., Goto S., Sato Y., Furumichi M., Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012;40:D109–D114. doi: 10.1093/nar/gkr988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Croft D., O’Kelly G., Wu G., Haw R., Gillespie M., Matthews L., Caudy M., Garapati P., Gopinath G., Jassal B. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 2011;39:D691–D697. doi: 10.1093/nar/gkq1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lage K., Karlberg E.O., Størling Z.M., Olason P.I., Pedersen A.G., Rigina O., Hinsby A.M., Tümer Z., Pociot F., Tommerup N. A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat. Biotechnol. 2007;25:309–316. doi: 10.1038/nbt1295. [DOI] [PubMed] [Google Scholar]
  • 59.Blake J.A., Eppig J.T., Kadin J.A., Richardson J.E., Smith C.L., Bult C.J., the Mouse Genome Database Group Mouse Genome Database (MGD)-2017: community knowledge resource for the laboratory mouse. Nucleic Acids Res. 2017;45(D1):D723–D729. doi: 10.1093/nar/gkw1040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Burgess S., Butterworth A., Thompson S.G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 2013;37:658–665. doi: 10.1002/gepi.21758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Sudlow C., Gallacher J., Allen N., Beral V., Burton P., Danesh J., Downey P., Elliott P., Green J., Landray M. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Marchini J., Howie B., Myers S., McVean G., Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 2007;39:906–913. doi: 10.1038/ng2088. [DOI] [PubMed] [Google Scholar]
  • 64.Bowden J., Davey Smith G., Haycock P.C., Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Zhao Q., Wang J., Hemani G., Bowden J., Small D.S. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. arXiv:1801.09652. 2018 [Google Scholar]
  • 66.Bowden J., Davey Smith G., Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Hemani G., Tilling K., Davey Smith G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 2017;13:e1007081. doi: 10.1371/journal.pgen.1007081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Zheng J., Erzurumluoglu A.M., Elsworth B.L., Kemp J.P., Howe L., Haycock P.C., Hemani G., Tansey K., Laurin C., Pourcain B.S., Early Genetics and Lifecourse Epidemiology (EAGLE) Eczema Consortium LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics. 2017;33:272–279. doi: 10.1093/bioinformatics/btw613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Staley J.R., Blackshaw J., Kamat M.A., Ellis S., Surendran P., Sun B.B., Paul D.S., Freitag D., Burgess S., Danesh J. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics. 2016;32:3207–3209. doi: 10.1093/bioinformatics/btw373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Dorajoo R., Chang X., Gurung R.L., Li Z., Wang L., Wang R., Beckman K.B., Adams-Haduch J., M Y., Liu S. Loci for human leukocyte telomere length in the Singaporean Chinese population and trans-ethnic genetic studies. Nat. Commun. 2019;10:2491. doi: 10.1038/s41467-019-10443-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Nelson C.P., Goel A., Butterworth A.S., Kanoni S., Webb T.R., Marouli E., Zeng L., Ntalla I., Lai F.Y., Hopewell J.C., EPIC-CVD Consortium. CARDIoGRAMplusC4D. UK Biobank CardioMetabolic Consortium CHD working group Association analyses based on false discovery rate implicate new loci for coronary artery disease. Nat. Genet. 2017;49:1385–1391. doi: 10.1038/ng.3913. [DOI] [PubMed] [Google Scholar]
  • 72.Wang X.G., Wang Z.Q., Tong W.M., Shen Y. PARP1 Val762Ala polymorphism reduces enzymatic activity. Biochem. Biophys. Res. Commun. 2007;354:122–126. doi: 10.1016/j.bbrc.2006.12.162. [DOI] [PubMed] [Google Scholar]
  • 73.Beneke S., Cohausz O., Malanga M., Boukamp P., Althaus F., Bürkle A. Rapid regulation of telomere length is mediated by poly(ADP-ribose) polymerase-1. Nucleic Acids Res. 2008;36:6309–6317. doi: 10.1093/nar/gkn615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Gomez M., Wu J., Schreiber V., Dunlap J., Dantzer F., Wang Y., Liu Y. PARP1 Is a TRF2-associated poly(ADP-ribose)polymerase and protects eroded telomeres. Mol. Biol. Cell. 2006;17:1686–1696. doi: 10.1091/mbc.E05-07-0672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Lee J., Zhou P. DCAFs, the missing link of the CUL4-DDB1 ubiquitin ligase. Mol. Cell. 2007;26:775–780. doi: 10.1016/j.molcel.2007.06.001. [DOI] [PubMed] [Google Scholar]
  • 76.Garvin A.J., Densham R.M., Blair-Reid S.A., Pratt K.M., Stone H.R., Weekes D., Lawrence K.J., Morris J.R. The deSUMOylase SENP7 promotes chromatin relaxation for homologous recombination DNA repair. EMBO Rep. 2013;14:975–983. doi: 10.1038/embor.2013.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Liu S., Chu J., Yucer N., Leng M., Wang S.Y., Chen B.P., Hittelman W.N., Wang Y. RING finger and WD repeat domain 3 (RFWD3) associates with replication protein A (RPA) and facilitates RPA-mediated DNA damage response. J. Biol. Chem. 2011;286:22314–22322. doi: 10.1074/jbc.M111.222802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Bartocci C., Diedrich J.K., Ouzounov I., Li J., Piunti A., Pasini D., Yates J.R., 3rd, Lazzerini Denchi E. Isolation of chromatin from dysfunctional telomeres reveals an important role for Ring1b in NHEJ-mediated chromosome fusions. Cell Rep. 2014;7:1320–1332. doi: 10.1016/j.celrep.2014.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Arnoult N., Karlseder J. Complex interactions between the DNA-damage response and mammalian telomeres. Nat. Struct. Mol. Biol. 2015;22:859–866. doi: 10.1038/nsmb.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Knies K., Inano S., Ramírez M.J., Ishiai M., Surrallés J., Takata M., Schindler D. Biallelic mutations in the ubiquitin ligase RFWD3 cause Fanconi anemia. J. Clin. Invest. 2017;127:3013–3027. doi: 10.1172/JCI92069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Krenciute G., Liu S., Yucer N., Shi Y., Ortiz P., Liu Q., Kim B.J., Odejimi A.O., Leng M., Qin J., Wang Y. Nuclear BAG6-UBL4A-GET4 complex mediates DNA damage signaling and cell death. J. Biol. Chem. 2013;288:20547–20557. doi: 10.1074/jbc.M112.443416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Kim M.K., Kang M.R., Nam H.W., Bae Y.S., Kim Y.S., Chung I.K. Regulation of telomeric repeat binding factor 1 binding to telomeres by casein kinase 2-mediated phosphorylation. J. Biol. Chem. 2008;283:14144–14152. doi: 10.1074/jbc.M710065200. [DOI] [PubMed] [Google Scholar]
  • 83.Franzolin E., Pontarin G., Rampazzo C., Miazzi C., Ferraro P., Palumbo E., Reichard P., Bianchi V. The deoxynucleotide triphosphohydrolase SAMHD1 is a major regulator of DNA precursor pools in mammalian cells. Proc. Natl. Acad. Sci. USA. 2013;110:14272–14277. doi: 10.1073/pnas.1312033110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Jobert L., Skjeldam H.K., Dalhus B., Galashevskaya A., Vågbø C.B., Bjørås M., Nilsen H. The human base excision repair enzyme SMUG1 directly interacts with DKC1 and contributes to RNA quality control. Mol. Cell. 2013;49:339–345. doi: 10.1016/j.molcel.2012.11.010. [DOI] [PubMed] [Google Scholar]
  • 85.Irwin C.R., Hitt M.M., Evans D.H. Targeting Nucleotide Biosynthesis: A Strategy for Improving the Oncolytic Potential of DNA Viruses. Front. Oncol. 2017;7:229. doi: 10.3389/fonc.2017.00229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Reichard P. Interactions between deoxyribonucleotide and DNA synthesis. Annu. Rev. Biochem. 1988;57:349–374. doi: 10.1146/annurev.bi.57.070188.002025. [DOI] [PubMed] [Google Scholar]
  • 87.Bebenek K., Roberts J.D., Kunkel T.A. The effects of dNTP pool imbalances on frameshift fidelity during DNA replication. J. Biol. Chem. 1992;267:3589–3596. [PubMed] [Google Scholar]
  • 88.Ojha J., Codd V., Nelson C.P., Samani N.J., Smirnov I.V., Madsen N.R., Hansen H.M., de Smith A.J., Bracci P.M., Wiencke J.K., ENGAGE Consortium Telomere Group Genetic Variation Associated with Longer Telomere Length Increases Risk of Chronic Lymphocytic Leukemia. Cancer Epidemiol. Biomarkers Prev. 2016;25:1043–1049. doi: 10.1158/1055-9965.EPI-15-1329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Córdoba-Lanús E., Cazorla-Rivero S., Espinoza-Jiménez A., de-Torres J.P., Pajares M.J., Aguirre-Jaime A., Celli B., Casanova C. Telomere shortening and accelerated aging in COPD: findings from the BODE cohort. Respir. Res. 2017;18:59. doi: 10.1186/s12931-017-0547-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Kurz D.J., Kloeckener-Gruissem B., Akhmedov A., Eberli F.R., Bühler I., Berger W., Bertel O., Lüscher T.F. Degenerative aortic valve stenosis, but not coronary disease, is associated with shorter telomere length in the elderly. Arterioscler. Thromb. Vasc. Biol. 2006;26:e114–e117. doi: 10.1161/01.ATV.0000222961.24912.69. [DOI] [PubMed] [Google Scholar]
  • 91.Steer S.E., Williams F.M., Kato B., Gardner J.P., Norman P.J., Hall M.A., Kimura M., Vaughan R., Aviv A., Spector T.D. Reduced telomere length in rheumatoid arthritis is independent of disease activity and duration. Ann. Rheum. Dis. 2007;66:476–480. doi: 10.1136/ard.2006.059188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.van der Harst P., van der Steege G., de Boer R.A., Voors A.A., Hall A.S., Mulder M.J., van Gilst W.H., van Veldhuisen D.J., MERIT-HF Study Group Telomere length of circulating leukocytes is decreased in patients with chronic heart failure. J. Am. Coll. Cardiol. 2007;49:1459–1464. doi: 10.1016/j.jacc.2007.01.027. [DOI] [PubMed] [Google Scholar]
  • 93.Tong A.S., Stern J.L., Sfeir A., Kartawinata M., de Lange T., Zhu X.D., Bryan T.M. ATM and ATR Signaling Regulate the Recruitment of Human Telomerase to Telomeres. Cell Rep. 2015;13:1633–1646. doi: 10.1016/j.celrep.2015.10.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Denchi E.L., de Lange T. Protection of telomeres through independent control of ATM and ATR by TRF2 and POT1. Nature. 2007;448:1068–1071. doi: 10.1038/nature06065. [DOI] [PubMed] [Google Scholar]
  • 95.Egan E.D., Collins K. Biogenesis of telomerase ribonucleoproteins. RNA. 2012;18:1747–1759. doi: 10.1261/rna.034629.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Nguyen D., Grenier St-Sauveur V., Bergeron D., Dupuis-Sandoval F., Scott M.S., Bachand F. A Polyadenylation-Dependent 3′ End Maturation Pathway Is Required for the Synthesis of the Human Telomerase RNA. Cell Rep. 2015;13:2244–2257. doi: 10.1016/j.celrep.2015.11.003. [DOI] [PubMed] [Google Scholar]
  • 97.Boyraz B., Moon D.H., Segal M., Muosieyiri M.Z., Aykanat A., Tai A.K., Cahan P., Agarwal S. Posttranscriptional manipulation of TERC reverses molecular hallmarks of telomere disease. J. Clin. Invest. 2016;126:3377–3382. doi: 10.1172/JCI87547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Schilders G., Raijmakers R., Raats J.M.H., Pruijn G.J.M. MPP6 is an exosome-associated RNA-binding protein involved in 5.8S rRNA maturation. Nucleic Acids Res. 2005;33:6795–6804. doi: 10.1093/nar/gki982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Austin W.R., Armijo A.L., Campbell D.O., Singh A.S., Hsieh T., Nathanson D., Herschman H.R., Phelps M.E., Witte O.N., Czernin J., Radu C.G. Nucleoside salvage pathway kinases regulate hematopoiesis by linking nucleotide metabolism with replication stress. J. Exp. Med. 2012;209:2215–2228. doi: 10.1084/jem.20121061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Davidson M.B., Katou Y., Keszthelyi A., Sing T.L., Xia T., Ou J., Vaisica J.A., Thevakumaran N., Marjavaara L., Myers C.L. Endogenous DNA replication stress results in expansion of dNTP pools and a mutator phenotype. EMBO J. 2012;31:895–907. doi: 10.1038/emboj.2011.485. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1

Figures S1–S4, Supplemental Methods, Description of Individual Loci Associated with LTL, Supplemental Acknowledgments, Supplementary References

mmc1.pdf (1.9MB, pdf)
Document S2. Tables S1–S18
mmc2.xlsx (1.2MB, xlsx)
Document S3. Article plus Supplemental Information
mmc3.pdf (4MB, pdf)

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES