Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Sep 1.
Published in final edited form as: Nat Genet. 2017 Feb 6;49(3):426–432. doi: 10.1038/ng.3752

Genetic loci associated with chronic obstructive pulmonary disease overlap with loci for lung function and pulmonary fibrosis

Brian D Hobbs 1,2, Kim de Jong 3,4, Maxime Lamontagne 5, Yohan Bossé 5,6, Nick Shrine 7, María Soler Artigas 7, Louise V Wain 7, Ian P Hall 8, Victoria E Jackson 7, Annah B Wyss 9, Stephanie J London 9, Kari E North 10, Nora Franceschini 10, David P Strachan 11, Terri H Beaty 12, John E Hokanson 13, James D Crapo 14, Peter J Castaldi 1,15, Robert P Chase 1, Traci M Bartz 16,17,18, Susan R Heckbert 16,19,20, Bruce M Psaty 16,17,19,20,21, Sina A Gharib 22, Pieter Zanen 23, Jan W Lammers 23, Matthijs Oudkerk 24, H J Groen 25, Nicholas Locantore 26, Ruth Tal-Singer 26, Stephen I Rennard 27,28, Jørgen Vestbo 29, Wim Timens 30, Peter D Paré 31, Jeanne C Latourelle 32, Josée Dupuis 33,34, George T O’Connor 34,35, Jemma B Wilk 34, Woo Jin Kim 36, Mi Kyeong Lee 36, Yeon-Mok Oh 37, Judith M Vonk 3,4, Harry J de Koning 38, Shuguang Leng 39, Steven A Belinsky 39, Yohannes Tesfaigzi 39, Ani Manichaikul 40,41, Xin-Qun Wang 41, Stephen S Rich 40,41, R Graham Barr 42, David Sparrow 43, Augusto A Litonjua 1,2, Per Bakke 44, Amund Gulsvik 44, Lies Lahousse 45,46, Guy G Brusselle 45,46,47, Bruno H Stricker 45,48,49,50, André G Uitterlinden 45,49,50, Elizabeth J Ampleford 51, Eugene R Bleecker 51, Prescott G Woodruff 52, Deborah A Meyers 51, Dandi Qiao 1, David A Lomas 53, Jae-Joon Yim 54, Deog Kyeom Kim 55, Iwona Hawrylkiewicz 56, Pawel Sliwinski 56, Megan Hardin 1,2,28, Tasha E Fingerlin 57,58, David A Schwartz 57,59,60, Dirkje S Postma 4,25, William MacNee 61, Martin D Tobin 7,62, Edwin K Silverman 1,2, H Marike Boezen 3,4,64, Michael H Cho 1,2,64,*; COPDGene Investigators63; ECLIPSE Investigators63; LifeLines Investigators63; SPIROMICS Research Group63; International COPD Genetics Network Investigators63; UK BiLEVE Investigators63; International COPD Genetics Consortium63
PMCID: PMC5381275  NIHMSID: NIHMS854307  PMID: 28166215

Chronic obstructive pulmonary disease (COPD) is a leading cause of mortality worldwide1. We performed a genetic association in 15,256 cases and 47,936 controls, with replication of select top results (P < 5×10−6) in 9,498 cases and 9,748 controls. In the combined meta-analysis, we identified 22 loci at genome-wide significance, including 13 new associations with COPD. Nine of these 13 loci have been associated with lung function in general population samples27; however, 4 (EEFSEC, DSP, MTCL1, and SFTPD) are novel. We noted 2 loci shared with pulmonary fibrosis8,9 (FAM13A and DSP) but with opposite risk alleles for COPD. None of our loci overlapped with genome-wide associations for asthma; however, one locus has been implicated in the joint susceptibility to asthma and obesity10. We also identified genetic correlation between COPD and asthma. Our findings highlight novel loci, demonstrate the importance of specific lung function loci to COPD, and identify potential regions of genetic overlap between COPD and other respiratory diseases.

COPD is characterized by persistent and progressive airflow limitation diagnosed by lung function testing1. While cigarette smoking is the major risk factor, susceptibility is also influenced by genetics1113. We established the International COPD Genetics Consortium (ICGC) to coordinate efforts to find susceptibility loci14. We defined cases based on pre-bronchodilator evidence of moderate-to-severe airflow limitation by modified GOLD criteria15; controls had normal spirometry, and all analyses were adjusted for age, sex, and cigarette smoking (pack-years and smoking status). We performed a two-stage genome-wide association study (Figure 1). In Stage 1, we combined 26 cohorts (Supplementary Table 1 and 2) containing 63,192 individuals (15,256 COPD cases and 47,936 controls). We selected 79 loci with P < 5×10−6 and in analysis Stage 2, we tested them in the UK BiLEVE dataset (9,498 COPD cases and 9,748 controls) from the UK Biobank and performed an overall meta-analysis (Supplementary Table 3).

Figure 1.

Figure 1

Study design showing cohorts used in each stage of the analysis. ARIC = Atherosclerosis Risk in Communities Study, B58 = British 1958 Birth Cohort, CHS = Cardiovascular Health Study, COPACETIC = COPD Pathology: Addressing Critical gaps, Early Treatment & Diagnosis and Innovative Concepts, ECLIPSE = Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points, eQTL = Lung Expression Quantitative Trait Loci Study, FHS = Framingham Heart Study, KARE = Korean Association Resource project, MESA = Multi-Ethnic Study of Atherosclerosis, NETT-NAS = National Emphysema Treatment Trial/Normative Aging Study, RS = Rotterdam Study, SPIROMICS = Subpopulations and intermediate outcome measures in COPD study, EOCOPD = Boston Early-Onset COPD Study, ICGN = International COPD Genetics Network, TCGS = Transcontinental COPD Genetics Study, UK BiLEVE = UK Biobank Lung Exome Variant Evaluation; NHW = Non-Hispanic white, AA = African American, EA = European American. * Studies without genome-wide array genotyping (custom genotyping)

We identified 13 genome-wide significant (P < 5×10−8) associations in Stage 1. Following the Stage 2 analysis, an additional 9 loci achieved genome-wide significance in the overall meta-analysis (Table 1, Figure 2, Supplementary Figures 1 and 2). Analysis of only European ancestry (Supplementary Table 4) and only African ancestry (Supplementary Table 5 and Supplementary Figure 3) Stage 1 cohorts showed no unique association signals. Of the 22 genome-wide significant loci described in our study, 9 have been previously described as genome- (or exome-) wide significant in studies of COPD13,1619: HHIP, CHRNA5, HTR4, FAM13A, RIN3, TGFB2, GSTCD-NPNT, CYP2A6, and IL27-CCDC101. The remaining 13 loci have not been previously associated with COPD at genome- (or exome-) wide significance. Eight of these 13 loci: ADGRG6/GPR126, THSD4, ADAM19, TET2, CFDP1, AGER, ARMC2, and RARB have been previously described and replicated (Supplementary Table 6) in general population GWASs of two measures of lung function (FEV1 and FEV1/FVC) that are used in conjunction to diagnose COPD2,47,20,21. One locus near PID1 was previously associated with FEV1/FVC, but had not replicated in those studies4,6. Four loci are newly being described as genome-wide significant in association with either COPD or lung function: EEFSEC, DSP, MTCL1, and SFTPD (Figure 3).

Table 1.

Overall study results showing 22 loci with genome-wide significant P values in overall meta-analysis following UK BiLEVE Stage 2 analysis.

rsID Closest Gene Variant Annotation Locus Risk Allele Alt Allele Risk Allele Frequency Stage 1 Analysis Stage 1
P Value
UK BiLEVE (Stage 2) UK BiLEVE (Stage 2)
P Value
Overall Meta-Analysis
P Value
Mean Range OR 95% CI OR 95% CI
rs13141641 HHIP intergenic 4q31.21 T C 0.59 0.52–0.89 1.23 1.18–1.28 1.16E-24 1.21 1.16–1.27 8.15E-18 9.10E-41
rs17486278 CHRNA5 intronic 15q25.1 C A 0.35 0.24–0.44 1.22 1.18–1.27 2.61E-24 1.13 1.08–1.18 2.35E-07 1.77E-28
rs7733088 HTR4 intronic 5q32 G A 0.60 0.47–0.69 1.18 1.13–1.23 4.40E-14 1.18 1.13–1.23 1.78E-13 5.33E-26
rs9399401 ADGRG6 intronic 6q24.1 T C 0.72 0.61–0.75 1.14 1.09–1.19 3.59E-10 1.17 1.12–1.23 6.18E-11 1.81E-19
rs1441358 THSD4 intronic 15q23 G T 0.33 0.19–0.55 1.13 1.09–1.18 2.06E-10 1.12 1.07–1.17 6.87E-07 8.22E-16
rs6837671 FAM13A intronic 4q22.1 G A 0.41 0.36–0.58 1.16 1.11–1.20 1.02E-14 1.07 1.02–1.11 3.75E-03 7.48E-15
rs11727735 GSTCD intronic 4q24 A G 0.94 0.93–0.99 1.27 1.17–1.37 1.55E-08 1.25 1.14–1.36 4.93E-07 3.84E-14
rs754388 RIN3 intronic 14q32.12 C G 0.82 0.80–0.86 1.20 1.14–1.26 7.07E-12 1.11 1.05–1.17 1.85E-04 4.96E-14
rs113897301 ADAM19 intronic 5q33.3 AT A 0.17 0.05–0.19 1.20 1.13–1.26 4.52E-10 1.13 1.07–1.19 2.79E-05 1.58E-13
rs2047409* TET2 intronic 4q24 A G 0.62 0.22–0.65 1.10 1.06–1.15 1.58E-06 1.14 1.09–1.19 1.95E-08 2.46E-13
rs2955083 EEFSEC intronic 3q21.3 A T 0.88 0.85–0.89 1.20 1.12–1.27 2.00E-08 1.17 1.09–1.25 4.01E-06 4.16E-13
rs7186831* CFDP1 intergenic 16q23.1 A G 0.43 0.23–0.47 1.12 1.07–1.18 3.54E-06 1.12 1.07–1.17 6.63E-07 1.12E-11
rs10429950* TGFB2 intergenic 1q41 T C 0.73 0.22–0.77 1.12 1.07–1.16 1.83E-07 1.10 1.04–1.15 1.94E-04 1.66E-10
rs2070600* AGER coding 6p21.32 C T 0.95 0.85–0.99 1.28 1.15–1.41 3.54E-06 1.21 1.10–1.32 2.96E-05 5.94E-10
rs2806356* ARMC2 intronic 6q21 C T 0.18 0.05–0.24 1.12 1.07–1.18 2.84E-06 1.12 1.06–1.18 6.88E-05 8.34E-10
rs16825267* PID1 intergenic 2q36.3 C G 0.93 0.87–0.94 1.24 1.15–1.34 5.22E-08 1.13 1.04–1.22 2.27E-03 1.68E-09
rs2076295 DSP coding 6p24.3 T G 0.55 0.44–0.58 1.11 1.07–1.15 4.95E-08 1.06 1.02–1.11 7.45E-03 3.97E-09
rs647097* MTCL1 intronic 18p11.22 C T 0.27 0.26–0.40 1.11 1.06–1.15 3.03E-06 1.09 1.04–1.14 4.66E-04 6.14E-09
rs1529672 RARB intronic 3p24.2 C A 0.83 0.68–0.86 1.16 1.11–1.22 2.37E-09 1.05 0.99–1.11 9.95E-02 2.47E-08
rs721917* SFTPD coding 10q22.3 G A 0.42 0.39–0.63 1.09 1.05–1.13 2.11E-06 1.07 1.02–1.11 2.60E-03 2.49E-08
rs12459249* CYP2A6 intergenic 19q13.2 C T 0.66 0.62–0.70 1.13 1.07–1.18 2.89E-06 1.08 1.03–1.13 1.35E-03 3.42E-08

OR = odds ratio, CI = confidence interval.

*

Genome-wide significant in overall meta-analysis only

Figure 2.

Figure 2

Manhattan plot showing P values for Stage 1 analysis (small open diamonds) with overlay of overall meta-analysis P values for SNPs analyzed in UK BiLEVE Stage 2 analysis (filled circles). Gene names in gray are previously described COPD or lung function (FEV1 or FEV1/FVC) loci; black are novel loci discovered in this study. The Stage 1 cohorts with available genotyping data (Supplementary Figures 1a–v) and the UK BiLEVE cohort determined the sample size for each top variant. The red dashed line indicates the threshold for genome-wide significance (P value < 5×10−8).

Figure 3. a–d Regional association for novel loci.

Figure 3

LocusZoom plots showing regional association of variants at the four novel COPD loci. The point size is proportional to the sample size, where Stage 1 cohorts with available genotyping data (Supplementary Figures 1a–v) and the UK BiLEVE cohort determined the sample size for each top variant.

To explore the potential function and causal genes for our novel loci, in addition to using publicly available datasets and prioritization tools (Supplementary Table 7), we also examined a larger set of lung expression quantitative trait loci (eQTL) in 1038 subjects, including subjects with COPD22 (Supplementary Table 8). As eQTL are pervasive, we also attempted to determine whether our association signal co-localized23 with an eQTL signal in lung tissue (Supplementary Table 9). We found strong evidence of co-localization (posterior probability > 0.8) for DSP, a major protein of desmosomes required for epidermal integrity24, and MTCL1, important in epithelial-cell-specific microtubule stabilization25,26, and expressed in respiratory epithelial cells27. Variants in strong LD with our top MTCL1 variant rs647097 (NC_000018.9:g.8808464T>C) appear to have enhancer histone marks in fetal lung fibroblasts28,29. In contrast, we found no evidence of a strong eQTL signal or co-localization at our other two novel loci. At 3q21, EEFSEC is a potential candidate, as it is a paralog of TUFM, a top blood and lung eQTL gene for the 16p11.2/IL27 COPD susceptibility locus19, recently part of a novel COPD-related pathway involving NLRX13032. At 10q22, pulmonary surfactant-associated protein D (SFTPD) is the most likely candidate, as it is highly expressed in pneumocytes27, and sftpd (−/−) mice develop pulmonary emphysema33. SFTPD has been explored as a COPD biomarker34, and while rs721917 (NC_000010.10:g.81706324A>G) is not an eQTL, polymorphisms in SFTPD, including rs721917, may lead to decreased surfactant protein D levels35; though the association of SFTPD polymorphisms with COPD susceptibility have been inconsistent. Our analysis also led to some additional insights into other previously described loci. We found evidence of COPD association and eQTL statistical co-localization in lung tissue (posterior probability > 0.8) for THSD4, HHIP, AGER, CHRNA3, and RARB (Supplementary Table 9). Additional data on eQTLs (Supplementary Table 8), cohort-specific associations at each locus (Supplementary Figures 1a–v), fine mapping (Supplementary Note and Supplementary Table 10), causal gene (Supplementary Table 11 and 12), and other supportive analysis for previously described and novel loci can be found in the Supplementary Note.

We note that our top variant at DSP (rs2076295, NC_000006.11:g.7563232T>G) is also associated (P = 1.1×10−19) with pulmonary fibrosis8. Recently, a re-sequencing study36 at the DSP locus identified a second fibrosis-associated variant, rs2744371 (NC_000006.11:g.7554174A>C) with Pfibrosis = 0.002 and PCOPD = 0.04. We also note overlap at the FAM13A locus (top fibrosis SNP8, rs2609255 [NC_000004.11:g.89811195G>T]; Pfibrosis = 2.2×10−11, PCOPD = 1.9×10−7). We performed additional analysis to investigate genetic overlap using gwas-pw37 (see Supplementary Note). We confirmed overlap at the DSP and FAM13A loci with a posterior probability of > 0.99, and additionally discovered overlap near MAPT/KANSL1 (top fibrosis SNP8, rs1981997 [NC_000017.10:g.44056767G>A]; Pfibrosis = 8.87×10−14, PCOPD = 4.5×10−3) with posterior probability of 0.84. While the MAPT/KANSL1 locus did not reach genome-wide significance in our study, we note its independent discovery in a genome-wide association in extremes of lung function7. Notably, for all four of these variants (in DSP [2], FAM13A, and MAPT), the fibrosis risk allele is protective for development of COPD. Emphysema, a key component of COPD, and pulmonary fibrosis are both smoking-related lung diseases that have both shared and distinct pathophysiology3840, though genetic loci with opposing effects have not been previously described. Additional investigation of these loci as a well as a more comprehensive assessment of genetic overlap of COPD and pulmonary fibrosis may lead to insight into both disorders.

Because our analysis relied on a spirometric definition of COPD alone, we did not specifically exclude other causes of airway obstruction such as asthma, which can overlap with COPD in adults41. To define COPD, we used pre-bronchodilator spirometry, which was available across all cohorts, and we included at least moderately affected cases (FEV1 < 80% predicted). We examined the top set of genome-wide significant results in a subset of our largest cohorts with both pre- and post-bronchodilator data and densely imputed genotypes; overall, the effect sizes (mean difference = 0.001) and P values (mean log10 P value difference = 0.18) were similar (Supplementary Table 13 and Supplementary Figures 4 and 5). In addition, a recent GWAS of FEV1, FVC, and FEV1/FVC did not find substantial differences including and excluding subjects with asthma7. In the 79 variants tested in Stage 2, we found no significant difference in the OR for COPD association when including and excluding individuals with asthma (Supplementary Figure 6).

We examined COPD associations of genome-wide significant asthma (and asthma-associated traits) loci from the NHGRI-EBI GWAS Catalog42 (Supplementary Table 14). We also compared our COPD association results to the GABRIEL asthma study43 (Supplementary Tables 15). None of the genome-wide significant loci from asthma and COPD overlapped. Further, no asthma or COPD loci showed Bonferroni-adjusted (for number of look-ups) significant association with the other disease, though several loci showed nominal (P<0.05) significance. The 16p11.2 (CCDC101) locus has been described in the joint susceptibility to asthma and obesity10. COPD susceptibility is strongly related to cigarette smoking. Two of our loci (15q25 and 19q13) have been previously associated with smoking behavior44,45, though we found no additional evidence of overlap in genome-wide significant variants described in the NHGRI-EBI GWAS Catalog42 and Tobacco and Genetics Consortium GWAS45 (Supplementary Tables 16–18). We additionally evaluated overlap of our top 22 loci with COPD comorbidities (Supplementary Table 19) and radiographic imaging features (Supplementary Table 20).

In contrast to minimal overlap in genome-wide significant results with asthma and smoking, we discovered a significant overall genetic correlation of COPD with asthma (rgenetic = 0.38, P = 6.2×10−5) using LD score regression in our European-ancestry subjects46,47. We also assessed genetic correlation with population-based lung function, pulmonary fibrosis, smoking behavior, and two common COPD comorbidities, coronary artery disease and osteoporosis. We identified significant correlation of COPD with lung function and two aspects of smoking behavior, but not with common comorbidities or with pulmonary fibrosis (Figure 4). The lack of significant correlation of COPD with pulmonary fibrosis may indicate our overlapping loci for COPD and pulmonary fibrosis are not representative of a broader disease correlation; alternatively, it could reflect limited sample size or a mix of positive and negative genetic correlations across the genome for the diseases. In potential support of this latter hypothesis, and in contrast to the loci we describe in this study, are recent descriptions of rare variants in telomerase genes predisposing to both emphysema, a key feature of COPD, and pulmonary fibrosis40,48. Our analysis of partitioned heritability identified COPD genetic association enrichment in fetal lung tissue (coefficient P = 3.5×10−7); other bioinformatics analyses also support functional annotation of COPD associations to lung tissue or lung cell types (Supplementary Note).

Figure 4. Genetic correlation (using LD score regression) between COPD and other traits.

Figure 4

Shading and numbers represents strength of correlation. An asterisk indicates nominal (P < 0.05) significance, and a double asterisk indicates significant after Bonferroni correction for number of pairwise comparisons. fev1fvc and fev1 = lung function (FEV1/FVC ratio and FEV1 from CHARGE/SpiroMeta4, asthma taken from the asthma GWAS by the GABRIEL Consortium43, ild = pulmonary fibrosis from Fingerlin et al.8,9, bilSmk = subset of smokers in the UK BiLEVE study7, smkCpd = cigarettes per day smoking from the Tobacco and Genetics (TAG) Consortium45, smkFormer = current versus former smokers from TAG, smkOnset = age of smoking initiation from TAG, smkEver = ever versus never smoking from TAG. cad = coronary artery disease from the CARDIoGRAM study50, height51 and bmi (body mass index)52 from the GIANT consortium, bmdLumbar and bmdFemoral = lumbar and femoral bone mineral density, respectively, from the Genetic Factors for Osteoporosis (GeFOS) Consortium53.

Our study is, to our knowledge, the largest genome-wide association study of COPD cases to date and includes over 60,000 subjects (including 15,256 COPD cases) in the Stage 1 analysis. We chose to combine subjects of different ethnicities, hypothesizing that shared COPD risk factors across ethnicities would outweigh power loss due to heterogeneity. While methods have been developed that can more rigorously assess the degree of overlap and provide additional power in this setting49, none of our non-white cohorts were sufficiently sized or powered for these analyses. COPD is also a highly heterogeneous disease; whether a more precise phenotypic definition would result in greater power is not clear. We used a staged study design and examined overall meta-analysis P-values to determine genome-wide significance. Thus, 9 loci (TET2, CFDP1, TGFB2, AGER, ARMC2, PID1, MTCL1, SFTPD, and CYP2A6) from our Stage 1 analysis, which only reached genome-wide significance in either the Stage 2 UK BiLEVE analysis or the overall meta-analysis, should be further replicated. However, six of these 9 association signals are significant if we consider a Bonferroni correction (P < 6.3×10−4) for the 79 variants tested in Stage 2. Further, 8 of these 9 variants are more strongly associated in the overall meta-analysis compared to Stage 1; the exception is RARB, which has a previously reported association with both lung function4 and airflow obstruction21 (Table 1).

The majority of our significant loci overlap with lung function loci, strengthening the foundation for investigating the relationship of lung function variability in the general population to risk of developing COPD. These loci are unlikely to reflect susceptibility for asthma or for cigarette smoking; however, our association results as a whole show evidence of shared heritability with asthma (supporting investigation into shared genetic etiologies for these diseases) and cigarette smoking behavior (despite adjustment for smoking in our statistical model). We identified functional annotation enrichment for fetal lung cells, supporting a role for early life events contributing to future risk of COPD. Finally, we identify loci that overlap with pulmonary fibrosis, but with opposite risk alleles. Our study highlights the important contribution of genetic association studies to understanding COPD, not only by identifying novel loci, but also illustrating relationships with other pulmonary traits and diseases.

Data Availability Statement

The genome-wide association summary statistics generated in the Stage 1 analysis of the current study are available in the dbGaP repository, https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000179.v5.p2

The Stage 2 analysis summary statistics are available in Supplementary Table 3.

Online Methods

Study Cohorts

We invited investigators from 22 studies with genome-wide association data and COPD case-control or general population samples with spirometry to participate in a genome-wide association meta-analysis. Additionally, we included four cohorts with Illumina HumanExome v1.2 and custom genotyping based primarily on prior top results from a previously published COPD GWAS13, using results with P < 1×10−4 using plink ‘–clump’ on the COPDGene non-Hispanic whites to perform linkage disequilibrium pruning (r2 < 0.8), preferentially retaining both an imputed and genotyped top SNP at each locus. An additional group of variants was a candidate panel, based on results from a previous candidate gene analysis54, as well as variants identified in association with lung function (supplementing the existing content on the array, which included variants from previous genome-wide association studies), including the lead SNP and a 200kb region around that SNP pruned for variants with P < 0.01 and r2 < 0.8, and additional top-ranked SNPs for COPDGene-specific analyses for lung function, bronchodilator responsiveness, exacerbations, and SNPs from candidate genes.

The baseline characteristics of these 26 cohorts can be seen in Supplementary Table 1. Each cohort obtained approval from appropriate ethical/regulatory bodies; informed consent was obtained for all individuals. (Further cohort-specific methods can be found in the Data Supplement.) As most of these cohorts did not have post-bronchodilator spirometry, we used a modified definition of GOLD criteria based on pre-bronchodilator spirometry: forced expiratory volume in 1 second (FEV1) < 80% and FEV1 to forced vital capacity (FVC) ratio of < 0.7 for cases, and FEV1 > 80% and FEV1/FVC > 0.7 for controls. Logistic regression was performed in each cohort, adjusting for age, sex, pack-years of smoking, ever-smoking status, current-smoking status, and ancestry-based principal components, as appropriate for each study. Summary statistics were assessed using EasyQC55 version 10.1. More detailed cohort information, including cohort-specific methods, can be found in the Supplementary Note.

Genome-wide association quality control

Summary statistics, including effect allele and other allele oriented to the + strand, effect allele frequency, chromosome and position (hg19), and imputation quality were uploaded to a secure site at the Brigham and Women’s Hospital/Channing Division of Network Medicine. Quality control assessments included assessing allele frequencies versus 1000 Genomes reference, standard error versus sample size, and quantile-quantile plots. Variants with an imputation quality metric of < 0.3 (provided a higher threshold for imputation quality was not already implemented), a minor allele count (MAC) of < 20 using the effective sample size or the number of cases and adjusted for imputation quality where applicable, were set to missing. Variants were included for meta-analysis if they were present in at least 13 studies (those with European ancestry and at least 7 million markers passing all quality control filters).

Staged GWAS meta-analysis

In Stage 1 of the analysis, we used Metal56,57 version 2011-03-25 to perform a fixed-effects meta-analysis of genome-wide data from 22 studies and four additional COPD cohorts genotyped on an Illumina HumanExome v1.2 platform with custom content; this content included a set of COPD candidate genes and regions identified from prior COPD GWAS efforts13. We adjusted for inflation using genomic control correction in each study. We included study populations with subjects of non-European ancestry in the overall analysis, and additionally examined results limited to study populations of European ancestry. To identify variants to test for association in Stage 2 in the UK BiLEVE study, we selected top results (P < 5×10−6) from the Stage 1 meta-analysis. We selected one lead variant from the chromosome 15q25, FAM13, and HHIP regions, as all of these have been described in multiple COPD GWASs13,16,17,21. For the remainder of the regions, we performed linkage disequilibrium pruning using the PLINK1.9 –clump procedure with an r2 of 0.5, additionally examining these SNPs for the number of cohorts with passing quality control at each variant and including SNPs in strong LD (i.e., part of the same clump) with a lower degree of missingness. To identify independent results, we used GCTA-COJO58,59 on the Stage 1 meta-analysis for variants with P < 5×10−6 using the default distance of 10Mb. We used the COPDGene non-Hispanic whites (as the largest representative population) as the reference population for these analyses. An overall meta-analysis across the Stage 1 and Stage 2 (UK BiLEVE) cohorts was performed and variants with P < 5×10−8 were considered genome-wide significant (Figure 1).

Lung eQTL analysis

Lung expression quantitative trait loci (eQTL) were calculated from 1,111 human subjects who underwent lung surgery at three academic sites, Laval University, University of British Columbia (UBC), and University of Groningen, henceforth referred to as Laval, UBC, and Groningen, respectively. This lung eQTL dataset has been described previously22,60. Briefly, 66.7% to 91.2% of the individuals in this study were current or former smokers and 24.2% to 35.3% had moderate to severe COPD (GOLD spirometry grade 2 to 4). Whole-genome gene expression profiling in the lung was performed on a custom Affymetrix array (GPL10379). Microarray pre-processing and quality controls were described previously22,61,62. Probe sequences were mapped to the human genome (hg19) using Bowtie63 and probes not mapping to a coding region or having a common SNP (MAF ≥ 5%) in their sequence were removed. Expression data were adjusted for age, sex, and smoking status using residuals obtained with the robust fitting of linear models function (rlm) in the R statistical package MASS. Residual values deviating from the median by more than three standard deviations were filtered as outliers. Genotyping was carried on the Illumina Human 1M-Duo BeadChip array.

Twenty-one out of the 22 SNPs (in main manuscript Table 1) were genotyped or imputed in the three cohorts, i.e. Laval, UBC, and Groningen. One of the SNPs, rs7186831, was not well-imputed; a proxy, rs11865296 in modest linkage disequilibrium (r2 = 0.54, 1000 genomes phase 3, EUR) was used instead. These variants were tested for association with adjusted expression traits (43,465 probe sets) in the lung. SNPs within 1 Mb up and downstream of the transcription probe set were considered as local-eQTL. Distant-acting eQTLs were further than 1 Mb away or on a different chromosome. Association tests were carried with PLINK1.964,65 in each cohort and then meta-analyzed using Fisher’s method. All local eQTL with nominal P value < 0.05 in the meta-analysis were considered. To provide an additional overall estimate of eQTL significance, we considered a Bonferroni correction threshold ([0.05/(22 SNPs × 43,465 probe sets) = P value < 5.2 × 10−8]). Statistical analyses were performed in R3.2.366.

Co-localization Analysis

Co-localization of statistical signals between COPD genetic association and eQTL were examined using the coloc R package23. We used phenotypic summary statistics from individuals of European ancestry with genome-wide association data and all eQTL results and examined 500kb flanks around the top 22 genome-wide significant associations found in the overall meta-analysis (Table 1).

Sensitivity Analysis

To estimate the effect of using pre- instead of post-bronchodilator lung function on our results, we examined the top set of genome-wide significant results in our largest cohorts with both pre- and post-bronchodilator data and densely imputed genotypes (COPDGene NHW and AA, ECLIPSE, NETT-NAS, and Norway/GenKOLS). Since subjects from these cohorts (except for COPDGene) were included based on post-bronchodilator values, including all subjects with COPD based on post-bronchodilator spirometry would lead to larger sample sizes and make comparison of P-values more difficult. Thus, we chose a random sample of post-bronchodilator cases and controls that matched the number of pre-bronchodilator cases and controls. We performed logistic regression using these equal sized set of pre- and post-bronchodilator cohorts, and meta-analyzed the results.

Asthma overlap analysis

We assessed the overlap between our results and known asthma susceptibility loci. We downloaded information on genome-wide significant (P < 5×10−8) associations with asthma and asthma-related traits including asthma and hay fever, asthma (childhood onset), asthma (corticosteroid response), bronchodilator response in asthma, pulmonary function decline, and severe asthma in the NHGRI-EBI GWAS Catalog42. Additionally, we examined top associated variants (which were not genome-wide significant) in the susceptibility to the asthma-COPD overlap syndrome67. In all, we assessed the association statistics of 49 unique asthma-associated trait loci across 26 genomic regions in our Stage 1 meta-analysis results. We also examined the asthma association statistics of our top COPD loci from overall meta-analysis using publically available asthma GWAS data from the GABRIEL Consortium43. For COPD loci not present in the GABRIEL Consortium asthma GWAS data, we attempted to examine proxy SNPs in LD (r2 > 0.5, 1000 genomes phase 1 CEU) with our top COPD loci.

To examine the genetic correlation47 of COPD and asthma over the entire genome, we performed LD score regression46 using summary statistics from publically available asthma GWAS data from the GABRIEL Consortium43. For all comparisons using LD score regression, we filtered to HapMap3 variants, limited to European-ancestry subjects with genome-wide data, and filtered on missingness using default parameters in munge_sumstats.py. For the GABRIEL data, we required a variant to be present in at least 35 of the studies.

Smoking behavior overlap analysis

We downloaded information on genome-wide significant (P < 5×10−8) associations with the traits “nicotine dependence” and “smoking behaviour” in the NHGRI-EBI GWAS Catalog42. We assessed these top smoking-associated SNPs in our Stage 1 meta-analysis results. We also assessed overlap of smoking and COPD in the publically available summary statistics from the 2010 Tobacco and Genetics Consortium GWAS45. We evaluated our top COPD loci associations from overall meta-analysis with both cigarettes per day and ever-smoking traits. For COPD risk SNPs not directly analyzed in the Tobacco and Genetics Consortium GWAS, we attempted to examine proxy SNPs in LD (r2 > 0.5, 1000 genomes phase1 CEU) with our top COPD loci.

To examine the genetic correlation47 of COPD and smoking behaviours (cigarettes per day and ever-smoking status) over the entire genome, we performed LD score regression46 using summary statistics from our current COPD study as noted above and publically available summary statistics from the 2010 Tobacco and Genetics Consortium GWAS45.

Fine mapping analysis

We attempted to determine, at each locus, whether we could identify a potentially causal variant. We performed these analyses using European ancestry subjects alone, and in all subjects with genome-wide data, and excluded variants that were not present in at least 80% of the full sample. We assumed a single causal variant at each locus, examined a +/− 250kb region around the top variant, and calculated approximate Bayes factors using the method of Wakefield68 to determine the 95% credible set. While specific trans-ethnic mapping approaches69,70 can significantly assist in identifying causal loci, we found that the number of non-European samples in our study were likely insufficient to leverage these methods.

Functional enrichment analysis

To identify enriched cell types for our COPD associations, we applied LD score regression to GenoSkyline71 lung tissue annotations (the default LD score regression annotations collapse lung into the cardiovascular tissue type), as well as cell-type specific annotations from LD score regression46. We also performed analysis using only the 22 genome-wide significant loci and tested for enrichment of imputed chromatin marks from ROADMAP using HaploReg 4.129. Further, we applied a more sophisticated analysis adjusting for local linkage disequilibrium patterns, GoShifter72. Finally, we examined overlap with gene expression datasets using SNPsea73.

Additional pulmonary fibrosis and COPD overlap analysis

To further examine overlapping loci for COPD and pulmonary fibrosis, we combined summary statistics from our study and the pulmonary fibrosis GWAS by Fingerlin et al.8,9 using gwas-pw37.

Causal gene analysis

For the genome-wide significant loci from the overall meta-analysis, we explored potential causative genes at each association locus using the PrixFixe method74, assuming co-function of all significant loci. As required by the PrixFixe method, we assured our genome-wide significant loci were present in dbSNP v13775 and were represented HapMap76 phase III data; for loci not meeting these requirements, proxy SNPs from HapMap phase III were selected based on strongest LD (r2) with index SNP (see Supplementary Table 11 for details of the proxy variant used at each genome-wide significant locus).

COPD comorbidity overlap analysis

We assessed the overlap between our results and two common COPD comorbidities, coronary artery disease and osteoporosis (through bone mineral density traits). We downloaded information on genome-wide significant (P < 5×10−8) associations with these comorbidities as reported in the NHGRI-EBI GWAS Catalog42. We assessed the association statistics of these comorbid trait loci in our Stage 1 meta-analysis results.

Quantitative imaging overlap analysis

To explore the relationship between our top COPD-associated variants and imaging features of emphysema and airway thickness, we queried data from a GWAS of COPD quantitative imaging features77. For each genome-wide significant COPD susceptibility locus in our overall meta-analysis, we assessed the corresponding quantitative imaging GWAS effect size, effect direction, and P value for association with the following quantitative imaging traits: %LAA-950 (percentage low attenuation area, using a threshold of −950 Hounsfield units); Perc15 (value of Hounsfield units at the 15th percentile of the density histogram); Pi10 (airway wall area: the value for a hypothetical airway of 10 mm internal perimeter obtained by plotting a regression line of the square root of the airway wall area versus the airway internal perimeter); and WAP (percentage of the wall area compared to the total bronchial area).

Gene set enrichment analysis

As an attempt to minimize false positives in our gene set enrichment analysis, we divided the Stage 1 GWAS cohorts with full genome-wide data into two sets of roughly equal size. We then used i-GSEA4GWAS (http://gsea4gwas.psych.ac.cn/)78 for each of the two GWAS data sets to assess enrichment of COPD GWAS loci in BioCarta (http://cgap.nci.nih.gov/Pathways/BioCarta_Pathways) and KEGG79 pathways as well as gene ontology (GO) terms80,81. We first evaluated GO terms and pathways with a false-discovery rate (FDR) less than 5% in both analysis sets and then used a more stringent threshold of FDR < 1% to evaluate overlap of GO term and pathway enrichment in our two analysis sets.

Supplementary Material

Supplement

Acknowledgments

Please refer to the Supplementary Note for full acknowledgements.

Footnotes

Author Contributions:

B.D.H. & M.H.C. contributed to the study concept and design, data analysis, statistical support, and manuscript writing. K.d.J., A.B.W., S.J.L., & D.P.S. contributed to the study concept and design & data analysis. N.S. & M.S.A. contributed to the data analysis and statistical support. T.H.B. & J.E.H. contributed to the study concept and design and statistical support. L.L. contributed to the data collection, data analysis, and statistical support. K.E.N. contributed to data collection and data analysis. J.D.C., B.M.P., N.L., R.T.S., G.T.O., Y.T., R.G.B., S.I.R., P.B., A.G., P.G.W., D.A.M., D.A.S, & E.K.S. contributed to the study concept and design and to data collection. D.Q., T.A.F., M.L., Y.B., N.S., N.F., P.J.C., R.P.C., T.M.B., S.A.G., J.C.L., J.D., J.B.W., M.K.L., S.L., A.M., X.W., & E.J.A. contributed to the data analysis. L.V.W., I.P.H., P.D.P., D.S.P., W.M., M.D.T., & H.M.B. contributed to the study concept and design. S.R.H., M.O., J.V., P.A.D., W.J.K., Y.O., S.S.R., D.S., A.A.L., G.G.B., B.H.S., A.G.U., E.R.B., D.A.L., J.J.Y., D.K.K., I.H., P.S. & M.H. contributed to data collection. All authors, including those whose initials are not listed above, contributed to the critical review and editing of the manuscript and approved the final version of the manuscript.

Competing Financial Interest Statements:

I.P.H. has received grant support from Pfizer.

P.J.C. has received research funding from GSK.

B.P. serves on the DSMB of a clinical trial funded by the manufacturer and on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson.

N.L. and R.T-S. are shareholders and employees of GSK.

S.I.R. is a current employee and shareholder at AstraZeneca. He has served as a consultant, participated in advisory boards, received honorarium for speaking or grant support from: American Board of Internal Medicine, Advantage Healthcare, Almirall, American Thoracic Society, AstraZeneca, Baxter, Boehringer Ingelheim, Chiesi, ClearView Healthcare, Cleveland Clinic, Complete Medical Group, CSL, Dailchi Sankyo, Decision Resources, Forest, Gerson Lehman, Grifols, GroupH, Guidepoint Global, Haymarket, Huron Consulting, Inthought, Johnson and Johnson, Methodist Health System – Dallas, NCI Consulting, Novartis, Pearl, Penn Technology, Pfizer, PlanningShop, PSL FirstWord, Qwessential, Takeda, Theron and WebMD.

W.T. reports reports fees to the Department, all outside the submitted work, from Pfizer, GSK, Chiesi, Roche Diagnostics/Ventana, Biotest, Merck Sharp Dohme, Novartis, Lilly Oncology, Boehringer Ingelheim, and grants from Dutch Asthma Fund.

J.C.L. is currently an employee of GNS Healthcare in Cambridge, MA.

J.B.W. was employed by Pfizer during the time this research was performed.

P.B. has received consulting and lecture fees from AstraZeneca, Boehringer Ingelheim, Chiesi, Novartis, and Teva.

L.L. has performed consultancy for Boehringer Ingelheim GmbH, received an AstraZeneca Scientific Award and travel support from Novartis, European Respiratory Society, and the Belgian Respiratory Society.

P.G.W. has consulted for Amgen, Sanofi, Novartis, Genentech/Roche, Boehringer-Ingelheim, Neostem and has had research grants from Pfizer and Genentech.

D.L. received grant support, honoraria and consultancy fees from GSK for work on the ICGN and ECLIPSE studies, and was a member and then Chaired the GSK Respiratory Therapy Area Board (2009–2015).

M.H. is a current employee at AstraZeneca.

D.A.S. is serving on the scientific advisory boards of Apellis Pharmaceuticals and Pliant Therapeutics, and is the founder and owner of Eleven P15.

D.S.P. - The University of Groningen has received money for Professor Postma regarding a grant for research from Astra Zeneca, Chiesi, Genentec, GSK and Roche. Fees for consultancies were given to the University of Groningen by Astra Zeneca, Boehringer Ingelheim, Chiesi, GSK, Takeda and TEVA.

E.K.S. has received honoraria and consulting fees from Merck, grant support and consulting fees from GSK, and honoraria and travel support from Novartis.

M.H.C. has received grant support from GSK.

References

  • 1.Vestbo J, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am J Respir Crit Care Med. 2013;187:347–65. doi: 10.1164/rccm.201204-0596PP. [DOI] [PubMed] [Google Scholar]
  • 2.Hancock DB, et al. Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function. Nat Genet. 2010;42:45–52. doi: 10.1038/ng.500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Repapi E, et al. Genome-wide association study identifies five loci associated with lung function. Nat Genet. 2010;42:36–44. doi: 10.1038/ng.501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Soler Artigas M, et al. Genome-wide association and large-scale follow up identifies 16 new loci influencing lung function. Nat Genet. 2011;43:1082–90. doi: 10.1038/ng.941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hancock DB, et al. Genome-wide joint meta-analysis of SNP and SNP-by-smoking interaction identifies novel loci for pulmonary function. PLoS Genet. 2012;8:e1003098. doi: 10.1371/journal.pgen.1003098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Soler Artigas M, et al. Sixteen new lung function signals identified through 1000 Genomes Project reference panel imputation. Nat Commun. 2015;6:8658. doi: 10.1038/ncomms9658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wain LV, et al. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respir Med. 2015;3:769–81. doi: 10.1016/S2213-2600(15)00283-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Fingerlin TE, et al. Genome-wide association study identifies multiple susceptibility loci for pulmonary fibrosis. Nat Genet. 2013;45:613–20. doi: 10.1038/ng.2609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Fingerlin TE, et al. Genome-wide imputation study identifies novel HLA locus for pulmonary fibrosis and potential role for auto-immunity in fibrotic idiopathic interstitial pneumonia. BMC Genet. 2016;17:74. doi: 10.1186/s12863-016-0377-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gonzalez JR, et al. A common 16p11.2 inversion underlies the joint susceptibility to asthma and obesity. Am J Hum Genet. 2014;94:361–72. doi: 10.1016/j.ajhg.2014.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Laurell CB, Eriksson S. The electrophoretic alpha-1-globulin pattern of serum in alpha-1-antitrypsin deficiency. Scandinavian Journal of Clinical and Laboratory Investigation. 1963;15:132–140. [Google Scholar]
  • 12.Silverman EK, et al. Genome-wide linkage analysis of severe, early-onset chronic obstructive pulmonary disease: airflow obstruction and chronic bronchitis phenotypes. Hum Mol Genet. 2002;11:623–32. doi: 10.1093/hmg/11.6.623. [DOI] [PubMed] [Google Scholar]
  • 13.Cho MH, et al. Risk loci for chronic obstructive pulmonary disease: a genome-wide association study and meta-analysis. Lancet Respir Med. 2014;2:214–25. doi: 10.1016/S2213-2600(14)70002-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Silverman EK, et al. Opportunities and challenges in the genetics of COPD 2010: an International COPD Genetics Conference report. COPD. 2011;8:121–35. doi: 10.3109/15412555.2011.558864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mannino DM, Buist AS. Global burden of COPD: risk factors, prevalence, and future trends. Lancet. 2007;370:765–73. doi: 10.1016/S0140-6736(07)61380-4. [DOI] [PubMed] [Google Scholar]
  • 16.Pillai SG, et al. A genome-wide association study in chronic obstructive pulmonary disease (COPD): identification of two major susceptibility loci. PLoS Genet. 2009;5:e1000421. doi: 10.1371/journal.pgen.1000421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cho MH, et al. Variants in FAM13A are associated with chronic obstructive pulmonary disease. Nat Genet. 2010;42:200–2. doi: 10.1038/ng.535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cho MH, et al. A genome-wide association study of COPD identifies a susceptibility locus on chromosome 19q13. Hum Mol Genet. 2012;21:947–57. doi: 10.1093/hmg/ddr524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hobbs BD, et al. Exome Array Analysis Identifies a Common Variant in IL27 Associated with Chronic Obstructive Pulmonary Disease. Am J Respir Crit Care Med. 2016;194:48–57. doi: 10.1164/rccm.201510-2053OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wilk JB, et al. A genome-wide association study of pulmonary function measures in the Framingham Heart Study. PLoS Genet. 2009;5:e1000429. doi: 10.1371/journal.pgen.1000429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wilk JB, et al. Genome-wide association studies identify CHRNA5/3 and HTR4 in the development of airflow obstruction. Am J Respir Crit Care Med. 2012;186:622–32. doi: 10.1164/rccm.201202-0366OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Hao K, et al. Lung eQTLs to help reveal the molecular underpinnings of asthma. PLoS Genet. 2012;8:e1003029. doi: 10.1371/journal.pgen.1003029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Giambartolomei C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Vasioukhin V, Bowers E, Bauer C, Degenstein L, Fuchs E. Desmoplakin is essential in epidermal sheet formation. Nat Cell Biol. 2001;3:1076–85. doi: 10.1038/ncb1201-1076. [DOI] [PubMed] [Google Scholar]
  • 25.Sato Y, et al. The novel PAR-1-binding protein MTCL1 has crucial roles in organizing microtubules in polarizing epithelial cells. J Cell Sci. 2013;126:4671–83. doi: 10.1242/jcs.127845. [DOI] [PubMed] [Google Scholar]
  • 26.Sato Y, et al. MTCL1 crosslinks and stabilizes non-centrosomal microtubules on the Golgi membrane. Nat Commun. 2014;5:5266. doi: 10.1038/ncomms6266. [DOI] [PubMed] [Google Scholar]
  • 27.Uhlen M, et al. Proteomics. Tissue-based map of the human proteome. Science. 2015;347:1260419. doi: 10.1126/science.1260419. [DOI] [PubMed] [Google Scholar]
  • 28.Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012;40:D930–4. doi: 10.1093/nar/gkr917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ward LD, Kellis M. HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease. Nucleic Acids Res. 2015 doi: 10.1093/nar/gkv1340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lei Y, et al. The mitochondrial proteins NLRX1 and TUFM form a complex that regulates type I interferon and autophagy. Immunity. 2012;36:933–46. doi: 10.1016/j.immuni.2012.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Lei Y, Wen H, Ting JP. The NLR protein, NLRX1, and its partner, TUFM, reduce type I interferon, and enhance autophagy. Autophagy. 2013;9:432–3. doi: 10.4161/auto.23026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kang MJ, et al. Suppression of NLRX1 in chronic obstructive pulmonary disease. J Clin Invest. 2015;125:2458–62. doi: 10.1172/JCI71747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wert SE, et al. Increased metalloproteinase activity, oxidant production, and emphysema in surfactant protein D gene-inactivated mice. Proc Natl Acad Sci U S A. 2000;97:5972–7. doi: 10.1073/pnas.100448997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lomas DA, et al. Serum surfactant protein D is steroid sensitive and associated with exacerbations of COPD. Eur Respir J. 2009;34:95–102. doi: 10.1183/09031936.00156508. [DOI] [PubMed] [Google Scholar]
  • 35.Foreman MG, et al. Polymorphisms in surfactant protein-D are associated with chronic obstructive pulmonary disease. Am J Respir Cell Mol Biol. 2011;44:316–22. doi: 10.1165/rcmb.2009-0360OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Mathai SK, et al. Desmoplakin Variants Are Associated with Idiopathic Pulmonary Fibrosis. Am J Respir Crit Care Med. 2016;193:1151–60. doi: 10.1164/rccm.201509-1863OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Pickrell JK, et al. Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016;48:709–17. doi: 10.1038/ng.3570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Washko GR, et al. Lung volumes and emphysema in smokers with interstitial lung abnormalities. N Engl J Med. 2011;364:897–906. doi: 10.1056/NEJMoa1007285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Chilosi M, Poletti V, Rossi A. The pathogenesis of COPD and IPF: distinct horns of the same devil? Respir Res. 2012;13:3. doi: 10.1186/1465-9921-13-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Stanley SE, et al. Telomerase mutations in smokers with severe emphysema. J Clin Invest. 2015;125:563–70. doi: 10.1172/JCI78554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Soriano JB, et al. The proportional Venn diagram of obstructive lung disease: two approximations from the United States and the United Kingdom. Chest. 2003;124:474–81. doi: 10.1378/chest.124.2.474. [DOI] [PubMed] [Google Scholar]
  • 42.Welter D, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42:D1001–6. doi: 10.1093/nar/gkt1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Moffatt MF, et al. A large-scale, consortium-based genomewide association study of asthma. N Engl J Med. 2010;363:1211–21. doi: 10.1056/NEJMoa0906312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Thorgeirsson TE, et al. Sequence variants at CHRNB3-CHRNA6 and CYP2A6 affect smoking behavior. Nat Genet. 2010;42:448–53. doi: 10.1038/ng.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Consortium, T.a.G. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat Genet. 2010;42:441–7. doi: 10.1038/ng.571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Finucane HK, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47:1228–35. doi: 10.1038/ng.3404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Bulik-Sullivan B, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47:1236–41. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Stanley SE, et al. Loss-of-function mutations in the RNA biogenesis factor NAF1 predispose to pulmonary fibrosis-emphysema. Sci Transl Med. 2016;8:351ra107. doi: 10.1126/scitranslmed.aaf7837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Coram MA, et al. Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach. Am J Hum Genet. 2015;96:740–52. doi: 10.1016/j.ajhg.2015.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Consortium, C.A.D. et al. Large-scale association analysis identifies new risk loci for coronary artery disease. Nat Genet. 2013;45:25–33. doi: 10.1038/ng.2480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wood AR, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173–86. doi: 10.1038/ng.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Locke AE, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206. doi: 10.1038/nature14177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zheng HF, et al. Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture. Nature. 2015;526:112–7. doi: 10.1038/nature14878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Castaldi PJ, et al. The association of genome-wide significant spirometric loci with chronic obstructive pulmonary disease susceptibility. Am J Respir Cell Mol Biol. 2011;45:1147–53. doi: 10.1165/rcmb.2011-0055OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Winkler TW, et al. Quality control and conduct of genome-wide association meta-analyses. Nat Protoc. 2014;9:1192–212. doi: 10.1038/nprot.2014.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Abecasis G, Li Y, Willer C. METAL MetaAnalysis Helper. Version release 2011-03-25. 2011 URL: http://genome.sph.umich.edu/wiki/METAL_Program.
  • 57.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–1. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Yang J, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–75. doi: 10.1038/ng.2213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lamontagne M, et al. Refining susceptibility loci of chronic obstructive pulmonary disease with lung eqtls. PLoS One. 2013;8:e70220. doi: 10.1371/journal.pone.0070220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Bosse Y, et al. Molecular signature of smoking in human lung tissues. Cancer Res. 2012;72:3753–63. doi: 10.1158/0008-5472.CAN-12-1160. [DOI] [PubMed] [Google Scholar]
  • 62.Lamontagne M, et al. Genetic regulation of gene expression in the lung identifies CST3 and CD22 as potential causal genes for airflow obstruction. Thorax. 2014;69:997–1004. doi: 10.1136/thoraxjnl-2014-205630. [DOI] [PubMed] [Google Scholar]
  • 63.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Chang CC, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing; Vienna, Austria: 2016. URL: http://www.R-project.org/ [Google Scholar]
  • 67.Hardin M, et al. The clinical and genetic features of COPD-asthma overlap syndrome. Eur Respir J. 2014;44:341–50. doi: 10.1183/09031936.00216013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Wakefield J. Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol. 2009;33:79–86. doi: 10.1002/gepi.20359. [DOI] [PubMed] [Google Scholar]
  • 69.Morris AP. Transethnic meta-analysis of genomewide association studies. Genet Epidemiol. 2011;35:809–22. doi: 10.1002/gepi.20630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Kichaev G, Pasaniuc B. Leveraging Functional-Annotation Data in Trans-ethnic Fine-Mapping Studies. Am J Hum Genet. 2015;97:260–71. doi: 10.1016/j.ajhg.2015.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Lu Q, Powles RL, Wang Q, He BJ, Zhao H. Integrative Tissue-Specific Functional Annotations in the Human Genome Provide Novel Insights on Many Complex Traits and Improve Signal Prioritization in Genome Wide Association Studies. PLoS Genet. 2016;12:e1005947. doi: 10.1371/journal.pgen.1005947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Trynka G, et al. Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci. Am J Hum Genet. 2015;97:139–52. doi: 10.1016/j.ajhg.2015.05.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Slowikowski K, Hu X, Raychaudhuri S. SNPsea: an algorithm to identify cell types, tissues and pathways affected by risk loci. Bioinformatics. 2014;30:2496–7. doi: 10.1093/bioinformatics/btu326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Tasan M, et al. Selecting causal genes from genome-wide association studies via functionally coherent subnetworks. Nat Methods. 2015;12:154–9. doi: 10.1038/nmeth.3215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Database of Single Nucleotide Polymorphisms (dbSNP) Bethesda, MD: National Center for Biotechnology Information, National Library of Medicine; (dbSNP Build ID: 137). URL: http://www.ncbi.nlm.nih.gov/SNP/ [Google Scholar]
  • 76.International HapMap, C. The International HapMap Project. Nature. 2003;426:789–96. doi: 10.1038/nature02168. [DOI] [PubMed] [Google Scholar]
  • 77.Cho MH, et al. A Genome-Wide Association Study of Emphysema and Airway Quantitative Imaging Phenotypes. Am J Respir Crit Care Med. 2015;192:559–69. doi: 10.1164/rccm.201501-0148OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Zhang K, Cui S, Chang S, Zhang L, Wang J. i-GSEA4GWAS: a web server for identification of pathways/gene sets associated with traits by applying an improved gene set enrichment analysis to genome-wide association study. Nucleic Acids Res. 2010;38:W90–5. doi: 10.1093/nar/gkq324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Ashburner M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–9. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Gene Ontology, C. Gene Ontology Consortium: going forward. Nucleic Acids Res. 2015;43:D1049–56. doi: 10.1093/nar/gku1179. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement

Data Availability Statement

The genome-wide association summary statistics generated in the Stage 1 analysis of the current study are available in the dbGaP repository, https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000179.v5.p2

The Stage 2 analysis summary statistics are available in Supplementary Table 3.

RESOURCES