Skip to main content
BMC Genomics logoLink to BMC Genomics
. 2024 Dec 30;25:1255. doi: 10.1186/s12864-024-11165-6

Genome-wide analysis in northern Chinese twins identifies twelve new susceptibility loci for pulmonary function

Tong Wang 1, Weijing Wang 1, Chunsheng Xu 2, Xiaocao Tian 2,, Dongfeng Zhang 1,
PMCID: PMC11684132  PMID: 39736507

Abstract

Background

Previous genome-wide association studies (GWAS) have established association between genetic variants and pulmonary function across various ethnics, whereas such associations are scarcely reported in Chinese adults. Therefore, we conducted an GWAS to explore relationships between genetic variants and pulmonary function among middle-aged Chinese dizygotic twins and further validated the top variants using data from the UK Biobank (UKB).

Methods

In the discovery phase, 139 dizygotic twin pairs were drawn from the Qingdao Twin Registry. Pulmonary function was assessed using three parameters: forced expiratory volume the first second (FEV1), forced vital capacity (FVC), and FEV1/FVC ratio. GWAS was performed using GEMMA, Gene-based analysis was conducted by VEGAS2. And pathway enrichment analysis was performed using PASCAL. In the validation phase, Single-nucleotide polymorphisms (SNPs) with suggestive significance were examined through linear regression analysis of the additive effect model among 1573 Chinese ethnic participants from UKB.

Results

The median age of twin pairs in the study was 49 years. 3 SNPs (rs80345886, rs117883876, and 75139439) related to FEV1 achieved genome-wide significance. Moreover, 222, 150, and 73 SNPs surpassed suggestive evidence level (p < 1 × 10− 5) for FEV1, FVC, and FEV1/FVC, respectively. Among them, 16 SNPs located in TBC1D16 for FEV1, 25 SNPs located in GPR126 for FVC, and 2 SNPs located in CCDC110 for FEV1/FVC, the three genes were also revealed by gene-based analysis. Moreover, 12 novel SNPs related to pulmonary function were validated to reach the nominal significance level (p < 0.05) in the UKB, with some located in the TBC1D16, TAFA5, and MTHFD1L genes.

Conclusion

Our GWAS results on Chinese dizygotic twins provide new references for the genetic regulation on pulmonary function. Twelve novel susceptibility loci are considered as possible crucial to pulmonary function.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-024-11165-6.

Keywords: Pulmonary function, GWAS, Twins

Introduction

Pulmonary function, a heritable trait that reflects the physiological state of the lungs and airways [1], declines with age [2]. It can be assessed by spirometry with three key metrics including forced expiratory volume in one second (FEV1), forced vital capacity (FVC), and the FEV1/FVC ratio. Pulmonary function is a predictor of the incidence [3, 4] and mortality [5, 6] of cardiopulmonary diseases and is crucial for diagnosing and evaluating respiratory diseases such as asthma and chronic obstructive pulmonary disease (COPD). These lung diseases not only impair lung health but also contribute to the development of other diseases, including diabetes [7] and cardiovascular disease [8]. Therefore, understanding the underlying biological mechanisms and factors affecting pulmonary function is essential for the effective prevention and control of respiratory diseases.

Previous research based on family and twin studies has provided evidence that pulmonary function is influenced by genetic factors, with heritability estimates ranging from 0.3 to 0.9 [9, 10]. Researchers have also sought to identify genetic loci associated with pulmonary function through Genome-Wide Association Study (GWAS), which is an extremely effective method for identifying genetic loci associated with complex phenotypes or diseases [11]. To date, most GWASs related to pulmonary function have focused on European and American populations, identifying numerous genetic variants and genes, such as HHIP, GPR126, FAM13A, AGER, GSTCD, and PRDM11 [1215]. Additionally, GWAS involving Latin American and East Asian populations, including Koreans, Mongolians, Japanese and Chinese, have also identified genetic loci and genes related to pulmonary function, some of which overlap with those found in European and American populations (e.g., GPR126 [16], FAM13A [17], AGER [18]), while others are novel, such as KCNE2, PPT2, and TTLL2 [16, 17, 19].

Although some previous GWAS have focused on pulmonary function in East Asian populations, sequencing data from the Northeast Asian Reference Database reveal that despite a shared genetic foundation, distinct ancestral components exist among Koreans, Mongolians, Japanese, and mainland East Asians [20]. To our knowledge, only one study conducted with participants from the China has identified 18 novel loci associated with pulmonary function [21]. Therefore, further GWAS on the Chinese population are essential to expand our understanding of the genetic heterogeneity of pulmonary function across different ethnics [22] and to identify genetic variants associated with pulmonary function in the Chinese population. The twin-based GWAS design can excellently control population stratification and gene-environment correlation, identifying direct genetic effects [23]. Thus, in the current study, we conducted the GWAS based the Chinese twin samples to address specific genetic variants related to pulmonary function in Chinese populations.

Materials and methods

Sample

The twin samples were derived from Qingdao Twin Registry [24], and details of the sample recruitment procedure have been described in previous literatures [25]. Participants were excluded if they were pregnant, lactating, or had incomplete measurement of pulmonary function; the incomplete co-twin pairs were also dropped. The final sample consisted of 139 complete dizygotic (DZ) pairs (278 adults, 141 male and 137 female pairs) with a median age of 49 years (ranging from 40 to 70 years). Informed signed consents were achieved for all participants. Ethical approval for the study was granted by the Regional Ethics Committee of the Qingdao Centers for Disease Control and Prevention Institutional Review Boards, following the principles of the Helsinki Declaration.

Pulmonary function, including FEV1 and FVC, both measured in liters, was assessed by the electronic hand-held spirometer (Micro 0102). The FEV1/FVC ratio was calculated by dividing FEV1 by FVC. Details of measurement procedure have been described in previous literatures [26]. The three pulmonary function indices were transformed by Blom’s formula for normality.

Genotyping, quality control, and imputation

Genomic DNA extracted from whole blood of DZ twins were genotyped using Infinium Omni2.5Exome-8v1.2 genotyping BeadChip platform (Illumina Inc, San Diego, USA). Data from autosome and X chromosome were analyzed. Following quality control criteria [27, 28], we included SNPs with locus missing < 0.05, calling rate > 0.98, minor allele frequencies (MAF) > 0.01, and Hardy-Weinberg equilibrium (HWE) p-value > 1 × 10− 4.

Moreover, IMPUTE2 [29] software was used to impute the untyped SNPs based on the linkage disequilibrium (LD) information by using 1,000 Genomes Project Phase 3 East Asian as the reference panel. The filter criteria for imputed SNPs were INFO > 0.8, MAF > 0.05, and HWE > 1 × 10− 4. A total of 7,405,822 SNPs were finally obtained.

GWAS analysis

SNP-based analyses

GEMMA [30] was used to perform the effect of pulmonary function indices on SNP genotypes, adjusting for age, sex, height, smoking status (never, ever, and now), the first five principle components, and the relatedness matrix. The common Bonferroni-corrected threshold (p < 5 × 10− 8) was set as genome-wide statistical significance, and the threshold (p < 1 × 10− 5) was considered for suggestive evidence level. Locus Zoom applied to display regional information of GWAS results. We submitted suggestively significant SNPs located in non-coding region to Enhancer DB to search which enhancer these SNPs were located in, to GTEx project portal (https://www.gtexportal.org/home/) (Accessed data: August 2023; Data Release: Version 8) to identify expression quantitative trait loci (eQTL) related to these SNPs, to SNP2TFBS to find which transcript factor these SNPs affect.

Gene-based analysis

We conducted gene-based analyses using the VEGAS2 [31] tool on GWAS summary data, with linkage disequilibrium accounted for using the Han Chinese in Beijing, China (CHB) sub-population from the 1000 Genomes Project Asian Population. The analysis incorporated 26,056 unique VEGAS2 gene definitions, comprising 24,769 autosomal genes and 1,287 X-chromosomal genes. Significance thresholds were set at p < 1.92(0.05/26,056) × 10− 6 for multiple testing adjustment, and p < 0.05 for suggestive significance.

Pathway enrichment analysis

We conducted pathway enrichment analysis using the PASCAL tool [32], which provides a library of 1077 pathways sourced from the KEGG, REACTOME, and BIOCARTA databases through the Molecular Signatures Database (MsigDB). In our analysis, we aggregated SNP p-values from GWAS, adjusting for linkage disequilibrium (LD), to compute gene scores. These scores were then used to calculate pathway scores based on the gene scores within each of the pathways identified from the library. The significance of the pathway enrichment was assessed using either chi-squared or empirical scoring methods. We established statistical significance with a Bonferroni-corrected emp-p < 4.64 (0.05/1077) × 10− 5, and set a nominal significance level at emp-p < 0.05.

Validation analysis

To confirm the associations, we obtained the phenotype and genotype data from the third release of UK Biobank data (https://www.ukbiobank.ac.uk/) with Applications ID: 95,715. FEV1 and FVC value were calculated from spirometry blow and FEV1/FVC was calculated by these two indicators. The details about data collection, genotyping, and imputation have been described elsewhere [33, 34]. We restricted the validation sample to Chinese ethnic background (n = 1573). As the previous study described [28], the top SNPs were validated by linear regression analysis of the additive effect model, adjusting for age, sex, smoke status, height and the first 10 genetic PCs. A total of 352 SNPs for FEV1, FVC, and FEV1/FVC with p values < 1 × 10− 5 in the discovery set were typed in the UKB data set and selected for validation. The Bonferroni-corrected significance threshold was p < 1.42 × 10− 4 (0.05/352), and the nominal significance level was p < 0.05. Figure 1 showed the outline of overall study design and analysis steps. Moreover, we used the 12 identified susceptibility loci associated with pulmonary function to construct a polygenic risk score (PRS). Each SNP was weighted according to its effect size derived from the UK Biobank validation results. These weights were then applied to the corresponding SNPs in our target population (twin pairs) to calculate the cumulative PRS for each individual. Generalized estimating equation models were used to assess the association between the computed PRS and pulmonary function, adjusting for age, gender, smoking status, and height. Statistical analyses were performed utilizing R version 4.2.0. The code used in this study can be accessed at the following GitHub repository: https://github.com/TongWang0106/Pulmonary-function-GWAS .

Fig. 1.

Fig. 1

Flowchart of the overall study design and analysis steps

Results

Descriptive statistics of basic characteristics are shown in Supplementary Material 4: Table S1. Our study included 139 dizygotic twin pairs, with a median age of 49 years. The median level (interquartile range) pulmonary function values were 2.06 (0.97) for FEV1, 2.19(0.90) for FVC, and 0.97(0.10) for FEV1/FVC.

SNPs-based genome-wide association study

The association between pulmonary function and SNP genotypes was evaluated using the GEMMA software. The quantile-quantile plot (Fig. 2) illustrates the relationship between the observed and expected GWAS p-values. The deviation of points towards the upper right corner of the QQ plot in our results suggests the presence of significant genetic associations related to pulmonary function. The low level of genomic inflation factor (λ) ranging from 0.997 to 1.007 indicates no evidence of population stratification effects. As the Manhattan plot shows in Fig. 3 SNPs reached the genome-wide significance level for FEV1, and 222 SNPs for FEV1, 150 SNPs for FVC, 73 SNPs for FEV1/FVC reached suggestive association level (p < 1 × 10− 5, Supplementary Material 1: Table S2).

Fig. 2.

Fig. 2

Quantile-quantile (Q-Q) plot of genome-wide association study (GWAS) for FEV1 (a), FVC (b), and FEV1/FVC (c). The x-axis shows the -log10 of expected P-values of the association from the chi-square distribution, and the y-axis shows the -log10 of P-values from the observed chi-square distribution. The black dots represent the observed data with the top hit SNP being colored, and the red line is the expectation under the null hypothesis of no association

Fig. 3.

Fig. 3

Manhattan plot of genome-wide association study (GWAS) of FEV1 (a), FVC (b), and FEV1/FVC (c). The x-axis shows the numbers of autosomes and the X chromosome, and the y-axis shows the -log10 of P-values for statistical significance. The dots represent the SNPs. 222 SNPs for FEV1, 150 SNPs for FVC, 73 SNPs for FEV1/FVC reached suggestive association level

Table 1 displays the top 20 SNPs for pulmonary function. Among the top SNPs for FEV1, three SNPs exceeded genome-wide significance level, including rs80345886 and rs117883876 (p = 8.45 × 10− 10) near MTHFD2P6 gene, and rs75139439 located in the PARM1 gene (p = 2.31 × 10− 9). In addition, three SNPs (rs34982522, rs78238888, and rs78628444, p = 5.88 × 10− 8) were located in the enh8386(chr5: 14,024,025 − 14,042,149) region, which regulates DNAH5 gene. Seven SNPs, including rs4723047 (p = 8.25 × 10− 8) at ADCYAP1R1 gene on chromosome 7p14.3, were illustrated in the locus zoom plots (Supplementary Metarial 4: Figure S1). For FVC, the strongest associated SNP was rs9403380 (p = 1.27 × 10− 7) in the GPR126 gene. Nine other SNPs in the GPR126 gene located on chromosome 6q24.1, are shown in the locus zoom plots (Supplementary Metarial 4: Figure S2). As shown in Figure S2, there was no LD between the nine SNPs and rs9403380, suggesting that these SNPs might independently affect FVC. For FEV1/FVC, including the strongest associated SNP rs61918244 (p = 2.86 × 10− 7), 14 SNPs were located in enh14492 (chr12: 6,369,425-6,399,675) and enh14493 (chr12: 6,399,825-6,409,255), which regulates the PLEKHG6 gene. Supplementary Metarial 4: Figure S3 displays the locus zoom plot of chromosome 12p13.31 regions. In addition, rs4862535 (p = 2.13 × 10− 6) was significantly associated with the expression of CCDC110 gene in lung tissues (p = 4.90 × 10− 5, m-value = 0.934), and rs4950744 (p = 2.81 × 10− 6) was significantly associated with the expression of the TIMM17A gene in lung tissues (p = 1.30 × 10− 6, m-value = 1).

Table 1.

Top 20 SNPs (p < 1 × 10− 5) for association with the pulmonary function in genome-wide association study

SNP Chr band Chr: BP P-value Genes Nearest gene/Functional gene Functional annotation
FEV1
rs80345886 5p13.1 5:41975016 8.45E-10 MTHFD2P6
rs117883876 5p13.3 5:41981801 8.45E-10 MTHFD2P6
rs75139439 4q13.3 4:75872722 2.31E-09 PARM1
rs116821226 3q29 3:195195328 5.14E-08 PDX1,PRRX2 TF
rs191296044 5p13.1 5:41543923 5.23E-08 PLCXD3
rs34982522 5p15.2 5:14026049 5.88E-08 DNAH5 Enhancer
rs78238888 5p15.2 5:14010246 5.88E-08 DNAH5 Enhancer
rs78628444 5p15.2 5:14012334 5.88E-08 DNAH5 Enhancer
rs4723047 7p14.3 7:31139332 8.25E-08 ADCYAP1R1
rs2284225 7p14.3 7:31138503 8.79E-08 ADCYAP1R1
rs1008385 7p14.3 7:31151579 1.05E-07 PRRX2 TF
rs781590047 Xq26.1 23:128565706 1.06E-07 RNA5SP513
rs35031103 3q26.32 3:176086799 1.28E-07 RP11-71G7.1
rs6954589 7p14.3 7:31147965 1.34E-07 ADCYAP1R1
rs4540319 7p14.3 7:31149053 1.47E-07 ADCYAP1R1
rs4503014 7p14.3 7:31149140 1.47E-07 ADCYAP1R1
rs4723046 7p14.3 7:31139200 1.6E-07 ADCYAP1R1
rs10462025 5p13.1 5:41820770 1.62E-07 OXCT1
rs2302476 7p14.3 7:31146455 1.67E-07 ADCYAP1R1
rs117091220 5p14.1 5:27267776 1.8E-07 CTD-3007L5.1
FVC
rs9403380 6q24.1-6q24.2 6:142652416 1.27E-07 GPR126
rs116821226 3q29 3:195195328 1.30E-07 PDX1,PRRX2 TF
rs4540319 7p14.3 7:31149053 1.48E-07 ADCYAP1R1
rs4503014 7p14.3 7:31149140 1.48E-07 ADCYAP1R1
rs972982 6q24.1-6q24.2 6:142666861 1.65E-07 GPR126
rs9496346 6q24.1-6q24.2 6:142669338 1.65E-07 GPR126
rs6686611 1q42.12 1:225610906 2.05E-07 LBR
rs1008385 7p14.3 7:31151579 2.44E-07 PRRX2 TF
rs6954589 7p14.3 7:31147965 2.63E-07 ADCYAP1R1
rs78275852 19q12 19:29708754 3.29E-07 UQCRFS1
rs2302476 7p14.3 7:31146455 3.44E-07 ADCYAP1R1
rs6937121 6q24.1-6q24.2 6:142707133 3.69E-07 GPR126
rs80345886 5p13.1 5:41975016 3.71E-07 MTHFD2P69
rs117883876 5p13.1 5:41981801 3.71E-07 MTHFD2P69
rs9389985 6q24.1-6q24.2 6:142653898 3.99E-07 GPR126
rs2039986 6q24.1-6q24.2 6:142655238 3.99E-07 GPR126
rs1891308 6q24.1-6q24.2 6:142648235 4.29E-07 GPR126
rs2050157 6q24.1-6q24.2 6:142658162 5.27E-07 GPR126
rs6570507 6q24.1-6q24.2 6:142679572 5.27E-07 GPR126
rs7741741 6q24.1-6q24.2 6:142655801 5.28E-07 GPR126
FEV1/FVC
rs61918244 12p13.31 12:6403174 2.86E-07 PLEKHG6 Enhancer
rs61918203 12p13.31 12:6396589 6.47E-07 PLEKHG6 Enhancer
rs144870434 10q23.1 10:85877161 1.04E-06 GHITM Enhancer
rs111457230 12p13.31 12:6399669 1.57E-06 PLEKHG6 Enhancer
rs113872288 12p13.31 12:6399666 1.57E-06 PLEKHG6 Enhancer
rs1366756831 11p15.2 11:13815376 1.77E-06 LINC02548
rs16932500 12p13.31 12:6394730 1.95E-06 PLEKHG6 Enhancer
rs1990385 12p13.31 12:6394635 1.96E-06 PLEKHG6 Enhancer
rs1990386 12p13.31 12:6394443 1.96E-06 PLEKHG6 Enhancer
rs61918201 12p13.31 12:6394883 1.96E-06 PLEKHG6 Enhancer
rs4862535 4q35.1 4:186397351 2.13E-06 CCDC110 eQTL(lung)
rs61918243 12p13.31 12:6402906 2.14E-06 PLEKHG6 Enhancer
rs60560940 12p13.31 12:6402471 2.15E-06 PLEKHG6 Enhancer
rs61918242 12p13.31 12:6402692 2.15E-06 PLEKHG6 Enhancer
rs2363884 12p13.31 12:6394322 2.26E-06 PLEKHG6 Enhancer
rs1178679169 12p13.31 12:6398519 2.43E-06 PLEKHG6 Enhancer
rs61918206 12p13.31 12:6401304 2.59E-06 PLEKHG6 Enhancer
rs4950744 1q32.1 1:201923383 2.75E-06 TIMM17A eQTL(lung)
rs891603236 16q23.2 16:80616315 2.81E-06 LINC01227
rs11615534 12p13.31 12:6409706 2.87E-06 PLEKHG6

eQTL: expression Quantitative Trait Loci; TF: Transcription Factor

Gene based analysis

The results of gene-based analysis by VEGAS2 showed that no gene reached the genome-wide significance level (p < 1.92 × 10− 6). However, 698 genes for FEV1,724 genes for FVC, and 355 genes for FEV1/FVC ratio exceeded the suggestive evidence level (p < 0.05), as shown in Supplementary Material 2: Table S3. The top 20 genes for each of the three pulmonary function indicators, ranked by P values, are shown in Table 2. ADCYAP1R1 for FEV1 and FVC, CPR126 for FVC, and CCDC110 for FEV1/FVC had already been shown in the top 20 SNP annotated genes, whereas the other genes were novel, including TBC1D16, FKBP5, AQP4, RPL37, and CD1B.

Table 2.

The top 20 genes from gene-based analysis by using VEGAS2 tool

Chr Gene Number
of SNPs
Start position Stop position P-value Top-SNP Top-SNP P-value
FEV1
12 TESC 185 117,476,727 117,537,251 6.40E-05 rs7304889 3.75E-05
12 KRT5 28 52,908,358 52,914,243 1.35E-04 rs60314569 4.96E-05
17 ARHGDIA 5 79,825,596 79,829,282 1.57E-04 rs113301975 1.49E-04
23 MBNL3 43 131,503,342 131,623,996 1.71E-04 rs1952453 1.38E-04
6 GPR126 172 142,623,055 142,767,403 1.75E-04 rs2039986 1.60E-05
23 RAP2C 5 131,337,051 131,353,508 1.95E-04 rs5977658 2.00E-04
23 RAP2C-AS1 116 131,352,534 131,566,839 2.05E-04 rs1952453 1.38E-04
17 TBC1D16 279 77,906,141 78,009,657 2.23E-04 rs11653700 2.39E-06
17 FAM195B 18 79,780,236 79,791,170 3.41E-04 rs140438341 1.96E-04
6 FHL5 103 97,010,423 97,064,512 3.50E-04 rs12204342 2.56E-05
7 ADCYAP1R1 133 31,092,075 31,151,093 4.12E-04 rs4723047 8.25E-08
17 PPP1R27 2 79,791,367 79,792,926 4.25E-04 rs34856581 3.43E-04
7 SEPT7P2 79 45,763,385 45,808,617 4.54E-04 rs1004028 3.60E-04
10 FAM196A 106 128,933,689 128,994,422 5.58E-04 rs2486971 9.90E-05
17 P4HB 34 79,801,033 79,818,544 5.91E-04 rs111839893 2.94E-04
23 FRMD7 32 131,211,020 131,262,050 7.96E-04 rs5933079 5.97E-05
12 C12orf80 5 52,599,364 52,604,639 8.36E-04 rs4144987 3.06E-03
23 TAF7L 39 100,523,240 100,548,059 9.90E-04 rs5991927 3.01E-04
12 IRAK4 53 44,152,746 44,183,346 1.02E-03 rs4251463 3.90E-04
11 LOC440040 506 49,580,079 49,831,969 1.02E-03 rs11040429 1.72E-04
FVC
6 GPR126 172 142,623,055 142,767,403 1.50E-05 rs9403380 1.27E-07
6 FKBP5 225 35,541,361 35,696,360 1.80E-05 rs12153967 4.39E-06
13 SLC15A1 173 99,336,054 99,404,929 2.40E-05 rs1768042 2.97E-04
18 AQP4 17 24,432,007 24,445,716 2.60E-05 rs455671 5.10E-05
9 SETX 324 135,136,826 135,230,372 1.78E-04 rs508021 1.22E-04
12 C12orf80 5 52,599,364 52,604,639 2.67E-04 rs1317997 9.70E-04
12 IRAK4 53 44,152,746 44,183,346 3.45E-04 rs4251463 1.55E-04
11 LOC646813 36 50,368,317 50,379,802 3.57E-04 rs692671 2.69E-04
11 LOC440040 506 49,580,079 49,831,969 3.62E-04 rs10742953 9.12E-05
19 CABP5 83 48,532,639 48,547,311 3.82E-04 rs111784716 2.17E-04
23 RAP2C 5 131,337,051 131,353,508 3.85E-04 rs5977658 3.52E-04
7 ADCYAP1R1 133 31,092,075 31,151,093 3.91E-04 rs4503014 1.48E-07
23 RAP2C-AS1 116 131,352,534 131,566,839 4.08E-04 rs142507995 3.36E-04
7 DNAJB6 283 157,129,709 157,210,133 4.26E-04 rs12154512 3.07E-04
16 CALB2 88 71,392,615 71,424,342 4.86E-04 rs11545954 1.40E-05
23 MBNL3 43 131,503,342 131,623,996 6.17E-04 rs5977691 3.36E-04
10 FGF8 2 103,529,886 103,540,126 6.74E-04 rs1348870 1.57E-04
23 FRMD7 32 131,211,020 131,262,050 6.84E-04 rs5933079 1.03E-04
12 MKRN9P 5 88,176,662 88,178,488 7.17E-04 rs1675574 4.63E-04
19 SLC5A5 32 17,982,781 18,005,983 7.43E-04 rs7255433 7.70E-05
FEV1/FVC
5 RPL37 9 40,831,429 40,835,387 2.43E-04 rs2291782 8.52E-04
2 LOC101928113 6 86,116,402 86,119,375 3.86E-04 rs4313976 1.21E-04
2 LOC728730 234 39,664,556 39,828,484 6.32E-04 rs10210842 3.22E-05
1 KLHL17 25 895,966 901,099 7.10E-04 rs41285808 6.56E-05
1 CD1B 4 158,297,739 158,301,321 1.36E-03 rs962879 8.61E-04
2 C2orf81 4 74,641,302 74,644,844 1.42E-03 rs7571984 9.31E-04
3 GSK3B 263 119,540,801 119,813,264 1.52E-03 rs60235167 1.30E-04
3 BTD 77 15,642,858 15,687,328 1.59E-03 rs2455822 2.22E-04
1 C8A 153 57,320,442 57,383,894 1.77E-03 rs2284952 1.16E-03
2 GAL3ST2 62 242,716,239 242,743,702 2.01E-03 rs74704923 5.61E-04
3 BPESC1 39 138,823,026 138,844,005 2.05E-03 rs586043 1.38E-03
2 MDH1 6 63,815,742 63,834,330 2.09E-03 rs17618390 6.18E-04
4 CCDC110 76 186,366,337 186,392,913 2.19E-03 rs57923780 5.62E-06
5 GCNT4 7 74,323,288 74,326,724 2.25E-03 rs3811986 1.78E-03
3 LINC00606 11 10,801,168 10,805,877 2.41E-03 rs35131573 6.57E-04
3 RAD18 281 8,918,879 9,005,159 2.67E-03 rs579898 8.73E-04
3 Section 62 56 169,684,579 169,716,161 2.89E-03 rs3348 3.65E-04
1 HNRNPU 2 245,013,601 245,027,827 2.93E-03 rs3766527 7.68E-03
3 RNF168 48 196,195,656 196,230,639 3.20E-03 rs9869437 7.07E-04
3 DCBLD2 227 98,514,813 98,620,533 3.29E-03 rs2439237 7.02E-04

The content discussed in detail was bold

Pathway enrichment analysis

Pathway enrichment analysis identified 582, 540, and 633 pathways (emp-p < 0.05) associated with FEV1, FVC, and FEV1/FVC ratio, respectively. The top 20 pathways ranked by empirical p-value for FEV1, FVC, and FEV1/FVC ratio are shown in Table 3. For FEV1, significant pathways included signaling by NGF, olfactory signaling pathway, purine metabolism, P53 signaling pathway, and HIF pathway. For FVC, significant pathways included the PI3K cascade, FGFR signaling relevant pathway including negative regulation of FGFR signaling, FGFR ligand binding and activation. For FEV1/FVC, significant pathways included signaling to ERKS, NTHI pathway, WNT pathway, MET pathway, aquaporin mediated transport, etc. Additionally, 268 pathways were common to FEV1, FVC, and FEV1/FVC, including signaling by NGF, insulin signaling pathway, and aquaporin mediated transport (Supplementary Material 3: Table S4).

Table 3.

The top 20 pathways (Emp-p < 0.05) by using PASCAL tool

Pathway Chi2
P value
Emp
P value
FEV1
REACTOME signalling by NGF 3.22E-04 7.10E-05
REACTOME NGF signalling via TRKA from the plasma membrane 8.05E-04 1.44E-04
KEGG p53 signaling pathway 8.15E-04 1.61E-04
BIOCARTA IL7 pathway 7.04E-04 5.10E-04
BIOCARTA IL4 pathway 7.04E-04 7.40E-04
REACTOME olfactory signaling pathway 1.15E-03 7.60E-04
BIOCARTA IL2 pathway 7.04E-04 7.60E-04
BIOCARTA IL2RB pathway 7.04E-04 8.30E-04
REACTOME digestion of dietary carbohydrate 8.72E-04 1.02E-03
REACTOME activated AMPK stimulates fatty acid oxidation in muscle 1.68E-03 1.12E-03
REACTOME passive transport by aquaporins 1.33E-03 1.28E-03
BIOCARTA ARAP pathway 1.26E-03 1.30E-03
REACTOME amine derived hormones 1.41E-03 1.37E-03
KEGG insulin signaling pathway 1.86E-03 1.53E-03
REACTOME APC C CDC20 mediated degradation of mitotic proteins 1.61E-03 1.58E-03
KEGG spliceosome 1.44E-03 1.65E-03
KEGG olfactory transduction 2.56E-03 1.72E-03
REACTOME autodegradation of CDH1 by CDH1 APC C 1.61E-03 1.86E-03
KEGG small cell lung cancer 3.32E-03 1.88E-03
REACTOME collagen formation 5.69E-03 2.11E-03
FVC
REACTOME insulin receptor signalling cascade 2.37E-04 1.53E-04
REACTOME FRS2 mediated cascade 2.37E-04 1.68E-04
REACTOME signaling by insulin receptor 2.37E-04 1.85E-04
REACTOME signalling by NGF 1.35E-03 2.77E-04
REACTOME NGF signalling via TRKA from the plasma membrane 1.48E-03 3.11E-04
REACTOME tandem pore domain potassium channels 8.98E-04 3.24E-04
REACTOME collagen formation 1.14E-03 3.38E-04
REACTOME SHC mediated cascade 1.03E-03 4.47E-04
REACTOME signaling by FGFR mutants 1.03E-03 4.56E-04
REACTOME FGFR ligand binding and activation 1.03E-03 4.57E-04
REACTOME FGFR2C ligand binding and activation 1.03E-03 4.68E-04
REACTOME negative regulation of FGFR signaling 1.03E-03 4.71E-04
REACTOME PI3k cascade 1.03E-03 5.30E-04
REACTOME activated point mutants of FGFR2 1.03E-03 5.60E-04
REACTOME downstream signaling events of b cell receptor BCR 1.42E-03 6.10E-04
REACTOME phospholipase C mediated cascade 1.03E-03 6.30E-04
REACTOME signaling by the B cell receptor BCR 1.40E-03 6.40E-04
BIOCARTA IL7 pathway 7.04E-04 6.50E-04
BIOCARTA IL 4 pathway 7.04E-04 7.20E-04
BIOCARTA IL 2 pathway 7.04E-04 7.40E-04
FEV1/FVC
REACTOME influenza viral RNA transcription and replication 5.97E-04 1.77E-04
REACTOME apoptosis 8.12E-04 1.83E-04
KEGG ubiquitin mediated proteolysis 5.44E-04 1.96E-04
REACTOME p75 NTR receptor mediated signalling 1.42E-03 3.82E-04
KEGG WNT signaling pathway 1.16E-03 5.46E-04
BIOCARTA keratinocyte pathway 3.40E-03 6.90E-04
BIOCARTA TNFR1 pathway 4.70E-03 1.14E-03
BIOCARTA stress pathway 5.94E-03 1.30E-03
REACTOME peptide chain elongation 2.30E-03 1.35E-03
KEGG glycosphingolipid biosynthesis ganglio series 1.39E-03 1.38E-03
REACTOME SRP dependent cotranslational protein targeting to membrane 2.30E-03 1.39E-03
KEGG ribosome 2.30E-03 1.46E-03
REACTOME transferrin endocytosis and recycling 2.47E-03 1.49E-03
BIOCARTA TID pathway 6.30E-03 1.58E-03
BIOCARTA RELA pathway 1.80E-03 1.71E-03
REACTOME insulin receptor recycling 2.47E-03 1.78E-03
BIOCARTA p35 alzheimers pathway 1.59E-03 1.78E-03
BIOCARTA sodd pathway 1.80E-03 1.78E-03
BIOCARTA ceramide pathway 1.80E-03 1.82E-03
BIOCARTA NFKB pathway 1.80E-03 1.88E-03

Validation analysis

A total of 352 SNPs for the three pulmonary function indicators with p < 1 × 10− 5 in the discovery set were genotyped in the UK Biobank validation set and selected for validation. Although no SNP passed the Bonferroni correction threshold (p < 1.42 × 10− 4), six SNPs for FEV1 and FVC, and six SNPs for FEV1 exceeded the nominal significance level (p < 0.05). Compared to results published in the GWAS catalog, the 12 SNPs related to pulmonary function validated in this study are all novel. Two SNPs (rs10871500 and rs8070465) were located in the TBC1D16 gene, one (rs149612364) in the TAFA5 gene, and one (rs507108) in the MTHFD1L gene (Table 4). After adjusting for confounders, the GEE models indicated that the PRS was positively associated with FEV1 and FVC, with regression coefficients (β) of 1.49 (p < 0.001) and 1.33 (p < 0.001), respectively.

Table 4.

The SNPs with nominal significance (P < 0.05) in UK Biobank validation analysis

Discovery Validation
SNP Chr BP P value P value Enhancers Gene*
FEV1
rs35031103 3 176,086,799 1.28E-07 3.09E-02 -
rs34981006 3 176,087,566 2.58E-07 2.14E-02
rs7632772 3 176,103,902 4.25E-07 2.30E-02
rs34817694 3 176,118,748 4.38E-07 2.35E-02 enh113420
rs66475255 3 176,107,428 5.02E-07 2.10E-02
rs13084577 3 176,113,219 8.33E-07 4.76E-02
rs186195940 4 138,380,119 1.29E-06 1.29E-02
rs192813943 4 138,326,821 2.34E-06 2.86E-02
rs149612364 22 48,967,243 3.71E-06 1.45E-02 TAFA5
rs10871500 17 77,980,564 4.08E-06 2.14E-02 TBC1D16 enh17427

TBC1D16,

CCDC40, GAA

rs8070465 17 77,978,990 9.36E-06 2.47E-02 TBC1D16 enh17427

TBC1D16,

CCDC40, GAA

rs507108 6 151,327,692 5.75E-06 4.65E-02 MTHFD1L enh9447
FVC
rs35031103 3 176,086,799 2.42E-06 3.23E-02
rs34981006 3 176,087,566 4.73E-06 2.51E-02
rs34817694 3 176,118,748 4.87E-06 2.08E-02
rs7632772 3 176,103,902 7.40E-06 2.80E-02
rs66475255 3 176,107,428 8.70E-06 2.77E-02
rs186195940 4 138,380,119 9.26E-06 9.35E-03

*: Genes possibly regulated by Enhancers

Discussion

In this study, we conducted a GWAS of pulmonary function in northern Chinese twins. We identified 222 SNPs associated with FEV1, 150 SNPs associated with FVC, and 73 SNPs associated with the FEV1/FVC ratio, all of which reached a suggestive level of association. Among these, we corroborated the involvement of 52 SNPs located in 14 genes, including DNAH5, C5orf67 [35], CXCR4 [36, 37], ETV1 [38], MAP3K1 [35], MTHFD1L [39], TENM2 [38, 40], ZBTB43 [40], GPR126 [13, 16, 41, 42], AKAP6 [39, 40], FKBP5 [43], GPC5 [21, 39, 40], LRBA [39, 40], and TMOD1 [40], as supported by previous studies. Additionally, we validated 12 novel SNPs associated with pulmonary function in an independent UK Biobank cohort, with some located in the TBC1D16, TAFA5, and MTHFD1L genes. Gene-based and pathway enrichment analyses further identified hundreds of suggestively significant associations involving genes and pathways across FEV1, FVC, and the FEV1/FVC ratio, underscoring the complex genetic architecture underlying pulmonary function traits.

In this study, we identified 222, 150, and 73 SNPs associated with FEV1, FVC, and the FEV1/FVC ratio, respectively, that reached suggestive significance levels. For FEV1, three SNPs located in enhancers were found to regulate the transcription of the DNAH5 gene. This gene, previously associated with ciliary defects [38, 40, 44], was linked to FEV1 in our study. Additionally, two SNPs near the lesser-studied MTHFD2P6 pseudogene and one at the PARM1 gene, which may influence airway epithelium maintenance and repair [45], reached genome-wide significance. For FVC, 11 highly correlated SNPs within the GPR126 gene showed significant associations. GPR126, previously identified in GWAS studies as related to pulmonary function [16, 46, 47], is crucial for lung development [48] and may influence cell proliferation and airway remodeling [49]. Several SNPs located the ADCYAP1R1 gene showed associations with both FEV1 and FVC. This gene is widely distributed in airway nerve fibers and has significant anti-inflammatory and protective functions in the lungs [50, 51]. For the FEV1/FVC ratio, 14 SNPs in enhancers regulating the PLEKHG6 gene reached suggestive significance. Animal studies suggest that PLEKHG6 may contribute to lung protection against hypercapnia [52], and methylation of PLEKHG6 CpG sites, associated with NO2 exposure, may mediate the effects of air pollution on pulmonary function [53].

During the validation phase in the UK Biobank Chinese cohort, 12 novel SNPs were found to be nominally significant for lung function, including two in the TBC1D16 gene, one in the TAFA5 gene, and one in the MTHFD1L gene. TBC1D16 is a GTPase-activating protein that regulates Rab GTPases [54], influencing cell migration and growth, which may be related to alveolar development [55, 56]. The TAFA5 gene, previously associated with wheezing [57], which is the symptomatic manifestation of diseases that cause airway obstruction, also inhibits post-injury neointima formation [58] and may influence lung function. The MTHFD1L-encoded protein is involved in the synthesis of tetrahydrofolate in mitochondria, which could impact lung tissue repair and regeneration, and has been associated with lung function [39, 59]. Additionally, six SNPs located in non-coding regions suggest that these sites may represent functional coding variants or regulatory elements, warranting further exploration.

The gene-based analysis found 698, 724, and 355 genes exceeding the suggestive significance level for FEV1, FVC, and FEV1/FVC ratio, respectively. Some of these genes, such as GPR126, TBC1D16, and ADCYAP1R1, have been discussed above. Several genes of these genes might play a role in pulmonary function through the following mechanisms. FKBP51 encodes a major component of the glucocorticoid receptor and regulates its response to corticosteroids [60]. Glucocorticoids are involved in late fetal lung development and intrauterine lung maturation in humans [61, 62]. AQP4 encodes an aquaporins expressed in large- and small-airway epithelia, and AQP4 deletion affects lung water permeability [63]. RPL37, as a component of the 60 S ribosomes subunit, is a target gene of miR-4516 and forms the miR-4516/RPL37 pathway, which is involved in autophagy. This pathway plays a critical role in the cellular response to PM2.5 and metal components, potentially affecting pulmonary function [64]. CD1B is a lipid-presenting protein that may be influenced by the sphingosine pathway, which is relevant to COPD and other chronic inflammatory lung diseases [65].

Pathway enrichment analysis identified several biological pathways significantly associated with pulmonary function: 582 pathways for FEV1, 540 pathways for FVC, and 633 pathways for FEV1/FVC. Some of these pathways are biologically plausible. Signaling by nerve growth factor (NGF) was associated with FEV1 and FVC. As a member of the neurotrophin family, NGF is highly expressed in the lungs and can change protein expression levels and mediate cellular function, impacting the development of lung diseases such as pulmonary fibrosis and asthma [66]. Olfactory signaling pathway and olfactory transduction were also related to FEV1 and FVC [67]. Notably, olfactory transduction was the most significant KEGG pathway in a previous GWAS of pulmonary function in Koreans [68]. WNT pathway was related to FEV1/FVC and is essential for proper lung development [69, 70]. Additionally, WNT signaling pathway has been revealed in asthma pathogenesis in previous GWASs [68, 71]. Besides these plausible pathways, other pathways might also have biological associations with pulmonary function. More studies need to be conducted to verify these associations.

This study has several advantages. First, our study, based on Qingdao twin samples, provides valuable genetic evidence for the East Asian population. This contrasts with previous pulmonary function GWAS, which were conducted primarily on European populations. Second, we performed the pulmonary function GWAS in twin pairs. The twin-based GWAS approach effectively controls population stratification and passive gene-environment correlations due to the shared genetic and environmental factors of twins. This methodology enhances the accuracy of genetic association studies by enabling the detection of direct genetic effects [23], while simultaneously reducing the risk of false positives and clarifying indirect genetic influences.

However, our study still has some limitations. First, the relatively small sample size of our study may limit the ability to detect significant associations between genetic markers and pulmonary function, largely due to the difficulty in recruiting and identifying qualified twin samples. More studies are needed to confirm our results. Second, although 12 suggestive SNPs were identified and validated in our study, none reached the statistically significant level. Further research is necessary to validate these potential candidate biomarkers of pulmonary function. Third, since our sample is based on a limited number of Han Chinese twins from the Qingdao region, the generalizability of our findings may be constrained. Future studies with more diverse populations and larger sample sizes are needed to validate the broader applicability of our results.

In conclusion, we identified and validated 12 novel SNPs associated with pulmonary function, including those located in the TBC1D16, TAFA5, and MTHFD1L genes. These findings enhance our understanding of the genetic factors influencing pulmonary function and highlight potential biomarkers for future investigation. While these SNPs show promise, further studies are needed to confirm their roles and explore their potential as candidate biomarkers for pulmonary function.

Electronic supplementary material

Below is the link to the electronic supplementary material.

12864_2024_11165_MOESM1_ESM.xlsx (36.5KB, xlsx)

Supplementary Material 1: Table 2S All suggestively significant SNPs for association with the pulmonary function in genome-wide association study: FEV1, FVC and FEV1/FVC

12864_2024_11165_MOESM2_ESM.xlsx (137.5KB, xlsx)

Supplementary Material 2: Table S3:Nominal significant genes of pulmonary function from gene-based analysis by using VEGAS2 tool: FEV1, FVC and FEV1/FVC

12864_2024_11165_MOESM3_ESM.xlsx (46.1KB, xlsx)

Supplementary Material 3: Table S4: Common pathways for FEV1, FVC and FEV1/FVC by PASCAL tool

12864_2024_11165_MOESM4_ESM.pdf (414.3KB, pdf)

Supplementary Material 4: Table S1. Descriptive statistics of basic characteristics of the sample., Figure S1 Regional association plot showing signals around chromosomal loci (6q24.1) for genome-wide association study of FVC, Figure S2 Regional association plot showing signals around chromosomal loci (6q24.1) for genome-wide association study of FVC, Figure S3 Regional association plot showing signals around chromosomal loci (12p13.31) for genome-wide association study of FEV1/FVC.

Acknowledgements

The authors appreciate Dr. Gu Zhu for his technical guidance of data analysis, and all participants and contributors of Qingdao Twins Registry and the UK Biobank.

Author contributions

T.W conducted the analysis, wrote original draft, and prepare the figures and tables. W.W, C.X and X.T collected and provided data and other resources. D.Z helped make interpretation of data and substantively revised it. All authors reviewed the draft for intellectual content, and approved submission of the manuscript.

Funding

None.

Data availability

The dataset analysed during the current study are available in the European Variation Archive (EVA) repository (Accession No. PRJEB23749).

Declarations

Ethics approval and consent to participate

Informed written consents were obtained from all participants. Regional Ethics Committee of the Qingdao Centers for Disease Control and Prevention Institutional Review Boards has approved this study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Xiaocao Tian, Email: txc_2006911@126.com.

Dongfeng Zhang, Email: zhangdf1961@126.com.

References

  • 1.Wilk JB, Djousse L, Arnett DK, Rich SS, Province MA, Hunt SC, Crapo RO, Higgins M, Myers RH. Evidence for major genes influencing pulmonary function in the NHLBI family heart study. Genet Epidemiol. 2000;19(1):81–94. [DOI] [PubMed] [Google Scholar]
  • 2.Weiss ST. Lung function and airway diseases. Nat Genet. 2010;42(1):14–6. [DOI] [PubMed] [Google Scholar]
  • 3.Tockman MS, Pearson JD, Fleg JL, Metter EJ, Kao SY, Rampal KG, Cruise LJ, Fozard JL. Rapid decline in FEV1. A new risk factor for coronary heart disease mortality. Am J Respir Crit Care Med. 1995;151(2 Pt 1):390–8. [DOI] [PubMed] [Google Scholar]
  • 4.Agustí A, Noell G, Brugada J, Faner R. Lung function in early adulthood and health in later life: a transgenerational cohort analysis. Lancet Respiratory Med. 2017;5(12):935–45. [DOI] [PubMed] [Google Scholar]
  • 5.Schünemann HJ, Dorn J, Grant BJ, Winkelstein W Jr., Trevisan M. Pulmonary function is a long-term predictor of mortality in the general population: 29-year follow-up of the Buffalo Health Study. Chest. 2000;118(3):656–64. [DOI] [PubMed] [Google Scholar]
  • 6.Sin DD, Wu L, Man SF. The relationship between reduced lung function and cardiovascular mortality: a population-based study and a systematic review of the literature. Chest. 2005;127(6):1952–9. [DOI] [PubMed] [Google Scholar]
  • 7.Gläser S, Krüger S, Merkel M, Bramlage P, Herth FJ. Chronic obstructive pulmonary disease and diabetes mellitus: a systematic review of the literature. Respir Int Rev Thorac Dis. 2015;89(3):253–64. [DOI] [PubMed] [Google Scholar]
  • 8.Cazzola M, Calzetta L, Bettoncelli G, Cricelli C, Romeo F, Matera MG, Rogliani P. Cardiovascular disease in asthma and COPD: a population-based retrospective cross-sectional study. Respir Med. 2012;106(2):249–56. [DOI] [PubMed] [Google Scholar]
  • 9.Yao TC, Du G, Han L, Sun Y, Hu D, Yang JJ, Mathias R, Roth LA, Rafaels N, Thompson EE, et al. Genome-wide association study of lung function phenotypes in a founder population. J Allergy Clin Immunol. 2014;133(1):248–55. e241-210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tian X, Xu C, Wu Y, Sun J, Duan H, Zhang D, Jiang B, Pang Z, Li S, Tan Q. Genetic and environmental influences on pulmonary function and muscle strength: the Chinese twin study of aging. Twin Res Hum Genet. 2017;20(1):53–9. [DOI] [PubMed] [Google Scholar]
  • 11.Visscher PM, Montgomery GW. Genome-wide association studies and human disease: from trickle to flood. JAMA. 2009;302(18):2028–9. [DOI] [PubMed] [Google Scholar]
  • 12.Wilk JB, Chen TH, Gottlieb DJ, Walter RE, Nagle MW, Brandler BJ, Myers RH, Borecki IB, Silverman EK, Weiss ST, et al. A genome-wide association study of pulmonary function measures in the Framingham Heart Study. PLoS Genet. 2009;5(3):e1000429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hancock DB, Eijgelsheim M, Wilk JB, Gharib SA, Loehr LR, Marciante KD, Franceschini N, van Durme YM, Chen TH, Barr RG, et al. Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function. Nat Genet. 2010;42(1):45–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Repapi E, Sayers I, Wain LV, Burton PR, Johnson T, Obeidat M, Zhao JH, Ramasamy A, Zhai G, Vitart V, et al. Genome-wide association study identifies five loci associated with lung function. Nat Genet. 2010;42(1):36–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Loth DW, Soler Artigas M, Gharib SA, Wain LV, Franceschini N, Koch B, Pottinger TD, Smith AV, Duan Q, Oldmeadow C, et al. Genome-wide association analysis identifies six new loci associated with forced vital capacity. Nat Genet. 2014;46(7):669–77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Burkart KM, Sofer T, London SJ, Manichaikul A, Hartwig FP, Yan Q, Soler Artigas M, Avila L, Chen W, Davis Thomas S, et al. A Genome-Wide Association Study in Hispanics/Latinos Identifies Novel Signals for lung function. The Hispanic Community Health Study/Study of latinos. Am J Respir Crit Care Med. 2018;198(2):208–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kim WJ, Lee MK, Shin C, Cho NH, Lee SD, Oh YM, Sung J. Genome-wide association studies identify locus on 6p21 influencing lung function in the Korean population. Respirology. 2014;19(3):360–8. [DOI] [PubMed] [Google Scholar]
  • 18.Yamada M, Motoike IN, Kojima K, Fuse N, Hozawa A, Kuriyama S, Katsuoka F, Tadaka S, Shirota M, Sakurai M, et al. Genetic loci for lung function in Japanese adults with adjustment for exhaled nitric oxide levels as airway inflammation indicator. Commun Biology. 2021;4(1):1288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Son HY, Sohn SW, Im SH, Kim HJ, Lee MK, Gombojav B, Kwon HS, Park DS, Kim HL, Min KU, et al. Family-based Association study of pulmonary function in a Population in Northeast Asia. PLoS ONE. 2015;10(10):e0139716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yoo SK, Kim CU, Kim HL, Kim S, Shin JY, Kim N, Yang JSW, Lo KW, Cho B, Matsuda F, et al. NARD: whole-genome reference panel of 1779 Northeast asians improves imputation accuracy of rare and low-frequency variants. Genome Med. 2019;11(1):64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zhu Z, Li J, Si J, Ma B, Shi H, Lv J, Cao W, Guo Y, Millwood IY, Walters RG et al. A large-scale genome-wide association analysis of lung function in the Chinese population identifies novel loci and highlights shared genetic aetiology with obesity. Eur Respir J 2021, 58(4). [DOI] [PMC free article] [PubMed]
  • 22.Sakaue S, Kanai M, Karjalainen J, Akiyama M, Kurki M, Matoba N, Takahashi A, Hirata M, Kubo M, Matsuda K, et al. Trans-biobank analysis with 676,000 individuals elucidates the association of polygenic risk scores of complex traits with human lifespan. Nat Med. 2020;26(4):542–8. [DOI] [PubMed] [Google Scholar]
  • 23.Friedman NP, Banich MT, Keller MC. Twin studies to GWAS: there and back again. Trends Cogn Sci. 2021;25(10):855–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Duan H, Ning F, Zhang D, Wang S, Zhang D, Tan Q, Tian X, Pang Z. The Qingdao Twin Registry: a status update. Twin Res Hum Genet. 2013;16(1):79–85. [DOI] [PubMed] [Google Scholar]
  • 25.Xu C, Zhang D, Tian X, Wu Y, Pang Z, Li S, Tan Q. Genetic and environmental basis in phenotype correlation between physical function and cognition in Aging Chinese Twins. Twin Res Hum Genet. 2017;20(1):60–5. [DOI] [PubMed] [Google Scholar]
  • 26.Wang T, Wang W, Li W, Duan H, Xu C, Tian X, Zhang D. Genome-wide DNA methylation analysis of pulmonary function in middle and old-aged Chinese monozygotic twins. Respir Res. 2021;22(1):300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Zhao B, Lu Q, Cheng Y, Belcher JM, Siew ED, Leaf DE, Body SC, Fox AA, Waikar SS, Collard CD, et al. A genome-wide Association study to identify single-nucleotide polymorphisms for acute kidney Injury. Am J Respir Crit Care Med. 2017;195(4):482–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sun J, Wang W, Zhang R, Duan H, Tian X, Xu C, Li X, Zhang D. Multivariate genome-wide association study of depression, cognition, and memory phenotypes and validation analysis identify 12 cross-ethnic variants. Translational Psychiatry. 2022;12(1):304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.van Leeuwen EM, Kanterakis A, Deelen P, Kattenberg MV, Genome of the, Netherlands C, Slagboom PE, de Bakker PI, Wijmenga C, Swertz MA, Boomsma DI et al. Population-specific genotype imputations using minimac or IMPUTE2. Nat Protoc 2015, 10(9):1285–1296. [DOI] [PubMed]
  • 30.Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44(7):821–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mishra A, Macgregor S. VEGAS2: Software for more flexible gene-based testing. Twin Res Hum Genet. 2015;18(1):86–91. [DOI] [PubMed] [Google Scholar]
  • 32.Lamparter D, Marbach D, Rueedi R, Kutalik Z, Bergmann S. Fast and rigorous computation of Gene and pathway scores from SNP-Based Summary statistics. PLoS Comput Biol. 2016;12(1):e1004714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, Motyer A, Vukcevic D, Delaneau O, O’Connell J, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wyss AB, Sofer T, Lee MK, Terzikhan N, Nguyen JN, Lahousse L, Latourelle JC, Smith AV, Bartz TM, Feitosa MF, et al. Multiethnic meta-analysis identifies ancestry-specific and cross-ancestry loci for pulmonary function. Nat Commun. 2018;9(1):2976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wain LV, Shrine N, Artigas MS, Erzurumluoglu AM, Noyvert B, Bossini-Castillo L, Obeidat M, Henry AP, Portelli MA, Hall RJ, et al. Genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets. Nat Genet. 2017;49(3):416–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wain LV, Shrine N, Miller S, Jackson VE, Ntalla I, Soler Artigas M, Billington CK, Kheirallah AK, Allen R, Cook JP, et al. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respiratory Med. 2015;3(10):769–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lutz SM, Cho MH, Young K, Hersh CP, Castaldi PJ, McDonald ML, Regan E, Mattheisen M, DeMeo DL, Parker M, et al. A genome-wide association study identifies risk loci for spirometric measures among smokers of European and African ancestry. BMC Genet. 2015;16:138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Shrine N, Izquierdo AG, Chen J, Packer R, Hall RJ, Guyatt AL, Batini C, Thompson RJ, Pavuluri C, Malik V, et al. Multi-ancestry genome-wide association analyses improve resolution of genes and pathways influencing lung function and chronic obstructive pulmonary disease risk. Nat Genet. 2023;55(3):410–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kichaev G, Bhatia G, Loh PR, Gazal S, Burch K, Freund MK, Schoech A, Pasaniuc B, Price AL. Leveraging Polygenic Functional Enrichment to improve GWAS Power. Am J Hum Genet. 2019;104(1):65–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wilk JB, Shrine NR, Loehr LR, Zhao JH, Manichaikul A, Lopez LM, Smith AV, Heckbert SR, Smolonska J, Tang W, et al. Genome-wide association studies identify CHRNA5/3 and HTR4 in the development of airflow obstruction. Am J Respir Crit Care Med. 2012;186(7):622–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Shrine N, Guyatt AL, Erzurumluoglu AM, Jackson VE, Hobbs BD, Melbourne CA, Batini C, Fawcett KA, Song K, Sakornsakolpat P, et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat Genet. 2019;51(3):481–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Sinkala M, Elsheikh SSM, Mbiyavanga M, Cullinan J, Mulder NJ. A genome-wide association study identifies distinct variants associated with pulmonary function among European and African ancestries from the UK Biobank. Commun Biology. 2023;6(1):49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lee JH, McDonald ML, Cho MH, Wan ES, Castaldi PJ, Hunninghake GM, Marchetti N, Lynch DA, Crapo JD, Lomas DA, et al. DNAH5 is associated with total lung capacity in chronic obstructive pulmonary disease. Respir Res. 2014;15(1):97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Chen H, Matsumoto K, Brockway BL, Rackley CR, Liang J, Lee JH, Jiang D, Noble PW, Randell SH, Kim CF, et al. Airway epithelial progenitors are region specific and show differential responses to bleomycin-induced lung injury. Stem Cells. 2012;30(9):1948–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Liu Y, Basty N, Whitcher B, Bell JD, Sorokin EP, van Bruggen N, Thomas EL, Cule M. Genetic architecture of 11 organ traits derived from abdominal MRI using deep learning. eLife 2021, 10. [DOI] [PMC free article] [PubMed]
  • 47.Imboden M, Bouzigon E, Curjuric I, Ramasamy A, Kumar A, Hancock DB, Wilk JB, Vonk JM, Thun GA, Siroux V, et al. Genome-wide association study of lung function decline in adults with and without asthma. J Allergy Clin Immunol. 2012;129(5):1218–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Miller S, Melen E, Merid SK, Hall IP, Sayers I. Genes associated with polymorphic variants predicting lung function are differentially expressed during human lung development. Respir Res. 2016;17(1):95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Hall RJ, O’Loughlin J, Billington CK, Thakker D, Hall IP, Sayers I. Functional genomics of GPR126 in airway smooth muscle and bronchial epithelial cells. FASEB J. 2021;35(7):e21300. [DOI] [PubMed] [Google Scholar]
  • 50.Busto R, Prieto JC, Bodega G, Zapatero J, Carrero I. Immunohistochemical localization and distribution of VIP/PACAP receptors in human lung. Peptides. 2000;21(2):265–9. [DOI] [PubMed] [Google Scholar]
  • 51.Chen W, Boutaoui N, Brehm JM, Han YY, Schmitz C, Cressley A, Acosta-Perez E, Alvarez M, Colon-Semidey A, Baccarelli AA, et al. ADCYAP1R1 and asthma in Puerto Rican children. Am J Respir Crit Care Med. 2013;187(6):584–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Li G, Zhou D, Vicencio AG, Ryu J, Xue J, Kanaan A, Gavrialov O, Haddad GG. Effect of carbon dioxide on neonatal mouse lung: a genomic approach. J Appl Physiol (1985). 2006;101(6):1556–64. [DOI] [PubMed] [Google Scholar]
  • 53.van der Plaat DA. Genes, DNA methylation and exposures underlying COPD in never-smokers.
  • 54.Goueli BS, Powell MB, Finger EC, Pfeffer SR. TBC1D16 is a Rab4A GTPase activating protein that regulates receptor recycling and EGF receptor signaling. Proc Natl Acad Sci USA. 2012;109(39):15787–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.McGowan SE, Holmes AJ, Mecham RP, Ritty TM. Arg-gly-asp-containing domains of fibrillins-1 and – 2 distinctly regulate lung fibroblast migration. Am J Respir Cell Mol Biol. 2008;38(4):435–45. [DOI] [PubMed] [Google Scholar]
  • 56.Vukmirica J, Monzo P, Le Marchand-Brustel Y, Cormont M. The Rab4A effector protein Rabip4 is involved in migration of NIH 3T3 fibroblasts. J Biol Chem. 2006;281(47):36360–8. [DOI] [PubMed] [Google Scholar]
  • 57.Granell R, Curtin JA, Haider S, Kitaba NT, Mathie SA, Gregory LG, Yates LL, Tutino M, Hankinson J, Perretti M et al. A meta-analysis of genome-wide association studies of childhood wheezing phenotypes identifies ANXA1 as a susceptibility locus for persistent wheezing. eLife 2023, 12. [DOI] [PMC free article] [PubMed]
  • 58.Wang Y, Chen D, Zhang Y, Wang P, Zheng C, Zhang S, Yu B, Zhang L, Zhao G, Ma B, et al, et al. Novel adipokine, FAM19A5, inhibits neointima formation after Injury through Sphingosine-1-Phosphate receptor 2. Circulation. 2018;138(1):48–63. [DOI] [PubMed] [Google Scholar]
  • 59.Wilk JB, Walter RE, Laramie JM, Gottlieb DJ, O’Connor GT. Framingham Heart Study genome-wide association: results for pulmonary function measures. BMC Med Genet. 2007;8(Suppl 1):S8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Alsaffar SF, Alqaraghuli HA, Yenzeel JH, Ghazi HF. The association of FKBP5 polymorphism with asthma susceptibility in asthmatic patients. J Basic Clin Physiol Pharmacol. 2021;32(4):479–84. [DOI] [PubMed] [Google Scholar]
  • 61.Bolt RJ, van Weissenbruch MM, Lafeber HN, Delemarre-van de Waal HA. Glucocorticoids and lung development in the fetus and preterm infant. Pediatr Pulmonol. 2001;32(1):76–91. [DOI] [PubMed] [Google Scholar]
  • 62.Fowden AL, Forhead AJ. Hormones as epigenetic signals in developmental programming. Exp Physiol. 2009;94(6):607–25. [DOI] [PubMed] [Google Scholar]
  • 63.Verkman AS, Matthay MA, Song Y. Aquaporin water channels and lung physiology. Am J Physiol Lung Cell Mol Physiol. 2000;278(5):L867–879. [DOI] [PubMed] [Google Scholar]
  • 64.Li X, Lv Y, Hao J, Sun H, Gao N, Zhang C, Lu R, Wang S, Yin L, Pu Y, et al. Role of microRNA-4516 involved autophagy associated with exposure to fine particulate matter. Oncotarget. 2016;7(29):45385–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Ween MP, White JB, Tran HB, Mukaro V, Jones C, Macowan M, Hodge G, Trim PJ, Snel MF, Hodge SJ. The role of oxidised self-lipids and alveolar macrophage CD1b expression in COPD. Sci Rep. 2021;11(1):4106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Liu P, Li S, Tang L. Nerve growth factor: a potential therapeutic target for Lung diseases. Int J Mol Sci 2021, 22(17). [DOI] [PMC free article] [PubMed]
  • 67.Kalbe B, Knobloch J, Schulz VM, Wecker C, Schlimm M, Scholz P, Jansen F, Stoelben E, Philippou S, Hecker E, et al. Olfactory receptors modulate physiological processes in human airway smooth muscle cells. Front Physiol. 2016;7:339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lee BY, Cho S, Shin DH, Kim H. Genome-wide association study of copy number variations associated with pulmonary function measures in Korea Associated Resource (KARE) cohorts. Genomics. 2011;97(2):101–5. [DOI] [PubMed] [Google Scholar]
  • 69.Lehmann M, Baarsma HA, Konigshoff M. WNT signaling in Lung Aging and Disease. Ann Am Thorac Soc. 2016;13(Suppl 5):S411–6. [DOI] [PubMed] [Google Scholar]
  • 70.Konigshoff M, Kramer M, Balsara N, Wilhelm J, Amarie OV, Jahn A, Rose F, Fink L, Seeger W, Schaefer L, et al. WNT1-inducible signaling protein-1 mediates pulmonary fibrosis in mice and is upregulated in humans with idiopathic pulmonary fibrosis. J Clin Invest. 2009;119(4):772–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Barreto-Luis A, Corrales A, Acosta-Herrera M, Gonzalez-Colino C, Cumplido J, Martinez-Tadeo J, Carracedo A, Villar J, Carrillo T, Pino-Yanes M, et al. A pathway-based association study reveals variants from wnt signalling genes contributing to asthma susceptibility. Clin Exp Allergy. 2017;47(5):618–26. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12864_2024_11165_MOESM1_ESM.xlsx (36.5KB, xlsx)

Supplementary Material 1: Table 2S All suggestively significant SNPs for association with the pulmonary function in genome-wide association study: FEV1, FVC and FEV1/FVC

12864_2024_11165_MOESM2_ESM.xlsx (137.5KB, xlsx)

Supplementary Material 2: Table S3:Nominal significant genes of pulmonary function from gene-based analysis by using VEGAS2 tool: FEV1, FVC and FEV1/FVC

12864_2024_11165_MOESM3_ESM.xlsx (46.1KB, xlsx)

Supplementary Material 3: Table S4: Common pathways for FEV1, FVC and FEV1/FVC by PASCAL tool

12864_2024_11165_MOESM4_ESM.pdf (414.3KB, pdf)

Supplementary Material 4: Table S1. Descriptive statistics of basic characteristics of the sample., Figure S1 Regional association plot showing signals around chromosomal loci (6q24.1) for genome-wide association study of FVC, Figure S2 Regional association plot showing signals around chromosomal loci (6q24.1) for genome-wide association study of FVC, Figure S3 Regional association plot showing signals around chromosomal loci (12p13.31) for genome-wide association study of FEV1/FVC.

Data Availability Statement

The dataset analysed during the current study are available in the European Variation Archive (EVA) repository (Accession No. PRJEB23749).


Articles from BMC Genomics are provided here courtesy of BMC

RESOURCES