Abstract
Background
Asthma, lung cancer (LC) and chronic obstructive pulmonary disease (COPD) are three respiratory diseases characterized by complex mechanisms underlying and genetic predispositions, with asthma having the highest calculated heritability. Despite efforts deployed in the last decades, only a small part of its heritability has been elucidated. It was hypothesized that shared genetic factors by these three diseases could help identify new asthma loci.
Methods
GWAS-nominated LC and COPD loci were selected among studies performed in Caucasian cohorts using the GWAS Catalog. Genetic analyses were carried out for these loci in the Saguenay–Lac-Saint-Jean (SLSJ) asthma familial cohort and then replicated in two independent cohorts (the Canadian Cohort Obstructive Lung Disease [CanCOLD] and the Epidemiological Study of the Genetics and Environment of Asthma [EGEA]).
Results
Analyses in the SLSJ cohort identified 2851 and 4702 genetic variants to be replicated in the CanCOLD and EGEA cohorts for LC and COPD loci respectively. Replication and meta-analyses allowed the association of one new locus with asthma, 2p24.3, from COPD studies. None was associated from LC studies reported.
Conclusions
The approach used in this study contributed to better understand the heritability of asthma with shared genetic backgrounds of respiratory diseases.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12890-022-01890-7.
Keywords: Asthma, COPD, Lung cancer, Genetics, 2p24.3, MYCN
Background
The World Health Organization (WHO; www.who.int) has stated that noncommunicable diseases are the world’s biggest killers, with 36 million deaths globally on an annual basis. Cancers account for 21% of these, particularly lung cancer (LC) which is the leading cause of death. Chronic respiratory diseases represent 12% of deaths, and include asthma and chronic obstructive pulmonary disease (COPD). These three noncommunicable diseases are characterized by the combination of complex mechanisms underlying and genetic predispositions, with asthma having the highest calculated heritability (55–90%) [1]. Efforts in the last decades have defined multiple genetic factors, with more than 100 loci linked to asthma in genome-wide association studies (GWAS) [1]. However, they explain only a small part of this heritability; undiscovered genes may reside in genetic loci common between asthma and other respiratory diseases such as LC and COPD.
Meta-analyses have shown an increased risk of developing LC for asthmatic individuals [2, 3]. However, it was recently demonstrated that these associations were probably biased by other respiratory or allergic comorbidities. In fact, when analyzing data from individuals with asthma only, an inverse relationship was found [3], suggesting a protection against LC for asthmatics. Links between asthma and COPD have also been studied, both diseases being frequent comorbidities with up to 55% with COPD having the other as well [4]. In recent years, an asthma-COPD overlap (ACO) syndrome has been described. However, as these two diseases are each characterized by complex interactions of environmental and genetic factors, it is now recognized that ACO is as heterogeneous as the two others and do not correspond to a syndrome definition [5]. Nonetheless, a genetic correlation of 0.38 has been observed between asthma and COPD [6].
In this study, we hypothesized that genetic factors shared by LC or COPD and asthma could help identify new loci of interest for the latter and thus contribute in defining the missing heritability of this complex trait.
Methods
A schematic view of the study design is presented in Fig. 1 with short description of cohorts, analyses performed including covariates, and significance thresholds.
Discovery cohort
The Saguenay–Lac-Saint-Jean asthma familial cohort (SLSJ cohort) [7] comprises 271 families ascertained through one proband (offspring) with allergic asthma for a total of 1394 individuals, among which 1214 have phenotypes and genotypes available. Genome-wide methylation and expression data are also accessible for 215 of these individuals. The general and respiratory health of all were evaluated using a standardized questionnaire (including questions about smoking status, age at asthma onset, etc.) as well as allergy testing and pulmonary function tests according to the American Thoracic Society guidelines (as the forced expiratory volume in 1 second [FEV1], the forced vital capacity [FVC], the FEV1/FVC ratio post bronchodilator [BD] treatment with inhalation of a 200 ng salbutamol dose, the BD reversibility [FEV1 post-BD minus FEV1 pre-BD], and the methacholine challenge [PC20]) [8, 9]. Participants were considered asthmatic if: (1) they had a reported history of asthma (validated by a physician), or (2) they presented asthma-related symptoms and positive PC20 (< 8 mg/ml) on recruitment. All participants (or kin or guardians of minors) gave informed written consent and the study was approved by the Centre intégré universitaire de santé et de services sociaux du Saguenay–Lac-Saint-Jean ethics committee (project #0002-001, 08-11-2005).
Replication cohorts
Each site of both replication cohorts obtained local Research Ethics Board approval for the study, and each participant gave signed informed consent. The Canadian Cohort Obstructive Lung Disease (CanCOLD) [10] is a prospective longitudinal study comprising 1800 individuals with assessment at the time of recruitment, 18 months, 3 years and so on. It involves two balanced subsets of COPD, mild and moderate-to-severe COPD, defined based on Global Initiative for Chronic Obstructive Lung Disease (GOLD) classification (https://goldcopd.org/) as well as many subsets of non-COPD individuals with normal post-bronchodilator spirometry including ever smokers, for those at risk, and never smokers, for healthy controls matching for sex and age. Asthma phenotype, as well as respiratory measures, is similar to the SLSJ cohort. Age at asthma onset is unavailable for this cohort.
The French Epidemiological study on the Genetics and Environment of Asthma (EGEA) [11] combines a family-based study on this condition and a case–control study consisting of three surveys over 20 years, from 1991 to 2013 (see https://egeanet.vjf.inserm.fr/index.php/en/ for details). The whole study includes 388 asthmatic probands and their 1317 first-degree relatives additional to 415 population-based controls (total of 2120 subjects). All EGEA participants have extensive phenotypic characterization through use of standardized questionnaires as well as physiological and biological tests, and data on many lifestyle and environmental factors. However, values for BD reversibility and FEV1/FVC ratio from a post-BD treatment were not available for the EGEA study.
Genetic data
Genetic data for the SLSJ cohort were collected from Illumina Human610-Quad BeadChip. A complete description of the quality control filtering and the imputation process is available in Madore et al. [12]. Briefly, the pre-phasing step has been performed with the Shapeit2 software [13] for imputation in turn by the Impute2 software [14] with samples from the 1000 Genomes Project (phase 3) [15] and the UK10K as reference. After quality control filtering, imputed genotypes were accessible for 1198 samples and according to availability of phenotypes and covariates, analyses have been performed on genotypes from 454 individuals for lung cancer loci and 317 for COPD loci.
Genotypes for CanCOLD were obtained from the Infinium Global Screening Array-24 v2.0 BeadChip. Imputations were carried out using the same quality control criteria, tools and reference samples as the SLSJ cohort. After quality control filtering and according to availability of phenotypes and covariates, analyses were made on genotypes from 1293 individuals for LC and COPD susceptibility loci.
Genetic data were available for 1904 EGEA participants (Illumina Human610-Quad BeadChip). Imputations were carried out using the Haplotype Reference Consortium panel (> 20 million SNPs). Stringent quality control was applied before and after the imputation, and imputed genotypes with posterior probabilities < 0.9 were set to missing. After quality control filtering and according to availability of phenotypes and covariates, analyses for lung cancer susceptibility loci were made on genotypes from 638 offsprings.
Genetic analyses
On November 19, 2019, GWAS Catalog (https://www.ebi.ac.uk/gwas/) was used to list all GWAS nominated loci associated with LC and COPD on Caucasian cohorts. The keywords “familial lung cancer”, “small cell carcinoma”, “non-small cell carcinoma (adenocarcinoma and squamous cell carcinoma)” and “pulmonary neuroendocrine tumor” were typed to search for LC loci and “chronic obstructive pulmonary disease” was for COPD loci. GWAS that are addressing treatment response were excluded. Regions of ± 100 kb around GWAS-associated genetic variants were used for analysis in the SLSJ cohort [7].
We carried out the MFQLS test implemented in the Workbench for Integrated Superfast Association study with Related Data (WISARD) toolkit [16, 17] in the SLSJ samples to identify single nucleotide polymorphisms (SNPs) or insertions/deletions associated with asthma at a p value < 0.05. MFQLS is a quasi-likelihood method of estimation extended from the Cochran-Armitage’s one and suited for familial cohorts considering kinship coefficients [16, 17]. From these, significance thresholds for replication were calculated with Nyholt’s method [18], in accordance with the number of independent genetic variants in LC and COPD susceptibility loci and were set at 1.98E−04 (259 variants) and 6.97E−05 (736 variants), respectively. For LC, analyses were made with the asthma phenotype using age at asthma onset, smoking status (smokers [smokers and ex-smokers who have stopped for less than a year], ex-smoker [if stopped for at least one year] and non-smoker), FEV1 and FVC in percentages of predicted value as covariates. For COPD, they were with the asthma phenotype and age at asthma onset, smoking status, BD reversibility and FEV1/FVC ratio from a post-BD treatment used as covariates.
Analyses in the CanCOLD cohort for lung cancer LC and COPD susceptibility loci were made using the logistic function in Plink1.9 with all covariates excepts age at asthma onset, which was unavailable. As for the EGEA study, they have been performed with the MFQLS test for lung cancer loci only, as BD reversibility and FEV1/FVC ratio from a post-BD were not available here for associations with COPD loci.
Results from the SLSJ discovery cohort and both replication ones were combined using the Stouffer’s approach implemented in the METAL software considering the weight of each cohort (http://csg.sph.umich.edu/abecasis/metal/) [19]. Direction of associations were not considered. As for Haploview, it was used to delimit the haplotype blocks in the regions associated with asthma using the analyses in combination with an algorithm taken from Gabriel et al. [20, 21].
Isolation of naïve CD4+ T cells as well as eosinophils, and RNA sequencing
To look at possible expression quantitative trait loci (eQTLs) between associated SNPs and gene expression in immune cell types involved in asthma pathophysiology, transcriptome data of naïve CD4+ T cells (units with CD3+, CD4+, CD45RA+, and CD45RO− markers) and eosinophils from 215 individuals of the SLSJ cohort were used [12]. Such analyses may help better understand possible functional impact of associated genetic variants. Those cells were isolated from 200 ml of blood samples and gene expression counts were extracted from sequencing data of 500 ng RNA samples. For details on the method, please see Madore et al. [12]. After quality control filtering and given availability of all covariates included in statistical models, transcriptomic data were accessible for 159 naïve CD4+ T cell samples and 139 eosinophil samples.
eQTL analyses
eQTLs analyses have been performed using the package MatrixEQTL in R [22] for gene expression counts with age at sampling, sex, smoking status, proportion of naïve CD4+ T cells or eosinophils, according to which sample types were used for analyses, and surrogate variables as covariates. Proportion of cell types were calculated with the methylome data collected from the same samples and the Houseman method [23] implemented in R package RnBeads [24] as previously described [12]. Surrogate variables were using the R package sva [25] and were added to the analysis model in order to take into account hidden confounders and relatedness between samples. The significance threshold was calculated with Nyholt’s method [18], in accordance with the number of independent genetic variants in LC and COPD susceptibility loci in a respective manner, multiplied by that of genes expressed by each cell type. Those thresholds were set at 1.18E−08 and 1.17E−08 for LC loci when analyzing naïve CD4+ T cells (259 independent genetic variants × 16,324 genes) and eosinophils (259 × 16,449), respectively, and at 4.16E−09 and 4.13E−09 for COPD loci for analyses in naïve CD4+ T cells (736 × 16,324) and eosinophils (736 × 16,449), respectively. Analyses were run for cis-eQTLs (maximum distance between gene and genetic variant ≤ 100 kb) and trans-eQTLs (distance between gene and genetic variant > 100 kb) separately.
Finally, the eQTL Catalogue (https://www.ebi.ac.uk/eqtl/), that includes data from several studies including the GTEx Portal, was used to look at eQTLs between associated SNPs and all tissue types.
Results
According to the GWAS Catalogue, 121 LC loci and 167 COPD loci have been selected for analyses (Additional file 1: Tables S1 and S2), representing 60,379 and 85,776 genetic variants in the SLSJ cohort respectively. Analyses have been performed in this cohort using the MFQLS method. Among the genetic variants tested, 2956 and 4893 for LC and COPD loci respectively had p values below or equal to 0.05 in SLSJ. Of these, 2851 and 4702 were available in at least one of two independent cohorts of Caucasian origins (CanCOLD or EGEA cohorts) and were selected for replication (Additional file 1: Tables S3 and S4). LC variants were tested for replication in both cohorts. A logistic model was used for the CanCOLD [10] which has a case–control design, and the MFQLS method was for the EGEA which has a familial design [11]. No variants were found significantly associated in the meta-analysis.
COPD genetic variants were only replicated in the CanCOLD [10] cohort, according to covariates availability. Among these, four were found significantly associated with asthma in the meta-analysis including both cohorts (p < 6.97E−05) (Table 1, Fig. 2). These (2p24.3 locus, chr2:15,969,655–15,977,866) are 63.5 kb downstream of the COPD-associated variant in the literature (rs10929386, not evaluated in the SLSJ cohort [best r2 with the nearest at < 0.01]) [26] and form a unique haplotype block (Fig. 2B). These variants are 10.3 kb upstream the MYCN proto-oncogene (MYCN) gene’s transcription start site. According to data from the SLSJ cohort, this haplotype block includes 34 genetic variants, all significant in the SLSJ cohort, among which 4 were genotyped and 30 were imputed (Additional file 1: Table S3). Looking at ENCODE and ORegAnno databases of regulatory elements, the haplotype block of 34 genetic variants spans several regulatory features. A DNAse 1 hypersensitivity site with a score of 1000/1000 was found at position chr2:15,980,166–15,980,675, a H3K27me3 modification site with a score of 789 out of 1000 at position chr2:15,975,986–15,976,193 and two transcription factor binding sites for E2F1 at position chr2:15,969,690–15,970,109 and JUND at position chr2:15,975,707–15,975,721.
Table 1.
CHR | SNPa | HGVS name | Genotyped or imputed | MAF SLSJ/UCSCb | Nearest gene | r2c | SLSJ cohort | CanCOLD | Combined (SLSJ, CanCOLD) | |||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Effect allele | P | Effect allele | P | Z-score | P | |||||||
2 | rs13382914 | g.15969655C>T | Imputed | 0.23/0.42 | 111,028 bp before MYCN | 0.995 | T | 0.011 | T | 0.001 | 4.014 | 5.98E−05 |
rs56003245 | g.15970221T>A | Imputed | 0.23/0.43 | 110,462 bp before MYCN | 0.995 | A | 0.018 | A | 0.001 | 3.990 | 6.62E−05 | |
rs61557349 | g.15970339A>G | Imputed | 0.23/0.42 | 110,344 bp before MYCN | 0.995 | G | 0.018 | G | 0.001 | − 3.990 | 6.62E−05 | |
rs4668471 | g.15977866A>T | Imputed | 0.23/0.53 | 102,817 bp before MYCN | 1.000 | T | 0.009 | T | 0.001 | − 4.056 | 4.99E−05 |
Chr, chromosome; CanCOLD, Canadian Cohort Obstructive Lung Disease; HGVS, human genome variation society; MAF, minor allele frequency; SLSJ cohort, Saguenay–Lac-Saint-Jean asthma familial cohort
aSNPs identified using data from dbSNP151
bHGVS names and distance from genes’ transcription start sites calculated from human genome version 19 (hg19, GRCh37). †MAF for the SLSJ cohort and the 1000 Genomes project available from UCSC Genome Browser (https://genome.ucsc.edu)
cR square between the most significant genetic variant and the three other ones
eQTL analyses were performed between SNPs of all GWAS-nominated loci for LC and COPD in naïve CD4+ T cell and eosinophil samples from the SLSJ cohort. Following analyses of LC loci, a total of 622 and 1682 significant cis- and trans-eQTLs were found for naïve CD4+ T cell samples respectively and a total of 204 and 1334 cis- and trans-eQTLs were found for eosinophil samples respectively (Additional file 1: Table S4). Following analyses of COPD loci, a total of 148 and 384 significant cis- and trans-eQTLs were found for naïve CD4+ T cell samples respectively and a total of 0 and 355 cis- and trans-eQTLs were found for eosinophil samples respectively. None were identified for the four associated SNP (Additional file 1: Table S5). No documented eQTLs were found in eQTL Catalogue for associated SNPs.
Discussion
Studying GWAS-nominated loci of LC and COPD, this study allowed to identify one new asthma locus (2p24.3 locus near MYCN gene). This gene encodes a transcription factor normally involved in development of multiple organs including the nervous system and lungs, and aberrantly functions as a known oncogene, predominantly in neuroblastoma [27]. Results for LC and COPD loci are in line with the insufficient evidence observed for common biological background between asthma and LC, compared to the known ties between asthma and COPD, as much for the high proportion of individuals affected by both diseases [4], as for the genetic correlation between them [6]. Even considering this correlation, a previous study of the UK Biobank team including 28,628 individuals with asthma and 9266 individuals with COPD didn’t find any common genes between results from gene-based association tests for these two diseases [28]. This could be due to the high phenotypic heterogeneity of both diseases and may explain why only one common locus was found in this study.
Neither the gene MYCN proximal to the genetic variants associated with asthma in our study, nor its respective locus, have previously been associated with asthma or allergy in the literature. Moreover, considering all tissues available, no documented eQTLs were found in the eQTL Catalogue for associated SNPs, nor in naïve CD4+ T cells or eosinophil samples from the SLSJ cohort. eQTLS could have helped explain association of the four SNPs with asthma through a possible functional link with proximal or distal genes. However, GTEx databases do not include all isolated cell types as circulating immune cells and thus, eQTLs may be found in others than the ones tested in the SLSJ cohort or listed in GTEx databases and the eQTL Catalogue. According to the ENCODE project, the haplotype block identified in the SLSJ cohort spans several regulatory features including binding sites for the transcription factors E2F1 and JUND. Gene expression studies have linked E2F1 expression levels with childhood and allergic asthma [29, 30] underlying possible implications of this new associated locus in asthma.
The analyses performed on eosinophil and naïve CD4+ T cell samples from the SLSJ cohort have identified several eQTLs for the genetic variants associated with asthma in the discovery cohort. Even though this study did not prove any link for these eQTLs with asthma, as the association of their respective variants has not been replicated successfully, these results are a contribution to the general knowledge on their integration with gene expression in human samples of isolated immune cells, including a rarely studied one, the eosinophils.
Conclusions
This study was based on the hypothesis that genetic backgrounds common between asthma and other respiratory diseases such as LC and COPD will lead to the identification of new heritable loci of interest for this disease. Indeed, our analyses using three cohorts have shown robust associations for a new locus at 2p24.3. Our results suggest that this one warrant further studies in additional independent cohorts as well as functional genetic analyses.
Supplementary Information
Acknowledgements
We wish to thank the participants for their valuable participation in this study. We also thank Dominique Fournier, scientific language expert, for the revision of this manuscript (http://www.serviceslinguistiquesdf.com/home) as well as Anne-Marie Boucher-Lafleur and Lucile Pain for the isolation of naïve CD4+ T cells and eosinophils. Catherine Laprise is part of the Quebec Respiratory Health Network (RHN; https://rsr-qc.ca/en/), the Director of the Centre intersectoriel en santé durable de l’UQAC as for the Canada Research Chair on Environment and Genetics of Respiratory Diseases and Allergy (http://www.chairs.gc.ca).
Abbreviations
- ACO
Asthma-COPD overlap
- BD
Bronchodilator
- CanCOLD
The Canadian Cohort Obstructive Lung Disease
- COPD
Chronic obstructive pulmonary disease
- EGEA
The French Epidemiological study on the Genetics and Environment of Asthma
- eQTL
Expression quantitative trait locus
- FEV1
Forced expiratory volume in 1 second
- FVC
Forced vital capacity
- GWAS
Genome-wide association study
- LC
Lung cancer
- MYCN
MYCN proto-oncogene gene
- PC20
Methacholine challenge that induce a 20% fall in FEV1
- SLSJ
Saguenay–Lac-Saint-Jean
- WHO
World Health Organization
- WISARD
Workbench for Integrated Superfast Association study with Related Data
Authors' contributions
AMM: analysis of data for the SLSJ cohort and the meta-analysis, interpretation of data and drafting the manuscript. YB: analysis of data for the CanCOLD cohort. PM-J: analysis of data for the EGEA cohort. EV: revising the manuscript. WLL: design of the study and revising the manuscript, E Bouzigon: analysis of data for the EGEA cohort and revising the manuscript. JB: design of the CanCOLD cohort, acquisition of data and revising the manuscript. CL: design of the study, design of the SLSJ cohort and acquisition of data, supervision of the analysis and contribution to paper redaction, revising the manuscript, final approval of the manuscript version to be published. All authors read and approved the final manuscript.
Funding
This study had the support from the Canada Research Chair on Environment and Genetics of Respiratory Diseases and Allergy and operating grants from the Canadian Institutes of Health Research (CIHR #133605; Laprise). The GWAS data were made available by the European Commission as part of GABRIEL (A multidisciplinary study to identify the genetic and environmental causes of asthma in the European Community), contract number 018996 under the Integrated Program LSH-2004-1.2.5-1 Post genomic approaches to understand the molecular basis of asthma aiming at a preventive or therapeutic control. This study makes use of data generated by the UK10K Consortium. A full list of the investigators who contributed in generating the data is available from www.UK10K.org. Funding for UK10K was provided by the Wellcome Trust under award WT091310.
Availability of data and materials
Datasets used or analyzed during the current study are available from the corresponding author on reasonable request. The data are not publicly available due to ethical reasons.
Declarations
Ethics approval and consent to participate
All participants (or kin or guardians of minors) gave informed written consent and the study was approved by the Centre intégré universitaire de santé et de services sociaux du Saguenay–Lac-Saint-Jean ethics committee (project #0002-001, 08-11-2005).
Consent for publication
Not applicable.
Competing interests
The authors have no competing interests to declare.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Hernandez-Pacheco N, Pino-Yanes M, Flores C. Genomic predictors of asthma phenotypes and treatment response. Front Pediatr. 2019;7:6. doi: 10.3389/fped.2019.00006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rosenberger A, Bickeböller H, McCormack V, Brenner DR, Duell EJ, Tjønneland A, et al. Asthma and lung cancer risk: a systematic investigation by the International Lung Cancer Consortium. Carcinogenesis. 2012;33(3):587–597. doi: 10.1093/carcin/bgr307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Denholm R, Crellin E, Arvind A, Quint J. Asthma and lung cancer, after accounting for co-occurring respiratory diseases and allergic conditions: a systematic review protocol. BMJ Open. 2017;7(1):e013637. doi: 10.1136/bmjopen-2016-013637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Marsh SE, Travers J, Weatherall M, Williams MV, Aldington S, Shirtcliffe PM, et al. Proportional classifications of COPD phenotypes. Thorax. 2008;63(9):761–767. doi: 10.1136/thx.2007.089193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gaspar Marques J, Lobato M, Leiria Pinto P, Neuparth N, Carreiro MP. Asthma and COPD "overlap": a treatable trait or common several treatable-traits? Eur Ann Allergy Clin Immunol. 2020;52(4):148–159. doi: 10.23822/EurAnnACI.1764-1489.138. [DOI] [PubMed] [Google Scholar]
- 6.Morales E, Duffy D. Genetics and gene–environment interactions in childhood and adult onset asthma. Front Pediatr. 2019;7:499. doi: 10.3389/fped.2019.00499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Laprise C. The Saguenay–Lac-Saint-Jean asthma familial collection: the genetics of asthma in a young founder population. Genes Immun. 2014;15(4):247–255. doi: 10.1038/gene.2014.12. [DOI] [PubMed] [Google Scholar]
- 8.Standardization of Spirometry, 1994 Update. American Thoracic Society. Am J Respir Crit Care Med. 1995;152(3):1107–36. [DOI] [PubMed]
- 9.Standards for the diagnosis and care of patients with chronic obstructive pulmonary disease (COPD) and asthma. This official statement of the American Thoracic Society was adopted by the ATS Board of Directors, November 1986. Am Rev Respir Dis. 1987;136(1):225–44. [DOI] [PubMed]
- 10.Bourbeau J, Tan WC, Benedetti A, Aaron SD, Chapman KR, Coxson HO, et al. Canadian Cohort Obstructive Lung Disease (CanCOLD): fulfilling the need for longitudinal observational studies in COPD. COPD. 2014;11(2):125–132. doi: 10.3109/15412555.2012.665520. [DOI] [PubMed] [Google Scholar]
- 11.Kauffmann F, Dizier MH, Pin I, Paty E, Gormand F, Vervloet D, et al. Epidemiological study of the genetics and environment of asthma, bronchial hyperresponsiveness, and atopy: phenotype issues. Am J Respir Crit Care Med. 1997;156(4 Pt 2):S123–S129. doi: 10.1164/ajrccm.156.4.12tac9. [DOI] [PubMed] [Google Scholar]
- 12.Madore AM, Pain L, Boucher-Lafleur AM, Morin A, Meloche J, Simon MM, et al. Asthma-associated polymorphisms in 17q12-21 locus modulate methylation and gene expression of GSDMA in naïve CD4(+) T cells. J Genet Genomics. 2020;47(3):171–174. doi: 10.1016/j.jgg.2020.03.002. [DOI] [PubMed] [Google Scholar]
- 13.O'Connell J, Gurdasani D, Delaneau O, Pirastu N, Ulivi S, Cocca M, et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet. 2014;10(4):e1004234. doi: 10.1371/journal.pgen.1004234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5(6):e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Thornton T, McPeek MS. Case-control association testing with related individuals: a more powerful quasi-likelihood score test. Am J Hum Genet. 2007;81(2):321–337. doi: 10.1086/519497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lee S, Choi S, Qiao D, Cho M, Silverman EK, Park T, et al. WISARD: workbench for integrated superfast association studies for related datasets. BMC Med Genomics. 2018;11(Suppl 2):39. doi: 10.1186/s12920-018-0345-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nyholt DR. A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Hum Genet. 2004;74(4):765–769. doi: 10.1086/383251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics (Oxford, England) 2010;26(17):2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263–265. doi: 10.1093/bioinformatics/bth457. [DOI] [PubMed] [Google Scholar]
- 21.Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science. 2002;296(5576):2225–2229. doi: 10.1126/science.1069424. [DOI] [PubMed] [Google Scholar]
- 22.Shabalin AA. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012;28(10):1353–1358. doi: 10.1093/bioinformatics/bts163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13:86. doi: 10.1186/1471-2105-13-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Assenov Y, Muller F, Lutsik P, Walter J, Lengauer T, Bock C. Comprehensive analysis of DNA methylation data with RnBeads. Nat Methods. 2014;11(11):1138–1140. doi: 10.1038/nmeth.3115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Leek JT, Storey JD. Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 2007;3(9):1724–1735. doi: 10.1371/journal.pgen.0030161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sakornsakolpat P, Prokopenko D, Lamontagne M, Reeve NF, Guyatt AL, Jackson VE, et al. Genetic landscape of chronic obstructive pulmonary disease identifies heterogeneous cell-type and phenotype associations. Nat Genet. 2019;51(3):494–505. doi: 10.1038/s41588-018-0342-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhao E, Hou J, Cui H. Serine-glycine-one-carbon metabolism: vulnerabilities in MYCN-amplified neuroblastoma. Oncogenesis. 2020;9(2):14. doi: 10.1038/s41389-020-0200-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wang L, Fang R, Zhu M, Qin N, Wang Y, Fan J, et al. Integrated gene-based and pathway analyses using UK Biobank data identify novel genes for chronic respiratory diseases. Gene. 2021;767:145287. doi: 10.1016/j.gene.2020.145287. [DOI] [PubMed] [Google Scholar]
- 29.Xu W. Expression data analysis to identify biomarkers associated with asthma in children. Int J Genomics. 2014;2014:165175. doi: 10.1155/2014/165175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Yang IV, Tomfohr J, Singh J, Foss CM, Marshall HE, Que LG, et al. The clinical and environmental determinants of airway transcriptional profiles in allergic asthma. Am J Respir Crit Care Med. 2012;185(6):620–627. doi: 10.1164/rccm.201108-1503OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Datasets used or analyzed during the current study are available from the corresponding author on reasonable request. The data are not publicly available due to ethical reasons.