Abstract
Background
More than 200 asthma-associated genetic variants have been identified in genome-wide association studies (GWASs). Expression quantitative trait loci (eQTL) data resources can help identify causal genes of the GWAS signals, but it can be difficult to find an eQTL that reflects the disease state because most eQTL data are obtained from normal healthy subjects.
Methods
We performed a blood eQTL analysis using transcriptomic and genotypic data from 433 Korean asthma patients. To identify asthma-related genes, we carried out colocalization, Summary-based Mendelian Randomization (SMR) analysis, and Transcriptome-Wide Association Study (TWAS) using the results of asthma GWASs and eQTL data. In addition, we compared the results of disease eQTL data and asthma-related genes with two normal blood eQTL data from Genotype-Tissue Expression (GTEx) project and a Japanese study.
Results
We identified 340,274 cis-eQTL and 2,875 eGenes from asthmatic eQTL analysis. We compared the disease eQTL results with GTEx and a Japanese study and found that 64.1% of the 2,875 eGenes overlapped with the GTEx eGenes and 39.0% with the Japanese eGenes. Following the integrated analysis of the asthmatic eQTL data with asthma GWASs, using colocalization and SMR methods, we identified 15 asthma-related genes specific to the Korean asthmatic eQTL data.
Conclusions
We provided Korean asthmatic cis-eQTL data and identified asthma-related genes by integrating them with GWAS data. In addition, we suggested these asthma-related genes as therapeutic targets for asthma. We envisage that our findings will contribute to understanding the etiological mechanisms of asthma and provide novel therapeutic targets.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12920-023-01677-7.
Keywords: Asthma, Expression quantitative trait loci, Genome-wide association study, Colocalization, Summary-based Mendelian Randomization, Transcriptome-wide association study
Background
Asthma is one of the most common chronic respiratory diseases worldwide, which affected 262 million people in 2019 [1] and is a complex disease with multiple endotypes and different underlying pathological mechanisms [2]. Both genetic and environmental factors contribute to asthma development, and the heritability of asthma is 35–95% [3]. Large population-based genome-wide association studies (GWASs) have identified more than 200 lead single-nucleotide polymorphisms (SNPs) associated with asthma or asthma subtypes [4–10]. However, most asthma GWAS SNPs are located in non-coding regions; therefore, it is unclear which genes are causative, leading to a lack of understanding of the biological pathways involved in asthma.
Expression quantitative trait loci (eQTL) are genetic variants that affect the expression of genes and are discovered via association analyses between genetic variants and gene expression levels [11]. eQTL data have been used to identify causal genes from the associated SNPs of GWASs by linking the genetic variant to changes in the gene expression, which consequently helps to understand the underlying biological pathways [12–14]. To better serve this, several statistical methods have been developed to integrate GWAS results with eQTL data. COLOC is a statistical framework that estimates the probability of sharing the same causal genetic variants in GWASs and eQTL [15]. Summary-based Mendelian Randomization (SMR) is another method used to identify the causative genes from GWAS results by assessing the association between the variations in gene expressions and a trait; SMR is free from the confounding of non-genetic factors [16]. In addition, the transcriptome-wide association study (TWAS) is a statistical approach for identifying candidate causal genes using the results of GWASs and eQTL data [17]. In the TWAS analysis, gene expression levels are imputed by eQTLs for a gene, and the association between the imputed gene expression levels and a trait is tested to estimate the statistical significance of each gene-disease association. Recent studies have further integrated GWAS results with eQTL, protein quantitative trait loci (pQTL), and epigenetic data to identify target genes [18, 19].
The eQTL data resources for various human tissues have increased [20, 21]; however, most eQTL resources are constructed from healthy human subjects. The regulation of gene expression has been shown to be context-specific; therefore, gene expression may differ depending on disease states, as well as tissue and cell types [22, 23]. Large eQTL studies across various human tissues have been conducted by the Genotype-Tissue Expression (GTEx) consortium [20], and tissue- or cell-type eQTL databases have been analyzed to identify the causative genes from GWAS results [23, 24]. A few disease-specific eQTL databases have been constructed, however, the usefulness of these has not been evaluated well [25–27]. Certain eQTL effects may be influenced by disease status, therefore this eQTL would not be identified by the eQTL data constructed from healthy subjects.
In the present study, we performed an eQTL analysis using RNA-seq data of peripheral blood mononuclear cells (PBMCs) from 433 Korean patients with asthma, as well as their genotype data. We compared the results of the asthmatic eQTL analysis with two healthy blood eQTL data and found not only substantial overlaps but also unique eQTLs in the asthma patients. Furthermore, we integrated the asthma eQTLs with two asthma GWAS summaries and identified asthma susceptibility genes related to Korean asthmatic eQTL data. These asthma eQTL data can be used to further identify asthma susceptibility genes from asthma GWASs, resulting in enhanced knowledge regarding the regulatory pathways and biological processes of asthma.
Methods
Study subjects
A total of 436 Korean patients with asthma were obtained from the Allergy Asthma Center of the Asan Medical Center, Seoul, Republic of Korea between January 2005 and December 2018. The inclusion criteria were as follows: 1) symptoms such as dyspnea, wheezing or cough for more than three months; and 2) airway hyperresponsiveness, as indicated by a 20% reduction in forced expiratory volume in 1 s (FEV1) with a methacholine dose of 16 mg/mL [provocation concentration (PC) causing a 20% fall in FEV1; PC20] through a provocation test or airway reversibility, as indicated by an increase in FEV1 of > 12% (and at least 200 mL) following the inhalation of a short-acting β-agonist. Patients with severe lung damage, bronchiectasis or a history of lung resection were excluded.
Blood samples from each subject were collected for the genotyping and transcriptomic analysis. PBMCs were isolated from whole blood samples (2 tubes of 10 ml each) collected in heparin tubes (Becton, Dickinson and Company, USA) and purified according to the manufacturer’s recommendations for Lymphosep (Biowest, USA). The PBMCs were then stored in a -80°C deep freezer.
The study was approved by the Institutional Review Board of Asan Medical Center (2019–1676), adhered to the guidelines of the declaration of Helsinki. All the subjects or their legal guardians were informed about the study and they provided written informed consent. No potentially identifiable human data is presented in this study.
Genotyping and imputation
Korean asthma patients were genotyped using the Korea Biobank Array, which was designed by the Korea National Institute of Health (KNIH), and based on the UK Biobank Axiom Array [28]. Low-quality SNPs were removed based on the following exclusion criteria: (1) poor clustering in the SNPolisher analysis, (2) genotype call rates < 95%, and (3) Hardy–Weinberg equilibrium P-value (HWE P) < 1 × 10–6. The genotype call rates and HWE P-values were calculated using PLINK v.1.9.0 [29]. And we excluded samples based on the following exclusion criteria: (1) low genotype call rates (< 95%), (2) deviating ± 3 standard deviation (SD) from the mean heterozygosity rate, (3) gender discrepancy, and (4) cryptic relatedness. All sample quality control (QC) process were performed using PLINK v.1.9.0, and three individuals were filtered. After QC was conducted, SNP imputation was performed using the Michigan imputation server [30]. Phasing and genotype imputations were performed using Eagle2 [31] and Minimac4 [32], respectively, and the ‘non-European’ or ‘mixed’ populations of the Haplotype Reference Consortium (HRC) were used as the reference panel [33]. We filtered out the low imputation quality SNPs, from the imputed SNPs, as follows: (1) low imputation quality score (Rsq < 0.3), (2) duplicated SNP, (3) HWE P < 1 × 10–6, (4) genotype call rates < 95%, and (5) minor allele frequency (MAF) < 1%. Finally, 6,676,632 SNPs for 433 samples were used further analysis.
Transcriptome data processing
Total RNA sequencing was performed using the Swift RNA Library Kit and IDT Exome v2.0, according to the manufacturer’s instructions. Total RNA purity was determined by assaying 1 µL of the total RNA extract on a NanoDrop8000 spectrophotometer, and the integrity was checked using an Agilent Technologies 2100 Bioanalyzer, with an RNA integrity number value and the percentage of RNA fragments > 200 nt fragment distribution value. RNA sequencing was performed using an Illumina NovaSeq 6000 system, according to the protocols provided for 2 × 100 bp read sequencing. Quality trimming and adapter clipping of the reads were conducted using the Trimmomatic software [34]. Read alignment and quantification were performed using HISAT2 [35], SAM tools [36] and HTseq [37] with the Ensembl transcriptome reference (hg19/GRCh37).
Normalization and quality control of the gene expressions
The gene expressions were quantified in transcripts per million (TPM) using GENCODE v19 [38]. A gene was allowed for downstream analyses if its expression in at least 20% of the samples was greater than 0.1 TPM and the read counts were greater than six. The expression values were normalized between the samples using the trimmed mean of M-values in edgeR [39], and each gene was normalized across the samples using inverse normal transformation.
cis-eQTL analysis
To investigate the effects of genetic variation on gene expression in 433 Korean patients with asthma, we performed cis-eQTL analysis mapping using FastQTL [40]. After filtering out genes with low expression levels, we tested the association between normalized gene expression levels and genotypes of SNPs, adjusting for sex and 60 PEER factors according to the GTEx methods (https://gtexportal.org/home/methods) [41] as covariates. Only SNPs within a 1 Mb window from the transcription start site (TSS) of every gene were tested for cis-eQTL analysis.
FastQTL provides a permutation test to correct for multiple testing, therefore we performed an adaptive permutation scheme with the number of permutation iterations between 1,000 and 10,000, using the “–permute 1000 10,000” option. The beta distribution-adjusted empirical P-values were used to calculate false discovery rates (FDRs) using Storey’s method [42], and any gene with an FDR ≤ 5% was defined as an eGene whose expression was regulated by at least one SNP. To identify all the significant gene-SNP pairs associated with the expression of a particular gene, the nominal P-value thresholds of each eGene were defined as Pg, threshold = Fg,min−1(Pt), where Pt is the empirical P-value of the gene closest to the FDR of 5% and F−1 is the inverse cumulative distribution. All the SNPs with nominal P-values below the threshold for their target genes were defined as eSNPs that regulate the expression of certain genes.
Comparison with the blood eQTL data of normal subjects
We compared our eQTL results against existing blood eQTL data from GTEx [43] and a Japanese study [44] to determine whether our eQTL signals overlapped with those of normal subjects or were asthma patient-specific. GTEx contains DNA and RNA-seq data from 838 individuals over 49 tissues and provides 8,663 eGenes and 1,052,542 cis-eQTLs for whole blood samples in GTEx v7. The Japanese study conducted an eQTL analysis on five immune cells and the whole blood from 105 healthy Japanese volunteers and provided 3,386 eGenes and 335,813 cis-eQTLs for whole blood. We identified the overlapping eGenes and compared the direction of the allelic effects of the overlapped cis-eQTLs between the eQTL data resources.
Colocalization, SMR and TWAS analysis
To consider ethnic differences, we collected two asthma-GWAS summaries that originated from European (UK Biobank) [4] and Japanese ancestry [45] We then identified GWAS-lead SNPs in the genomic risk loci from each GWAS summary using the FUMA program, which provides functional annotation, visualization, and interpretation of GWAS results [46]. The GWAS-lead SNPs were clumped with following criteria: P < 5 × 10–8, distance between linkage disequilibrium (LD) blocks > 250 kb, and r2 < 0.1 from the 1,000 Genome Project phase 3 European panel [46]. Colocalization analyses between one of the three blood eQTL datasets (Korean asthma eQTL, GTEx, and Japanese) and the corresponding GWAS-lead SNPs were conducted using COLOC from the R package [15]. We selected target genes for the colocalization analyses with the following criteria: the TSS and the eQTL lead SNP of the gene were located within 0.5 Mb of the GWAS lead SNP in the GWAS risk locus. All the SNPs located between the GWAS lead SNP and the eQTL lead SNP or located in the 0.5 Mb extension on either side, were used for the analysis. We applied a posterior probability of colocalization (PP4) > 0.5 as the threshold for the colocalization significance. Significant GWAS-eQTL colocalization events were visualized using LocusCompareR in R [47].
We performed an SMR analysis with asthma-GWAS summary statistics and blood eQTLs using the SMR software [16]. The LD was estimated using a 1000 genome reference panel (European and East Asian) [48], and the major histocompatibility complex (MHC) region (chromosome 6:28.5 Mb ~ 33.5 Mb in hg19) was excluded from the analysis. We applied the Bonferroni correction and heterogeneity-independent instrument (HEIDI) threshold to exclude false positives and confounding effects.
In addition, we performed a TWAS analysis using the FUSION program [49]. Two inputs were required for FUSION program as follows: (1) GWAS summary statistics and (2) gene expression weights for each gene with significant heritability. We conducted the TWAS using two asthma-GWAS summaries from European and Japanese population, and two eQTL datasets, Korean asthmatic and GTEx eQTL data. The pre-computed gene expression weights for GTEx were provided in the FUSION program. For TWAS using Korean asthmatic data, we computed gene expression weight using individual genotype data and eQTL dataset, and succeeded in calculating gene expression weights for 1,160 genes that passed the gene expression heritability threshold (Ph2 < 0.01). However, due to the unavailability of individual genotype data for Japanese eQTL, we did not perform TWAS for Japanese eQTL. We applied the Bonferroni correction to the results of TWAS (P < 4.31 × 10–5) and performed replication analysis using the GTEx dataset for significant TWAS genes.
Druggable target genes
Using the drug-gene interaction database (DGIdb) [50], 15 asthma-related genes were mapped to licensed drugs to identify the druggable target genes. Information on drug indications for the target gene was obtained from CheMBL [51] and Drug Bank [52]. To identify the associations between the drugs and asthma, the identified drugs were queried on the Open Targets Platform [53]. We analyzed protein–protein interaction (PPI) between target genes using the STRING database [54].
Results
Cis-eQTL analysis of Korean asthma patients
A total of 433 Korean asthma patients were used for the eQTL analysis, and their basic characteristics are shown in Table 1. The mean age of these asthma patients, at diagnosis, was 45.3 ± 16.2 years, and 40.4% were male. Among them, 42.9% were current or ever smokers and 45.7% had atopy. The mean FEV1 and FEV1/forced vital capacity (FVC) were 75.2 ± 18.0% and 72.7 ± 13.4%, respectively. The mean blood eosinophil count and total IgE levels were 431.3 ± 448.6 (cells/µL) and 426.0 ± 686.7 IU/mL, respectively. Of the patients, 74.1% had blood eosinophils greater than or equal to 150 cells/µL, and 66.8% had total IgE greater than or equal to 100 IU/mL. Two hundred and twenty-five patients (53.8%) used a medium or high dose of inhaled corticosteroids (ICS).
Table 1.
Characteristics | Korean asthma patients mean ± SD / N (%) |
---|---|
Age, years | 49.5 ± 15.6 |
Sex (%) | |
Male | 175 (40.4) |
Age at diagnosis, yearsa | 45.3 ± 16.2 |
Age group at diagnosisa | |
0 ~ 19 | 23 |
≥ 20 | 338 |
Smoking status (%) | |
Current | 53 (12.2) |
Ever | 133 (30.7) |
Never | 247 (57.0) |
Atopy (%) | 198 (45.7) |
FEV1 pred.a (%) | 75.4 ± 18.0 |
< 80 | 223 (53.7) |
≥ 80 | 192 (46.3) |
FEV1/FVCa (%) | 72.8 ± 13.4 |
< 70 | 166 (40.0) |
≥ 70 | 249 (60.0) |
Blood eosinophil counta (cells/µL) | 431.3 ± 448.6 |
< 150 | 110 (25.9) |
≥ 150 | 314 (74.1) |
Dose of ICSa | |
Medium or high dose | 225 (53.8) |
Low dose | 193 (46.2) |
Total IgEa (IU/mL) | 426.0 ± 686.7 |
< 100 | 128 (33.2) |
≥ 100 | 257 (66.8) |
All data are presented as the mean ± standard deviation (SD) or the number of participants (%)
aparticipants available with this data were analyzed
FEV1 pred predicted forced expiratory volume in 1 s, FVC Forced vital capacity, ICS Inhaled corticosteroids
We performed the cis-eQTL analysis with RNA-seq of PBMCs from the asthma patients using FastQTL [40]. Following QC for genotype and gene expressions, 6,676,632 SNPs and 24,066 genes were chosen for the analysis. After applying FDR < 0.05 as an eQTL P-value threshold, 267,691 SNPs (eSNPs) had at least one target gene (eGene), and 2,875 genes (eGenes) had at least one expression-associated SNP (eSNP); consequently 340,274 eGene-eSNP pairs (cis-eQTLs) were discovered (Additional File 1: Table S1). Of the 267,691 eSNPs, 223,578 (83.5%) had one target gene, 29,794 (11.1%) had two target genes, 8,346 (3.1%) had three target genes, and 5,974 (2.2%) had more than four target genes (Fig. 1A). Among the 340,274 eGene-eSNP pairs, the distance from the TSS of the target gene to the eSNP was within 500 kb in 95.4% (324,541 pairs), within 250 kb in 85.3% (290,253 pairs), within 100 kb in 60.0% (204,174 pairs), and within 50 kb in 39.0% (132,669 pairs) (Fig. 1B). The 2,875 eGenes were composed of 2,562 protein-coding genes (89.1%), 257 pseudogenes (8.9%), 50 non-coding RNAs (1.7%), and 5 other types of genes (0.2%) (Fig. 1C).
We compared our blood eQTL results from asthmatic patients with healthy subjects. For this comparison, we used two blood eQTL datasets from a GTEx v7 study [43] and a Japanese study [44]. The total numbers of eGenes from the GTEx and Japanese datasets were 8,663 and 3,386, respectively. Among the 2,875 eGenes from the Korean asthmatic eQTL, 64.1% (1,844 of 2,875 eGenes) overlapped with the GTEx eGenes and 39.0% (1,122 of 2,875 eGenes) overlapped with the Japanese eGenes. For reference, 69.6% of the eGenes from the Japanese data (2,355/3,386) overlapped with the GTEx eQTL data. We also compared the direction of eSNP allelic effects on the target gene expressions between the asthmatic and normal eQTL data. There were 1,052,542 eGene-eSNP pairs in the GTEx and 335,813 eGene-eSNP pairs in the Japanese population. Among the 340,274 eGene-eSNP pairs from the Korean asthmatic eQTL data, 98,302 eGene-eSNP pairs were shared with GTEx and 71,533 eGene-eSNP pairs were shared with the Japanese pairs. Among the shared eGene-eSNP pairs, approximately 93.1% (91,508/98,302 pairs) for GTEx and 97.5% (69,758/71,533 pairs) for Japanese showed the same direction of allelic effects on gene expression as the Korean asthmatic eQTL (Fig. 2). As a reference, 142,923 eGene-eSNP pairs were shared between the GTEx and Japanese eQTL data, and 99.2% (141,842 pairs) showed allelic effects in the same direction (Fig. 2). As shown in Additional File 2: Table S2, we also found eQTL results showing allelic effects in the opposite direction, between the asthmatic and normal eQTL data as follows: 6.9% (6,794 pairs from 155 eGenes) of 98,302 pairs for GTEx, and 2.5% (1,775 pairs from 20 eGenes) of 71,533 pairs for the Japanese eQTL. Among the 155 eGenes from the GTEx data and 20 eGenes from the Japanese data, showing the opposite direction of the eQTL effect, 15 genes, including CEACAM21, NDE1, LGALS8, and AC124944.3, overlapped between them (Additional File 2: Table S2).
Identification of the asthma-susceptibility genes from the Korean asthmatic blood eQTL data using COLOC, SMR and TWAS analysis
We performed a colocalization analysis to identify the target genes corresponding to the genetic variants from the asthma GWAS using the asthmatic eQTL data. We applied the COLOC method [15] to the three blood cis-eQTL datasets (Korean asthma, GTEx, and Japanese) and two asthma-GWAS summary statistics using European participants [4] and East Asian ancestries [45]. The European asthma-GWAS analyzed 64,538 asthma cases and 239,321 controls from the UK Biobank and identified 148 lead SNPs [4], whereas the Japanese asthma-GWAS analyzed 8,216 cases and 201,592 controls from a Japanese cohort and identified 11 lead SNPs [45] (Additional File 2: Table S3 and S4). We conducted the colocalization analysis by applying the criteria described in the Methods section and defined the GWAS colocalized genes with a PP4 greater than 0.5.
We identified 39 genes in the Korean asthmatic eQTL data with evidence of colocalization between the eQTL and European asthma-GWAS signals (36 genes) or Japanese asthma-GWAS signals (4 genes) (Additional File 2: Table S5). Among 39 genes, SUOX gene were identified both in European and Japanese asthma-GWASs. In addition, 59 and 39 genes were identified as having significant colocalization, from the normal eQTLs of GTEx and the Japanese study, respectively, using both the European and Japanese asthma-GWAS summaries (Additional File 2: Table S5). Of the 39 colocalized genes identified in the Korean asthmatic eQTL data, 20 genes were also significant in the colocalization results of the GTEx or Japanese eQTL datasets, 15 genes were not available in the GTEx or Japanese eQTL datasets, and the remaining 9 genes were found to be significant only in the Korean asthmatic eQTL dataset (Table 2, Fig. 3, and Additional File 3: Fig. S1-S9). For example, an association signal around rs2070901 located on chromosome 1 showed a clear colocalization pattern with the FCRG3A gene only in the Korean asthmatic eQTL data (Fig. 3). In the normal eQTL data of GTEx and the Japanese study, strong eQTL lead variants were identified in the LD block, different from that of the asthmatic eQTL data, with modest GWAS P-values. As a reference, the colocalization results for GTEx and the Japanese study showed 17 overlapping genes (Additional File 2: Table S6). Among these 17 overlapping genes, 10 were also colocalized in the Korean asthmatic eQTL data. Among the remaining seven genes, four were not available, and three did not show significant colocalization in the Korean asthmatic eQTL data.
Table 2.
Study ID | Ensemble ID | Gene symbol | Var num | PP4 Korean asthma | PP4 GTEx | PP4 Japanese |
---|---|---|---|---|---|---|
GCST010042 | ENSG00000124920 | MYRF | 796 | 0.890 | 3.25 × 10–10 | 0.304 |
GCST010042 | ENSG00000185800 | DMWD | 1181 | 0.817 | 0.114 | 0.109 |
GCST010042 | ENSG00000186088 | GSAP | 1097 | 0.979 | 0.002 | 0.096 |
GCST010042 | ENSG00000173917 | HOXB2 | 961 | 0.927 | 0.116 | 0.074 |
GCST010042 | ENSG00000104147 | OIP5 | 991 | 0.755 | 0.211 | 0.069 |
GCST010042 | ENSG00000163608 | NEPRO | 1249 | 0.654 | 3.49 × 10–6 | 0.063 |
GCST010042 | ENSG00000203747 | FCGR3A | 1281 | 0.619 | 9.39 × 10–6 | 0.053 |
GCST010042 | ENSG00000197070 | ARRDC1 | 711 | 0.982 | 2.90 × 10–5 | 0.016 |
GCST010042 | ENSG00000260261 | AC124944.3 | 1163 | 0.994 | 0.143 | 6.58 × 10–4 |
Study ID ID of the paper which provided asthma-GWAS summary, Var num number of genetic variants used in the colocalization analysis, PP4 Posterior probability of colocalization, NA Not available because gene was out of analysis criteria or does not exist in the dataset
To identify asthma-associated genes using the asthmatic eQTL data, we additionally performed an SMR analysis using the asthmatic eQTL data and two asthma-GWAS summaries from Europe and Japan. SMR is generally used to discover genes satisfying the hypothesis that genetic variants affecting the risk of disease may act via changes in the expression levels of certain genes [16]. We applied the Bonferroni correction and HEIDI method for the SMR analysis, to reduce the likelihood of false-positive results.
After applying the Bonferroni correction (Psmr < 2.08 × 10–6) and HEIDI threshold (PHEIDI > 0.01), a total of four SMR genes were identified: three SMR genes (AHI, MED24, and DDX5) with the European asthma-GWAS summary and one SMR gene (RPS26) with the Japanese asthma-GWAS summary (Additional File 2: Table S7). We also performed an SMR analysis using the GTEx and Japanese eQTL datasets by applying the Bonferroni correction (Psmr < 2.57 × 10–6 for GTEx and < 2.27 × 10–6 for Japanese) and HEIDI threshold (PHEIDI > 0.01), and found 18 and one SMR genes, respectively (Additional File 2: Table S7).
Among the four SMR genes in the asthmatic eQTL data, two genes, DDX5 and MED24, were not identified in the SMR analysis of the two normal eQTL data, and have not been reported to be significantly associated with asthma in the SMR analysis. In addition, DDX5 were significant in both the colocalization and SMR analyses (Additional File 2: Table S5 and S7).
As another approach to identifying asthma-associated genes using the asthmatic eQTL data, we performed a TWAS analysis using the FUSION program [49]. TWAS is a gene-based association approach that investigates associations between genetically regulated gene expression and phenotype [17]. We carried out the TWAS analysis using two asthma-GWAS summaries from European and Japanese, and two eQTL datasets, Korean asthmatic and GTEx eQTL data. We tested 1,160 and 8,799 genes that passed the gene expression heritability threshold (Ph2 < 0.01) for Korean asthma and GTEx, respectively.
After applying the Bonferroni correction to the TWAS analysis using Korean asthmatic eQTL dataset, we identified 22 significant genes (P < 4.31 × 10–5). Of these 22 genes, 20 and 3 genes identified from the TWAS results using European and Japanese GWASs, respectively (Additional File 2: Table S8). In addition, the RPS26 gene was found to be significant in both European and Japanese asthma-GWASs. We then performed a replication analysis using GTEx data. Of the 22 TWAS genes, 6 were not available in the GTEx eQTL datasets. Among the remaining 16 genes, 11 were replicated in the GTEx TWAS analysis (P < 3.13 × 10–3 = 0.05/16), while five genes, including INPP5B, LCLAT1, MYRF, TUBG2 and CASTOR3, were significant only in the Korean asthmatic dataset. Notably, the MYRF gene was also identified in colocalization analysis (Additional File 2: Table S5 and S8).
Asthma-related genes specific to the Korean asthmatic blood eQTL data as therapeutic targets
In this study, using the asthmatic blood eQTL, we identified 15 asthma-related genes; two from the SMR, four from the TWAS, eight from the colocalization analysis, and one from both TWAS and colocalization analysis. To investigate the druggable genes, we mapped the 15 asthma-related genes onto licensed drugs or compounds from the DGIdb [50]. Of all the asthma-related genes, only FCGR3A (Fc gamma receptor IIIa) was found to interact with 24 drugs. We again investigated the 24 drugs in Drug Bank [52] and CheMBL [51], and confirmed that FCGR3A was associated with six asthma drugs that were approved or under clinical trials (Additional File 2: Table S9). Among these drugs, prednisolone, an approved asthma drug that targets the NR3C1, gene was found to interact with FCGR3A. We analyzed FCGR3A-NR3C1 interaction using STRING, a database of known and predicted protein–protein interactions. While no direct interaction was observed between FCGR3A and NR3C1, we identified an interaction network comprising the path FCGR3A—FCER1G—ZAP70—EGFR—NR3C1 based on known PPI results from curated databases and experiments. Additionally, three asthma drugs undergoing clinical trials were found to interact with FCGR3A, despite targeting TNF (adalimumab and etanercept) and PTGS1/PTGS2 (indomethacin). No direct interaction was observed between FCGR3A and TNF or between FCGR3A and PTGS1/PTGS2. However, similar to the PPI result of FCGR3A and NR3C1, we identified an interaction network comprising the path FCGR3A—FCER1G—ZAP70—TNF. Furthermore, we found an interaction network comprising the path FCGR3A—SRC—PTGS1/PTGS2 based on the known PPI results from curated databases and experiment.
Discussion
The main aim of this study was to identify asthma-related genes from asthmatic eQTL data, given that gene expression may change in a context-specific manner depending on the disease status. We identified 15 asthma-related genes specific to the asthmatic eQTL data, by three methods of colocalization, SMR and TWAS, using two asthma-GWAS summary statistics from European and Japanese populations, and suggested these genes as therapeutic targets for asthma.
Among the 9 colocalization genes specific to Korean asthma, FCGR3A is well known for its association with various innate immune responses [55], and has been reported to interact with certain approved asthma drugs (Additional File 2: Table S9). As shown in Fig. 3, FCGR3A was highly colocalized with the eQTL and GWAS variants only in the Korean asthmatic eQTL data. In the normal blood eQTL datasets, eQTL variants around the GWAS-lead SNP rs2070901 were not found to be significant. The eQTL P-values of rs2070901 were 0.0014, 0.53, and 0.07 in the Korean, GTEx, and Japanese, respectively. Additionally, ARRDC1 (arrestin domain containing 1) is located in the extracellular vesicles (EVs), which are known to contribute to the pathogenesis of asthma via various mechanisms related to both inflammation and pathological remodeling [56].
Interestingly, among the colocalization genes, four genes (MYRF, DMWD, GSAP, and NEPRO) were involved in neuronal survival and proliferation. MYRF (myelin regulatory factor) encodes a transcription factor required for myelination of the central nervous system. This gene is located near the fatty acid desaturase (FADS) gene cluster and is associated with an increased risk of inflammation [57]. DMWD (DM1 locus, WD repeat containing) is predicted to be located in the dendrites, nuclei, and perikarya of neurons, however its function is poorly characterized. The amino acid sequence of DMWD is similar to that of WD repeat domain 20 (WDR20), which is associated with childhood asthma [58]. In addition, GSAP (gamma-secretase activating protein) plays an important role in Alzheimer's disease by regulating lipid homeostasis and mitochondrial function [59], but its functional relevance in asthma is not largely unknown. NEPRO (nucleolus and neural progenitor protein) is predicted to be located in the nucleolus and is known to act as a Notch effector for the development and maintenance of neural progenitor cells in the neocortex [60]. We do not know how to link the functions of these genes to asthma, and future studies are required to determine their underlying mechanisms. The remaining colocalization genes belong to embryo development (HOXB2) [61], cell division (OIP5) [62], and one with an unknown function (AC124944.3).
Among the two asthma-related genes identified through the SMR analysis, MED24 is associated with increased concentrations of phosphatidylcholine, a metabolite that is positively associated with asthma [63]. In addition, DDX5 is known to be a hub gene of the cilia module in asthma and a regulator of ERBB2, which is associated with epithelial repair processes in asthma patients [64, 65].
Through a TWAS analysis, five genes related to asthma were identified, including the MYRF gene which was also identified in colocalization analysis. The INPP5B gene encodes an inositol polyphosphate-5-phosphatase B protein that regulates calcium signaling, which is involved in inflammatory cellular responses and interactive signaling pathways that mediate the development of asthma [66]. The LCLAT1 gene encodes a lysocardiolipin acyltransferase 1 protein that is involved in phosphatidylinositol acyl-chain remodeling. Researchers have found that changes in microRNA (miRNA) expression can contribute to the pathogenesis of asthma, and there is a negative correlation between LCLAT1 and miRNAs (miR-199b-5p and miR-223-3p) associated with asthma [67]. The CASTOR3 gene regulates amino acid metabolism, which plays a role in various anti-oxidant and immunological activities relevant to asthma pathogenesis. And differences in amino acid levels may be involved in the development of asthma [68]. Additionally, the TUBG2 gene encodes a tubulin gamma-2 chain protein, which is a major constituent of microtubules. Microtubules play a role in smooth muscle cell migration and airway remodeling a key feature of asthma [69].
This study had several limitations. First, our eQTL analysis was performed only in Korean asthma patients only, without comparable eQTL data from normal Koreans. Therefore, we compared our findings with two normal blood eQTL datasets of European and Japanese ancestry. Second, we used two asthma-GWAS summaries of European and Asian ancestry for the colocalization, SMR, and TWAS analysis, with different sample sizes (64,538 cases vs. 8,216 cases) for the asthma GWAS. Both of these limitations can affect the power of the analysis and contribute to bias related to ethnicity due to differences in LD structures between populations. To validate our findings, future analyses will require a comparison with normal Korean eQTL data and a larger sample size for the asthma-GWAS of East Asian ancestry. Third, through the drug target analysis, we discovered that the FCGR3A gene interacts with asthma drugs that have either been approved or are currently undergoing clinical trials. While we did not observe direct interactions between FCGR3A and drug target genes, we identified indirect interaction network based on the curated databases and experiments. These interactions could have potential implications for asthma research and treatment, but further investigation into these interactions could be necessary to fully understand their potential impact. Another limitation is that we used PBMCs, which may not fully capture the disease context. Using asthma-relevant tissues such as bronchial epithelial cells, airway smooth muscle cells, and nasal epithelial cells can provide more accurate insights into the molecular mechanisms of asthma and help identify eQTLs that are specific to these tissues. This limitation may affect our ability to identify eQTLs specific to asthma-relevant tissues and may impact the interpretation of our results, so our findings will need to be confirmed in additional asthma-relevant tissues.
Conclusions
This study provided asthma cis-eQTL data and identified 15 asthma-related genes by integrating them with GWAS data. We envisage that our findings will contribute to understanding the etiological mechanisms of asthma and provide novel therapeutic targets.
Supplementary Information
Acknowledgements
Not applicable.
Abbreviations
- GWASs
Genome-wide association studies
- SNPs
Single-nucleotide polymorphisms
- eQTL
Expression quantitative trait loci
- SMR
Summary-based Mendelian Randomization
- TWAS
Transcriptome-wide association study
- pQTL
Protein quantitative trait loci
- GTEx
Genotype-Tissue Expression
- PBMCs
Peripheral blood mononuclear cells
- FEV1
Forced expiratory volume in 1 second
- PC
Provocation concentration
- KNIH
Korea National Institute of Health
- HWE P
Hardy-Weinberg equilibrium P-value
- SD
Standard deviation
- QC
Quality control
- HRC
Haplotype Reference Consortium
- MAF
Minor allele frequency
- TPM
Transcripts per million
- TSS
Transcription start site
- FDRs
False discovery rates
- LD
Linkage disequilibrium
- PP4
Posterior probability of colocalization
- MHC
Major histocompatibility complex
- HEIDI
Heterogeneity-independent instrument
- DGIdb
Drug-gene interaction database
- PPI
Protein-protein interaction
- FVC
Forced vital capacity
- ICS
Inhaled corticosteroids
- CEACAM21
CEA cell adhesion molecule 21
- NDE1
nudE neurodevelopment protein 1
- LGALS8
Galectin 8
- SUOX
Sulfite oxidase
- FCGR3A
Fc gamma receptor IIIa
- AHI
Abelson helper integration site 1
- MED24
Mediator complex subunit 24
- DDX5
DEAD-box helicase 5
- RPS26
Ribosomal protein S26
- INPP5B
Inositol polyphosphate-5-phosphatase B
- LCLAT1
Lysocardiolipin acyltransferase 1
- MYRF
Myelin regulatory factor
- TUBG2
Tubulin gamma 2
- CASTOR3
CASTOR family member 3
- NR3C1
Nuclear receptor subfamily 3 group C member 1
- FCER1G
Fc epsilon receptor Ig
- ZAP70
Zeta chain of T cell receptor associated protein kinase 70
- EGFR
Epidermal growth factor receptor
- TNF
Tumor necrosis factor
- PTGS1
Prostaglandin-endoperoxide synthase 1
- PTGS2
Prostaglandin-endoperoxide synthase 2
- SRC
SRC proto-oncogene, non-receptor tyrosine kinase
- ARRDC1
Arrestin domain containing 1
- EVs
Extracellular vesicles
- FADS
Fatty acid desaturase
- DMWD
DM1 locus, WD repeat containing
- WDR20
WD repeat domain 20
- GSAP
Gamma-secretase activating protein
- NEPRO
Nucleolus and neural progenitor protein
- HOXB2
Homeobox B2
- OIP5
Opa interacting protein 5
- ERBB2
Erb-b2 receptor tyrosine kinase 2
- miRNA
MicroRNA
Authors’ contributions
All authors contributed in planning of the study and the analyses. DJK, JEL, TBK and BO conceived the original idea for the study. DJK and JEL drafted the manuscript. DJK, SYK, H-UJ and KP curated data, and DJK, JEL, HJ, H-UJ, and SYK conducted the statistical analyses. DJK, JEL, JYC, and EJB contributed to the analytical strategy and data visualization. HKK, J-OK, and SW provided guidance on study design and statistical analyses. All authors read and approved the final manuscript.
Funding
This research was supported by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) funded by the Korean government (MSIT) (2019M3E5D3073365).
Availability of data and materials
The datasets supporting the conclusions of this article are included within the article and its additional files. And the RNA-seq data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE226825 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE226825).
Declarations
Ethics approval and consent to participate
The study was approved by the Institutional Review Board of Asan Medical Center (2019–1676), adhered to the guidelines of the declaration of Helsinki. All the subjects or their legal guardians were informed about the study and they provided written informed consent. No potentially identifiable human data is presented in this study.
Consent for publication
Not applicable.
Competing interests
DJK, JEL, H-UJ, JYC, HJ, SYK, HKK, J-OK, KP, SW, T-BK declare no conflicts of interest. EJB is an employee in Mendel Inc.. BO is Chief Executive Officer of Mendel Inc.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Dong Jun Kim and Ji Eun Lim contributed equally to this work.
Contributor Information
Tae-Bum Kim, Email: tbkim@amc.seoul.kr.
Bermseok Oh, Email: ohbs@khu.ac.kr.
References
- 1.Diseases GBD, Injuries C. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;396(10258):1204–1222. doi: 10.1016/S0140-6736(20)30925-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Pembrey L, Barreto ML, Douwes J, Cooper P, Henderson J, Mpairwe H, Ardura-Garcia C, Chico M, Brooks C, Cruz AA et al: Understanding asthma phenotypes: the World Asthma Phenotypes (WASP) international collaboration. ERJ Open Res 2018, 4(3). [DOI] [PMC free article] [PubMed]
- 3.Ober C, Yao TC. The genetics of asthma and allergic disease: a 21st century perspective. Immunol Rev. 2011;242(1):10–30. doi: 10.1111/j.1600-065X.2011.01029.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Han Y, Jia Q, Jahani PS, Hurrell BP, Pan C, Huang P, Gukasyan J, Woodward NC, Eskin E, Gilliland FD, et al. Genome-wide analysis highlights contribution of immune system pathways to the genetic architecture of asthma. Nat Commun. 2020;11(1):1776. doi: 10.1038/s41467-020-15649-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ferreira MAR, Mathur R, Vonk JM, Szwajda A, Brumpton B, Granell R, Brew BK, Ullemar V, Lu Y, Jiang Y, et al. Genetic Architectures of Childhood- and Adult-Onset Asthma Are Partly Distinct. Am J Hum Genet. 2019;104(4):665–684. doi: 10.1016/j.ajhg.2019.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pividori M, Schoettler N, Nicolae DL, Ober C, Im HK. Shared and distinct genetic risk factors for childhood-onset and adult-onset asthma: genome-wide and transcriptome-wide studies. Lancet Respir Med. 2019;7(6):509–522. doi: 10.1016/S2213-2600(19)30055-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Olafsdottir TA, Theodors F, Bjarnadottir K, Bjornsdottir US, Agustsdottir AB, Stefansson OA, Ivarsdottir EV, Sigurdsson JK, Benonisdottir S, Eyjolfsson GI, et al. Eighty-eight variants highlight the role of T cell regulation and airway remodeling in asthma pathogenesis. Nat Commun. 2020;11(1):393. doi: 10.1038/s41467-019-14144-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Shrine N, Portelli MA, John C, Soler Artigas M, Bennett N, Hall R, Lewis J, Henry AP, Billington CK, Ahmad A, et al. Moderate-to-severe asthma in individuals of European ancestry: a genome-wide association study. Lancet Respir Med. 2019;7(1):20–34. doi: 10.1016/S2213-2600(18)30389-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Daya M, Rafaels N, Brunetti TM, Chavan S, Levin AM, Shetty A, Gignoux CR, Boorgula MP, Wojcik G, Campbell M, et al. Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations. Nat Commun. 2019;10(1):880. doi: 10.1038/s41467-019-08469-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhu Z, Lee PH, Chaffin MD, Chung W, Loh PR, Lu Q, Christiani DC, Liang L. A genome-wide cross-trait analysis from UK Biobank highlights the shared genetic architecture of asthma and allergic diseases. Nat Genet. 2018;50(6):857–864. doi: 10.1038/s41588-018-0121-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jansen RC, Nap JP. Genetical genomics: the added value from segregation. Trends Genet. 2001;17(7):388–391. doi: 10.1016/s0168-9525(01)02310-1. [DOI] [PubMed] [Google Scholar]
- 12.Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, Torstenson ES, Shah KP, Garcia T, Edwards TL, et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gamazon ER, Segre AV, van de Bunt M, Wen X, Xi HS, Hormozdiari F, Ongen H, Konkashbaev A, Derks EM, Aguet F, et al. Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat Genet. 2018;50(7):956–967. doi: 10.1038/s41588-018-0154-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gusev A, Mancuso N, Won H, Kousi M, Finucane HK, Reshef Y, Song L, Safi A, Schizophrenia Working Group of the Psychiatric Genomics C, McCarroll S et al: Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nat Genet 2018, 50(4):538-548. [DOI] [PMC free article] [PubMed]
- 15.Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10(5):e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, Montgomery GW, Goddard ME, Wray NR, Visscher PM, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48(5):481–487. doi: 10.1038/ng.3538. [DOI] [PubMed] [Google Scholar]
- 17.Wainberg M, Sinnott-Armstrong N, Mancuso N, Barbeira AN, Knowles DA, Golan D, Ermel R, Ruusalepp A, Quertermous T, Hao K, et al. Opportunities and challenges for transcriptome-wide association studies. Nat Genet. 2019;51(4):592–599. doi: 10.1038/s41588-019-0385-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Mountjoy E, Schmidt EM, Carmona M, Schwartzentruber J, Peat G, Miranda A, Fumis L, Hayhurst J, Buniello A, Karim MA, et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat Genet. 2021;53(11):1527–1533. doi: 10.1038/s41588-021-00945-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Richardson TG, Hemani G, Gaunt TR, Relton CL, Davey Smith G. A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome. Nat Commun. 2020;11(1):185. doi: 10.1038/s41467-019-13921-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Consortium GT, Laboratory DA. Coordinating Center -Analysis Working G, Statistical Methods groups-Analysis Working G, Enhancing Gg, Fund NIHC, Nih/Nci, Nih/Nhgri, Nih/Nimh, Nih/Nida et al: Genetic effects on gene expression across human tissues. Nature. 2017;550(7675):204–213. doi: 10.1038/nature24277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Grundberg E, Small KS, Hedman AK, Nica AC, Buil A, Keildson S, Bell JT, Yang TP, Meduri E, Barrett A, et al. Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat Genet. 2012;44(10):1084–1089. doi: 10.1038/ng.2394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jonkers IH, Wijmenga C. Context-specific effects of genetic variants associated with autoimmune disease. Hum Mol Genet. 2017;26(R2):R185–R192. doi: 10.1093/hmg/ddx254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zhernakova DV, Deelen P, Vermaat M, van Iterson M, van Galen M, Arindrarto W, van 't Hof P, Mei H, van Dijk F, Westra HJ, et al. Identification of context-dependent expression quantitative trait loci in whole blood. Nat Genet. 2017;49(1):139–145. doi: 10.1038/ng.3737. [DOI] [PubMed] [Google Scholar]
- 24.Patel D, Zhang X, Farrell JJ, Chung J, Stein TD, Lunetta KL, Farrer LA. Cell-type-specific expression quantitative trait loci associated with Alzheimer disease in blood and brain tissue. Transl Psychiatry. 2021;11(1):250. doi: 10.1038/s41398-021-01373-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ulrich A, Otero-Nunez P, Wharton J, Swietlik EM, Graf S, Morrell NW, Wang D, Lawrie A, Wilkins MR, Prokopenko I et al: Expression Quantitative Trait Locus Mapping in Pulmonary Arterial Hypertension. Genes (Basel) 2020, 11(11). [DOI] [PMC free article] [PubMed]
- 26.Yoo T, Joo SK, Kim HJ, Kim HY, Sim H, Lee J, Kim HH, Jung S, Lee Y, Jamialahmadi O, et al. Disease-specific eQTL screening reveals an anti-fibrotic effect of AGXT2 in non-alcoholic fatty liver disease. J Hepatol. 2021;75(3):514–523. doi: 10.1016/j.jhep.2021.04.011. [DOI] [PubMed] [Google Scholar]
- 27.Jung S, Liu W, Baek J, Moon JW, Ye BD, Lee HS, Park SH, Yang SK, Han B, Liu J, et al. Expression Quantitative Trait Loci (eQTL) Mapping in Korean Patients With Crohn's Disease and Identification of Potential Causal Genes Through Integration With Disease Associations. Front Genet. 2020;11:486. doi: 10.3389/fgene.2020.00486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Moon S, Kim YJ, Han S, Hwang MY, Shin DM, Park MY, Lu Y, Yoon K, Jang HM, Kim YK, et al. The Korea Biobank Array: Design and Identification of Coding Variants Associated with Blood Biochemical Traits. Sci Rep. 2019;9(1):1382. doi: 10.1038/s41598-018-37832-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Das S, Forer L, Schonherr S, Sidore C, Locke AE, Kwong A, Vrieze SI, Chew EY, Levy S, McGue M, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–1287. doi: 10.1038/ng.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Loh PR, Danecek P, Palamara PF, Fuchsberger C, Y AR, H KF, Schoenherr S, Forer L, McCarthy S, Abecasis GR et al: Reference-based phasing using the Haplotype Reference Consortium panel. Nat Genet 2016, 48(11):1443–1448. [DOI] [PMC free article] [PubMed]
- 32.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet. 2012;44(8):955–959. doi: 10.1038/ng.2354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, Kang HM, Fuchsberger C, Danecek P, Sharp K, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet. 2016;48(10):1279–1283. doi: 10.1038/ng.3643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37(8):907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. Genome Project Data Processing S: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Anders S, Pyl PT, Huber W. HTSeq–a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31(2):166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Frankish A, Diekhans M, Ferreira AM, Johnson R, Jungreis I, Loveland J, Mudge JM, Sisu C, Wright J, Armstrong J, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47(D1):D766–D773. doi: 10.1093/nar/gky955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ongen H, Buil A, Brown AA, Dermitzakis ET, Delaneau O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics. 2016;32(10):1479–1485. doi: 10.1093/bioinformatics/btv722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Stegle O, Parts L, Durbin R, Winn J. A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS Comput Biol. 2010;6(5):e1000770. doi: 10.1371/journal.pcbi.1000770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100(16):9440–9445. doi: 10.1073/pnas.1530509100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Consortium GT The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ishigaki K, Kochi Y, Suzuki A, Tsuchida Y, Tsuchiya H, Sumitomo S, Yamaguchi K, Nagafuchi Y, Nakachi S, Kato R, et al. Polygenic burdens on cell-specific pathways underlie the risk of rheumatoid arthritis. Nat Genet. 2017;49(7):1120–1125. doi: 10.1038/ng.3885. [DOI] [PubMed] [Google Scholar]
- 45.Ishigaki K, Akiyama M, Kanai M, Takahashi A, Kawakami E, Sugishita H, Sakaue S, Matoba N, Low SK, Okada Y, et al. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet. 2020;52(7):669–679. doi: 10.1038/s41588-020-0640-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826. doi: 10.1038/s41467-017-01261-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Liu B, Gloudemans MJ, Rao AS, Ingelsson E, Montgomery SB. Abundant associations with gene expression complicate GWAS follow-up. Nat Genet. 2019;51(5):768–769. doi: 10.1038/s41588-019-0404-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Genomes Project C. Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010;467(7319):1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, de Geus EJ, Boomsma DI, Wright FA, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48(3):245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cotto KC, Wagner AH, Feng YY, Kiwala S, Coffman AC, Spies G, Wollam A, Spies NC, Griffith OL, Griffith M. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database. Nucleic Acids Res. 2018;46(D1):D1068–D1073. doi: 10.1093/nar/gkx1143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D, Mutowo P, Atkinson F, Bellis LJ, Cibrian-Uhalte E, et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017;45(D1):D945–D954. doi: 10.1093/nar/gkw1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018;46(D1):D1074–D1082. doi: 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ochoa D, Hercules A, Carmona M, Suveges D, Gonzalez-Uriarte A, Malangone C, Miranda A, Fumis L, Carvalho-Silva D, Spitzer M, et al. Open Targets Platform: supporting systematic drug-target identification and prioritisation. Nucleic Acids Res. 2021;49(D1):D1302–D1310. doi: 10.1093/nar/gkaa1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Szklarczyk D, Kirsch R, Koutrouli M, Nastou K, Mehryary F, Hachilif R, Gable AL, Fang T, Doncheva NT, Pyysalo S, et al. The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023;51(D1):D638–D646. doi: 10.1093/nar/gkac1000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Pricop L, Redecha P, Teillaud JL, Frey J, Fridman WH, Sautes-Fridman C, Salmon JE. Differential modulation of stimulatory and inhibitory Fc gamma receptors on human monocytes by Th1 and Th2 cytokines. J Immunol. 2001;166(1):531–537. doi: 10.4049/jimmunol.166.1.531. [DOI] [PubMed] [Google Scholar]
- 56.Willart M, Hammad H. Lung dendritic cell-epithelial cell crosstalk in Th2 responses to allergens. Curr Opin Immunol. 2011;23(6):772–777. doi: 10.1016/j.coi.2011.09.008. [DOI] [PubMed] [Google Scholar]
- 57.Martinelli N, Girelli D, Malerba G, Guarini P, Illig T, Trabetti E, Sandri M, Friso S, Pizzolo F, Schaeffer L, et al. FADS genotypes and desaturase activity estimated by the ratio of arachidonic acid to linoleic acid are associated with inflammation and coronary artery disease. Am J Clin Nutr. 2008;88(4):941–949. doi: 10.1093/ajcn/88.4.941. [DOI] [PubMed] [Google Scholar]
- 58.Reese SE, Xu CJ, den Dekker HT, Lee MK, Sikdar S, Ruiz-Arenas C, Merid SK, Rezwan FI, Page CM, Ullemar V, et al. Epigenome-wide meta-analysis of DNA methylation and childhood asthma. J Allergy Clin Immunol. 2019;143(6):2062–2074. doi: 10.1016/j.jaci.2018.11.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Xu P, Chang JC, Zhou X, Wang W, Bamkole M, Wong E, Bettayeb K, Jiang LL, Huang T, Luo W et al: GSAP regulates lipid homeostasis and mitochondrial function associated with Alzheimer's disease. J Exp Med 2021, 218(8). [DOI] [PMC free article] [PubMed]
- 60.Muroyama Y, Saito T. Identification of Nepro, a gene required for the maintenance of neocortex neural progenitor cells downstream of Notch. Development. 2009;136(23):3889–3893. doi: 10.1242/dev.039180. [DOI] [PubMed] [Google Scholar]
- 61.Gehring WJ, Hiromi Y. Homeotic genes and the homeobox. Annu Rev Genet. 1986;20:147–173. doi: 10.1146/annurev.ge.20.120186.001051. [DOI] [PubMed] [Google Scholar]
- 62.Kim J, Abdelmohsen K, Yang X, De S, Grammatikakis I, Noh JH, Gorospe M. LncRNA OIP5-AS1/cyrano sponges RNA-binding protein HuR. Nucleic Acids Res. 2016;44(5):2378–2392. doi: 10.1093/nar/gkw017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ried JS, Baurecht H, Stuckler F, Krumsiek J, Gieger C, Heinrich J, Kabesch M, Prehn C, Peters A, Rodriguez E, et al. Integrative genetic and metabolite profiling analysis suggests altered phosphatidylcholine metabolism in asthma. Allergy. 2013;68(5):629–636. doi: 10.1111/all.12110. [DOI] [PubMed] [Google Scholar]
- 64.Modena BD, Bleecker ER, Busse WW, Erzurum SC, Gaston BM, Jarjour NN, Meyers DA, Milosevic J, Tedrow JR, Wu W, et al. Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease. Am J Respir Crit Care Med. 2017;195(11):1449–1463. doi: 10.1164/rccm.201607-1407OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Inoue H, Hattori T, Zhou X, Etling EB, Modena BD, Trudeau JB, Holguin F, Wenzel SE. Dysfunctional ErbB2, an EGF receptor family member, hinders repair of airway epithelial cells from asthmatic patients. J Allergy Clin Immunol. 2019;143(6):2075–2085 e2010. doi: 10.1016/j.jaci.2018.11.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Truong L, Zheng YM, Kandhi S, Wang YX. Overview on Interactive Role of Inflammation, Reactive Oxygen Species, and Calcium Signaling in Asthma, COPD, and Pulmonary Hypertension. Adv Exp Med Biol. 2021;1304:147–164. doi: 10.1007/978-3-030-68748-9_9. [DOI] [PubMed] [Google Scholar]
- 67.Roffel MP, Boudewijn IM, van Nijnatten JLL, Faiz A, Vermeulen CJ, van Oosterhout AJ, Affleck K, Timens W, Bracke KR, Maes T et al: Identification of asthma-associated microRNAs in bronchial biopsies. Eur Respir J 2022, 59(3). [DOI] [PubMed]
- 68.Fogarty A, Broadfield E, Lewis S, Lawson N, Britton J. Amino acids and asthma: a case-control study. Eur Respir J. 2004;23(4):565–568. doi: 10.1183/09031936.04.00090404. [DOI] [PubMed] [Google Scholar]
- 69.Tang DD, Gerlach BD. The roles and regulation of the actin cytoskeleton, intermediate filaments and microtubules in smooth muscle cell migration. Respir Res. 2017;18(1):54. doi: 10.1186/s12931-017-0544-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets supporting the conclusions of this article are included within the article and its additional files. And the RNA-seq data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE226825 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE226825).