Skip to main content
Frontiers in Genetics logoLink to Frontiers in Genetics
. 2022 May 18;13:838518. doi: 10.3389/fgene.2022.838518

Dissecting Meta-Analysis in GWAS Era: Bayesian Framework for Gene/Subnetwork-Specific Meta-Analysis

Emile R Chimusa 1,*, Joel Defo  1
PMCID: PMC9159898  PMID: 35664319

Abstract

Over the past decades, advanced high-throughput technologies have continuously contributed to genome-wide association studies (GWASs). GWAS meta-analysis has been increasingly adopted, has cross-ancestry replicability, and has power to illuminate the genetic architecture of complex traits, informing about the reliability of estimation effects and their variability across human ancestries. However, detecting genetic variants that have low disease risk still poses a challenge. Designing a meta-analysis approach that combines the effect of various SNPs within genes or genes within pathways from multiple independent population GWASs may be helpful in identifying associations with small effect sizes and increasing the association power. Here, we proposed ancMETA, a Bayesian graph-based framework, to perform the gene/pathway-specific meta-analysis by combining the effect size of multiple SNPs within genes, and genes within subnetwork/pathways across multiple independent population GWASs to deconvolute the interactions between genes underlying the pathogenesis of complex diseases across human populations. We assessed the proposed framework on simulated datasets, and the results show that the proposed model holds promise for increasing statistical power for meta-analysis of genetic variants underlying the pathogenesis of complex diseases. To illustrate the proposed meta-analysis framework, we leverage seven different European bipolar disorder (BD) cohorts, and we identify variants in the angiotensinogen (AGT) gene to be significantly associated with BD across all 7 studies. We detect a commonly significant BD-specific subnetwork with the ESR1 gene as the main hub of a subnetwork, associated with neurotrophin signaling (p = 4e −14) and myometrial relaxation and contraction (p = 3e −08) pathways. ancMETA provides a new contribution to post-GWAS methodologies and holds promise for comprehensively examining interactions between genes underlying the pathogenesis of genetic diseases and also underlying ethnic differences.

Keywords: gene, meta-analysis, Bayesian, subnetwork, GWAS

1 Introduction

The main goals of trait mapping studies, including genome-wide association studies (GWASs), are to understand the genetic architecture of diseases, pinpoint the number of loci associated with a particular trait, and approximate the underlying heritability rate (Hirschhorn and Daly, 2003; Cantor et al., 2010; Wray et al., 2010; Garfield, 2020). Once the disease-causing variants and genes are identified, this information will help researchers who are working in clinical, medical, or public health fields to establish prevention strategies, predict risks, and adapt therapeutic measurements Hirschhorn and Daly (2003); Cantor et al. (2010); Tam et al. (2019); Nguyen and Eisman (2020); Legge et al. (2021). Regardless of the successes, GWASs are still confronting many challenges and limitations Hao et al., 2019); Zhu et al. (2017) and have received considerable criticism (Zuka et al., 2012; Cantor et al., 2010). The challenges faced by GWASs include 1) the translation of the associated loci into suitable biological hypotheses (Zuka et al., 2012); 2) the issue of missing or hidden heritability (Kang et al., 2010; Hirschhorn and Daly 2003), which has now been partially tackled; 3) the understanding of how multiple modestly associated variants within genes interact to influence a phenotype (Wang et al., 2007; Hao et al., 2019; Tam et al., 2019); 4) the imperfection of asymptotic distribution of the current mixed model association or logistic regression in the specific case of low-frequency variants (Chimusa et al., 2012; Hao et al., 2019); and 5) the inefficiency in distinguishing between inflation from bias (from cryptic relatedness and population stratification) to the true signal from polygenicity (Chimusa et al., 2012; Hao et al., 2019; Tam et al., 2019).

These limitations reflect a gap in our understanding of the mechanisms underlying the pathogenesis of complex traits and diseases. The major source of these shortcomings is the method of GWAS itself as restricted to a single-marker-based testing approach (Wang et al., 2007; Hao et al., 2019; Tam et al., 2019). Various post-GWAS approaches have been proposed to address the single-SNP-based GWAS limitation (Jia et al., 2011; Wu et al., 2009; Wang et al., 2010; Peng et al., 2008), which are different in many aspects, but all are driven by the need to extract useful information from the GWAS summary statistics. The GWAS meta-analysis has become an increasingly adopted method that leverages association summary statistics to fostering a culture of compulsory in silico replication to maintain reliability in genetics association findings Ilya et al. (2017); Duarte et al. (2019); Shen and Tseng (2010); Lu et al. (2018). A meta-analysis framework combines results from different GWAS cohorts and puts them in one analysis framework to recover signals that one single GWAS cohort study mighty be missed and address the between-study and between-population heterogeneity Kavvoura and Ioannidis (2008); Thompson et al. (2011). In the last decade, the use of meta-analysis method has increased due to different interests from both the medical researchers and statisticians Shi and Lee (2016); Fan et al. (2016); Turley et al. (2018). Recently, meta-analysis has shown remarkable discovery results and helped to more understand and validate association results from different studies. The meta-analysis is considered as post-genome-wide association study method Chimusa et al. (2012); Han and Eskin (2011); Lu et al. (2018); Shi and Lee (2016). Despite the instrumental findings from single-SNP-based meta-analysis, there remains a need for a single comprehensive analysis that can both aggregate from the diverse population GWAS and incorporate the effect of multiple markers and other potential factors at a gene or pathway level. Heterogeneity among the GWAS meta-analyses remains an issue, particularly when the number of studies increases Han and Eskin (2011); Kavvoura and Ioannidis (2008); Thompson et al. (2011). This raised challenge on the power of GWAS meta-analysis across diverse population cohorts of differing genetics ancestry. Moreover, critical caution is required since incomplete replication can also be informative as several studies reported lack of interpopulation replicability, indicating that some risk variants are population-specific Hirschhorn and Daly (2005); McCarthy et al. (2008); Newton-Cheh and Hirschhorn (2005). For example, comparing the Asian and European associations with major depression, the failure of replication is largely due to the difference in the partner of linkage disequilibrium (LD), which reduces power in one population since the proportion of attributable risk declines with a population-specific minor allele frequency Newton-Cheh and Hirschhorn (2005). A caveat, however, is that fewer GWASs conducted in the non-European ancestry usually constitute of fewer samples Newton-Cheh and Hirschhorn (2005), raising the question as to how the clinical utility of GWASs can be made equitable across multi-ethnic populations Martin et al. (2021); Torkamani et al. (2018) and, specifically, how to accurately predict health and disease risks in the African populations. Furthermore, variation of the cohort size across independent studies is challenging, especially when these studies have been conducted from distinct populations of different ancestries and patterns of LD Chimusa et al. (2012); Han and Eskin (2011); Kavvoura and Ioannidis (2008); Thompson et al. (2011).

While the factors may raise heterogeneity Chimusa et al. (2012); Han and Eskin (2011); Kavvoura and Ioannidis (2008); Thompson et al. (2011), designing a gene-based and subnetwork/pathway-based meta-analysis may be helpful in pooling information from multiple population GWASs and multiple variants within a gene or genes within pathways or subnetworks Shi and Lee (2016); Fan et al. (2016); Turley et al. (2018); Ilya et al. (2017); Duarte et al. (2019); Shen and Tseng (2010); Lu et al. (2018). This may reveal larger effects and provide valuable information to prioritize the most important results across human populations. We refer to this approach as gene- or subnetwork/pathway-specific meta-analysis. Similarly, the list of new post-GWAS tools, such as multi-marker analyses, which go beyond single SNP tests, or the inclusion of functional evidence to reweight GWAS results, is growing by the day Newton-Cheh and Hirschhorn (2005); Chimusa et al. (2012); Shi and Lee (2016); Fan et al. (2016); Turley et al. (2018); Ilya et al. (2017); Duarte et al. (2019); Shen and Tseng (2010); Lu et al. (2018). Although many methods for meta-analysis have been developed over the past decades, the methodology still faces significant limitations. In particular, the challenge of low statistical power is still unresolved, as demonstrated by the fact that meta-analyses have not necessarily resulted in an increased statistical power Li et al. (2012). This is, in part, due to the analysis methods failing to optimally account for the high degree of between-study heterogeneity that characterizes most meta-analyses of the datasets Shi and Lee (2016); Fan et al. (2016); Turley et al. (2018); Duarte et al. (2019); Lu et al. (2018). Apart from power considerations, another important challenge is the translation of statistical association in meta-analyses into biologically meaningful insights.

Here, we addressed some GWAS meta-analysis limitations by proposing a Bayesian graph-based framework to perform the gene/pathway-specific meta-analysis by combining the effect size of multiple SNPs within genes, and genes within subnetwork/pathways across multiple independent population GWAS to deconvolute the interactions between genes underlying the pathogenesis of complex diseases across diverse populations (Figure 1).

FIGURE 1.

FIGURE 1

Flowchart of ancMETA.

We assessed the proposed framework on simulated datasets, and the results show that the proposed model holds promise for increasing statistical power for meta-analysis of genetic variants underlying the pathogenesis of complex diseases. We illustrated it in 7 different European bipolar disorder (BD) cohorts, and we finally outlined the implications, challenges, and opportunities that cross-ancestry meta-analyses present in the GWAS era. The proposed method has been implemented in the ancMETA tool https://github.com/echimusa/ancMETA, providing a new contribution to post-GWAS methodologies, and holds promise for deconvoluting interactions between genes underlying the pathogenesis of genetic diseases and underlying ethnic differences.

2 Materials and Methods

2.1 Details of Gene/Subnetwork-Specific Meta-Analysis

Here, we discuss the proposed meta-analysis framework, ancMETA. It performs meta-analysis at two different levels by aggregating multiple independent population GWAS summary statistics datasets. It uses an integrative analysis through Bayesian posterior probability and combines the results into known biological protein–protein network datasets. Lastly, ancMETA performs the meta-analysis at the subnetwork level and identifies the most significant subnetworks to understand the biological pathways (Figure 1).

We describe six different steps of the proposed meta-analysis framework as follows:

Step 1: Collection of N-independent studies.

This step requires N ≥ 2 independent GWAS summary statistics datasets, from the same phenotype or trait. GWAS summary statistics are defined here as per-genetic locus effect sizes (log odds ratios) together with their standard errors, p-values, or z-scores for the affected–unaffected traits de Leeuw et al. (2015); Il-Youp and Wei (2015); Huang et al., 2016); Wang et al. (2017).

Step 2: Mapping SNPs to the associated genes.

This is an intermediate step, and it is similar to our previous approach Chimusa et al. (2015), where all the SNPs are mapped to their related genes. It is common practice to assign SNPs to the genes based on a distance cutoff, and the previous studies use a variety of cutoffs, such as distance from 2 to 500 Kbps Conti et al. (2009); Brodie et al. (2016); Chimusa et al. (2015); Wang et al. (2010, 2007). ancMETA allows users to specify the distance cutoffs of assigning SNPs down-/upstream to a specific gene (see the Supplementary Material). At this step, the combined statistical outcomes (i.e., the effect size and the standard error) are computed, as illustrated in Figure 1.

We assume that the ith study (i = 1, 2,…, N) has J i genes Gji(j=1,2,,Ji) , each having a set of specific SNPs, where N is the total number of studies. Let S=i,jSji denote the set of all SNPs, where Sji is the set of SNPs at the jth gene of the study i. For an SNP sSji , let βji(s) , ζji(s) , and pji(s) represent the effect size, the standard error, and the p-value, respectively. The proposed meta-analysis framework assumes that the SNPs related to one gene are correlated. This assumption is due to the fact that each gene contains a large number of SNPs mostly under LD; therefore, the magnitude of the statistical outcomes for each SNP within a gene from each study is considered to be approximately the same. Thus, the effect size and the p-value will follow a normal distribution. Let μji be the unknown true effect size at the jth gene Gji of study i (i = 1, 2,…, N; j = 1, 2,, J i ). Thus, a fixed-effects model can be applied to estimate the combined effect size at the gene level from each study i = 1, 2, … N. We construct a linear function of parameters μji as follows:

x=yβ,μ=sSjiβjisμji.

Let Vji(s) be the variance. Then, the estimation of the combined effect size is given as follows:

px|Bji,μji,Vji=Nx|Bjiμji,VjisSji12Vjisπexpβjisμji22Vjis. (1)

Note Eq. 1 is the likelihood when μ ≠ 0. Although the solution, which corresponds to the maximum true effect size μji at gene Gji , can be approximated from sampling approaches, Eq. 1 can be solved analytically by differentiating the log likelihood and setting it to 0, that is,

logpx|Bji,μji,Vjiμji=0

or

sSjiβjisμjiVjis=0. (2)

Eq. 2 can be written as

sSjiVjis1βjissSjiVjis1μji=0.

Hence, the estimated effect size of gene Gji is given as

μ^ji=sSjiVjis1βjissSjiVjis1.

Therefore, its standard error is derived as

sdμ^ji=sSjiVjis1.

Step 3: Meta-analysis at the gene level.

Let βji and ζji be the combined statistical outcomes (i.e., effect size and standard error, respectively). Since the effect size may be different across the studies, the heterogeneity between the studies may be high. A parameter σj2 , which accounts for between-study variability, is introduced. Because the maximum-likelihood method underestimates the variance when the number of studies is small DerSimonian and Laird (1986), we use the unbiased method of moments Borenstein et al. (2010) to compute the variance components as follows:

σj2=1Ni=1NβjiHj2,

where Hj=1Ni=1Nβji . It is assumed that the true effect size of the gene G j is μj+σj2 , where μ j is an unknown parameter. The same calculations as in step 2 are applied to estimate the effect size of the gene G j across the studies, which is calculated as follows:

μ^j=i=1Nσj2+Vji1βjii=1Nσj2+Vji1,

and its standard error is given as follows:

sdμ^j=i=1Nσj2+Vji1.

If the precision ρ i of the study ith for i = 1, 2,…, N is equal to 1(μi)2 , then the heterogeneity statistic, using Cochrane’s test (Q) is given as follows:

Q=100×iNρiμ^jθ2, (3)

where θ=ΣiNρiμ^jΣiNρi is a weighted estimator, and QχN12 under the null hypothesis that the between-study variance τ 2 = 0. If we consider the expectation of Eq. 3, the between-study variance is given as

τ2=QN1ΣiNρiΣiNρi2ΣiNρi.

Since the test statistics in Eq. 3 has N − 1 degree of freedom (df), then for Q < df (which means τ 2 < 0), the maximum between 0 and τ 2 is considered, so that τ 2 is non-negative. Recalling that in step 2, we pointed out that SNPs within one gene are correlated. Therefore, to perform meta-analysis, a fixed-effects model is used Borenstein et al. (2010).

Step 4: Mapping genes to biological networks.

For this step, all the genes and their related statistical outcomes and gene–gene combined LD are mapped into a PPI network database (Supplementary Material), which contains information on interactions among the genes. Therefore, the genes are considered as weighted nodes with their related statistical outcomes (i.e., effect size and standard error), and the edges represent the interactions between the genes (nodes), which are weighted with the combined LD, for each pair-wise genes (nodes) that have a link. More details can be found in Supplementary Materials. The details of the combined LD can be found in Supplementary Text S1.

Step 5: Meta-analysis at the subnetwork level. This step estimates the effect size and standard error of each subnetwork from . Supplementary Text S1 provides more details of different concepts used in . The procedure is the same as that of step 3, where it is assumed that there is a heterogeneity between the genes in each subnetwork. Let ρj2 be the between-gene heterogeneity, βji be the observed effect size, and ζji be its standard error. If μ is the unknown true effect size at the subnetwork level with K genes, the estimated effect size is given by p(x|βji,μji,Vji+ρj2)

GjiK12Vji+ρj2πeβjiμji22Vji+ρj2. (4)

Since Eq. 4 is considered to be the likelihood when μ ≠ 0, maximizing the log likelihood by solving

logpt|βji,μ,Vji+ρj2μji=0

yields

μ^ji=GjiKρj2+Vji1βjGjiKρj2+Vji1, (5)

and its standard error estimated effect size is given as

sdμ^ji=GjiKρj2+Vji1.

Step 6: Computing the overall statistical significance.

From step 5, we know that βji and ζji represent the observed effect size and the standard error, respectively, where μ^ji as the unknown effect was estimated using Equation 5. For this step, let us consider the Bayes’ theorem, which includes all the possible effect outcomes for N studies, and let us apply the symmetrical property of the Gaussian distribution. Therefore, the posterior probability is approximately given as

e^ji=πRNβji|μ^ji,ζjipμ^jidμ^ji1πNβji|0,ζji+πRNβji|μ^ji,ζjipμ^jidμ^ji=πNβji|X^ji,ζji+V^ji1πNμji|0,ζji+πNμji|X^ji,ζji+V^ji. (6)

However, a test statistic based on the weighted sum of chi-square can be used, under the null hypothesis that there is no association. Thus, from the fact that eji contains the information from all studies, it can be used as a prior weight in the weighted sum of the chi-square statistics as follows:

χ^j=i=1Ne^jie^jiμ^j2i=1Nζji+τ2,τ20,

where τ 2 is the between-study variance. Recalling that Equation 6 is not normally distributed, the p-values can be evaluated through a sampling method developed by Han and Eskin (2011). puts together all the steps of the proposed framework.

graphic file with name fgene-13-838518-fx1.jpg
graphic file with name fgene-13-838518-fx2.jpg

2.2 Evaluation of ancMETA From Simulation GWAS Data

The U.S. residents of northern and western European ancestry (CEU), Yoruba (YRI), and Mexican (MEX) populations from the HapMap3 Project were used to generate independent case–control studies. Details of these populations can be found in Supplementary Table S1. We independently performed a population growth model on each dataset mentioned previously (Supplementary Text S2). From the resulting expanded datasets, we generated three independent case–control datasets based on chromosomes 1 and 22 using HapGen2 Su et al. (2011). Here, we randomly selected 3 SNPs on chromosome 1 and 3 other SNPs on chromosome 22 to be simulated as causal disease SNPs with differing effect sizes under HapGen2 Su et al. (2011). The simulated disease effect size parameters of those SNPs are summarized in Supplementary Table S2. These parameters are chosen to fit small effect size in some studies and strong in others. We simulated 1,000, 3,000, and 950 cases and 1,000, 3,000, and 1,000 controls from the haplotype (combination of multiple SNPs) data of CEU, YRI, and MEX, respectively. We considered the simulated MEX GWAS dataset as our primary study. The simulation details can be found in Supplementary Text S2.

We conducted a GWAS on each dataset using EMMAX Kang et al. (2010). As expected and according to our simulation parameters (Supplementary Table S2), the GWAS results indicate significant, moderate, and weak signals of association in CEU, YRI, and MEX, respectively, as per simulation (Supplementary Table S3 and Supplementary Figure S1). We used the resulting GWAS summary statistics as input in our proposed model, implemented in “ancMETA,” to perform the gene- and subnetwork-specific meta-analysis (Supplementary Text S3).

At the gene level, the results in Table 1 and Supplementary Figure S2 show significant genes, including CBX7, LD0C1L, and ASTN1 associated with our simulated causal SNPs (Supplementary Table S2). This result indicates further increase in the effect size across these 3 simulated case–control studies, compared to single-SNP tests in GWAS based on EMMAX (Supplementary Table S3). Interestingly, at a subnetwork level, we observed a significant convergence of effect sizes of the simulated disease variants across all the studies (Figures 2A,B), particularly to CBX7 (Table 1). This supports the fact that the true effect risk-associated variants may differ across populations at the polymorphism level, but the effect may tend to convert at a similar magnitude at gene and pathway levels or overlap in the same biological subnetworks/pathways when aggregating several small polymorphism effects.

TABLE 1.

Meta-analysis at gene and subnetwork levels from the simulated GWAS summary statistics across 3 studies: results show convergence of effects across studies to the simulated causal variants.

Study p-values Study M-values
Gene #Study P Q Tau square MEX YRI CEU MEX YRI CEU
LDOC1L 3 1.33e-10 2.055 0.0007 0.012 0.00019 0.0003 0.98 0.91 0.81
PRKCZ 3 1.77e-11 2.193 0.001 0.001 0.047 0.001 0.40 0.41 0.41
CBX7 3 0.00000001 1.825 0 0.0001 0.001 0.0001 0.90 0.91 0.90
ASTN1 3 2.006e-8 2.261 0.001 0.0012 0.001 0.0006 0.81 0.91 0.91
Hub #Study P Q Tau Square MEX YRI CEU MEX YRI CEU
CBX7 3 4.25e-7 2.1 0.0002 0.0003 0.0004 0.0001 0.98 0.96 1.0
PRKCZ 3 1.93e-7 2.2 0.0002 0.019 0.04 0.021 0.59371 0.59 0.59
HNRNPA1 3 3e-08 2.09 0.00008 0.09 0.048 0.03 0.59 0.59 0.59
KRT18 3 4e-07 1.90 0 0.049 0.09 0.005 0.59 0.59 0.59

FIGURE 2.

FIGURE 2

Combined statistical outcomes at the subnetwork level, explaining causality relationship between the simulated phenotype and population variation population based on 3 independent simulated GWAS datasets from CEU, YRI, and MEX, respectively. (A) Top significant meta-analysis-based sub-network from ancMETA where nodes in blue colour denote critical hub and genes associated to the simulated causal SNPs. The size of a node denotes its statistical significance from small to large. (B-D) Forest plot of three gene-hub from the top 3 meta-analysis sub-networks produced from ancMETA.

2.3 Application to Seven European With Bipolar Disorder Cohorts

Here, we used seven European bipolar disorder (BD) GWAS summary statistics obtained from the NIMH data repository (Supplementary Table S1). These datasets include Irish (IRI), Scottish (SCT), 3 European American (EUA), Norwegian (NOR), and British (BRB) individuals. Supplementary Table S4 provides GWAS summary results of each of these studies. Leveraging the European reference panel from 1000 Genomes Project data, we first conducted the imputation LD fine-mapping from the obtained GWAS summary statistics using ImpG-Summary Pasaniuc et al. (1977) to possibly unravel more LD-SNP association (Supplementary Figure S3). The resulting imputation GWAS LD fine-mapping summary statistics were used as input for ancMETA. We performed the meta-analysis at a gene level across these 7 studies, and seven genes were identified (AGT CACNA1C, ESR1, NCAN, BDNF, BCR, and GSK3B), of which AGT has a strong effect across the European populations including Irish, Scottish, Norwegian, and British in contrast to the European American (Supplementary Figures S4,5 and Supplementary Table S5).

Leveraging the recent version of the human PPI network (IntAct release 239) from the IntAct database Kerrien et al. (2012); Orchard et al. (2014), we observed a significant connected subnetwork where the estrogen receptor 1 (ESR1) gene is the main hub. ESR1 interacts with AGT and is connected with the other gene hubs with known BD-associated genes such as CACNA1C, PLCG2, NCAN, BCR, and BDNF. Supplementary Table S5 summarizes the association of the top subnetwork across these 7 studies. The identified subnetwork (Figures 3A,B) is significantly associated with the neurotrophin signaling (p = 4e −14) and other interesting biological pathways such as myometrial relaxation and contraction (p = 3e −08), morphine addiction (p = 9.2e −08), DNA damage response (p = 3e −05), and alcoholism (p = 2e −05). In addition, the subnetwork (Figure 3C) is implicated in a positive regulation of transcription from RNA polymerase II promoter involved in neuronal differentiation (p = 7.7e −21) and is also associated with autosomal dominant (p = 2.8e −07) inheritance types of diseases (Figure 3C).

FIGURE 3.

FIGURE 3

Combined statistical outcomes at the subnetwork level, explaining causality relationship between the simulated phenotype and population variation based on 7 independent GWAS datasets from European bipolar cohorts. (A) The top significant meta-analysis-based sub-network where the size of a node denotes its statistical significance from small to large and large sized node are associated genes or genes interacting with known Bipolar genes. (B) Forest plot of the gene-hub from the top meta-analysis sub-networks in (A, C) Enrichment analysi based on pathways, biological process and human phenotype associated to the top significant meta-analysis-based sub-network in (A).

To evaluate the determined interconnectivity among AGT, CACNA1C, ESR1, NCAN, BDNF, BCR, and GSK3B as well as with the other protein-coding genes in Figure 3, and their association with BD, we leverage GeneMania Franz et al. (2018) to reconstruct the network shown in Figure 3 based on physical and co-expressed interactions. The result from the reconstructed co-expression and physical interaction network of genes in Figure 3A confirms that the above 7 genes detected by ancMETA (Figure 3A) are physically and co-expressly interconnected Figure 4 and as well as with other protein-coding genes in Figure 3A. From the enrichment analysis based on Enrichr Kuleshov et al. (2016), the network in Figure 4 is significantly associated with neurotrophin signaling and other interesting biological pathways such as ErbB signaling pathways (Figure 4). Interestingly, recent studies have revealed that complex ErbB signaling networks regulate the assembly of neural circuitry, myelination, neurotransmission, and synaptic plasticity Mei and Nave (2014). Evidence indicates that there is an optimal level of ErbB signaling in the brain, and a deviation from it impairs brain functions. The ErbB signaling pathway may provide therapeutic targets for specific neuropsychiatric symptoms, and dysregulation in the ErbB signaling pathway may explain abnormalities of neural precursor migration in BD Mei and Nave (2014).

FIGURE 4.

FIGURE 4

Network reconstructed from genes in Figure 3, based on physical and co-expressed interactions obtained GeneMania Franz et al. (2018) and the top pathways of the reconstructed network from Figure 3 obtained from various pathways databases in Enrichr Kuleshov et al. (2016).

3 Discussion and Conclusion

Designing a post-GWAS meta-analysis that leverages the combined effect of multiple SNPs within a gene or genes within subnetworks/pathways across multiple GWAS datasets may reveal consensus association signals and identify large effect sizes. This may further provide valuable information in prioritizing the most important results across different populations. Here, we proposed a Bayesian graph-based gene- and pathway-specific meta-analysis approach. We implemented the proposed model in ancMETA (Figure 1), which addresses the variation in effect size across several independent GWAS summary statistics from distinct populations of different ancestry background. We assessed ancMETA through the simulation of different ancestries in three different GWAS studies.

A well-reconstructed human protein–protein interaction network is a powerful tool in network biology and medicine research, which forms the basis for multi-omics and dynamic analyses Mortezaei and Tavallaei (2020). However, the topology of the network and their connectivity may be very sensible from various pathway analysis methods Mazandu and Mulder (2011) in reflecting the relationship between certain biological processes or densely connected multi-protein complexes of biological relevance Franz et al. (2018); Kuleshov et al. (2016). This makes it challenging to compare the different post-GWAS pathway-based or network-based methods Jia et al. (2011); Chimusa et al. (2015); Wang et al. (2015); Mishra and Macgregor (2015); Huang et al., 2016; Wang et al. (2017); de Leeuw et al. (2015).

In addition to the GWAS summary statistics, ancMETA allows users to use any weighted biological network and accepts a user-defined network. In case if users provide an unweighted biological network, ancMETA leverages case–control genotype datasets to construct the weights of the network (Supplementary Text S1), in which the current post-GWAS approaches Fan et al. (2016); Turley et al. (2018); Ilya et al. (2017); Duarte et al. (2019); Shen and Tseng (2010); Lu et al. (2018); Mishra and Macgregor (2015); Huang et al., 2016; Wang et al. (2017); de Leeuw et al. (2015); Jia et al. (2011); Wang et al. (2015) do not account for. Evidence shows that many disorders are “polygenic” (many genetic loci contribute to risk) and reflect disruptions in proteins that participate and involve complex interactions between genes Dudbridge (2016); Khera et al. (2018). In contrast to other tools Shi and Lee (2016); Fan et al. (2016); Turley et al. (2018); Ilya et al. (2017); Duarte et al. (2019); Shen and Tseng (2010); Lu et al. (2018), ancMETA leverages the advantage of topological properties of biological networks to ascertain the interaction of proteins/genes that can be involved in a pathway. Our method accounts for the correlation that exists between the SNPs within a gene or genes within pathways and introduces flexibility in estimating the gene-specific and subnetwork-specific effect size, which, to our knowledge, is a new contribution to post-GWAS methodologies. The proposed framework holds promise for comprehensively examining the interactions between genes underlying the pathogenesis of genetic diseases.

Some improvements need to be considered in future work, such as accurately modeling the convergence of the SNP signal to the related subnetworks and leveraging the weakly/moderately associated signals from different GWAS studies. It is worth mentioning that we have applied ancMETA on old BD GWAS with a limited sample size (see Supplementary Table S5). However, regardless of the trait or phenotype used by the user in using ancMETA, the validity of the outcome will benefit and be improved by employing the powered GWAS summary statistics from GWAS datasets associated with larger numbers of samples (cases/controls).

There is also a need to integrate summary-level data across multiple phenotypes to simultaneously capture the evidence of the aggregate-level pleiotropic association. The lack of accurate knowledge of complex traits and the sensitivity of human protein interaction network makes it challenging to directly compare the results from the different pathway analysis methods. Overall, the method implemented in the proposed framework highlights the value of identifying the effect size of pathways associated with a disease, which may be useful in understanding the pathogenesis, disease risk prediction, and susceptibility to genetic diseases.

The results obtained from the GWAS summary statistics on the European BD cohorts found association with seven genes, of which CACNA1C and NCAN have previously been implicated in BD through GWAS Bellantuono et al. (2007); Ferreira et al. (2008); Gordovez and McMahon (2020); O’Connell and Coombes (2021). This meta-analysis therefore strengthens these initial results and demonstrates that ancMETA can successfully validate GWAS findings. Interestingly, AGT has not reached GWAS significance in the previous studies despite being considered as a candidate in BD due to its role in the renin–angiotensin system Ferreira et al. (2008); Gordovez and McMahon (2020); O’Connell and Coombes (2021). The high significance for AGT from this analysis (p = 3.2e −19) therefore strengthens its association with BD and further highlights the potential impact of ancMETA as a useful tool in discovering additional small effect variants that may be missed in the single GWAS. Importantly, the very interesting AGT finding is due to the innovations made in addressing population variation and to the inclusion of a PPI framework. The results from the subnetwork analysis revealed a strong interaction of the hub gene ESR1 with AGT, as well as CACNA1C, NCAN, BCR, GSK3B, and BDNF, suggesting the fact that the AGT finding here is not merely indicative of a false positive, but rather that is valid. In addition to a moderate link to BD, ESR1 has been related to migraine onset, alcohol dependence, obsessive compulsive disorder, and postpartum depression Alonso et al. (2011). The pleiotropy and genetic overlap between BD and these and other psychiatric phenotypes is a considerable complex Carmiol et al. (2014), suggesting that this analysis may have identified a key hub network and genetic underpinnings in not only BD etiology but also across several psychiatric phenotypes.

Acknowledgments

We thank all study participants for the dataset used from NIMH data repository https://data-archive.nimh.nih.gov. Computations were performed using facilities provided by the University of Cape Town’s and Cape Town Centre for High Performance Computing team (https://www.chpc.ac.za/) and technical supports from Omics Data Solutions (https://www.omicsdatasolutions.com). Some of the authors are funded in part by the National Institutes of Health Common Fund under grant number U24HG006941. The content of this publication is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, Further inquiries can be directed to the corresponding author.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

EC and JD contribute in editing and writing the manuscript. EC designed, implemented and wrote the script of the methods and the python code of ancMETA.

Funding

The authors are supported in part by National Institutes of Health Common Fund under grant number 1U2RTW012131-01 (COBIP), and National Research Foundation of South Africa for funding (NRF) [Grant RA171111285157/119056]. Responsibility for the information and views expressed in the article lies entirely with the authors.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.838518/full#supplementary-material

References

  1. Alonso P., Gratacòs M., Segalàs C., Escaramís G., Real E., Bayés M., et al. (2011). Variants in Estrogen Receptor Alpha Gene Are Associated with Phenotypical Expression of Obsessive-Compulsive Disorder. Psychoneuroendocrinology 36, 473–483. 10.1016/j.psyneuen.2010.07.022 [DOI] [PubMed] [Google Scholar]
  2. Bellantuono C., Barraco A., Rossi A., Goetz I. (2007). The Management of Bipolar Mania: a National Survey of Baseline Data from the Emblem Study in italy. BMC Psychiatry 7, 33. 10.1186/1471-244X-7-33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Borenstein M., Hedges L. V., Higgins J. P. T., Rothstein H. R. (2010). A Basic Introduction to Fixed-Effect and Random-Effects Models for Meta-Analysis. Res. Synth. Method 1 (2), 97–111. 10.1002/jrsm.12 [DOI] [PubMed] [Google Scholar]
  4. Brodie A., Azaria J. R., Ofran Y. (2016). How Far from the Snp May the Causative Genes Be? Nucleic Acids Res. 44 (13), 6046–6054. 10.1093/nar/gkw500 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cantor R. M., Lange K., Sinsheimer J. S. (2010). Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application. Am. J. Hum. Genet. 86, 6–22. 10.1016/j.ajhg.2009.11.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carmiol N., Peralta J. M., Almasy L., Contreras J., Pacheco A., Escamilla M. A., et al. (2014). Shared Genetic Factors Influence Risk for Bipolar Disorder and Alcohol Use Disorders. Eur. Psychiatr. 29 (5), 282–287. 10.1016/j.eurpsy.2013.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chimusa E. R., Mbiyavanga M., Mazandu G. K., Mulder N. J. (2015). Ancgwas: a post Genome-wide Association Study Method for Interaction, Pathway and Ancestry Analysis in Homogeneous and Admixed Populations. Bioinformatics 32 (4), 549–556. 10.1093/bioinformatics/btv619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chimusa E. R., Zaitlen N., Daya M., Moller M., van Helden P. D., Mulder N. J., et al. (2014). Genome-wide Association Study of Ancestry-specific Tb Risk in the South African Coloured Population. Hum. Mol. Genet. 23, 796–809. 10.1093/hmg/ddt462 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Conti D., Lewinger J., Tyndale R., Benowitz N., Swan G., Thomas P. (2009). “Using Ontologies in Hierarchical Modeling of Genes and Exposure in Biological Pathways,”. Phenotypes, Endophenotypes, and Genetic Studies of Nicotine Dependence. Editor Swan G. E., 22. in press: NCI Monograph . [Google Scholar]
  10. de Leeuw C. A., Mooij J. M., Heskes T., Posthuma D. (2015). Magma: Generalized Gene-Set Analysis of Gwas Data. Plos Comput. Biol. 11 (4), e1004219. 10.1371/journal.pcbi.1004219 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. DerSimonian R., Laird N. (1986). Meta-analysis in Clinical Trials. Controlled Clin. Trials 7, 177–188. 10.1016/0197-2456(86)90046-2 [DOI] [PubMed] [Google Scholar]
  12. Duarte D. A. S., Newbold C. J., Detmann E., Silva F. F., Freitas P. H. F., Veroneze R., et al. (2019). Genome‐wide Association Studies Pathway‐based Meta‐analysis for Residual Feed Intake in Beef Cattle. Anim. Genet. 50, 150–153. 10.1111/age.12761 [DOI] [PubMed] [Google Scholar]
  13. Dudbridge F. (2016). Polygenic Epidemiology. Genet. Epidemiol. 40 (4), 268–272. 10.1002/gepi.21966 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fan R., Wang Y., Chiu C.-y., Chen W., Ren H., Li Y., et al. (2016). Meta-analysis of Complex Diseases at Gene Level with Generalized Functional Linear Models. Genetics 202 (2), 457–470. 10.1534/genetics.115.180869 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Ferreira M. A. R., O’Donovan M., O’Donovan M. C., Meng Y. A., Jones I. R., Ruderfer D. M., et al. (2008). Collaborative Genome-wide Association Analysis Supports a Role for Ank3 and Cacna1c in Bipolar Disorder. Nat. Genet. 40, 1056–1058. 10.1038/ng.209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Franz M., Rodriguez H., Lopes C., Zuberi K., Montojo J., Bader G. D., et al. (2018). Genemania Update 2018. Nucleic Acids Res. 46 (W1), W60–W64. 10.1093/nar/gky311 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Garfield V. (2021). Sleep Duration: A Review of Genome-wide Association Studies (Gwas) in Adults from 2007 to 2020. Sleep Med. Rev. 56, 101413. 10.1016/j.smrv.2020.101413 [DOI] [PubMed] [Google Scholar]
  18. Gordovez F. J. A., McMahon F. J. (2020). The Genetics of Bipolar Disorder. Mol. Psychiatry 25 (3), 544–559. 10.1038/s41380-019-0634-7 [DOI] [PubMed] [Google Scholar]
  19. Han B., Eskin E. (2011). Random-effects Model Aimed at Discovering Associations in Meta-Analysis of Genome-wide Association Studies. Am. J. Hum. Genet. 88, 586–598. 10.1016/j.ajhg.2011.04.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hao Z., Jiang L., Gao J., Ye J., Zhao J., Li S., et al. (2019). Quick Approximation of Threshold Values for Genome-wide Association Studies. Brief Bioinform 20 (6), 2217–2223. 10.1093/bib/bby082 [DOI] [PubMed] [Google Scholar]
  21. Hirschhorn J. N., Daly M. J. (2003). Genome-wide Association Studies for Common Diseases and Complex Traits. Nat. Rev. Genet. 6 (2), 95–108. 10.1038/nrg1521 [DOI] [PubMed] [Google Scholar]
  22. Hirschhorn J. N., Daly M. J. (2005). Genome-wide Association Studies for Common Diseases and Complex Traits. Nat. Rev. Genet. 6, 95–108. 10.1038/nrg1521 [DOI] [PubMed] [Google Scholar]
  23. Huang J., Wang K., Wei P., Liu X., Liu X., Tan K., et al. (2016). Flags: A Flexible and Adaptive Association Test for Gene Sets Using Summary Statistics. Genetics 202 (3), 919–929. 10.1534/genetics.115.185009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Il-Youp K., Wei P. (2015). Adaptive Gene- and Pathway-Trait Association Testing with GWAS Summary Statistics. Bioinformatics 32, 1178–1184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Ilya Y., Arbeev K., Yashin A. (2017). Rqt: an R Package for Gene-Level Meta-Analysis. Bioinformatics 33 (19), 3129–3130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jia P., Zheng S., Long J., Zheng W., Zhao Z. (2011). Dmgwas: Dense Module Searching for Genome-wide Association Studies in Protein-Protein Interaction Networks. Bioinformatics 27 (1), 95–102. 10.1093/bioinformatics/btq615 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kang H. M., Sul J. H., Service S. K., Zaitlen N. A., Kong S.-y., Freimer N. B., et al. (2010). Variance Component Model to Account for Sample Structure in Genome-wide Association Studies. Nat. Genet. 42, 348–354. 10.1038/ng.548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kavvoura F. K., Ioannidis J. P. A. (2008). Methods for Meta-Analysis in Genetic Association Studies: a Review of Their Potential and Pitfalls. Hum. Genet. 123, 1–14. 10.1007/s00439-007-0445-9 [DOI] [PubMed] [Google Scholar]
  29. Kerrien S., Aranda B., Breuza L., Bridge A., Broackes-Carter F., Chen C., et al. (2012). The Intact Molecular Interaction Database in 2012. Nucleic Acids Res. 40 (Database Issue), D841–D846. 10.1093/nar/gkr1088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Khera A. V., Chaffin M., Aragam K. G., Haas M. E., Roselli C., Choi S. H., et al. (2018). Genome-wide Polygenic Scores for Common Diseases Identify Individuals with Risk Equivalent to Monogenic Mutations. Nat. Genet. 50, 1219–1224. 10.1038/s41588-018-0183-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kuleshov M. V., Jones M. R., Rouillard A. D., Fernandez N. F., Duan Q., Wang Z., et al. (2016). Enrichr: a Comprehensive Gene Set Enrichment Analysis Web Server 2016 Update. Nucleic Acids Res. 44 (W1), W90–W97. 10.1093/nar/gkw377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Legge S. E., Santoro M. L., Periyasamy S., Okewole A., Arsalan A., Kowalec K. (2021). Genetic Architecture of Schizophrenia: a Review of Major Advancements. Psychol. Med. 51 (13), 2168–2177. 10.1017/s0033291720005334 [DOI] [PubMed] [Google Scholar]
  33. Li J., Guo Y.-f., Pei Y., Deng H.-W. (2012). The Impact of Imputation on Meta-Analysis of Genome-wide Association Studies. PLoS ONE 7 (4), e34486. 10.1371/journal.pone.0034486 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lu W., Wang X., Zhan X., Gazdar A. (2018). Meta-analysis Approaches to Combine Multiple Gene Set Enrichment Studies. Stat. Med. 37 (4), 659–672. 10.1002/sim.7540 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Martin A. R., Kanai M., Kamatani Y., Okada Y., Neale B. M., Daly M. J. (2021). Clinical Use of Current Polygenic Risk Scores May Exacerbate Health Disparities. Nat. Genet. 3251 (44), 584–591. 10.1038/s41588-019-0379-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mazandu G. K., Mulder N. J. (2011). Generation and Analysis of Large-Scale Data-Driven mycobacterium Tuberculosis Functional Networks for Drug Target Identification. Adv. Bioinformatics 2011, 801478. 10.1155/2011/801478 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. McCarthy M. I., Abecasis G. R., Cardon L. R., Goldstein D. B., Little J., Ioannidis J. P. A., et al. (2008). Genome-wide Association Studies for Complex Traits: Consensus, Uncertainty and Challenges. Nat. Rev. Genet. 9, 356–369. 10.1038/nrg2344 [DOI] [PubMed] [Google Scholar]
  38. Mei L., Nave K.-A. (2014). Neuregulin-erbb Signaling in the Nervous System and Neuropsychiatric Diseases. Neuron 83 (1), 27–49. 10.1016/j.neuron.2014.06.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Mishra A., Macgregor S. (2015). Vegas2: Software for More Flexible Gene-Based Testing. Twin Res. Hum. Genet. 18 (1), 86–91. 10.1017/thg.2014.79 [DOI] [PubMed] [Google Scholar]
  40. Mortezaei Z., Tavallaei M. (2020). Recent Innovations and In-Depth Aspects of post-genome Wide Association Study (post-gwas) to Understand the Genetic Basis of Complex Phenotypes. Heredity 127, 485–497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Newton-Cheh C., Hirschhorn J. N. (2005). Genetic Association Studies of Complex Traits: Design and Analysis Issues. Mutat. Research/Fundamental Mol. Mech. Mutagenesis 573, 54–69. 10.1016/j.mrfmmm.2005.01.006 [DOI] [PubMed] [Google Scholar]
  42. Nguyen T. V., Eisman J. A. (2020). Post-gwas Polygenic Risk Score: Utility and Challenges. JBMR Plus 4 (11), :e10411. 10.1002/jbm4.10411 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. O’Connell K. S., Coombes B. J. (2021). Genetic Contributions to Bipolar Disorder: Current Status and Future Directions. Psychol. Med. 51 (13), 2156–2167. 10.1017/S0033291721001252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Orchard S., Ammari M., Aranda B., Breuza L., Briganti L., Broackes-Carter F., et al. (2014). The MIntAct Project-IntAct as a Common Curation Platform for 11 Molecular Interaction Databases. Nucl. Acids Res. 42, D358–D363. 10.1093/nar/gkt1115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Pasaniuc B., Zaitlen N., Shi H., Bhatia G., Gusev A., Pickrell J., et al. (2014). Fast and Accurate Imputation of Summary Statistics Enhances Evidence of Functional Enrichment. Bioinformatics 30 (20), 2906–2914. 10.1093/bioinformatics/btu416 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Peng G., Luo L., Siu H., Zhu Y., Hu P., Hong S., et al. (2008). Gene and Pathway-Based Second-Wave Analysis of Genome-wide Association Studies. Eur. J. Hum. Genet. 18, 111–117. 10.1038/ejhg.2009.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Shen K., Tseng G. C. (2010). Meta-analysis for Pathway Enrichment Analysis when Combining Multiple Genomic Studies. Bioinformatics 26, 1316–1323. 10.1093/bioinformatics/btq148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Shi J., Lee S. (2016). A Novel Random Effect Model for Gwas Meta-Analysis and its Application to Trans-ethnic Meta-Analysis. Biom 72 (3), 945–954. 10.1111/biom.12481 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Su Z., Marchini J., Donnelly P. (2011). Hapgen2: Simulation of Multiple Disease Snps. Bioinformatics 27 (16), 2304–2305. 10.1093/bioinformatics/btr341 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tam V., Patel N., Turcotte M., Bossé Y., Paré G., Meyre D. (2019). Benefits and Limitations of Genome-wide Association Studies. Nat. Rev. Genet. 20 (8), 467–484. 10.1038/s41576-019-0127-1 [DOI] [PubMed] [Google Scholar]
  51. Thompson J. R., Attia J., Minelli C. (2011). The Meta-Analysis of Genome-wide Association Studies. Brief. Bioinformatics 12, 259–269. 10.1093/bib/bbr020 [DOI] [PubMed] [Google Scholar]
  52. Torkamani A., Wineinger N. E., Topol E. J. (2018). The Personal and Clinical Utility of Polygenic Risk Scores. Nat. Rev. Genet. 19 (9), 581–590. 10.1038/s41576-018-0018-x [DOI] [PubMed] [Google Scholar]
  53. Turley P., Walters R., Walters R. K., Maghzian O., Okbay A., Lee J. J., et al. (2018). Multi-trait Analysis of Genome-wide Association Summary Statistics Using Mtag. Nat. Genet. 50 (2), 229–237. 10.1038/s41588-017-0009-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Wang K., Li M., Bucan M. (2007). Pathway-based Approaches for Analysis of Genomewide Association Studies. Am. J. Hum. Genet. 81 (6), 1278–1283. 10.1086/522374 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Wang K., Li M., Hakonarson H. (2010). Analysing Biological Pathways in Genome-wide Association Studies. Nat. Rev. Genet. 11, 843–854. 10.1038/nrg2884 [DOI] [PubMed] [Google Scholar]
  56. Wang M., Huang J., Liu Y., Ma L., Potash J. B., Han S. (2017). Combat: A Combined Association Test for Genes Using Summary Statistics. Genetics 207 (3), 883–891. 10.1534/genetics.117.300257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Wang Q., Yu H., Zhao Z., Jia P. (2015). EW_dmGWAS: Edge-Weighted Dense Module Search for Genome-wide Association Studies and Gene Expression Profiles. Bioinformatics 31 (15), 2591–2594. 10.1093/bioinformatics/btv150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wray N. R., Pergadia M. L., Blackwood D. H. R., Penninx B. W. J. H., Gordon S. D., Nyholt D. R., et al. (2010). Genome-wide Association Study of Major Depressive Disorder: New Results, Meta-Analysis, and Lessons Learned. Mol. Psychiatry 17, 36–48. 10.1038/mp.2010.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wu J., Vallenius T., Ovaska K., Westermarck J., Mäkelä T. P., Hautaniemi S. (2009). Integrated Network Analysis Platform for Protein-Protein Interactions. Nat. Methods 6, 75–77. 10.1038/nmeth.1282 [DOI] [PubMed] [Google Scholar]
  60. Zhu Y., Tazearslan C., Suh Y. (2017). Challenges and Progress in Interpretation of Non-coding Genetic Variants Associated with Human Disease. Exp. Biol. Med. (Maywood) 242 (13), 1325–1334. 10.1177/1535370217713750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Zuk O., Hechter E., Sunyaev S. R., Lander E. S. (2012). The Mystery of Missing Heritability: Genetic Interactions Create Phantom Heritability. Proc. Natl. Acad. Sci. U.S.A. 109, 1193–1198. 10.1073/pnas.1119675109 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, Further inquiries can be directed to the corresponding author.


Articles from Frontiers in Genetics are provided here courtesy of Frontiers Media SA

RESOURCES