Evaluating statistical significance in a meta-analysis by using numerical integration

Yin-Chun Lin; Yu-Jen Liang; Hsin-Chou Yang

doi:10.1016/j.csbj.2022.06.055

. 2022 Jul 4;20:3615–3620. doi: 10.1016/j.csbj.2022.06.055

Evaluating statistical significance in a meta-analysis by using numerical integration

Yin-Chun Lin ^a, Yu-Jen Liang ^a, Hsin-Chou Yang ^a,^b,^c,^d,^⁎

PMCID: PMC9283883 PMID: 35860413

Graphical abstract

Keywords: Meta-analysis, P-value combination, Fisher’s method, Permutation, Decorrelation, Genome-wide association study, Transcriptome-wise association study

Abbreviations: GWAS, Genome-Wide Association Study; MHC, Major Histocompatibility Commplex; NARAC, North American Rheumatoid Arthritis Consortium; SNP, Single Nucleotide Polymorphism; TWAS, Transcriptome-Wide Association Study; WTCCC, Wellcome Trust Case Control Consortium

Abstract

Meta-analysis is a method for enhancing statistical power through the integration of information from multiple studies. Various methods for integrating p-values (i.e., statistical significance), including Fisher’s method under an independence assumption, the permutation method, and the decorrelation method, have been broadly used in bioinformatics and computational biotechnology studies. However, these methods have limitations related to statistical assumption, computing efficiency, and accuracy of statistical significance estimation. In this study, we proposed a numerical integration method and examined its theoretical properties. Simulation studies were conducted to evaluate its Type I error, statistical power, computational efficiency, and estimation accuracy, and the results were compared with those of other methods. The results demonstrate that our proposed method performs well in terms of Type I error, statistical power, computing efficiency (regardless of sample size), and statistical significance estimation accuracy. P-value data from multiple large-scale genome-wide association studies (GWASs) and transcriptome-wise association studies (TWASs) were analyzed. The results demonstrate that our proposed method can be used to identify critical genomic regions associated with rheumatoid arthritis and asthma, increase statistical significance in individual GWASs and TWASs, and control for false-positives more effectively than can Fisher’s method under an independence assumption. We created the software package Pbine, available at GitHub (https://github.com/Yinchun-Lin/Pbine).

1. Introduction

Fisher’s p-value combination method [1], which integrates statistical significance (i.e., p-value $p_{i}$ ) from many statistical hypothesis tests was originally proposed to examine a joint null hypothesis (i.e., an intersection of the individual null hypotheses) in a meta-analysis [2]. Test statistic of Fisher’s method is defined as negative two times the summation of log-transformed p-values (i.e., $F = - 2 \sum_{i} l n (p_{i})$ ) [1]. This method has been broadly applied in bioinformatics and computational biotechnology studies, such as the meta-analysis of genome-wide association study (GWAS) [3], [4], [5] and that of transcriptome-wide association study (TWAS) [6], [7], [8].

If all the p-values $\{p_{i}, i = 1, \dots, K\}$ follow Uniform (0,1) distribution under a null hypothesis independently, test statistic $F$ follows a chi-squared distribution with $2 K$ degrees of freedom. As such, the exact p-value ( $p_{F}$ ) of F can be derived. However, when the p-values are correlated, the mathematical derivation for the sampling distribution of $F$ becomes intractable. Remarkably, ignorance of the correlation of p-values will inflate false-positives in the subsequent statistical inference [9].

The permutation procedure [10], which involves non-parametric resampling without replacement based on a set of observed data, has been applied to generate a null distribution and calculate an empirical p-value ( $p_{P}$ ) of $F$ when the independence assumption of p-values is violated. The permutation procedure is simple in concept and robust to various correlation structures of p-values. However, this method has some limitations. For instance, in a GWAS with a large sample size, which is used to evaluate the phenotype–genotype relationship, permutations over the phenotype status of study samples require intensive computation. Permutations over single nucleotide polymorphisms (SNPs) may distort the inherent structure of SNPs (i.e., linkage disequilibrium). In addition, numerous permutations and intensive computation are required for a correction for multiple testing in GWASs and TWASs [11], [12].

The permutation procedure requires raw data (only p-value data itself cannot perform permutations), which are not always available. For instance, in a meta-analysis of GWAS, the p-values of single-locus association tests from public genomic databases (e.g., GWAS Catalog [13] and GWAS Central [14]) are available and can be combined to infer the genetic association between SNPs and a phenotype of interest. However, no raw genotype and phenotype data are provided in GWAS Catalog and GWAS Central.

Without the need for raw data, a decorrelation procedure [15] transforms dependent p-values into independent p-values only on the basis of p-value data and the correlation structure of the p-values. The distribution F can thus be derived and computed rapidly. However, in this study, we demonstrated that the order of the combination of p-values influences the results (the order-noninterchangeable property). For example, if $p_{1} = 0.9970$ and $p_{2} = 0.1903$ are combined and their correlation coefficient $r$ = 0.5, then the decorrelation procedure provides a p-value of F ( $p_{D}) = 0.0297$ ; however, if the order of the combination of p-values is reversed, then $p_{D} = 0.5060$ .

Considering the permutation and decorrelation methods’ limitations, we suggested a numerical integration method ( $p_{N}$ ) to evaluate the statistical significance of F. Herein, the theoretical properties of a p-value combination are examined. Type I error, statistical power, computational efficiency, and estimation accuracy were evaluated through simulation studies and compared with those of Fisher’s method under an independence assumption ( $p_{F}$ ), permutation ( $p_{P}$ ), and decorrelation ( $p_{D}$ ). Real-world examples of meta-analyses of GWASs and meta-analyses of TWASs are given. R codes Pbine are provided in GitHub at https://github.com/Yinchun-Lin/Pbine.

2. Methods

2.1. Proposed method

Let $R = \{p_{i}, i = 1, \dots, K\}$ denote a list of dependent p-values from K hypothesis tests with a correlation $c o r r (R) = Σ$ . Because the correlation is approximately invariant under monotone transformations [15], $R$ satisfies the following transformation T: $R = 1 - Φ {C Φ^{- 1} (1 - R^{#})}$ , where $C$ is the Cholesky factor of $Σ$ (i.e. $Σ = {C C}^{T}$ ), $R^{#}$ is a list of independent p-values, and $Φ$ denotes the cumulative distribution function of a standard normal random variable. Let $\{p_{i}^{*}, i = 1, \dots, K\}$ be the p-values calculated from the raw data observations for testing K null hypotheses. In our method, the p-value of F is calculated as follows:

p_{N} = \int_{D} J_{T^{- 1}},

where $J_{T^{- 1}}$ is the joint probability density function of $R$ (Text S1), $D$ is the region enclosed by $0 < p_{i} < 1, \forall i$ and $\prod p_{i} < \prod p_{i}^{*}$ . Noted that $p_{N}$ is fixed for any permutation on $\{p_{i}^{*}, i = 1, \dots, K\} .$

For reduced case $K = 2$ with correlation $0 < r < 1$ , then $C = (\begin{matrix} 1 & 0 \\ r & \sqrt{1 - r^{2}} \end{matrix})$ . The p-value of $F$ is derived (Text S1) as follows:

p_{N} = \iint_{D} \frac{1}{\sqrt{1 - r^{2}}} Φ' (\frac{Φ^{- 1} (1 - v) - r Φ^{- 1} (1 - u)}{\sqrt{1 - r^{2}}}) {(Φ^{- 1})}^{'} (1 - v) d (u, v),

(1)

where $D = \{(u, v) | 0 < u, v < 1, u v < p_{1}^{*} p_{2}^{*}\}$ and $Φ^{'}$ is the derivative of a cumulative distribution function of a standard normal variate.

A logarithm transformation of $p_{i}$ ’s was considered to prevent integration on curvilinear region $D$ . $p_{N}$ in Eq. (1) can be rewritten as follows:

p_{N} = 1 - \iint_{D^{'}} \frac{1}{\sqrt{1 - r^{2}}} Φ' (\frac{Φ^{- 1} (1 - e^{v'}) - r Φ^{- 1} (1 - e^{u^{'}})}{\sqrt{1 - r^{2}}}) {(Φ^{- 1})}^{'} (1 - e^{v'}) e^{u'} e^{v'} d (u', v')

where $(u^{'}, v^{'}) = (\ln (u), \ln (v))$ and $D^{'} = \{(u^{'}, v^{'}) | u^{'}, v^{'} < 0, u^{'} + v^{'} > \ln (p_{1}^{*}) + \ln (p_{2}^{*})\}$ .

If correlation of two p-values is $r = 0$ , $p_{N}$ in Eq. (1) can be reduced to Fisher’s method as follows:

p_{N} = p_{F} = p_{1}^{*} p_{2}^{*} - p_{1}^{*} p_{2}^{*} l n (p_{1}^{*} p_{2}^{*})

(2)

For $K = 2$ , the detailed derivation of $J_{T^{- 1}}$ (Text S1), the symmetric behavior and limiting behavior of $J_{T^{- 1}}$ (Text S2), and a comparison of the p-value of our proposed method ( $p_{N}$ ) with the individual p-values (Text S3 and Fig. S1) are provided.

2.2. Simulation studies

To evaluate the Type I error, power, and computation time of the p-value combination methods, bivariate linear regression models were applied to generate data in the simulation studies. Model parameters were assigned or estimated following the purposes of the simulation study. The p-values of our numerical integration method ( $p_{N}$ ), the Fisher’s method under an independence assumption ( $p_{F}$ ), and the decorrelation method ( $p_{D}$ ) were calculated and compared with the benchmark method – the permutation method ( $p_{P}$ ).

2.2.1. Evaluation of Type I error

A bivariate linear regression model was applied to generate two phenotypes ( $y_{ng}$ , $z_{ng}$ ) and gene expression ( $x_{ng}$ ) data in the simulation study as follows:

(\binom{y_{ng}}{z_{ng}}) = (\binom{{β_{j} x}_{ng}}{{γ_{j} x}_{ng}}) + (\binom{ε_{ng}}{φ_{ng}}), n = 1, \dots, N, g = 1, \dots, G

(3)

where N denotes the total number of samples and G indicates the total number of genes. Random error terms $ε_{ng}$ and $φ_{ng} = w ε_{ng} + (1 - w) ∊_{ng}$ have correlation coefficient $c o r r (ε_{ng}, φ_{ng}) = w$ . We assumed that $ε_{ng}$ and $∊_{ng}$ would be independent white noises following a standard normal distribution. Gene expression $x_{ng}$ was independently generated from $U n i f o r m (0, 1)$ .

Here, we considered the regression coefficient under the null hypothesis: $β_{j} = γ_{j} = 0$ ; sample size $N = 500$ ; gene number $G = 2, 000$ ; simulation replications $K = 2, 000$ . Given a specified $w$ , we generated the phenotypes and gene expression data. In each of the simulated datasets, p-values $p_{y}$ and $p_{z}$ were obtained by testing null hypotheses $H_{0 g}^{β} : β_{g} = 0$ and $H_{0 g}^{γ} : γ_{g} = 0$ , respectively, through Student’s t test.

The results demonstrate that the methods control Type I error well for uncorrelated p-values, i.e., $c o r r (p_{y}, p_{z}) = 0$ (Fig. 1A). For the positively correlated p-values $(p_{y}, p_{z})$ , Fisher’s method (red line) under an independence assumption ( $p_{F}$ ) exhibited an inflated Type I error particularly for an increased correlation of p-values. Our method $p_{N}$ and the decorrelation procedure $p_{D}$ exhibited a Type I error similar to that of the benchmark method $p_{P}$ ; the false-positive was 0.05–0.06. This indicates that p-value dependency must be considered when the sampling distribution of a p-value combination is derived. Therefore, our proposed method controls Type I errors well.

2.2.2. Evaluation of statistical power

A bivariate linear regression model was applied to generate two phenotypes ( $y_{ng}$ , $z_{ng}$ ) and gene expression ( $x_{ng}$ ) data in the simulation study as follows:

(\binom{y_{ng}}{z_{ng}}) = (\binom{{β_{j} x}_{ng}}{w {β_{j} x}_{ng}}) + (\binom{ε_{ng}}{φ_{ng}}), n = 1, \dots, N, g = 1, \dots, G

(4)

where N indicates the total number of samples and G indicates the total number of genes. That is, $z_{ng} = w y_{ng} + (1 - w) ∊_{ng}$ and $φ_{ng} = w ε_{ng} + (1 - w) ∊_{ng}$ , and $c o r r (y_{ng}, z_{ng}) = c o r r (ε_{ng}, φ_{ng}) = w$ . We assumed that random error terms $ε_{ng}$ and $∊_{ng}$ would follow an independent standard normal random distribution individually. Gene expression $x_{ng}$ was generated from $U n i f o r m (0, 1)$ independently. Here, we considered $β_{j} = 0.6$ ; $w = 0.1, 0.3, 0.5, 0.7, 0.9$ ; sample size $N = 500$ ; number of genes $G = 2, 000$ ; simulation replications $K = 2, 000$ . P-values $p_{y}$ and $p_{z}$ were obtained by testing null hypotheses $H_{0 g}^{β} : β_{g} = 0$ and $H_{0 g}^{γ} : γ_{g} = 0, w h e r e γ_{g} = w β_{g}$ . The relationship between $c o r r (ε_{ng}, φ_{ng}) = w$ and $c o r r (p_{y}, p_{z}) = r$ is discussed (Text S4 and Fig. S2).

The results demonstrated that the four methods have similar power and that the power increases with $r$ (Fig. 1B). $p_{F}$ has the highest power, particularly for a higher $r$ ; however, the high power is accompanied by an inflated Type I error as mentioned in the previous section (Fig. 1A). $p_{D}$ has the lowest power and deviates from the benchmark $p_{P}$ , particularly for a high $r$ . The proposed method $p_{N}$ has high power similar to that of the benchmark $p_{P}$ .

2.2.3. Evaluation of computation time

The simulation model for an evaluation of Type I error was then applied. We compared the computation time between the proposed method ( $p_{N}$ ) and the benchmark method ( $p_{P}$ ) with various sample sizes ( $N =$ 3,000, 5,000, and 7,000), gene numbers ( $G =$ 10,000, 20,000, and 30,000), correlation coefficients ( $w = 0.1, 0.5, a n d 0.9$ ), and permutation times ( $K =$ 2,000–20,000, with increments of 2,000). The results show that the computation time of the proposed method ( $p_{N}$ ) is unchanged with the sample size, number of genes, and correlation coefficient (Fig. 1C). The computation time for the permutation method ( $p_{P}$ ) increases with sample size, number of genes, and permutation time, but it did not change as the correlation coefficient increased; it required approximately 2,500 h to compute 30,000 pairs of p-value combinations. Our proposed method was more computationally efficient, requiring < 10 h for computation. Because Fisher’s method under an independence assumption ( $p_{F}$ ) and the decorrelation method ( $p_{D}$ ) do not involve complex re-sampling or integration procedures, they are computationally efficient.

2.2.4. Evaluation of estimation accuracy

We applied the model for an evaluation of Type I error with $G = 1, 000$ , $N = 500,$ and $K = 5, 000$ , and compared the estimation accuracy of the three p-value combination methods $p_{N}$ , $p_{F}$ , and $p_{D}$ with that of the benchmark permutation $p_{P}$ . Correlation coefficients $c o r r (p_{y}, p_{z}) = r = 0.1, 0.5, 0.9$ were considered. The results demonstrate that $p_{N}$ , $p_{F}$ , and $p_{D}$ deviate from the benchmark $p_{P}$ as $c o r r (p_{y}, p_{z}) = r$ increases (Fig. 1D–1F). In addition, $p_{D}$ exhibits a certain proportion of outliers (Fig. 1D–1F). The biased estimation can be explained by the order-noninterchangeable property (Text S5) and non-uniformity property (Fig. S3) of $p_{D}$ . Compared with $p_{F}$ and $p_{D}$ , the proposed method ( $p_{N}$ ) is closest to the benchmark $p_{P}$ . Thus, the proposed method provides a more accurate estimate than do $p_{F}$ and $p_{D}$ .

3. Real data applications

3.1. Meta-GWAS for rheumatoid arthritis

This meta-analysis identified SNPs associated with rheumatoid arthritis on the basis of two large-scale population-based GWASs – The North American Rheumatoid Arthritis Consortium (NARAC) data $(N_{c a s e} = 868, N_{c o n t r o l} = 1, 194)$ [16] and Wellcome Trust Case Control Consortium (WTCCC) data $(N_{c a s e} = 1, 999, N_{c o n t r o l} = 3, 002)$ [17]. In each of the two GWASs, a logistic regression analysis with covariate adjustment for sex and SNP coding based on an additive genetic model was performed to examine the genetic associations between rheumatoid arthritis disease status and individual SNP markers. At each SNP, p-values based on NARAC and WTCCC data were obtained separately.

We applied Fisher’s ( $p_{F}$ ) and our ( $p_{N}$ ) methods to combine the two p-values at each SNP locus in the NARAC and WTCCC data. There were 4,963 statistical tests (because of 4,963 SNPs on chromosome 6) in this meta-GWAS. Bonferroni’s correction [18] for multiple testing was performed to obtain adjusted p-values for $p_{F}$ and $p_{N}$ separately. The result demonstrate that both the methods could be used to identify the major histocompatibility complex (MHC) region on chromosome 6p21.3 (Fig. 2), which is strongly associated with rheumatoid arthritis [19], [20]. Our method ennabled us to identify the SNP rs9391858 (p = $3.495 \times 10^{- 6}$ ) truly associated with rheumatoid arthritis [21] (Fig. 2); however, the individual GWASs could not detect this SNP $p = 1.020 \times 10^{- 5}$ in NARAC and $p = 1.291 \times 10^{- 4}$ in WTCCC; Fig. S4). Fisher’s method but not ours identified six false-positive SNPs: rs2394102, rs11752073, rs12697946, rs9394169, rs3818528, and rs3130014 (Fig. 2).

Fig. 2 — **Manhattan plots for chromosome 6 in the *meta*-GWAS.** This *meta*-GWAS contained 4,963 SNPs on chromosome 6. The Fisher’s method ( $p_{F}$ ) and our method ( $p_{N}$ ) were employed. Each point indicates a SNP. The x-axis indicates physical position of a SNP. The y-axis indicates p-value in a scale of –log₁₀. The green lines indicate false-positive events identified by the Fisher’s method ( $p_{F}$ ) but not by our method ( $p_{N}$ ); they involved six SNPs: rs11752073, rs2394102, rs3130014, rs3818528, rs9394169, and rs12697946 (green line). The orange lines indicate false-positive events identified by both of the Fisher’s method ( $p_{F}$ ) and our method ( $p_{N}$ ). The light blue line indicates the SNP known to be associated with rheumatoid arthritis and identified by the Fisher’s method ( $p_{F}$ ) and our method ( $p_{N}$ ), but not by either of the two studies (Fig. S4). The red dashed line indicates the significance level after Bonferroni correction for multiple testing.

The real data analysis demonstrated that our method can be used to identify crucial genomic regions strongly associated with rheumatoid arthritis, detect some of rheumatoid arthritis-associated loci not detected by individual GWASs, and control for false-positive more efficiently than can Fisher’s method.

3.2. Meta-TWAS for rheumatoid arthritis

This meta-analysis was performed to identify genes differentially expressed in patients with rheumatoid arthritis and normal controls of European descent based on the two large-scale TWASs: Eyre et al. $(N_{c a s e} = 13, 838, N_{c o n t r o l} = 33, 742)$ [22] and Stahl et al. $(N_{c a s e} = 5, 539, N_{c o n t r o l} = 20, 169)$ [23]. In each TWAS, the gene-level p-values used to examine the association of rheumatoid arthritis disease status with gene expression were downloaded from webTWAS [24]. However, raw gene expression data were unavailable on the website. When the p-value data were downloaded, Elastic-net was used as a model for transcriptome data prediction in the MetaXcan framework [25].

We applied Fisher’s ( $p_{F}$ ) and our ( $p_{N}$ ) methods to combine the two p-values from the two TWASs. There were 3,175 statistical tests (because of 3,175 genes overlapping between the two studied TWAS datasets) in this meta-TWAS. Bonferroni’s correction [18] for multiple testing was performed to obtain adjusted p-values for $p_{F}$ and $p_{N}$ separately. The results indicate that both methods identified the MHC region as a key genomic region for rheumatoid arthritis (Fig. 3). Fisher’s method identified 11 genes outside the MHC region. Except ANKRD55 was a true-positive, all other 10 genes were false-positively identified. Our method did not have false-positives but failed to detect ANKRD55 ( $p_{N} = 4.410 \times 10^{- 5}$ ). Our and Fisher’s methods did not identify additional genes associated with rheumatoid arthritis detected by individual TWASs; nevertheless, our method demonstrated a more significant signal for several rheumatoid arthritis genes. For instance, in the TWAS of Eyre et al [22], the TWAS of Stahl et al [23], and this meta-TWAS, the p-values were $p = 1.93 \times 10^{- 6}$ , $p = 6.33 \times 10^{- 7}$ , and $p = 1.39 \times 10^{- 7}$ , respectively, for AFF3 on chromosome 2, and $p = 3.07 \times 10^{- 5}$ , $p = 1.30 \times 10^{- 5}$ , and $p = 4.09 \times 10^{- 6}$ , respectively, for IRF5 on chromosome 7 (Fig. S5). Thus, this real data analysis demonstrated the inference of our meta-TWAS.

Fig. 3 — **Manhattan plots for 22 autosomes in the *meta*-TWAS.** This *meta*-TWAS contained 3,175 genes overlapping between the two studied TWAS datasets. The Fisher’s method ( $p_{F}$ ) and our method ( $p_{N}$ ) were employed. Each point indicates a gene. The x-axis indicates the physical position of a gene in an autosome. The y-axis indicates p-value in a scale of –log₁₀. The 11 purple lines indicate the genes for which Fisher’s method ( $p_{F}$ ) reported genetic association but our method ( $p_{N}$ ) did not. All genes, *ANKRD55*, were false-positively detected. The red dashed line indicates the significance level after Bonferroni correction for multiple testing.

3.3. Meta-TWAS for asthma

We evaluated the performance of the proposed method in combining p-values from more than two studies. We downloaded the gene-level p-values data in four large-scale studies for asthma from webTWAS [24] – Canela-Xandri et al. [26] with $N_{c a s e} = 52, 269$ and $N_{c o n t r o l} = 399, 995$ ; Zhu et al. [27] with $N_{c a s e} = 14, 085$ and $N_{c o n t r o l} = 76, 768$ ; Zhu et al. [28] with $N_{c a s e} = 46, 802$ and $N_{c o n t r o l} = 347, 481$ ; and Demenais et al. [29] with $N_{c a s e} = 19, 954$ and $N_{c o n t r o l} = 107, 715$ . We analyzed 15 genes, including 10 asthma-associated genes (HLA-G, ATP6V1G2, TAP1, TRIM10, HLA-DRB1, LST1, HLA-DRB5, DDX39B, MSH5, and HLA-A) in the MHC region and five genes that they are located outside the MHC region and no studies have reported association of asthma with the genes (PHIP, EED, PYGB, SMARCD2, and BORCS8) (Table S1).

We applied Fisher’s ( $p_{F}$ ) and our ( $p_{N}$ ) methods to identify differentially expressed genes in this meta-TWAS. After applying Bonferroni’s adjustment for multiple testing correction, the adjusted p-values are provided (Table 1). Fisher’s method ( $p_{F}$ ) identified all the ten asthma-associated genes, but also identified a high proportion of false-positive genes. Our method ( $p_{N}$ ) identified most of the asthma-associated genes, except for HLA-DRB5 (adjusted p-value = 0.069) and controlled false positive well. The results suggest that our method performs well and better than Fisher’s method in combining more than two p-values.

Table 1.

Our method performs better than Fisher’s method when combining more than two p-values. Ten asthma related genes and five asthma unrelated gene were analyzed in this meta-TWAS. Adjusted p-values of Fisher’s method ( $p_{F}$ ), our method with equal weights ( $p_{N}$ ), and our method with unequal weights ( $p_{N}^{w}$ ) are provided. All the p-values were adjusted using Bonferroni’s correction for multiple testing. The numbers marked in bold indicate they are statistically significant.

	Gene name	Chromosome	$p_{F}$	$p_{N}^{w}$	$p_{N}$
Asthma	HLA-G	Chr. 6	0.0000127	0.0244885	0.0012742
Related	ATP6V1G2	Chr. 6	0.0005959	0.0030811	0.0168662
Genes	HLA-DRB5	Chr. 6	0.0059697	0.0386449	0.0689027
	TAP1	Chr. 6	0.0003276	0.0025073	0.0118823
	TRIM10	Chr. 6	0.0000009	0.0002722	0.0000589
	HLA-DRB1	Chr. 6	0.0000018	0.0004108	0.0003191
	LST1	Chr. 6	0.0000009	0.0002963	0.0000931
	DDX39B	Chr. 6	0.0000008	0.0001398	0.0000000
	MSH5	Chr. 6	0.0005163	0.0009750	0.0154954
	HLA-A	Chr. 6	0.0016827	0.0046854	0.0318339
Asthma	PHIP	Chr. 6	0.0418319	0.0986147	0.2299088
Unrelated	EED	Chr. 11	0.0094608	0.0100037	0.0915084
Genes	PYGB	Chr. 20	0.0124950	0.0805022	0.1086624
	SMARCD2	Chr. 17	0.0131833	0.0153122	0.1123372
	BORCS8	Chr. 19	0.0233820	0.1422346	0.1601654

Open in a new tab

Furthermore, our method can assign different weights to p-values in different TWASs. Here, the sample size of a study relative to the total sample size in the four TWASs was calculated as a weight; that is, a higher weight was assigned to a study with a higher sample size. Our weighted method ( $p_{N}^{w}$ ) can identify all the ten asthma-associated genes, including HLA-DRB5 that cannot be detected by the equal-weighted method ( $p_{N}$ ). However, $p_{N}^{w}$ obtained some false-positive findings, such as EED and SMARCD2 in this analysis.

4. Conclusion and discussion

In this study, we proposed a novel numerical integration method ( $p_{N}$ ) to evaluate statistical significance by combining correlated p-values from multiple studies in a meta-analysis. The proposed method is simple in concept and flexible to various correlation structures. Our theoretical investigation and simulation studies demonstrated that our proposed method performs well in terms of Type I error, statistical power, computing efficiency (regardless of the sample size), and statistical significance estimation accuracy. Real applications in large-scale GWASs and TWASs for rheumatoid arthritis and asthma facilitated efficient identification of critical genomic regions, such as the MHC region, and genes associated with rheumatoid arthritis and asthma, not reported in the previous GWASs or TWASs. We developed Pbine (https://github.com/Yinchun-Lin/Pbine) for meta-analysis based on a combination of p-values from multiple studies. Our method can combine p-values from more than two studies with different sample sizes and precision so as to different levels of importance and information. The incorporation of unequal weights into a p-value combination for more than two studies has been implemented into Pbine.

The results in our simulation studies and real data analyses demonstrated that our method outperforms Fisher’s method. We discussed the weakness of the decorrelation method – an order-noninterchangeable property. Although $p_{D}$ can be calculated in either an ascending order or a descending order of p-values, we showed that p-values of these two procedures in the decorrelation method violate the uniformity property under a null distribution – the ascending order method ( $p_{D A}$ ) tends to have more false negative and the descending order method ( $p_{D D}$ ) have more false positive, particularly at the case with a high between-study correlation of p-values (Fig. S3). When we applied $p_{D A}$ and $p_{D D}$ in the meta-GWAS and meta-TWAS for rheumatoid arthritis, we did find a number of false-positive and false-negative findings (Table S2).

In addition to the methods discussed in this paper, studies have reported other p-value combination methods [30]. Some of these methods depend on p-value independency assumptions [31], [32], parametric assumptions [33], [34], and mathematical approximations such as Satterthwaite’s approximation [35], [36]. These methods may be efficient in computation. However, when their assumptions are violated, their performance is negatively affected by inflated Type I error, particularly when significance level $α$ is low [37]. In addition, several methods have been developed on the basis of a generalization of Fisher’s product p-value method, such as the weighted [38], truncated [15], and rank-truncated [39] product p-value methods. Our method can be generalized to more complicated cases.

CRediT authorship contribution statement

Yin-Chun Lin: Methodology, Software, Formal analysis, Writing – original draft, Writing – review & editing. Yu-Jen Liang: Data curation, Resources. Hsin-Chou Yang: Conceptualization, Methodology, Writing – original draft, Writing – review & editing, Resources, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This study was supported by a research grant from the Ministry of Science and Technology of Taiwan (MOST 109-2314-B-001-007-MY3). We thank NARAC and WTCCC for making their data on GWASs for rheumatoid arthritis available. The NARAC data were provided by Genetic Analysis Workshop 16 (R01 GM031575) and gathered with the support of grants from the National Institutes of Health (N01-AR-2-2263 and R01-AR-44422) and the National Arthritis Foundation. The WTCCC data were generated by the Wellcome Trust Case Control Consortium. A full list of the investigators who contributed to data generation is available at www.wtccc.org.uk. The Wellcome Trust (Award 079113) provided funding for that project.

Footnotes

^{Appendix A}

Supplementary data to this article can be found online at https://doi.org/10.1016/j.csbj.2022.06.055.

Appendix A. Supplementary data

The following are the Supplementary data to this article:

Supplementary data 1

mmc1.docx^{(1.4MB, docx)}

References

1.Fisher R.A. 4th ed. Oliver and Boyd; Edinburgh: 1932. Statistical methods for research workers. [Google Scholar]
2.Haidich A.B. Meta-analysis in medical research. Hippokratia. 2010;14(1):29–37. [PMC free article] [PubMed] [Google Scholar]
3.Evangelou E., Ioannidis J.P. Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet. 2013;14(6):379–389. doi: 10.1038/nrg3472. [DOI] [PubMed] [Google Scholar]
4.Hagg S., et al. Gene-based meta-analysis of genome-wide association studies implicates new loci involved in obesity. Hum Mol Genet. 2015;24(23):6849–6860. doi: 10.1093/hmg/ddv379. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Springelkamp H., et al. Meta-analysis of genome-wide association studies identifies novel loci associated with optic disc morphology. Genet Epidemiol. 2015;39(3):207–216. doi: 10.1002/gepi.21886. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Scott E.N., et al. Transcriptome-wide association study uncovers the role of essential genes in anthracycline-induced cardiotoxicity. NPJ Genom Med. 2021;6(1):35. doi: 10.1038/s41525-021-00199-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Zeng P., et al. Aggregating multiple expression prediction models improves the power of transcriptome-wide association studies. Hum Mol Genet. 2021;30(10):939–951. doi: 10.1093/hmg/ddab056. [DOI] [PubMed] [Google Scholar]
8.Hu Y., et al. A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet. 2019;51(3):568–576. doi: 10.1038/s41588-019-0345-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Li Q.Z., et al. Fisher's method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations. Biostatistics. 2014;15(2):284–295. doi: 10.1093/biostatistics/kxt045. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Karlsson A. Permutation, parametric, and bootstrap tests of hypotheses. J Royal Stat Soc Series a-Stat Soc. 2006;169:171. [Google Scholar]
11.Hayes B. Overview of statistical methods for genome-wide association studies (GWAS) Methods Mol Biol. 2013;1019:149–169. doi: 10.1007/978-1-62703-447-0_6. [DOI] [PubMed] [Google Scholar]
12.Feng H., et al. Multitrait transcriptome-wide association study (TWAS) tests. Genet Epidemiol. 2021;45(6):563–576. doi: 10.1002/gepi.22391. [DOI] [PubMed] [Google Scholar]
13.MacArthur J., et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog) Nucleic Acids Res. 2017;45(D1):D896–D901. doi: 10.1093/nar/gkw1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Beck T., et al. GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies. Eur J Hum Genet. 2014;22(7):949–952. doi: 10.1038/ejhg.2013.274. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Zaykin D.V., et al. Truncated product method for combining P-values. Genet Epidemiol. 2002;22(2):170–185. doi: 10.1002/gepi.0042. [DOI] [PubMed] [Google Scholar]
16.Plenge R.M., et al. TRAF1-C5 as a risk locus for rheumatoid arthritis–a genomewide study. N Engl J Med. 2007;357(12):1199–1209. doi: 10.1056/NEJMoa073491. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wellcome Trust Case Control C. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447(7145):661–678. doi: 10.1038/nature05911. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Bonferroni, C.E., Teoria statistica delle classi e calcolo delle probabilit ‘a. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze, 1936. 8.
19.Newton J.L., et al. A review of the MHC genetics of rheumatoid arthritis. Genes Immun. 2004;5(3):151–157. doi: 10.1038/sj.gene.6364045. [DOI] [PubMed] [Google Scholar]
20.Matzaraki V., et al. The MHC locus and genetic susceptibility to autoimmune and infectious diseases. Genome Biol. 2017;18(1):76. doi: 10.1186/s13059-017-1207-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Zheng W., Rao S. Knowledge-based analysis of genetic associations of rheumatoid arthritis to inform studies searching for pleiotropic genes: a literature review and network analysis. Arthritis Res Ther. 2015;17:202. doi: 10.1186/s13075-015-0715-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Eyre S., et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat Genet. 2012;44(12):1336–1340. doi: 10.1038/ng.2462. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Stahl E.A., et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet. 2010;42(6):508–514. doi: 10.1038/ng.582. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Cao C., et al. webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Res. 2022;50(D1):D1123–D1130. doi: 10.1093/nar/gkab957. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Barbeira A.N., et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Canela-Xandri O., Rawlik K., Tenesa A. An atlas of genetic associations in UK Biobank. Nat Genet. 2018;50(11):1593–1599. doi: 10.1038/s41588-018-0248-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Zhu Z., et al. A genome-wide cross-trait analysis from UK Biobank highlights the shared genetic architecture of asthma and allergic diseases. Nat Genet. 2018;50(6):857–864. doi: 10.1038/s41588-018-0121-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Zhu Z., et al. Shared genetics of asthma and mental health disorders: a large-scale genome-wide cross-trait analysis. Eur Respir J. 2019;54(6) doi: 10.1183/13993003.01507-2019. [DOI] [PubMed] [Google Scholar]
29.Demenais F., et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat Genet. 2018;50(1):42–53. doi: 10.1038/s41588-017-0014-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Chen C.W., Yang H.C. OPATs: Omnibus P-value association tests. Brief Bioinform. 2019;20(1):1–14. doi: 10.1093/bib/bbx068. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Lipták T. On the combination of independent tests. A Magyar Tudományos Akadémia Matematikai Kutató Intézetének Közleményi. 1958;3:171–197. [Google Scholar]
32.Leroy Folks J. Handbook of Statistics. Elsevier; 1984. 6 Combination of independent tests; pp. 113–121. [Google Scholar]
33.Brown M.B. 400: A method for combining non-independent, one-sided tests of significance. Biometrics. 1975;31(4):987–992. [Google Scholar]
34.Kost J.T., McDermott M.P. Combining dependent P-values. Stat Probab Lett. 2002;60(2):183–190. [Google Scholar]
35.Satterthwaite F.E. An approximate distribution of estimates of variance components. Biometrics. 1946;2(6):110–114. [PubMed] [Google Scholar]
36.Dai H., Leeder J.S., Cui Y. A modified generalized Fisher method for combining probabilities from dependent tests. Front Genet. 2014;5:32. doi: 10.3389/fgene.2014.00032. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Zhang H., Wu Z. The generalized Fisher's combination and accurate p-value calculation under dependence. Biometrics. 2022 doi: 10.1111/biom.13634. [DOI] [PubMed] [Google Scholar]
38.Good I. On the weighted combination of significance tests. J Roy Stat Soc: Ser B (Methodol) 1955;17:264–265. [Google Scholar]
39.Dudbridge F., Koeleman B.P. Rank truncated product of P-values, with application to genomewide association scans. Genet Epidemiol. 2003;25(4):360–366. doi: 10.1002/gepi.10264. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data 1

mmc1.docx^{(1.4MB, docx)}

[b0005] 1.Fisher R.A. 4th ed. Oliver and Boyd; Edinburgh: 1932. Statistical methods for research workers. [Google Scholar]

[b0010] 2.Haidich A.B. Meta-analysis in medical research. Hippokratia. 2010;14(1):29–37. [PMC free article] [PubMed] [Google Scholar]

[b0015] 3.Evangelou E., Ioannidis J.P. Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet. 2013;14(6):379–389. doi: 10.1038/nrg3472. [DOI] [PubMed] [Google Scholar]

[b0020] 4.Hagg S., et al. Gene-based meta-analysis of genome-wide association studies implicates new loci involved in obesity. Hum Mol Genet. 2015;24(23):6849–6860. doi: 10.1093/hmg/ddv379. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0025] 5.Springelkamp H., et al. Meta-analysis of genome-wide association studies identifies novel loci associated with optic disc morphology. Genet Epidemiol. 2015;39(3):207–216. doi: 10.1002/gepi.21886. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0030] 6.Scott E.N., et al. Transcriptome-wide association study uncovers the role of essential genes in anthracycline-induced cardiotoxicity. NPJ Genom Med. 2021;6(1):35. doi: 10.1038/s41525-021-00199-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0035] 7.Zeng P., et al. Aggregating multiple expression prediction models improves the power of transcriptome-wide association studies. Hum Mol Genet. 2021;30(10):939–951. doi: 10.1093/hmg/ddab056. [DOI] [PubMed] [Google Scholar]

[b0040] 8.Hu Y., et al. A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet. 2019;51(3):568–576. doi: 10.1038/s41588-019-0345-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0045] 9.Li Q.Z., et al. Fisher's method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations. Biostatistics. 2014;15(2):284–295. doi: 10.1093/biostatistics/kxt045. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0050] 10.Karlsson A. Permutation, parametric, and bootstrap tests of hypotheses. J Royal Stat Soc Series a-Stat Soc. 2006;169:171. [Google Scholar]

[b0055] 11.Hayes B. Overview of statistical methods for genome-wide association studies (GWAS) Methods Mol Biol. 2013;1019:149–169. doi: 10.1007/978-1-62703-447-0_6. [DOI] [PubMed] [Google Scholar]

[b0060] 12.Feng H., et al. Multitrait transcriptome-wide association study (TWAS) tests. Genet Epidemiol. 2021;45(6):563–576. doi: 10.1002/gepi.22391. [DOI] [PubMed] [Google Scholar]

[b0065] 13.MacArthur J., et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog) Nucleic Acids Res. 2017;45(D1):D896–D901. doi: 10.1093/nar/gkw1133. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0070] 14.Beck T., et al. GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies. Eur J Hum Genet. 2014;22(7):949–952. doi: 10.1038/ejhg.2013.274. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0075] 15.Zaykin D.V., et al. Truncated product method for combining P-values. Genet Epidemiol. 2002;22(2):170–185. doi: 10.1002/gepi.0042. [DOI] [PubMed] [Google Scholar]

[b0080] 16.Plenge R.M., et al. TRAF1-C5 as a risk locus for rheumatoid arthritis–a genomewide study. N Engl J Med. 2007;357(12):1199–1209. doi: 10.1056/NEJMoa073491. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0085] 17.Wellcome Trust Case Control C. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447(7145):661–678. doi: 10.1038/nature05911. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0090] 18.Bonferroni, C.E., Teoria statistica delle classi e calcolo delle probabilit ‘a. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze, 1936. 8.

[b0095] 19.Newton J.L., et al. A review of the MHC genetics of rheumatoid arthritis. Genes Immun. 2004;5(3):151–157. doi: 10.1038/sj.gene.6364045. [DOI] [PubMed] [Google Scholar]

[b0100] 20.Matzaraki V., et al. The MHC locus and genetic susceptibility to autoimmune and infectious diseases. Genome Biol. 2017;18(1):76. doi: 10.1186/s13059-017-1207-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0105] 21.Zheng W., Rao S. Knowledge-based analysis of genetic associations of rheumatoid arthritis to inform studies searching for pleiotropic genes: a literature review and network analysis. Arthritis Res Ther. 2015;17:202. doi: 10.1186/s13075-015-0715-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0110] 22.Eyre S., et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat Genet. 2012;44(12):1336–1340. doi: 10.1038/ng.2462. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0115] 23.Stahl E.A., et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet. 2010;42(6):508–514. doi: 10.1038/ng.582. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0120] 24.Cao C., et al. webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Res. 2022;50(D1):D1123–D1130. doi: 10.1093/nar/gkab957. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0125] 25.Barbeira A.N., et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0130] 26.Canela-Xandri O., Rawlik K., Tenesa A. An atlas of genetic associations in UK Biobank. Nat Genet. 2018;50(11):1593–1599. doi: 10.1038/s41588-018-0248-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0135] 27.Zhu Z., et al. A genome-wide cross-trait analysis from UK Biobank highlights the shared genetic architecture of asthma and allergic diseases. Nat Genet. 2018;50(6):857–864. doi: 10.1038/s41588-018-0121-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0140] 28.Zhu Z., et al. Shared genetics of asthma and mental health disorders: a large-scale genome-wide cross-trait analysis. Eur Respir J. 2019;54(6) doi: 10.1183/13993003.01507-2019. [DOI] [PubMed] [Google Scholar]

[b0145] 29.Demenais F., et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat Genet. 2018;50(1):42–53. doi: 10.1038/s41588-017-0014-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0150] 30.Chen C.W., Yang H.C. OPATs: Omnibus P-value association tests. Brief Bioinform. 2019;20(1):1–14. doi: 10.1093/bib/bbx068. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0155] 31.Lipták T. On the combination of independent tests. A Magyar Tudományos Akadémia Matematikai Kutató Intézetének Közleményi. 1958;3:171–197. [Google Scholar]

[b0160] 32.Leroy Folks J. Handbook of Statistics. Elsevier; 1984. 6 Combination of independent tests; pp. 113–121. [Google Scholar]

[b0165] 33.Brown M.B. 400: A method for combining non-independent, one-sided tests of significance. Biometrics. 1975;31(4):987–992. [Google Scholar]

[b0170] 34.Kost J.T., McDermott M.P. Combining dependent P-values. Stat Probab Lett. 2002;60(2):183–190. [Google Scholar]

[b0175] 35.Satterthwaite F.E. An approximate distribution of estimates of variance components. Biometrics. 1946;2(6):110–114. [PubMed] [Google Scholar]

[b0180] 36.Dai H., Leeder J.S., Cui Y. A modified generalized Fisher method for combining probabilities from dependent tests. Front Genet. 2014;5:32. doi: 10.3389/fgene.2014.00032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0185] 37.Zhang H., Wu Z. The generalized Fisher's combination and accurate p-value calculation under dependence. Biometrics. 2022 doi: 10.1111/biom.13634. [DOI] [PubMed] [Google Scholar]

[b0190] 38.Good I. On the weighted combination of significance tests. J Roy Stat Soc: Ser B (Methodol) 1955;17:264–265. [Google Scholar]

[b0195] 39.Dudbridge F., Koeleman B.P. Rank truncated product of P-values, with application to genomewide association scans. Genet Epidemiol. 2003;25(4):360–366. doi: 10.1002/gepi.10264. [DOI] [PubMed] [Google Scholar]

PERMALINK

Evaluating statistical significance in a meta-analysis by using numerical integration

Yin-Chun Lin

Yu-Jen Liang

Hsin-Chou Yang

Graphical abstract

Abstract

1. Introduction