Skip to main content
. Author manuscript; available in PMC: 2018 Sep 1.
Published in final edited form as: Genet Epidemiol. 2017 Jun 4;41(6):498–510. doi: 10.1002/gepi.22052

Algorithm 1.

Procedure with P-values of gene-level statistics computed via asymptotic testing

  1. For each study k, compute the gene-level statistics based on association testing via GLM using isoform-specific expression:

    1. Given Tkg = 1, compute the score statistic Ukg and its estimated covariance matrix Vkg for each gene g in study k. If Tkg = 0, simply set the corresponding Ukg and Vkg to 0.

  2. Meta-analysis:

    1. For the FE approach, compute the gene-level statistic Qg using (2) for 1 ≤ gG, whose asymptotic reference distribution is χ2 with df = Ig. For the RE approach, estimate Bg from data and compute Qg using (4), whose asymptotic reference distribution is approximately χ2 with df = Ig + 1.

    2. Estimate P-values of each Qg, denoted by p(Qg), based on asymptotic testing.

  3. Set enrichment analysis:

    1. For each pathway p, compute the corrected OKS statistic ωp based on the rankings of [p(Qg)]g=1G, 1 ≤ pP.

    2. Permute gene labels N times and calculate the permuted statistics ωp(n), 1 ≤ nN, 1 ≤ pP.

    3. Estimate the P-value of pathway p: p(ωp)=n=1Np=1PT(ωp(n)ωp)/(N·P), 1 ≤ pP, where T(·) is the indicator function.

    4. Estimate the Q-value of pathway p using a smoothing method (Storey and Tibshirani, 2003) implemented in an R package named qvalue (Dabney et al., 2011). Pathways with q(ωp)δ are claimed to be enriched.