Skip to main content
. Author manuscript; available in PMC: 2021 Jan 5.
Published in final edited form as: Nat Neurosci. 2017 Dec;20(12):1661–1668. doi: 10.1038/s41593-017-0017-9

Figure 1. Statistical power in the noncoding genome.

Figure 1.

We estimated the power at a significance threshold (alpha) of 5 × 10−5, selected to account for 1,000 categories of noncoding variants, to detect an excess of noncoding variants at 122,500 risk loci in cases vs. controls as we varied the relative risk and risk:non-risk ratio, which represents annotation quality (Table S2). In a) we assessed the power for detecting an excess of de novo mutations in 5,000 cases vs. 5,000 controls as the relative risk increases. With a risk:non-risk ratio of 1:20, approximately equivalent to assessing protein truncating variants in the coding genome, we achieve >80% power with a relative risk of 5. In b) the power to detect an excess burden of rare variants (allele frequency ≤0.1%) is assessed in 20,000 cases vs. 20,000 controls. In c) we assessed the power to identify an excess of de novo mutations at a specific genomic locus, e.g. the noncoding region regulating a single gene. Consequently, we set the significance threshold (alpha) at 2.5×10−6. In d) we assessed the power to identify an excess of rare variants (allele frequency ≤0.1%) at a specific nucleotide (alpha = 1.7×10−11), since this yielded better power than testing for burden at a locus (alpha = 2.5×10−6).