Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2017 Mar 27;206(1):91–104. doi: 10.1534/genetics.117.200063

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2017 by the Genetics Society of America

PMC Copyright notice

Mean- $R^{2}$ and calibration for imputation based on GeneImp. (A) Mean- $R^{2}$ as a function of window-size. Results are from chromosome 22. A smaller window and the Combined panel lead to higher mean- $R^{2},$ while more filtered haplotypes lead to very small gains. (B) Mean- $R^{2}$ as a function of MAF. Results are from the whole genome using $ℓ = 200$ filtered haplotypes. Single window-split corresponds to median window-size of 58.2 kb, average of two window-splits is taken over results with median window-sizes of 58.2 and 78.9 kb. Mean- $R^{2}$ increases as a function of the MAF, leveling-off around $MAF = 0.05.$ Averaging posterior probabilities from two window-splits leads to higher mean- $R^{2},$ especially for rarer SNPs. (C) Mean- $R^{2}$ in different chromosomes. Results are based on $ℓ = 200$ filtered haplotypes. Single window-split corresponds to median window-size of 58.2 kb, average of two window-splits is taken over results with median window-sizes of 58.2 and 78.9 kb. Imputation is marginally worse in shorter chromosomes. (D) Calibration of posterior probabilities from a single window-split corresponding to median window-size of 58.2 kb, and an average of two window-splits taken over results from median window-sizes of 58.2 and 78.9 kb. To evaluate calibration we split imputed genotypes into bins according to their posterior probability distribution. We plot the mean posterior probability in each bin (x-axis) against the percentage of correctly predicted genotypes in each bin (y-axis). Averaging across window-splits leads to well calibrated posterior probabilities (most points lie close to the diagonal), while imputation probabilities based on a single window-split are over-confident (points lie below the diagonal).