Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2021 May 20;28(5):435–451. doi: 10.1089/cmb.2020.0445

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© Brooks Paige, et al., 2021. Published by Mary Ann Liebert, Inc.

This Open Access article is distributed under the terms of the Creative Commons License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.

PMC Copyright notice

FIG. 4. — Accuracy at reconstruction of genomes x₀ using EM estimation and a noisy estimate $\hat{K}$ , as compared with a natural baseline that always predicts the most common variant at each SNP locus. We use this as a baseline, because without any additional information about $β_{M}$ and $β_{M + 1}$ , the most accurate prediction of the dog's genotype would be to predict the most common variant at each locus. Here, we define accuracy as the proportion of SNPs that are correctly identified in the dog that was found in the second GWAS study, but not the first. Each distribution is constructed from 500 experimental test points, in which we (1) took 10 random splits of the full dog dataset, assigning dogs to either the public or private dataset; (2) for each split, we tested the reconstruction 50 times, each time adding a different randomly sampled dog to the second GWAS study. The private dataset always has 1000 individuals; the public test dataset is of increasing size, improving performance. EM, expectation–maximization.