Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Nov 16;23(6):bbac473. doi: 10.1093/bib/bbac473

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

PMC Copyright notice

Illustration of the kinship estimation. (A) The expected values of kinship coefficient () and probability of zero-IBD sharing () for relatives with varying degrees of relatedness. Each dot corresponds to a relationship. The expected values of and are shown on y- and x-axis, respectively, for each relatedness level. (B) reference population panels are used for computing the principal components () and the population-specific centroid coordinates . Given the query genotype matrix, , they are first projected onto reference panel components, where the projected coordinates are stored in . The admixture rates are computed by comparing the population-specific centroids to the projected coordinates. The estimated admixture rates are used to compute individual-specific allele frequencies for each of the variants for each of the individuals in the query genotype matrix. The individual-specific allele frequencies are used in the estimation of the correlation and distance-based kinship coefficients and IBD-sharing probabilities. (C) Illustration of decomposition and projection of a query individual. The pooled reference genotype matrix is by PCA and projected on the top two components for the three reference populations. The centroids of each population are identified as the mean projected coordinates for individuals in the respective population. The query individual is projected onto the two components and distance of the projection to the three centroids is used to estimate admixture rates for this individual. It should be noted that two components are used for illustration purposes, the number of components that SIGFRIED uses can be changed by the user.

Inline graphic — Illustration of the kinship estimation. (A) The expected values of kinship coefficient () and probability of zero-IBD sharing () for relatives with varying degrees of relatedness. Each dot corresponds to a relationship. The expected values of and are shown on y- and x-axis, respectively, for each relatedness level. (B) reference population panels are used for computing the principal components () and the population-specific centroid coordinates . Given the query genotype matrix, , they are first projected onto reference panel components, where the projected coordinates are stored in . The admixture rates are computed by comparing the population-specific centroids to the projected coordinates. The estimated admixture rates are used to compute individual-specific allele frequencies for each of the variants for each of the individuals in the query genotype matrix. The individual-specific allele frequencies are used in the estimation of the correlation and distance-based kinship coefficients and IBD-sharing probabilities. (C) Illustration of decomposition and projection of a query individual. The pooled reference genotype matrix is by PCA and projected on the top two components for the three reference populations. The centroids of each population are identified as the mean projected coordinates for individuals in the respective population. The query individual is projected onto the two components and distance of the projection to the three centroids is used to estimate admixture rates for this individual. It should be noted that two components are used for illustration purposes, the number of components that SIGFRIED uses can be changed by the user.