Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
. 2022 Feb 3;109(2):373. doi: 10.1016/j.ajhg.2022.01.007

Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort

Florian Privé , Hugues Aschard, Shai Carmi, Lasse Folkersen, Clive Hoggart, Paul F O’Reilly, Bjarni J Vilhjálmsson
PMCID: PMC8874215  PMID: 35120604

(The American Journal of Human Genetics 109, 12–23; January 6, 2022)

An unfortunate corruption of two equations on page 14 appeared in the version of this paper published on January 6. It has been corrected here and online. The publisher apologizes for this error.

New formula used in LDpred2

We also slightly modify the formula used in Privé et al.;32 we have previously used

seγˆj2=y˘γˆjG˘jTy˘γˆjG˘jnK1G˘jTG˘jy˘Ty˘nG˘jTG˘jvarynvarGj,

where γˆj is the marginal effect of variant j, and where y˘ and G˘j are the vectors of phenotypes and genotypes for variant j residualized from K covariates, e.g., centering them. The first approximation expects γˆj to be small, while the second approximation assumes the effects from covariates are small. However, we have found here that some variants can have very large effects, e.g., one variant explains about 30% of the variance in bilirubin log-concentration. Then, instead we compute

(y˘γˆjG˘j)T(y˘γˆjG˘j)=y˘Ty˘2γˆjG˘jTy˘+γˆj2G˘jTG˘j=y˘Ty˘γˆj2G˘jTG˘j,

which now gives

(nK1)se(γˆj)2=y˘Ty˘γˆj2G˘jTG˘jG˘jTG˘j=y˘Ty˘G˘jTG˘jγˆj2var(y˘)var(Gj)γˆj2,

finally giving (note the added term γˆj2)

sdGjsdy˘nseγˆj2+γˆj2. (Equation 1)

Figure S23 shows that the updated formula Equation 1 is better; we now use it in the code of LDpred2, and also recommend using it for the QC procedure proposed in Privé et al.32


Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES