Skip to main content
. 2023 Jul 25;14:4473. doi: 10.1038/s41467-023-40069-4

Table 2.

Results of the OLS and IV regressions explaining (residualized and standardized) educational attainment

OLS OLS OLS ORIV ORIV PGI-RC PGI-RC
(UKB) (23andMe) (Meta-analysis) (2-sample) (Split-sample) (default) (GREML unc.)
Between-family results
 Polygenic index 0.258*** 0.218*** 0.276*** 0.337*** 0.323*** 0.394*** 0.394***
(0.005) (0.005) (0.005) (0.007) (0.007) (0.007) (0.024)
 First-stage estimate 0.498*** 0.489***
(0.005) (0.005)
 First-stage F-statistic 11,919 11,061
 Incremental R2 6.7% 4.7% 7.6%
 Family fixed effects NO NO NO NO NO NO NO
 N 35,282 35,282 35,282 35,282 35,282 35,282 35,282
Within-family results
 Polygenic index 0.124*** 0.115*** 0.142*** 0.184*** 0.170***
(0.009) (0.009) (0.009) (0.012) (0.013)
 First-stage estimate 0.460*** 0.436***
(0.006) (0.006)
 First-stage F-statistic 6068 5158
 Incremental R2 1.5% 1.3% 2.0%
 Family fixed effects YES YES YES YES YES
 N 35,282 35,282 35,282 35,282 35,282

Notes: *p value < 0.10; **p value < 0.05; ***p value < 0.01 of a two-sided t test (OLS, PGI-RC) or two-sided z-test (ORIV) without adjustments for multiple comparisons. In all regressions the dependent variable is residualized educational attainment (EA, standardized to have mean 0 and standard deviation 1), where the residuals are obtained from a regression of EA on sex, year of birth, month of birth, sex interacted with year of birth, and the first 40 principal components of the genetic relationship matrix. Standard errors are robust and clustered at the family level, and in case of ORIV also at the individual level. OLS (UKB) refers to the model with the PGI constructed using the UKB non-sibling (i.e., excluding all siblings and their relatives) sample. OLS (23andMe) refers to the model with the PGI constructed using the 23andMe summary statistics. OLS (Meta-analysis) uses a PGI constructed using a meta-analysis of GWAS summary statistics of the UKB non-sibling sample and the 23andMe sample. ORIV (2-sample) refers to a 2SLS estimation using the PGIs from the UKB non-sibling sample and 23andMe as instrumental variables for each other. ORIV (Split-sample) refers to a 2SLS estimation where the summary statistics derive from a random split of the UKB sample. PGI-RC refers to the PGI repository correction, where (default) refers to the conventional application of the method, whereas (GREML unc.) refers to the case where we incorporate the uncertainty in the estimation of the SNP-based heritability.