Skip to main content
. 2021 Jan 28;11:2518. doi: 10.1038/s41598-021-82000-1

Table 1.

Simulation results for genetic effect estimation of our method and the gold standard.

MAF Individual level data frequency (gold standard) Proposed method full dataset Proposed method subset dataset (20%) Proposed method within ± 20% of true value
β^ SE(β^) β^ SE(β^) β^ SE(β^) β^ SE(β^)
Continuous Y1 and continuous Y2
2% 0.710 0.146 0.710 0.146 0.709 0.145 0.709 0.145
5% 0.460 0.092 0.460 0.092 0.461 0.092 0.460 0.092
10% 0.332 0.065 0.332 0.065 0.331 0.065 0.332 0.065
25% 0.231 0.046 0.231 0.046 0.23 0.046 0.231 0.046
Continuous Y1 and binary Y2
2% 0.709 0.143 0.707 0.143 0.708 0.143 0.707 0.143
5% 0.460 0.091 0.455 0.091 0.457 0.09 0.455 0.090
10% 0.333 0.064 0.329 0.064 0.329 0.064 0.328 0.064
25% 0.228 0.046 0.222 0.046 0.223 0.046 0.221 0.046
Continuous Y2 and binary Y1
2% 0.597 1.072 0.561 1.101 0.571 1.144 0.541 1.057
5% 0.752 0.288 0.722 0.284 0.727 0.309 0.695 0.277
10% 0.809 0.203 0.781 0.200 0.782 0.215 0.750 0.192
25% 0.862 0.150 0.843 0.149 0.843 0.155 0.811 0.143
Binary Y1 and binary Y2
2% 0.867 0.364 0.786 0.340 0.784 0.342 0.784 0.340
5% 0.886 0.234 0.808 0.219 0.806 0.222 0.806 0.220
10% 0.842 0.172 0.782 0.162 0.784 0.165 0.780 0.162
25% 0.764 0.133 0.744 0.130 0.748 0.132 0.742 0.130

Number in the table represent averages over all simulation replicates.

MAF: minor allele frequency. Individual level data analysis is the gold standard for estimation. “Full dataset” means the relationship between the outcome and the covariate is estimated in the full sample of individuals, but the effect is estimated using our approximate approach. “Subset dataset” means the relationship between the traits is estimated by randomly selecting 200 individuals, or 20% of the total sample size. “Proposed method within ± 20% of true value” means the relationship between the outcome, and the covariate is a random estimate falling with 20% the true covariance between the traits. The latter scenario reflects what might happen when using estimates from published reports.