Simulation results from 1000 simulated case-control samples taken from a population with a disease rate of approximately 4.5%, and independent genetic and environmental variables, under the logistic model with gene–environment interaction. The results for G ∼ (0.6) and X ∼ (20, 1) is displayed on the left whereas the results for G ~ N(0, 1) and X ~
(20, 1) is on the right. Each replicate contains N1 = 1000 cases and N0 = 1000 controls, and is analyzed through two approaches, (1) “Logistic” is ordinary logistic regression, and (2) “Semi” is our semiparametric efficient estimator. Here, we list the sample mean (“mean”), the sample standard error (“se”), the mean estimated standard error (“est se”) and the coverage for the nominal 95% confidence intervals (“95%”) for both methods. In addition, we computed the mean squared error efficiency of the “Semi” method compared to the “Logistic” approach.