Skip to main content
. Author manuscript; available in PMC: 2020 Apr 3.
Published in final edited form as: J Am Stat Assoc. 2019 Apr 3;114(527):1038–1049. doi: 10.1080/01621459.2018.1529594

Table C3:

Candidate regressions included in super learner analysis of the RTS,S/AS01 data for estimation of iterated means. GLM = logistic regression; GAM = generalized additive model; GBM = gradient boosted machine. A variable selection was included in some candidate regression algorithms (third column). In some cases, variables were selected a-priori; in others, variables were selected based on their correlation with the outcome. Gradient boosted machine tuning parameters selected based on out-of-bag (OOB) error rate.

Algorithm Tuning parameters Covariates
GLM main terms all
GLM intercept only none
GLM main terms treatment
GLM main terms study site, treatment
GLM two-way interactions study site, treatment
Stepwise GLM two-way interactions five highest correlations
Forward stepwise GLM main terms all
GAM main terms, df=3 five highest correlations
GAM main terms, df=3 ten highest correlations
Random Forest mtry=5,ntree=1000 all
GBM OOB error rate all