Skip to main content
. Author manuscript; available in PMC: 2020 Apr 3.
Published in final edited form as: J Am Stat Assoc. 2019 Apr 3;114(527):1038–1049. doi: 10.1080/01621459.2018.1529594

Table C2:

Candidate regressions included in super learner analysis of the RTS,S/AS01 data for estimation of censoring distribution. GLM = logistic regression; GAM = generalized additive model; GBM = gradient boosted machine. A variable selection was included in some candidate regression algorithms (third column). In some cases, variables were selected a-priori; in others, variables were selected based on their correlation with the outcome.

Algorithm Tuning parameters Covariates
GLM intercept only none
GLM main terms all
GLM main terms five highest correlations
GLM main terms study site, treatment, time
GLM all two-way interactions study site, treatment, time
GAM main terms, df=3 five highest correlations
GAM main terms five highest correlations
GAM main terms study site, treatment, time
GAM all two-way interactions study site, treatment, time
Random forest mtry=5,ntree=1000 All
GBM int.depth=2, ntree=1000 All