Table C3:
Candidate regressions included in super learner analysis of the RTS,S/AS01 data for estimation of iterated means. GLM = logistic regression; GAM = generalized additive model; GBM = gradient boosted machine. A variable selection was included in some candidate regression algorithms (third column). In some cases, variables were selected a-priori; in others, variables were selected based on their correlation with the outcome. Gradient boosted machine tuning parameters selected based on out-of-bag (OOB) error rate.
Algorithm | Tuning parameters | Covariates |
---|---|---|
GLM | main terms | all |
GLM | intercept only | none |
GLM | main terms | treatment |
GLM | main terms | study site, treatment |
GLM | two-way interactions | study site, treatment |
Stepwise GLM | two-way interactions | five highest correlations |
Forward stepwise GLM | main terms | all |
GAM | main terms, df=3 | five highest correlations |
GAM | main terms, df=3 | ten highest correlations |
Random Forest | mtry=5,ntree=1000 | all |
GBM | OOB error rate | all |