Table 2.
Dependent Variable Sample used | Upper-bound for the regression based models | Random Forests |
---|---|---|
Total Reading Time | ||
All | .083 | .082 |
Test | .075 | .080 |
Skipping | ||
All | .029 | .027 |
Test | .022 | .026 |
Comprehension Score | ||
All | .236 | .174 |
Test | .171 | .173 |
The R2 of the full regression model with all predictors is reported as the upper bound for Dominance Analysis and Multimodel Inference. The R2 values for the All sample are based on the models trained on the entire dataset, and are calculated based either on the models’ fitted values to the entire dataset (for full regression model), or on the model's prediction to the out-of-bag samples (for Random Forests). The R2 values for the Test sample are based on the models trained on the 70% random training subsamples and are calculated using the models’ prediction to the remaining 30% testing subsamples, averaged over 100 runs.