. Author manuscript; available in PMC: 2016 Apr 16.

Published in final edited form as: J Am Stat Assoc. 2015 Apr 16;110(512):1770–1784. doi: 10.1080/01621459.2015.1036994

Table 1.

Algorithm for reinforcement learning trees

Draw M bootstrap samples from D.
For the m-th bootstrap sample, where m ∈ {1, …, M}, fit one RLT model f̂_m, using the following rules:
1. At an internal node A, fit an embedded model ${\hat{f}}_{A}^{*}$ to the training data in A, restricted to the set of variables ${1, 2, \dots, p} \ P_{A}^{d}$ , i.e. $P \ P_{A}^{d}$ , where $P_{A}^{d}$ is the set of muted variables at the current node A. Details are given in Section 2.4.
2. Using ${\hat{f}}_{A}^{*}$ , calculate the variable importance measure ${\hat{V I}}_{A} (j)$ for each variable X⁽^j⁾, where j ∈ . Details are given in Section 2.5.
3. Split node A into two daughter nodes using the variable(s) with the highest variable importance measure (Section 2.7).
4. Update the set of muted variables for the two daughter nodes by adding the variables with the lowest variable importance measures at the current node. Details are given in Section 2.6.
5. Apply a)–d) on each daughter node until node sample size is smaller than a pre-specified value n_min.
Average M trees to get a final model $\hat{f} = M^{- 1} \sum_{m = 1}^{M} {\hat{f}}_{m}$ . For classification, $\hat{f} = I (0.5 < M^{- 1} \sum_{m = 1}^{M} {\hat{f}}_{m})$ .