Skip to main content
. 2018 Jul 28;21(1):69–85. doi: 10.1093/biostatistics/kxy032

Fig. 1.

Fig. 1.

This is a diagram of a binary regression tree. On the vertical axis, we have the tier, Inline graphic, i.e., in each tier, we may have nodes Inline graphic. This tree has branch decisions (diamonds) at nodes Inline graphic and terminal leaf nodes (circles) at Inline graphic with corresponding output values of Inline graphic. At branches Inline graphic, the splitting variable is Inline graphic and the cutpoint is Inline graphic, i.e., if Inline graphic, then go to node Inline graphic, otherwise, Inline graphic. Note that nodes Inline graphic (in gray) do not appear since node Inline graphic is a leaf. This tree is denoted by Inline graphic and a corresponding regression function Inline graphic where Inline graphic is a vector of covariates, Inline graphic. Inline graphic represent the branch decision rules of the form Inline graphic and is composed of ordered triples, Inline graphic: Inline graphic for the node, Inline graphic for covariate Inline graphic and Inline graphic for the cutpoint Inline graphic. So, here we have Inline graphic. Inline graphic represents leaves and is composed of ordered pairs, Inline graphic: Inline graphic for the node and Inline graphic for the outcome value. So, Inline graphic. The regression function, Inline graphic, climbs the tree. For example, suppose Inline graphic and Inline graphic, then Inline graphic.