Classification of Potential Water Bodies Using Landsat 8 OLI and a Combination of Two Boosted Random Forest Classifiers

. 2015 Jun 11;15(6):13763–13777. doi: 10.3390/s150613763

Algorithm 1 BRF learning

1. T: the maximum number of decision trees to grow for BRF

D: the maximum depth of trees to extend

M: number of classes

S_n: Training set, including positive (river and lake) and negative (land, mountain and building) samples with their labels and weight, {x₁, y₁, w₁},…,{x_N, y_N, w_N}; x_i ∈ X, y ∈ M

Initialize sample weight

w_{i}^{(1)}

= 1/N

2. For t = 1 to T do

Select subset s from training set S_n

Grow an unpruned tree using the s subset samples with their corresponding weights.

For d = 1 to D do

Each internal node randomly selects p variables and determines the best split function using only these variables.

Loop: Using different p-th variables, the split function f(v_p) iteratively splits the training data into left (Il) and right (Ir) subsets using Equation (6).

\begin{array}{l} I l = {p \in I n | f (v p) < t}, \\ I r = I n \ I l \end{array}

(6)

The threshold t is randomly chosen by the split function f(v_p) in the range

t \in (\min_{p} f (v_{p}), \max_{p} f (v_{p}))

Compute information gain ΔG function f(v_p)

If (ΔG= max) then Determine the best split function f(v_p) for the node d

Else goto Loop.

End For

Store the probability distribution P (C | l_t) to leaf node

Output: A weak decision tree

Estimate class label

{\hat{y}}_{i}

of the training data with the trained decision trees:

{\hat{y}}_{i} = \arg ​ \max_{c} P (c | l_{t})

(7)

Calculate the error of decision tree ε_t:

ε_{t} = \sum_{i : y_{i} \neq {\hat{y}}_{i}}^{N} w_{i}^{(t)} / \sum_{i}^{N} w_{i}^{(t)}

(8)

Compute weight of the t-th decision tree α_t:

α_{t} = \frac{1}{2} \log \frac{(M - 1) (1 - ε_{t})}{ε_{t}}

(9)

If α > 0, then

Update weight of training sample

w_{i}^{(t + 1)}

w_{i}^{(t + 1)} = {\begin{matrix} w_{i}^{(t)} \exp (α_{t}) i f y_{i} \neq {\hat{y}}_{i} \\ w_{i}^{(t)} \exp (- α_{t}) o t h e r w i s e \end{matrix}

(10)

else

Reject the decision tree

End For

3. Final output: A BRF consists of N decision trees (N ≤ T)