Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets

View full-text article in PMC

. 2020 Sep 29;22(10):1093. doi: 10.3390/e22101093

Algorithm 1 The R-MOTiFS Algorithm

Load dataset and preprocess

Initialize SCALAR, BUDGET //Scaling factor & Number of MCTS simulations (hyper parameters)

function R-MOTiFS (featuresList)

create rootNode

maxReward, bestFeatureSubset ← UCTSEARCH (rootNode)

if maxReward is greater than optimalReward then

optimalReward ← maxReward

optimalFeatureSubset ← bestFeatureSubset

R-MOTiFS (bestFeatureSubset)

else

return (optimalReward, optimalFeatureSubset)

function UCTSEARCH (rootNode)

Initialize maxReward, bestFeatureSubset

while within computational budget do

frontNode ← TREEPOLICY (rootNode)

reward, featureSubset ← DEFAULTPOLICY (frontNode.state)

BACKUP (frontNode, reward)

if reward is greater than maxReward then

maxReward ← reward

bestFeatureSubset ← featureSubset

return (maxReward, bestFeatureSubset)

function TREEPOLICY (node)

while node is non-terminal do

if node not fully expanded then

return EXPAND (node)

else

node ← BESTCHILD (node, SCALAR)

return node

function EXPAND (node)

choose a

\in

untried actions from A(node.state)

add a newChild with f(node.state, a)

return newChild

function BESTCHILD (

v

C

)

return \max_{v^{'} \in children of v} \max (Q_{v^{'}}) + C \sqrt{\frac{2 \times l n (v . v i s i t s)}{v^{'} . v i s i t s}}

function DEFAULTPOLICY (state)

while state is non-terminal do

choose a

\in

A(state) uniformly at random

state ← f(state, a)

traversestate.path

if a_i is equal to f_i+1 then

featureSubset ← INCLUDE (f_i+1)

reward ← REWARD (featureSubset)

return (reward, featureSubset)

function BACKUP (node, reward)

while node is not null do

node.visits ← node.visits + 1

if reward > node.reward then

node.reward ← reward

node ← node.parent

return