Skip to main content
. 2020 Mar 27;8(2):77. doi: 10.3390/healthcare8020077
Algorithm 2. Action selection
 Input: Stg, Pgt
 Output: A patient with maximum Q-value
1: rn random ( ):
2:  if rn ϵ then
3:   action random (Pgt)
4:  else
5:   action maxQvalue (Stg,Pgt)
6:  end if