Algorithm 2. Action selection |
Input: , |
Output: A patient with maximum -value |
1: rn random ( ): |
2: if rn then |
3: action random |
4: else |
5: action maxQvalue |
6: end if |
Algorithm 2. Action selection |
Input: , |
Output: A patient with maximum -value |
1: rn random ( ): |
2: if rn then |
3: action random |
4: else |
5: action maxQvalue |
6: end if |