| Algorithm 2 Generation algorithm for the safe decision controller |
|
Input: Output: Optimal safety policy 1: Init () 2: 3: 4: while do 5: // Iterative selection of optimal value 6: 7: // Select action 8: 9: while do // monitor 10: 11: if then 12: 13: else 14: 15: break 16: end if 17: end while 18: // Update parameters 19: Go to the next state 20: end while |