View full-text article in PMC Entropy (Basel). 2023 Jan 30;25(2):251. doi: 10.3390/e25020251 Search in PMC Search in PubMed View in NLM Catalog Add to search Copyright and License information © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). PMC Copyright notice Algorithm 3 Stochastic Rollout Algorithm 1:Input:(x0′,x0)∈Sk′×Sk 2:fork=0toN−1do 3: for each (uk′,uk)∈Uk′(xk′)×Uk(xk′,xk) do 4: repeat 5: wk∼pWk, mk∼pMk 6: xk+1′←vk(xk′,uk′,wk),xk+1←fk(xk,uk,mk) 7: for i=k+1 to N−1 do 8: {μ^i′(xi′),μ^i(xi′,xi)}←Generate_base_policies(xi′,xi) 9: wi∼pWi, mi∼pMi 10: xi+1′←vi(xi′,μ^i′(xi′),wi),xi+1←fi(xi,μ^i(xi′,xi),mi) 11: end for 12: Store: hk(xk,uk,mk)+J˜k+1(xk+1′,xk+1) 13: until a selected criterion is met 14: Q˜k(xk′,xk,uk′,uk)←EwkEmkhk(xk,uk,mk)+J˜k+1(xk+1′,xk+1) 15: end for 16: J^k(xk′,xk)←maxuk′∈Uk′(xk′)maxuk∈Uk(xk′,xk)Q˜k(xk′,xk,uk′,uk) 17: μ¯k(xk′,xk)←uk*,μ¯k′(xk′)←uk′* 18: wk∼pWk, mk∼pMk 19: xk+1′←vk(xk′,μ¯k′(xk′),wk),xk+1←fk(xk,μ¯k(xk′,xk),mk) 20:end for