View full-text article in PMC Entropy (Basel). 2021 Apr 29;23(5):551. doi: 10.3390/e23050551 Search in PMC Search in PubMed View in NLM Catalog Add to search Copyright and License information © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). PMC Copyright notice Algorithm 1 EM algorithm for DEC-POMDP k←0, Initialize θk. Tmax←⌈(log(1−γ)ε)/logγ−1⌉ while θk or J(θk) do not converge do Calculate p(x′,z′|x,z;θk) by Equation (20). //—E step—// α0(x,z;θk)←p0(x,z;νk) β0(x,z;θk)←r¯(x,z;πk) for t=1,2,…,Tmax do αt(x,z;θk)←∑x′,z′p(x,z|x′,z′;θk)αt−1(x′,z′;θk) βt(x,z;θk)←∑x′,z′βt−1(x′,z′;θk)p(x′,z′|x,z;θk) end for F(x,z;θk)←∑t=0Tmaxγtαt(x,z;θk) V(x,z;θk)←∑t=0Tmaxγtβt(x,z;θk) //—M step—// Update θk to θk+1 by Equations (9)–(11). k←k+1 end while return θk