View full-text article in PMC Entropy (Basel). 2021 Apr 29;23(5):551. doi: 10.3390/e23050551 Search in PMC Search in PubMed View in NLM Catalog Add to search Copyright and License information © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). PMC Copyright notice Algorithm 3 Modified Bellman EM algorithm (MBEM) k←0, Initialize θk. F(x,z;θk−1)←p0(x,z;νk) V(x,z;θk−1)←r¯(x,z;πk) while θk or J(θk) do not converge do Calculate p(x′,z′|x,z;θk) by Equation (20). //—E step—// F0(x,z)←F(x,z;θk−1) V0(x,z)←F(x,z;θk−1) L←0 repeat FL+1(x,z)←AθkFL(x,z) VL+1(x,z)←BθkVL(x,z) L←L+1 until max{∥FL−FL−1∥1,∥VL−VL−1∥∞}<1−γγε //—M step—// Update θk to θk+1 by Equations (9)–(11). k←k+1 end while return θk