View full-text article in PMC Entropy (Basel). 2022 Apr 2;24(4):501. doi: 10.3390/e24040501 Search in PMC Search in PubMed View in NLM Catalog Add to search Copyright and License information © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). PMC Copyright notice Algorithm 1 DIMCO training procedure. Initialize network parameters θ repeat Sample a batch {(xn,yn)} Compute logits {li,jn}=f(xn;θ) via (1) Compute probs {pi,jn}=softmax({li,jn}) via (2) Update θ to minimize loss L({pi,jn},{yn}) via (10) until converged