Skip to main content
. 2025 Jan 2;16:203. doi: 10.1038/s41467-024-55525-y

Fig. 4. Illustration of error decomposition and the approach of Electron Configuration models with Stacked Generalization (ECSG) to limited data problems.

Fig. 4

a Decomposition of error between expected risk and empirical risk. (b, c) show how ECSG solves limited data problems by augmenting data and restricting hypothesis space using domain knowledge. Triangles represent starting points; the blue stars (h^) denote the optimal hypothesis; the green four-pointed stars (h*) and the orange squares (hI) represent the hypotheses that minimize expected risk and empirical risk, respectively, within hypothesis space H. The area enclosed by the dotted line (H~) and hI~ correspond to the hypothesis space and the resulting hypothesis after incorporating diverse knowledge sources. Eapp refers to the error between the optimal hypothesis in H and the global hypothesis, while Eest represents the error between hI or hI~ and h*.