Skip to main content
. Author manuscript; available in PMC: 2024 Jun 4.
Published in final edited form as: Nat Biomed Eng. 2023 May 1;7(6):811–829. doi: 10.1038/s41551-023-01034-0

Extended Data Fig. 10 |. Ensembling improves XGBoost attributions more than explicit regularization.

Extended Data Fig. 10 |

Using the synthetic datasets with real AML gene expression features, we compare the increase in AUFDC seen with explicit regularization, such as per-tree column dropout and L1 regularization, with ensembling. For the synthetic datasets with AML features and the AND true function, we see that ensembles improve AUFDC significantly more than column dropout (a, two-sided Mann−Whitney U-test, U=2.83, P=4.7×103) and L1 regularization (c, U=3.00, P=2.7×103). For the synthetic datasets with AML features and the multiplicative true function, we see that ensembles improve AUFDC significantly more than column dropout (b, U=4.04, 5.25 × 10−5) and L1 regularization (d, U=4.87, P=1.12×106).