Skip to main content
. 2020 Jun 10;120(16):8066–8129. doi: 10.1021/acs.chemrev.0c00004

Figure 40.

Figure 40

Summary plot of SHAP feature importance for a GBDT model, trained using pore properties descriptors (POV: pore occupiable volume, Di: diameter of the largest included sphere, Df: diameter of the largest free sphere, Dif: diameter of the largest included sphere along the free path) to predict N2 uptake from the CO2/N2 mixture data from Boyd et al.13 Note that we chose the N2 uptake as one expects that the pore geometry is more important than the chemistry, which simplifies the example. The violins in this plot show the distributions of the importance, i.e., the spread of the SHAP values (along the abscissa) and how many samples we have for different SHAP values (the thickness of the violin). The coloring encodes the value of the features, red meaning high feature values whereas blue represents low feature values (e.g., high vs low density). The SHAP value is shown on the abscissa and reflects how a particular feature (one feature per row) with a value represented by the color impacts the prediction. For example, a high density (red color in the second row) leads to lower predictions for N2 uptake (indicated by negative SHAP values).