Skip to main content
. 2023 Feb 3;13(7):4623–4640. doi: 10.1039/d2ra07007c

Predicted active and inactive docked poses for testing set active compounds based on GA/SHAP-selected features.

Compoundsa Predicted number of active and inactive docked poses
Xgboost RF
Active poses Inactive poses % Active posesb Active poses Inactive poses % Active posesb
10 105 14 88.2 98 21 82.4
22 113 21 84.3 99 35 73.9
34 86 22 79.6 88 20 81.5
49 76 20 79.2 73 23 76.0
54 60 29 67.4 57 32 64.0
66 76 20 79.2 75 21 78.1
72 98 23 81.0 87 34 71.9
78 77 2 97.5 65 14 82.3
107 54 17 76.1 58 13 81.7
113 57 6 90.5 56 7 88.9
115 90 2 97.9 73 19 79.3
116 67 22 75.3 65 24 73.0
126 97 2 98.0 80 19 80.8
127 66 1 98.5 55 12 82.1
133 51 2 96.2 41 12 77.4
135 65 0 100 56 9 86.2
146 82 5 94.3 73 14 83.9
148 54 3 94.7 45 12 78.9
156 90 7 92.8 85 12 87.6
162 62 5 92.5 59 8 88.1
250c 10 32 23.8 24 18 57.1
312 0 1 0 1 0 100
a

Compounds' numbers and bioactivities are as in Table S1.

b

Determined by dividing the number of poses labeled as “active” by the total number of poses (labeled as “active” and “inactive”).

c

The percent active poses of this compound (IC50 = 5000 nM) was used as threshold to classify screened compounds into potential active and inactive STAT3 inhibitors in both GA-RF and GA-XGboost models.