Skip to main content
. 2021 Jun 3;12:3307. doi: 10.1038/s41467-021-23165-1

Fig. 8. Machine learning-based kinase activity predictions.

Fig. 8

a Comparison of single-dose inhibition assay (at 1 µM) against multi-dose Kd assay activities across 475 compound-target pairs (395 from Round 2 and 81 from the post-Challenge experiments). The red points indicate false negatives and blue points false positives when using the cut-offs of pKd = 6 and inhibition = 80% among the 394 Round 2 pairs (including 75 pairs with inhibition >80% but that showed no activity in the dose-response assays, i.e, pKd = 5). The green points indicate the new 81 pairs profiled post-Challenge solely based on the ensemble model predictions, regardless of their inhibition levels. The black trace is the expected %inhibition rate based on measured pKd’s, estimated using the maximum ligand concentration of 1 µM both for the single-dose and dose-response assays (see Methods). bd Multi-dose (left) and single-dose (right) assays for kinases tested with TPKI-30, GSK1379763, and PFE-PKIS14. Green points indicate the new experimental validations based on the ensemble model predictions, whereas black points come from Round 2 data. Blue points indicate false positive predictions based either on predictive models or single-dose testing. e Predictive accuracy of the top-performing ensemble model (average predicted pKd), top-performing Q.E.D model and single-dose assay (at 1 µM), when classifying subsets of the 475 pairs into the true activity classes with measured pKd less or higher than 6. The y-axis indicates the area under the receiver operating characteristic (ROC) curve (AU-ROC) as a function of the single-dose inhibition% levels, x-axis the pairs with inhibition >x%, and the dashed black curve the percentage of all pairs that passed that single-dose activity threshold. The combined model trace corresponds to the average of measured and expected inhibition values, where the latter was calculated based on the mean ensemble of the top-performing model pKd predictions (Q.E.D, DMIS_DK and AI Winter is Coming). See Supplementary Fig. 16 for the corresponding analysis with precision-recall (PR) metric, and Supplementary Fig. 17 for the ROC and PR curves for all the 475 pairs. Source data are provided as a Source Data file54.