Skip to main content
. 2022 Mar 31;7(14):12138–12146. doi: 10.1021/acsomega.2c00424

Figure 2.

Figure 2

Overall statistics of the datasets used in the study. (A,B) A general overview of multiple interactions in the datasets is depicted for drugs and targets. (C) Estimation of the sequence thresholds for drug SMILES and protein sequences. The length of SMILE strings and protein sequences does not exceed a value of about 150 and 2000 for any dataset. (D) Number of data points and an estimate of their activities for each of the three interaction groups.