Abstract
The constitutive androstane receptor (CAR) plays pivotal roles in drug-induced liver injury through the transcriptional regulation of drug-metabolizing enzymes and transporters. Thus, identifying regulatory factors for CAR activation is important for understanding its mechanisms. Numerous studies conducted previously on CAR activation and its toxicity focused on in vivo or in vitro analyses, which are expensive, time consuming, and require many animals. We developed a computational model that predicts agonists for the CAR using the Toxicology in the 21st Century 10k library. Additionally, we evaluate the prediction performance of novel deep learning (DL)-based quantitative structure-activity relationship analysis called the DeepSnap-DL approach, which is a procedure of generating an omnidirectional snapshot portraying three-dimensional (3D) structures of chemical compounds. The CAR prediction model, which applies a 3D structure generator tool, called CORINA-generated and -optimized chemical structures, in the DeepSnap-DL demonstrated better performance than the existing methods using molecular descriptors. These results indicate that high performance in the prediction model using the DeepSnap-DL approach may be important to prepare suitable 3D chemical structures as input data and to enable the identification of modulators of the CAR.
Keywords: chemical structure, constitutive androgen receptor (CAR), DeepSnap, Tox21, deep learning, QSAR
1. Introduction
The Toxicology in the 21st Century (Tox21) project is a multi-agency collaboration consortium, constituted of the National Institutes of Health, the US Environmental Protection Agency, the National Toxicology Program, the National Center for Advancing Translational Sciences, and the Food and Drug Administration, which focuses on developing and evaluating novel efficient methods for toxicity assessments and mechanistic insights, in addition to reducing time, costs, and the use of animals [1,2]. To establish the effects of agonist and antagonist models on toxicity pathways by utilizing a quantitative high-throughput screening based on in vitro human cell-based assays [2,3,4], it has profiled a library in the program including more than 10,000 chemical compounds, such as commercial chemicals, pesticides, food additives, and drugs, referred to as the Tox21 10K library. In the Tox21 program, the fundamental concept is based on an adverse outcome pathway (AOP), which is a structured representative idea of biological events in a linear way. This links a series of causally connected key events (KEs) between a molecular initiating event (MIE), triggered by environmental pollutants, drugs, food additives, and pesticides, and an adverse outcome (AO), including organ toxicity, such as hepatotoxicity, and toxicological endpoints, such as mutagenicity, carcinogenicity, cytotoxicity, immunotoxicity, and neurotoxicity, whose linkage is specially called by key event relationships (KERs) [5,6,7,8,9,10,11,12,13,14,15,16]. By considering the MIEs for developing toxicity, a broad range of important genes in their signaling pathways, such as a nuclear receptor, stress response, genotoxicity, and cell death, have been selected in the Tox21 project [17,18,19,20]. Among them, the constitutive androstane receptor (CAR nuclear receptor subfamily 1, group I, member 3, NR1I3) is a member of the nuclear receptor superfamily usually expressed in the liver. Additionally, the CAR is a ligand-activated transcription factor, which regulates the transcriptions of cellular responses and involves not only the metabolism and transport of numerous xenobiotics and endogenous chemicals, but also energy metabolism, tumor propagation, and cancer therapy for coordinated detoxification systems [4,21,22,23]. Despite the biological importance, the construction of in silico models to predict CAR activity with chemical compounds is possibly disturbed by the constitutive activity based on the normal condition, in which it is difficult to identify the important structural characteristics for ligand binding [24,25]. Furthermore, for the data, most toxicological assessments for these novel chemical compounds with agonist or antagonist activities rely on animal models, which are expensive, time-consuming, and difficult to translate into a human being. Thus, a potential alternative approach is required to the traditional animal toxicity evaluations [26,27]. Although the in silico approaches have long been the focus of interest as an alternative to animal testing, the conventional machine learning (ML) approaches, such as random forest (RF), support vector machine (SVM), and neural networks (NNs), for building precise prediction models require feature selection to avoid the combinatorial optimization problem and find a suitable combination, and feature learning to reduce overfitting by too many molecular descriptors with complicated craftsmanship skills, which is time-consuming [28,29,30]. Furthermore, a prediction model using two-dimensional (2D) descriptors may have difficulty recognizing the differences resulting from molecular chirality [31]. Thus, by utilizing molecular descriptors, the automated classification of chemical compounds may not be so easy due to the difficulty of feature extraction.
Recently, it was proven that deep learning (DL), which is an ML algorithm based on a convolutional neural network (CNN), can successfully and automatically capture the spatial and temporal dependencies in an image through the application of relevant filters, the latter are useful tools to build high-performance prediction models due to their powerful feature discrimination capability [32,33,34,35]. Numerous studies have been conducted on the prediction performance between the DL and the conventional ML methods, and these have shown better performance in the DL than in the conventional ML methods, indicating that the DL method may enable the building of a prediction model, such as chemical toxicity [36,37,38,39]. One important perspective on the high performance of the prediction models is access to large datasets, in which the DL method uses transfer-learning techniques, defined as a system to recognize and apply the knowledge learned in a previous task. This was shown to improve the performance in proportion to the number of input datasets by applying regularization techniques, such as dropout, early stopping, and weight decay [40,41]. In addition, another is the quality of the data sources. If the imprecise representation of chemical structures and inaccurate information of activity were utilized to design a prediction model, a large chemical space may indicate activity cliffs and a discontinuous region in a structure or activity surface [38,40,42,43]. Thus, the performance of the DL method may depend on the quantity and the quality of the input datasets. Moreover, the DL approach has not established the preparation of suitable input data of chemical structures. Thus, a novel capturing technique of their molecular features from molecular structure input data has been reported [44]. In this study, three-dimensional (3D) optimized molecular structures, which can be rotated at any arbitrary angle on the x-, y-, and z-axes, were photographed as a ball-and-stick model with different colors to represent the corresponding atoms to automatically input as much structural information as possible into the DL models. This is called DeepSnap [44]. When compared with the performance of the state-of–the–art methods using RF and DL with 3D or extend-connectivity fingerprint descriptors, the performance of the DL prediction model using the image input data produced by DeepSnap indicated that the receiver operating characteristic (ROC)-area under the curve (AUC) value in the DeepSnap-DL approach outperformed [44]. In addition, in the DeepSnap technique, the optimization of some parameters has been shown to involve optimal thresholds to obtain the best performance of the prediction models [45]. Furthermore, the studies conducted previously on the DeepSnap-DL method were insufficient for the prediction performance. Given the remarkably high performance of CNNs with promising robustness and self-learning capability [46,47,48], it could be hypothesized that the DeepSnap-DL technique using a DL algorithm causes the prediction models to have a high performance and high throughput for chemical compounds on toxicity.
Therefore, this study aimed to assess input datasets and parameters in the DeepSnap-DL method and investigated how they affect the prediction performance for CAR activity. Herein, the combinations of the various cleaning rules, such as adjusting protonation states and coordinating washed species in washing a molecular operating environment (MOE) database, were investigated. Consequently, when chemical structures produced by CORINA were applied as input data in the DeepSnap-DL method, very high performance of the prediction model was obtained and the calculation cost was reduced. These results indicate that conformational sampling is important in building of the high-performance prediction model in the DeepSnap-DL approach.
2. Results and Discussion
2.1. Contributions of Parameters for Prediction Performance in the DeepSnap-DL Approach
We have previously investigated that the optimal ranges of parameter values in the DeepSnap-DL process attained the high performance of the prediction models for CAR activity [45]. In addition, in order to analyze the influence of different splits for Tra, Val, and Test datasets, we randomly divided the input data of a total of 7,141 chemical compounds into 32 kinds of ratios, that is, Tra:Val:Test = 1:1:1 to 28:28:1(Table S1, Figure S1), and then built a total of 32 prediction models by applying the DeepSnap-DL approach (Table S2). The results, for average loss (Tra), loss (Val), Acc (Val), AUC, and BAC were 0.192 ± 0.160, 0.216 ± 0.013, 91.80 ± 0.56, 0.785 ± 0.030, and 0.803 ± 0.057, respectively (Table S2). Moreover, the minimum loss (Tra), loss (Val), maximum Acc (Val), AUC (Test), and BAC (Test) were 20:20:1, 12:1:1, 8:1:1, 26:26:1, and 26:26:1, respectively, in the ratio of Tra:Val:Test (Table S2). Therefore, we combined the ratio of Tra:Val:Test (26:26:1) for the maximum AUC, while the BAC was selected for the next analysis. To investigate the contributions of the angles in capturing Jmol-generated images in the DeepSnap approach to predict the performance in detail, the angles on the x-, y-, and z-axes were optimized using a total of 7141 nonoverlapped chemical compounds consisting of 768 active and 6373 inactive compounds, which were prepared by eliminating the same chemical structures with different annotation numbers (Table S3). The 3D chemical structures were curated and optimized to generate a single low-energy conformation utilizing the CORINA classic software in the MOE modeling program. A total of 92 different angles on the x-, y-, and z-axes from (360°, 360°, 360°) to (38°, 38°, 38°) (Table S4), which produced from one to one thousand pictures from the 3D structures in the DeepSnap approach, were assessed for the prediction performance using the following datasets ratio: Tra:Val:Test (26:26:1). In this study, the mean values of the AUC and the BAC for the 92 angles used were 0.839 ± 0.031 and 0.789 ± 0.030, respectively (Figure 1, Figure S2). The maximum values of the AUC and the BAC for the angles were 0.910 and 0.867 at angle 176°, respectively (Figure 1, Figure S2). These results indicated that there is an appropriate angle that demonstrates high prediction performance. Furthermore, to assess the contribution made by the depiction of the chemical structures to the prediction performance, the BR of parameters in the DeepSnap process were examined.
By employing the nonoverlapped chemical compounds in the DeepSnap approach, we studied a total of 101 kinds of BR chemical compounds in Jmol-generated ball-and-stick structure models, including from 3 to 30 mÅ at angle 176°. The mean AUC and BAC values for the 101 kinds of BRs applied in this study were 0.871 ± 0.021 and 0.813 ± 0.043, respectively (Figure S3a,b). The maximum AUC and BAC values for the BRs demonstrated 0.914 at a BR:14.5 mÅ and 0.879 at a BR:17.2 mÅ, respectively (Figure S3a,b). Moreover, to investigate the contributions of hyperparameters in the DL to the prediction performance, a total of 68 batch sizes (BSs) from 1 to 600, and 250 learning rates (LRs) from 0.036 to 0.00002 were fine-tuned using the nonoverlapped chemical compounds. In this study, the mean AUC and the BAC values for 68 kinds of BSs or 250 kinds of LRs used were 0.882 ± 0.019 (AUC for BS) and 0.816 ± 0.054 (BAC for BS) (Figure S3a,b) or 0.885 ± 0.021 (AUC for LR) and 0.829 ± 0.027 (BAC for LR) (Figure S3c,d), respectively. The maximum AUC and BAC values for the BS or LR were 0.918 (AUC at BS:108) and 0.867 (BAC at BS:11) (Figure S4a,b) or 0.925 (AUC at LR:0.00061) and 0.881 (BAC at LR:0.00062), respectively (Figure S3c,d). Besides, to assess the contributions made by the two DNNs, AlexNet and GoogLeNet, with parameters in DL, the AUC and BAC were calculated for the total 92 kinds of BSs in these two DNNs. For the AlexNet, the mean AUC and BAC values for 29 kinds of BSs used in this study were 0.710 ± 0.070 and 0.503 ± 0.210, respectively (Figure S5a,b). In addition, the maximum AUC and BAC values for the BSs indicated 0.857 at BS:5 and 0.796 at BSs:35 and 40, respectively (Figure S5a,b). It has been reported previously that the GoogLeNet used in this study showed higher AUC and BAC values than the AlexNet [49]. The mean AUC and BAC values for 29 kinds of BSs utilized in the GoogLeNet were 0.769 ± 0.038 and 0.737 ± 0.114, respectively (Figure S5a,b). The maximum AUC and BAC values in the GoogLeNet exhibited 0.886 at BS:1 and 0.819 at BS:1, respectively (Figure S5a,b). Therefore, the GoogLeNet was selected for the next analysis.
2.2. Contributions of Conformational Sampling of Chemical Compounds for Prediction Performance in the DeepSnap-DL Approach
Since numerous compounds appear in tautomertic forms in solution [50,51], it is difficult to describe a suitable molecular structure as a single variation of the structure. Thus, to investigate the contribution of conformational sampling of the 3D chemical structures to the prediction performance, the combinations of various cleaning rules, such as adjusting the protonation states (none, dominant, neutralize) and coordinating washed species (depict 2D, rebuild 3D, CORINA) in the MOE database wash treatment, were used to produce the 3D structures. Each AUC value for nine kinds of LRs from 0.0001 to 0.001 in ten kinds of cleaning rules, combining the protonation states and coordinating washed species, were calculated using the GoogLeNet (Figure 2a, Figure S6). The mean AUC values for the nine LRs were 0.945 ± 0.009 (none_2D), 0.901 ± 0.033 (domi_2D), 0.941 ± 0.009 (neut_2D), 0.965 ± 0.004 (none_3D), 0.984 ± 0.001 (domi_3D), 0.934 ± 0.014 (neut_3D), 0.992 ± 0.002 (none_CORINA), 0.986 ± 0.009 (domi_ CORINA), 0.995 ± 0.005 (neut_ CORINA), 0.906 ± 0.026 (neut_3D+neut_ CORINA) (Figure 2b). Next, to investigate the effect of dataset splits for the prediction performance using the neut_CORINA wash condition, a given dataset can be split into a total of 30 kinds of ratios for the Tra, Val, and Test datasets, in which Tra:Val:Test = X:X:1 or X:1:1, where X denote a variable integer (Table S5). The mean AUC and BAC values for 30 datasets were 0.999 ± 0.001 and 0.996 ± 0.003, respectively (Table S5). Additionally, the mean AUC and BAC values for Tra:Val:Test = X:X:1 or X:1:1 were AUC = 0.999 ± 0.001 (X:X:1) or 0.998 ± 0.002 (X:1:1) (Table S5) and BAC = 0.997 ± 0.003 (X:1:1) or 0.995 ± 0.004 (X:1:1) (Table S5). Meanwhile, to assess the contribution of the angles to the DeepSnap-DL for the prediction performance, a total of 16 angles from (360°, 360°, 360°) to (60°, 60°, 60°), corresponding to the number of pictures from one to 216, were analyzed for the production of pictures at three kinds of ratios with Tra:Val:Test = 8:8:1, 16:16:1, and 21:21:1 using the neut_CORINA wash condition (Table S6). The mean AUC and BAC values for 16 angles applied in the GoogLeNet were as follows: 0.9994 ± 0.0022 (8:8:1, AUC), 0.9996 ± 0.0012
(16:16:1,AUC), 0.9990 ± 0.0024 (21:21:1,AUC), 0.9965 ± 0.0092 (8:8:1,BAC), 0.9974 ± 0.0070 (16:16:1,BAC), and 0.9688 ± 0.1161 (21:21:1, BAC) (Table S6). Therefore, to investigate the contribution of conformational sampling to the prediction performance with the following angles (280°, 280°, 280°) in DeepSnap, the ten kinds of combinations of different cleaning rules were analyzed using two kinds of ratios with Tra:Val:Test = 1:1:1 and 4:4:1 (Table 1, Table S7). By considering the results with the following angles (176°, 176°, 176°), the neut_CORINA wash condition exhibited the highest performance: 0.998 ± 0.002 and 0.999 ± 0.001 (Tra:Val:Test = 1:1:1 and 4:4:1, AUC), 0.991 ± 0.004 and 0.993 ± 0.005 (Tra:Val:Test = 1:1:1 and 4:4:1, BAC), 0.991 ± 0.006 and 0.993 ± 0.005 (Tra:Val:Test = 1:1:1 and 4:4:1, Acc), 0.958 ± 0.024 and 0.969 ± 0.021 (Tra:Val:Test = 1:1:1 and 4:4:1, F), and 0.954 ± 0.026 and 0.966 ± 0.023 (Tra:Val:Test = 1:1:1 and 4:4:1, MCC) (Table 1, Table S7). Next, to study the contribution of the angles to the DeepSnap-DL for the prediction performance with split ratios of datasets, three kinds of angles (176°, 176°, 176°), (280°, 280°, 280°), and (360°, 360°, 360°) were used at eight kinds of ratios from Tra:Val:Test = 1:1:1 to 8:8:1 using the neut_CORINA wash condition (Table 2, Figure S7a–g). The prediction model quality and reliability depend on various conditions based on a similarity between the training and test datasets, including prediction space coverage, and applicability domain [52,53,54,55]. Thus, we split the datasets randomly shuffled into N groups, then took one group as the test dataset (hold out) to evaluate the model performance and the remaining groups as Tra or Val datasets. Moreover, for the CAR nonspecific activity scoring, scores datasets labeled randomly were utilized (Table 2, 280°PT in Figure S7). The performance at angles (176°, 176°, 176°) and (280°, 280°, 280°) significantly indicated high values, despite the differences of ratios with Tra:Val:Test (Figure S7 a–g). The performance at angles (360°, 360°, 360°) showed middle values among 280°PT, 176°, and 280° and increases along with ratio with Tra:Val (Table 2, Figure S7 a–g). The DL models have shown higher prediction performance and calculation costs than the traditional ML methods, such as RF and the SVM, due to the model structure complexity and gradient descent algorithm [56]. Furthermore, to analyze the combination of angles or the number of images in the DeepSnap with the prediction performance, a total of 14 kinds of combinations of pictures with various angles, including three kinds of picture numbers were utilized using the optimized parameters (Angle: 280°, MPS: 100, ZF: 100, AT: 23%, BR: 14.5 mÅ, BMD: 0.4 Å, BT: 0.8 Å, LR: 0.0008, BS: 108, GoogleNet) in the ratio with Tra:Val:Test = 1:1:1 (Table 2). The highest performance at four pictures, which include (0°, 0°, 0°), (280°, 0°, 0°), (0°, 280°, 0°), and (0°, 0°, 280°), was observed at 0.991 ± 0.003 (BAC), 0.971 ± 0.010 (F), and 0.967 ± 0.012 (MCC), respectively (Table 3, Table S8,). These findings indicate the possibility of reducing the calculation cost while maintaining high prediction performance.
Table 1.
AUC | Acc | MCC | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Train:val:test | Protonation | Coordinate | Protonation | Coordinate | Average | SD | Average | SD | Average | SD |
1:1:1 | none | 2D | 0.930 | 0.006 | 0.967 | 0.007 | 0.821 | 0.035 | ||
1:1:1 | dominate | 2D | 0.904 | 0.011 | 0.926 | 0.048 | 0.668 | 0.131 | ||
1:1:1 | neutralize | 2D | 0.890 | 0.006 | 0.919 | 0.032 | 0.619 | 0.115 | ||
1:1:1 | none | 3D | 0.907 | 0.008 | 0.797 | 0.035 | 0.440 | 0.019 | ||
1:1:1 | dominate | 3D | 0.971 | 0.003 | 0.927 | 0.001 | 0.734 | 0.005 | ||
1:1:1 | neutralize | 3D | 0.924 | 0.007 | 0.969 | 0.003 | 0.827 | 0.017 | ||
1:1:1 | none | CORINA | 0.989 | 0.003 | 0.958 | 0.003 | 0.826 | 0.012 | ||
1:1:1 | dominate | CORINA | 0.996 | 0.002 | 0.982 | 0.005 | 0.914 | 0.021 | ||
1:1:1 | neutralize | CORINA | 0.998 | 0.002 | 0.991 | 0.006 | 0.954 | 0.026 | ||
1:1:1 | neutralize | 3D | neutralize | CORINA | 0.798 | 0.016 | 0.707 | 0.020 | 0.302 | 0.018 |
4:4:1 | none | 2D | 0.923 | 0.024 | 0.959 | 0.029 | 0.798 | 0.107 | ||
4:4:1 | dominate | 2D | 0.906 | 0.013 | 0.894 | 0.069 | 0.609 | 0.139 | ||
4:4:1 | neutralize | 2D | 0.898 | 0.019 | 0.903 | 0.059 | 0.621 | 0.125 | ||
4:4:1 | none | 3D | 0.911 | 0.009 | 0.801 | 0.043 | 0.458 | 0.033 | ||
4:4:1 | dominate | 3D | 0.972 | 0.003 | 0.928 | 0.012 | 0.739 | 0.030 | ||
4:4:1 | neutralize | 3D | 0.927 | 0.011 | 0.971 | 0.002 | 0.839 | 0.010 | ||
4:4:1 | none | CORINA | 0.990 | 0.003 | 0.957 | 0.009 | 0.821 | 0.029 | ||
4:4:1 | dominate | CORINA | 0.997 | 0.001 | 0.985 | 0.003 | 0.927 | 0.015 | ||
4:4:1 | neutralize | CORINA | 0.999 | 0.001 | 0.993 | 0.005 | 0.966 | 0.023 | ||
4:4:1 | neutralize | 3D | neutralize | CORINA | 0.802 | 0.014 | 0.684 | 0.043 | 0.311 | 0.021 |
Parameters (angle: 280, MPS: 100, ZF:100, AT: 23%, BR: 14.5 mÅ, BMD: 0.4 Å, BT: 0.8 Å, LR: 0.0008, BS: 108, GoogleNet), n = 3 or 9 for 1:1:1 or 4:4:1, respectively. Maximum values for AUC, Accuray in test dataset (Acc), and Matthews correlation coefficient (MCC) in each dataset are indicated by bold.
Table 2.
176° | 280° | 360° | 280°PT | |||||||
---|---|---|---|---|---|---|---|---|---|---|
Train:Val:Test | N | Average | SD | Average | SD | Average | SD | Average | SD | |
AUC | 1:1:1 | 3 | 1.000 | 0.000 | 0.998 | 0.002 | 0.932 | 0.027 | 0.537 | 0.009 |
2:2:1 | 5 | 0.999 | 0.001 | 0.998 | 0.001 | 0.964 | 0.005 | 0.522 | 0.013 | |
3:3:1 | 6 | 0.999 | 0.000 | 0.998 | 0.001 | 0.972 | 0.009 | 0.544 | 0.019 | |
4:4:1 | 9 | 0.998 | 0.003 | 0.999 | 0.001 | 0.979 | 0.005 | 0.545 | 0.027 | |
5:5:1 | 11 | 0.998 | 0.003 | 0.998 | 0.002 | 0.983 | 0.005 | 0.534 | 0.016 | |
6:6:1 | 13 | 0.999 | 0.001 | 0.998 | 0.002 | 0.983 | 0.008 | 0.529 | 0.022 | |
7:7:1 | 15 | 0.998 | 0.002 | 0.998 | 0.002 | 0.982 | 0.007 | 0.555 | 0.043 | |
8:8:1 | 17 | 0.999 | 0.003 | 0.998 | 0.003 | 0.983 | 0.009 | 0.552 | 0.044 | |
Acc | 1:1:1 | 3 | 0.997 | 0.001 | 0.991 | 0.006 | 0.851 | 0.037 | 0.422 | 0.009 |
2:2:1 | 5 | 0.995 | 0.002 | 0.993 | 0.005 | 0.898 | 0.005 | 0.554 | 0.013 | |
3:3:1 | 6 | 0.993 | 0.006 | 0.988 | 0.008 | 0.918 | 0.034 | 0.555 | 0.019 | |
4:4:1 | 9 | 0.995 | 0.003 | 0.993 | 0.005 | 0.925 | 0.020 | 0.449 | 0.027 | |
5:5:1 | 11 | 0.993 | 0.004 | 0.992 | 0.004 | 0.934 | 0.022 | 0.507 | 0.016 | |
6:6:1 | 13 | 0.995 | 0.002 | 0.993 | 0.007 | 0.942 | 0.022 | 0.498 | 0.022 | |
7:7:1 | 15 | 0.994 | 0.003 | 0.993 | 0.007 | 0.934 | 0.030 | 0.513 | 0.043 | |
8:8:1 | 17 | 0.996 | 0.003 | 0.992 | 0.009 | 0.931 | 0.049 | 0.527 | 0.044 | |
MCC | 1:1:1 | 3 | 0.986 | 0.006 | 0.954 | 0.026 | 0.547 | 0.074 | 0.018 | 0.073 |
2:2:1 | 5 | 0.977 | 0.012 | 0.966 | 0.022 | 0.647 | 0.016 | 0.018 | 0.047 | |
3:3:1 | 6 | 0.966 | 0.028 | 0.942 | 0.037 | 0.705 | 0.074 | 0.025 | 0.065 | |
4:4:1 | 9 | 0.976 | 0.015 | 0.966 | 0.023 | 0.723 | 0.049 | 0.078 | 0.022 | |
5:5:1 | 11 | 0.967 | 0.017 | 0.962 | 0.018 | 0.749 | 0.055 | 0.057 | 0.055 | |
6:6:1 | 13 | 0.976 | 0.012 | 0.970 | 0.028 | 0.768 | 0.060 | 0.062 | 0.049 | |
7:7:1 | 15 | 0.970 | 0.012 | 0.966 | 0.031 | 0.755 | 0.072 | 0.069 | 0.079 | |
8:8:1 | 17 | 0.978 | 0.016 | 0.961 | 0.041 | 0.749 | 0.103 | 0.060 | 0.092 |
Parameters (MPS: 100, ZF: 100, AT: 23%, BR: 14.5 mÅ, BMD: 0.4 Å, BT: 0.8 Å, LR: 0.0008, BS: 108, GoogleNet). Protonation: neutralize, coordinate: CORINA, train:val:test: ratio of train, validation, and test datasets, n: number of external test datasets, average: means of Accuray in test dataset (Acc), AUC, and Matthews correlation coefficient (MCC) for n, sd: standard deviations of MCC for n, 280°PT: permutation test for activity scores at 280° angle.
Table 3.
Angles on x-, y-, z-axes | AUC | Acc | MCC | |||||||
---|---|---|---|---|---|---|---|---|---|---|
No. of Picture | Pic1 | Pic2 | Pic3 | Pic4 | Average | SD | Average | SD | Average | SD |
4 | 0,0,0, | 280,0,0, | 0,280,0 | 0,0,280 | 0.999 | 0.000 | 0.994 | 0.002 | 0.967 | 0.012 |
4 | 280,280,280, | 0,280,280, | 280,0,280 | 280,280,0 | 0.998 | 0.002 | 0.988 | 0.004 | 0.941 | 0.021 |
4 | 0,0,0, | 0,280,280, | 280.0.280 | 280,280,0 | 0.998 | 0.001 | 0.990 | 0.003 | 0.952 | 0.014 |
4 | 0,0,0, | 280,0,0, | 280.0.280 | 280,280,0 | 0.997 | 0.003 | 0.988 | 0.006 | 0.943 | 0.027 |
4 | 0,0,0, | 280,0,0, | 0.280.0 | 280,280,0 | 0.996 | 0.002 | 0.991 | 0.004 | 0.953 | 0.018 |
3 | - | 280,0,0, | 0,280,0 | 0,0,280 | 0.995 | 0.004 | 0.984 | 0.006 | 0.921 | 0.027 |
3 | 0,0,0, | - | 0,280,0 | 0,0,280 | 0.998 | 0.001 | 0.987 | 0.005 | 0.935 | 0.024 |
3 | 0,0,0, | 280,0,0, | - | 0,0,280 | 0.998 | 0.001 | 0.988 | 0.008 | 0.943 | 0.037 |
3 | 0,0,0, | 280,0,0, | 0,280,0 | - | 0.995 | 0.002 | 0.984 | 0.007 | 0.921 | 0.032 |
2 | 0,0,0, | 280,0,0, | - | - | 0.995 | 0.002 | 0.976 | 0.012 | 0.890 | 0.048 |
2 | 0,0,0, | - | 0,280,0 | - | 0.993 | 0.002 | 0.970 | 0.015 | 0.864 | 0.055 |
2 | 0,0,0, | - | - | 0,0,280 | 0.996 | 0.000 | 0.978 | 0.009 | 0.896 | 0.034 |
2 | - | 280,0,0, | 0,280,0 | - | 0.982 | 0.008 | 0.960 | 0.006 | 0.817 | 0.010 |
2 | - | - | 0,280,0 | 0,0,280 | 0.998 | 0.001 | 0.986 | 0.002 | 0.931 | 0.010 |
Parameters (angle: 280, MPS: 100, ZF: 100, AT: 23%, BR: 14.5 mÅ, BMD: 0.4 Å, BT: 0.8 Å, LR: 0.0008, BS: 108, GoogleNet). Parameters (angle: 280, MPS: 100, ZF: 100, AT: 23%, BR: 14.5 mÅ, BM: 0.4 Å, BT: 0.8 Å, LR: 0.0008, BS: 108, GoogleNet). Maximum values for AUC, Accuracy in test dataset (Acc), and Matthews correlation coefficient (MCC) are indicated by bold. Wash in MOE (protonation states: neutralize, coordinating washed species: CORINA).
Generally, the construction of quantitative structure–activity relationship (QSAR) prediction models with high performance could be disturbed when the endpoint is complex [57,58], or structurally similar molecules exhibit a large difference in potency, that is, activity cliffs, where small chemical modifications lead to profound effects on biological activity [59,60,61]. However, if there are a sufficient number of chemical compounds with structural quality that can show complex endpoints and slight chemical structural differences then QSAR analysis could overcame this issue. Since they represent the structural relationships between small chemical modifications and large potency, there may be principal indicators for QSAR if these chemical changes or substitution sites were identified during compound optimization. [38,62]. Following this perspective, the DeepSnap-DL approach that can extract an appropriate feature of a chemical structure using numerous input data for the DL may be considered as a QSAR approach, which shows high prediction performance. This is because the DL is enabled to automatically transform low-level features to higher and more abstract levels as a feature extractor and to learn numerous datasets [47,63]. In this study, there was a problem regarding the datasets from the Tox21 10k library used that imbalanced the activity scores of the CAR agonist (active: 10.8% and inactive: 89.2% in nonoverlapped sample, Table S3). To correctly evaluate the performance of prediction models constructed from the imbalanced datasets, we used the following as evaluation criteria: the BAC, F value, and MCC. To coordinate 3D structure preparation, high performances were observed at two different data split ratios with 1:1:1 or 4:4:1 when the CORINA on the DeepSnap-DL approach was employed, as demonstrated in Table 1. When these preparation procedures of the 3D chemical structure are applied, dataset imbalance and differences in chemica space coverage by different dataset splits seem not to adversely affect the robustness of the prediction model of CAR activity.
2.3. The Prediction Performance of the DeepSnap-DL Approach Compared with the Conventional ML
To evaluate the prediction performance of the DeepSnap appoach, two ML approaches, RF and extreme gradient boosting (XGB) were used to formulate the prediction models by applying the nonoverlapped chemical compounds library. By utilizing a molecular descriptor calculation software application, MORDRED, which is a non-copyleft open-source software, a total of 836 descriptors were extracted (Table S9a,b) [64]. This software has high flexibility, so that it can calculate uncommon-range descriptors without modifying the source code [64]. Thus, new molecular descriptors can be created by using the descriptor arithmetic feature. In addition, a total of 7140 chemical compounds for the CAR agonist split randomly into Tra and test datasets at a 1:1 ratio. Based on the quantitative descriptors for molecular structures extracted by MORDRED, the two classification and regression tools, RF and XGB, were applied to predict the compounds’ activity and build ten prediction models of RF and XGB, respectively, by optimizing parameters, including max_depth, nEstimators, and max_features (Table 4). The highest mean AUC values of RF and XGB in five independent tests were 0.8842 ± 0.0052 (max_depth:20, nEstimators:1000, and max_features:120) and 0.8890 ± 0.0072 (max_depth:3, nEstimators:5000, and max_features:60), respectively (Table 4). These results demonstrated that the DeepSnap-DL method outperformed the traditional ML methods for constructing the prediction model of the CAR agonist. However, the prediction performance by the RF and XGB showed a relatively high prediction ability. This result may suggest that adjustment of input data is a critical step in building a prediction model. Moreover, the feature extraction or their selection that was performed in the conventional ML methods can be automatically performed without human intervention in the DeepSnap-DL method, so the prediction model is expected to achieve high throughput and high performance. On the other hand, since a large amount of supervised data are required as learning data when building a prediction model, it might be necessary to reduce further the calculation cost.
Table 4.
Auc | Parameters | ||||
---|---|---|---|---|---|
Model # | Average | SD | Max_Depth | Nestimators | Max_Features |
XGB_1 | 0.8855 | 0.0071 | 3 | 100 | 29 |
XGB_2 | 0.8862 | 0.0095 | 3 | 500 | 29 |
XGB_3 | 0.8854 | 0.0073 | 3 | 1000 | 29 |
XGB_4 | 0.8885 | 0.0033 | 3 | 5000 | 29 |
XGB_5 | 0.8883 | 0.0040 | 30 | 1000 | 29 |
XGB_6 | 0.8872 | 0.0089 | 3 | 5000 | 40 |
XGB_7 | 0.8851 | 0.0026 | 3 | 5000 | 50 |
XGB_8 | 0.8890 | 0.0072 | 3 | 5000 | 60 |
XGB_9 | 0.8873 | 0.0069 | 3 | 5000 | 100 |
XGB_10 | 0.8835 | 0.0075 | 3 | 5000 | 120 |
RF_1 | 0.8069 | 0.0193 | 2 | 10 | 29 |
RF_2 | 0.8314 | 0.0287 | 2 | 100 | 29 |
RF_3 | 0.8416 | 0.0252 | 2 | 1000 | 29 |
RF_4 | 0.8803 | 0.0053 | 20 | 1000 | 29 |
RF_5 | 0.8781 | 0.0104 | 200 | 1000 | 29 |
RF_6 | 0.8780 | 0.0083 | 20 | 5000 | 29 |
RF_7 | 0.8702 | 0.0067 | 20 | 1000 | 5 |
RF_8 | 0.8813 | 0.0032 | 20 | 1000 | 80 |
RF_9 | 0.8842 | 0.0052 | 20 | 1000 | 120 |
RF_10 | 0.8807 | 0.0055 | 20 | 1000 | 250 |
Average: means of AUCs for 5 tests. SD: standard deviations of AUCs for 5 independent tests. Maximum values for AUC in each models are indicated by bold.
3. Materials and Methods
3.1. Data
In an approach reported previously [45], a total of 9523 chemical structures and the corresponding CAR activity scores were downloaded in the simplified molecular input line entry system (SMILES) format from the PubChem database (AID 1224892), derived from the Tox21 10k library (Table S3). The library includes some chemical compounds that are the same but with different activity scores, which represent an under or over 40 score, as different ID numbers. Therefore, these chemical compounds with indefinite activity criteria and/or with nonorganic compounds were eliminated, and a total of 7141 chemicals for the CAR were selected as nonoverlapped input data (Table S3). In the primary screening of the Tox21 program, the CAR activity scores were represented from 0% to 100% based on a compound concentration response analysis as follows: % Activity = ((Vcompound–Vdmso)/(Vpos–Vdmso)) × 100, where Vcompound, Vdmso, and Vpos denote the compound, the median values of the DMSO only, and the median value of the positive control well values in the reporter gene assay, respectively, which were then corrected by utilizing compound-free control plates, that is, DMSO-only plates, at the beginning and end of the compound plate measurement [18,65]. Concentration response titration points for each compound were fitted to a four-parameter Hill equation, which yields the concentrations of half-maximal activity (AC50) and maximal response (efficacy) values [18,66]. The activity scores were grouped into the following three classes: (1) zero, (2) from 1 to 39, and (3) from 40 to 100, represented as inactive, inconclusive, and active compounds, respectively. This study, defines active or inactive compounds from 40 to 100 or from 0 to 39 of the activity score (Table S3). For the permutation test for the nonspecific activity scoring of CAR, activity scores labeled randomly to chemical compounds were applied. Then, a MOE 2018 application software program (MOLSIS Inc., Tokyo, Japan) was utilized to produce the molecular geometry replaced by lower-energy 3D coordinates with rotatable torsions per compound, which was optimized by a two-step method: a cyclic embedder based on distance geometry and refinement (called rebuild 3D), in which the washed species are to be replaced by those generated by a cyclic 3D embedder. Thus, using the external program, CORINA classic software (Molecular Networks GmbH, Nürnberg, Germany, https://www.mn-am.com/products/corina) defines a single sTable conformation, and the 3D structures are saved finally in the SDF file format, as described previously [45,67,68]. Besides, if depict 2D is chosen, the coordinates of the washed species will be replaced by the results of the 2D depiction layout algorithm. In addition, to investigate whether each structure is in a suitable form for subsequent KEs, which are triggered by MIEs, the protonation states were adjusted. Charged species will be replaced with the following if the protonation menu was set to neutralize: (1) all the atoms are neutral; (2) the species is neutral overall; and (3) the least charge-bearing form of the structure. If it was set to dominant, the molecule was replaced with the dominant promoter/tautomer at a specified pH of pH: 7 used in this study. The ten types of combinations of the protonation states as follows: (none, dominant, neutralize) and coordinating washed species (depict 2D, rebuild 3D, CORINA) in washing the MOE database were investigated; none_2D (none, depict 2D), domi_2D (dominant, depict 2D), neut_2D (neutralize, depict 2D), none_3D (none, rebuild 3D), domi_3D (dominant, rebuild 3D), neut_3D (neutralize, rebuild 3D), none_CORINA(none, CORINA), domi_ CORINA (dominant, CORINA), neut_ CORINA (neutralize, CORINA), neut_3D + neut_CORINA (meutralize, rebuild 3D, and then neutralize, CORINA).
3.2. DeepSnap
The 3D chemical structures were depicted as 3D ball-and-stick models with different colors corresponding to different atoms by a Jmol, which is an open-source Java viewer for 3D structures [69,70,71]. They were captured automatically as snapshots in selected angle increments on the x-, y-, and z-axes, which were saved as 256 × 256 pixel resolution PNG files (RGB) and split into three types of datasets, that is training (Tra), validation (Val), and test (Test) datasets, as depicted in Figure S8 [44,45]. During the DeepSnap depiction process, some parameters were set to design suitable molecular images for their classifications at the next step, such as image pixel size, image format (png or jpg), molecule number per SDF file to split into MPS, zoom factor (ZF, %), atom size for van der Waals radius (AT, %), bond radius (BR, mÅ), minimum bond distance (MBD), and bond tolerance (BT).
3.3. ML Models
The following three different ML models were chosen to construct the prediction models for CAR activity: (1) DL, (2) RF, and (3) extreme gradient boosting (XGBoost, which we denote as XGB). For the DL, all the 2D-PNG image files produced by DeepSnap were resized by utilizing NVIDIA DL GPU Training System (DIGITS) version 4.0.0 software (NVIDIA, Santa Clara, CA, USA), which was constituted based on managing data, designing, and training NNs on four-GPU systems, Tesla-V100-PCIE (31.7GB); real-time monitoring for model performance; and selecting the best performance model from the results [49] with a resolution of 256 × 256 pixels as input data. To rapidly train and fine-tune the highly accurate deep neural networks (DNNs) using the input Tra and Val datasets based on the image classification and building the prediction model as transfer learning, a pretrained open-source DL model, Caffe, and the open-source software on the CentOS Linux distribution 7.3.1611 were used. AlexNet is a CNN developed by the University of Toronto, as described in a previous study [72]. The CNN architecture comprises eight pretrained layers, which include five convolutional/pooling layers that convert the feature volume and reduce the layers. This was achieved by compressing images using max-pooling compresses and selecting the maximum value in each region as a representative value. Convolutional/pooling layer I converted the previous volume (224 × 224 × 3) to (11 × 11 × 3); convolutional/pooling layers II, III, IV, and V converted the results of layers I, II, III, and IV to (5 × 5 × 48), (3 × 3 × 256), (3 × 3 × 192), and (3 × 3 × 192), respectively. These fully connected layers make final connections between feature values and forces that were converted to zero to suppress overfitting (dropout) by a total of 4,096 neurons (Figure S9) [72,73,74]. This study utilized a GoogLeNet model that comprised 22 complex and deep CNN architectures called “Inception”, which concatenates different filter sizes and dimensions into a single new filter and introduces sparsity and multiscale information in one block; and it comprises two convolutional layers, two pooling layers, and nine “Inception” modules, in which each module has six convolution layers and one pooling layer (Figure S10) [49]. The filters and weights of the GoogLeNet were iteratively discriminated via error back-propagation that can convert a loss into gradients to rectify the last layer and calculate the correcting gradients [73,75]. For the RF and XGB, molecular descriptors were calculated using a Python package called Mordred (https://github.com/mordred-descriptor/mordred) [64]. Classification experiments were conducted in the Python programming language using specific classifier implementations, RF, and XGB provided by the scikit-learn and rdkit Python packages [76,77,78,79].
3.4. Evaluation of the Predictive Model
Using the external test dataset in the DL prediction model, the probability of the prediction results was analyzed by applying the designed prediction models. Since we calculated the probabilities for each image prepared from different angles with the x-, y-, and z-axes directions calculated for one molecule during the process of the DeepSnap-DL method, the medians of these predicted values were used as the representative values for target molecules, as described previously [80]. Based on the sensitivity, which is a true positive rate identified as positive for all the positive samples including true and false positives, and the specificity, which is a true negative rate identified as negative for all the negative samples including true and false negatives, a confusion matrix regarding the predicted and the experimentally defined labels was used to make the ROC curve and calculate the AUC using JMP Pro 14, which is a statistical discovery software (SAS Institute Inc., Cary, NC, USA), as reported previously [45]. Thus, it follows that where TP, FN, TN, and FP denote true positive, false negative, true negative, and false positive, respectively:
Sensitivity = ΣTPs/(ΣTPs + ΣFNs), | (1) |
Specificity = ΣTNs/(ΣTNs + ΣFPs). | (2) |
Additionally, since the proportion between the “active” and “inactive” compounds for the activity scores is biased in the library [18], the balanced accuracy (BAC), accuracy in test datasets (Acc), F value, and Matthews correlation coefficient (MCC) were utilized to avoid overfitting by applying a cut-off point calculated using the JMP Pro 14 and statistical discovery software. Thus, it follows that
BAC = (sensitivity + specificity)/2 | (3) |
Accuracy = ((TP + TN)/(TP + FP + TN + FN) | (4) |
Precision = TP/(TP + FP) | (5) |
Recall = TP/(TP + FN) | (6) |
F value = 2 × Recall × Precision/(Recall + Precision) | (7) |
(8) |
For RF and XGB, AUC was calculated by utilizing the scikit-learn Python packages [77]. Prediction models were optimized by possible important parameters (https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html) [80,81,82].
4. Conclusions
This study proposed a strategy based on a DL model that can predict the CAR agonist, which has been studied extensively by applying the DeepSnap-DL approach using the Tox21 10k library, due to its importance in toxicological and pharmaceutical fields. When compared with the other conventional ML methods, which include RF and Xgboost, the results indicated that the DeepSnap-DL approach is superior in prediction performance using the suitable 3D chemical structures produced by CORINA as input data for the DL. Based on the DeepSnap-DL approach, a similar strategy could also be applied to other MIE targets and KE pathways to understand the mechanisms in which chemical toxicants cause AO. Generally, we conclude that the optimal utilization of 3D structures in the DeepSnap-DL approach could demonstrate a prediction model with a very high performance in regards to the CAR agonist.
Acknowledgments
The environmental setting for Python in Ubuntu was supported by Shunichi Sasaki.
Abbreviations
2D | Two-dimensional |
3D | Three-dimensional |
Acc | Accuracy in the test dataset |
AO | Adverse outcome |
AOP | Adverse outcome pathway |
AUC | Area under the curve |
AT | Atom size for van der Waals |
AV | Accuracy in the validation dataset |
BAC | Balanced accuracy |
BMD | Bound minimum distance |
BR | Bond radius |
BSs | Batch sizes |
BT | Bond tolerance |
CAR | Constitutive androstane receptor |
CNN | Convolutional neural network |
DIGITS | Deep learning GPU training system |
DL | Deep learning |
DNNs | Deep neural networks |
F | F value |
KEs | Key events |
KER | Key event relationship |
LR | Learning rate |
LV | Loss in the validation dataset |
MBD | Minimum bond distance |
MCC | Matthews correlation coefficient |
MIE | Molecular initiating event |
ML | Machine learning |
MOE | Molecular operating environment |
MPS | Number of molecules per SDF file to split into |
NN | Neural network |
RF | Random Forest |
ROC | Receiver operating characteristic |
SMILES | Simplified molecular input line entry system |
SVM | Support vector machine |
Tox21 | Toxicology in the 21st century |
XGB | eXtreme Gradient Boosting |
ZF | Zoom factor |
Supplementary Materials
Supplementary materials can be found at https://www.mdpi.com/1422-0067/20/19/4855/s1.
Author Contributions
Conceptualization, Y.M. and Y.U.; methodology, Y.M. and Y.U.; formal analysis for DL and ML, Y.M.; project administration and supervision, Y.U.; writing—original draft, Y.M.; writing—review and editing, Y.U.; approved the final manuscript, Y.M. and Y.U.; funding acquisition, Y.U.
Funding
This study was funded in part by grants from the Long-Range Research Initiative, Japan Chemical Industry Association (16_PT01-02) and the Ministry of Economy, Trade and Industry, AI-SHIPS (AI-based Substances Hazardous Integrated Prediction System) project (20180314ZaiSei8).
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Thomas R.S., Paules R.S., Simeonov A., Fitzpatrick S.C., Crofton K.M., Casey W.M., Mendrick D.L. The US Federal Tox21 Program: A strategic and operational plan for continued leadership. ALTEX. 2018;35:163–168. doi: 10.14573/altex.1803011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Xia M., Huang R., Shi Q., Boyd W.A., Zhao J., Sun N., Rice J.R., Dunlap P.E., Hackstadt A.J., Bridge M.F., et al. Comprehensive Analyses and Prioritization of Tox21 10K Chemicals Affecting Mitochondrial Function by in-Depth Mechanistic Studies. Environ. Health Perspect. 2018;126:077010. doi: 10.1289/EHP2589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sipes N.S., Wambaugh J.F., Pearce R., Auerbach S.S., Wetmore B.A., Hsieh J.H., Shapiro A.J., Svoboda D., DeVito M.J., Ferguson S.S. An Intuitive Approach for Predicting Potential Human Health Risk with the Tox21 10k Library. Environ. Sci. Technol. 2017;51:10786–10796. doi: 10.1021/acs.est.7b00650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lynch C., Mackowiak B., Huang R., Li L., Heyward S., Sakamuru S., Wang H., Xia M. Identification of Modulators That Activate the Constitutive Androstane Receptor From the Tox21 10K Compound Library. Toxicol. Sci. 2019;167:282–292. doi: 10.1093/toxsci/kfy242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ankley G.T., Bennett R.S., Erickson R.J., Hoff D.J., Hornung M.W., Johnson R.D., Mount D.R., Nichols J.W., Russom C.L., Schmieder P.K., et al. Adverse outcome pathways: A conceptual framework to support ecotoxicology research and risk assessment. Environ. Toxicol. Chem. 2010;29:730–741. doi: 10.1002/etc.34. [DOI] [PubMed] [Google Scholar]
- 6.Vinken M. The adverse outcome pathway concept: A pragmatic tool in toxicology. Toxicology. 2013;312:158–165. doi: 10.1016/j.tox.2013.08.011. [DOI] [PubMed] [Google Scholar]
- 7.Villeneuve D.L., Crump D., Garcia-Reyero N., Hecker M., Hutchinson T.H., LaLone C.A., Landesmann B., Lettieri T., Munn S., Nepelska M., et al. Adverse outcome pathway development II: Best practices. Toxicol. Sci. 2014;142:321–330. doi: 10.1093/toxsci/kfu200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bal-Price A., Crofton K.M., Sachana M., Shafer T.J., Behl M., Forsby A., Hargreaves A., Landesmann B., Lein P.J., Louisse J., et al. Putative adverse outcome pathways relevant to neurotoxicity. Crit. Rev. Toxicol. 2015;45:83–91. doi: 10.3109/10408444.2014.981331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bal-Price A., Lein P.J., Keil K.P., Sethi S., Shafer T., Barenys M., Fritsche E., Sachana M., Meek M.E.B. Developing and applying the adverse outcome pathway concept for understanding and predicting neurotoxicity. Neurotoxicology. 2017;59:240–255. doi: 10.1016/j.neuro.2016.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Perkins E.J., Antczak P., Burgoon L., Falciani F., Garcia-Reyero N., Gutsell S., Hodges G., Kienzler A., Knapen D., McBride M., et al. Adverse Outcome Pathways for Regulatory Applications: Examination of Four Case Studies With Different Degrees of Completeness and Scientific Confidence. Toxicol. Sci. 2015;148:14–25. doi: 10.1093/toxsci/kfv181. [DOI] [PubMed] [Google Scholar]
- 11.El-Masri H., Kleinstreuer N., Hines R.N., Adams L., Tal T., Isaacs K., Wetmore B.A., Tan Y.M. Integration of Life-Stage Physiologically Based Pharmacokinetic Models with Adverse Outcome Pathways and Environmental Exposure Models to Screen for Environmental Hazards. Toxicol. Sci. 2016;152:230–243. doi: 10.1093/toxsci/kfw082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Leist M., Ghallab A., Graepel R., Marchan R., Hassan R., Bennekou S.H., Limonciel A., Vinken M., Schildknecht S., Waldmann T., et al. Adverse outcome pathways: Opportunities, limitations and open questions. Arch. Toxicol. 2017;91:3477–3505. doi: 10.1007/s00204-017-2045-3. [DOI] [PubMed] [Google Scholar]
- 13.Davis A.P., Wiegers T.C., Wiegers J., Johnson R.J., Sciaky D., Grondin C.J., Mattingly C.J. Chemical-Induced Phenotypes at CTD Help Inform the Predisease State and Construct Adverse Outcome Pathways. Toxicol. Sci. 2018;165:145–156. doi: 10.1093/toxsci/kfy131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Terron A., Bal-Price A., Paini A., Monnet-Tschudi F., Bennekou S.H., EFSA WG EPI1 Members. Leist M., Schildknecht S. An adverse outcome pathway for parkinsonian motor deficits associated with mitochondrial complex I inhibition. Arch. Toxicol. 2018;92:41–82. doi: 10.1007/s00204-017-2133-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wang D. Infer the in vivo point of departure with ToxCast in vitro assay data using a robust learning approach. Arch. Toxicol. 2018;92:2913–2922. doi: 10.1007/s00204-018-2260-6. [DOI] [PubMed] [Google Scholar]
- 16.Song C., Charli A., Luo J., Riaz Z., Jin H., Anantharam V., Kanthasamy A., Kanthasamy A.G. Mechanistic Interplay between Autophagy and Apoptotic Signaling in Endosulfan-induced Dopaminergic Neurotoxicity: Relevance to the Adverse OutcomePathway in Pesticide Neurotoxicity. Toxicol. Sci. 2019;169:333–352. doi: 10.1093/toxsci/kfz049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Huang R., Sakamuru S., Martin M.T., Reif D.M., Judson R.S., Houck K.A., Casey W., Hsieh J.H., Shockley K.R., Ceger P., et al. Profiling of the Tox21 10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci. Rep. 2014:4. doi: 10.1038/srep05664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Huang R., Xia M., Sakamuru S., Zhao J., Shahane S.A., Attene-Ramos M., Zhao T., Austin C.P., Simeonov A. Modelling the Tox21 10 K chemical profiles for in vivo toxicity prediction and mechanism characterization. Nat. Commun. 2016;7:10425. doi: 10.1038/ncomms10425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mahadevan B., Snyder R.D., Waters M.D., Benz R.D., Kemper R.A., Tice R.R., Richard A.M. Genetic toxicology in the 21st century: Reflections and future directions. Environ. Mol. Mutagen. 2011;52:339–354. doi: 10.1002/em.20653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Attene-Ramos M.S., Miller N., Huang R., Michael S., Itkin M., Kavlock R.J., Austin C.P., Shinn P., Simeonov A., Tice R.R., et al. The Tox21 robotic platform for the assessment of environmental chemicals--from vision to reality. Drug Discov. Today. 2013;18:716–723. doi: 10.1016/j.drudis.2013.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cherian M.T., Chai S.C., Chen T. Small-molecule modulators of the constitutive androstane receptor. Expert Opin. Drug Metab. Toxicol. 2015;11:1099–1114. doi: 10.1517/17425255.2015.1043887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kobayashi K., Hashimoto M., Honkakoski P., Negishi M. Regulation of gene expression by CAR: An update. Arch. Toxicol. 2015;89:1045–1055. doi: 10.1007/s00204-015-1522-9. [DOI] [PubMed] [Google Scholar]
- 23.Mackowiak B., Wang H. Mechanisms of xenobiotic receptor activation: Direct vs. indirect. Biochim. Biophys. Acta. 2016;1859:1130–1140. doi: 10.1016/j.bbagrm.2016.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lee K., You H., Choi J., No K.T. Development of pharmacophore-based classification model for activators of constitutive androstane receptor. Drug Metab. Pharmacokinet. 2017;32:172–178. doi: 10.1016/j.dmpk.2016.11.005. [DOI] [PubMed] [Google Scholar]
- 25.Kato H., Yamaotsu N., Iwazaki N., Okamura S., Kume T., Hirono S. Precise prediction of activators for the human constitutive androstane receptorusing structure-based three-dimensional quantitative structure-activity relationship methods. Drug Metab. Pharmacokinet. 2017;32:179–188. doi: 10.1016/j.dmpk.2017.02.001. [DOI] [PubMed] [Google Scholar]
- 26.Freires I.A., Sardi J.C., de Castro R.D., Rosalen P.L. Alternative Animal and Non-Animal Models for Drug Discovery and Development: Bonus or Burden? Pharm. Res. 2017;34:681–686. doi: 10.1007/s11095-016-2069-z. [DOI] [PubMed] [Google Scholar]
- 27.Alves-Pimenta S., Colaço B., Oliveira P.A., Venâncio C. Biological Concerns on the Selection of Animal Models for Teratogenic Testing. Methods Mol. Biol. 2018;1797:61–93. doi: 10.1007/978-1-4939-7883-0_3. [DOI] [PubMed] [Google Scholar]
- 28.Ponzoni I., Sebastián-Pérez V., Requena-Triguero C., Roca C., Martínez M.J., Cravero F., Díaz M.F., Páez J.A., Arrayás R.G., Adrio J., et al. Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery. Sci. Rep. 2017;7:2403. doi: 10.1038/s41598-017-02114-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Xia L.Y., Wang Y.W., Meng D.Y., Yao X.J., Chai H., Liang Y. Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure. Int. J. Mol. Sci. 2017;19:30. doi: 10.3390/ijms19010030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Khan P.M., Roy K. Current approaches for choosing feature selection and learning algorithms in quantitative structure-activity relationships (QSAR) Expert Opin. Drug Discov. 2018;13:1075–1089. doi: 10.1080/17460441.2018.1542428. [DOI] [PubMed] [Google Scholar]
- 31.Moriwaki H., Tian Y.S., Kawashita N., Takagi T. Three-Dimensional Classification Structure-Activity RelationshipAnalysis Using Convolutional Neural Network. Chem. Pharm. Bull. 2019;67:426–432. doi: 10.1248/cpb.c18-00757. [DOI] [PubMed] [Google Scholar]
- 32.Xu Y., Pei J., Lai L. Deep Learning Based Regression and Multiclass Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction. J. Chem. Inf. Model. 2017;57:2672–2685. doi: 10.1021/acs.jcim.7b00244. [DOI] [PubMed] [Google Scholar]
- 33.Fernandez M., Ban F., Woo G., Hsing M., Yamazaki T., LeBlanc E., Rennie P.S., Welch W.J., Cherkasov A. Toxic Colors: The Use of Deep Learning for Predicting Toxicity of Compounds Merely from Their Graphic Images. J. Chem. Inf. Model. 2018;58:1533–1543. doi: 10.1021/acs.jcim.8b00338. [DOI] [PubMed] [Google Scholar]
- 34.Tang W., Chen J., Wang Z., Xie H., Hong H. Deep learning for predicting toxicity of chemicals: A mini review. J. Environ. Sci. Health C Environ. Carcinog. Ecotoxicol. Rev. 2018;36:252–271. doi: 10.1080/10590501.2018.1537563. [DOI] [PubMed] [Google Scholar]
- 35.Cai C., Guo P., Zhou Y., Zhou J., Wang Q., Zhang F., Fang J., Cheng F. Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity. J. Chem. Inf. Model. 2019;59:1073–1084. doi: 10.1021/acs.jcim.8b00769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Koutsoukas A., Monaghan K.J., Li X., Huan J. Deep-learning: Investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data. J. Cheminform. 2017;9:42. doi: 10.1186/s13321-017-0226-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lenselink E.B., Ten Dijke N., Bongers B., Papadatos G., van Vlijmen H.W.T., Kowalczyk W., IJzerman A.P., van Westen G.J.P. Beyond the hype: Deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set. J. Cheminform. 2017;9:45. doi: 10.1186/s13321-017-0232-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Winkler D.A., Le T.C. Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR. Mol. Inform. 2017:36. doi: 10.1002/minf.201600118. [DOI] [PubMed] [Google Scholar]
- 39.Liu R., Wang H., Glover K.P., Feasel M.G., Wallqvist A. Dissecting Machine-Learning Prediction of Molecular Activity: Is an Applicability Domain Needed for Quantitative Structure-Activity Relationship Models Based on Deep Neural Networks? J. Chem. Inf. Model. 2019;59:117–126. doi: 10.1021/acs.jcim.8b00348. [DOI] [PubMed] [Google Scholar]
- 40.Simões R.S., Maltarollo V.G., Oliveira P.R., Honorio K.M. Transfer and Multi-task Learning in QSAR Modeling: Advances and Challenges. Front. Pharm. 2018;9:74. doi: 10.3389/fphar.2018.00074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Park J.G., Jo S. Bayesian Weight Decay on Bounded Approximation for Deep Convolutional Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2019;30:2866–2875. doi: 10.1109/TNNLS.2018.2886995. [DOI] [PubMed] [Google Scholar]
- 42.Cruz-Monteagudo M., Medina-Franco J.L., Pérez-Castillo Y., Nicolotti O., Cordeiro M.N., Borges F. Activity cliffs in drug discovery: Dr Jekyll or Mr Hyde? Drug Discov. Today. 2014;19:1069–1080. doi: 10.1016/j.drudis.2014.02.003. [DOI] [PubMed] [Google Scholar]
- 43.Horvath D., Marcou G., Varnek A., Kayastha S., de la Vega de León A., Bajorath J. Prediction of Activity Cliffs Using Condensed Graphs of Reaction Representations, Descriptor Recombination, Support Vector Machine Classification, and Support Vector Regression. J. Chem. Inf. Model. 2016;56:1631–1640. doi: 10.1021/acs.jcim.6b00359. [DOI] [PubMed] [Google Scholar]
- 44.Uesawa Y. Quantitative structure-activity relationship analysis using deep learning based on a novel molecular image input technique. Bioorg. Med. Chem. Lett. 2018;28:3400–3403. doi: 10.1016/j.bmcl.2018.08.032. [DOI] [PubMed] [Google Scholar]
- 45.Matsuzaka Y., Uesawa Y. Optimization of a Deep-Learning Method Based on the Classification of Images Generated by Parameterized Deep Snap a Novel Molecular-Image-Input Technique for Quantitative Structure-Activity Relationship (QSAR) Analysis. Front. Bioeng. Biotechnol. 2019;7:65. doi: 10.3389/fbioe.2019.00065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Xu M., Papageorgiou D.P., Abidi S.Z., Dao M., Zhao H., Karniadakis G.E. A deep convolutional neural network for classification of red blood cells in sickle cell anemia. PLoS Comput. Biol. 2017;13:e1005746. doi: 10.1371/journal.pcbi.1005746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yang Y., Ye Z., Su Y., Zhao Q., Li X., Ouyang D. Deep learning for in vitro prediction of pharmaceutical formulations. Acta. Pharm. Sin. B. 2019;9:177–185. doi: 10.1016/j.apsb.2018.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cui Y., Dong Q., Hong D., Wang X. Predicting protein-ligand binding residues with deep convolutional neural networks. BMC Bioinform. 2019;20:93. doi: 10.1186/s12859-019-2672-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Yang Y., Yan L.F., Zhang X., Han Y., Nan H.Y., Hu Y.C., Hu B., Yan S.L., Zhang J., Cheng D.L., et al. Glioma Grading on Conventional MR Images: A Deep Learning Study With Transfer Learning. Front. Neurosci. 2018;12:804. doi: 10.3389/fnins.2018.00804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dybiec K., Molchanov S., Gryff-Keller A. Structure of neutral molecules and monoanions of selected oxopurines in aqueous solutions as studied by NMR spectroscopy and theoretical calculations. J. Phys. Chem. A. 2011;115:2057–2064. doi: 10.1021/jp110888m. [DOI] [PubMed] [Google Scholar]
- 51.Wang L., Liu Y., Zhang Y., Yasin A., Zhang L. Investigating Stability and Tautomerization of Gossypol-A Spectroscopy Study. Molecules. 2019;24:1286. doi: 10.3390/molecules24071286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Parthiban V., Gromiha M.M., Hoppe C., Schomburg D. Structural analysis and prediction of protein mutant stability using distance and torsion potentials: Role of secondary structure and solvent accessibility. Proteins. 2007;66:41–52. doi: 10.1002/prot.21115. [DOI] [PubMed] [Google Scholar]
- 53.Mervin L.H., Afzal A.M., Drakakis G., Lewis R., Engkvist O., Bender A. Target prediction utilising negative bioactivity data covering large chemicalspace. J. Cheminform. 2015;7:51. doi: 10.1186/s13321-015-0098-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Klingspohn W., Mathea M., Ter Laak A., Heinrich N., Baumann K. Efficiency of different measures for defining the applicability domain of classification models. J. Cheminform. 2017;9:44. doi: 10.1186/s13321-017-0230-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Huang R., Xia M., Sakamuru S., Zhao J., Lynch C., Zhao T., Zhu H., Austin C.P., Simeonov A. Expanding biological space coverage enhances the prediction of drug adverse effects in human using in vitro activity profiles. Sci. Rep. 2018;8:3783. doi: 10.1038/s41598-018-22046-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Jang Y., Kim S., Kim K., Lee D. Deep learning-based classification with improved time resolution for physical activities of children. Peer J. 2018;6:e5764. doi: 10.7717/peerj.5764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Low Y., Uehara T., Minowa Y., Yamada H., Ohno Y., Urushidani T., Sedykh A., Muratov E., Kuz’min V., Fourches D., et al. Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. Chem. Res. Toxicol. 2011;24:1251–1262. doi: 10.1021/tx200148a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Marzo M., Kulkarni S., Manganaro A., Roncaglioni A., Wu S., Barton-Maclaren T.S., Lester C., Benfenati E. Integrating in silico models to enhance predictivity for developmental toxicity. Toxicology. 2016;370:127–137. doi: 10.1016/j.tox.2016.09.015. [DOI] [PubMed] [Google Scholar]
- 59.Maggiora G.M. On outliers and activity cliffs--why QSAR often disappoints. J. Chem. Inf. Model. 2006;46:1535. doi: 10.1021/ci060117s. [DOI] [PubMed] [Google Scholar]
- 60.Stumpfe D., Bajorath J. Exploring activity cliffs in medicinal chemistry. J. Med. Chem. 2012;55:2932–2942. doi: 10.1021/jm201706b. [DOI] [PubMed] [Google Scholar]
- 61.Stumpfe D., Hu H., Bajorath J. Introducing a new category of activity cliffs with chemical modifications at multiple sites and rationalizing contributions of individual substitutions. Bioorg. Med. Chem. 2019;27:3605–3612. doi: 10.1016/j.bmc.2019.06.045. [DOI] [PubMed] [Google Scholar]
- 62.Bajorath J. Representation and identification of activity cliffs. Expert. Opin. Drug Discov. 2017;12:879–883. doi: 10.1080/17460441.2017.1353494. [DOI] [PubMed] [Google Scholar]
- 63.LeCun Y., Bengio Y., Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
- 64.Moriwaki H., Tian Y.S., Kawashita N., Takagi T. Mordred: A molecular descriptor calculator. J. Cheminform. 2018;10:4. doi: 10.1186/s13321-018-0258-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Sakamuru S., Li X., Attene-Ramos M.S., Huang R., Lu J., Shou L., Shen M., Tice R.R., Austin C.P., Xia M. Application of a homogenous membrane potential assay to assess mitochondrial function. Physiol. Genom. 2012;44:495–503. doi: 10.1152/physiolgenomics.00161.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Inglese J., Auld D.S., Jadhav A., Johnson R.L., Simeonov A., Yasgar A., Zheng W., Austin C.P. Quantitative high-throughput screening: A titration-based approach that efficiently identifies biological activities in large chemical libraries. Proc. Natl. Acad. Sci. USA. 2006;103:11473–11478. doi: 10.1073/pnas.0604348103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Chen I.J., Foloppe N. Conformational sampling of druglike molecules with MOE and catalyst: Implications for pharmacophore modeling and virtual screening. J. Chem. Inf. Model. 2008;48:1773–1791. doi: 10.1021/ci800130k. [DOI] [PubMed] [Google Scholar]
- 68.Agrafiotis D.K., Gibbs A.C., Zhu F., Izrailev S., Martin E. Conformational sampling of bioactive molecules: A comparative study. J. Chem. Inf. Model. 2007;47:1067–1086. doi: 10.1021/ci6005454. [DOI] [PubMed] [Google Scholar]
- 69.Hanson R.M. Jmol SMILES and Jmol SMARTS: Specifications and applications. J. Cheminform. 2016;8:50. doi: 10.1186/s13321-016-0160-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Scalfani V.F., Williams A.J., Tkachenko V., Karapetyan K., Pshenichnov A., Hanson R.M., Liddie J.M., Bara J.E. Programmatic conversion of crystal structures into 3D printable files using Jmol. J. Cheminform. 2016;8:66. doi: 10.1186/s13321-016-0181-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Hanson R.M., Lu X.J. DSSR-enhanced visualization of nucleic acid structures in Jmol. Nucleic Acids Res. 2017;45:W528–W533. doi: 10.1093/nar/gkx365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Krizhevsky A., Sutskev I., Hinton G.E. ImageNet classification with deep convolutional neural networks. Adv. Neural. Inf. Process. Syst. 2012;1:1097–1105. doi: 10.1145/3065386. [DOI] [Google Scholar]
- 73.Szegedy C., Liu W., Jia Y., Sermanet P., Reed S., Anguelov D., Erhan D., Vanhoucke V., Rabinovich A. Going Deeper with Convolutions; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; Boston, MA, USA. 7–12 June 2015; pp. 1–9. [Google Scholar]
- 74.Rawat W., Wang Z. Deep convolutional neural networks for image classification: A comprehensive review. Neural. Comput. 2017;29:2352–2449. doi: 10.1162/neco_a_00990. [DOI] [PubMed] [Google Scholar]
- 75.Karri S.P., Chakraborty D., Chatterjee J. Transfer learning based classification of optical coherence tomography images with diabetic macular edema and dry age-related macular degeneration. Biomed. Op.T Express. 2017;8:579–592. doi: 10.1364/BOE.8.000579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Saubern S., Guha R., Baell J.B. KNIME Workflow to Assess PAINS Filters in SMARTS Format. Comparison of RDKit and Indigo Cheminformatics Libraries. Mol. Inform. 2011;30:847–850. doi: 10.1002/minf.201100076. [DOI] [PubMed] [Google Scholar]
- 77.Abraham A., Pedregosa F., Eickenberg M., Gervais P., Mueller A., Kossaifi J., Gramfort A., Thirion B., Varoquaux G. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 2014;8:14. doi: 10.3389/fninf.2014.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kensert A., Alvarsson J., Norinder U., Spjuth O. Evaluating parameters for ligand-based modeling with random forest on sparse data sets. J. Cheminform. 2018;10:49. doi: 10.1186/s13321-018-0304-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Sandino J., Gonzalez F., Mengersen K., Gaston K.J. UAVs and Machine Learning Revolutionising Invasive Grass and Vegetation Surveys in Remote Arid Lands. Sensors. 2018;18:605. doi: 10.3390/s18020605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Hongjaisee S., Nantasenamat C., Carraway T.S., Shoombuatong W. HIVCoR: A sequence-based tool for predicting HIV-1 CRF01_AE coreceptor usage. Comput. Biol. Chem. 2019;80:419–432. doi: 10.1016/j.compbiolchem.2019.05.006. [DOI] [PubMed] [Google Scholar]
- 81.Laengsri V., Nantasenamat C., Schaduangrat N., Nuchnoi P., Prachayasittikul V., Shoombuatong W. TargetAntiAngio: A Sequence-Based Tool for the Prediction and Analysis of Anti-Angiogenic Peptides. Int. J. Mol. Sci. 2019;20:2950. doi: 10.3390/ijms20122950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Shoombuatong W., Schaduangrat N., Pratiwi R., Nantasenamat C. THPep: A machine learning-based approach for predicting tumor homing peptides. Comput. Biol. Chem. 2019;80:441–451. doi: 10.1016/j.compbiolchem.2019.05.008. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.